BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 010945
         (497 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|225462926|ref|XP_002267249.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
           large subunit N-methyltransferase, chloroplastic [Vitis
           vinifera]
 gi|296087793|emb|CBI35049.3| unnamed protein product [Vitis vinifera]
          Length = 484

 Score =  743 bits (1919), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 366/470 (77%), Positives = 412/470 (87%), Gaps = 4/470 (0%)

Query: 28  LTSLTKKPSFRLKSRAFAATCSLHSASATTNPPTAQVETFWQWLRDQKVVSPKSPIRPAT 87
           + +L   P    +   F+ TC     S  TNPP   V+TFW+WL DQ VVS K+P++P  
Sbjct: 19  IKTLKNPPIIHSRRPPFSLTCL---RSLETNPPPP-VQTFWKWLFDQGVVSGKTPVKPGI 74

Query: 88  FPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREKK 147
            PEGLGLVAQRDIA+NE VLEVP +FWINPD VAASEIGS+C GLKPW+SVALFLIREK 
Sbjct: 75  VPEGLGLVAQRDIARNEAVLEVPKRFWINPDAVAASEIGSVCGGLKPWVSVALFLIREKL 134

Query: 148 KEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIIL 207
           +++SPWR YLDILPE T+ST++WSEEELVE+QGTQL +TTLGVKEYVQ+E+LKVEEE+IL
Sbjct: 135 RDESPWRSYLDILPEYTNSTIYWSEEELVEIQGTQLSNTTLGVKEYVQSEFLKVEEEVIL 194

Query: 208 PNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIK 267
           P+ QLFP P+TLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSP ITTE+YA+EIK
Sbjct: 195 PHSQLFPFPVTLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPSITTEEYAWEIK 254

Query: 268 GAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEI 327
           GAGLFSRD LFSLRTPV VKAGEQVLIQYDL+KSNAELALDYGFIES+ +RN+YTLTLEI
Sbjct: 255 GAGLFSRDQLFSLRTPVSVKAGEQVLIQYDLDKSNAELALDYGFIESRPNRNSYTLTLEI 314

Query: 328 SESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRN 387
           SESDPFFGDKLDIAE+NGL E AYFDIVLG++LP AML YLRLVALGG DAFLLESIFRN
Sbjct: 315 SESDPFFGDKLDIAESNGLSEIAYFDIVLGQSLPAAMLPYLRLVALGGPDAFLLESIFRN 374

Query: 388 TIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVR 447
           TIWGHL+LPVS ANEELIC+V++DACKSALSG+ TTIEEDE+L + GNL PRL IAVGVR
Sbjct: 375 TIWGHLELPVSRANEELICQVIQDACKSALSGYLTTIEEDEKLKEEGNLHPRLEIAVGVR 434

Query: 448 AGEKRVLQQIDEIFKERELELDELEYYQERRLKDLGLVGEQGDIIFWEPK 497
            GEK+VLQQID IF+ERE ELD+LEYYQERRLKDLGL GEQG+IIFWE K
Sbjct: 435 TGEKKVLQQIDGIFRERERELDQLEYYQERRLKDLGLCGEQGEIIFWESK 484


>gi|255562948|ref|XP_002522479.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase, chloroplast precursor, putative
           [Ricinus communis]
 gi|223538364|gb|EEF39971.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase, chloroplast precursor, putative
           [Ricinus communis]
          Length = 502

 Score =  738 bits (1904), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 376/507 (74%), Positives = 424/507 (83%), Gaps = 27/507 (5%)

Query: 9   FSLSTSPPTFVSSSKTLKLLTSLTKKPSFRLKSRAFAATCSLHSASATTNPPTAQVETFW 68
           F+LS+S  TF+S    LK  + +TK PSF LK   F  + SL S++  T PP + +E FW
Sbjct: 5   FTLSSS--TFLSP---LKPFSPVTKIPSFHLKKPFFITSSSLQSSA--TAPPQS-LEIFW 56

Query: 69  QWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSL 128
           QWL DQ VVS KSP +P    EGLGL+A+RDIA+NEVVLE+P K WINPD VAAS+IG++
Sbjct: 57  QWLSDQGVVSGKSPAKPGVVKEGLGLIAERDIARNEVVLEIPKKLWINPDAVAASDIGNV 116

Query: 129 CSGLKPWISVALFLIREK-KKEDSPWRVYLDILPECTDSTVFW----------------- 170
           CSGLKPWISVALFLIREK KKE S W  YLDILP+ T+ST++W                 
Sbjct: 117 CSGLKPWISVALFLIREKLKKEGSTWWPYLDILPDTTNSTIYWWVLLVAFYVLVLSFQRR 176

Query: 171 SEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILR 230
           SEEEL ELQGTQLL TTLGVKEY+Q E+ KVEEEI+LP+K+LFP PITLDDFLWAFGILR
Sbjct: 177 SEEELAELQGTQLLRTTLGVKEYMQREFAKVEEEILLPHKELFPSPITLDDFLWAFGILR 236

Query: 231 SRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGE 290
           SRAFSRLRGQNLVLIPLADLINHSP ITTEDYAYEIKG GLFSR+LLFSLR+P+ VK+GE
Sbjct: 237 SRAFSRLRGQNLVLIPLADLINHSPDITTEDYAYEIKGGGLFSRELLFSLRSPISVKSGE 296

Query: 291 QVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESA 350
           QVLIQYDLNKSNAELALDYGFIE   DRN YTLTL+ISESDPFFGDKLDIAETNG GE+A
Sbjct: 297 QVLIQYDLNKSNAELALDYGFIEKTPDRNTYTLTLQISESDPFFGDKLDIAETNGSGETA 356

Query: 351 YFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVR 410
            FDIVLG  LPPAML YLRLVALGGTDAFLLESIFRNTIWGHL+LP+S ANEELICRVVR
Sbjct: 357 DFDIVLGNPLPPAMLPYLRLVALGGTDAFLLESIFRNTIWGHLELPISRANEELICRVVR 416

Query: 411 DACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKERELELDE 470
           DACKSALSG+HTTIEEDE+ L+  +L+PRL IAVG+RAGEK+VLQQI+++FK+RE ELDE
Sbjct: 417 DACKSALSGYHTTIEEDEK-LEAADLNPRLEIAVGIRAGEKKVLQQINDVFKQRESELDE 475

Query: 471 LEYYQERRLKDLGLVGEQGDIIFWEPK 497
            EYYQERRLK+LGLVGEQG+IIFWE K
Sbjct: 476 FEYYQERRLKELGLVGEQGEIIFWESK 502


>gi|147843303|emb|CAN82664.1| hypothetical protein VITISV_015206 [Vitis vinifera]
          Length = 507

 Score =  732 bits (1889), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 360/462 (77%), Positives = 405/462 (87%), Gaps = 4/462 (0%)

Query: 28  LTSLTKKPSFRLKSRAFAATCSLHSASATTNPPTAQVETFWQWLRDQKVVSPKSPIRPAT 87
           + +L   P    +   F+ TC     S  TNPP   V+TFW+WL DQ VVS K+P++P  
Sbjct: 19  IKTLKNPPIIHSRRPPFSLTCL---RSLETNPPPP-VQTFWKWLFDQGVVSGKTPVKPGI 74

Query: 88  FPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREKK 147
            PEGLGLVAQRDIA+NE VLEVP +FWINPD VAASEIGS+C GLKPW+SVALFLIREK 
Sbjct: 75  VPEGLGLVAQRDIARNEAVLEVPKRFWINPDAVAASEIGSVCGGLKPWVSVALFLIREKL 134

Query: 148 KEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIIL 207
           +++SPWR YLDILPE T+ST++WSEEELVE+QGTQL +TTLGVKEYVQ+E+LKVEEE+IL
Sbjct: 135 RDESPWRSYLDILPEYTNSTIYWSEEELVEIQGTQLSNTTLGVKEYVQSEFLKVEEEVIL 194

Query: 208 PNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIK 267
           P+ QLFP P+TLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSP ITTE+YA+EIK
Sbjct: 195 PHSQLFPFPVTLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPSITTEEYAWEIK 254

Query: 268 GAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEI 327
           GAGLFSRD LFSLRTPV VKAGEQVLIQYDL+KSNAELALDYGFIES+ +RN+YTLTLEI
Sbjct: 255 GAGLFSRDQLFSLRTPVSVKAGEQVLIQYDLDKSNAELALDYGFIESRPNRNSYTLTLEI 314

Query: 328 SESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRN 387
           SESDPFFGDKLDIAE+NGL E AYFDIVLG++LP AML YLRLVALGG DAFLLESIFRN
Sbjct: 315 SESDPFFGDKLDIAESNGLSEIAYFDIVLGQSLPAAMLPYLRLVALGGPDAFLLESIFRN 374

Query: 388 TIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVR 447
           TIWGHL+LPVS ANEELIC+V++DACKSALSG+ TTIEEDE+L + GNL PRL IAVGVR
Sbjct: 375 TIWGHLELPVSRANEELICQVIQDACKSALSGYLTTIEEDEKLKEEGNLHPRLEIAVGVR 434

Query: 448 AGEKRVLQQIDEIFKERELELDELEYYQERRLKDLGLVGEQG 489
            GEK+VLQQID IF+ERE ELD+LEYYQERRLKDLGL GEQG
Sbjct: 435 TGEKKVLQQIDGIFRERERELDQLEYYQERRLKDLGLCGEQG 476


>gi|297849804|ref|XP_002892783.1| hypothetical protein ARALYDRAFT_471564 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297338625|gb|EFH69042.1| hypothetical protein ARALYDRAFT_471564 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 482

 Score =  727 bits (1876), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 350/444 (78%), Positives = 394/444 (88%)

Query: 54  SATTNPPTAQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKF 113
           SA ++     V  FW+WL DQ VVS KSP  PA  PEGLGLVA+RDI +NEVVLE+P + 
Sbjct: 39  SALSSELPENVRNFWKWLGDQGVVSGKSPAEPAVVPEGLGLVARRDIGRNEVVLEIPKRL 98

Query: 114 WINPDTVAASEIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEE 173
           WINP+TV AS+IG LC GLKPW+SVALFLIREK +E+S WR+YLD+LP+ TDSTVFWSEE
Sbjct: 99  WINPETVTASKIGPLCGGLKPWVSVALFLIREKYEEESSWRLYLDMLPQSTDSTVFWSEE 158

Query: 174 ELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRA 233
           EL EL+GTQLLSTTLGVKEYV+NE+LK+E+EI+LPNK LF   ITLDDF+WAFGIL+SRA
Sbjct: 159 ELAELKGTQLLSTTLGVKEYVENEFLKLEQEILLPNKDLFSSRITLDDFIWAFGILKSRA 218

Query: 234 FSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVL 293
           FSRLRGQNLVLIPLADLINH+P ITTEDYAYEIKGAGLFSRDLLFSL++PV VKAGEQV 
Sbjct: 219 FSRLRGQNLVLIPLADLINHNPAITTEDYAYEIKGAGLFSRDLLFSLKSPVYVKAGEQVY 278

Query: 294 IQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFD 353
           IQYDLNKSNAELALDYGF+ES  +RN+YTLT+EI ESDPFFGDKLDIAETN +GE+ YFD
Sbjct: 279 IQYDLNKSNAELALDYGFVESNPNRNSYTLTIEIPESDPFFGDKLDIAETNKMGETGYFD 338

Query: 354 IVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDAC 413
           +V G+TLP  MLQYLRLVALGG+DAFLLESIF NTIWGHL+LPVS +NEELICRVVRDAC
Sbjct: 339 VVDGQTLPAGMLQYLRLVALGGSDAFLLESIFNNTIWGHLELPVSRSNEELICRVVRDAC 398

Query: 414 KSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKERELELDELEY 473
           KSALSGF TTIEEDE+LL+ G LDPRL +A+ +R GEKRVLQQID+IFK+RELELD LEY
Sbjct: 399 KSALSGFSTTIEEDEKLLEEGKLDPRLEMALKIRIGEKRVLQQIDQIFKDRELELDVLEY 458

Query: 474 YQERRLKDLGLVGEQGDIIFWEPK 497
           YQERRLKDLGLVGEQGDIIFWE K
Sbjct: 459 YQERRLKDLGLVGEQGDIIFWETK 482


>gi|15223054|ref|NP_172856.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase
           [Arabidopsis thaliana]
 gi|17369870|sp|Q9XI84.1|RBCMT_ARATH RecName: Full=[Fructose-bisphosphate aldolase]-lysine
           N-methyltransferase, chloroplastic; AltName:
           Full=Aldolases N-methyltransferase; AltName:
           Full=[Ribulose-bisphosphate carboxylase]-lysine
           N-methyltransferase-like; Short=AtLSMT-L;
           Short=LSMT-like enzyme; Flags: Precursor
 gi|5080779|gb|AAD39289.1|AC007576_12 Putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase [Arabidopsis thaliana]
 gi|28973755|gb|AAO64193.1| putative ribulose-1,5 bisphosphate carboxylase oxygenase large
           subunit N-methyltransferase [Arabidopsis thaliana]
 gi|332190979|gb|AEE29100.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase
           [Arabidopsis thaliana]
          Length = 482

 Score =  723 bits (1865), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 357/466 (76%), Positives = 400/466 (85%), Gaps = 3/466 (0%)

Query: 32  TKKPSFRLKSRAFAATCSLHSASATTNPPTAQVETFWQWLRDQKVVSPKSPIRPATFPEG 91
           ++ PSF L SR   +     SAS++  P    V  FW+WLRDQ VVS KS   PA  PEG
Sbjct: 20  SQNPSF-LFSRPKKSLVRPISASSSELP--ENVRNFWKWLRDQGVVSGKSVAEPAVVPEG 76

Query: 92  LGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREKKKEDS 151
           LGLVA+RDI +NEVVLE+P + WINP+TV AS+IG LC GLKPW+SVALFLIREK +E+S
Sbjct: 77  LGLVARRDIGRNEVVLEIPKRLWINPETVTASKIGPLCGGLKPWVSVALFLIREKYEEES 136

Query: 152 PWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQ 211
            WRVYLD+LP+ TDSTVFWSEEEL EL+GTQLLSTTLGVKEYV+NE+LK+E+EI+LPNK 
Sbjct: 137 SWRVYLDMLPQSTDSTVFWSEEELAELKGTQLLSTTLGVKEYVENEFLKLEQEILLPNKD 196

Query: 212 LFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGL 271
           LF   ITLDDF+WAFGIL+SRAFSRLRGQNLVLIPLADLINH+P I TEDYAYEIKGAGL
Sbjct: 197 LFSSRITLDDFIWAFGILKSRAFSRLRGQNLVLIPLADLINHNPAIKTEDYAYEIKGAGL 256

Query: 272 FSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESD 331
           FSRDLLFSL++PV VKAGEQV IQYDLNKSNAELALDYGF+ES   RN+YTLT+EI ESD
Sbjct: 257 FSRDLLFSLKSPVYVKAGEQVYIQYDLNKSNAELALDYGFVESNPKRNSYTLTIEIPESD 316

Query: 332 PFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWG 391
           PFFGDKLDIAE+N +GE+ YFDIV G+TLP  MLQYLRLVALGG DAFLLESIF NTIWG
Sbjct: 317 PFFGDKLDIAESNKMGETGYFDIVDGQTLPAGMLQYLRLVALGGPDAFLLESIFNNTIWG 376

Query: 392 HLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEK 451
           HL+LPVS  NEELICRVVRDACKSALSGF TTIEEDE+LL  G L+PRL +A+ +R GEK
Sbjct: 377 HLELPVSRTNEELICRVVRDACKSALSGFDTTIEEDEKLLDKGKLEPRLEMALKIRIGEK 436

Query: 452 RVLQQIDEIFKERELELDELEYYQERRLKDLGLVGEQGDIIFWEPK 497
           RVLQQID+IFK+RELELD LEYYQERRLKDLGLVGEQGDIIFWE K
Sbjct: 437 RVLQQIDQIFKDRELELDILEYYQERRLKDLGLVGEQGDIIFWETK 482


>gi|224098926|ref|XP_002311320.1| SET domain-containing protein [Populus trichocarpa]
 gi|222851140|gb|EEE88687.1| SET domain-containing protein [Populus trichocarpa]
          Length = 490

 Score =  700 bits (1807), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 367/501 (73%), Positives = 414/501 (82%), Gaps = 30/501 (5%)

Query: 9   FSLSTSPPTFVSSSKTLKLLTSLTKKPSFRLKSRAFAATCSLHSASATTNPPTAQ----V 64
           F+LS SP      SK  K  T  TK PS   K           S ++  +PP A     V
Sbjct: 5   FTLSLSP------SKPFKNSTFFTKNPSLHFKKPL--------SVTSFQSPPAAAAAAAV 50

Query: 65  ETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASE 124
           +TFWQWL DQ VVS K+P RP   P+GLGLVAQRDI++NEVVLE+P K WINPD VAASE
Sbjct: 51  QTFWQWLSDQDVVSAKTPARPGLVPQGLGLVAQRDISRNEVVLEIPKKLWINPDVVAASE 110

Query: 125 IGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLL 184
           IG++C G+KPW+SVALFLIREK KEDS WR YLD+LPE T+ST+FWSEEEL ELQGTQLL
Sbjct: 111 IGNVCGGVKPWVSVALFLIREKLKEDSTWRPYLDVLPESTNSTIFWSEEELAELQGTQLL 170

Query: 185 STTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVL 244
           STTLGVK Y++ E+LKVEEEI++P+KQLFP P+TLDDF WAFGILRSR+FSRLRGQNLVL
Sbjct: 171 STTLGVKSYLRREFLKVEEEILVPHKQLFPSPVTLDDFSWAFGILRSRSFSRLRGQNLVL 230

Query: 245 IPLADL-----------INHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVL 293
           IPLADL           +NHSP IT ED  YEIKGAGLFSRDL+FSLR+P+ +KAGEQVL
Sbjct: 231 IPLADLCNFLHTWLLDQVNHSPDITIEDGVYEIKGAGLFSRDLIFSLRSPISLKAGEQVL 290

Query: 294 IQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFD 353
           IQY+LN SNAELA+DYGFIE+KSDRN YTLTL+ISESDPFFGDKLDIAETNGLGE A FD
Sbjct: 291 IQYNLNLSNAELAVDYGFIEAKSDRNMYTLTLQISESDPFFGDKLDIAETNGLGEIADFD 350

Query: 354 IVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDAC 413
           IVLG  LPP +L YLRLVALGGTD+FLLESIFRNTIWGHL+LPVS ANEELICRVVRDAC
Sbjct: 351 IVLGNPLPPTLLPYLRLVALGGTDSFLLESIFRNTIWGHLELPVSRANEELICRVVRDAC 410

Query: 414 KSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKERELELDELEY 473
           KSALSG+HTTIEEDE+ L+G  L+PRL IAVG+RAGEK+VLQQI+EIFK+R+ ELDELEY
Sbjct: 411 KSALSGYHTTIEEDEK-LKGEELNPRLEIAVGIRAGEKKVLQQIEEIFKQRQSELDELEY 469

Query: 474 YQERRLKDLGLVGEQGDIIFW 494
           YQERRLKDLGLVGEQG+IIFW
Sbjct: 470 YQERRLKDLGLVGEQGEIIFW 490


>gi|449442309|ref|XP_004138924.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Cucumis
           sativus]
          Length = 503

 Score =  681 bits (1757), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 331/437 (75%), Positives = 385/437 (88%), Gaps = 2/437 (0%)

Query: 62  AQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVA 121
           + V TFWQW+R + +VS K+ ++PA FPEGLGL   ++++KNEVVLEVP +FWINPD VA
Sbjct: 68  SNVHTFWQWVRQEGMVSYKTHVKPAIFPEGLGLATTKNLSKNEVVLEVPKRFWINPDAVA 127

Query: 122 ASEIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGT 181
            SEIG++CSGLKPWISVALFLIRE  K DS WR YLDILP+ TDSTVFWSEEEL E+QGT
Sbjct: 128 DSEIGNVCSGLKPWISVALFLIRENLKGDSRWRRYLDILPQETDSTVFWSEEELAEIQGT 187

Query: 182 QLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQN 241
           QLLSTTL VKEYV++E+LKVEEEI+L +K LFP  ITLDDF WAFGILRSRAFSRLRGQN
Sbjct: 188 QLLSTTLNVKEYVKSEFLKVEEEILLRHKDLFPSRITLDDFFWAFGILRSRAFSRLRGQN 247

Query: 242 LVLIPLADLINHSPGITTEDYAYEIKG-AGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNK 300
           LVLIP ADL+NHS  +TTE++A+E+KG AGLFS D+LFSLR+P+ VKAG+QV IQYDL K
Sbjct: 248 LVLIPFADLVNHSANVTTEEHAWEVKGPAGLFSWDVLFSLRSPLSVKAGDQVFIQYDLKK 307

Query: 301 SNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTL 360
           SNA+LALDYGFIE KSDRNAYTLTLEI ESD FF DKLDIAETNGL ++AYFDI+L R  
Sbjct: 308 SNADLALDYGFIEQKSDRNAYTLTLEIPESDLFFDDKLDIAETNGLNQTAYFDIILERPF 367

Query: 361 PPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGF 420
           PPAML +LRL+ALGGTDAFLLES+FRN++WGHL++PVS ANEELIC+VVR+AC++ALSG+
Sbjct: 368 PPAMLPFLRLLALGGTDAFLLESLFRNSVWGHLEMPVSRANEELICQVVRNACEAALSGY 427

Query: 421 HTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKERELELDELEYYQERRLK 480
           HTTIEEDE+ L+  NLD RLRIAVG+R GEKRVLQQI +IFK+RELELD+LEYYQERRLK
Sbjct: 428 HTTIEEDEK-LKEENLDSRLRIAVGIREGEKRVLQQIIQIFKDRELELDQLEYYQERRLK 486

Query: 481 DLGLVGEQGDIIFWEPK 497
           DLGL GEQG+IIFWE K
Sbjct: 487 DLGLCGEQGEIIFWETK 503


>gi|17368377|sp|P94026.1|RBCMT_TOBAC RecName: Full=Ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic; AltName:
           Full=[Ribulose-bisphosphate carboxylase]-lysine
           N-methyltransferase; Short=RuBisCO LSMT; Short=RuBisCO
           methyltransferase; Short=rbcMT; Flags: Precursor
 gi|1731475|gb|AAC49565.1| ribulose-1,5-bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase [Nicotiana tabacum]
 gi|1731477|gb|AAC49566.1| ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase [Nicotiana tabacum]
          Length = 491

 Score =  678 bits (1749), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 329/435 (75%), Positives = 390/435 (89%), Gaps = 1/435 (0%)

Query: 64  VETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAAS 123
           V+TFWQWL  + VV+ K+P++P   PEGLGLVA+RDIAK E VL+VP +FWINPD VA S
Sbjct: 57  VQTFWQWLCKEGVVTTKTPVKPGIVPEGLGLVAKRDIAKGETVLQVPKRFWINPDAVAES 116

Query: 124 EIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQL 183
           EIG++CSGLKPWISVALFL+REK ++DS W+ Y+D+LP+ TDST++WSEEEL E+QGTQL
Sbjct: 117 EIGNVCSGLKPWISVALFLLREKWRDDSKWKYYMDVLPKSTDSTIYWSEEELSEIQGTQL 176

Query: 184 LSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLV 243
           LSTT+ VK+YVQNE+ KVEEE+IL NKQLFP PITLDDF WAFGILRSRAFSRLR QNL+
Sbjct: 177 LSTTMSVKDYVQNEFQKVEEEVILRNKQLFPFPITLDDFFWAFGILRSRAFSRLRNQNLI 236

Query: 244 LIPLADLINHSPGITTEDYAYEIKG-AGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSN 302
           L+P ADL NH+  +TTED+A+E++G AGLFS DLLFSLR+P+ +KAG+Q+ IQYDLNKSN
Sbjct: 237 LVPFADLTNHNARVTTEDHAHEVRGPAGLFSWDLLFSLRSPLKLKAGDQLFIQYDLNKSN 296

Query: 303 AELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP 362
           A++ALDYGFIE  S R+A+TLTLEISESD F+GDKLDIAETNG+GE+AYFDI +G++LPP
Sbjct: 297 ADMALDYGFIEPSSARDAFTLTLEISESDEFYGDKLDIAETNGIGETAYFDIKIGQSLPP 356

Query: 363 AMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHT 422
            M+ YLRLVALGGTDAFLLESIFRN++WGHL LPVS ANEELIC+VVRDACKSALSG+HT
Sbjct: 357 TMIPYLRLVALGGTDAFLLESIFRNSVWGHLGLPVSRANEELICKVVRDACKSALSGYHT 416

Query: 423 TIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKERELELDELEYYQERRLKDL 482
           TIEEDE+L++ GNL  RL+IAVG+R GEKRVL+QID+IF+ERELELDELEYY ERRLKDL
Sbjct: 417 TIEEDEKLMEEGNLSTRLQIAVGIRLGEKRVLKQIDDIFRERELELDELEYYGERRLKDL 476

Query: 483 GLVGEQGDIIFWEPK 497
           GLVGEQGDIIFWEPK
Sbjct: 477 GLVGEQGDIIFWEPK 491


>gi|449495943|ref|XP_004159992.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Cucumis
           sativus]
          Length = 503

 Score =  677 bits (1748), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 330/437 (75%), Positives = 384/437 (87%), Gaps = 2/437 (0%)

Query: 62  AQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVA 121
           + V TFWQW+R + +VS K+ ++PA FPEGLGL   ++++KNEVVLEVP +FWINPD VA
Sbjct: 68  SNVHTFWQWVRQEGMVSYKTHVKPAIFPEGLGLATTKNLSKNEVVLEVPKRFWINPDAVA 127

Query: 122 ASEIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGT 181
            SEIG++CSGLKPWISVALFLIRE  K DS WR YLDILP+ TDSTVFWSEEEL E+QGT
Sbjct: 128 DSEIGNVCSGLKPWISVALFLIRENLKGDSRWRRYLDILPQETDSTVFWSEEELAEIQGT 187

Query: 182 QLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQN 241
           QLLSTTL VKEYV++E+LKVEEEI+L +K LFP  ITLDDF WAFGILRSRAFSRLRGQN
Sbjct: 188 QLLSTTLNVKEYVKSEFLKVEEEILLRHKDLFPSRITLDDFFWAFGILRSRAFSRLRGQN 247

Query: 242 LVLIPLADLINHSPGITTEDYAYEIKG-AGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNK 300
           LVLIP ADL+NHS  +TTE++A+E+KG AGLFS D+L SLR+P+ VKAG+QV IQYDL K
Sbjct: 248 LVLIPFADLVNHSANVTTEEHAWEVKGPAGLFSWDVLCSLRSPLSVKAGDQVFIQYDLKK 307

Query: 301 SNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTL 360
           SNA+LALDYGFIE KSDRNAYTLTLEI ESD FF DKLDIAETNGL ++AYFDI+L R  
Sbjct: 308 SNADLALDYGFIEQKSDRNAYTLTLEIPESDLFFDDKLDIAETNGLNQTAYFDIILERPF 367

Query: 361 PPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGF 420
           PPAML +LRL+ALGGTDAFLLES+FRN++WGHL++PVS ANEELIC+VVR+AC++ALSG+
Sbjct: 368 PPAMLPFLRLLALGGTDAFLLESLFRNSVWGHLEMPVSRANEELICQVVRNACEAALSGY 427

Query: 421 HTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKERELELDELEYYQERRLK 480
           HTTIEEDE+ L+  NLD RLRIAVG+R GEKRVLQQI +IFK+RELELD+LEYYQERRLK
Sbjct: 428 HTTIEEDEK-LKEENLDSRLRIAVGIREGEKRVLQQIIQIFKDRELELDQLEYYQERRLK 486

Query: 481 DLGLVGEQGDIIFWEPK 497
           DLGL GEQG+IIFWE K
Sbjct: 487 DLGLCGEQGEIIFWETK 503


>gi|8778402|gb|AAF79410.1|AC068197_20 F16A14.25 [Arabidopsis thaliana]
          Length = 474

 Score =  674 bits (1740), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 337/466 (72%), Positives = 383/466 (82%), Gaps = 11/466 (2%)

Query: 32  TKKPSFRLKSRAFAATCSLHSASATTNPPTAQVETFWQWLRDQKVVSPKSPIRPATFPEG 91
           ++ PSF L SR   +     SAS++  P    V  FW+WLRDQ VVS KS   PA  PEG
Sbjct: 20  SQNPSF-LFSRPKKSLVRPISASSSELP--ENVRNFWKWLRDQGVVSGKSVAEPAVVPEG 76

Query: 92  LGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREKKKEDS 151
           LGLVA+RDI +NEVVLE+P + WINP+TV AS+IG LC GLKPW+SVALFLIREK +E+S
Sbjct: 77  LGLVARRDIGRNEVVLEIPKRLWINPETVTASKIGPLCGGLKPWVSVALFLIREKYEEES 136

Query: 152 PWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQ 211
            WRVYLD+LP+ TDSTVFWSEEEL EL+GTQLLSTTLGVKEYV+NE+LK+E+EI+LPNK 
Sbjct: 137 SWRVYLDMLPQSTDSTVFWSEEELAELKGTQLLSTTLGVKEYVENEFLKLEQEILLPNKD 196

Query: 212 LFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGL 271
           LF   ITLDDF+WAFGIL   + + +           + INH+P I TEDYAYEIKGAGL
Sbjct: 197 LFSSRITLDDFIWAFGILNRESLTSMFE--------FEQINHNPAIKTEDYAYEIKGAGL 248

Query: 272 FSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESD 331
           FSRDLLFSL++PV VKAGEQV IQYDLNKSNAELALDYGF+ES   RN+YTLT+EI ESD
Sbjct: 249 FSRDLLFSLKSPVYVKAGEQVYIQYDLNKSNAELALDYGFVESNPKRNSYTLTIEIPESD 308

Query: 332 PFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWG 391
           PFFGDKLDIAE+N +GE+ YFDIV G+TLP  MLQYLRLVALGG DAFLLESIF NTIWG
Sbjct: 309 PFFGDKLDIAESNKMGETGYFDIVDGQTLPAGMLQYLRLVALGGPDAFLLESIFNNTIWG 368

Query: 392 HLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEK 451
           HL+LPVS  NEELICRVVRDACKSALSGF TTIEEDE+LL  G L+PRL +A+ +R GEK
Sbjct: 369 HLELPVSRTNEELICRVVRDACKSALSGFDTTIEEDEKLLDKGKLEPRLEMALKIRIGEK 428

Query: 452 RVLQQIDEIFKERELELDELEYYQERRLKDLGLVGEQGDIIFWEPK 497
           RVLQQID+IFK+RELELD LEYYQERRLKDLGLVGEQGDIIFWE K
Sbjct: 429 RVLQQIDQIFKDRELELDILEYYQERRLKDLGLVGEQGDIIFWETK 474


>gi|3403236|gb|AAC29137.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
           N-methyltransferase I [Spinacia oleracea]
          Length = 491

 Score =  664 bits (1713), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 330/489 (67%), Positives = 395/489 (80%), Gaps = 14/489 (2%)

Query: 11  LSTSPPTFVSSSKTLKL--LTSLTKKPSFR--LKSRAFAATCSLHSASATTNPPTAQVET 66
           + +S  TF++  KT +   L   T  P+F+  L  R F           T  PP  +++ 
Sbjct: 8   IPSSNSTFLNPFKTTQHSKLHFATPSPTFKNPLSIRCF-------RPPETDTPP--EIQK 58

Query: 67  FWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIG 126
           FW WL D+ ++SPK P++P   PEGLGLVAQ+DI++NEVVLEVP KFWINPDTVAASEIG
Sbjct: 59  FWGWLSDKGIISPKCPVKPGIVPEGLGLVAQKDISRNEVVLEVPQKFWINPDTVAASEIG 118

Query: 127 SLCSGLKPWISVALFLIREKK-KEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLS 185
           S+C+GLKPW+SVALFL+REKK    S W+ Y+DILP+ T+ST++WSEEEL ELQG+QLL+
Sbjct: 119 SVCNGLKPWVSVALFLMREKKLGNSSSWKPYIDILPDSTNSTIYWSEEELSELQGSQLLN 178

Query: 186 TTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLI 245
           TTLGVKE V NE+ K+EEE+++P+KQLFP  +T DDF WAFG+LRSRAF+ L GQ+LVLI
Sbjct: 179 TTLGVKELVANEFAKLEEEVLVPHKQLFPFDVTQDDFFWAFGMLRSRAFTCLEGQSLVLI 238

Query: 246 PLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAEL 305
           PLADL NHSP IT   YA+EI+GAGLFSR+L+FSLR P PVKAG+QVLIQYDLNKSNAEL
Sbjct: 239 PLADLANHSPDITAPKYAWEIRGAGLFSRELVFSLRNPTPVKAGDQVLIQYDLNKSNAEL 298

Query: 306 ALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAML 365
           ALDYG  ES+S+RNAYTLTLEI ESD F+GDKLDIAE+NG+GESAYFDIVL + LP  ML
Sbjct: 299 ALDYGLTESRSERNAYTLTLEIPESDSFYGDKLDIAESNGMGESAYFDIVLEQPLPANML 358

Query: 366 QYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            YLRLVALGG DAFLLESIFRN+IWGHLDLP+S ANEELIC+V+RDAC SALSG+ TTI 
Sbjct: 359 PYLRLVALGGEDAFLLESIFRNSIWGHLDLPISPANEELICQVIRDACTSALSGYSTTIA 418

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKERELELDELEYYQERRLKDLGLV 485
           EDE+LL  G++DPRL IA+ +R GEK+VLQQIDE FKERE+EL   EYYQERRLKDLGL 
Sbjct: 419 EDEKLLAEGDIDPRLEIAITIRLGEKKVLQQIDEEFKEREMELGGYEYYQERRLKDLGLA 478

Query: 486 GEQGDIIFW 494
           G QG+ + W
Sbjct: 479 GAQGEKLPW 487


>gi|3403234|gb|AAC29136.1| ribulose-1,5-bisphosphate carboxylase/oxygenase N-methyltransferase
           [Spinacia oleracea]
 gi|3403238|gb|AAC29138.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
           N-methyltransferase II [Spinacia oleracea]
          Length = 495

 Score =  658 bits (1698), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 330/493 (66%), Positives = 395/493 (80%), Gaps = 18/493 (3%)

Query: 11  LSTSPPTFVSSSKTLKL--LTSLTKKPSFR--LKSRAFAATCSLHSASATTNPPTAQVET 66
           + +S  TF++  KT +   L   T  P+F+  L  R F           T  PP  +++ 
Sbjct: 8   IPSSNSTFLNPFKTTQHSKLHFATPSPTFKNPLSIRCF-------RPPETDTPP--EIQK 58

Query: 67  FWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIG 126
           FW WL D+ ++SPK P++P   PEGLGLVAQ+DI++NEVVLEVP KFWINPDTVAASEIG
Sbjct: 59  FWGWLSDKGIISPKCPVKPGIVPEGLGLVAQKDISRNEVVLEVPQKFWINPDTVAASEIG 118

Query: 127 SLCSGLKPWISVALFLIREKK-KEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLS 185
           S+C+GLKPW+SVALFL+REKK    S W+ Y+DILP+ T+ST++WSEEEL ELQG+QLL+
Sbjct: 119 SVCNGLKPWVSVALFLMREKKLGNSSSWKPYIDILPDSTNSTIYWSEEELSELQGSQLLN 178

Query: 186 TTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLI 245
           TTLGVKE V NE+ K+EEE+++P+KQLFP  +T DDF WAFG+LRSRAF+ L GQ+LVLI
Sbjct: 179 TTLGVKELVANEFAKLEEEVLVPHKQLFPFDVTQDDFFWAFGMLRSRAFTCLEGQSLVLI 238

Query: 246 PLADL----INHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKS 301
           PLADL     NHSP IT   YA+EI+GAGLFSR+L+FSLR P PVKAG+QVLIQYDLNKS
Sbjct: 239 PLADLWVQQANHSPDITAPKYAWEIRGAGLFSRELVFSLRNPTPVKAGDQVLIQYDLNKS 298

Query: 302 NAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLP 361
           NAELALDYG  ES+S+RNAYTLTLEI ESD F+GDKLDIAE+NG+GESAYFDIVL + LP
Sbjct: 299 NAELALDYGLTESRSERNAYTLTLEIPESDSFYGDKLDIAESNGMGESAYFDIVLEQPLP 358

Query: 362 PAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFH 421
             ML YLRLVALGG DAFLLESIFRN+IWGHLDLP+S ANEELIC+V+RDAC SALSG+ 
Sbjct: 359 ANMLPYLRLVALGGEDAFLLESIFRNSIWGHLDLPISPANEELICQVIRDACTSALSGYS 418

Query: 422 TTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKERELELDELEYYQERRLKD 481
           TTI EDE+LL  G++DPRL IA+ +R GEK+VLQQIDE FKERE+EL   EYYQERRLKD
Sbjct: 419 TTIAEDEKLLAEGDIDPRLEIAITIRLGEKKVLQQIDEEFKEREMELGGYEYYQERRLKD 478

Query: 482 LGLVGEQGDIIFW 494
           LGL G QG+ + W
Sbjct: 479 LGLAGAQGEKLPW 491


>gi|17367341|sp|Q43088.1|RBCMT_PEA RecName: Full=Ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic; AltName:
           Full=[Fructose-bisphosphate aldolase]-lysine
           N-methyltransferase; AltName:
           Full=[Ribulose-bisphosphate carboxylase]-lysine
           N-methyltransferase; Short=PsLSMT; Short=RuBisCO LSMT;
           Short=RuBisCO methyltransferase; Short=rbcMT; Flags:
           Precursor
 gi|508551|gb|AAA69903.1| ribulose-1,5 bisphosphate carboxylase large subunit
           N-methyltransferase [Pisum sativum]
          Length = 489

 Score =  655 bits (1690), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 324/487 (66%), Positives = 397/487 (81%), Gaps = 12/487 (2%)

Query: 9   FSLSTSPPTFVSSSKTLKLLTSLT-KKPSFRLKSRAFAATCSLHSASATTNPP-TAQVET 66
           FS  +  P    ++K     TS T K P   LK R+F+A      AS  T P  +  V+T
Sbjct: 5   FSGGSVSPFLFHTNKG----TSFTPKAPILHLK-RSFSAKSV---ASVGTEPSLSPAVQT 56

Query: 67  FWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIG 126
           FW+WL+++ V++ K+P++ +   EGLGLVA +DI++N+V+L+VP + WINPD VAASEIG
Sbjct: 57  FWKWLQEEGVITAKTPVKASVVTEGLGLVALKDISRNDVILQVPKRLWINPDAVAASEIG 116

Query: 127 SLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLST 186
            +CS LKPW+SV LFLIRE+ +EDS W+ Y  ILP+ TDST++WSEEEL ELQG+QLL T
Sbjct: 117 RVCSELKPWLSVILFLIRERSREDSVWKHYFGILPQETDSTIYWSEEELQELQGSQLLKT 176

Query: 187 TLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIP 246
           T+ VKEYV+NE LK+E+EIILPNK+LFP P+TLDDF WAFGILRSRAFSRLR +NLV++P
Sbjct: 177 TVSVKEYVKNECLKLEQEIILPNKRLFPDPVTLDDFFWAFGILRSRAFSRLRNENLVVVP 236

Query: 247 LADLINHSPGITTEDYAYEIKG-AGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAEL 305
           +ADLINHS G+TTED+AYE+KG AGLFS D LFSL++P+ VKAGEQV IQYDLNKSNAEL
Sbjct: 237 MADLINHSAGVTTEDHAYEVKGAAGLFSWDYLFSLKSPLSVKAGEQVYIQYDLNKSNAEL 296

Query: 306 ALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAML 365
           ALDYGFIE   +R+AYTLTLEISESDPFF DKLD+AE+NG  ++AYFDI   RTLPP +L
Sbjct: 297 ALDYGFIEPNENRHAYTLTLEISESDPFFDDKLDVAESNGFAQTAYFDIFYNRTLPPGLL 356

Query: 366 QYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            YLRLVALGGTDAFLLES+FR+TIWGHL+L VS  NEEL+C+ VR+ACKSAL+G+HTTIE
Sbjct: 357 PYLRLVALGGTDAFLLESLFRDTIWGHLELSVSRDNEELLCKAVREACKSALAGYHTTIE 416

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKERELELDELEYYQERRLKDLGLV 485
           +D +L + GNLD RL IAVG+R GEK VLQQID IF+++ELELD+LEYYQERRLKDLGL 
Sbjct: 417 QDRELKE-GNLDSRLAIAVGIREGEKMVLQQIDGIFEQKELELDQLEYYQERRLKDLGLC 475

Query: 486 GEQGDII 492
           GE GDI+
Sbjct: 476 GENGDIL 482


>gi|356521657|ref|XP_003529470.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Glycine
           max]
          Length = 487

 Score =  651 bits (1680), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 326/465 (70%), Positives = 386/465 (83%), Gaps = 4/465 (0%)

Query: 29  TSLTKKPSFRLKSRAFAATCSLHSASATTNPPTAQVETFWQWLRDQKVVSPKSPIRPATF 88
            S + K SF    R  +A C     +  +  P   V+TFWQWL+++ VVS K+P++P   
Sbjct: 19  NSFSSKGSFLHLKRPLSANCVASLGTEVSVSPA--VDTFWQWLKEEGVVSGKTPVKPGVV 76

Query: 89  PEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREKKK 148
           PEGLGLVA +DI++NEVVL+VP + WINPD VAASEIG +CSGLKPW++VALFLIRE+ +
Sbjct: 77  PEGLGLVALKDISRNEVVLQVPKRLWINPDAVAASEIGKVCSGLKPWLAVALFLIRERSR 136

Query: 149 EDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILP 208
            DS W+ Y  ILP+ TDST++WSEEEL ELQGTQLL+TT  VK+YVQNE+ ++EEEII+P
Sbjct: 137 SDSLWKHYFSILPKETDSTIYWSEEELSELQGTQLLNTTRSVKQYVQNEFRRLEEEIIIP 196

Query: 209 NKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKG 268
           NK+LFP  ITLDDF WAFGILRSRAFSRLR +NLV+IPLADLINHS  +TT+D+AYEIKG
Sbjct: 197 NKKLFPSSITLDDFFWAFGILRSRAFSRLRNENLVVIPLADLINHSARVTTDDHAYEIKG 256

Query: 269 -AGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEI 327
            AGLFS D LFSLR+P+ +KAG+QV IQYDLNKSNAELALDYGFIE  +DRNAYTLTL+I
Sbjct: 257 AAGLFSWDYLFSLRSPLSLKAGDQVYIQYDLNKSNAELALDYGFIEPNTDRNAYTLTLQI 316

Query: 328 SESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRN 387
           SESDPFFGDKLDIAE+NG GE+AYFDI   R LPP +L YLRLVALGGTDAFLLESIFRN
Sbjct: 317 SESDPFFGDKLDIAESNGFGETAYFDIFYNRPLPPGLLPYLRLVALGGTDAFLLESIFRN 376

Query: 388 TIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVR 447
           +IWGHL+LPVS  NEELICRVVR+ CK+AL+G+HTTIEED++L +   LD R  IAVG+R
Sbjct: 377 SIWGHLELPVSRDNEELICRVVRETCKTALAGYHTTIEEDQKLKE-AKLDSRHAIAVGIR 435

Query: 448 AGEKRVLQQIDEIFKERELELDELEYYQERRLKDLGLVGEQGDII 492
            GEK +LQQIDEIFKE+ELEL +LEYYQERRLKDLGL GE GDI+
Sbjct: 436 EGEKNLLQQIDEIFKEKELELAQLEYYQERRLKDLGLCGENGDIL 480


>gi|24987776|pdb|1MLV|A Chain A, Structure And Catalytic Mechanism Of A Set Domain Protein
           Methyltransferase
 gi|24987777|pdb|1MLV|B Chain B, Structure And Catalytic Mechanism Of A Set Domain Protein
           Methyltransferase
 gi|24987778|pdb|1MLV|C Chain C, Structure And Catalytic Mechanism Of A Set Domain Protein
           Methyltransferase
 gi|33357815|pdb|1OZV|A Chain A, Crystal Structure Of The Set Domain Of Lsmt Bound To
           Lysine And Adohcy
 gi|33357816|pdb|1OZV|B Chain B, Crystal Structure Of The Set Domain Of Lsmt Bound To
           Lysine And Adohcy
 gi|33357817|pdb|1OZV|C Chain C, Crystal Structure Of The Set Domain Of Lsmt Bound To
           Lysine And Adohcy
 gi|33357822|pdb|1P0Y|A Chain A, Crystal Structure Of The Set Domain Of Lsmt Bound To
           Melysine And Adohcy
 gi|33357823|pdb|1P0Y|B Chain B, Crystal Structure Of The Set Domain Of Lsmt Bound To
           Melysine And Adohcy
 gi|33357824|pdb|1P0Y|C Chain C, Crystal Structure Of The Set Domain Of Lsmt Bound To
           Melysine And Adohcy
          Length = 444

 Score =  649 bits (1674), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 306/430 (71%), Positives = 373/430 (86%), Gaps = 2/430 (0%)

Query: 64  VETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAAS 123
           V+TFW+WL+++ V++ K+P++ +   EGLGLVA +DI++N+V+L+VP + WINPD VAAS
Sbjct: 10  VQTFWKWLQEEGVITAKTPVKASVVTEGLGLVALKDISRNDVILQVPKRLWINPDAVAAS 69

Query: 124 EIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQL 183
           EIG +CS LKPW+SV LFLIRE+ +EDS W+ Y  ILP+ TDST++WSEEEL ELQG+QL
Sbjct: 70  EIGRVCSELKPWLSVILFLIRERSREDSVWKHYFGILPQETDSTIYWSEEELQELQGSQL 129

Query: 184 LSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLV 243
           L TT+ VKEYV+NE LK+E+EIILPNK+LFP P+TLDDF WAFGILRSRAFSRLR +NLV
Sbjct: 130 LKTTVSVKEYVKNECLKLEQEIILPNKRLFPDPVTLDDFFWAFGILRSRAFSRLRNENLV 189

Query: 244 LIPLADLINHSPGITTEDYAYEIKG-AGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSN 302
           ++P+ADLINHS G+TTED+AYE+KG AGLFS D LFSL++P+ VKAGEQV IQYDLNKSN
Sbjct: 190 VVPMADLINHSAGVTTEDHAYEVKGAAGLFSWDYLFSLKSPLSVKAGEQVYIQYDLNKSN 249

Query: 303 AELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP 362
           AELALDYGFIE   +R+AYTLTLEISESDPFF DKLD+AE+NG  ++AYFDI   RTLPP
Sbjct: 250 AELALDYGFIEPNENRHAYTLTLEISESDPFFDDKLDVAESNGFAQTAYFDIFYNRTLPP 309

Query: 363 AMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHT 422
            +L YLRLVALGGTDAFLLES+FR+TIWGHL+L VS  NEEL+C+ VR+ACKSAL+G+HT
Sbjct: 310 GLLPYLRLVALGGTDAFLLESLFRDTIWGHLELSVSRDNEELLCKAVREACKSALAGYHT 369

Query: 423 TIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKERELELDELEYYQERRLKDL 482
           TIE+D + L+ GNLD RL IAVG+R GEK VLQQID IF+++ELELD+LEYYQERRLKDL
Sbjct: 370 TIEQDRE-LKEGNLDSRLAIAVGIREGEKMVLQQIDGIFEQKELELDQLEYYQERRLKDL 428

Query: 483 GLVGEQGDII 492
           GL GE GDI+
Sbjct: 429 GLCGENGDIL 438


>gi|109158151|pdb|2H21|A Chain A, Structure Of Rubisco Lsmt Bound To Adomet
 gi|109158152|pdb|2H21|B Chain B, Structure Of Rubisco Lsmt Bound To Adomet
 gi|109158153|pdb|2H21|C Chain C, Structure Of Rubisco Lsmt Bound To Adomet
 gi|109158154|pdb|2H23|A Chain A, Structure Of Rubisco Lsmt Bound To Trimethyllysine And
           Adohcy
 gi|109158155|pdb|2H23|B Chain B, Structure Of Rubisco Lsmt Bound To Trimethyllysine And
           Adohcy
 gi|109158156|pdb|2H23|C Chain C, Structure Of Rubisco Lsmt Bound To Trimethyllysine And
           Adohcy
 gi|109158157|pdb|2H2E|A Chain A, Structure Of Rubisco Lsmt Bound To Azaadomet And Lysine
 gi|109158158|pdb|2H2E|B Chain B, Structure Of Rubisco Lsmt Bound To Azaadomet And Lysine
 gi|109158159|pdb|2H2E|C Chain C, Structure Of Rubisco Lsmt Bound To Azaadomet And Lysine
 gi|109158160|pdb|2H2J|A Chain A, Structure Of Rubisco Lsmt Bound To Sinefungin And
           Monomethyllysine
 gi|109158161|pdb|2H2J|B Chain B, Structure Of Rubisco Lsmt Bound To Sinefungin And
           Monomethyllysine
 gi|109158162|pdb|2H2J|C Chain C, Structure Of Rubisco Lsmt Bound To Sinefungin And
           Monomethyllysine
          Length = 440

 Score =  649 bits (1674), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 306/430 (71%), Positives = 373/430 (86%), Gaps = 2/430 (0%)

Query: 64  VETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAAS 123
           V+TFW+WL+++ V++ K+P++ +   EGLGLVA +DI++N+V+L+VP + WINPD VAAS
Sbjct: 6   VQTFWKWLQEEGVITAKTPVKASVVTEGLGLVALKDISRNDVILQVPKRLWINPDAVAAS 65

Query: 124 EIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQL 183
           EIG +CS LKPW+SV LFLIRE+ +EDS W+ Y  ILP+ TDST++WSEEEL ELQG+QL
Sbjct: 66  EIGRVCSELKPWLSVILFLIRERSREDSVWKHYFGILPQETDSTIYWSEEELQELQGSQL 125

Query: 184 LSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLV 243
           L TT+ VKEYV+NE LK+E+EIILPNK+LFP P+TLDDF WAFGILRSRAFSRLR +NLV
Sbjct: 126 LKTTVSVKEYVKNECLKLEQEIILPNKRLFPDPVTLDDFFWAFGILRSRAFSRLRNENLV 185

Query: 244 LIPLADLINHSPGITTEDYAYEIKG-AGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSN 302
           ++P+ADLINHS G+TTED+AYE+KG AGLFS D LFSL++P+ VKAGEQV IQYDLNKSN
Sbjct: 186 VVPMADLINHSAGVTTEDHAYEVKGAAGLFSWDYLFSLKSPLSVKAGEQVYIQYDLNKSN 245

Query: 303 AELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP 362
           AELALDYGFIE   +R+AYTLTLEISESDPFF DKLD+AE+NG  ++AYFDI   RTLPP
Sbjct: 246 AELALDYGFIEPNENRHAYTLTLEISESDPFFDDKLDVAESNGFAQTAYFDIFYNRTLPP 305

Query: 363 AMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHT 422
            +L YLRLVALGGTDAFLLES+FR+TIWGHL+L VS  NEEL+C+ VR+ACKSAL+G+HT
Sbjct: 306 GLLPYLRLVALGGTDAFLLESLFRDTIWGHLELSVSRDNEELLCKAVREACKSALAGYHT 365

Query: 423 TIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKERELELDELEYYQERRLKDL 482
           TIE+D + L+ GNLD RL IAVG+R GEK VLQQID IF+++ELELD+LEYYQERRLKDL
Sbjct: 366 TIEQDRE-LKEGNLDSRLAIAVGIREGEKMVLQQIDGIFEQKELELDQLEYYQERRLKDL 424

Query: 483 GLVGEQGDII 492
           GL GE GDI+
Sbjct: 425 GLCGENGDIL 434


>gi|356577306|ref|XP_003556768.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Glycine
           max]
          Length = 487

 Score =  648 bits (1671), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 323/465 (69%), Positives = 385/465 (82%), Gaps = 4/465 (0%)

Query: 29  TSLTKKPSFRLKSRAFAATCSLHSASATTNPPTAQVETFWQWLRDQKVVSPKSPIRPATF 88
            S   K SF    R  +A C     +  +  P   V+TFWQWL+++ VVS K+P++P+  
Sbjct: 19  NSFPSKGSFIHLKRPLSANCVASLGTEVSVSPA--VDTFWQWLKEEGVVSAKTPVKPSVV 76

Query: 89  PEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREKKK 148
           PEGLGLVA +DI++NEVVL+VP + WINPD VAASEIG +C GLKPW++VALFLIRE+ +
Sbjct: 77  PEGLGLVALKDISRNEVVLQVPKRLWINPDAVAASEIGKVCIGLKPWLAVALFLIRERSR 136

Query: 149 EDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILP 208
            +S W+ Y  +LP+ TDST++WSEEEL ELQGTQLL+TT  VK+YV+NEY ++EEEIILP
Sbjct: 137 SNSLWKHYFSVLPKETDSTIYWSEEELSELQGTQLLNTTRSVKQYVENEYRRLEEEIILP 196

Query: 209 NKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKG 268
           NK+LFP P+TLDDF WAFGILRSRAFSRLR +NLV+IP AD INHS  +TTED+AYEIKG
Sbjct: 197 NKKLFPSPLTLDDFFWAFGILRSRAFSRLRNENLVVIPFADFINHSARVTTEDHAYEIKG 256

Query: 269 -AGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEI 327
            AGLFS D LFSLR+P+ +KAG+QV IQYDLNKSNAELALDYGFIE  +DRNAYTLTL+I
Sbjct: 257 AAGLFSWDYLFSLRSPLSLKAGDQVYIQYDLNKSNAELALDYGFIEPNADRNAYTLTLQI 316

Query: 328 SESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRN 387
           SESDPFFGDKLDIAE+NG GE+AYFDI   R LPP +L YLRLVALGGTDAFLLESIFRN
Sbjct: 317 SESDPFFGDKLDIAESNGFGETAYFDIFYSRPLPPGLLPYLRLVALGGTDAFLLESIFRN 376

Query: 388 TIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVR 447
           +IWGHL+LPVS  NEELICRVVR+ CK+AL+G+HTTIEED++L +   LD R  IAVG+R
Sbjct: 377 SIWGHLELPVSRDNEELICRVVRETCKTALAGYHTTIEEDQKLKE-AKLDSRHAIAVGIR 435

Query: 448 AGEKRVLQQIDEIFKERELELDELEYYQERRLKDLGLVGEQGDII 492
            GEK++LQQIDE FKE+ELELD+LEYYQERRLKDLGL GE GDI+
Sbjct: 436 EGEKQLLQQIDETFKEKELELDQLEYYQERRLKDLGLCGENGDIL 480


>gi|293333172|ref|NP_001168589.1| uncharacterized protein LOC100382373 [Zea mays]
 gi|223949395|gb|ACN28781.1| unknown [Zea mays]
 gi|414885391|tpg|DAA61405.1| TPA: hypothetical protein ZEAMMB73_723554 [Zea mays]
          Length = 489

 Score =  592 bits (1526), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 291/447 (65%), Positives = 351/447 (78%), Gaps = 5/447 (1%)

Query: 53  ASATTNPPTAQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMK 112
           A+A+T    + +E F +WL           I PA  PEGLGLVA RD+ + EVV EVP K
Sbjct: 44  AAASTTATVSAMEDFRRWLASHGAGDGGKAI-PAAVPEGLGLVAARDLPRGEVVAEVPKK 102

Query: 113 FWINPDTVAASEIGSLCSG---LKPWISVALFLIRE-KKKEDSPWRVYLDILPECTDSTV 168
            W++ D VAAS+IG  C G   L+PW++VAL L+ E  +  DSPW  YL ILP  TDST+
Sbjct: 103 LWMDADAVAASDIGRACGGGGGLRPWVAVALLLLSEVARGADSPWAPYLAILPRQTDSTI 162

Query: 169 FWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGI 228
           FWSEEEL+E+QGTQLLSTT+GVKEYVQ+E+  V+ EII  NK LFP  IT DDFLWAFG+
Sbjct: 163 FWSEEELLEIQGTQLLSTTVGVKEYVQSEFDSVQAEIISTNKDLFPGSITFDDFLWAFGM 222

Query: 229 LRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKA 288
           LRSR F  LRG  L LIP ADL+NHSP IT+E  ++EIKG GLF R+L+FSLRTPV VK+
Sbjct: 223 LRSRVFPELRGDKLALIPFADLVNHSPNITSEGSSWEIKGKGLFGRELMFSLRTPVNVKS 282

Query: 289 GEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGE 348
           G+Q+ IQYDL+KSNAELALDYGF+ES   R+++T+TLEISESDPF+GDKLDIAE NGLGE
Sbjct: 283 GQQIYIQYDLDKSNAELALDYGFVESNPSRDSFTVTLEISESDPFYGDKLDIAEANGLGE 342

Query: 349 SAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRV 408
           +AYFD++L   LPP ML YLRL+ +GGTDAFLLE++FRN++WGHL+LP+S  NEE IC+ 
Sbjct: 343 TAYFDVILNEPLPPQMLPYLRLLCIGGTDAFLLEALFRNSVWGHLELPLSPDNEESICQA 402

Query: 409 VRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKERELEL 468
           +RDACKSAL+ +HTTIEEDE+L    NL PRL IA+GVRAGEK+VLQ ID IFK+RE EL
Sbjct: 403 MRDACKSALADYHTTIEEDEELSGRENLQPRLAIAIGVRAGEKKVLQHIDGIFKQREEEL 462

Query: 469 DELEYYQERRLKDLGLVGEQGDIIFWE 495
           D LEYYQERRLKDLGLV + G+IIFWE
Sbjct: 463 DGLEYYQERRLKDLGLVSDNGEIIFWE 489


>gi|218202140|gb|EEC84567.1| hypothetical protein OsI_31339 [Oryza sativa Indica Group]
          Length = 649

 Score =  591 bits (1524), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 294/449 (65%), Positives = 356/449 (79%), Gaps = 7/449 (1%)

Query: 54  SATTNPPT---AQVETFWQWLRDQKVVSPKSPIRPATFPE-GLGLVAQRDIAKNEVVLEV 109
           S+T   PT   A ++ F +W+      +      PA  PE GLGLVA RD+ + EV+ EV
Sbjct: 200 SSTAVAPTTKDAALQEFRRWVSSHGSDAGSGAAAPAAVPEVGLGLVAARDLPRGEVLAEV 259

Query: 110 PMKFWINPDTVAASEIGSLCS--GLKPWISVALFLIREKKK-EDSPWRVYLDILPECTDS 166
           P K W++ D VAAS++G      GL+PW++VAL L+RE  +   SPW  YL ILP  TDS
Sbjct: 260 PKKLWLDADAVAASDLGGAVGRGGLRPWVAVALLLLREAARGAGSPWAPYLAILPRQTDS 319

Query: 167 TVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAF 226
           T+FWSEEEL+E+QGTQLLSTT+GVKEYVQ+E+  VE EII  N++LFP  +T +DFLWAF
Sbjct: 320 TIFWSEEELLEIQGTQLLSTTMGVKEYVQSEFESVEAEIISENRELFPGTVTFNDFLWAF 379

Query: 227 GILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPV 286
           GILRSR F+ LRG  L LIP ADL+NHS  IT+++ ++EIKG GLF RD++FSLRTPV V
Sbjct: 380 GILRSRVFAELRGDKLALIPFADLVNHSDDITSKESSWEIKGKGLFGRDVVFSLRTPVNV 439

Query: 287 KAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGL 346
           K+GEQ+ IQYDL+KSNAELALDYGF ES S R+AYTLTLEISESDPF+ DKLDIAE NG+
Sbjct: 440 KSGEQIYIQYDLDKSNAELALDYGFTESNSSRDAYTLTLEISESDPFYDDKLDIAELNGM 499

Query: 347 GESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELIC 406
           GE+AYFDIVLG +LPP ML YLRL+ LGGTDAFLLE++FRN +WGHL+LPVS  NEE IC
Sbjct: 500 GETAYFDIVLGESLPPQMLPYLRLLCLGGTDAFLLEALFRNAVWGHLELPVSQDNEEAIC 559

Query: 407 RVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKEREL 466
           +V+R+ACKSAL  +HTTIEEDE+LL   NL PRL+IAV VRAGEK+VLQQID+IFK+RE 
Sbjct: 560 QVIRNACKSALGAYHTTIEEDEELLGSENLQPRLQIAVEVRAGEKKVLQQIDDIFKQREE 619

Query: 467 ELDELEYYQERRLKDLGLVGEQGDIIFWE 495
           ELD LEYYQERRLKD+GLVG+ G+IIFWE
Sbjct: 620 ELDGLEYYQERRLKDIGLVGDNGEIIFWE 648


>gi|50252331|dbj|BAD28364.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplast precursor
           [Oryza sativa Japonica Group]
 gi|215769445|dbj|BAH01674.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 495

 Score =  587 bits (1512), Expect = e-165,   Method: Compositional matrix adjust.
 Identities = 289/436 (66%), Positives = 350/436 (80%), Gaps = 4/436 (0%)

Query: 64  VETFWQWLRDQKVVSPKSPIRPATFPEG-LGLVAQRDIAKNEVVLEVPMKFWINPDTVAA 122
           ++ F +W+      +      PA  PEG LGLVA RD+ + EV+ EVP K W++ D VAA
Sbjct: 59  LQEFRRWVSSHGADAGAGAAAPAAVPEGGLGLVAARDLPRGEVLAEVPKKLWLDADAVAA 118

Query: 123 SEIGSLCS--GLKPWISVALFLIREKKK-EDSPWRVYLDILPECTDSTVFWSEEELVELQ 179
           S++G      GL+PW++VAL L+RE  +   SPW  YL ILP  TDST+FWSEEEL+E+Q
Sbjct: 119 SDLGGAVGRGGLRPWVAVALLLLREAARGAGSPWAPYLAILPRQTDSTIFWSEEELLEIQ 178

Query: 180 GTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRG 239
           GTQLLSTT+GVKEYVQ+E+  VE EII  N++LFP  +T +DFLWAFGILRSR F+ LRG
Sbjct: 179 GTQLLSTTMGVKEYVQSEFESVEAEIISENRELFPGTVTFNDFLWAFGILRSRVFAELRG 238

Query: 240 QNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLN 299
             L LIP ADL+NHS  IT+++ ++EIKG GLF RD++FSLRTPV VK+GEQ+ IQYDL+
Sbjct: 239 DKLALIPFADLVNHSDDITSKESSWEIKGKGLFGRDVVFSLRTPVNVKSGEQIYIQYDLD 298

Query: 300 KSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRT 359
           KSNAELALDYGF ES S R+AYTLTLEISESDPF+ DKLDIAE NG+GE+AYFDIVLG +
Sbjct: 299 KSNAELALDYGFTESNSSRDAYTLTLEISESDPFYDDKLDIAELNGMGETAYFDIVLGES 358

Query: 360 LPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSG 419
           LPP ML YLRL+ LGGTDAFLLE++FRN +WGHL+LPVS  NEE IC+V+R+ACKSAL  
Sbjct: 359 LPPQMLPYLRLLCLGGTDAFLLEALFRNAVWGHLELPVSQDNEEAICQVIRNACKSALGA 418

Query: 420 FHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKERELELDELEYYQERRL 479
           +HTTIEEDE+LL   NL PRL+IAV VRAGEK+VLQQID+IFK+RE ELD LEYYQERRL
Sbjct: 419 YHTTIEEDEELLGSENLQPRLQIAVEVRAGEKKVLQQIDDIFKQREEELDGLEYYQERRL 478

Query: 480 KDLGLVGEQGDIIFWE 495
           KD+GLVG+ G+IIFWE
Sbjct: 479 KDIGLVGDNGEIIFWE 494


>gi|357153645|ref|XP_003576520.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
           large subunit N-methyltransferase, chloroplastic-like
           [Brachypodium distachyon]
          Length = 492

 Score =  580 bits (1496), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 296/485 (61%), Positives = 368/485 (75%), Gaps = 7/485 (1%)

Query: 15  PPTFVSSSKTLKLLTSLTKKPSFRLKSRAFAATCSLHSASATTNPPTAQVETFWQWLRDQ 74
           PP  +S      LL  L+++P   L     A   +  +A+A + P  A ++ F +W+  Q
Sbjct: 10  PPQLLSGHGPPPLL-RLSRRPRPLLHLLPRATAATTGTAAAVSTPSDAALQNFRRWISSQ 68

Query: 75  KVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLC-SG-- 131
              +  +   P   PEGLGLVA R++ + EVV EVP K W++ D VAAS+IG  C SG  
Sbjct: 69  GADTGAA--SPTVVPEGLGLVAARNLPRGEVVAEVPKKLWMDADAVAASDIGRACRSGGD 126

Query: 132 LKPWISVALFLIREKKKE-DSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGV 190
           L+PW+SV+L ++RE  +  DS W  YL ILP  TDST+FWSEEEL+E+QGTQLLSTT+GV
Sbjct: 127 LRPWVSVSLLILREAARGGDSLWAPYLAILPRQTDSTIFWSEEELLEIQGTQLLSTTMGV 186

Query: 191 KEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADL 250
           KEYVQ+E+  VE +II PNK LFP  IT DDFLWAFGILRSR F  LRG  L LIP ADL
Sbjct: 187 KEYVQSEFDNVEAKIIGPNKDLFPDTITFDDFLWAFGILRSRVFPELRGDKLALIPFADL 246

Query: 251 INHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYG 310
           INHS  IT++   +EI+G G   RD++FSLRTP+ VK+GEQV +QYDL+KSNAELALDYG
Sbjct: 247 INHSADITSKQSCWEIQGKGFLGRDVVFSLRTPMEVKSGEQVYVQYDLDKSNAELALDYG 306

Query: 311 FIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRL 370
           F E+ S R++YTLTLEISESDPF+GDKLDIAE NG+GE+AYFD+VLG +LPP M+ YLRL
Sbjct: 307 FTETNSTRDSYTLTLEISESDPFYGDKLDIAELNGMGETAYFDVVLGESLPPQMITYLRL 366

Query: 371 VALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQL 430
           + LGGTDAFLLE++FRN +WG L+LPVS  NEE IC+V++ ACKSAL+ +HTTIEEDE+L
Sbjct: 367 LCLGGTDAFLLEALFRNKVWGFLELPVSRDNEESICQVIQTACKSALTAYHTTIEEDEEL 426

Query: 431 LQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKERELELDELEYYQERRLKDLGLVGEQGD 490
           L+  +L  R +IAV VRAGEK+VLQQI++IFKERE ELD+LEYYQERRLKDLG +G+ GD
Sbjct: 427 LKREDLQSRHQIAVEVRAGEKKVLQQINDIFKEREQELDDLEYYQERRLKDLGFLGDNGD 486

Query: 491 IIFWE 495
           IIFWE
Sbjct: 487 IIFWE 491


>gi|242049248|ref|XP_002462368.1| hypothetical protein SORBIDRAFT_02g024510 [Sorghum bicolor]
 gi|241925745|gb|EER98889.1| hypothetical protein SORBIDRAFT_02g024510 [Sorghum bicolor]
          Length = 489

 Score =  541 bits (1393), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 278/437 (63%), Positives = 337/437 (77%), Gaps = 7/437 (1%)

Query: 64  VETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAAS 123
           +E F +WL    V        PA  PEGLGLVA RD+ + EVV EVP K W++ D VAAS
Sbjct: 54  LEDFRRWLASHSVGDGGKTF-PAAVPEGLGLVAARDLPRGEVVAEVPKKLWMDADAVAAS 112

Query: 124 EIGSLCSG----LKPWISVALFLIRE-KKKEDSPWRVYLDILPECTDSTVFWSEEELVEL 178
           +IG  C G    L+PW++VAL L+ E  +  DSPW  YL ILP  TDST+F +  +   L
Sbjct: 113 DIGRACGGGGGGLRPWVAVALLLLSEVARGADSPWAPYLAILPRQTDSTIFCAGLKKSSL 172

Query: 179 QGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLR 238
           +  +LLSTT+GVKEYVQ+E+  V+ EII  NK LFP  IT DDFLWAFGILRSR F  LR
Sbjct: 173 R-YKLLSTTVGVKEYVQSEFDSVQAEIISRNKDLFPGSITFDDFLWAFGILRSRVFPELR 231

Query: 239 GQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDL 298
           G  L L+P ADL+NHSP IT+E  ++EIKG GLF R+ +FSLRTPV VK+G+Q+ IQYDL
Sbjct: 232 GDKLALVPFADLVNHSPDITSEGSSWEIKGKGLFGREPMFSLRTPVDVKSGQQIYIQYDL 291

Query: 299 NKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGR 358
           +KSNAELALDYGF+ES   R++YT+TLEISESDPF+GDKLDIAE N LGE+AYFDI+L  
Sbjct: 292 DKSNAELALDYGFVESNPSRDSYTVTLEISESDPFYGDKLDIAELNELGETAYFDIILDE 351

Query: 359 TLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALS 418
            LPP ML YLRL+ +GGTDAF+LE++FRN++WGHL+LP+S  NEE IC+V+RDACKSAL+
Sbjct: 352 PLPPQMLPYLRLLCIGGTDAFILEALFRNSVWGHLELPLSPDNEESICQVMRDACKSALA 411

Query: 419 GFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKERELELDELEYYQERR 478
            +HTTIEEDE+L +  NL PRL IA+GVRAGEK+VLQ ID IFK+RE ELD LEYYQERR
Sbjct: 412 AYHTTIEEDEELSERENLQPRLTIAIGVRAGEKKVLQHIDNIFKQREQELDGLEYYQERR 471

Query: 479 LKDLGLVGEQGDIIFWE 495
           LKDLGLV + G+IIFWE
Sbjct: 472 LKDLGLVSDNGEIIFWE 488


>gi|326492674|dbj|BAJ90193.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 449

 Score =  540 bits (1390), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 270/411 (65%), Positives = 331/411 (80%), Gaps = 4/411 (0%)

Query: 89  PEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSG---LKPWISVALFLIRE 145
           P  LGLVA+R++ + EVV EVP K W++ D VAAS +G +C     L+PW+SV+L ++RE
Sbjct: 38  PPVLGLVAERNLPRGEVVAEVPKKLWLDADAVAASVLGRVCGSGGDLRPWVSVSLLILRE 97

Query: 146 KKKE-DSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEE 204
             +  DS W  YL ILP  TDST+FWSEEEL+E+QGTQLLSTT+GVKEYVQ+E+  VE  
Sbjct: 98  AARGGDSLWAPYLAILPRQTDSTIFWSEEELLEIQGTQLLSTTMGVKEYVQSEFDNVEAG 157

Query: 205 IILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAY 264
           II  NK LFP  IT DDFLWAFG+LRSR F  LRG  L LIP ADLINH+  IT+++  +
Sbjct: 158 IINVNKDLFPGTITFDDFLWAFGVLRSRVFPELRGDKLALIPFADLINHNGDITSKESCW 217

Query: 265 EIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLT 324
           EIKG G   RD +FSLRTPV VK+GEQ+ +QYDL+KSNAELALDYGF ES S R++YTLT
Sbjct: 218 EIKGKGFLGRDTVFSLRTPVDVKSGEQIYVQYDLDKSNAELALDYGFTESNSSRDSYTLT 277

Query: 325 LEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESI 384
           LEISESDPF+ DKLDIAE NG+GE+AYFD+VLG +LPP M+ YLRL+ LGGTDAFLLE++
Sbjct: 278 LEISESDPFYEDKLDIAELNGMGETAYFDVVLGESLPPQMITYLRLLCLGGTDAFLLEAL 337

Query: 385 FRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAV 444
           FRN +W HL+LPVS  NEE IC+V+++ACKSAL+ +HTTIEEDE+LL+  +L  R +IAV
Sbjct: 338 FRNKVWEHLELPVSRDNEESICQVIQNACKSALAAYHTTIEEDEELLEREDLQSRHQIAV 397

Query: 445 GVRAGEKRVLQQIDEIFKERELELDELEYYQERRLKDLGLVGEQGDIIFWE 495
            VR GEK+VL+QI++IFKERE ELD+LEYYQERRLKDLG +G+ GDIIFWE
Sbjct: 398 EVRVGEKKVLEQINDIFKEREQELDDLEYYQERRLKDLGFIGDNGDIIFWE 448


>gi|326496433|dbj|BAJ94678.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 453

 Score =  528 bits (1361), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 264/403 (65%), Positives = 324/403 (80%), Gaps = 4/403 (0%)

Query: 97  QRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSG---LKPWISVALFLIREKKKE-DSP 152
           +R++ + EVV EVP K W++ D VAAS +G +C     L+PW+SV+L ++RE  +  DS 
Sbjct: 50  ERNLPRGEVVAEVPKKLWLDADAVAASVLGRVCGSGGDLRPWVSVSLLILREAARGGDSL 109

Query: 153 WRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQL 212
           W  YL ILP  TDST+FWSEEEL+E+QGTQLLSTT+GVKEYVQ+E+  VE  II  NK L
Sbjct: 110 WAPYLAILPRQTDSTIFWSEEELLEIQGTQLLSTTMGVKEYVQSEFDNVEAGIINVNKDL 169

Query: 213 FPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLF 272
           FP  IT DDFLWAFG+LRSR F  LRG  L LIP ADLINH   IT+++  +EIKG G  
Sbjct: 170 FPGTITFDDFLWAFGVLRSRVFPELRGDKLALIPFADLINHDGDITSKESCWEIKGKGFL 229

Query: 273 SRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDP 332
            RD +FSLRTPV VK+GEQ+ +QYDL+KSNAELALDYGF ES S R++YTLTLEISESDP
Sbjct: 230 GRDTVFSLRTPVDVKSGEQIYVQYDLDKSNAELALDYGFTESNSSRDSYTLTLEISESDP 289

Query: 333 FFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGH 392
           F+ DKLDIAE NG+GE+AYFD+VLG +LPP M+ YLRL+ LGGTDAFLLE++FRN +W H
Sbjct: 290 FYEDKLDIAELNGMGETAYFDVVLGESLPPQMITYLRLLCLGGTDAFLLEALFRNKVWEH 349

Query: 393 LDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKR 452
           L+LPVS  NEE IC+V+++ACKSAL+ +HTTIEEDE+LL+  +L  R +IAV VR GEK+
Sbjct: 350 LELPVSRDNEESICQVIQNACKSALAAYHTTIEEDEELLEREDLQSRHQIAVEVRVGEKK 409

Query: 453 VLQQIDEIFKERELELDELEYYQERRLKDLGLVGEQGDIIFWE 495
           VL+QI++IFKERE ELD+LEYYQERRLKDLG +G+ GDIIFWE
Sbjct: 410 VLEQINDIFKEREQELDDLEYYQERRLKDLGFIGDNGDIIFWE 452


>gi|148908465|gb|ABR17345.1| unknown [Picea sitchensis]
          Length = 350

 Score =  407 bits (1045), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 203/331 (61%), Positives = 251/331 (75%), Gaps = 5/331 (1%)

Query: 171 SEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILR 230
           SEEEL EL+GTQLL++TLG K+YVQ+E+LKV+EEII PN+ LF   IT  DFLWAFGILR
Sbjct: 21  SEEELAELKGTQLLNSTLGYKDYVQSEFLKVQEEIIDPNRHLFDSDITQKDFLWAFGILR 80

Query: 231 SRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGE 290
           SRAF    G NL ++P ADL+NH   I  E+ ++E K  GLF R    ++R P   + GE
Sbjct: 81  SRAFPPFIGDNLAMVPFADLVNHGFSINVEEPSWERKVTGLFDRQEALTMRAPAAFRTGE 140

Query: 291 QVLIQYDLNKSNAELALDYGFIE----SKSDRNAYTLTLEISESDPFFGDKLDIAETNGL 346
           QVL+QY +NKSN +LALDYGF+E    + S+R+ +TLTLEISESDPFF DKLDIAE NG+
Sbjct: 141 QVLMQYGMNKSNGQLALDYGFVERNRKNGSNRDIFTLTLEISESDPFFADKLDIAELNGM 200

Query: 347 GESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELIC 406
             +AYFDI  G+ +P +ML +LRL+ALGGTDAFLLE +FR+++W HL LPVS  NE  IC
Sbjct: 201 ETTAYFDITQGQGVPESMLTFLRLIALGGTDAFLLEPLFRDSVWEHLSLPVSQENEAAIC 260

Query: 407 RVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKEREL 466
           +VV D C+S LSG+ TTIEEDE LL G  LD RL IA   R GEK+VLQ+I + F+E  +
Sbjct: 261 KVVLDGCQSTLSGYGTTIEEDEALL-GRKLDARLEIAAVTRLGEKQVLQEIQKSFEEYIV 319

Query: 467 ELDELEYYQERRLKDLGLVGEQGDIIFWEPK 497
             D+LEYYQERRLK+LGL+GE G+IIFWEPK
Sbjct: 320 NRDQLEYYQERRLKNLGLMGEIGEIIFWEPK 350


>gi|302804384|ref|XP_002983944.1| hypothetical protein SELMODRAFT_119151 [Selaginella moellendorffii]
 gi|300148296|gb|EFJ14956.1| hypothetical protein SELMODRAFT_119151 [Selaginella moellendorffii]
          Length = 439

 Score =  405 bits (1041), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 198/407 (48%), Positives = 280/407 (68%), Gaps = 3/407 (0%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREKKKED 150
           GLGL A+R ++KN+ V+ +P   W++ DTV  SEIG  C GL+PWI+VAL+L+ EK K  
Sbjct: 20  GLGLFAKRSVSKNQEVVSIPKTLWMDADTVRRSEIGECCEGLRPWIAVALYLLHEKAKPH 79

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNK 210
           S W  Y+ +LP   DS +FWSEEEL EL+GTQLLS+  G KE+++ EY KV  E+I P  
Sbjct: 80  SDWSAYIRVLPRTLDSPLFWSEEELAELKGTQLLSSMNGFKEFLKREYDKVMTEVIEPRP 139

Query: 211 QLFPRPI-TLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGA 269
            +F R + TL+ F WAFGILRSR F  L G NL L+PLAD +NH  G+T ED  +++K A
Sbjct: 140 DVFDRSLYTLEAFTWAFGILRSRTFPPLIGDNLALVPLADFVNHGFGLTNEDPGWKVKSA 199

Query: 270 GLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES--KSDRNAYTLTLEI 327
           G+F+R    +L+        ++VLIQY   K NA+LA DYGF++S  K++R+++TLTL++
Sbjct: 200 GVFARQETLTLQAAANCAEKQEVLIQYGKKKGNAQLATDYGFVDSDEKNNRDSFTLTLQV 259

Query: 328 SESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRN 387
           S S+ F  DK+DIA+  GL  +AYF++   +  P  M+ YLRL+AL G+D+FLLE++FRN
Sbjct: 260 SLSERFADDKVDIAQMAGLDSTAYFNLYRNQGPPEDMIAYLRLIALFGSDSFLLEALFRN 319

Query: 388 TIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVR 447
           T+W HL LP+S  NEE IC  + + C++ L  + +TI+ED  LL    L  R ++AV VR
Sbjct: 320 TVWDHLRLPISRENEEAICEAMIEGCRATLREYSSTIDEDTMLLNSSELSTRKKMAVVVR 379

Query: 448 AGEKRVLQQIDEIFKERELELDELEYYQERRLKDLGLVGEQGDIIFW 494
            GEKR+LQ+  + F+    +LD LE+Y ERRL+DLGL+ ++G++  W
Sbjct: 380 LGEKRILQEQLQWFETHLADLDRLEFYAERRLRDLGLLDDKGNMTPW 426


>gi|302754606|ref|XP_002960727.1| hypothetical protein SELMODRAFT_449995 [Selaginella moellendorffii]
 gi|300171666|gb|EFJ38266.1| hypothetical protein SELMODRAFT_449995 [Selaginella moellendorffii]
          Length = 430

 Score =  404 bits (1038), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 197/407 (48%), Positives = 280/407 (68%), Gaps = 3/407 (0%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREKKKED 150
           GLGL A+R ++KN+ V+ +P   W++ DTV  SEIG  C+GL+PWI+VAL+L+ EK K  
Sbjct: 20  GLGLFAKRSVSKNQEVVSIPKTLWMDVDTVRRSEIGECCAGLRPWIAVALYLLHEKAKPH 79

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNK 210
           S W  Y+ +LP   DS +FWSEEEL EL+GTQLLS+  G KE+++ EY KV  E+I P  
Sbjct: 80  SDWSAYIRVLPRTLDSPLFWSEEELAELKGTQLLSSINGFKEFLKREYDKVMTEVIEPRP 139

Query: 211 QLFPRPI-TLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGA 269
            +F R + TL+ F WAFGILRSR F  L G NL L+PLAD +NH  G+T ED  + +K A
Sbjct: 140 DVFDRSLYTLEAFTWAFGILRSRTFPPLIGDNLALVPLADFVNHGFGLTNEDPYWHVKSA 199

Query: 270 GLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES--KSDRNAYTLTLEI 327
           G+F+R    +L+        ++VL+QY   K NA+LA DYGF++S  K++R+++TLTL++
Sbjct: 200 GVFARQETLTLQAAANCAEKQEVLMQYGKKKGNAQLATDYGFVDSDEKNNRDSFTLTLQV 259

Query: 328 SESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRN 387
           S S+ F  DK+DIA+  GL  +AYF++   +  P  M+ YLRL+AL G+D+FLLE++FRN
Sbjct: 260 SLSERFADDKVDIAQMAGLDSTAYFNLYRNQGPPEDMIAYLRLIALFGSDSFLLEALFRN 319

Query: 388 TIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVR 447
           T+W HL LP+S  NEE IC  + + C++ L  + +TI+ED  LL    L  R ++AV VR
Sbjct: 320 TVWDHLRLPISRENEEAICEAMIEGCRATLREYSSTIDEDTMLLNSSELSTRKKMAVVVR 379

Query: 448 AGEKRVLQQIDEIFKERELELDELEYYQERRLKDLGLVGEQGDIIFW 494
            GEKR+LQ+  + F+    +LD LE+Y ERRL+DLGL+ ++G++  W
Sbjct: 380 LGEKRILQEQLQWFETHLADLDRLEFYAERRLRDLGLLDDKGNMTPW 426


>gi|168067849|ref|XP_001785817.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162662541|gb|EDQ49381.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 489

 Score =  400 bits (1029), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 212/448 (47%), Positives = 280/448 (62%), Gaps = 18/448 (4%)

Query: 62  AQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVA 121
           AQV+T W W +   +      ++PA   EGLGL+AQR +   + +L VP   WIN   V 
Sbjct: 41  AQVQTIWSWAQSHGI--QGEAVKPAEVSEGLGLIAQRPVNAGDEILNVPESVWINLAAVQ 98

Query: 122 ASEIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGT 181
            S +G  C GLKPW++VALFLI E     S WR YLD LP+  DS +FWS+EEL EL GT
Sbjct: 99  NSSLGKACEGLKPWVAVALFLIHESSNPSSKWRPYLDSLPKSLDSPLFWSDEELAELVGT 158

Query: 182 QLLSTTLGVKEYVQNEYLKVEEEIILPNKQLF-PRPITLDDFLWAFGILRSRAFSRLRGQ 240
           QLL +  G  E+++NEY  + EE++ PN ++F P   T D F WAFGILRSR FS L G+
Sbjct: 159 QLLGSVTGYLEFLENEYNNLVEEVLEPNNKIFNPAVYTFDGFKWAFGILRSRTFSPLTGE 218

Query: 241 NLVLIPLADLINHSPGITTEDYAYEIKGAGLF------SRDLLFSLRTPVPVKAGEQVLI 294
           ++ L+P+ADL+NH  G+     ++  KG   F      S DLL ++R      AGEQVL+
Sbjct: 219 DIALVPIADLVNHGKGLGDGSPSWVRKGTSQFWNIGKGSSDLL-TVRASANFSAGEQVLM 277

Query: 295 QYDLNKSNAELALDYGFIESKS--------DRNAYTLTLEISESDPFFGDKLDIAETNGL 346
           QY   KSNA+LALDYGF+E           +R++  L+LEIS  D F  DK DI E NG 
Sbjct: 278 QYGATKSNADLALDYGFVERDRGSQFSPGIERDSLALSLEISPDDRFVDDKADILEINGF 337

Query: 347 GESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELIC 406
             S  FD+  G+     M+ +LRL AL G D+FLLE++FRN  WGH+ LPVS  NEE +C
Sbjct: 338 QCSMQFDLSRGQGPSDEMITFLRLSALSGPDSFLLEALFRNEAWGHVSLPVSRDNEEALC 397

Query: 407 RVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKEREL 466
             + +  K+AL G+ TT+E+D +LL  G+L  R+ IAV VR GEKRV+Q++   F+ R  
Sbjct: 398 TSMLEGLKAALDGYSTTVEQDMELLARGDLSTRMEIAVVVRLGEKRVMQELQTWFEARLE 457

Query: 467 ELDELEYYQERRLKDLGLVGEQGDIIFW 494
           ELD LEYY ERRL++LGL+ E G +  W
Sbjct: 458 ELDTLEYYAERRLRNLGLMDEGGYMTPW 485


>gi|297726941|ref|NP_001175834.1| Os09g0411650 [Oryza sativa Japonica Group]
 gi|255678893|dbj|BAH94562.1| Os09g0411650, partial [Oryza sativa Japonica Group]
          Length = 206

 Score =  331 bits (849), Expect = 4e-88,   Method: Compositional matrix adjust.
 Identities = 157/205 (76%), Positives = 181/205 (88%)

Query: 291 QVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESA 350
           Q+ IQYDL+KSNAELALDYGF ES S R+AYTLTLEISESDPF+ DKLDIAE NG+GE+A
Sbjct: 1   QIYIQYDLDKSNAELALDYGFTESNSSRDAYTLTLEISESDPFYDDKLDIAELNGMGETA 60

Query: 351 YFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVR 410
           YFDIVLG +LPP ML YLRL+ LGGTDAFLLE++FRN +WGHL+LPVS  NEE IC+V+R
Sbjct: 61  YFDIVLGESLPPQMLPYLRLLCLGGTDAFLLEALFRNAVWGHLELPVSQDNEEAICQVIR 120

Query: 411 DACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKERELELDE 470
           +ACKSAL  +HTTIEEDE+LL   NL PRL+IAV VRAGEK+VLQQID+IFK+RE ELD 
Sbjct: 121 NACKSALGAYHTTIEEDEELLGSENLQPRLQIAVEVRAGEKKVLQQIDDIFKQREEELDG 180

Query: 471 LEYYQERRLKDLGLVGEQGDIIFWE 495
           LEYYQERRLKD+GLVG+ G+IIFWE
Sbjct: 181 LEYYQERRLKDIGLVGDNGEIIFWE 205


>gi|255087300|ref|XP_002505573.1| set domain protein [Micromonas sp. RCC299]
 gi|226520843|gb|ACO66831.1| set domain protein [Micromonas sp. RCC299]
          Length = 509

 Score =  267 bits (682), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 172/481 (35%), Positives = 247/481 (51%), Gaps = 36/481 (7%)

Query: 35  PSFRLKSRAFAATCSLHSASATTNPPTAQVETFWQWLRDQKV-VSPKSP--IRPATFPEG 91
           P  + + RA A   S+ S +       A  +  W WL  + V VS  SP  +  A    G
Sbjct: 35  PLSQRRPRAVAVDASVDSRT------QADFDALWAWLGSEGVDVSKVSPALVDAAPGGRG 88

Query: 92  LGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREKK-KED 150
            GLVA  DI   + VL +P   W+  DT  AS IG+ C     WI+VAL L+ E+   E 
Sbjct: 89  WGLVAAEDIGGGDAVLAIPRSLWMTVDTALASPIGAHCGDEAGWIAVALQLLHERSIGEK 148

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNK 210
           S W  Y++ LP   D+ +FWS EE+  L GTQLL    G   Y +  + +++E     N 
Sbjct: 149 SRWAAYVNALPAQLDAPLFWSAEEVATLTGTQLLDAAAGYDSYARGTWARLKESAFDANP 208

Query: 211 QLFPRPITLDD--FLWAFGILRSRAFSRL-RGQNLVLIPLADLINHSPGITTEDYAYE-- 265
            +FP     D+  FLWAFGILRSR  + + +G ++ L+P  D+ NHS G++++ +     
Sbjct: 209 DVFPSD-AFDEPSFLWAFGILRSRCQAPVDQGADIALVPGLDMANHS-GLSSQTWTLNNG 266

Query: 266 --------IKGAGLFSRDLLFSLRTPVPVKA----GEQVLIQYDLNKSNAELALDYGFIE 313
                    K  G         LRT    K     G +V + Y   K + +LALDYGF +
Sbjct: 267 GVAAVFGGGKSGGSML------LRTEKGAKGLLAKGAEVFMNYGQRKIDNQLALDYGFTD 320

Query: 314 SKSDRNAYTLT-LEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVA 372
           + + R  Y L  + I ESDP   DK+D+ E  GL E+  F +       P +  ++RL+ 
Sbjct: 321 AFASRPGYVLGPIAIPESDPNAFDKMDVLEVAGLREAPSFVLRAFEDPEPELRVFMRLLN 380

Query: 373 LGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQ 432
           L G DAFLLE+IFR   WG +  PVS  NE+  C  + + C+ AL G+ T +E+D ++ +
Sbjct: 381 LKGEDAFLLEAIFRQEAWGLISEPVSRLNEQEACGTMINGCEEALRGYATRVEDDRRVAE 440

Query: 433 GGNLDPRLRIAVGVRAGEKRVLQQIDEIFKERELELDELEYYQERRLKDLGLVGEQGDII 492
              +  RLR+A  VR GEK+ L      F   E  LD +EYYQERRL+ L L+ + G+  
Sbjct: 441 DPGVGHRLRLAARVRMGEKQALADALGFFSGVEARLDSMEYYQERRLRSLNLLDKDGNST 500

Query: 493 F 493
           +
Sbjct: 501 Y 501


>gi|307108530|gb|EFN56770.1| hypothetical protein CHLNCDRAFT_8187, partial [Chlorella
           variabilis]
          Length = 398

 Score =  260 bits (664), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 156/398 (39%), Positives = 225/398 (56%), Gaps = 7/398 (1%)

Query: 94  LVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREKKKED-SP 152
           LV  + + K E +  VP   WI  DT   S+IGS  +GL+ W+++ALFL+ E+   + S 
Sbjct: 1   LVCSKAVNKGEQLFAVPEAAWITADTAQQSQIGSHLTGLESWLAIALFLLHERAMGNASR 60

Query: 153 WRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQL 212
           W  Y+ +LP  + S V W E +L ELQG+Q+L T  G + Y Q  + +++ E+  PN Q 
Sbjct: 61  WAPYIALLPADSGSPVQWEEADLAELQGSQVLGTVQGYRAYFQQRFDQLQAEVFGPNSQA 120

Query: 213 F-PRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIK-GAG 270
           F P     D FLWA   +R+RA   L G N+ L+PLAD++   P    +   +++K   G
Sbjct: 121 FDPIVFNFDAFLWAACTVRARAHPPLDGGNIALVPLADMVRSQPSWPPDSAGWQLKQTGG 180

Query: 271 LFSRDLLFSL--RTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEIS 328
           LF      +L       + AG+ + + +   KS+ +L +D+G I+   ++ +Y LTLE+S
Sbjct: 181 LFGAGSTQALVMEASGSMAAGDAIAMDFGPQKSDGQLLVDHGVIDPLVNQPSYALTLELS 240

Query: 329 ESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNT 388
           + D  + DK DI E N L ES    +   R     +L  LRL+ L GTDAFLLESIFRN 
Sbjct: 241 KEDRNYDDKADILELNELAESTEHILRADRAPDAGLLPVLRLLNLSGTDAFLLESIFRNE 300

Query: 389 IWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDP--RLRIAVGV 446
           +W H+ LPVS  NE    + + D C +AL+ + T+I+ED  L+  G+L P  R + AV V
Sbjct: 301 VWEHMQLPVSEDNERGCYQQLIDGCTAALAAYPTSIDEDLALMASGSLQPGSRRQSAVRV 360

Query: 447 RAGEKRVLQQIDEIFKERELELDELEYYQERRLKDLGL 484
           R GEK  L      F+ R   L+ +EYY ERRLK LGL
Sbjct: 361 RLGEKEALDATLRYFEGRISRLENMEYYAERRLKRLGL 398


>gi|384246211|gb|EIE19702.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 503

 Score =  259 bits (661), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 166/459 (36%), Positives = 257/459 (55%), Gaps = 24/459 (5%)

Query: 47  TCSLHS-ASATTNPPTAQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEV 105
            CS  + A   T PP +       W+  + +   K  +RP      L LV  +   K + 
Sbjct: 40  VCSAQAQADVETLPPLSA------WVEQRGLPLKKLNVRPEIVEGDLCLVVSKPTKKGQP 93

Query: 106 VLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTD 165
           ++ VP   W+    V +S IGSL   L+PW+ +ALFL+ E+ K D+ W+ +LD +P   D
Sbjct: 94  LVAVPSSAWLTQQVVRSSSIGSLVEDLEPWLQIALFLLHERSKPDAAWQGFLDSIPAAPD 153

Query: 166 STVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFP-RPITLDDFLW 224
             +FWSEEEL +L+GTQLLS+  G +++ + +Y ++EE++  P+++ FP +   LDDFLW
Sbjct: 154 VPLFWSEEELSQLEGTQLLSSVQGYRQFFEAKYAELEEQLFAPHREAFPPKSHQLDDFLW 213

Query: 225 AFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPV 284
           A   +RSR  S L G+++ L+PLADL+ H   +    +  ++ G GLFS+     +    
Sbjct: 214 AVATVRSRVHSPLDGEDVALVPLADLVQHRK-LQGARWQLQLAG-GLFSKAQALVVEAQR 271

Query: 285 PVKAGEQVLIQY-------DLNKSNAELALDYGFIES---KSD----RNAYTLTLEISES 330
               GE V + +       D  K ++++ LDYG +++   ++D    +  + L+L + E 
Sbjct: 272 DYAEGEVVTMDFGAPLTEEDQEKLDSQVLLDYGALDADRPQADPGVVQGGFILSLALPED 331

Query: 331 DPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIW 390
           D ++ DK DI E NGL E+A F +         +L +LRL+ L G DAFLLE +FRN  W
Sbjct: 332 DKYYDDKADILELNGLSEAASFVLRANEEPSEQLLGFLRLLNLSGQDAFLLEPLFRNEAW 391

Query: 391 GHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGE 450
           GH+  PVS ANE  +   + + C++AL G+ T+I++D + L+      RL  A+ VR GE
Sbjct: 392 GHMLAPVSEANERAVYESMMEGCRAALQGYATSIDDDLRALRDTQPGTRLEKAILVRLGE 451

Query: 451 KRVLQQIDEIFKERELELDELEYYQERRLKDLGLVGEQG 489
           K  L      F++R   L  LE+YQERRLK LGL+ + G
Sbjct: 452 KETLDATLAFFEDRYDRLATLEFYQERRLKRLGLLDDDG 490


>gi|302836231|ref|XP_002949676.1| Rubisco large subunit N-methyltransferase [Volvox carteri f.
           nagariensis]
 gi|300265035|gb|EFJ49228.1| Rubisco large subunit N-methyltransferase [Volvox carteri f.
           nagariensis]
          Length = 484

 Score =  245 bits (626), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 145/408 (35%), Positives = 228/408 (55%), Gaps = 11/408 (2%)

Query: 94  LVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSG--LKPWISVALFLIREK-KKED 150
           L+A  D  + +V+  VP   W++ ++V  + +G L +   L+PW+ +AL L+ ++     
Sbjct: 71  LIASTDAQQGDVLFSVPDSAWLSAESVKKAAVGKLAAAAGLEPWLQIALQLVADRFGSTK 130

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNK 210
           S    Y   +PE  D+ + WSE+EL ELQGTQ+L T  G   + ++ + +++  +   N 
Sbjct: 131 SELSAYAASIPEDLDTPLLWSEDELQELQGTQVLQTLGGYLTFFRSTFQQLQSGLFTSNP 190

Query: 211 QLFPRPI-TLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGA 269
             FP  I TL  FLWA   +RSR+   L G  + L PL +L++H     ++     ++ A
Sbjct: 191 AAFPPSIFTLPRFLWAVAAVRSRSHPPLDGPKIALAPLTELVSHRRAANSK---LSVRSA 247

Query: 270 GLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISE 329
           GLF R  +  L     ++ GE + + Y   K +  + +DYG ++  S +  Y+LTL++ +
Sbjct: 248 GLFGRGQVLVLEATRAIRKGEPLSMDYGPGKLDGPVLVDYGVMDVTSPKPGYSLTLKMPD 307

Query: 330 SDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTI 389
           SD F  DKLDI E+N L +S  +++         ML +LRL+ L G+DAFLLESIFRN +
Sbjct: 308 SDRFIDDKLDILESNDLPQSVVYNLTPDEQPTIEMLAFLRLMQLKGSDAFLLESIFRNDV 367

Query: 390 WGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEED--EQLLQGGNLD-PRLRIAVGV 446
           WG +  PVS  NEE +C  + +  ++AL G+ TTI++D  E   QG      R   A+ +
Sbjct: 368 WGFMQEPVSEGNEEAVCNTLSEGARAALGGYGTTIDQDLAELRAQGSRAKGSRREAALLI 427

Query: 447 RAGEKRVLQQIDEIFKERE-LELDELEYYQERRLKDLGLVGEQGDIIF 493
           R GEK  L  +   F++R   +L  L YYQERRL+ LGLV ++G   +
Sbjct: 428 RLGEKEALDAVARFFEDRRATQLKRLVYYQERRLRRLGLVDDEGRTTY 475


>gi|303288796|ref|XP_003063686.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226454754|gb|EEH52059.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 538

 Score =  243 bits (621), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 160/461 (34%), Positives = 228/461 (49%), Gaps = 36/461 (7%)

Query: 62  AQVETFWQWL-RDQKVVSPKSPIRPATFP--EGLGLVAQRDIAKNEVVLEVPMKFWINPD 118
           A  +  W WL R+   V+  SP      P   G GLVA RD+   +  + VP   W+  +
Sbjct: 77  ADFDALWTWLEREGADVASVSPALVDATPGGRGWGLVATRDVGGGDAAIVVPRALWMTKE 136

Query: 119 TVAASEIGSLCS--GLKPWISVALFLIREKK-KEDSPWRVYLDILP--ECTDSTVFWSEE 173
           T  AS+IG+        PW ++AL L+ EK   +DS W  Y+  LP  E  D+ +FWS E
Sbjct: 137 TAFASKIGTALDPETTPPWCALALQLLHEKSLGDDSRWAAYIRCLPRVEALDAPLFWSSE 196

Query: 174 ELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRP--------ITLDDFLWA 225
           EL EL GTQLL+   G   YV+  +  ++E     +  LF            +  +FLWA
Sbjct: 197 ELAELAGTQLLANAAGYDSYVRGTHAALKETTFKEHPALFGDAGDDDGGGAFSEREFLWA 256

Query: 226 FGILRSRAFSRL-RGQNLVLIPLADLINHSPGITTEDYAYEIKG-AGLFSRDLLFSLRTP 283
           FG+LRSRA   + +G+++ LIP  D+ NH  G+ ++ +     G A +F           
Sbjct: 257 FGVLRSRALPPVDQGESIALIPGIDMANHD-GLCSQTWQLNNGGIAAVFGGRGGADGGGS 315

Query: 284 V----------PVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLT-LEISESDP 332
           V            K GE++   Y     +++ ALDYGF+++   R  Y L  L I E D 
Sbjct: 316 VLLRVEKTKAGGAKRGEEIRCNYGPANIDSQFALDYGFVDAFCSRPGYVLGPLSIPEDDV 375

Query: 333 FFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGH 392
              DK+D+    GL ES  F I      PP M+ ++RL+ L   DAFLLE+IFR   W  
Sbjct: 376 NAFDKMDVLSVAGLKESPAFTIRAFEDPPPEMVVFMRLLNLKNDDAFLLEAIFRQECWAL 435

Query: 393 LDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKR 452
           +  PVS  NE          C+ AL  + T IE+D  +    +  PRLR+A  VR GEK+
Sbjct: 436 ISDPVSPDNE------ADAGCEEALGAYATKIEDDRGVADDADASPRLRLAARVRMGEKQ 489

Query: 453 VLQQIDEIFKERELELDELEYYQERRLKDLGLVGEQGDIIF 493
            L+++   +   E  LD  EYYQERRL+ L L+   G+  +
Sbjct: 490 ALEEVLGYYASVEARLDRTEYYQERRLRSLNLLDRDGNSTY 530


>gi|159479580|ref|XP_001697868.1| rubisco large subunit N-methyltransferase [Chlamydomonas
           reinhardtii]
 gi|158273966|gb|EDO99751.1| rubisco large subunit N-methyltransferase [Chlamydomonas
           reinhardtii]
          Length = 475

 Score =  228 bits (582), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 139/407 (34%), Positives = 216/407 (53%), Gaps = 18/407 (4%)

Query: 94  LVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCS--GLKPWISVALFLIREK-KKED 150
           LVA  D+   E ++ VP   W++   VA + +G L S  GL+PW+ +AL L+ E+     
Sbjct: 71  LVASADVQPGESLIVVPDAAWVSVPNVAKTTVGKLASSAGLEPWLQLALVLVAERFGSAK 130

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNK 210
           S    Y   LPE   + + WSEEE   L GTQ+  T      + ++ + +++  +   N 
Sbjct: 131 SELAGYASSLPEDLGTPLLWSEEETRALAGTQVAGTLNSYLTFFRSTFAQLQAGLFTANP 190

Query: 211 QLFPRPI-TLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGA 269
             FP  + TL +F+WA   +RSR+   L G  + L PL DL++H     T+     ++ +
Sbjct: 191 AAFPPAVFTLPNFVWAVAAVRSRSHPPLEGDKIALAPLVDLVSHRRAANTK---LSVRSS 247

Query: 270 GLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISE 329
           GLF R  +  +     ++ GE + + Y   K +  + LDYG +++ S +  Y+LTL + E
Sbjct: 248 GLFGRGQVAVVEATRAIRKGEALGMDYAPGKLDGPVLLDYGVMDTASPKPGYSLTLTLDE 307

Query: 330 SDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTI 389
           SD F  DK DI E  GL  S  + I   +     M+ +LRL+ +   DAFLLESIFRN  
Sbjct: 308 SDKFVDDKADIVEGAGLRPSMTYSITPDQQPGEEMMAFLRLMNIKAMDAFLLESIFRNE- 366

Query: 390 WGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGN--LDPRLRIAVGVR 447
                  VS  NEE +C ++ +  ++AL+G+ TT+++D   L+  +  L  R   A+ VR
Sbjct: 367 -------VSEGNEEAVCAMLAEGARAALAGYPTTLDQDLAALRSNSTPLGSRAEAALLVR 419

Query: 448 AGEKRVLQQIDEIFKEREL-ELDELEYYQERRLKDLGLVGEQGDIIF 493
            GEK  L  +   F++R   +L  L YYQERRL+ LGLV ++G   +
Sbjct: 420 LGEKESLDAVARFFEDRRATQLKRLVYYQERRLRRLGLVDDEGRTTY 466


>gi|145356486|ref|XP_001422460.1| chloroplast lysine N-methyltransferase [Ostreococcus lucimarinus
           CCE9901]
 gi|144582703|gb|ABP00777.1| chloroplast lysine N-methyltransferase [Ostreococcus lucimarinus
           CCE9901]
          Length = 529

 Score =  211 bits (538), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 140/432 (32%), Positives = 214/432 (49%), Gaps = 37/432 (8%)

Query: 96  AQRDIAKNEVVLEVPMKFWINPDT-VAASEIG------SLCSGLKPWISVALFLIREKKK 148
           A R +A+    + VP   WI P+  +   E+G       +  GL  W ++AL L++E+++
Sbjct: 93  ATRALARGAKAIVVPKSLWITPEVGMNDDELGKALRDEDVAGGLARWTTLALTLLKERER 152

Query: 149 -EDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIIL 207
            E+S +  Y+  LPE   S +FW+ EEL E+QGTQLL    G   YV+  Y  +   +  
Sbjct: 153 GEESKYAAYVKTLPEVLHSPLFWNAEELSEIQGTQLLDNAAGYDGYVRGVYETLRTGMFA 212

Query: 208 PNKQLFP--RPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYE 265
            +  +F      + D+F WAFGILRSR  +   G N+ L+P  DL+NHS   +     + 
Sbjct: 213 KHADVFDVEGAFSEDNFRWAFGILRSRTMAPCDGANIALVPGVDLVNHS---SLSQARWR 269

Query: 266 IKG------AGLFSR---DLLFSLRTPV--PVKAGEQVLIQYDLNKSNAELALDYGFIES 314
           + G      AGLF     D   S R      +   E + + Y+   ++   ALD+GF+++
Sbjct: 270 VSGGVAGAVAGLFGGGKGDDGVSARVECDRALNVNEPLYVNYNPEGTDTSFALDFGFVDT 329

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALG 374
            +    Y L+L + E DP   DKLD+ +  GLGE+  F +       P +  +LRL+   
Sbjct: 330 ITPSPGYALSLSVPEDDPNVFDKLDVLDVCGLGETPTFTLRAYSDPDPDLRTFLRLLNCK 389

Query: 375 GTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHT-TIEEDEQLLQ- 432
             DAFLLE++FR   W  +  P+S  NE   C  + D   +ALS + +  ++E++  L  
Sbjct: 390 NQDAFLLEALFRQQCWSLISEPLSRENEADCCASMTDGVAAALSAYASRALDEEKAYLMS 449

Query: 433 ---------GGNLDPRLR--IAVGVRAGEKRVLQQIDEIFKERELELDELEYYQERRLKD 481
                    G +   R+R  +AV VR  EK  L +    F      LD +EYYQERRL+ 
Sbjct: 450 PPSARRAAAGDDERARVRKDVAVRVRLAEKSTLIETASFFDVIASGLDGMEYYQERRLRS 509

Query: 482 LGLVGEQGDIIF 493
           L L+ E G   +
Sbjct: 510 LNLLDEDGSSTY 521


>gi|308811012|ref|XP_003082814.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplast precursor (ISS)
           [Ostreococcus tauri]
 gi|116054692|emb|CAL56769.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplast precursor (ISS)
           [Ostreococcus tauri]
          Length = 588

 Score =  197 bits (502), Expect = 9e-48,   Method: Compositional matrix adjust.
 Identities = 135/422 (31%), Positives = 205/422 (48%), Gaps = 32/422 (7%)

Query: 92  LGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLK---------PWISVALFL 142
            G+ A+  + +    + +P + W+  D   A+E   + + L+         PW+ VAL L
Sbjct: 75  FGVRAKTTLRRGTRAMVIPREVWM--DATRATEDADVGAALRDARYDAVKQPWVRVALLL 132

Query: 143 IREKKK-EDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKV 201
           ++E+++  D  +  Y+  LP+  DS +FWS +EL ++ GTQLL    G   YV+  Y ++
Sbjct: 133 LKERERGADGEFAAYVATLPKTLDSPLFWSADELRDIAGTQLLDNAAGYDAYVRAVYEEL 192

Query: 202 EEEIILPNKQLFPRPITLDD--FLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITT 259
           +  + +     F      D+  F WAFGILRSR  + L G N+ L+P  DLINHS  ++ 
Sbjct: 193 KNGVFVEYASTFDVDGAFDEASFRWAFGILRSRTMAPLDGANVALVPGLDLINHS-SLSG 251

Query: 260 EDYAYEIKGAGLFSRDLLFSLRTPVPVKA------GEQVLIQYDLNKSNAELALDYGFIE 313
             +     G                 V+       G ++ + YD    +++ ALDYGFI+
Sbjct: 252 ARWRVGGGGGMGGLFGGGSGSGVAAYVECDRDYDEGAEIFVNYDPEGIDSKFALDYGFID 311

Query: 314 SKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVAL 373
             +    Y LTL I E D    DKLD+ ET GL E+  F +         +  +LRL+  
Sbjct: 312 VVNPSPGYALTLSIPEDDANLFDKLDVLETQGLPEAPTFTLRPYSDPDRELRTFLRLLHC 371

Query: 374 GGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHT-TIEEDEQLL- 431
             TDAFLLE++FR   W  +  P+S  NE   C  +      ALS + T ++++++  L 
Sbjct: 372 KDTDAFLLEALFRQQCWSLISEPLSRENEADCCASITRGVADALSAYETRSLDQEKAFLT 431

Query: 432 ------QGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKERELELDELEYYQERRLKDLGLV 485
                 QGG    R  IAV  R  EK  L +    F      LD+LEYYQERRL+ L L+
Sbjct: 432 TPPNERQGG---IRQEIAVRARLSEKSALLEASAFFDVVASSLDKLEYYQERRLRSLNLL 488

Query: 486 GE 487
            +
Sbjct: 489 ND 490


>gi|301122791|ref|XP_002909122.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262099884|gb|EEY57936.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 426

 Score =  149 bits (376), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 122/428 (28%), Positives = 205/428 (47%), Gaps = 32/428 (7%)

Query: 64  VETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAAS 123
           V T  +WL+    V     IR     EG G+ A++ +   +V L +P K  +N ++ A S
Sbjct: 7   VTTLLEWLKANGGVDNLLDIRYLGKLEGHGVFAKQALTSGQVTLRIPFKLTMNIESAARS 66

Query: 124 EIGSLCSGLKPWIS----VALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSEEELVEL 178
           ++  +     P I     +AL L+ E+ K+ DS +  ++  LP   D  VFWSE EL EL
Sbjct: 67  DLARVLEKY-PQIPDDEVLALHLMHERSKRSDSFFAPFIASLPTTFDLPVFWSESELNEL 125

Query: 179 QGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPR--PITLDDFLWAFGILRSRAFSR 236
           +GT +L  T  +K+ +Q ++  + + ++    ++F     +TL+D+ WA  ++ SRAF  
Sbjct: 126 KGTNVLLLTQLMKQQLQRDFENIHQAVVEDFPEVFALLPTLTLEDYTWAMSVIWSRAFGV 185

Query: 237 LRGQNL--VLIPLADLINH--SPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQV 292
            R +    VL P  D+ NH  S  I  +D+    +   + +         P  V AG  +
Sbjct: 186 TREKKYLRVLCPAMDMFNHDVSLRILLDDFVSFDEETQMLTH------HVPKEVAAGSAL 239

Query: 293 LIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYF 352
            I Y    SNA+L   YGF+  ++ R A    ++I  +DP+   K  + ++N L     +
Sbjct: 240 QISYG-QYSNAKLLFSYGFVAKENSRRAVDFWMKIPPNDPYLKLKQTVLDSNELTRDQTY 298

Query: 353 DI---VLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVV 409
           D    +    +   +L  LR++ +   +  L +  F  +I       +S  NE  +   +
Sbjct: 299 DFCGTLFENDVDERLLATLRVILMNEQEIRLYKKAFETSI-------ISIRNELAVYENL 351

Query: 410 RDACKSALSGFHTTIEEDEQLL--QGGNLDPRLRIAVGVRAGEKRVLQQ-IDEIFKEREL 466
           ++ C+  L+ F TT+EEDE +L        PRL  AV VR  +K+VL   ID + K +++
Sbjct: 352 QNTCRRKLANFATTLEEDEAILAEMATESSPRLSFAVRVRVEDKQVLTGVIDTLEKWKQV 411

Query: 467 ELDELEYY 474
               LE Y
Sbjct: 412 LASNLEMY 419


>gi|387193935|gb|AFJ68731.1| hypothetical protein NGATSA_2061300, partial [Nannochloropsis
           gaditana CCMP526]
          Length = 446

 Score =  147 bits (372), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 120/368 (32%), Positives = 177/368 (48%), Gaps = 31/368 (8%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWIS----VALFLIREK 146
           GLGLVA   I + E +  VP+    + D+V AS +G +    +P +     +AL L+ E 
Sbjct: 86  GLGLVATAPIKQGETLATVPLNLCFSMDSVRASPLGKVIGEFEPALGDASLIALQLLYEA 145

Query: 147 K-KEDSPWRVYLDILP----ECTDSTVFWSEEELVELQGTQLLSTTLGVKE----YVQNE 197
                S + VY+  LP    +  D  +FWS  E    QG    S+T  + E     V  +
Sbjct: 146 HMGPKSKYAVYIKSLPRPGQDGFDHPLFWSTAE----QGVLAKSSTRNLGETLIDAVAED 201

Query: 198 YLKVEEEIILPN-KQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPG 256
           Y  ++  +       L      L DF WA  ++ SR+F    G  L L PL D+ N   G
Sbjct: 202 YGWIQSALARGGISGLQADSFDLSDFEWAVAVVLSRSFFAENG--LRLAPLLDMANRGEG 259

Query: 257 ITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKS 316
            T E     +   G     ++    T      G++++I Y   KS  E   D+GF+    
Sbjct: 260 CTNEPQIGGLGIFGGKGLKVIADRDT----DKGQEIVISYG-PKSGIEFLEDHGFVPPPL 314

Query: 317 DRNAY-----TLTLEIS-ESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRL 370
           + NA      +LT +IS E D F+ DK D+  T GL  +  FD+     + P MLQ+LR 
Sbjct: 315 EGNALVGGMCSLTFKISPEGDRFYDDKEDVMGTLGLPMAFSFDVRSDGDVDPEMLQFLRF 374

Query: 371 VALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQL 430
           + LGG DAFLLE++FRN +W  ++LPVS ANE+ +  V+ DAC  AL+ F       ++L
Sbjct: 375 LKLGGKDAFLLEAVFRNEVWRFMELPVSEANEQAVDEVIIDACTQALANFAEAEVLTDEL 434

Query: 431 LQGGNLDP 438
           +Q   + P
Sbjct: 435 IQKEGVSP 442


>gi|422293951|gb|EKU21251.1| hypothetical protein NGA_2061300, partial [Nannochloropsis gaditana
           CCMP526]
          Length = 452

 Score =  147 bits (372), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 120/368 (32%), Positives = 177/368 (48%), Gaps = 31/368 (8%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWIS----VALFLIREK 146
           GLGLVA   I + E +  VP+    + D+V AS +G +    +P +     +AL L+ E 
Sbjct: 92  GLGLVATAPIKQGETLATVPLNLCFSMDSVRASPLGKVIGEFEPALGDASLIALQLLYEA 151

Query: 147 K-KEDSPWRVYLDILP----ECTDSTVFWSEEELVELQGTQLLSTTLGVKE----YVQNE 197
                S + VY+  LP    +  D  +FWS  E    QG    S+T  + E     V  +
Sbjct: 152 HMGPKSKYAVYIKSLPRPGQDGFDHPLFWSTAE----QGVLAKSSTRNLGETLIDAVAED 207

Query: 198 YLKVEEEIILPN-KQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPG 256
           Y  ++  +       L      L DF WA  ++ SR+F    G  L L PL D+ N   G
Sbjct: 208 YGWIQSALARGGISGLQADSFDLSDFEWAVAVVLSRSFFAENG--LRLAPLLDMANRGEG 265

Query: 257 ITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKS 316
            T E     +   G     ++    T      G++++I Y   KS  E   D+GF+    
Sbjct: 266 CTNEPQIGGLGIFGGKGLKVIADRDT----DKGQEIVISYG-PKSGIEFLEDHGFVPPPL 320

Query: 317 DRNAY-----TLTLEIS-ESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRL 370
           + NA      +LT +IS E D F+ DK D+  T GL  +  FD+     + P MLQ+LR 
Sbjct: 321 EGNALVGGMCSLTFKISPEGDRFYDDKEDVMGTLGLPMAFSFDVRSDGDVDPEMLQFLRF 380

Query: 371 VALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQL 430
           + LGG DAFLLE++FRN +W  ++LPVS ANE+ +  V+ DAC  AL+ F       ++L
Sbjct: 381 LKLGGKDAFLLEAVFRNEVWRFMELPVSEANEQAVDEVIIDACTQALANFAEAEVLTDEL 440

Query: 431 LQGGNLDP 438
           +Q   + P
Sbjct: 441 IQKEGVSP 448


>gi|299472213|emb|CBN77183.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloropl [Ectocarpus
           siliculosus]
          Length = 460

 Score =  144 bits (363), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 134/430 (31%), Positives = 200/430 (46%), Gaps = 34/430 (7%)

Query: 71  LRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCS 130
           L D  V++ +SP+         GLVA + I   + VL +P    +    + +S I     
Sbjct: 26  LTDNAVLAGRSPLAGER-----GLVAAKAIETGQSVLAIPQSLGLTATGLKSSGIAQYVE 80

Query: 131 GLKPWIS----VALFLIREKKK-EDSPWRVYLDILPECTD--STVFWSEEELVELQGTQL 183
           G + W      +AL ++ E+ + E S    ++ +LP+  +    +FW E +L        
Sbjct: 81  GFEGWTGETGLIALQVLWERAQGEGSKMAPWIAVLPKEGELEMPLFWGEADLTLADA--- 137

Query: 184 LSTTLGVKEYVQN---EYLKVEEEIILPNKQLFPR-PITLDDFLWAFGILRSRAFSRLRG 239
            S+T G+  +V +   ++  + E     + ++FP       DF WA G+  SR+F     
Sbjct: 138 -SSTRGISGFVADVDEDFAWLSENAFAKHPKVFPADKFGPGDFRWAVGVALSRSF--FVD 194

Query: 240 QNLVLIPLADLINHSP--GITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYD 297
             L L PL D  NHS   G++           GLF    +  LR     + GE+  + Y 
Sbjct: 195 GELRLTPLVDFANHSSLRGVSEPTGGT----TGLFGSKAVV-LRAGKNYEEGEEFFVSYG 249

Query: 298 LNKSNAELALDYGFIESKSDRNAYT-LTLEISESDPFFGDKLDIAETNGLGESAYFDIVL 356
             K  A    + GF+   S       L   I E D FF DK DI E  GL  S+ FD+  
Sbjct: 250 -PKGAAGYLEENGFVPPVSGSEVTCELEFSIPEDDKFFDDKEDILERAGLRTSSTFDLT- 307

Query: 357 GRTLPPA-MLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKS 415
              LP A ++++LRL+ + G DAFLLE IFRN +W  ++ PVS  NEE +  ++   C+ 
Sbjct: 308 AVGLPDAELVRFLRLLCVSGDDAFLLEGIFRNEVWDFMNEPVSRPNEEAVNELLATRCEE 367

Query: 416 ALSGFHTTIEEDEQLLQG-GNLDPRLRIAVGVRAGEKRVLQQIDEIFKERELELDELEYY 474
            L  F  T +E+E ++ G      R R+  GVR GE+  L+      +     LD  EYY
Sbjct: 368 ELKAFFGTAKEEEGIVSGKAEASERQRLCAGVRQGERMALEMTRSWCESDSKALDRKEYY 427

Query: 475 QERRLKDLGL 484
           QERRLKDL L
Sbjct: 428 QERRLKDLQL 437


>gi|348675930|gb|EGZ15748.1| hypothetical protein PHYSODRAFT_561468 [Phytophthora sojae]
          Length = 430

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 113/418 (27%), Positives = 196/418 (46%), Gaps = 31/418 (7%)

Query: 62  AQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVA 121
           A V    +WL           IR     EG G+ A+R +   +V L+VP K  +N ++ A
Sbjct: 5   AVVTRLLEWLEAHGAADSLLDIRYLGKLEGHGVFAKRALTSGQVTLQVPFKLTMNTESAA 64

Query: 122 ASEIGSLCSGLKPWIS----VALFLIREKKKE-DSPWRVYLDILPECTDSTVFWSEEELV 176
            S++  +     P I     +AL L+ E+ K  +S +  ++  +P   D  VFW+E EL 
Sbjct: 65  TSDLAPVLEKY-PQIPDDEVLALHLMHERSKGGESFFAPFIASMPTTFDLPVFWTEAELN 123

Query: 177 ELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPR--PITLDDFLWAFGILRSRAF 234
           EL+GT +L  T  +K++++ ++  + + +      +F     +T+DD++WA  ++ SRAF
Sbjct: 124 ELKGTNVLLLTQLMKQHLERDFENIHQAVAADFPDIFASLPTLTIDDYMWAMSVIWSRAF 183

Query: 235 SRLRGQNL--VLIPLADLINHSPGITT--EDYAYEIKGAGLFSRDLLFSLRTPVPVKAGE 290
              +G     VL P  D+ NH   +    +D+              + +   P  V AG 
Sbjct: 184 GVSKGGKYLHVLCPAMDMFNHDVTVRKPLDDF------VSFNEEKQMMTHHVPEDVAAGS 237

Query: 291 QVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESA 350
            V I Y    SNA+L   YGF+  ++ R      ++I  SDP+F  K  + ++N L +  
Sbjct: 238 AVHISYG-QYSNAKLLYSYGFVSPENFRRGVDFWMKIPLSDPYFKLKQTVLDSNELTKEQ 296

Query: 351 YFDI---VLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICR 407
            +D    +L   +   +L  LR++ +   +    +  F ++I       +S  NE  +  
Sbjct: 297 TYDFHGTLLSNDVDERLLATLRVILMNEQEIRQYKKAFESSI-------LSVRNELAVYE 349

Query: 408 VVRDACKSALSGFHTTIEEDEQLLQGGNLD--PRLRIAVGVRAGEKRVLQQIDEIFKE 463
            ++  C+  LS + TT+EEDE +L     +  PRL  AV VR  +K+V   + E  ++
Sbjct: 350 NLQSTCRRKLSNYATTLEEDEAILAETETESKPRLAFAVRVRMEDKQVTTSVIETLEQ 407


>gi|397563943|gb|EJK44003.1| hypothetical protein THAOC_37500 [Thalassiosira oceanica]
          Length = 595

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 141/477 (29%), Positives = 218/477 (45%), Gaps = 97/477 (20%)

Query: 92  LGLVAQRDIAKNEVVLEVPMK------FWINPDTVAASEIGS-LCSGLKPWIS----VAL 140
           LGLVA    +K ++V  +P          + P+  A       L  G   W      +A+
Sbjct: 128 LGLVATGSASKGDLVASLPFNEGDGSGLALEPNFAAKIVYDKVLPEGYDGWTGDVGLLAM 187

Query: 141 FLIREKKKED--------------------SPWRVYLDILPECTD-STVFWSE--EELVE 177
            L+ E  ++D                    S W   L    E T+   + W+E  +E+++
Sbjct: 188 LLLNEMARQDADSPKGVDLPRRKEGVQALMSAWIASLPSFEEMTEMHPLLWNEDEQEIMQ 247

Query: 178 LQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI--------------TLDDFL 223
           L  T+ +   L   + ++++   ++E +   +++ FP  +              T+D F 
Sbjct: 248 LSSTKKIYRLL---DDIEDDSSWLDENVWSADREKFPETVKIRVGDADEERQCFTVDGFR 304

Query: 224 WAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDY-AYEIKGAGL-----FSRDLL 277
           +A  ++RSR+F       L L+P  D  NH      +DY + E+ G G+      S+ +L
Sbjct: 305 YAVAVVRSRSF--FVDGALRLLPYVDYANH------DDYDSNELVGGGIGMLWGSSKGVL 356

Query: 278 FSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI----ESKSDRNAYT--LTLEISESD 331
             L++   ++ G++V I Y   K  A+  LD+GF+    +  +   A T  L+ E+ ESD
Sbjct: 357 --LKSGKALRVGDEVRISYG-PKGPADYILDHGFVPPMCQLSTQGGAITAELSFEVDESD 413

Query: 332 PFFGDKLDIAETNG-----LGESAYFDIVLGRTLP----PAMLQYLRLVALGGTDAFLLE 382
            F  DKLDI E        +  +  FD+V G        PAM+Q+LRLV L G DAFLLE
Sbjct: 414 RFRDDKLDILEFETYDLAPMEPAQVFDVVGGPGSTGEPDPAMIQFLRLVKLAGKDAFLLE 473

Query: 383 SIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRL-- 440
           SIFR  IWG +  PVS  NE+     V++AC++ALS  +    ED       +  P L  
Sbjct: 474 SIFRKEIWGFMSEPVSEDNEKASVTAVQEACEAALSDMNELESED------SSASPSLNE 527

Query: 441 RIAVGVRAGEKRVLQQIDEIFKERELELDELEYYQERRLKDLGLVGEQGDIIFWEPK 497
           ++   VR  E+  L++     K+    LD  EYYQERRLK LGL  E      W P+
Sbjct: 528 QLCSMVRGSEREALERTLAYIKQEAEALDLKEYYQERRLKTLGLDSE------WTPE 578


>gi|224112199|ref|XP_002316117.1| predicted protein [Populus trichocarpa]
 gi|222865157|gb|EEF02288.1| predicted protein [Populus trichocarpa]
          Length = 92

 Score =  139 bits (349), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 77/91 (84%), Positives = 83/91 (91%)

Query: 321 YTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFL 380
           YTLTL ISESDPFFGDKLDIAETNGLGE+A FDIV G  LPPA+L YLRLV LGGTDAFL
Sbjct: 2   YTLTLRISESDPFFGDKLDIAETNGLGETADFDIVSGNPLPPALLPYLRLVTLGGTDAFL 61

Query: 381 LESIFRNTIWGHLDLPVSHANEELICRVVRD 411
           LESI+RNTIWGHL++PVS +NEELICRVVRD
Sbjct: 62  LESIYRNTIWGHLEVPVSRSNEELICRVVRD 92


>gi|298706765|emb|CBJ29688.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Ectocarpus siliculosus]
          Length = 521

 Score =  139 bits (349), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 116/397 (29%), Positives = 200/397 (50%), Gaps = 40/397 (10%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS--LCSGLKPWISVALFLIREKKK 148
           G G++A R+I + + +  +P+   +  D  A  E G+  +   L  +I++AL  + EK K
Sbjct: 132 GRGMIANREIKEGDELFTLPIDLLLTKD-AAKKEFGADVITEDLSEYIAIALLAVHEKAK 190

Query: 149 -EDSPWRVYLDILPECTD--STVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEI 205
            ++S W  Y+ +LP   +   T  W+EE+L  L+G+ +++ T  ++  ++ EY  VE ++
Sbjct: 191 GKESFWSSYIGVLPTVEEVYPTYLWAEEDLALLEGSPVIAATESMRRKLEVEYATVENDL 250

Query: 206 ILPNKQLFPRPI-TLDDFLWAFGILRSRAFSRL----RGQNLVLIPLADLINHSPGITTE 260
           +    ++ PR + T ++F WAF +L SRA  RL     G+ + L+P ADL NH+P   + 
Sbjct: 251 LDKFPEILPREVHTYEEFQWAFAMLFSRAI-RLGGLSTGEAVALVPYADLFNHNPFANS- 308

Query: 261 DYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNA 320
            Y    +    FS+     +      K  EQV I Y   K N++L L YGF   ++  N+
Sbjct: 309 -YIDARQQGLFFSKTDEVVVYADRSYKKMEQVYISYG-PKGNSDLLLLYGFSLDRNPYNS 366

Query: 321 YTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAF- 379
             +T+ + E+D  +  K       GL  +  F +   R  P  +LQYLRL+ L  TD   
Sbjct: 367 VDVTVSLDENDELYERKKAFLSEAGLPPTKAFPLYNDR-YPDELLQYLRLIQL-NTDQLR 424

Query: 380 --LLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLD 437
              LE +         +   +  NE ++   + +ACK+ ++G+ TT E+D +L+     D
Sbjct: 425 GRTLEDL-------SFEKKQTDVNELMVLDSLVEACKATIAGYPTTEEQDSKLMN----D 473

Query: 438 PRL--------RIAVGVRAGEKRVLQQ-IDEIFKERE 465
           P +        R+AV  R  EK +L++ I  + K++E
Sbjct: 474 PGMFRALSKTQRMAVKHRRQEKVILRRTIAAVTKDKE 510


>gi|224112201|ref|XP_002316118.1| predicted protein [Populus trichocarpa]
 gi|222865158|gb|EEF02289.1| predicted protein [Populus trichocarpa]
          Length = 137

 Score =  136 bits (342), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 70/131 (53%), Positives = 89/131 (67%), Gaps = 8/131 (6%)

Query: 35  PSFRLKSRAFAATCSLHSASATTNPPTAQ-----VETFWQWLRDQKVVSPKSPIRPATFP 89
           P F  +S +     +LH+  + TN  ++      V+TFWQWL D  VVS K+P +P   P
Sbjct: 5   PFFHQESISSLQETTLHNLFSVTNTTSSAAAAPVVQTFWQWLSDLGVVSSKTPAKPGVVP 64

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREKKKE 149
           +GLGLVAQRDI++NEVVLE+P K WIN D V+ASEI ++CSG KPW+SVALFL REK  E
Sbjct: 65  QGLGLVAQRDISRNEVVLEIPKKLWINLDVVSASEIRNVCSGAKPWVSVALFLTREKLNE 124

Query: 150 DSPWRVYLDIL 160
           DS    YL+ L
Sbjct: 125 DS---TYLEAL 132


>gi|219110715|ref|XP_002177109.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217411644|gb|EEC51572.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 531

 Score =  134 bits (336), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 114/357 (31%), Positives = 163/357 (45%), Gaps = 45/357 (12%)

Query: 161 PECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI--- 217
           PE  +  + WSEE+   LQ +   +    V + ++ +   ++  +   +   FP  I   
Sbjct: 178 PEDINHPLLWSEEDQEILQSSST-NRIYRVLDDIEEDVTWLKTNVFEKDGNRFPVSIPWN 236

Query: 218 -------TLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAG 270
                  +L  F WA  + +SR+F       + L+PL D  NH+   T E  A  +   G
Sbjct: 237 GEEIPCFSLTGFKWAMALAQSRSF--FVDNAVRLLPLMDFCNHADEGTEEARAGFM---G 291

Query: 271 LFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYT-LTLEISE 329
            F       L      + GE+V I Y   KS A+  L++ F   +S + A + L  E+  
Sbjct: 292 TFGTTKGAELVAGQSYEVGEEVFICYG-PKSAADYLLEHAFCPEQSWKTAVSELFFEVDP 350

Query: 330 SDPFFGDKLDIAETNGLGESAY-----FDIVL--GRTLPPA--MLQYLRLVALGGTDAFL 380
            D F+ DKLDI E      S       FD+V   GR   P   M+Q++RL  LG TDAFL
Sbjct: 351 KDRFYDDKLDILEFETYDASPMDPVQSFDVVSAPGRDGEPDPFMVQFVRLCKLGATDAFL 410

Query: 381 LESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRL 440
           LESIFR  +WG ++LPVS  NE  +   + +AC+ AL  F    E       GG      
Sbjct: 411 LESIFRKEVWGFMELPVSETNERDVVDAIMEACQLALDDFSKCAE-------GGP----- 458

Query: 441 RIAVGVRAGEKRVLQQIDEIFKERELELDELEYYQERRLKDLGLVGEQGDIIFWEPK 497
            I   +R  E + L +  +  +     LD  EYYQ+RRLKDLGL  E      W P+
Sbjct: 459 EICSKLRESESQALIRTRDFLRRDREALDLKEYYQQRRLKDLGLDSE------WSPE 509


>gi|224001788|ref|XP_002290566.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220973988|gb|EED92318.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 595

 Score =  132 bits (331), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 104/304 (34%), Positives = 152/304 (50%), Gaps = 38/304 (12%)

Query: 215 RP-ITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDY-AYEIKGAG-- 270
           RP  + + F +A  ++RSR+F      +L L+P  D  NH      +D+ + E+ G G  
Sbjct: 293 RPCFSQEGFRYAVSLVRSRSF--FVDGSLRLLPYLDFANH------DDFDSLELVGGGIG 344

Query: 271 -LFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYT------- 322
            L+       +++   ++ G+++ I Y   K  A+  LD+GF+       + T       
Sbjct: 345 TLWGSAKGALMKSGKALEVGDEIRISYG-PKGPADYLLDHGFVPPMCQTTSGTGGAITAE 403

Query: 323 LTLEISESDPFFGDKLDIAETNGLGESAY-----FDIVLGRTLP----PAMLQYLRLVAL 373
           L+ EI +SD F  DKLD+ E      +       FD+  G        PAM+Q+LRL+ L
Sbjct: 404 LSFEIDDSDRFRDDKLDVLEYETYDLAPMEPLQVFDVTGGPGSTGEPDPAMIQFLRLINL 463

Query: 374 GGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQG 433
           GG DAFLLESIFR  IWG +  PVS ANE L    V +AC+ ALS     +  D      
Sbjct: 464 GGKDAFLLESIFRKEIWGFMSEPVSEANERLAVMAVVEACEKALSEMEGPVGGDIT---- 519

Query: 434 GNLDPRLRIAVGVRAGEKRVLQQIDEIFKERELELDELEYYQERRLKDLGLVGE---QGD 490
            +L  ++R+   VR  E++ L++     K+    LD  EYYQERRLK LGL  E   + D
Sbjct: 520 -SLSEKMRLCNMVRESERQALERTLIYVKQEAEALDLKEYYQERRLKSLGLDSEWSPEND 578

Query: 491 IIFW 494
            + W
Sbjct: 579 DVGW 582


>gi|452821842|gb|EME28868.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
           N-methyltransferase I [Galdieria sulphuraria]
          Length = 490

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 119/418 (28%), Positives = 200/418 (47%), Gaps = 39/418 (9%)

Query: 61  TAQVETFWQWLRDQKV-VSPKS-----PIRPATFPE--------GLGLVAQRDIAKNEVV 106
           ++++  F+ WL++  V +S K+     P R     E        G GL++ R +   E V
Sbjct: 77  SSEISAFYDWLKENGVYLSEKASWTHAPHRLVIAEETKDEGEYSGRGLLSSRSVNLGEKV 136

Query: 107 LEVPMKFWINPDTVAASEIGSLCSGLK-PWISVALFLIREKKKE-DSPWRVYLDILPECT 164
           LE+P K          +   S+ + ++  ++S+ L L+ EK K  DS ++ YLDILP   
Sbjct: 137 LEIPEKLMFTRKLALETFPTSIIASIEDEYVSIGLLLLYEKAKGFDSFFKPYLDILPTLD 196

Query: 165 --DSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDF 222
             +    WS ++L  LQG+  LS    +++ +  EY  + + II        +PI    F
Sbjct: 197 ELNPLFLWSNKDLDLLQGSPTLSACEQLRDKLLREYTYLGKNIIPQIPNFASKPIDFKQF 256

Query: 223 LWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRT 282
            WAFGIL SRA      + + L+P ADL+NHSP  +      +I      +  +++  R 
Sbjct: 257 QWAFGILFSRAICFPSSKRIALVPYADLLNHSPFCSAFIDEEKIPFGNGVTEAVVYVDRL 316

Query: 283 PVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAE 342
             P    EQV + Y   +SN EL L YGF   ++  +   +T+ + ++DP + +K  + E
Sbjct: 317 YEPY---EQVYVSYG-PRSNQELLLLYGFSLERNPFDCVEITIGLDKTDPLYLEKCRMLE 372

Query: 343 TNGLGESAYFDIVLGRTLPPAMLQYLRLVALGG-TDAFLLESIFRNTIWGHLDLPVSHAN 401
           + G      F + + R  P  M ++LR   +   TD   L++ F  TI       VS +N
Sbjct: 373 SYGKSPLQSFPLYMDR-YPVEMAEFLRFCCIDTETD---LQADF-GTI-------VSASN 420

Query: 402 EELICRVVRDACKSALSGFHTTIEEDEQLLQG----GNLDPRLRIAVGVRAGEKRVLQ 455
           EE     + +     L  + T++E+DE++++       L+   R+A+  R GEKR+L 
Sbjct: 421 EESALDKLLNYIVDQLRKYPTSLEDDEKIIRDRAMFQTLEKNQRMAIRQRLGEKRILH 478


>gi|344273731|ref|XP_003408672.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Loxodonta
           africana]
          Length = 597

 Score =  130 bits (327), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 116/391 (29%), Positives = 182/391 (46%), Gaps = 41/391 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 104 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNITLAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ E+E+  LQ TQ +       +    +Y    KV
Sbjct: 164 ERANPNSFWQPYIQTLPSEYDTPLYFEEDEVRHLQSTQAIHDVFSQYKNTARQYAYFYKV 223

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 224 IQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 280

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+     +AGEQ+ I Y   +SNAE  +  GF   
Sbjct: 281 NGLITTGYNLED------DRCECVALQD---FRAGEQIYIFYG-TRSNAEFVIHSGFFFD 330

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 331 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTEPPISAQLLAFLRVF 388

Query: 372 ALGGTDAFLLE------SIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            +  T+  L E      +I R    G+ + PVS  NE  +   + D     L  + TTIE
Sbjct: 389 CM--TEEELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIE 446

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           ED+  L+G +L  R  +AV +R GEK +L++
Sbjct: 447 EDKAFLKGHDLSIRATMAVKLRLGEKEILER 477


>gi|328864871|gb|EGG13257.1| hypothetical protein DFA_11018 [Dictyostelium fasciculatum]
          Length = 1658

 Score =  130 bits (327), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 116/389 (29%), Positives = 191/389 (49%), Gaps = 39/389 (10%)

Query: 59   PPTAQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPD 118
            P       F +WL D  V  PK  I       G G+V  + + +NE V+ VP KF IN D
Sbjct: 1177 PTDVVYRKFEKWLTDGGVHFPKLQIANFNDSTGRGVVTTKKVEENECVVSVPRKFLINVD 1236

Query: 119  T-----VAASEIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEE 173
                  V  S +    +GL     + LF+I EK+  +S WR + D LP    +++ ++  
Sbjct: 1237 CARKHPVLNSILFEEATGLNDDTILFLFVIYEKENPNSFWRPFFDTLPSYFPTSIHYTTT 1296

Query: 174  ELVELQGTQLLSTTLGVKEYVQN--EYLKVEEEIILPNKQLFPRPI-TLDDFLWAFGILR 230
            EL+EL+GT L   T+ +KE++++  E L  E     P+  +FP  + T+++FLWA  +  
Sbjct: 1297 ELLELEGTNLFEETIQIKEHLESIRELLFPELSNQYPD--VFPESLFTMENFLWARSLFD 1354

Query: 231  SRAFS-RLRGQNL-VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKA 288
            SRA   ++ G+ +  L+P+AD+INH       D A +I        +  F + +   + A
Sbjct: 1355 SRAIQLKIDGRIVNCLVPMADMINH------HDQA-QISQRYFDQENDCFRMISCCNIPA 1407

Query: 289  GEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDP--FFGDKLDIAETNGL 346
              Q+ +QY   +S  ELAL YGF+ S +  ++  +  ++ E D      +K  + + + L
Sbjct: 1408 TSQIFLQYGALQS-WELALYYGFVISNNHYDSVHIGFDMPEEDTPELREEKQKLLDRHLL 1466

Query: 347  GESAYFDIVLGRT-LPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDL--PVSHANEE 403
                ++   L R+ +P  +L  LR VAL   D F            H+D+  P+S +NEE
Sbjct: 1467 TVDHHY---LHRSNIPSKLLASLR-VALLAEDEF----------NPHVDVWNPISRSNEE 1512

Query: 404  LICRVVRDACKSALSGFHTTIEEDEQLLQ 432
            ++   +       L  F +T +ED+QLL+
Sbjct: 1513 VVLYTLYSTVLMLLKQFSSTCDEDQQLLE 1541


>gi|348554489|ref|XP_003463058.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cavia
           porcellus]
          Length = 789

 Score =  129 bits (325), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 117/391 (29%), Positives = 183/391 (46%), Gaps = 41/391 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLK-----PWISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 104 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSILGPLYSQDRILQAMGNIALAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W  Y+  LP   D+ +++ EEE+  LQ TQ +       +    +Y    KV
Sbjct: 164 ERANPNSFWLPYIQTLPSEYDTPLYFEEEEVQCLQSTQAIHDVFSQYKNTARQYAYFYKV 223

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRA-------FSRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 224 IQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 280

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+     +AGEQ+ I Y   +SNAE  +  GF   
Sbjct: 281 NGLITTGYNLED------DRCECVALQD---FRAGEQIYIFYG-TRSNAEFVIHSGFFFD 330

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 331 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTEPPISAQLLAFLRVF 388

Query: 372 ALGGTDAFLLE------SIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            +  T+  L E      +I R    G+ + PVS  NE  +   + D     L  + TTIE
Sbjct: 389 CM--TEEELKEHLLGENAIDRIFTLGNSEFPVSWENEVKLWSFLEDRASLLLKTYKTTIE 446

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           ED+ +L+G  L  R+++AV +R GEK +L++
Sbjct: 447 EDKAVLKGPELPTRMKMAVKLRLGEKEILER 477


>gi|73964462|ref|XP_547974.2| PREDICTED: SET domain containing 3 [Canis lupus familiaris]
          Length = 589

 Score =  129 bits (325), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 114/391 (29%), Positives = 181/391 (46%), Gaps = 41/391 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 104 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNITLAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ E+E+ +LQ TQ +       +    +Y    KV
Sbjct: 164 ERADPNSFWQPYIQTLPSEYDTPLYFEEDEVRDLQSTQAIHDVFSQYKNTARQYAYFYKV 223

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 224 IQTHPHANKLPLKDAFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 280

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +LR     +AGEQ+ I Y   +SNAE  +  GF   
Sbjct: 281 NGLITTGYNLED------DRCECVALRD---FRAGEQIYIFYG-TRSNAEFVIHSGFFFD 330

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPA---MLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 331 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHYTDPPVSAQLLAFLRVF 388

Query: 372 ALGGTDAFLLESIF------RNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            +  T+  L E +       R    G+ + PVS  NE  +   + D     L  + TTIE
Sbjct: 389 CM--TEEELKEHLLGDNALDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIE 446

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           ED+  L+  +L  R  +A+ +R GEK +L++
Sbjct: 447 EDKSFLRNHDLSVRATMAIKLRLGEKEILEK 477


>gi|301764186|ref|XP_002917505.1| PREDICTED: SET domain-containing protein 3-like [Ailuropoda
           melanoleuca]
          Length = 591

 Score =  129 bits (325), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 115/391 (29%), Positives = 181/391 (46%), Gaps = 41/391 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 104 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNITLAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ EEE+ +LQ TQ +       +    +Y    KV
Sbjct: 164 ERADPNSFWQPYIQTLPSEYDTPLYFEEEEVRDLQCTQAIHDVFSQYKNTARQYAYFYKV 223

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 224 IQTHPHANKLPLKDAFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 280

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +LR     +AGEQ+ I Y   +SNAE  +  GF   
Sbjct: 281 NGLITTGYNLED------DRCECVALRD---FRAGEQIYIFYG-TRSNAEFVIHSGFFFD 330

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPA---MLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 331 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTDPPVSAQLLAFLRVF 388

Query: 372 ALGGTDAFLLESIF------RNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            +  T+  L E +       R    G+ + PVS  NE  +   + D     L  + TTIE
Sbjct: 389 CM--TEEELKEHLLGDNALDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIE 446

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           ED+  L+  +L  R  +A+ +R GEK +L++
Sbjct: 447 EDKSFLKNHDLSVRATMAIKLRLGEKEILEK 477


>gi|281338628|gb|EFB14212.1| hypothetical protein PANDA_005835 [Ailuropoda melanoleuca]
          Length = 585

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 115/391 (29%), Positives = 181/391 (46%), Gaps = 41/391 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 104 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNITLAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ EEE+ +LQ TQ +       +    +Y    KV
Sbjct: 164 ERADPNSFWQPYIQTLPSEYDTPLYFEEEEVRDLQCTQAIHDVFSQYKNTARQYAYFYKV 223

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 224 IQTHPHANKLPLKDAFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 280

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +LR     +AGEQ+ I Y   +SNAE  +  GF   
Sbjct: 281 NGLITTGYNLED------DRCECVALRD---FRAGEQIYIFYG-TRSNAEFVIHSGFFFD 330

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPA---MLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 331 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTDPPVSAQLLAFLRVF 388

Query: 372 ALGGTDAFLLESIF------RNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            +  T+  L E +       R    G+ + PVS  NE  +   + D     L  + TTIE
Sbjct: 389 CM--TEEELKEHLLGDNALDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIE 446

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           ED+  L+  +L  R  +A+ +R GEK +L++
Sbjct: 447 EDKSFLKNHDLSVRATMAIKLRLGEKEILEK 477


>gi|355718753|gb|AES06373.1| SET domain containing 3 [Mustela putorius furo]
          Length = 585

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 114/391 (29%), Positives = 181/391 (46%), Gaps = 41/391 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 104 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNITLAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ E+E+ +LQ TQ +       +    +Y    KV
Sbjct: 164 ERADPNSFWQPYIQTLPSEYDTPLYFEEDEVRDLQSTQAIHDVFSQYKNTARQYAYFYKV 223

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 224 IQTHPHANKLPLKDAFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 280

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +LR     +AGEQ+ I Y   +SNAE  +  GF   
Sbjct: 281 NGLITTGYNLED------DRCECVALRD---FRAGEQIYIFYG-TRSNAEFVIHSGFFFD 330

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPA---MLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 331 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHSTEPPVSAQLLAFLRVF 388

Query: 372 ALGGTDAFLLESIF------RNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            +  T+  L E +       R    G+ + PVS  NE  +   + D     L  + TTIE
Sbjct: 389 CM--TEEELKEHLLGDNALDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIE 446

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           ED+  L+  +L  R  +A+ +R GEK +L++
Sbjct: 447 EDKSFLKDHDLSVRAAMAIKLRLGEKEILEK 477


>gi|332321742|sp|E2RBS6.1|SETD3_CANFA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
          Length = 588

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 114/391 (29%), Positives = 181/391 (46%), Gaps = 41/391 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 104 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNITLAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ E+E+ +LQ TQ +       +    +Y    KV
Sbjct: 164 ERADPNSFWQPYIQTLPSEYDTPLYFEEDEVRDLQSTQAIHDVFSQYKNTARQYAYFYKV 223

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 224 IQTHPHANKLPLKDAFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 280

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +LR     +AGEQ+ I Y   +SNAE  +  GF   
Sbjct: 281 NGLITTGYNLED------DRCECVALRD---FRAGEQIYIFYG-TRSNAEFVIHSGFFFD 330

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPA---MLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 331 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHYTDPPVSAQLLAFLRVF 388

Query: 372 ALGGTDAFLLESIF------RNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            +  T+  L E +       R    G+ + PVS  NE  +   + D     L  + TTIE
Sbjct: 389 CM--TEEELKEHLLGDNALDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIE 446

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           ED+  L+  +L  R  +A+ +R GEK +L++
Sbjct: 447 EDKSFLRNHDLSVRATMAIKLRLGEKEILEK 477


>gi|296215874|ref|XP_002754318.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Callithrix jacchus]
          Length = 610

 Score =  129 bits (324), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 115/391 (29%), Positives = 183/391 (46%), Gaps = 41/391 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 119 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLC 178

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ EEE+  LQ TQ +       +    +Y    KV
Sbjct: 179 ERASPNSFWQPYIQTLPSEYDTPLYFEEEEVRYLQSTQAVHDVFSQYKNTARQYAYFYKV 238

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 239 IQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 295

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+     +AGEQ+ I Y   +SNAE  +  GF   
Sbjct: 296 NGLITTGYNLED------DRCECVALQD---FRAGEQIYIFYG-TRSNAEFVIHSGFFFD 345

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 346 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTEPPISAQLLAFLRVF 403

Query: 372 ALGGTDAFLLE------SIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            +  T+  L E      +I R    G+ + PVS  NE  +   + D     L  + TTIE
Sbjct: 404 CM--TEEELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIE 461

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           ED+ +L+  +L  R ++A+ +R GEK +L++
Sbjct: 462 EDKSVLKNQDLSVRAKMAIKLRLGEKEILEK 492


>gi|168986666|gb|ACA35060.1| SET domain containing 3 isoform a (predicted) [Callithrix jacchus]
          Length = 597

 Score =  129 bits (323), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 115/391 (29%), Positives = 183/391 (46%), Gaps = 41/391 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 106 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLC 165

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ EEE+  LQ TQ +       +    +Y    KV
Sbjct: 166 ERASPNSFWQPYIQTLPSEYDTPLYFEEEEVRYLQSTQAVHDVFSQYKNTARQYAYFYKV 225

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 226 IQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 282

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+     +AGEQ+ I Y   +SNAE  +  GF   
Sbjct: 283 NGLITTGYNLED------DRCECVALQD---FRAGEQIYIFYG-TRSNAEFVIHSGFFFD 332

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 333 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTEPPISAQLLAFLRVF 390

Query: 372 ALGGTDAFLLE------SIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            +  T+  L E      +I R    G+ + PVS  NE  +   + D     L  + TTIE
Sbjct: 391 CM--TEEELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIE 448

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           ED+ +L+  +L  R ++A+ +R GEK +L++
Sbjct: 449 EDKSVLKNQDLSVRAKMAIKLRLGEKEILEK 479


>gi|332320543|sp|B0VX69.2|SETD3_CALJA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
          Length = 595

 Score =  128 bits (322), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 115/391 (29%), Positives = 183/391 (46%), Gaps = 41/391 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 104 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ EEE+  LQ TQ +       +    +Y    KV
Sbjct: 164 ERASPNSFWQPYIQTLPSEYDTPLYFEEEEVRYLQSTQAVHDVFSQYKNTARQYAYFYKV 223

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 224 IQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 280

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+     +AGEQ+ I Y   +SNAE  +  GF   
Sbjct: 281 NGLITTGYNLED------DRCECVALQD---FRAGEQIYIFYG-TRSNAEFVIHSGFFFD 330

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 331 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTEPPISAQLLAFLRVF 388

Query: 372 ALGGTDAFLLE------SIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            +  T+  L E      +I R    G+ + PVS  NE  +   + D     L  + TTIE
Sbjct: 389 CM--TEEELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIE 446

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           ED+ +L+  +L  R ++A+ +R GEK +L++
Sbjct: 447 EDKSVLKNQDLSVRAKMAIKLRLGEKEILEK 477


>gi|452824261|gb|EME31265.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase
           [Galdieria sulphuraria]
          Length = 546

 Score =  128 bits (322), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 125/485 (25%), Positives = 220/485 (45%), Gaps = 53/485 (10%)

Query: 29  TSLTKKPSFRLKSRAFAATCSLHSASATTNPPTAQVETFWQWLRDQKVVSPKSPIRPATF 88
            S T +P F    R    + +     AT    T ++E    WL D  V S K   +P   
Sbjct: 41  VSKTMRPQFFYVLRKKIRSSTWWRIKATFLEKTEELEN---WLFDNGVPSIKG--KPVLS 95

Query: 89  PEGL-GLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVA-LFLIREK 146
           P       A+  +   E VL +P +FW+    ++   +G   S L    ++A L L+   
Sbjct: 96  PHNCRTFRAKIPLKLGEEVLAIPERFWLTKQ-LSEKLLGFHVSDLSDEEAIAALLLVETA 154

Query: 147 KKEDSPWRVYLDILPECTDSTVF--WSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEE 204
           +KE S W+ +++ LP   +   F  WS  E   L+ +      L ++E        V EE
Sbjct: 155 RKETSFWKPWIETLPSSDELHHFLVWSTAETQYLESSSTFEDILSLRETAS----LVFEE 210

Query: 205 IILPNKQLFPRPI---------TLDDFLWAFGILRSRA-FSRLRGQNLVLIPLADLINHS 254
           +   N +LFP+ +         TL  F WA  I++S   +  +    LV++P  + + + 
Sbjct: 211 L---NTELFPKFLYPQYDVKYFTLPYFTWALSIVQSFGLYDIMDSCPLVIVPGLEWLTYK 267

Query: 255 PGITTEDYAYE----------IKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAE 304
             + TE+  +           I+    F+++    +     +K GE V + Y+ N S  +
Sbjct: 268 YSLITEESFFRQYFHISNVSLIRVGPFFTQERRLKITASEDLKVGEPVSLVYEGNVSLID 327

Query: 305 ------LALDYGFIESKS--DRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIV- 355
                   LD G ++ +      +Y ++  ++ +D FF DK DI +   L     F++  
Sbjct: 328 TFCRWGWKLDLGALDEEQLLKMGSYEISFAVTTTDQFFDDKEDILDAQRLELLQTFELRY 387

Query: 356 -LGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACK 414
            + + L   +L +LRL+ L   D+F+LES+FR+ +W HL LP S+ NE+ +C +V   C+
Sbjct: 388 DMSKELLQRILPFLRLICLKDKDSFILESVFRSEVWSHLQLPFSYNNEKAVCELVIQTCE 447

Query: 415 SALSGFHTTIEEDEQLLQGGNLDP---RLRIAVGVRAGEKRVLQQIDEIFKERELELDEL 471
            +L  +H    E   +++ G  D    R ++   ++  E+ VL++  E F+     L+++
Sbjct: 448 ESLERWHPVSLE---IIERGMKDSPEIRKKMVSIIKWMEEAVLKRTIEYFERYLAGLEQI 504

Query: 472 EYYQE 476
           EYYQE
Sbjct: 505 EYYQE 509


>gi|40068481|ref|NP_115609.2| histone-lysine N-methyltransferase setd3 isoform a [Homo sapiens]
 gi|74750394|sp|Q86TU7.1|SETD3_HUMAN RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|28071092|emb|CAD61927.1| unnamed protein product [Homo sapiens]
 gi|119602070|gb|EAW81664.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
 gi|119602072|gb|EAW81666.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
 gi|119602073|gb|EAW81667.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
 gi|194380984|dbj|BAG64060.1| unnamed protein product [Homo sapiens]
 gi|307686103|dbj|BAJ20982.1| SET domain containing 3 [synthetic construct]
          Length = 594

 Score =  128 bits (321), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 112/391 (28%), Positives = 183/391 (46%), Gaps = 41/391 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 104 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ E+E+  LQ TQ +       +    +Y    KV
Sbjct: 164 ERASPNSFWQPYIQTLPSEYDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV 223

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 224 IQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 280

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+     +AGEQ+ I Y   +SNAE  +  GF   
Sbjct: 281 NGLITTGYNLED------DRCECVALQD---FRAGEQIYIFYG-TRSNAEFVIHSGFFFD 330

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 331 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTEPPISAQLLAFLRVF 388

Query: 372 ALGGTDAFLLESIFRNT------IWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            +  T+  L E +  ++        G+ + PVS  NE  +   + D     L  + TTIE
Sbjct: 389 CM--TEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIE 446

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           ED+ +L+  +L  R ++A+ +R GEK +L++
Sbjct: 447 EDKSVLKNHDLSVRAKMAIKLRLGEKEILEK 477


>gi|268370088|ref|NP_082538.2| histone-lysine N-methyltransferase setd3 [Mus musculus]
 gi|81879567|sp|Q91WC0.1|SETD3_MOUSE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=Endothelial differentiation inhibitory protein D10;
           AltName: Full=SET domain-containing protein 3
 gi|16359331|gb|AAH16123.1| SET domain containing 3 [Mus musculus]
 gi|18044800|gb|AAH19973.1| Setd3 protein [Mus musculus]
 gi|26327255|dbj|BAC27371.1| unnamed protein product [Mus musculus]
 gi|74145116|dbj|BAE27425.1| unnamed protein product [Mus musculus]
 gi|74151505|dbj|BAE38861.1| unnamed protein product [Mus musculus]
 gi|148686776|gb|EDL18723.1| mCG18357, isoform CRA_a [Mus musculus]
          Length = 594

 Score =  128 bits (321), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 124/433 (28%), Positives = 199/433 (45%), Gaps = 52/433 (12%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 104 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ EEE+  LQ TQ +       +    +Y    KV
Sbjct: 164 ERASPNSFWQPYIQTLPSEYDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQYAYFYKV 223

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 224 IQTHPHANKLPLKESFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 280

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+     +AG+Q+ I Y   +SNAE  +  GF   
Sbjct: 281 NGLITTGYNLED------DRCECVALQD---FQAGDQIYIFYG-TRSNAEFVIHSGFFFD 330

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 331 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHSTEPPISAQLLAFLRVF 388

Query: 372 ALGGTDAFLLESIFRNT------IWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            +  T+  L E +  ++        G+ + PVS  NE  +   + D     L  + TTIE
Sbjct: 389 CM--TEEELKEHLLGDSAIDRIFTLGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIE 446

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQI-------DEIFKERELELDELEYYQERR 478
           ED+ +L+  +L  R  +A+ +R GEK +L++         E +++   E   L  Y+E  
Sbjct: 447 EDKIVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAAVNREYYRKHMEERAPLPRYEE-- 504

Query: 479 LKDLGLV-GEQGD 490
             DLGL+ G  GD
Sbjct: 505 -SDLGLLEGGVGD 516


>gi|386780935|ref|NP_001247800.1| SET domain containing 3 [Macaca mulatta]
 gi|355693560|gb|EHH28163.1| hypothetical protein EGK_18532 [Macaca mulatta]
 gi|380817110|gb|AFE80429.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
 gi|383422129|gb|AFH34278.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
 gi|384949778|gb|AFI38494.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
          Length = 595

 Score =  128 bits (321), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 118/421 (28%), Positives = 194/421 (46%), Gaps = 48/421 (11%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 104 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ E+E+  LQ TQ +       +    +Y    KV
Sbjct: 164 ERANPNSFWQPYIQTLPSEYDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV 223

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 224 IQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 280

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+     +AGEQ+ I Y   +SNAE  +  GF   
Sbjct: 281 NGLITTGYNLED------DRCECVALQD---FRAGEQIYIFYG-TRSNAEFVIHSGFFFD 330

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 331 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTEPPISAQLLAFLRVF 388

Query: 372 ALGGTDAFLLESIFRNT------IWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            +  T+  L E +  ++        G+ + PVS  NE  +   + D     L  + TTIE
Sbjct: 389 CM--TEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIE 446

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQI-------DEIFKERELELDELEYYQERR 478
           ED+ +L+  +L  R ++A+ +R GEK +L++         E ++++  E   L  Y+E  
Sbjct: 447 EDKSVLKNQDLSVRAKMAIKLRLGEKEILEKAVKSAAVNREFYRQQMEEKAPLPKYEESN 506

Query: 479 L 479
           L
Sbjct: 507 L 507


>gi|426377975|ref|XP_004055723.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Gorilla
           gorilla gorilla]
          Length = 594

 Score =  128 bits (321), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 112/391 (28%), Positives = 183/391 (46%), Gaps = 41/391 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 104 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ E+E+  LQ TQ +       +    +Y    KV
Sbjct: 164 ERASPNSFWQPYIQTLPSEYDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV 223

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 224 IQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 280

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+     +AGEQ+ I Y   +SNAE  +  GF   
Sbjct: 281 NGLITTGYNLED------DRCECVALQD---FRAGEQIYIFYG-TRSNAEFVIHSGFFFD 330

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 331 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTEPPISAQLLAFLRVF 388

Query: 372 ALGGTDAFLLESIFRNT------IWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            +  T+  L E +  ++        G+ + PVS  NE  +   + D     L  + TTIE
Sbjct: 389 CM--TEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIE 446

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           ED+ +L+  +L  R ++A+ +R GEK +L++
Sbjct: 447 EDKSVLKNQDLSVRAKMAIKLRLGEKEILEK 477


>gi|148686779|gb|EDL18726.1| mCG18357, isoform CRA_d [Mus musculus]
          Length = 597

 Score =  128 bits (321), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 124/433 (28%), Positives = 199/433 (45%), Gaps = 52/433 (12%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 107 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLC 166

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ EEE+  LQ TQ +       +    +Y    KV
Sbjct: 167 ERASPNSFWQPYIQTLPSEYDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQYAYFYKV 226

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 227 IQTHPHANKLPLKESFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 283

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+     +AG+Q+ I Y   +SNAE  +  GF   
Sbjct: 284 NGLITTGYNLED------DRCECVALQD---FQAGDQIYIFYG-TRSNAEFVIHSGFFFD 333

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 334 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHSTEPPISAQLLAFLRVF 391

Query: 372 ALGGTDAFLLESIFRNT------IWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            +  T+  L E +  ++        G+ + PVS  NE  +   + D     L  + TTIE
Sbjct: 392 CM--TEEELKEHLLGDSAIDRIFTLGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIE 449

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQI-------DEIFKERELELDELEYYQERR 478
           ED+ +L+  +L  R  +A+ +R GEK +L++         E +++   E   L  Y+E  
Sbjct: 450 EDKIVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAAVNREYYRKHMEERAPLPRYEE-- 507

Query: 479 LKDLGLV-GEQGD 490
             DLGL+ G  GD
Sbjct: 508 -SDLGLLEGGVGD 519


>gi|114654683|ref|XP_522946.2| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2 [Pan
           troglodytes]
 gi|332843114|ref|XP_003314566.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Pan
           troglodytes]
 gi|397525919|ref|XP_003832895.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1 [Pan
           paniscus]
 gi|397525921|ref|XP_003832896.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2 [Pan
           paniscus]
 gi|410227562|gb|JAA11000.1| SET domain containing 3 [Pan troglodytes]
 gi|410255618|gb|JAA15776.1| SET domain containing 3 [Pan troglodytes]
 gi|410289938|gb|JAA23569.1| SET domain containing 3 [Pan troglodytes]
 gi|410342147|gb|JAA40020.1| SET domain containing 3 [Pan troglodytes]
          Length = 594

 Score =  128 bits (321), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 112/391 (28%), Positives = 183/391 (46%), Gaps = 41/391 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 104 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ E+E+  LQ TQ +       +    +Y    KV
Sbjct: 164 ERASPNSFWQPYIQTLPSEYDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV 223

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 224 IQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 280

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+     +AGEQ+ I Y   +SNAE  +  GF   
Sbjct: 281 NGLITTGYNLED------DRCECVALQD---FRAGEQIYIFYG-TRSNAEFVIHSGFFFD 330

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 331 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTEPPISAQLLAFLRVF 388

Query: 372 ALGGTDAFLLESIFRNT------IWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            +  T+  L E +  ++        G+ + PVS  NE  +   + D     L  + TTIE
Sbjct: 389 CM--TEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIE 446

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           ED+ +L+  +L  R ++A+ +R GEK +L++
Sbjct: 447 EDKSVLKNQDLSVRAKMAIKLRLGEKEILEK 477


>gi|355778846|gb|EHH63882.1| hypothetical protein EGM_16943 [Macaca fascicularis]
          Length = 595

 Score =  127 bits (320), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 118/421 (28%), Positives = 194/421 (46%), Gaps = 48/421 (11%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 104 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ E+E+  LQ TQ +       +    +Y    KV
Sbjct: 164 ERANPNSFWQPYIQTLPSEYDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV 223

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 224 IQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 280

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+     +AGEQ+ I Y   +SNAE  +  GF   
Sbjct: 281 NGLITTGYNLED------DRCECVALQD---FRAGEQIYIFYG-TRSNAEFVIHSGFFFD 330

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 331 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTEPPISAQLLAFLRVF 388

Query: 372 ALGGTDAFLLESIFRNT------IWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            +  T+  L E +  ++        G+ + PVS  NE  +   + D     L  + TTIE
Sbjct: 389 CM--TEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIE 446

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQI-------DEIFKERELELDELEYYQERR 478
           ED+ +L+  +L  R ++A+ +R GEK +L++         E ++++  E   L  Y+E  
Sbjct: 447 EDKSVLKNQDLSVRAKMAIKLRLGEKEILEKAVKSAAVNREFYRQQMEEKAPLPKYEESN 506

Query: 479 L 479
           L
Sbjct: 507 L 507


>gi|10439587|dbj|BAB15525.1| unnamed protein product [Homo sapiens]
          Length = 512

 Score =  127 bits (320), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 112/391 (28%), Positives = 183/391 (46%), Gaps = 41/391 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 22  EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLC 81

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ E+E+  LQ TQ +       +    +Y    KV
Sbjct: 82  ERASPNSFWQPYIQTLPSEYDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV 141

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 142 IQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 198

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+     +AGEQ+ I Y   +SNAE  +  GF   
Sbjct: 199 NGLITTGYNLED------DRCECVALQD---FRAGEQIYIFYG-TRSNAEFVIHSGFFFD 248

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 249 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTEPPISAQLLAFLRVF 306

Query: 372 ALGGTDAFLLESIFRNT------IWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            +  T+  L E +  ++        G+ + PVS  NE  +   + D     L  + TTIE
Sbjct: 307 CM--TEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIE 364

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           ED+ +L+  +L  R ++A+ +R GEK +L++
Sbjct: 365 EDKSVLKNHDLSVRAKMAIKLRLGEKEILEK 395


>gi|332252553|ref|XP_003275417.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
           [Nomascus leucogenys]
 gi|332252555|ref|XP_003275418.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
           [Nomascus leucogenys]
          Length = 595

 Score =  127 bits (320), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 112/391 (28%), Positives = 183/391 (46%), Gaps = 41/391 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 104 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ E+E+  LQ TQ +       +    +Y    KV
Sbjct: 164 ERASPNSFWQPYIQTLPSEYDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV 223

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 224 IQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 280

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+     +AGEQ+ I Y   +SNAE  +  GF   
Sbjct: 281 NGLITTGYNLED------DRCECVALQD---FRAGEQIYIFYG-TRSNAEFVIHSGFFFD 330

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 331 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTEPPISAQLLAFLRVF 388

Query: 372 ALGGTDAFLLESIFRNT------IWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            +  T+  L E +  ++        G+ + PVS  NE  +   + D     L  + TTIE
Sbjct: 389 CM--TEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIE 446

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           ED+ +L+  +L  R ++A+ +R GEK +L++
Sbjct: 447 EDKSVLKNQDLSVRAKMAIKLRLGEKEILEK 477


>gi|297695854|ref|XP_002825140.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
           [Pongo abelii]
 gi|395746278|ref|XP_003778419.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
           [Pongo abelii]
          Length = 595

 Score =  127 bits (320), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 114/391 (29%), Positives = 183/391 (46%), Gaps = 41/391 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 104 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ E+E+  LQ TQ +       +    +Y    KV
Sbjct: 164 ERASPNSFWQPYIQTLPSEYDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV 223

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 224 IQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 280

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+     +AGEQ+ I Y   +SNAE  +  GF   
Sbjct: 281 NGLITTGYNLED------DRCECVALQD---FRAGEQIYIFYG-TRSNAEFVIHSGFFFD 330

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 331 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTEPPISAQLLAFLRVF 388

Query: 372 ALGGTDAFLLE------SIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            +  T+  L E      +I R    G+ + PVS  NE  +   + D     L  + TTIE
Sbjct: 389 CM--TEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIE 446

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           ED+ +L+  +L  R ++A+ +R GEK +L++
Sbjct: 447 EDKSVLKNQDLSVRAKMAIKLRLGEKEILEK 477


>gi|354483159|ref|XP_003503762.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Cricetulus
           griseus]
 gi|344254671|gb|EGW10775.1| SET domain-containing protein 3 [Cricetulus griseus]
          Length = 577

 Score =  127 bits (320), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 125/433 (28%), Positives = 199/433 (45%), Gaps = 52/433 (12%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 104 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ EEE+  LQ TQ +       +    +Y    KV
Sbjct: 164 ERASPNSFWQPYIQTLPSEYDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQYAYFYKV 223

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 224 IQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 280

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+     +AGEQ+ I Y   +SNAE  +  GF   
Sbjct: 281 NGLITTGYNLED------DRCECVALQD---FQAGEQIYIFYG-TRSNAEFVIHSGFFFD 330

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 331 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTEPPISAQLLAFLRVF 388

Query: 372 ALGGTDAFLLESIFRNT------IWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            +  T+  L E +  ++        G+ + PVS  NE  +   + D     L  + TTIE
Sbjct: 389 CM--TEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIE 446

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQI-------DEIFKERELELDELEYYQERR 478
           ED+ +L+  +L  R  +A+ +R GEK +L++         E +++   E   L  Y+E  
Sbjct: 447 EDKIVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAAMNREYYRKHMEERAPLPRYEE-- 504

Query: 479 LKDLGLV-GEQGD 490
             DLGL+ G  GD
Sbjct: 505 -SDLGLLEGGVGD 516


>gi|281182452|ref|NP_001162549.1| histone-lysine N-methyltransferase setd3 [Papio anubis]
 gi|332321745|sp|A9X1D0.1|SETD3_PAPAN RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|163781076|gb|ABY40825.1| SET domain containing 3, isoform 1 (predicted) [Papio anubis]
          Length = 595

 Score =  127 bits (320), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 112/391 (28%), Positives = 183/391 (46%), Gaps = 41/391 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 104 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ E+E+  LQ TQ +       +    +Y    KV
Sbjct: 164 ERANPNSFWQPYIQTLPSEYDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV 223

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 224 IQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 280

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+     +AGEQ+ I Y   +SNAE  +  GF   
Sbjct: 281 NGLITTGYNLED------DRCECVALQD---FRAGEQIYIFYG-TRSNAEFVIHSGFFFD 330

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 331 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTEPPISAQLLAFLRVF 388

Query: 372 ALGGTDAFLLESIFRNT------IWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            +  T+  L E +  ++        G+ + PVS  NE  +   + D     L  + TTIE
Sbjct: 389 CM--TEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIE 446

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           ED+ +L+  +L  R ++A+ +R GEK +L++
Sbjct: 447 EDKSVLKNQDLSVRAKMAIKLRLGEKEILEK 477


>gi|194038089|ref|XP_001925323.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Sus scrofa]
 gi|456754196|gb|JAA74239.1| SET domain containing 3 [Sus scrofa]
          Length = 595

 Score =  127 bits (320), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 118/407 (28%), Positives = 188/407 (46%), Gaps = 40/407 (9%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L +  +       I++A  L+ 
Sbjct: 104 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYAQDRILQAMGNITLAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEE 204
           E+   +S W+ Y+  LP   D+ +++ E+E+  LQ TQ +       +    +Y    + 
Sbjct: 164 ERADPNSFWQPYIQTLPSEYDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYK- 222

Query: 205 IILPNKQLFPRPI----TLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINH 253
           +I  + Q    P+    T +D+ WA   + +R         SR+    L LIPL D+ NH
Sbjct: 223 VIQTHPQAHKLPLKESFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNH 279

Query: 254 SPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIE 313
           + G+ T  Y  E        R    +LR     +AGEQ+ I Y   +SNAE  +  GF  
Sbjct: 280 TNGLITTGYNLED------DRCECVALRD---FRAGEQIYIFYG-TRSNAEFVIHSGFFF 329

Query: 314 SKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPA---MLQYLRL 370
             +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+
Sbjct: 330 DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTEPPVSAQLLAFLRV 387

Query: 371 VAL--GGTDAFLL--ESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEE 426
             +  G     LL   +I R    G+ + PVS  NE  +   + D     L  + TTIEE
Sbjct: 388 FCMTEGELKEHLLGENAIDRIFTLGNSEYPVSWDNEVKLWTFLEDRASLLLKTYKTTIEE 447

Query: 427 DEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ-IDEIFKERELELDELE 472
           D+  L+   L  R  +AV +R GEK +L++ ++     RE    +LE
Sbjct: 448 DKTFLKNHGLSVRATMAVKLRLGEKEILEKAVESAAANREFYRRQLE 494


>gi|332321478|sp|B1MTJ4.2|SETD3_CALMO RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
          Length = 595

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 114/391 (29%), Positives = 183/391 (46%), Gaps = 41/391 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 104 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ E+E+  LQ TQ +       +    +Y    KV
Sbjct: 164 ERASPNSFWQPYIQTLPSEYDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV 223

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 224 IQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 280

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+     +AGEQ+ I Y   +SNAE  +  GF   
Sbjct: 281 NGLITTGYNLED------DRCECVALQD---FRAGEQIYIFYG-TRSNAEFVIHSGFFFD 330

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 331 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTEPPISAQLLAFLRVF 388

Query: 372 ALGGTDAFLLE------SIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            +  T+  L E      +I R    G+ + PVS  NE  +   + D     L  + TTIE
Sbjct: 389 CM--TEEELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIE 446

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           ED+ +L+  +L  R ++A+ +R GEK +L++
Sbjct: 447 EDKSVLKNQDLSVRAKMAIKLRLGEKEILEK 477


>gi|169409575|gb|ACA57918.1| SET domain containing 3 isoform a (predicted) [Callicebus moloch]
          Length = 597

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 114/391 (29%), Positives = 183/391 (46%), Gaps = 41/391 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 106 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLC 165

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ E+E+  LQ TQ +       +    +Y    KV
Sbjct: 166 ERASPNSFWQPYIQTLPSEYDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV 225

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 226 IQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 282

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+     +AGEQ+ I Y   +SNAE  +  GF   
Sbjct: 283 NGLITTGYNLED------DRCECVALQD---FRAGEQIYIFYG-TRSNAEFVIHSGFFFD 332

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 333 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTEPPISAQLLAFLRVF 390

Query: 372 ALGGTDAFLLE------SIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            +  T+  L E      +I R    G+ + PVS  NE  +   + D     L  + TTIE
Sbjct: 391 CM--TEEELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIE 448

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           ED+ +L+  +L  R ++A+ +R GEK +L++
Sbjct: 449 EDKSVLKNQDLSVRAKMAIKLRLGEKEILEK 479


>gi|148744485|gb|AAI42996.1| SET domain containing 3 [Homo sapiens]
          Length = 594

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 112/391 (28%), Positives = 182/391 (46%), Gaps = 41/391 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 104 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ E+E+  LQ TQ +       +    +Y    KV
Sbjct: 164 ERASPNSFWQPYIQTLPSEYDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV 223

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 224 IQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 280

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+     +AGEQ+ I Y   +SNAE  +  GF   
Sbjct: 281 NGLITTGYNLED------DRCECVALQD---FRAGEQIYIFYG-TRSNAEFVIHSGFFFD 330

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 331 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTEPPISAQLLAFLRVF 388

Query: 372 ALGGTDAFLLESIFRNT------IWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            +  T+  L E +  ++        G+   PVS  NE  +   + D     L  + TTIE
Sbjct: 389 CM--TEEELKEHLLGDSAIDRIFTLGNSKFPVSWDNEVKLWTFLEDRASLLLKTYKTTIE 446

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           ED+ +L+  +L  R ++A+ +R GEK +L++
Sbjct: 447 EDKSVLKNHDLSVRAKMAIKLRLGEKEILEK 477


>gi|431839268|gb|ELK01195.1| SET domain-containing protein 3 [Pteropus alecto]
          Length = 805

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 115/391 (29%), Positives = 180/391 (46%), Gaps = 41/391 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLK-----PWISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 283 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNITLAFHLLC 342

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ E+E+  LQ TQ +       +    +Y    KV
Sbjct: 343 ERADPNSFWQPYIQTLPSEYDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV 402

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRA-------FSRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 403 IQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 459

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +LR     +AGEQ+ I Y   +SNAE  +  GF   
Sbjct: 460 NGLITTGYNLED------DRCECVALRD---FRAGEQIYIFYG-TRSNAEFVIHSGFFFD 509

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 510 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTEPPISAQLLAFLRVF 567

Query: 372 ALGGTDAFLLE------SIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            +  T+  L E      +I R    G+ + PVS  NE  +   + D     L  + TT+E
Sbjct: 568 CM--TEEELKEHLLGDNAIDRIFTLGNSEYPVSWDNEVKLWTFLEDRASLLLKTYKTTVE 625

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           ED   L+  +L  R  +AV +R GEK +L++
Sbjct: 626 EDRSFLRNHDLSVRAAMAVKLRLGEKEILER 656


>gi|12848462|dbj|BAB27964.1| unnamed protein product [Mus musculus]
 gi|46241521|gb|AAS82953.1| endothelial differentiation inhibitory protein D10 [Mus musculus]
          Length = 594

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 124/433 (28%), Positives = 199/433 (45%), Gaps = 52/433 (12%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 104 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ EEE+  LQ TQ +       +    +Y    KV
Sbjct: 164 ERASPNSFWQPYIQTLPSEYDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQYAYFYKV 223

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 224 IQTHPHANKLPLKESFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 280

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+     +AG+Q+ I Y   +SNAE  +  GF   
Sbjct: 281 NGLITTGYNLED------DRCECVALQD---FQAGDQIYIFYG-TRSNAESVIHSGFFFD 330

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 331 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHSTEPPISAQLLAFLRVF 388

Query: 372 ALGGTDAFLLESIFRNT------IWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            +  T+  L E +  ++        G+ + PVS  NE  +   + D     L  + TTIE
Sbjct: 389 CM--TEEELKEHLLGDSAIDRIFTLGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIE 446

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQI-------DEIFKERELELDELEYYQERR 478
           ED+ +L+  +L  R  +A+ +R GEK +L++         E +++   E   L  Y+E  
Sbjct: 447 EDKIVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAAVNREYYRKHMEERAPLPRYEE-- 504

Query: 479 LKDLGLV-GEQGD 490
             DLGL+ G  GD
Sbjct: 505 -SDLGLLEGGVGD 516


>gi|403274243|ref|XP_003928891.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Saimiri
           boliviensis boliviensis]
          Length = 513

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 115/391 (29%), Positives = 183/391 (46%), Gaps = 41/391 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 22  EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLC 81

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ EEE+  LQ TQ +       +    +Y    KV
Sbjct: 82  ERASPNSFWQPYIQTLPSEYDTPLYFEEEEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV 141

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 142 IQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 198

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+     +AGEQ+ I Y   +SNAE  +  GF   
Sbjct: 199 NGLITTGYNLED------DRCECVALQD---FQAGEQIYIFYG-TRSNAEFVIHSGFFFD 248

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 249 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTEPPISAQLLAFLRVF 306

Query: 372 ALGGTDAFLLE------SIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            +  T+  L E      +I R    G+ + PVS  NE  +   + D     L  + TTIE
Sbjct: 307 CM--TEEELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIE 364

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           ED+ +L+  +L  R ++A+ +R GEK +L++
Sbjct: 365 EDKFVLKNQDLSVRAKMAIKLRLGEKEILEK 395


>gi|110331827|gb|ABG67019.1| hypothetical protein LOC84193 [Bos taurus]
          Length = 488

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 113/391 (28%), Positives = 181/391 (46%), Gaps = 41/391 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 111 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNITLAFHLLC 170

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEE 204
           E+   +S W+ Y+  LP   D+ +++ E+E+  LQ TQ +       +    +Y    + 
Sbjct: 171 ERADPNSFWQPYIQTLPSEYDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV 230

Query: 205 I-ILPNKQLFP--RPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
           I   P+    P     T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 231 IQTHPHAHKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 287

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+     +AGEQ+ I Y   +SNAE  +  GF   
Sbjct: 288 SGLITTGYNLED------DRCECVALQD---FRAGEQIYIFYG-TRSNAEFVIHSGFFFD 337

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 338 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTEPPISAQLLAFLRVF 395

Query: 372 ALGGTDAFLLE------SIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            +  T+  L E      +I R    G+ + PVS  NE  +   + D     L  + TTIE
Sbjct: 396 CM--TEEELKEHLLGDSAIDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIE 453

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           ED+  L+  +L  R  +A+ +R GEK +L++
Sbjct: 454 EDKSFLKNHDLSARATMAIKLRLGEKEILER 484


>gi|392341246|ref|XP_002726820.2| PREDICTED: histone-lysine N-methyltransferase setd3 [Rattus
           norvegicus]
 gi|392349051|ref|XP_216781.6| PREDICTED: histone-lysine N-methyltransferase setd3 [Rattus
           norvegicus]
 gi|149044195|gb|EDL97577.1| rCG27725, isoform CRA_a [Rattus norvegicus]
          Length = 596

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 124/433 (28%), Positives = 199/433 (45%), Gaps = 52/433 (12%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 104 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSILGPLYSQDRILQAMGNIALAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ EEE+  LQ TQ +       +    +Y    KV
Sbjct: 164 ERASPNSFWQPYIQTLPSEYDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQYAYFYKV 223

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 224 IQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 280

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+     +AG+Q+ I Y   +SNAE  +  GF   
Sbjct: 281 NGLITTGYNLED------DRCECVALQD---FQAGDQIYIFYG-TRSNAEFVIHSGFFFD 330

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 331 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTEPPISAQLLAFLRVF 388

Query: 372 ALGGTDAFLLESIFRNT------IWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            +  T+  L E +  ++        G+ + PVS  NE  +   + D     L  + TTIE
Sbjct: 389 CM--TEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIE 446

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQI-------DEIFKERELELDELEYYQERR 478
           ED+ +L+  +L  R  +A+ +R GEK +L++         E +++   E   L  Y+E  
Sbjct: 447 EDKTVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAAMNREYYRKHMEERAPLPRYEE-- 504

Query: 479 LKDLGLV-GEQGD 490
             DLGL+ G  GD
Sbjct: 505 -SDLGLLEGGVGD 516


>gi|426248573|ref|XP_004018037.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
           setd3 [Ovis aries]
          Length = 596

 Score =  125 bits (315), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 112/391 (28%), Positives = 180/391 (46%), Gaps = 41/391 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 111 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNITLAFHLLC 170

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEE 204
           E+   +S W+ Y+  LP   D+ +++ E+E+  LQ TQ +       +    +Y      
Sbjct: 171 ERADPNSFWQPYIQTLPSEYDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYRV 230

Query: 205 I-ILPNKQLFP--RPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
           I   P+    P     T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 231 IQTHPHAHKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 287

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+     +AGEQ+ I Y   +SNAE  +  GF   
Sbjct: 288 SGLITTGYNLED------DRCECVALQD---FRAGEQIYIFYG-TRSNAEFVIHSGFFFD 337

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 338 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTEPPISAQLLAFLRVF 395

Query: 372 ALGGTDAFLLESIFRNT------IWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            +  T+  L E +  ++        G+ + PVS  NE  +   + D     L  + TTIE
Sbjct: 396 CM--TEEELKEHLLGDSAIDRIFTLGNSEYPVSWDNEVRLWAFLEDRASLLLKTYKTTIE 453

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           ED+  L+  +L  R  +AV +R GEK +L++
Sbjct: 454 EDKSFLKNHDLSARATMAVKLRLGEKEILER 484


>gi|119914085|ref|XP_589822.3| PREDICTED: histone-lysine N-methyltransferase setd3 [Bos taurus]
 gi|297488270|ref|XP_002696879.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Bos taurus]
 gi|296475307|tpg|DAA17422.1| TPA: SET domain containing 3 [Bos taurus]
          Length = 601

 Score =  125 bits (314), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 111/391 (28%), Positives = 181/391 (46%), Gaps = 41/391 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 111 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNITLAFHLLC 170

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEE 204
           E+   +S W+ Y+  LP   D+ +++ E+E+  LQ TQ +       +    +Y    + 
Sbjct: 171 ERADPNSFWQPYIQTLPSEYDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV 230

Query: 205 I-ILPNKQLFP--RPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
           I   P+    P     T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 231 IQTHPHAHKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 287

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+     +AGEQ+ I Y   +SNAE  +  GF   
Sbjct: 288 SGLITTGYNLED------DRCECVALQD---FRAGEQIYIFYG-TRSNAEFVIHSGFFFD 337

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 338 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTEPPISAQLLAFLRVF 395

Query: 372 ALGGTDAFLLESIFRNT------IWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            +  T+  L E +  ++        G+ + PVS  NE  +   + D     L  + TTIE
Sbjct: 396 CM--TEEELKEHLLGDSAIDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIE 453

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           ED+  L+  +L  R  +A+ +R GEK +L++
Sbjct: 454 EDKSFLKNHDLSARATMAIKLRLGEKEILER 484


>gi|340780678|pdb|3SMT|A Chain A, Crystal Structure Of Human Set Domain-Containing Protein3
          Length = 497

 Score =  125 bits (314), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 114/391 (29%), Positives = 179/391 (45%), Gaps = 41/391 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K     ++   S +G L S  +       I++A  L+ 
Sbjct: 103 EGFGLRATRDIKAEELFLWVPRKLLXTVESAKNSVLGPLYSQDRILQAXGNIALAFHLLC 162

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ E+E+  LQ TQ +       +    +Y    KV
Sbjct: 163 ERASPNSFWQPYIQTLPSEYDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV 222

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D  NH+
Sbjct: 223 IQTHPHANKLPLKDSFTYEDYRWAVSSVXTRQNQIPTEDGSRV---TLALIPLWDXCNHT 279

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+     +AGEQ+ I Y   +SNAE  +  GF   
Sbjct: 280 NGLITTGYNLED------DRCECVALQD---FRAGEQIYIFYG-TRSNAEFVIHSGFFFD 329

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 330 NNSHDRVKIKLGVSKSDRLYAXKAEVLARAGIPTSSVF--ALHFTEPPISAQLLAFLRVF 387

Query: 372 ALGGTDAFLLE------SIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
               T+  L E      +I R    G+ + PVS  NE  +   + D     L  + TTIE
Sbjct: 388 CX--TEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIE 445

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           ED+ +L+  +L  R + A+ +R GEK +L++
Sbjct: 446 EDKSVLKNHDLSVRAKXAIKLRLGEKEILEK 476


>gi|440907688|gb|ELR57800.1| SET domain-containing protein 3 [Bos grunniens mutus]
          Length = 594

 Score =  125 bits (314), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 111/391 (28%), Positives = 181/391 (46%), Gaps = 41/391 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 104 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNITLAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEE 204
           E+   +S W+ Y+  LP   D+ +++ E+E+  LQ TQ +       +    +Y    + 
Sbjct: 164 ERADPNSFWQPYIQTLPSEYDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV 223

Query: 205 I-ILPNKQLFP--RPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
           I   P+    P     T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 224 IQTHPHAHKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 280

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+     +AGEQ+ I Y   +SNAE  +  GF   
Sbjct: 281 SGLITTGYNLED------DRCECVALQD---FRAGEQIYIFYG-TRSNAEFVIHSGFFFD 330

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 331 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTEPPISAQLLAFLRVF 388

Query: 372 ALGGTDAFLLESIFRNT------IWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            +  T+  L E +  ++        G+ + PVS  NE  +   + D     L  + TTIE
Sbjct: 389 CM--TEEELKEHLLGDSAIDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIE 446

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           ED+  L+  +L  R  +A+ +R GEK +L++
Sbjct: 447 EDKSFLKNHDLSARATMAIKLRLGEKEILER 477


>gi|432098266|gb|ELK28072.1| Histone-lysine N-methyltransferase setd3 [Myotis davidii]
          Length = 585

 Score =  125 bits (313), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 113/394 (28%), Positives = 180/394 (45%), Gaps = 47/394 (11%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 93  EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNITLAFHLLC 152

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ E+E+  LQ TQ +       +    +Y    KV
Sbjct: 153 ERADPNSFWQPYIQTLPSEYDTPLYFEEDEVRSLQSTQAVHDVFSQYKNTARQYAYFYKV 212

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 213 IQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 269

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +LR     +AGEQ+ I Y   +SNAE  +  GF   
Sbjct: 270 NGLITTGYNLED------DRCECVALRD---FRAGEQIYIFYG-TRSNAEFVIHSGFFFD 319

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L    PP    +L +LR+ 
Sbjct: 320 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFMEPPISAQLLAFLRVF 377

Query: 372 ALG---------GTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHT 422
            +          G +A  ++ IF     G+ + PVS  NE  +   + D     L  + T
Sbjct: 378 CMTEEELKDHLLGDNA--IDKIF---TLGNSEYPVSWDNEVKLWTFLEDRASLLLKTYKT 432

Query: 423 TIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           T EED+  L+  +L  R R+A+ +R GEK +L++
Sbjct: 433 TSEEDKSFLKNHDLSVRARMAIKLRLGEKEILEK 466


>gi|332321744|sp|B5FW36.1|SETD3_OTOGA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|197215622|gb|ACH53017.1| SET domain containing 3 isoform a (predicted) [Otolemur garnettii]
          Length = 595

 Score =  124 bits (312), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 114/391 (29%), Positives = 182/391 (46%), Gaps = 41/391 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 104 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ E+E+  LQ TQ +       +    +Y    KV
Sbjct: 164 ERASPNSFWQPYIQSLPSEYDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV 223

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 224 IQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 280

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+     +AGEQ+ I Y   +SNAE  +  GF   
Sbjct: 281 NGLITTGYNLED------DRCECVALQD---FRAGEQIYIFYG-TRSNAEFVIHSGFFFD 330

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 331 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTEPPISAQLLAFLRVF 388

Query: 372 ALGGTDAFLLE------SIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            +  T+  L E      +I R    G+ + PVS  NE  +   + D     L  + TTIE
Sbjct: 389 CM--TEEELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIE 446

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           ED+ +L+  +L  R  +A+ +R GEK +L++
Sbjct: 447 EDKFVLKNHDLSVRATMAIKLRLGEKEILEK 477


>gi|291411315|ref|XP_002721936.1| PREDICTED: SET domain containing 3 [Oryctolagus cuniculus]
          Length = 591

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 114/391 (29%), Positives = 180/391 (46%), Gaps = 41/391 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A R+I   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 104 EGFGLRATREIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ E+E+  LQ TQ +       +    +Y    +V
Sbjct: 164 ERASPNSFWQPYIQTLPSEYDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYRV 223

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 224 IQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 280

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +LR      AGEQ+ I Y   +SNAE  +  GF   
Sbjct: 281 NGLITTGYNLED------DRCECVALRD---FHAGEQIYIFYG-TRSNAEFVIHSGFFFD 330

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 331 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTEPPISAQLLAFLRVF 388

Query: 372 ALGGTDAFLLE------SIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            +  T+  L E      +I R    G+ + PVS  NE  +   + D     L  + TTIE
Sbjct: 389 CM--TEEELREHLLGDGAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIE 446

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           ED+ +L+   L  R  +AV +R GEK +L++
Sbjct: 447 EDKAVLRSPALSARAAMAVKLRLGEKEILEK 477


>gi|332321743|sp|C1FXW2.1|SETD3_DASNO RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|226526916|gb|ACO71275.1| SET domain containing 3 isoform a (predicted) [Dasypus
           novemcinctus]
          Length = 589

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 113/391 (28%), Positives = 180/391 (46%), Gaps = 41/391 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 104 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSMLGPLYSQDRILQAMGNITLAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ E+E+  L  TQ +       +    +Y    KV
Sbjct: 164 ERANPNSFWQPYIQSLPGEYDTPLYFEEDEVRYLHSTQAIHDVFSQYKNTARQYAYFYKV 223

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 224 IQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 280

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+     +AGEQ+ I Y   +SNAE  +  GF   
Sbjct: 281 NGLITTGYNLED------DRCECVALQD---FRAGEQIYIFYG-TRSNAEFVIHSGFFFD 330

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 331 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTEPPISAQLLAFLRVF 388

Query: 372 ALGGTDAFLLE------SIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            +  T+  L E      +I R    G+ + PVS  NE  +   + D     L  + TTIE
Sbjct: 389 CM--TEEELKEHLLGENAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIE 446

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           ED+  L+  +L  R  +A+ +R GEK +L++
Sbjct: 447 EDKSFLKNHDLSVRATMAIKLRLGEKEILEK 477


>gi|332321746|sp|B2KI88.1|SETD3_RHIFE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|183637154|gb|ACC64548.1| SET domain containing 3 isoform a (predicted) [Rhinolophus
           ferrumequinum]
          Length = 594

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 113/391 (28%), Positives = 180/391 (46%), Gaps = 41/391 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 104 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNITLAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ E+E+  LQ TQ +       +    +Y    KV
Sbjct: 164 ERADPNSFWQPYIQTLPSEYDTPLYFGEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV 223

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 224 IQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 280

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+     +AGEQ+ I Y   +SNAE  +  GF   
Sbjct: 281 NGLITTGYNLED------DRCECVALQD---FQAGEQIYIFYG-TRSNAEFVIHSGFFFD 330

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 331 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTEPPISAQLLAFLRVF 388

Query: 372 ALGGTDAFLLE------SIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            +  T+  L E      +I R    G+ + PVS  NE  +   + D     L  + T IE
Sbjct: 389 CM--TEEELKEHLLGDNAIDRIFTLGNSEYPVSWDNEVKLWTFLEDRASLLLKTYKTNIE 446

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           ED+  L+  +L  R  +A+ +R GEK +L++
Sbjct: 447 EDKSFLKNHDLSVRATMAIKLRLGEKEILEK 477


>gi|395827792|ref|XP_003787079.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Otolemur
           garnettii]
          Length = 595

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 114/391 (29%), Positives = 182/391 (46%), Gaps = 41/391 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 104 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ E+E+  LQ TQ +       +    +Y    KV
Sbjct: 164 ERASPNSFWQPYIQSLPSEYDTPLYFEEDEVRYLQSTQAIYDVFSQYKNTARQYAYFYKV 223

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 224 IQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 280

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+     +AGEQ+ I Y   +SNAE  +  GF   
Sbjct: 281 NGLITTGYNLED------DRCECVALQD---FRAGEQIYIFYG-TRSNAEFVIHSGFFFD 330

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 331 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTEPPISAQLLAFLRVF 388

Query: 372 ALGGTDAFLLE------SIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            +  T+  L E      +I R    G+ + PVS  NE  +   + D     L  + TTIE
Sbjct: 389 CM--TEEELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIE 446

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           ED+ +L+  +L  R  +A+ +R GEK +L++
Sbjct: 447 EDKFVLKNHDLSVRATMAIKLRLGEKEILEK 477


>gi|281201870|gb|EFA76078.1| hypothetical protein PPL_10657 [Polysphondylium pallidum PN500]
          Length = 1234

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 109/384 (28%), Positives = 179/384 (46%), Gaps = 30/384 (7%)

Query: 59   PPTAQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPD 118
            P  A    F +WL    V  PK  I       G G+V  + + +NEV+++VP KF IN  
Sbjct: 732  PSDAVYRKFEKWLASDGVQCPKLQIANFQDSTGRGIVTTKKVEENEVIIKVPRKFLINVQ 791

Query: 119  TVAASEI-GSLC---SGLKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEE 174
                  I G +    SGL     + LF+I EK+  +S WR + D LP    +++ ++  E
Sbjct: 792  VAREHPILGRIFEEFSGLNDDTILFLFVIYEKENPNSFWRPFFDTLPSYFPTSIHYTSTE 851

Query: 175  LVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI-TLDDFLWAFGILRSRA 233
            L+EL+GT L + TL VKE++Q+    +  E+      +FP  + + ++FLWA  +  SRA
Sbjct: 852  LLELEGTNLFAETLQVKEHLQSIRDMLFPELSEQYPTIFPESLFSWENFLWARSLFDSRA 911

Query: 234  FSRLRGQNLV---LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGE 290
              +L+  + +   L+P+AD+INH        +  +I        D  F + +   V    
Sbjct: 912  I-QLKIDDKITNCLVPMADMINHH-------HNAQISQRFFDQTDQCFKMVSCCSVPPNA 963

Query: 291  QVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDP--FFGDKLDIAETNGLGE 348
            Q+ + Y   + N ELAL YGF+   +  ++  +  ++ + D       K D+   N L  
Sbjct: 964  QIFLHYGALQ-NRELALYYGFVIQDNPYDSMLIGFDLPDEDTPEITKSKQDLLAANLLSI 1022

Query: 349  SAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRV 408
              ++  +    +PP +L  LR VAL   +    E      IW     P++  NEE +   
Sbjct: 1023 DTHY--LRKSNIPPKVLAALR-VALMTPE----ECNPYVDIWN----PITRRNEESMLHT 1071

Query: 409  VRDACKSALSGFHTTIEEDEQLLQ 432
            +       L  F ++ EED+QL +
Sbjct: 1072 LYSTVLMFLKQFSSSWEEDKQLCE 1095


>gi|338719872|ref|XP_001488117.2| PREDICTED: histone-lysine N-methyltransferase setd3-like [Equus
           caballus]
          Length = 609

 Score =  122 bits (306), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 111/389 (28%), Positives = 179/389 (46%), Gaps = 37/389 (9%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 118 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNITLAFHLLC 177

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ E+E+  LQ TQ +       +    +Y    +V
Sbjct: 178 ERADPNSFWQPYIQTLPSEYDTPLYFEEDEVRYLQSTQAVHDVFSQYKNTARQYAYFYRV 237

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 238 IQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 294

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+     +AGEQ+ I Y   +SNAE  +  GF   
Sbjct: 295 TGLITTGYNLED------DRCECVALQD---FRAGEQIYIFYG-TRSNAEFVIHSGFFFD 344

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 345 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTEPPISAQLLAFLRVF 402

Query: 372 ALGG---TDAFLLE-SIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEED 427
            +      D  L + +I R    G+ + PVS  NE  +   + D     L  + TT+EED
Sbjct: 403 CMTEEELKDHLLGDNAIDRIFTLGNSEYPVSWDNEVKLWTFLEDRALLLLKTYKTTVEED 462

Query: 428 EQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
              L+  +L  R  +A+ +R GEK +L++
Sbjct: 463 RAFLKNSDLSVRATMAIKLRLGEKEILEK 491


>gi|219121061|ref|XP_002185762.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
           N-methyltransferase I [Phaeodactylum tricornutum CCAP
           1055/1]
 gi|209582611|gb|ACI65232.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
           N-methyltransferase I [Phaeodactylum tricornutum CCAP
           1055/1]
          Length = 575

 Score =  122 bits (305), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 128/421 (30%), Positives = 202/421 (47%), Gaps = 52/421 (12%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS--LCSGLKPWISVALFLIREK-- 146
           G GL+A+RDI   + +L +PM   +   + A   +G   L S +  ++++A  LI E+  
Sbjct: 126 GRGLLARRDINDGDELLRIPMALCMT-KSAARKAVGKDVLPSEINEYLAMACHLIYERNV 184

Query: 147 KKEDSPWRVYLDILPEC--TDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           + E+SPW+ YLD+LP+    + T  W +E+L  L G+ +++ T  ++  ++ EY   L  
Sbjct: 185 RGEESPWKPYLDVLPDIDEVNPTFTWPDEDLAFLNGSPVIAATKSLQMKLRREYDALLGG 244

Query: 202 EEEIILPNKQLFP-RPITLDDFLWAFGILRSRAFSRLR----GQNLVLIPLADLINHSPG 256
           E+ ++      FP        + WAF +L SRA  RLR    G+ L L+P ADLINHSP 
Sbjct: 245 EDGLLAKYPDRFPAEAFNFKAWEWAFTMLFSRAI-RLRSLKQGETLALVPYADLINHSPF 303

Query: 257 ITTEDYAYEIKGAGLF-SRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESK 315
                 A +  G  LF S D    L      +  EQ+ I Y   KSNAEL L YGF   +
Sbjct: 304 SQAYIDARQ-NGDWLFKSGDEEVILYADRGYRRMEQIYISYG-PKSNAELLLLYGFAVER 361

Query: 316 SDRNAYTLTLEISESDPFFGDKLD--------IAETNG-----LGESAYFDI-VLGRTLP 361
           +  N+  +T+ I+     F  +LD        +AE        +G  A  D        P
Sbjct: 362 NPFNSVDVTVSIAPRTASFVKELDDDTIPVDPLAEEKAAFLEQVGRDATVDFPCYADRYP 421

Query: 362 PAMLQYLRLVALGGTDAF---LLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALS 418
             ML+YLRL+ +   D     L E  +  TI        S  NE  +   V  A    LS
Sbjct: 422 VEMLEYLRLMQMTPEDTRGKPLAEFDYSRTI--------SLGNEAAVLTSVITAVSRQLS 473

Query: 419 GFHTTIEEDEQLLQGGNLDPRL----RIAVGVRAGEKRVLQQ----IDEIFKERELELDE 470
            +  + E+D  L++  +L   L    R+AV  R  EKR+L++    +++  +++ L++++
Sbjct: 474 NYPQSEEDDAALIKDKSLFRLLSYNQRMAVRHRRNEKRLLKRTIAALEKQLQQQGLDVED 533

Query: 471 L 471
           L
Sbjct: 534 L 534


>gi|115657973|ref|XP_798530.2| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Strongylocentrotus purpuratus]
          Length = 682

 Score =  122 bits (305), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 127/464 (27%), Positives = 217/464 (46%), Gaps = 57/464 (12%)

Query: 65  ETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWI-NPDTVAAS 123
           ETF++WL    V +    ++ A F EG GL A +DI  ++ ++ +P K  + + + V + 
Sbjct: 81  ETFFKWLNTNGVTT--DAVKMAKFDEGYGLQATQDIKMDQELMNIPRKVMMTDQNAVDSP 138

Query: 124 EIGSLCSGLK-----PWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVEL 178
            IG L  G +     P +S+A+F++ EK K DS W+ YLD+LP      ++++ +E+   
Sbjct: 139 TIGDLVRGDRLLKGMPNVSLAIFILSEKLKSDSFWKPYLDVLPSSYSLPLYFTPDEIQLF 198

Query: 179 QGTQLLSTTLGVKEYVQNEYLKVEEEIILP-NKQLFPRP-ITLDDFLWAFGILRSRA--- 233
           QG+ +    L   + +  +Y  + + + LP N +L  R   T D + WA   + +R    
Sbjct: 199 QGSTMYGECLKQHKNIARQYAYLFKLLNLPENSKLHIREYFTYDFYRWAVSTVMTRQNQI 258

Query: 234 -FSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRD--LLFSLRTPVPVKAGE 290
                +G +L LIPL D+ NH+ G    D+  E        RD  +  +LR       GE
Sbjct: 259 PAKDGKGMSLSLIPLWDMCNHANGEMKTDFIEE--------RDSCVNMALRD---FSVGE 307

Query: 291 QVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLG-ES 349
           Q+ I Y   +S+A+L L  GF+   +  +   + L +S SD  +  K  +     LG  S
Sbjct: 308 QIFICYG-RRSSADLLLYSGFVYPGNVYDGMAIQLGLSSSDRLYAMKAQLCSVMKLGVPS 366

Query: 350 AYFDIVLGRT-LPPAMLQYLRLVA----------LGGTDAFLLESIF-RNTIWGHLDLPV 397
             + I  G+  +   +L +LR+            LG   A  L S+  R+ I       +
Sbjct: 367 QNYHISAGKEPVTLELLTFLRIFCMQDLELRDRLLGDNRAQALFSLVDRSQI-------I 419

Query: 398 SHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQI 457
           S  NE   C  +          + T+I+EDE+ L+ GNL  + R A+ +   EK  L+ +
Sbjct: 420 SKLNELRTCVYLATRVTLLQRQYKTSIQEDEEKLKDGNLSAQERSALQLLLIEKCTLENV 479

Query: 458 DE--IFKERELEL--DELEYYQERRLKDLGLVGEQGDIIFWEPK 497
            E   F   ++E   ++   ++ +R +D G+V E+     WE +
Sbjct: 480 LEYCTFWREKIEALPEDATAFEVKREEDEGVVVEK-----WEDR 518


>gi|412991387|emb|CCO16232.1| predicted protein [Bathycoccus prasinos]
          Length = 622

 Score =  121 bits (304), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 71/218 (32%), Positives = 113/218 (51%), Gaps = 10/218 (4%)

Query: 286 VKAGEQVLIQY------DLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLD 339
           ++ GE++ + Y      D    + +  +DYGF++       Y L + I E+DP   DK+D
Sbjct: 397 IQPGEEIFMDYYASINFDERPLDGKTCVDYGFVDPVERNGGYELRIGIPENDPNRDDKID 456

Query: 340 IAETN----GLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDL 395
           I + +    G G+   F +       P +  + RL+ + G+DAFLLE++FRN  W  +  
Sbjct: 457 ICDVSQQIIGGGDDPTFYLKAYEDPDPNLRVFSRLLNIQGSDAFLLEALFRNNAWELISE 516

Query: 396 PVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQ 455
           PVS  NE   C  +   CK AL  + T+IE+DE +L   ++     +++ V  GEK+ L+
Sbjct: 517 PVSKENETQACETMIAGCKDALLQYETSIEQDEDMLSIQDISDEKLLSIRVLLGEKKALR 576

Query: 456 QIDEIFKERELELDELEYYQERRLKDLGLVGEQGDIIF 493
                F+     +D LEYYQERRL+ L L+ E G   +
Sbjct: 577 NTLRYFETIRASVDRLEYYQERRLRQLNLLDEDGSSTY 614



 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 58/178 (32%), Positives = 86/178 (48%), Gaps = 23/178 (12%)

Query: 100 IAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLK----------PWISVALFLIRE-KKK 148
           I KNE +L+V    W+  +       G     LK           W+ ++++L+ E +K 
Sbjct: 156 IKKNESILKVGDSVWMTAEKAREDADGKCGKILKRLAAQGEAAPAWVELSVYLVCELEKG 215

Query: 149 EDSPWRVYLDILPECT--DSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEII 206
           E S +  YL  L E T  +S +FWS E++  + G+QLL    G   YV+  Y    E + 
Sbjct: 216 ESSFYAPYLSYLREATVLESPLFWSTEDVNAIAGSQLLDDAAGYDSYVRGTY----ESLN 271

Query: 207 LPNKQLFPRPITLDDFLWAFGILRSRAFSRLR-GQNLVLIPLADLINHSPGITTEDYA 263
           L N       +  D FLWAFGILRSRA   +R G  + L+P  D++NH    +  D+ 
Sbjct: 272 LSND-----GVPEDTFLWAFGILRSRAQQPMRDGSEVTLVPGLDMLNHKARYSENDFG 324


>gi|356547583|ref|XP_003542190.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Glycine
           max]
          Length = 499

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 133/462 (28%), Positives = 211/462 (45%), Gaps = 44/462 (9%)

Query: 41  SRAFAATCSLHS-ASATTNPPTAQVE--------TFWQWLRDQKVVSPKSPIRPATFPEG 91
           SR     CS+ + A+A TNP     E           +WL +  +   K  I      E 
Sbjct: 37  SRKHQVQCSVSAGAAAQTNPVAWGCEIDSLENSSALQRWLSESGLPPQKMGIERVEVGER 96

Query: 92  LGLVAQRDIAKNEVVLEVPMKFWINPDTV-AASEIGSLC--SGLKPWISVALFLIREKK- 147
            GLVA ++I K E +L VP    I PD+  +  E G +   + +  W  +A +LI E   
Sbjct: 97  -GLVALKNIRKGEKLLFVPPSLVITPDSEWSCPEAGEVLKRNSVPDWPLLATYLISEASL 155

Query: 148 KEDSPWRVYLDILPECTDSTVFWSEEELVE-LQGTQLLSTTLGVKEYVQNEYLKVEEEII 206
            E S W  Y+  LP    S ++W++ EL   L+ +Q+    +     V   Y  +   I 
Sbjct: 156 MESSRWSNYISALPRQPYSLLYWTQAELDRYLEASQIRERAIERINNVIGTYNDLRLRIF 215

Query: 207 LPNKQLFPRPI-TLDDFLWAFGILRSRAFSRL--RGQNLVLIPLADLINHSPGITT-EDY 262
                LFP  +  ++ F W+FGIL SR   RL   G N+ L+P AD++NHS  + T  DY
Sbjct: 216 SKYPDLFPDEVFNIESFKWSFGILFSR-LVRLPSMGGNVALVPWADMLNHSCDVETFLDY 274

Query: 263 AYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI--ESKSDRNA 320
               KG       ++F+  T  P + GEQV I Y   KSN EL L YGF+  E  +  ++
Sbjct: 275 DKTSKG-------IVFT--TDRPYQPGEQVFISYG-KKSNGELLLSYGFVPKEGANPSDS 324

Query: 321 YTLTLEISESDPFFGDKLDIAETNGLGESAYFDI-VLGRTLPPAMLQYLRLVALGGTDAF 379
             L+L + +SD  + +KL++ +  GL  S  F I + G  L      YL +        F
Sbjct: 325 VELSLSLKKSDASYKEKLELLKNYGLSASQCFPIQITGWPLELMAYAYLAVSPSSMRGDF 384

Query: 380 -LLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEE----DEQLLQGG 434
             + +   N      DL      E+ + + + D+C+S++S ++  ++     D  +    
Sbjct: 385 EEMAAAASNNTTSKKDLRYPEIEEQAL-QFILDSCESSISKYNKFLQASGSLDLDVTSPK 443

Query: 435 NLDPRL---RIAVGVRAGEKRVLQQIDEIFKE--RELELDEL 471
            L+ RL   ++AV +   E+R+L +   I +   R++   EL
Sbjct: 444 QLNRRLFLKQLAVDLCTSERRILFRAQYILRRKLRDMRAGEL 485


>gi|224051705|ref|XP_002200601.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Taeniopygia
           guttata]
          Length = 593

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 110/389 (28%), Positives = 176/389 (45%), Gaps = 37/389 (9%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A R+I   E+ L VP K  +  ++   S +GSL S  +       I++A  L+ 
Sbjct: 104 EGFGLKATREIKAEELFLWVPRKLLMTVESAKNSVLGSLYSQDRILQAMGNITLAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEE 204
           E+    S W  Y+  LP   D+ +++ E+E+  LQ TQ +       +    +Y    + 
Sbjct: 164 ERANPHSFWLPYIQTLPSEYDTPLYFEEDEVQHLQSTQAIHDVFSQYKNTARQYAYFYKV 223

Query: 205 I-ILPNKQLFP--RPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
           I   PN    P     T DD+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 224 IQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 280

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+     KAGEQ+ I Y   +SNAE  +  GF   
Sbjct: 281 NGLITTGYNLED------DRCECVALQD---FKAGEQIYIFYG-TRSNAEFVIHSGFFFD 330

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T P     +L +LR+ 
Sbjct: 331 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHSTEPAISAQLLAFLRVF 388

Query: 372 ALGGTD--AFLL--ESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEED 427
            +   +    L+   +I +    G+ D PVS  NE  +   +       L  + TT+E D
Sbjct: 389 CMSEEELKEHLIGEHAIGKIFTLGNSDFPVSWDNEVKLWTFLEARASLLLKTYKTTVEVD 448

Query: 428 EQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           +  L+  +L P   +A+ +R GEK +L++
Sbjct: 449 KSFLETHDLTPHAIMAIKLRLGEKEILEK 477


>gi|343961019|dbj|BAK62099.1| SET domain containing 3 isoform a [Pan troglodytes]
          Length = 492

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 110/382 (28%), Positives = 176/382 (46%), Gaps = 41/382 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 104 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ E+E+  LQ TQ +       +    +Y    KV
Sbjct: 164 ERASPNSFWQPYIQTLPSEYDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV 223

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 224 IQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 280

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+     +AGEQ+ I Y   +SNAE  +  GF   
Sbjct: 281 NGLITTGYNLED------DRCECVALQD---FRAGEQIYIFYG-TRSNAEFVIHSGFFFD 330

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 331 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTEPPISAQLLAFLRVF 388

Query: 372 ALGGTDAFLLE------SIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            +  T+  L E      +I R    G+ + PVS  NE  +   + D     L  + TTIE
Sbjct: 389 CM--TEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIE 446

Query: 426 EDEQLLQGGNLDPRLRIAVGVR 447
           ED+ +L+  +L  R ++A+ +R
Sbjct: 447 EDKSVLKNQDLSVRAKMAIKLR 468


>gi|444915331|ref|ZP_21235465.1| SET domain containing protein [Cystobacter fuscus DSM 2262]
 gi|444713560|gb|ELW54457.1| SET domain containing protein [Cystobacter fuscus DSM 2262]
          Length = 449

 Score =  119 bits (299), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 120/432 (27%), Positives = 203/432 (46%), Gaps = 42/432 (9%)

Query: 46  ATCSLHSASATTNPPTAQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEV 105
            T S  +A++++N    ++    +WL +     PK  +      E   ++AQ  I+  E 
Sbjct: 3   GTMSADTAASSSN---QKLSNLLRWLEEGGARFPKLQLVRREDGER-AVLAQAPISAGET 58

Query: 106 VLEVPMKFWINPDTVAASEIG-SLCSGLKPW---ISVALFLIREKKKEDSPWRVYLDILP 161
           VL+VP    +  +    S+IG ++  GL P    + +A FL++EK +E S W+ Y+D LP
Sbjct: 59  VLQVPRTHMLTLELARESDIGRAIAEGLDPDNEDLYLASFLLQEKHREGSFWKPYIDSLP 118

Query: 162 EC-TDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLD 220
           E  +   +F+  EE   L+G   L+      + ++ +YL + + +  P  + F    T  
Sbjct: 119 ESYSQMPLFYGSEEHALLKGCFALTLLTHQAQSLREDYLSLCQNV--PGYERF----TPG 172

Query: 221 DFLWAFGILRSRAFSRLRGQNL--VLIPLADLINHS--PGI---TTEDYAYEIKGAGLFS 273
           +F+WA   + SR FS  +G  L   L+P+AD++NH   P +   TTED            
Sbjct: 173 EFVWARLSVSSRLFSLKKGGFLGQTLVPMADMLNHRRPPDVLWETTEDGES--------- 223

Query: 274 RDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPF 333
               F ++    V AG++V   Y   KSN  + L +GF+   ++ +   L L I + DP 
Sbjct: 224 ----FVMKANNAVAAGDEVHDSYG-AKSNDLMLLHFGFVTDDNEHDEAFLGLRILDGDPL 278

Query: 334 FGDKLDIAETNGLGESAYFDIV--LGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWG 391
              K  +        +  F I      T       +LR+ A    D   +E I    + G
Sbjct: 279 AATKQMLLMLPSPTAARPFKISRPYVHTTTRMAFSFLRIAAAVPND---IEDISSRVMSG 335

Query: 392 HLDL-PVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGE 450
              L P+S  NEE +  ++   C++ LS F T++ +DE+LL+G +L P  R  V VR  E
Sbjct: 336 ERALGPLSVENEENVLELLAATCQARLSIFPTSLAQDEELLRGESLSPNARNCVLVRRAE 395

Query: 451 KRVLQQIDEIFK 462
           K++++   E+ +
Sbjct: 396 KQLIEDYLEMTR 407


>gi|168003103|ref|XP_001754252.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162694354|gb|EDQ80702.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 431

 Score =  119 bits (299), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 100/304 (32%), Positives = 144/304 (47%), Gaps = 18/304 (5%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTV-AASEIGSLC--SGLKPWISVALFLIREKK 147
           G GLVA + + + E +L VP    I  D+    +E G +   +GL  W  +A+FLI E  
Sbjct: 39  GRGLVATQSLRQGERLLFVPSGLLITADSEWGCAETGRIIKEAGLPEWPMLAIFLISEAS 98

Query: 148 KEDSP-WRVYLDILPECTDSTVFWSEEEL-VELQGTQLLSTTLGVKEYVQNEYLKVEEEI 205
           +E+S  W  Y   LP+   S + W+EEE+   L  + +    L     V   Y  +   I
Sbjct: 99  REESSRWFPYFATLPKTPSSILQWTEEEVNTWLTASPVREKALECIRDVTETYRDLRATI 158

Query: 206 ILPNKQLFPRPI-TLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITT-EDYA 263
            L + ++FP  + TL  F WAFGIL SR         L L+P AD++NHSP + +  D+ 
Sbjct: 159 FLKHPEVFPSQVYTLAAFKWAFGILFSRLVRLPSVGKLALVPWADMLNHSPQVDSFLDFD 218

Query: 264 YEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSD-RNAYT 322
                + +   D  +        ++GEQV I Y   +S+ EL L YGFI S+ +  ++  
Sbjct: 219 QNNAKSVVTVTDRAY--------QSGEQVFISYG-KRSSGELFLAYGFIPSELNVHDSVE 269

Query: 323 LTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLE 382
           L +EI   DP F  KL  A   GL     F  V     P  +L Y RL+A   +D   L 
Sbjct: 270 LEMEIDSDDPSFEAKLRAANEQGLSSPQRFP-VRKDGFPAQLLAYARLIASRTSDPAQLS 328

Query: 383 SIFR 386
            I R
Sbjct: 329 RIAR 332


>gi|440804394|gb|ELR25271.1| rubisco lsmt substrate-binding protein [Acanthamoeba castellanii
           str. Neff]
          Length = 408

 Score =  119 bits (299), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 113/406 (27%), Positives = 182/406 (44%), Gaps = 61/406 (15%)

Query: 100 IAKNEVVLEVPMKFWINPDT-VAASEIGSLCSGLKPWIS-------VALFLIREKKKEDS 151
           +  +E +LEVP    ++    + A ++GS+ + +KP +        +ALF++ E +K DS
Sbjct: 2   VLASERILEVPFSLLLDAGAALRAEDVGSVFAAVKPALDAVDNRLPLALFMLHELRKPDS 61

Query: 152 PWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQ 211
            WR Y D LP   +  +FW++E++  L G+ L +  L  K+  ++ +     E I+P  +
Sbjct: 62  FWRPYFDALPSRVNLPMFWADEDMQLLAGSPLHAAVLAQKKQARDWH----TEHIVPIVR 117

Query: 212 LFPRPITLDD-------------FLWAFGILRSRAFSRLRGQNL---VLIPLADLINHS- 254
            +PRP  + D             F W   ++ SRAF     ++     ++P+ADLINHS 
Sbjct: 118 RYPRPFGVSDDDSSLEPSYSLARFEWVLSMIASRAFWHFDLKDTWEPHMVPMADLINHSL 177

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
                  Y ++ K          F +    P   GEQV I Y    SN EL   Y  +  
Sbjct: 178 TNDNVSKYTFDDKTQT-------FIVHVQQPYAEGEQVFITY-CTDSNFELLKTYAMM-V 228

Query: 315 KSDRNAYT-LTLEISESDPFFGDKLD-IAET-----NGLGESAYFDIVLGRTLPPAMLQY 367
           + + N YT + L+ +       D+++ + +T      GL +  Y   V     P  ++Q 
Sbjct: 229 EDNYNKYTEIRLDETTIARICPDEVERLTKTRALTQRGLAKQTY--PVKSEEFPLDLVQA 286

Query: 368 LRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEED 427
           LRL  L  TD+    + F          PVS  NE ++   +    K  LS +  T +ED
Sbjct: 287 LRLYHLPLTDSHTESTCFETD-------PVSVQNELMVYDTIAGCVKELLSQYPITAQED 339

Query: 428 EQLLQGGNLDPRL----RIAVGVRAGEKRVLQQIDEIFKERELELD 469
             +L     DPRL    R+AV  R  +K  L ++  +F E   ELD
Sbjct: 340 AAMLAH---DPRLSATARLAVAYRREDKLFLTEVGSVFAEMRKELD 382


>gi|397613505|gb|EJK62256.1| hypothetical protein THAOC_17139 [Thalassiosira oceanica]
          Length = 648

 Score =  119 bits (299), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 124/403 (30%), Positives = 193/403 (47%), Gaps = 52/403 (12%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIG--SLCSGLKPWISVALFLIREK-- 146
           G GL+A+R I   + +L++P+   +   + A  E+G  +L  G+  +++VA  LI EK  
Sbjct: 205 GRGLLARRSINDGDELLKIPLDLCLTRKS-ARRELGKDALQEGINEYLAVACQLIHEKFV 263

Query: 147 KKEDSPWRVYLDILPEC--TDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           K EDS +  Y+ +LPE    + T  W +E+L  L+G+ +++ T  ++  ++ EY   L  
Sbjct: 264 KGEDSFYAAYMGVLPEVDEVNPTFTWPDEDLAFLEGSPVVAATRSLQMKLRREYDDLLGG 323

Query: 202 EEEIILPNKQLFP-RPITLDDFLWAFGILRSRAFSRLR----GQNLVLIPLADLINHSPG 256
            + ++      FP    T +++ WAF +L SRA  RLR    G+ L ++P ADLINHS  
Sbjct: 324 PDGLVAKFPLRFPAEHYTFENWEWAFTMLFSRAI-RLRNLQVGERLAMVPYADLINHSAF 382

Query: 257 ITTEDYAYEIKGAGLFSRD----LLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI 312
                 A E  G  LF       +L++ R     +  EQV I Y   KSNAEL L YGF 
Sbjct: 383 SQAFIDARE-SGDWLFKSGEEEVILYADRG---YRQMEQVYISYG-QKSNAELLLLYGFA 437

Query: 313 ESKSDRNAYTLTLEIS-------------ESDPFFGDKLDIAETNGLGESAYFDIVLGRT 359
             ++  N+  +T+ I+             E DP   +KL+   + G  ++  F     R 
Sbjct: 438 LERNPYNSVDVTVSIAPRTKQIAEANEGVEEDPLADEKLEFLLSVGRDQTVDFPCYADR- 496

Query: 360 LPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLP--VSHANEELICRVVRDACKSAL 417
            P  ML+YLRL+ +   D        R       D    +S ANE  + R V  A K  L
Sbjct: 497 YPVEMLEYLRLMMMTPEDT-------RGKPLSDFDYSRTISSANEASVLRSVVAAVKYQL 549

Query: 418 SGFHTTIEEDEQLLQGGNL----DPRLRIAVGVRAGEKRVLQQ 456
             F  T E+D  +++   +        R+AV  R  EKR+L++
Sbjct: 550 GLFPQTEEDDAAIIKDKGMFRLFSYNQRMAVRHRRNEKRLLKR 592


>gi|302764082|ref|XP_002965462.1| hypothetical protein SELMODRAFT_406852 [Selaginella moellendorffii]
 gi|300166276|gb|EFJ32882.1| hypothetical protein SELMODRAFT_406852 [Selaginella moellendorffii]
          Length = 481

 Score =  119 bits (297), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 108/402 (26%), Positives = 178/402 (44%), Gaps = 28/402 (6%)

Query: 69  QWLRDQKVVSPKSPIRPATFPE-GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASE-IG 126
           +WL++Q    P+ P+  ++F + GLG  A RD+   +  L +P  F +    VA    I 
Sbjct: 52  KWLQEQGF--PQQPLLVSSFEDKGLGCCATRDLQAGDAALSIPENFTVTAVDVANHPVIS 109

Query: 127 SLCSGLKPWISVALFLIREKKK-EDSPWRVYLDILPECTDS-TVFWSEEELVELQGTQLL 184
           S   G    + +AL+L+ E+++ +DSPW  YL + P  T S  ++  EE+   L+G+  L
Sbjct: 110 SAAEGRDELVGLALWLMYEQERSQDSPWYPYLKVFPASTLSPLLWEQEEQEELLRGSSAL 169

Query: 185 STTLGVKEYVQN--EYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNL 242
           +    VK+ + +  +     ++ +  NK       T   F  AF ++ SRA      +  
Sbjct: 170 AK---VKDQLTSLRQTFDALKDTLKDNKDFPMEKFTFSAFKAAFSVVLSRAVYLPSAELF 226

Query: 243 VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSN 302
            L+P  DLINH    +  DY  E +            L      K G+QV   Y  N ++
Sbjct: 227 ALVPFGDLINHESSRSLLDYDIEEQKV---------KLAVDKRYKKGDQVFASYAQNLTS 277

Query: 303 AELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP 362
           A+  + YGF++   + +   + + +   D     K +I +  GL     F + L R  P 
Sbjct: 278 ADFLIRYGFLDESDENDFIEIEVGLVSGDSLAPLKREILQEVGLTVPQKFPVYLNR-FPT 336

Query: 363 AMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHT 422
            +L Y RL  +  +  F         I    DL V   NE     ++   C++ L  F  
Sbjct: 337 QLLTYTRLARIQDSGLF-------AKITFEKDLIVCQTNEYETLMLLMADCRTKLLSFSD 389

Query: 423 TIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKER 464
           T+E+D Q L+  NL  + R+A  +R  EKR+L       + R
Sbjct: 390 TMEDDMQTLKRKNLSYKQRVAAQLRLKEKRILTDTMSALRNR 431


>gi|399949805|gb|AFP65462.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Chroomonas mesostigmatica
           CCMP1168]
          Length = 464

 Score =  118 bits (295), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 108/381 (28%), Positives = 187/381 (49%), Gaps = 22/381 (5%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREKK-KE 149
           G GL+A R I + E ++E+P    I   ++  +    L S L  + S+A+  I+E+   E
Sbjct: 95  GRGLLAFRKIQQGEKLIEIPENL-ILKKSLKENRSEDL-SFLNEYDSLAIKAIQERAIGE 152

Query: 150 DSPWRVYLDILPECTD-STVF-WSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIIL 207
            S W+VY +ILP+  D + VF W   ++V L+G+++L+ +  +KE ++ ++L++E+ I  
Sbjct: 153 KSKWKVYYEILPKEKDLNLVFRWKISDIVFLRGSKVLNASFYLKEKIKIQFLRIEKTIFS 212

Query: 208 PNKQLFPRPI-TLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEI 266
            N+ ++P  I  L  + WA  +L SRA      + + L+P AD INH+P  T+  Y    
Sbjct: 213 KNRLVYPEKIFNLQSWEWAISLLLSRAIFLQNMKKIALVPYADFINHNPFSTS--YINSK 270

Query: 267 KGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLE 326
           K A  FS +    +         +Q+   Y   K+N EL + YGFI  ++  ++  L + 
Sbjct: 271 KIA--FSENNEIVMYADKDYNKFDQIFTTYG-QKTNLELLVLYGFIIERNPFDSIELRVA 327

Query: 327 ISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLE-SIF 385
           +S  D  +  K          E   F +   +  P  + +++RL   G  D F  E S  
Sbjct: 328 LSTKDELYNKKEKFINDCEKTEQITFPVFYYK-YPKELYEFMRLCLSGPRDFFGEEFSNL 386

Query: 386 RNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGN---LDPRLRI 442
             T   + +L      E++I + V  ACK  L  ++ TI E++ L    N   L    + 
Sbjct: 387 NFTDEENFNL------EKIIRKTVIFACKKNLKAYNKTINEEKILNNLSNIIVLTKNQKT 440

Query: 443 AVGVRAGEKRVLQQIDEIFKE 463
           ++  R  EK++LQ++ E  K+
Sbjct: 441 SIKQRKCEKKILQRLSENLKK 461


>gi|223992783|ref|XP_002286075.1| rubisco small subunit small subunit n-methyltransferase
           [Thalassiosira pseudonana CCMP1335]
 gi|220977390|gb|EED95716.1| rubisco small subunit small subunit n-methyltransferase
           [Thalassiosira pseudonana CCMP1335]
          Length = 434

 Score =  118 bits (295), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 120/409 (29%), Positives = 193/409 (47%), Gaps = 64/409 (15%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIG--SLCSGLKPWISVALFLIREKK- 147
           G GL+A+R I   + +L++PM   I   + A   +G  +L  G+  ++++A  LI EK  
Sbjct: 45  GRGLLARRSINDGDELLKIPMDLCITRKS-ARKALGKDALQDGINEYLAIACQLIHEKYV 103

Query: 148 -KEDSPWRVYLDILPEC--TDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
             ++S W  Y+ +LPE    + T  W +E+L  L G+ +++ T  ++  ++ EY   L  
Sbjct: 104 LGDESEWDAYMGVLPEVEEVNPTFTWKDEDLAFLDGSPVVAATRSLQMKLRREYDALLGG 163

Query: 202 EEEIILPNKQLFP-RPITLDDFLWAFGILRSRAFSRLR----GQNLVLIPLADLINHSPG 256
           ++ +I      FP    T ++++WAF +L SRA  RLR    G+ L ++P ADLINHS  
Sbjct: 164 QDGLIAKFPDRFPAEHFTYENWVWAFTMLFSRAI-RLRNLQVGERLAMVPYADLINHSA- 221

Query: 257 ITTEDYAYEIKGAGLFSR---DLLFS-------LRTPVPVKAGEQVLIQYDLNKSNAELA 306
                      GA + +R   D LF        L      +  EQV I Y   KSNAEL 
Sbjct: 222 ---------FSGAFIDARESGDWLFKNGEEEVILYADRGYRQMEQVYISYG-QKSNAELL 271

Query: 307 LDYGFIESKSDRNAYTLTLEIS-------------ESDPFFGDKLDIAETNGLGESAYFD 353
           L YGF   ++  N+  +T+ I+             E DP   +K++   + G  ++  F 
Sbjct: 272 LLYGFALERNPYNSVDVTVSIAPRTAALAAANEGIEVDPLAQEKVEFLASVGRDQTVDFP 331

Query: 354 IVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLP--VSHANEELICRVVRD 411
               R  P  ML++LRL+ +   D        R       D    +S ANE  +   V +
Sbjct: 332 CYADR-YPVEMLEFLRLMMMTPEDT-------RGKPLADFDYSRTISPANEAAVLSSVVE 383

Query: 412 ACKSALSGFHTTIEEDEQLLQGGNLDPRL----RIAVGVRAGEKRVLQQ 456
           A K  L+ +  + E+D  +++   L   L    R+AV  R  EKR+L++
Sbjct: 384 AVKYQLNLYPQSEEDDANIIKDKALFRLLSYNQRMAVRHRRNEKRLLKR 432


>gi|443722302|gb|ELU11224.1| hypothetical protein CAPTEDRAFT_181634 [Capitella teleta]
          Length = 541

 Score =  117 bits (294), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 116/409 (28%), Positives = 181/409 (44%), Gaps = 28/409 (6%)

Query: 67  FWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIG 126
           F  WL+   V +    I+      G G+ A RD  + E+ L +P    +  DT   S +G
Sbjct: 80  FMGWLKSNSVDAEAVEIQHFDVG-GYGIKATRDFKEGELFLAIPRSVMMTTDTAKNSALG 138

Query: 127 SLCSGLK-----PWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGT 181
           +L +  +     P I +AL ++ E     S W  YL ILP    S ++++ E+L  L+ +
Sbjct: 139 ALIADNRILQTMPNILLALHVLCELCSPASFWLPYLKILPHSYSSPLYFNPEDLQLLKAS 198

Query: 182 QLLSTTLGVKEYVQNEY---LKVEEEIILPNK-QLFPRPITLDDFLWAFG--ILRSRAFS 235
             LS  +     +  +Y     + +   L +K  +  + I  DD+ WA    + R     
Sbjct: 199 PTLSEMINQFRNITRQYAYFFNLFQGHELASKLPIQVKNICYDDYRWAVSSVMTRQNQIP 258

Query: 236 RLRGQNLV--LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVL 293
            L GQ ++  LIPL D+ NH+ G  T D++ +        R   FSL   V   AG QV 
Sbjct: 259 TLDGQRMISALIPLWDMCNHTNGQITTDFSLK------NDRSECFSLEGTV---AGAQVF 309

Query: 294 IQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFD 353
           I Y  ++SNAEL +  GF+  ++  +  T+ L IS++DP F  K ++     +  S  F 
Sbjct: 310 IFYG-SRSNAELLIHNGFVYPQNHSDRLTIRLGISKNDPLFSMKSEVLSRLSMQASRLFS 368

Query: 354 IVLG-RTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLP--VSHANEELICRVVR 410
           +  G   +    L +LR+V +   D        R  I    D    VS  NE      + 
Sbjct: 369 LHCGVNPVDSDTLAFLRVVVM-TEDDLRTALACRQQISKLRDFDDFVSEDNERKAWAFLA 427

Query: 411 DACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDE 459
                 L  + T+ +ED  LLQG +L    R+AV +R  EK +L    E
Sbjct: 428 TRVLLLLKAYPTSAQEDATLLQGNDLSTHARLAVQLRHCEKNILMSTHE 476


>gi|326921018|ref|XP_003206761.1| PREDICTED: LOW QUALITY PROTEIN: SET domain-containing protein
           3-like [Meleagris gallopavo]
          Length = 593

 Score =  117 bits (294), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 106/392 (27%), Positives = 178/392 (45%), Gaps = 43/392 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A R+I   E+ L VP K  +  ++  +S +GSL S  +       I++A  L+ 
Sbjct: 104 EGFGLKATREIKAEELFLWVPRKLLMTVESAKSSVLGSLYSQDRILQAMGNITLAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEE 204
           E+   +S W  Y+  LP   D+ +++ E+E+  L+ TQ +       +    +Y    + 
Sbjct: 164 ERANPNSFWLPYIQTLPNEYDTPLYFEEDEVQYLRSTQAIHDVFSQYKNTARQYAYFYKV 223

Query: 205 I-ILPNKQLFP--RPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
           I   PN    P     T DD+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 224 IQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 280

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+     KAGEQ+ I Y   +SNAE  +  GF   
Sbjct: 281 NGLITTGYNLED------DRCECVALQ---DFKAGEQIYIFYG-TRSNAEFVIHSGFFFD 330

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L    PP    +L +LR+ 
Sbjct: 331 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHSIEPPISAQLLAFLRVF 388

Query: 372 ALGGTD-------AFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTI 424
            +   +          ++ IF     G+ + P+S  NE  +   +       L  + TT+
Sbjct: 389 CMNEEELKEHLIGEHAIDKIF---TLGNSEFPISWDNEVKLWTFLEARASLLLKTYKTTV 445

Query: 425 EEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           E+D+  L+  +L     +A+ +R GEK++L++
Sbjct: 446 EDDKLFLETHDLTSHATMAIKLRLGEKKILEK 477


>gi|449280698|gb|EMC87934.1| SET domain-containing protein 3 [Columba livia]
          Length = 593

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 107/392 (27%), Positives = 177/392 (45%), Gaps = 43/392 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A R+I   E+ L VP +  +  ++   S +GSL S  +       I++A  L+ 
Sbjct: 104 EGFGLKATREIKAEELFLWVPRRLLMTVESAKNSVLGSLYSQDRILQAMGNITLAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEE 204
           E+   +S W  Y+  LP   ++ +++ E+E+  LQ TQ +       +    +Y    + 
Sbjct: 164 ERANPNSFWLPYIQTLPSEYNTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV 223

Query: 205 I-ILPNKQLFP--RPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
           I   PN    P     T DD+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 224 IQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 280

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+     KAGEQ+ I Y   +SNAE  +  GF   
Sbjct: 281 NGLITTGYNLED------DRCECVALQ---DFKAGEQIYIFYG-TRSNAEFVIHSGFFFD 330

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 331 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHSTEPPISAQLLAFLRVF 388

Query: 372 ALGGTD-------AFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTI 424
            +   +          ++ IF     G+ + PVS  NE  +   +       L  + TT+
Sbjct: 389 CMSEEELKEHLIGEHAIDKIF---TLGNSEFPVSWDNEVKLWTFLEARASLLLKTYKTTV 445

Query: 425 EEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           E+D+  L+  +L     +A+ +R GEK +L++
Sbjct: 446 EDDKSFLETHDLTSHAIMAIKLRLGEKEILEK 477


>gi|387016380|gb|AFJ50309.1| Histone-lysine N-methyltransferase setd3 [Crotalus adamanteus]
          Length = 592

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 107/389 (27%), Positives = 178/389 (45%), Gaps = 37/389 (9%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +GSL S  +       I++A  L+ 
Sbjct: 104 EGFGLKATRDIKAEELFLWVPRKLLMTVESAKNSILGSLYSQDRILQAMGNITLAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEE 204
           E+   +S W  Y+  LP   ++ +++ E+E+  LQ TQ +       +    +Y    + 
Sbjct: 164 ERYNPNSFWLPYIQTLPNEYNTALYFEEDEVQYLQSTQAIHDIFSQYKNTARQYAYFYKV 223

Query: 205 I-ILPNKQLFP--RPITLDDFLWAFGILRSRAFSRLRGQN-----LVLIPLADLINHSPG 256
           +   PN    P     T DD+ WA   + +R  +++  ++     L LIPL D+ NH+ G
Sbjct: 224 VQTHPNASKLPLKDSFTYDDYRWAVSSVMARQ-NQIPAEDGSRVTLALIPLWDMCNHTNG 282

Query: 257 ITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKS 316
           + T  Y           +D           KAGEQ+ I Y   +SNAE  +  GF    +
Sbjct: 283 LITTGYNL---------KDDRCECVALQDFKAGEQIYIFYG-TRSNAEFVIHSGFFFDNN 332

Query: 317 DRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLVAL 373
             +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+  +
Sbjct: 333 SHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHSTEPPISAQLLAFLRVFCM 390

Query: 374 GGTDAFLLE------SIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEED 427
             T+  L E      +I R    G+ + PVS  NE  +   +       L  + TTI +D
Sbjct: 391 --TEDELKEHLIGEHTIDRIFTLGNSEFPVSWDNEVKLWTFLEARASLLLKTYKTTIHDD 448

Query: 428 EQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           + +L+  +L     +A+ +R GEK +L++
Sbjct: 449 KFILETQDLTHNATMAIKLRLGEKEILEK 477


>gi|168002824|ref|XP_001754113.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162694667|gb|EDQ81014.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 638

 Score =  117 bits (293), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 117/445 (26%), Positives = 193/445 (43%), Gaps = 29/445 (6%)

Query: 25  LKLLTSLTKKPSFRLKSRAFAATCSLHSASATTNPPTAQVETFWQWLRDQKVVSPKSPIR 84
           LK+L S T     R   R   A CS+   S  +      +    +WL  Q   + +  I 
Sbjct: 48  LKMLGSRTVWNEGRRGLRG-VARCSMSGNSMQS----MALHQLSEWLSKQGFPT-QDVIL 101

Query: 85  PATFPEGLGLVAQRDIAKNEVVLEVPMKFWINP-DTVAASEIGSLCSGLKPWISVALFLI 143
                EG+GL A RD  + EV L++P  + +   D V    + +  +G    I + L+L+
Sbjct: 102 TGFGEEGVGLAAGRDFKEGEVALKIPENYTVTGVDVVNHPVVAAPAAGRGDVIGLTLWLM 161

Query: 144 REKK-KEDSPWRVYLDILPECTDSTVFWSEEELVEL-QGTQLLSTTLGVKEYVQNEYLKV 201
            E+   E S W  YL   P  T S + W+ EE  +L +G+  L         ++ EY  +
Sbjct: 162 YERSLGEKSVWYPYLQTFPSTTLSPILWTAEEQQKLLKGSPALEEVQQRSAALEGEYEDL 221

Query: 202 EEEIILPNKQLFPRP-ITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTE 260
           +      + Q FP+   +L+ F  AF ++ SRA          L+P AD +NH       
Sbjct: 222 QS-YFTKDPQAFPQEYFSLEAFKSAFSVILSRAVYLPSADLFALVPYADALNHRADSQAY 280

Query: 261 -DYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRN 319
            DY+ E + A +F  D  +        K GEQV   Y   +SNA+L + YGF++  +  +
Sbjct: 281 LDYSMEDQ-AVVFPVDRNY--------KEGEQVFTSYGRERSNADLLITYGFVDENNAMD 331

Query: 320 AYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAF 379
              L + + + D     K  I +   L     F + L R  P  +L Y+RL  L   D  
Sbjct: 332 YLDLEVGLVDGDRLLVLKQQILQQAMLDSPQTFPLYLDR-FPTQLLTYMRLSRL--QDPA 388

Query: 380 LLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPR 439
           L   I  +      D+ +  ANE    +++   C++ L  +   ++++ +LL+   +  R
Sbjct: 389 LFPKIVFDK-----DIMLDQANEYECLQLLMGECRTKLGNYEGGVDDEIRLLKNKKISQR 443

Query: 440 LRIAVGVRAGEKRVLQQIDEIFKER 464
            R+A  +R  EK++L       + R
Sbjct: 444 ERVAAQLRLCEKKILTSTMTALRNR 468


>gi|47215092|emb|CAF98166.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 444

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 104/386 (26%), Positives = 171/386 (44%), Gaps = 31/386 (8%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L +P K  +  ++   S +G L +  +       +++AL L+ 
Sbjct: 22  EGYGLRATRDIKAEELFLWIPRKMLMTVESAKKSVLGPLYTQDRILQAMDNVTLALHLLC 81

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+    S W  Y+  LP+  D+ +F+ ++++  L GTQ +   L        +Y    K+
Sbjct: 82  ERADPASFWLPYIRTLPQEYDTPLFYQQQDVQLLHGTQAIQDVLSQYRNTARQYAYFYKL 141

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRA----FSRLRGQNLVLIPLADLINHSPGI 257
            +     +K       T DD+ WA   + +R         R   L LIPL D+ NH  G+
Sbjct: 142 VQTHPASSKLPLKDSFTFDDYRWAVSSVMTRQNQIPTEDGRQVTLALIPLWDMCNHRNGL 201

Query: 258 TTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSD 317
            T  Y  E        R    +L+     K  EQ+ I Y   +SNAE  +  GF   ++ 
Sbjct: 202 ITTGYNLED------DRCECVALQD---YKKNEQIYIFYG-TRSNAEFVIHNGFFYQENA 251

Query: 318 RNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLVALG 374
            +   + L IS+S+  +  K ++    G+  S+ F +      PP    +L +LR+  + 
Sbjct: 252 HDQVKIKLGISKSERLYAMKAEVLGRAGIPVSSVFALYCNE--PPISAQLLAFLRVFCMM 309

Query: 375 GTDA----FLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQL 430
             +     F  ++I R    G ++ PVS  NE  +   +       L  + TT EED   
Sbjct: 310 EEELKDYLFGAQAINRLVTLGSMEFPVSWENEIKLWTFLETRAALLLKAYKTTAEEDSST 369

Query: 431 LQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           L   +L P  R+AV +R  EK +L++
Sbjct: 370 LDKTDLSPHSRMAVQLRLAEKAILEK 395


>gi|57529914|ref|NP_001006486.1| histone-lysine N-methyltransferase setd3 [Gallus gallus]
 gi|363734802|ref|XP_003641459.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Gallus
           gallus]
 gi|75571462|sp|Q5ZML9.1|SETD3_CHICK RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|53127281|emb|CAG31024.1| hypothetical protein RCJMB04_1k10 [Gallus gallus]
          Length = 593

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 106/392 (27%), Positives = 176/392 (44%), Gaps = 43/392 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A R+I   E+ L VP K  +  ++   S +GSL S  +       I++A  L+ 
Sbjct: 104 EGFGLKATREIKAEELFLWVPRKLLMTVESAKNSVLGSLYSQDRILQAMGNITLAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEE 204
           E+   +S W  Y+  LP   D+ +++ E+E+  L+ TQ +       +    +Y    + 
Sbjct: 164 ERANPNSFWLPYIQTLPSEYDTPLYFEEDEVQYLRSTQAIHDVFSQYKNTARQYAYFYKV 223

Query: 205 I-ILPNKQLFP--RPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
           I   PN    P     T DD+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 224 IQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 280

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+     KAGEQ+ I Y   +SNAE  +  GF   
Sbjct: 281 NGLITTGYNLED------DRCECVALQ---DFKAGEQIYIFYG-TRSNAEFVIHSGFFFD 330

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L    PP    +L +LR+ 
Sbjct: 331 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHSIEPPISAQLLAFLRVF 388

Query: 372 ALGGTD-------AFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTI 424
            +   +          ++ IF     G+ + P+S  NE  +   +       L  + TT+
Sbjct: 389 CMNEEELKEHLIGEHAIDKIF---TLGNSEFPISWDNEVKLWTFLEARASLLLKTYKTTV 445

Query: 425 EEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           E+D+  L+  +L     +A+ +R GEK +L++
Sbjct: 446 EDDKSFLETHDLTSHATMAIKLRLGEKEILEK 477


>gi|41056027|ref|NP_956348.1| histone-lysine N-methyltransferase setd3 [Danio rerio]
 gi|82187658|sp|Q7SXS7.1|SETD3_DANRE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|32766447|gb|AAH55261.1| SET domain containing 3 [Danio rerio]
          Length = 596

 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 114/416 (27%), Positives = 185/416 (44%), Gaps = 44/416 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A +DI   E+ L +P K  +  ++   S +G L S  +       +++AL L+ 
Sbjct: 104 EGYGLKATKDIKAEELFLWIPRKMLMTVESAKNSVLGPLYSQDRILQAMGNVTLALHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEE 204
           E+    SPW  Y+  LP   D+ +++ EEE+  L  TQ +   L   +    +Y    + 
Sbjct: 164 ERANPSSPWLPYIKTLPSEYDTPLYFEEEEVRHLLATQAIQDVLSQYKNTARQYAYFYKV 223

Query: 205 I-ILPNKQLFP--RPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
           I   PN    P     T DD+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 224 IHTHPNASKLPLKDAFTFDDYRWAVSSVMTRQNQIPTADGSRV---TLALIPLWDMCNHT 280

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+     K GEQ+ I Y   +SNAE  +  GF   
Sbjct: 281 NGLITTGYNLED------DRCECVALKD---YKEGEQIYIFYG-TRSNAEFVIHNGFFFE 330

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+ +  +  K ++    G+  S+ F   L  + PP    +L +LR+ 
Sbjct: 331 DNAHDRVKIKLGVSKGERLYAMKAEVLARAGIPASSIF--ALHCSEPPISAQLLAFLRVF 388

Query: 372 ALGGTD--AFLLESIFRNTIW--GHLDLPVSHANEELICRVVRDACKSALSGFHTTIEED 427
            +   +   +L+     N I+  G+ + PVS  NE  +   +       L  + T  EED
Sbjct: 389 CMTEEELRDYLVGDHAINKIFTLGNTEFPVSWENEIKLWTFLETRAALLLKTYKTASEED 448

Query: 428 EQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKEREL----ELDE---LEYYQE 476
             +L+  +L    RIA+ +R  EK +L+      + + L    +LDE   L  Y+E
Sbjct: 449 RSMLEKPDLSLHSRIAIKLRLAEKEILEHAVSCGRAKRLHFQKQLDEGAPLPLYEE 504


>gi|160774366|gb|AAI55279.1| SET domain containing 3 [Danio rerio]
          Length = 596

 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 114/416 (27%), Positives = 185/416 (44%), Gaps = 44/416 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A +DI   E+ L +P K  +  ++   S +G L S  +       +++AL L+ 
Sbjct: 104 EGYGLKATKDIKAEELFLWIPRKMLMTVESAKNSVLGPLYSQDRILQAMGNVTLALHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEE 204
           E+    SPW  Y+  LP   D+ +++ EEE+  L  TQ +   L   +    +Y    + 
Sbjct: 164 ERANPSSPWLPYIKTLPSEYDTPLYFEEEEVRHLLATQAIQDVLSQYKNTARQYAYFYKV 223

Query: 205 I-ILPNKQLFP--RPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
           I   PN    P     T DD+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 224 IHTHPNASKLPLKDAFTFDDYRWAVSSVMTRQNQIPTADGSRV---TLALIPLWDMCNHT 280

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+     K GEQ+ I Y   +SNAE  +  GF   
Sbjct: 281 NGLITTGYNLED------DRCECVALKD---YKEGEQIYIFYG-TRSNAEFVIHNGFFFE 330

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+S+  +  K ++    G+  S+ F   L  + PP    +L +LR+ 
Sbjct: 331 DNAHDRVKIKLGVSKSERLYAMKAEVLARAGIPASSIF--ALHCSEPPISAQLLAFLRVF 388

Query: 372 ALGGTD--AFLLESIFRNTIW--GHLDLPVSHANEELICRVVRDACKSALSGFHTTIEED 427
            +   +   +L+     N I+  G+ + PVS  NE  +   +       L  + T  EED
Sbjct: 389 CMTEEELRDYLVGDHAINKIFTLGNTEFPVSWENEIKLWTFLETRAALLLKTYKTASEED 448

Query: 428 EQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKEREL----ELDE---LEYYQE 476
             +L+  +L    RI + +R  EK +L+      + + L    +LDE   L  Y+E
Sbjct: 449 RSMLEKPDLSLHSRITIKLRLAEKEILEHAVSCGRAKRLHFQKQLDEGAPLPLYEE 504


>gi|410928182|ref|XP_003977480.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Takifugu
           rubripes]
          Length = 598

 Score =  114 bits (286), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 103/384 (26%), Positives = 173/384 (45%), Gaps = 27/384 (7%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L +P K  +  ++   S +G L +  +       +++AL L+ 
Sbjct: 104 EGYGLRATRDIKAEELFLWIPRKMLMTVESAKKSVLGPLYNQDRILQAMDNVTLALHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+    S W  Y+  LP+  D+ +F+ ++E+  LQGTQ +   L        +Y    K+
Sbjct: 164 ERANPASFWLPYIRTLPQEYDTPLFYEQDEVQLLQGTQAVQDVLSQYRNTARQYAYFYKL 223

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRA----FSRLRGQNLVLIPLADLINHSPGI 257
            +     +K       T DD+ WA   + +R         R   L LIPL D+ NH  G+
Sbjct: 224 IQTHPASSKLPLKDSFTFDDYRWAVSSVMTRQNQIPTEDGRQVTLALIPLWDMCNHRNGL 283

Query: 258 TTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSD 317
            T  Y  E        R    +L+     K  EQ+ I Y   +SNAE  +  GF   ++ 
Sbjct: 284 ITTGYNLED------DRCECVALQD---YKKNEQIYIFYG-TRSNAEFVIHNGFFYQENA 333

Query: 318 RNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLG-RTLPPAMLQYLRLVALGGT 376
            +   + L IS+S+  +  K ++    G+  S+ F +    + +   +L +LR+  +   
Sbjct: 334 HDQVKIKLGISKSERLYAMKAEVLARAGIPVSSIFALYCNEQPISAQLLAFLRVFCMKEE 393

Query: 377 DA--FLL--ESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQ 432
           +   +LL   +I +    G ++ PVS  NE  +   +       L  + TT EED   L+
Sbjct: 394 ELRDYLLGGHAINKIVTLGSMEFPVSWDNEIKLWTFLETRVALLLKAYKTTSEEDSSTLE 453

Query: 433 GGNLDPRLRIAVGVRAGEKRVLQQ 456
              L P  R+A+ +R  EK +L++
Sbjct: 454 KSELSPHSRMAIQLRLAEKWILEK 477


>gi|126290266|ref|XP_001367810.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Monodelphis domestica]
          Length = 595

 Score =  114 bits (286), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 110/391 (28%), Positives = 175/391 (44%), Gaps = 41/391 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A R+I   E+ L VP K  +  ++   S +G+L S  +       I++A  L+ 
Sbjct: 104 EGFGLRATREIKAEELFLWVPRKLLMTVESAKNSVLGALYSQDRILQAMGNITLAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEE 204
           E+    S W  Y+  LP   D+ +++ E+E+  LQ TQ +       +    +Y    + 
Sbjct: 164 ERANPSSFWLPYIQTLPSEYDTPLYFEEDEVQHLQSTQAIHDVFSQYKNTARQYAYFYKV 223

Query: 205 I-ILPNKQLFP--RPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
           I   PN    P     T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 224 IQTHPNANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 280

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+       GEQ+ I Y   +SNAE  +  GF   
Sbjct: 281 NGLITTGYNLED------DRCECVALQD---FNVGEQIYIFYG-TRSNAEFVIHSGFFFD 330

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 331 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTEPPISAQLLAFLRVF 388

Query: 372 ALGGTDAFLLE------SIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            +  T+  L E      +I R    G+ + PVS  NE  +   +       L  + TTIE
Sbjct: 389 CM--TEEELKEHLIGEHAIDRIFTLGNSEFPVSWDNEVKLWTFLEARASLLLKTYKTTIE 446

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           ED+  L   +L     +A+ +R GEK +L++
Sbjct: 447 EDKSFLATHDLTFHATMAIKLRLGEKEILEK 477


>gi|325186532|emb|CCA21071.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 441

 Score =  114 bits (286), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 108/417 (25%), Positives = 193/417 (46%), Gaps = 28/417 (6%)

Query: 63  QVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAA 122
           ++ +  QWLR + V +     + +   EG+G+ A + + K E+ +E+P    I+  T   
Sbjct: 8   KIASLLQWLRSKSVTTDSLHFQKSDGHEGVGVYAAKSLQKGEITMEIPFHLTISKVTAMQ 67

Query: 123 SEIGSLCSG---LKPWISVALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSEEELVEL 178
           S++  +      L     VALFL+ E+ K  DS +  ++  LP   D  +FW++ +  EL
Sbjct: 68  SDLRQILQDKNELDQDEIVALFLMIERFKSSDSFFEPFIQSLPSQFDLPIFWNDSDFAEL 127

Query: 179 QGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQ---LFPRPITLDDFLWAFGILRSRAFS 235
           +GT +      +++ ++ ++  +   ++   ++   L    I++ D+ WA  I+ +RAF 
Sbjct: 128 EGTNVALLAKIMRKQIEADFQAIHIPLLRAYEERLNLRTSEISISDYEWALSIIWTRAFG 187

Query: 236 RLR-GQNL-VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVL 293
             R G+ L VL P  D+ NHS  +  E     IK   +  +D+L      +   A +   
Sbjct: 188 ITRYGEYLRVLCPALDMFNHSV-LVQEPLDEFIKYDHM--KDVLAHC-VVMETSANDPFY 243

Query: 294 IQYDLNKSNAELALDYGFI--ESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAY 351
           I Y  + S+A+L   YGF+    K+  N   L + +  +DP F  K  I E N       
Sbjct: 244 ISYG-SYSDAKLLYSYGFVSLNEKNRFNGIDLWMRVPVTDPNFKLKQAILEGNAATRDQT 302

Query: 352 FDI---VLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRV 408
           +D    +    +    L   R++ L   +    E  F +TI       VS  NE  +   
Sbjct: 303 YDFRGTIHLDDVDERFLASFRIILLSQEEFREYEKAFDSTI-------VSVRNELAVYAA 355

Query: 409 VRDACKSALSGFHTTIEEDEQLLQGGNLDPRLR--IAVGVRAGEKRVLQQIDEIFKE 463
           + D C+  L+ F T++E+D + L    ++  LR   A+ VR  +K++LQ +  + KE
Sbjct: 356 IHDVCEKRLARFPTSLEDDLKKLAELEMNSDLRKTYAISVRMEDKKILQSVCRLMKE 412


>gi|395504553|ref|XP_003756612.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Sarcophilus
           harrisii]
          Length = 602

 Score =  114 bits (286), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 110/391 (28%), Positives = 175/391 (44%), Gaps = 41/391 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A R+I   E+ L VP K  +  ++   S +G+L S  +       I++A  L+ 
Sbjct: 111 EGFGLRATREIKAEELFLWVPRKLLMTVESAKNSVLGALYSQDRILQAMGNITLAFHLLC 170

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEE 204
           E+    S W  Y+  LP   D+ +++ E+E+  LQ TQ +       +    +Y    + 
Sbjct: 171 ERANPSSFWLPYIQTLPSEYDTPLYFEEDEVQHLQSTQAIHDVFSQYKNTARQYAYFYKV 230

Query: 205 I-ILPNKQLFP--RPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
           I   PN    P     T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 231 IQTHPNANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 287

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+       GEQ+ I Y   +SNAE  +  GF   
Sbjct: 288 NGLITTGYNLED------DRCECVALQ---DFNVGEQIYIFYG-TRSNAEFVIHSGFFFD 337

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 338 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTEPPISAQLLAFLRVF 395

Query: 372 ALGGTDAFLLE------SIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            +  T+  L E      +I R    G+ + PVS  NE  +   +       L  + TTIE
Sbjct: 396 CM--TEEELKEHLIGEHAIDRIFTLGNSEFPVSWDNEVKLWTFLEARASLLLKTYKTTIE 453

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           ED+  L   +L     +A+ +R GEK +L++
Sbjct: 454 EDKSFLATHDLTFHATMAIKLRLGEKEILEK 484


>gi|428183325|gb|EKX52183.1| hypothetical protein GUITHDRAFT_84758 [Guillardia theta CCMP2712]
          Length = 213

 Score =  114 bits (285), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 65/187 (34%), Positives = 100/187 (53%)

Query: 298 LNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLG 357
           + KS + + LD+G      +   + ++  +S  D FF DK DI E  GL     F +   
Sbjct: 1   MGKSPSSMLLDHGVANPNKEEGEFKVSFAVSSLDRFFDDKADILEQEGLTTEMTFTLKAD 60

Query: 358 RTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSAL 417
            TL   +L ++RLV +   DAFLLES+FR  +WG ++LPVS  NE+L+   +    + AL
Sbjct: 61  GTLDEYLLPFMRLVCIQSFDAFLLESVFRQEVWGFVNLPVSKDNEKLMLETLIATFEGAL 120

Query: 418 SGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKERELELDELEYYQER 477
               ++  ED  +++  +   R   A  VR GE+  L++   + ++   E+D  EYYQER
Sbjct: 121 DDIGSSESEDMSIVRDASSTYRQVQAAYVRIGERSALKKTIYLLEQEMEEMDSKEYYQER 180

Query: 478 RLKDLGL 484
           RLK L L
Sbjct: 181 RLKSLNL 187


>gi|302823067|ref|XP_002993188.1| hypothetical protein SELMODRAFT_449044 [Selaginella moellendorffii]
 gi|300138958|gb|EFJ05708.1| hypothetical protein SELMODRAFT_449044 [Selaginella moellendorffii]
          Length = 600

 Score =  113 bits (282), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 108/402 (26%), Positives = 179/402 (44%), Gaps = 28/402 (6%)

Query: 69  QWLRDQKVVSPKSPIRPATFPE-GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASE-IG 126
           +WL++Q    P+ P+  ++F + GLG  A RD+   +  L +P  F +    VA    I 
Sbjct: 52  KWLQEQGF--PQQPLLVSSFEDKGLGCCATRDLQAGDAALSIPENFTVTAVDVANHPVIS 109

Query: 127 SLCSGLKPWISVALFLIREKKK-EDSPWRVYLDILPECTDSTVFWSEEELVEL-QGTQLL 184
           S   G    + +AL+L+ E+++ +DSPW  Y+ + P  T S + W +EE  EL +G+  L
Sbjct: 110 SAAEGRDELVGLALWLMYEQERSQDSPWYPYVKVFPASTLSLLLWEQEEQEELLRGSSAL 169

Query: 185 STTLGVKEYVQN--EYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNL 242
           +    VK+ + +  +     ++ +  NK       T   F  AF ++ SRA      +  
Sbjct: 170 AK---VKDQLTSLRQTFDALKDTLKDNKDFPMEKFTFSAFKTAFSVVLSRAVYLPSAELF 226

Query: 243 VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSN 302
            L+P  DLINH    +  DY  E +            L      K G+QV   Y  N ++
Sbjct: 227 ALVPFGDLINHESSRSLLDYDIEEQKV---------KLAVDKRYKKGDQVFASYAQNLTS 277

Query: 303 AELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP 362
           A+  + YGF++   + +   + + +   D     K +I +  GL     F + L R  P 
Sbjct: 278 ADFLIRYGFLDESDENDCIEIEVGLVSGDSLAPLKREILQEVGLTVPQKFPLYLNR-FPT 336

Query: 363 AMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHT 422
            +L Y RL  +  +  F         I    DL VS  NE     ++   C++ L     
Sbjct: 337 QLLTYTRLARIQDSGLFA-------KITFEKDLIVSQTNEYETLMLLMADCRTKLLSSSD 389

Query: 423 TIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKER 464
           T+E++ Q L+  NL  + R+A  +R  EKR+L       + R
Sbjct: 390 TMEDEMQTLRRKNLSYKQRVAAQLRLKEKRILTDTMSALRNR 431


>gi|357469947|ref|XP_003605258.1| SET domain-containing protein [Medicago truncatula]
 gi|355506313|gb|AES87455.1| SET domain-containing protein [Medicago truncatula]
          Length = 494

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 120/416 (28%), Positives = 191/416 (45%), Gaps = 33/416 (7%)

Query: 69  QWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTV-AASEIGS 127
           +WL    + S K  I      E  GLVA  +I K E +L VP +  I PD+  +  E G 
Sbjct: 69  KWLSQSGLPSQKMSIDKVDVGER-GLVALNNIRKGEKLLFVPPQLVITPDSEWSCPEAGE 127

Query: 128 LC--SGLKPWISVALFLIREKK-KEDSPWRVYLDILPECTDSTVFWSEEELVE-LQGTQL 183
           +   + +  W  +A +LI E    + S W  Y+  LP    S ++WS+ EL   L+ +Q+
Sbjct: 128 VLKKNSVPDWPLLATYLISEASLMKSSRWFSYISALPRQPYSLLYWSQAELDRYLEASQI 187

Query: 184 LSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI-TLDDFLWAFGILRSRA--FSRLRGQ 240
               +     V   Y  +   I       FP  +  ++ F W+FGIL SR      + G+
Sbjct: 188 RERAIERTNNVIGTYNDMRVRIFSKYPDFFPEEVFNIESFKWSFGILFSRMVRLPSMDGK 247

Query: 241 NLVLIPLADLINHSPGITT-EDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLN 299
           N  L+P AD++NHS  + T  DY    KG       ++F   T  P + GEQV I Y   
Sbjct: 248 N-ALVPWADMMNHSCEVETFLDYDKSSKG-------IVFP--TDRPYQPGEQVFISYG-K 296

Query: 300 KSNAELALDYGFI--ESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDI-VL 356
           KSN EL L YGF+  E  +  ++  L+L + +SD  + +KL++ +  GL  S  F I V 
Sbjct: 297 KSNGELLLSYGFVPKEGTNPSDSVELSLSLKKSDESYKEKLELLKKYGLSGSQCFPIRVT 356

Query: 357 GRTLPPAMLQYLRLVALGGTDAF-LLESIFRNTIWGHLDLPVSHANEELICRVVRDACKS 415
           G  L      YL +        F  + +   N      DL      E+ + + + D+C+S
Sbjct: 357 GWPLELMAYAYLAVSPSSMRGKFEEMAAAASNKTTSKKDLRYPEIEEQAL-QFILDSCES 415

Query: 416 ALSGFHTTIEE----DEQLLQGGNLDPRL---RIAVGVRAGEKRVLQQIDEIFKER 464
           ++S ++  ++     D  +     L+ RL   ++AV +   E+R+L +   I + R
Sbjct: 416 SISKYNKFLQVSGSLDLDVTSPKQLNRRLFLKQLAVDLCNSERRILFRAQYILRRR 471


>gi|357462493|ref|XP_003601528.1| SET domain-containing protein [Medicago truncatula]
 gi|355490576|gb|AES71779.1| SET domain-containing protein [Medicago truncatula]
 gi|388500078|gb|AFK38105.1| unknown [Medicago truncatula]
          Length = 497

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 120/416 (28%), Positives = 191/416 (45%), Gaps = 33/416 (7%)

Query: 69  QWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTV-AASEIGS 127
           +WL    + S K  I      E  GLVA  +I K E +L VP +  I PD+  +  E G 
Sbjct: 72  KWLSQSGLPSQKMSIDKVDVGER-GLVALNNIRKGEKLLFVPPQLVITPDSEWSCPEAGE 130

Query: 128 LC--SGLKPWISVALFLIREKK-KEDSPWRVYLDILPECTDSTVFWSEEELVE-LQGTQL 183
           +   + +  W  +A +LI E    + S W  Y+  LP    S ++WS+ EL   L+ +Q+
Sbjct: 131 VLKKNSVPDWPLLATYLISEASLMKSSRWFSYISALPRQPYSLLYWSQAELDRYLEASQI 190

Query: 184 LSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI-TLDDFLWAFGILRSRA--FSRLRGQ 240
               +     V   Y  +   I       FP  +  ++ F W+FGIL SR      + G+
Sbjct: 191 RERAIERTNNVIGTYNDMRVRIFSKYPDFFPEEVFNIESFKWSFGILFSRMVRLPSMDGK 250

Query: 241 NLVLIPLADLINHSPGITT-EDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLN 299
           N  L+P AD++NHS  + T  DY    KG       ++F   T  P + GEQV I Y   
Sbjct: 251 N-ALVPWADMMNHSCEVETFLDYDKSSKG-------IVFP--TDRPYQPGEQVFISYG-K 299

Query: 300 KSNAELALDYGFI--ESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDI-VL 356
           KSN EL L YGF+  E  +  ++  L+L + +SD  + +KL++ +  GL  S  F I V 
Sbjct: 300 KSNGELLLSYGFVPKEGTNPSDSVELSLSLKKSDESYKEKLELLKKYGLSGSQCFPIRVT 359

Query: 357 GRTLPPAMLQYLRLVALGGTDAF-LLESIFRNTIWGHLDLPVSHANEELICRVVRDACKS 415
           G  L      YL +        F  + +   N      DL      E+ + + + D+C+S
Sbjct: 360 GWPLELMAYAYLAVSPSSMRGKFEEMAAAASNKTTSKKDLRYPEIEEQAL-QFILDSCES 418

Query: 416 ALSGFHTTIEE----DEQLLQGGNLDPRL---RIAVGVRAGEKRVLQQIDEIFKER 464
           ++S ++  ++     D  +     L+ RL   ++AV +   E+R+L +   I + R
Sbjct: 419 SISKYNKFLQVSGSLDLDVTSPKQLNRRLFLKQLAVDLCNSERRILFRAQYILRRR 474


>gi|400594002|gb|EJP61885.1| histone-lysine N-methyltransferase [Beauveria bassiana ARSEF 2860]
          Length = 481

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 100/385 (25%), Positives = 184/385 (47%), Gaps = 33/385 (8%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMK-FWINPDTVAASEIGSLCSGLKPWISV------ALFLI 143
           G G+ AQR  +  E +L +P +  W      A   +G +   L+P +SV       + L+
Sbjct: 25  GRGVQAQRAFSAGERILTIPAQCLWTVEHAYADRLLGPVLRALQPPLSVEDTLALHILLV 84

Query: 144 REKKKED------SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNE 197
           R +++ D      +  R ++D+LP+    ++F+S+EE+   +G+ L + T  ++  + ++
Sbjct: 85  RARRRPDDDGAYEAGRRSHVDVLPDRYTMSIFFSDEEMQVCKGSSLYTLTTQLRGRIGDD 144

Query: 198 YLKVEEEIILPNKQLFP-RPITLDDFLWAFGILRSRA--FSRLRGQNL-VLIPLADLINH 253
           Y K+   +++ ++ LFP     ++ + WA   + SR   F+   G +L +L P AD++NH
Sbjct: 145 YKKLLTRVLMRHRNLFPLSKFGIEHYKWALCTVWSRGMDFTVSEGNSLRLLAPFADMLNH 204

Query: 254 SPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIE 313
           S  +  + +AY+     L       S+        G+QV I Y    +N  L L YGF+ 
Sbjct: 205 SSDV-KQCHAYDPTTGDL-------SILASKDYNVGDQVFIYYGPVPNNRLLRL-YGFVL 255

Query: 314 SKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVAL 373
            ++  ++Y L L+ S   P +  K  + +  GL  +    +     LP ++L+YLR+  L
Sbjct: 256 PENPHDSYDLVLQTSPMAPLYEQKERLWKLAGLDTACTIPLTANDPLPRSVLRYLRIQRL 315

Query: 374 GGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQG 433
              D  LL ++      G  D  +S  +E LI + + D+  + L GF   ++     L  
Sbjct: 316 ---DESLLGAMTMQIATG-ADEKISDDSETLILQFLIDSISAILEGFSIPLDILTAQLAA 371

Query: 434 GNLDP---RLRIAVGVRAGEKRVLQ 455
           G++ P       A  V  GE+R+L+
Sbjct: 372 GDVYPVGGNAWAAAQVSVGEQRILR 396


>gi|322697804|gb|EFY89580.1| putative histone-lysine N-methyltransferase [Metarhizium acridum
           CQMa 102]
          Length = 466

 Score =  112 bits (279), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 105/388 (27%), Positives = 180/388 (46%), Gaps = 33/388 (8%)

Query: 86  ATFP-EGLGLVAQRDIAKNEVVLEVPMKF-WINPDTVAASEIGSLCSGLKP------WIS 137
           A FP  G G+ A+R   + E +L +P    W      A S +G      +P       ++
Sbjct: 19  ADFPVTGRGVKARRRFKQGERILTIPSALHWTVQHAQADSLLGPALRSARPPLTVEDTLA 78

Query: 138 VALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNE 197
           V +  +R ++   +  R ++  LP    S++F++E+EL    GT L + T  +K+ ++++
Sbjct: 79  VYVLFVRSRESGYNGPRSHVAALPTSYSSSIFFTEDELEVCAGTSLYTITKQLKQRIEDD 138

Query: 198 YLKVEEEIILPNKQLFP-RPITLDDFLWAFGILRSRA--FSRLRGQNL-VLIPLADLINH 253
           Y  +   ++ P   LFP    T+  + WA   + SRA  F    G ++ +L P AD++NH
Sbjct: 139 YKDLIARVLGPRPDLFPLNKFTIHHYKWALCTVWSRAMDFELYDGSSMRLLAPFADMLNH 198

Query: 254 SPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIE 313
           S   + + + Y+     L       S+      +AG+QV I Y  +  N+ L   YGF+ 
Sbjct: 199 S-SESKQCHVYDASTGNL-------SILAGKDYEAGDQVYIHYG-SIPNSRLLRLYGFVI 249

Query: 314 SKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVAL 373
             +  ++Y L L      PFF  K  +    GL  +    + L   LP ++L YLR+  L
Sbjct: 250 PDNPNDSYDLVLATHPMAPFFEQKQKLWALAGLDATCTISLTLANPLPKSVLCYLRIQRL 309

Query: 374 GGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEE-DEQLLQ 432
             +D  ++     NT        +S++NE  + + + ++  S L  F T +E+ +EQL +
Sbjct: 310 DESDLAVINLQQSNTDTAFEK--ISNSNEVQVLQFLVESITSLLDSFGTQLEKLEEQLAK 367

Query: 433 -----GGNLDPRLRIAVGVRAGEKRVLQ 455
                GGN       A  V  GE+RVL+
Sbjct: 368 GIYPSGGN----AWAAAHVSLGEQRVLR 391


>gi|444909511|ref|ZP_21229702.1| hypothetical protein D187_00317 [Cystobacter fuscus DSM 2262]
 gi|444720460|gb|ELW61244.1| hypothetical protein D187_00317 [Cystobacter fuscus DSM 2262]
          Length = 445

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 118/409 (28%), Positives = 195/409 (47%), Gaps = 42/409 (10%)

Query: 63  QVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAA 122
           ++ +  +W+     + PK  I      E   ++A+ DIA+ EVVL++P       +   A
Sbjct: 11  KLSSLLRWMEQGGALFPKMHIVRQADGE-RSVLARTDIAEGEVVLQIPTTHLFTLERAKA 69

Query: 123 SEIGS-LCSGLKP---WISVALFLIREKKK-EDSPWRVYLDILPECTDST-VFWSEEELV 176
           S+IG  + S L+P   ++ +A +L+ EK +  DS W+ ++D LPE      +F+SE+E  
Sbjct: 70  SDIGRRIQSQLQPDNDFLYLASWLLEEKHRGADSFWKPFVDSLPEAYPHVPLFYSEQERA 129

Query: 177 ELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSR 236
            ++G+QL       ++  + EY ++ E+  LP  + F      ++++WA   L SR FS 
Sbjct: 130 RMKGSQLERLVEVQRQSFEQEYAQLREK--LPEYERF----GFEEYVWARISLYSRLFS- 182

Query: 237 LRG--QNLVLIPLADLINHS--PGI---TTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAG 289
           L+G  Q   L+PL+D+ NH   P +   T+ED                F +     V AG
Sbjct: 183 LKGGLQGPSLVPLSDMFNHRQPPDVLWSTSED-------------GQTFRMIAQRAVPAG 229

Query: 290 EQVLIQYDLNKSNAELALDYGFIESKSDRN-AYTLTLEISESDPFFGDKLDIAETNGLGE 348
            ++   Y    S+  L L  GF+   ++ N    L++ +   DP    K  +        
Sbjct: 230 TEIHTHYGAKSSDVFL-LHSGFVPDGNEENDEVYLSVGLPPGDPLASVKQQMFGLASATA 288

Query: 349 SAYFDIVL-GRTLPP-AMLQYLRLVALGGTDAFLLESIFRNTIWGHLDL-PVSHANEELI 405
              F +   G+ L   ++  +LR+ A    D FL  ++    + G   + PVS A EE +
Sbjct: 289 KHPFKVSRQGKYLASWSVFSFLRM-AHASPDEFL--ALSNRLLSGTKTIAPVSVACEERV 345

Query: 406 CRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVL 454
              +  AC+  L  F TT+EEDE+LL+ G L P  R  V +R  EKR+L
Sbjct: 346 LGTLAAACEERLKAFPTTLEEDERLLREGPLSPNERSCVLLRRQEKRLL 394


>gi|440797255|gb|ELR18348.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
          Length = 431

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 109/419 (26%), Positives = 186/419 (44%), Gaps = 37/419 (8%)

Query: 61  TAQVETFWQWLRDQKVVSPKSPIRPATFPE-GLGLVAQRDIAKNEVVLEVPMKFWIN--- 116
           T +   F  W  +Q   S  + ++  T P  G  +VA  DIA  E +L VP    ++   
Sbjct: 26  TQRWAAFEAWFAEQGG-SRHAGVQLRTLPTFGRSVVAAHDIATGETLLSVPFSLVVDSAD 84

Query: 117 -PDTVAASEIGSLCSGLKPWISV---ALFLIREKKKEDSPWRVYLDILPECTDSTVFWSE 172
            P   AA EI  +     P  +    AL L+  K   +SPW+ Y+D+LP    +T+F+S+
Sbjct: 85  APLATAAPEIRRILDEEFPLSATNENALLLLVHKNDPNSPWQRYIDVLPSTFSTTLFFSD 144

Query: 173 EELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLF-PRPITLDDFLWAFGILRS 231
           +EL  L+G+ L       +  ++++Y  +   + +   + F P   +LD + WA  ++ S
Sbjct: 145 DELSYLEGSSLHHFARQRRRAIESQYDTIFTPLFVDYPEHFAPEQFSLDAWKWALSVIWS 204

Query: 232 RAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQ 291
           R+F    G+   L+P AD+ N +P   TE     +         L++S R+  P+K GEQ
Sbjct: 205 RSFVVDEGKR-GLVPWADMFNMAP--ETEQVKVAVDAV---DHHLIYSARS--PIKKGEQ 256

Query: 292 VLIQYDLNK--SNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGES 349
           + + Y  ++  SNA+L +DYGF+   +  +A    +  S S      K  +   + L   
Sbjct: 257 IFVAYGQSRQMSNAQLLMDYGFVLENNPHDAVVFPMTHSSSAS--PRKRGLLRAHDLDRD 314

Query: 350 AYF--DIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLP--VSHANEELI 405
            +F     LG   P  +L   R+      +   L+++   +  G   LP  VS  NE   
Sbjct: 315 QFFVGPPALGE-FPEHLLAAFRVTVATEQE---LDALLEQSAQGRQRLPSRVSRRNELAA 370

Query: 406 CRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKER 464
            R++    +   + +        + +    L    R A+ V  GEKR+  Q+    +ER
Sbjct: 371 LRLLESNVRKLRASY-------PEHVMSRALGANARAALAVCQGEKRMWDQLLTQIQER 422


>gi|332321747|sp|B7ZUF3.1|SETD3_XENTR RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|213624517|gb|AAI71209.1| LOC549331 protein [Xenopus (Silurana) tropicalis]
          Length = 582

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 107/395 (27%), Positives = 179/395 (45%), Gaps = 44/395 (11%)

Query: 88  FPE-GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALF 141
           FPE G GL A R+I   E+ L VP K  +  ++   S +G L S  +       I++A  
Sbjct: 101 FPEEGFGLKATREIKAEELFLWVPRKLLMTVESAKGSVLGPLYSQDRILQAMGNITLAFH 160

Query: 142 LIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKV 201
           L+ E+   +S W  Y+  LP   D+ ++++E+E+  LQ TQ +       +    +Y   
Sbjct: 161 LLCERADPNSFWLPYIKTLPNEYDTPLYFNEDEVQYLQSTQAILDVFSQYKNTARQYAYF 220

Query: 202 EEEI-ILPNKQLFP--RPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLI 251
            + I   PN    P     T DD+ WA   + +R         SR+    L LIPL D+ 
Sbjct: 221 YKVIQTHPNANKLPLKDSFTFDDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMC 277

Query: 252 NHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF 311
           NH+ G+ T  Y  E        R    +L+     K+GEQ+ I Y   +SNAE  +  GF
Sbjct: 278 NHTNGLITTGYNLED------DRCECVALQD---FKSGEQIYIFYG-TRSNAEFVIHNGF 327

Query: 312 IESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYL 368
               +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +L
Sbjct: 328 FFENNLHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHVTEPPISAQLLAFL 385

Query: 369 RLVALGGTD-------AFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFH 421
           R+  +   +          ++ IF     G+ + PVS  NE  +   +       L  + 
Sbjct: 386 RVFCMNEDELKGHLIGDHAIDKIF---TLGNSEFPVSWENEIKLWTFLEARASLLLKTYK 442

Query: 422 TTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           TT+E+D ++L+  ++     +A+ +R  EK +L++
Sbjct: 443 TTVEDDNKVLEQPDMTFHSAMAIKLRRVEKEILEK 477


>gi|62857953|ref|NP_001016577.1| histone-lysine N-methyltransferase setd3 [Xenopus (Silurana)
           tropicalis]
 gi|89272100|emb|CAJ81720.1| novel protein containing a SET domain [Xenopus (Silurana)
           tropicalis]
          Length = 581

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 107/395 (27%), Positives = 179/395 (45%), Gaps = 44/395 (11%)

Query: 88  FPE-GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALF 141
           FPE G GL A R+I   E+ L VP K  +  ++   S +G L S  +       I++A  
Sbjct: 100 FPEEGFGLKATREIKAEELFLWVPRKLLMTVESAKGSVLGPLYSQDRILQAMGNITLAFH 159

Query: 142 LIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKV 201
           L+ E+   +S W  Y+  LP   D+ ++++E+E+  LQ TQ +       +    +Y   
Sbjct: 160 LLCERADPNSFWLPYIKTLPNEYDTPLYFNEDEVQYLQSTQAILDVFSQYKNTARQYAYF 219

Query: 202 EEEI-ILPNKQLFP--RPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLI 251
            + I   PN    P     T DD+ WA   + +R         SR+    L LIPL D+ 
Sbjct: 220 YKVIQTHPNANKLPLKDSFTFDDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMC 276

Query: 252 NHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF 311
           NH+ G+ T  Y  E        R    +L+     K+GEQ+ I Y   +SNAE  +  GF
Sbjct: 277 NHTNGLITTGYNLED------DRCECVALQD---FKSGEQIYIFYG-TRSNAEFVIHNGF 326

Query: 312 IESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYL 368
               +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +L
Sbjct: 327 FFENNLHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHVTEPPISAQLLAFL 384

Query: 369 RLVALGGTD-------AFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFH 421
           R+  +   +          ++ IF     G+ + PVS  NE  +   +       L  + 
Sbjct: 385 RVFCMNEDELKGHLIGDHAIDKIF---TLGNSEFPVSWENEIKLWTFLEARASLLLKTYK 441

Query: 422 TTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           TT+E+D ++L+  ++     +A+ +R  EK +L++
Sbjct: 442 TTVEDDNKVLEQPDMTFHSAMAIKLRRVEKEILEK 476


>gi|224129218|ref|XP_002320530.1| predicted protein [Populus trichocarpa]
 gi|222861303|gb|EEE98845.1| predicted protein [Populus trichocarpa]
          Length = 503

 Score =  110 bits (275), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 131/488 (26%), Positives = 211/488 (43%), Gaps = 70/488 (14%)

Query: 24  TLKLLTSLTKKPSFR---LKSRAFAATCSLHSASATTNPPTA--------------QVET 66
           +L  L    KK S+    L  R  A  CS+ ++S T                      E 
Sbjct: 16  SLHSLHKTHKKVSYSQPFLHKRHPAIQCSISTSSDTKAAAKVSETVPWGCDIDSLENAEA 75

Query: 67  FWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTV-AASEI 125
             +WL D  +   K  I+     E  GLVA ++I K E++L VP    I  D+  +  E 
Sbjct: 76  LQKWLSDSGLPPQKMAIQKVEVGER-GLVALKNIRKGEMLLFVPPSLVIAADSEWSCPEA 134

Query: 126 GSLCS--GLKPWISVALFLIREKKKEDSP-WRVYLDILPECTDSTVFWSEEEL-VELQGT 181
           G +     +  W  +A +LI E   E S  W  Y+  LP    S ++W+  EL   L+ +
Sbjct: 135 GEVLKKYSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDTYLEAS 194

Query: 182 QLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI-TLDDFLWAFGILRSRAFSRLRGQ 240
           Q+    +     V   Y  +   I      LFP  +  ++ F W+FGIL SR   RL   
Sbjct: 195 QIRERAIERITNVTGTYNDLRLRIFSKYPHLFPEEVFNMETFKWSFGILFSR-LVRLPSM 253

Query: 241 N--LVLIPLADLINHSPGITT-EDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYD 297
           +  + L+P AD++NHS  + T  DY    KG  +F+ D         P + GEQV I Y 
Sbjct: 254 DGRVALVPWADMLNHSSEVETFLDYDKSSKGV-VFTTDR--------PYQPGEQVFISYG 304

Query: 298 LNKSNAELALDYGFI--ESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDI- 354
             KSN EL L YGF+  E  +  ++  L+L + +SD  + +KL+  + +GL  S  F + 
Sbjct: 305 -RKSNGELLLSYGFVPREGTNPSDSVELSLSLKKSDKCYKEKLEALKKHGLSVSQCFPLQ 363

Query: 355 VLGRTL-----------PPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEE 403
           V G  L           PP+M +    +A   ++        + T    +  P     EE
Sbjct: 364 VTGWPLELMAYAYLAVSPPSMSRQFEEMAAAASN--------KTTTNKKITYP---DIEE 412

Query: 404 LICRVVRDACKSALSGFHTTIEE----DEQLLQGGNLDPRL---RIAVGVRAGEKRVLQQ 456
              + + D+C+ ++S +   ++     D  +     L+ RL   ++AV + + E+R+L +
Sbjct: 413 QALQFILDSCELSISKYTKFLQASGSMDLDVTSPKQLNRRLFLKQLAVDLCSSERRILFR 472

Query: 457 IDEIFKER 464
              + + R
Sbjct: 473 AQYVLRRR 480


>gi|440464432|gb|ELQ33864.1| hypothetical protein OOU_Y34scaffold00857g1 [Magnaporthe oryzae
           Y34]
          Length = 464

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 108/406 (26%), Positives = 184/406 (45%), Gaps = 41/406 (10%)

Query: 70  WLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPM-KFWINPDTVAASEIGS 127
           WL++   V     +  A FP  G G+   R   + E +L +P    W      A S +G 
Sbjct: 4   WLKETGAVGLDD-LELADFPITGRGVRTLRHFKEGEKILTIPCGSLWTVEQAHADSLLGP 62

Query: 128 LCSGLKPWISVA------LFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGT 181
               ++P +SV       +  +R ++      R ++  LP    S++F++ EEL    GT
Sbjct: 63  ALRSIRPPLSVEDILATYILFVRSRESGYDGLRSHVAALPSSYSSSIFFAGEELEVCAGT 122

Query: 182 QLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFP-RPITLDDFLWAFGILRSRAFS-RLRG 239
            L + T  +++ ++++Y  +   +++ ++ LFP    T++D+ WA   + SRA    L G
Sbjct: 123 SLYTITKQLEQRIEDDYRALVMRLLVQHRDLFPLEQFTIEDYKWALCTVWSRAMDFVLPG 182

Query: 240 QNLV--LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYD 297
            N +  L P AD++NHS  +  + +AY+       S     S+      +AG+QV I Y 
Sbjct: 183 GNSIRLLAPFADMLNHSDNVK-QCHAYD-------SSSKTLSVLAGKDYEAGDQVFIYYG 234

Query: 298 LNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLG 357
              SN+ L   YGF+   +  + Y L L      PFF  KL +  +  L  ++   + L 
Sbjct: 235 -PVSNSRLLRLYGFVLPGNSNDNYDLVLATHPEAPFFARKLKLWASARLDSTSTISLTLT 293

Query: 358 RTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDL--PVSHANEELICRVVRDACKS 415
             LP  +L+YLR+   G ++               +D    +S +NE  + R + ++   
Sbjct: 294 DPLPNDVLRYLRIQRSGASE-------LAGMACQRIDATEKISDSNEVEVLRFLVESLGG 346

Query: 416 ALSGFHTTIEE-DEQLLQ-----GGNLDPRLRIAVGVRAGEKRVLQ 455
            L  F T +E+ +EQL +     GGN       A  V  GE+RVL+
Sbjct: 347 LLKNFGTQLEKLEEQLAEGVYASGGN----AWAAAHVSLGEQRVLR 388


>gi|160331079|ref|XP_001712247.1| met [Hemiselmis andersenii]
 gi|159765694|gb|ABW97922.1| met [Hemiselmis andersenii]
          Length = 464

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 99/383 (25%), Positives = 185/383 (48%), Gaps = 34/383 (8%)

Query: 89  PEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREKKK 148
           P G GL+A + I + E ++E+P    ++ D       G+    L  + S+ +FLI++   
Sbjct: 98  PCGRGLLAFKKIQQGEKLIEIPENLILSVDRDQIKNEGN--DFLNEYDSLGIFLIQQMAM 155

Query: 149 ED-SPWRVYLDILPECTDSTV--FWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEI 205
            D S W++Y DILP   D  +   W+  ++V L+G++ L+ +L +KE ++ ++L++E+ I
Sbjct: 156 GDKSKWKIYFDILPREEDLNLGFRWNLNDIVFLRGSKTLNASLYLKEKIKIQFLRLEKTI 215

Query: 206 ILPNKQLFPRPI-TLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAY 264
              N+  +P  I  L  + WA  IL SRA      + + L+P AD +NH+P  T+   + 
Sbjct: 216 FSKNRLKYPVSIFNLAQWEWALSILLSRAIFLQNLKKVSLVPYADFMNHNPFSTSYINSK 275

Query: 265 EIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLT 324
           +I     FS++    +         +Q+   Y   K+N EL L YGFI  ++  ++  L 
Sbjct: 276 KIS----FSKNHEIVMYADKDYNKFDQIFTTYG-QKTNLELLLLYGFILERNPFDSIELR 330

Query: 325 LEISESDPFFGDKLD-IAETNGLGESAYFDIVLGRTLPPAMLQYLRLV-----ALGGTDA 378
           + +S+ D FF  K   + E     E  +   +     P  + ++LR        LG TD 
Sbjct: 331 ISLSDKDSFFEKKKQFMIECEKTSEITF--PIFYYKYPKELYEFLRFCISNQEELGSTDL 388

Query: 379 FLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGN--- 435
               S F      + +   ++  E++I ++V  +C+  L  +   + E E++L   N   
Sbjct: 389 ----SDF------NFNDENNYEIEKIIRKLVLFSCEKLLKNYSKKVSE-EKILNSLNSNF 437

Query: 436 -LDPRLRIAVGVRAGEKRVLQQI 457
            +    ++A+     EK+++Q++
Sbjct: 438 LISKNQKMALKQSKCEKKIIQRL 460


>gi|148686777|gb|EDL18724.1| mCG18357, isoform CRA_b [Mus musculus]
          Length = 466

 Score =  109 bits (273), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 107/382 (28%), Positives = 176/382 (46%), Gaps = 47/382 (12%)

Query: 136 ISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQ 195
           I++A  L+ E+   +S W+ Y+  LP   D+ +++ EEE+  LQ TQ +       +   
Sbjct: 27  IALAFHLLCERASPNSFWQPYIQTLPSEYDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTA 86

Query: 196 NEY---LKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLI 245
            +Y    KV +     NK       T +D+ WA   + +R         SR+    L LI
Sbjct: 87  RQYAYFYKVIQTHPHANKLPLKESFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALI 143

Query: 246 PLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAEL 305
           PL D+ NH+ G+ T  Y  E        R    +L+     +AG+Q+ I Y   +SNAE 
Sbjct: 144 PLWDMCNHTNGLITTGYNLED------DRCECVALQD---FQAGDQIYIFYG-TRSNAEF 193

Query: 306 ALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP--- 362
            +  GF    +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP   
Sbjct: 194 VIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHSTEPPISA 251

Query: 363 AMLQYLRLVALGGTDAFLLESIFRNT------IWGHLDLPVSHANEELICRVVRDACKSA 416
            +L +LR+  +  T+  L E +  ++        G+ + PVS  NE  +   + D     
Sbjct: 252 QLLAFLRVFCM--TEEELKEHLLGDSAIDRIFTLGNAEFPVSWDNEVKLWTFLEDRASLL 309

Query: 417 LSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQI-------DEIFKERELELD 469
           L  + TTIEED+ +L+  +L  R  +A+ +R GEK +L++         E +++   E  
Sbjct: 310 LKTYKTTIEEDKIVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAAVNREYYRKHMEERA 369

Query: 470 ELEYYQERRLKDLGLV-GEQGD 490
            L  Y+E    DLGL+ G  GD
Sbjct: 370 PLPRYEE---SDLGLLEGGVGD 388


>gi|340720054|ref|XP_003398458.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Bombus
           terrestris]
          Length = 484

 Score =  109 bits (273), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 106/426 (24%), Positives = 201/426 (47%), Gaps = 52/426 (12%)

Query: 56  TTNPPTAQ-VETFWQWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKF 113
           T +P  +Q +  F  WL+        + +  A FP   LGL A+R+  +NE++L +P + 
Sbjct: 73  TESPKRSQGIGRFINWLKQNGANVYGASV--AEFPGYDLGLKAERNFLENELILRIPREL 130

Query: 114 WINPDTVAASEIGSLCSG----LKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVF 169
             +    AA E+ +L +     L P +++A+ L+ EK KE S W+ YLDILP    + ++
Sbjct: 131 IFSIHN-AAPELVALQNDPLLQLMPQVALAIALLIEKHKEYSKWKPYLDILPTTYTTVLY 189

Query: 170 WSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNK---QLFPRPITLDDFLWAF 226
            +  ++ EL+G+  L   L     +  +Y    +     N     +     T + + WA 
Sbjct: 190 MTAADMNELKGSPTLEAALKQCRNIARQYAYFNKLFQKNNNAVSAILRDVFTYEKYCWAV 249

Query: 227 GILRSRAFSRLRGQNLV-----------LIPLADLINHSPGITTEDYAYEIKGAGLFS-R 274
             + +R       QN++           LIP+ D+ NH     T D+   +     ++ R
Sbjct: 250 STVMTR-------QNIIPSKDGSLMIHALIPMWDMCNHEDSKITTDFNATLNCCECYALR 302

Query: 275 DLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFF 334
           D           K  EQ+ I Y   ++N++  +  GF+   ++++ + L L IS++DP  
Sbjct: 303 DF----------KKAEQIFISYG-PRTNSDFFVHSGFVYMDNEQDGFKLRLGISKADPLH 351

Query: 335 GDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLVALGGTD-AFLLESIFRNTIW 390
            +++++   N L   A  + +L     P    +L +LR+ ++   + A  ++S   N + 
Sbjct: 352 KERVEL--LNKLDLPAVGEFLLKPGTEPISDTLLAFLRVFSMRKEELAHWIQSDRVNDL- 408

Query: 391 GHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRL-RIAVGVRAG 449
            H+D  +    EE + + +    +  ++ + TT++ED QLL+     PR+ ++A+ +R  
Sbjct: 409 KHMDCALETVVEENVKKFLLTRLQLLIANYPTTLKEDLQLLETT--LPRIKKLAIQLRVT 466

Query: 450 EKRVLQ 455
           EKR+LQ
Sbjct: 467 EKRILQ 472


>gi|380089029|emb|CCC12973.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 465

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 105/404 (25%), Positives = 184/404 (45%), Gaps = 29/404 (7%)

Query: 66  TFWQWLRDQKVVSPKSPIRPATFPE-GLGLVAQRDIAKNEVVLEVPMK-FWINPDTVAAS 123
           T   WL++   V     +  A FP+ G G+   R   + E +L +P    W      A  
Sbjct: 9   TMESWLKESGAVGLDG-LELADFPDTGRGVKTLRPFKEGEKILTIPSSILWTVEHAYADP 67

Query: 124 EIG-SLCSGLKPW-----ISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVE 177
            +G +LCS   P      ++  L  +R ++      R ++  LP    S++F++EEEL  
Sbjct: 68  LLGPALCSVQPPLSPEDTLTTYLLFVRSRESGYDGQRSHVAALPTSYSSSIFFTEEELEV 127

Query: 178 LQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFP-RPITLDDFLWAFGILRSRA--F 234
             GT L + T  +++ +++++  +  ++ + ++ LFP    +++D+ WA   + SR   F
Sbjct: 128 CAGTSLYTITKQLEQSIEDDHRALVMQLFIQHRDLFPLDKFSIEDYKWALCTVWSRRMDF 187

Query: 235 SRLRGQNL-VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVL 293
               G+++ +L P AD++NHS       + Y++    L       S+      + G+QV 
Sbjct: 188 QLRDGKSMRLLAPFADMLNHS-SEAKPCHVYDVSSGNL-------SVLAGKDYEPGDQVF 239

Query: 294 IQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFD 353
           I Y  +  N+ L   YGF+   +  + Y L L      PF+  K  +  + GL  ++   
Sbjct: 240 INYG-SVPNSRLLRLYGFVIPGNPNDTYDLVLSTHPQAPFYEQKHKLWVSAGLDSTSTIP 298

Query: 354 IVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDAC 413
           + L   LP  +L+YLR+     +D  L     +N      D  VS +NE  I + + ++ 
Sbjct: 299 LTLTDPLPKNVLRYLRIQRADASD--LAAMALQN---AKADEKVSDSNEVEILQFLVESF 353

Query: 414 KSALSGFHTTIEEDEQLLQGGNLDP--RLRIAVGVRAGEKRVLQ 455
              L GF T +E+ E+ L  G   P      A  V  GE+RVL+
Sbjct: 354 GHLLGGFGTPLEKLEEQLAQGVYSPGGNAWAAAHVSLGEQRVLR 397


>gi|260803924|ref|XP_002596839.1| hypothetical protein BRAFLDRAFT_284593 [Branchiostoma floridae]
 gi|229282099|gb|EEN52851.1| hypothetical protein BRAFLDRAFT_284593 [Branchiostoma floridae]
          Length = 500

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 105/417 (25%), Positives = 185/417 (44%), Gaps = 36/417 (8%)

Query: 67  FWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIG 126
           F +WL D  V S    I       G GL A +DI   E+ + +P K  +  +T   S +G
Sbjct: 95  FLKWLEDHGVKSDAVTIEKFEVG-GYGLKAVKDIKAEELFITIPRKLMLTTETARESSLG 153

Query: 127 SLCSG-----LKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGT 181
            L        +   +S+AL ++ EK   +S W  Y++I P    + +++ E E++ LQG+
Sbjct: 154 PLIKKDRILQVMANVSLALHVLCEKYSSNSFWAPYINIFPGTYTTPLYFEEGEMLHLQGS 213

Query: 182 QLLSTTLGVKEYVQNEY------LKVEEEII-LPNKQLFPRPITLDDFLWAFGILRSR-- 232
              S  L   + +  +Y       + + E   LP K+ F    T D++ WA   + +R  
Sbjct: 214 LNFSDVLNQYKSIARQYAYFYKLFQTQPEAAGLPLKECF----TFDEYRWAVSTVMTRQN 269

Query: 233 AFSRLRGQNLV--LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGE 290
                 G++L+  LIP+ D+ NHS G  + ++       G  S + L     P       
Sbjct: 270 QVPTSDGRHLITALIPMWDMCNHSNGEVSTEF-----NLGSDSAECLAMREFPT----DS 320

Query: 291 QVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESA 350
           QV I Y + +SNAE  +  GF+  ++  +   + L +S++D  F  K ++    G+  S 
Sbjct: 321 QVYIFYGM-RSNAEFLIHNGFVYPENVHDRVNVKLGVSKNDSLFAMKAEVLSRAGIHAST 379

Query: 351 YFDIVLGRT-LPPAMLQYLRLVALGGTDAF-LLESIFRN---TIWGHLDLPVSHANEELI 405
            F +  G+  +PP +L +LR+  +   D   LL S  ++   +  G  D  V+   E   
Sbjct: 380 SFQVHCGKDPIPPELLVFLRVFTMVEGDLRDLLTSEHQSAYLSCLGRSDCMVTQEQETKA 439

Query: 406 CRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFK 462
              +       +  + T+I++ E  LQ  ++    R A+ ++  E ++L    E  K
Sbjct: 440 WAFLETRLSLLIRSYRTSIKDVETELQAPDMTYHSRAALQLKLAEMQILSNAAEYAK 496


>gi|350408192|ref|XP_003488333.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Bombus
           impatiens]
          Length = 484

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 106/426 (24%), Positives = 201/426 (47%), Gaps = 52/426 (12%)

Query: 56  TTNPPTAQ-VETFWQWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKF 113
           T +P  +Q +  F  WL+        + +  A FP   LGL A+R+  +NE++L +P + 
Sbjct: 73  TESPKRSQGIGRFINWLKQNGANVYGASV--AEFPGYDLGLKAERNFLENELILRIPREL 130

Query: 114 WINPDTVAASEIGSLCSG----LKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVF 169
             +    AA E+ +L +     L P +++A+ L+ EK KE S W+ YLDILP    + ++
Sbjct: 131 IFSIHN-AAPELVALQNDPLLQLMPQVALAIALLIEKHKEYSKWKPYLDILPTTYTTVLY 189

Query: 170 WSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNK---QLFPRPITLDDFLWAF 226
            +  ++ EL+G+  L   L     +  +Y    +     N     +     T + + WA 
Sbjct: 190 MTAADMNELKGSPTLEAALKQCRNIARQYAYFNKLFQKNNNAVSAILRDVFTYEKYCWAV 249

Query: 227 GILRSRAFSRLRGQNLV-----------LIPLADLINHSPGITTEDYAYEIKGAGLFS-R 274
             + +R       QN++           LIP+ D+ NH     T D+   +     ++ R
Sbjct: 250 STVMTR-------QNIIPSKDGSLMIHALIPMWDMCNHENSKITTDFNATLNCCECYALR 302

Query: 275 DLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFF 334
           D           K  EQ+ I Y   ++N++  +  GF+   ++++ + L L IS++DP  
Sbjct: 303 DF----------KKAEQIFISYGA-RTNSDFFVHSGFVYMDNEQDGFKLRLGISKADPLQ 351

Query: 335 GDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLVALGGTD-AFLLESIFRNTIW 390
            +++++   N L   A  + +L     P    +L +LR+ ++   + A  ++S   N + 
Sbjct: 352 KERVEL--LNKLDLPAVGEFLLKPGTEPISDTLLAFLRVFSMRKEELAHWIQSDRVNDL- 408

Query: 391 GHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRL-RIAVGVRAG 449
            H+D  +    EE + + +    +  ++ + TT++ED QLL+     PR+ ++A+ +R  
Sbjct: 409 KHMDCALETVVEENVKKFLLTRLQLLIANYPTTLKEDLQLLETT--LPRIKKLAIQLRVT 466

Query: 450 EKRVLQ 455
           EKR+LQ
Sbjct: 467 EKRILQ 472


>gi|58177849|gb|AAH89108.1| Setd3 protein [Rattus norvegicus]
          Length = 450

 Score =  108 bits (271), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 107/382 (28%), Positives = 176/382 (46%), Gaps = 47/382 (12%)

Query: 136 ISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQ 195
           I++A  L+ E+   +S W+ Y+  LP   D+ +++ EEE+  LQ TQ +       +   
Sbjct: 9   IALAFHLLCERASPNSFWQPYIQTLPSEYDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTA 68

Query: 196 NEY---LKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLI 245
            +Y    KV +     NK       T +D+ WA   + +R         SR+    L LI
Sbjct: 69  RQYAYFYKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALI 125

Query: 246 PLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAEL 305
           PL D+ NH+ G+ T  Y  E        R    +L+     +AG+Q+ I Y   +SNAE 
Sbjct: 126 PLWDMCNHTNGLITTGYNLED------DRCECVALQD---FQAGDQIYIFYG-TRSNAEF 175

Query: 306 ALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP--- 362
            +  GF    +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP   
Sbjct: 176 VIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTEPPISA 233

Query: 363 AMLQYLRLVALGGTDAFLLESIFRNT------IWGHLDLPVSHANEELICRVVRDACKSA 416
            +L +LR+  +  T+  L E +  ++        G+ + PVS  NE  +   + D     
Sbjct: 234 QLLAFLRVFCM--TEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLL 291

Query: 417 LSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQI-------DEIFKERELELD 469
           L  + TTIEED+ +L+  +L  R  +A+ +R GEK +L++         E +++   E  
Sbjct: 292 LKTYKTTIEEDKTVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAAMNREYYRKHMEERA 351

Query: 470 ELEYYQERRLKDLGLV-GEQGD 490
            L  Y+E    DLGL+ G  GD
Sbjct: 352 PLPRYEE---SDLGLLEGGVGD 370


>gi|149044197|gb|EDL97579.1| rCG27725, isoform CRA_c [Rattus norvegicus]
          Length = 468

 Score =  108 bits (270), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 107/382 (28%), Positives = 176/382 (46%), Gaps = 47/382 (12%)

Query: 136 ISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQ 195
           I++A  L+ E+   +S W+ Y+  LP   D+ +++ EEE+  LQ TQ +       +   
Sbjct: 27  IALAFHLLCERASPNSFWQPYIQTLPSEYDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTA 86

Query: 196 NEY---LKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLI 245
            +Y    KV +     NK       T +D+ WA   + +R         SR+    L LI
Sbjct: 87  RQYAYFYKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALI 143

Query: 246 PLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAEL 305
           PL D+ NH+ G+ T  Y  E        R    +L+     +AG+Q+ I Y   +SNAE 
Sbjct: 144 PLWDMCNHTNGLITTGYNLED------DRCECVALQD---FQAGDQIYIFYG-TRSNAEF 193

Query: 306 ALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP--- 362
            +  GF    +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP   
Sbjct: 194 VIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTEPPISA 251

Query: 363 AMLQYLRLVALGGTDAFLLESIFRNT------IWGHLDLPVSHANEELICRVVRDACKSA 416
            +L +LR+  +  T+  L E +  ++        G+ + PVS  NE  +   + D     
Sbjct: 252 QLLAFLRVFCM--TEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLL 309

Query: 417 LSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQI-------DEIFKERELELD 469
           L  + TTIEED+ +L+  +L  R  +A+ +R GEK +L++         E +++   E  
Sbjct: 310 LKTYKTTIEEDKTVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAAMNREYYRKHMEERA 369

Query: 470 ELEYYQERRLKDLGLV-GEQGD 490
            L  Y+E    DLGL+ G  GD
Sbjct: 370 PLPRYEE---SDLGLLEGGVGD 388


>gi|134254196|gb|AAI35195.1| LOC549331 protein [Xenopus (Silurana) tropicalis]
          Length = 507

 Score =  108 bits (270), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 106/395 (26%), Positives = 178/395 (45%), Gaps = 44/395 (11%)

Query: 88  FPE-GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALF 141
           FPE G GL A R+I   E+ L VP K  +  ++   S +G L S  +       I++A  
Sbjct: 26  FPEEGFGLKATREIKAEELFLWVPRKLLMTVESAKGSVLGPLYSQDRILQAMGNITLAFH 85

Query: 142 LIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKV 201
           L+ E+   +S W  Y+  LP   D+ ++++E+E+  LQ TQ +       +    +Y   
Sbjct: 86  LLCERADPNSFWLPYIKTLPNEYDTPLYFNEDEVQYLQSTQAILDVFSQYKNTARQYAYF 145

Query: 202 EEEI-ILPNKQLFP--RPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLI 251
            + I   PN    P     T DD+ WA   + +R         SR+    L LIPL D+ 
Sbjct: 146 YKVIQTHPNANKLPLKDSFTFDDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMC 202

Query: 252 NHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF 311
           NH+  + T  Y  E        R    +L+     K+GEQ+ I Y   +SNAE  +  GF
Sbjct: 203 NHTNSLITTGYNLED------DRCECVALQD---FKSGEQIYIFYG-TRSNAEFVIHNGF 252

Query: 312 IESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYL 368
               +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +L
Sbjct: 253 FFENNLHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHVTEPPISAQLLAFL 310

Query: 369 RLVALGGTD-------AFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFH 421
           R+  +   +          ++ IF     G+ + PVS  NE  +   +       L  + 
Sbjct: 311 RVFCMNEDELKGHLIGDHAIDKIF---TLGNSEFPVSWENEIKLWTFLEARASLLLKTYK 367

Query: 422 TTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           TT+E+D ++L+  ++     +A+ +R  EK +L++
Sbjct: 368 TTVEDDNKVLEQPDMTFHSAMAIKLRRVEKEILEK 402


>gi|449017905|dbj|BAM81307.1| similar to ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplast precursor
           [Cyanidioschyzon merolae strain 10D]
          Length = 567

 Score =  108 bits (270), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 143/527 (27%), Positives = 211/527 (40%), Gaps = 141/527 (26%)

Query: 42  RAFAATCSLHSASATTNPPTAQVETFWQWLRDQKV-VSPKSPIRPATFP----------- 89
           R   A   +  AS T++    ++E F +WL   +V VS K+    A+ P           
Sbjct: 56  RELRAVAEMPGASKTSS----RLEAFVKWLAANEVFVSDKATWGRASHPLVVAEQTLIES 111

Query: 90  --EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSL----CSGLKPWISVALFLI 143
              G G +A+RDI   EV+ +VP       D VA     +L     +  + + ++A  L+
Sbjct: 112 EPAGRGFLARRDIQAGEVLFQVPFHLCFTKD-VAVRRFAALNVPELADEEEFFALATLLL 170

Query: 144 RE-------KKKEDSP---WRVYLDILP-----------------ECTDSTVFWSEEELV 176
            E       KK    P   W  YLDILP                 +  D+   W+E+E+ 
Sbjct: 171 YERGLDESWKKSGRGPGSFWGPYLDILPPVPWEFKGAEPAESLSMDPLDALWLWAEDEMQ 230

Query: 177 ELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLF--PRPITLDDFLWAFGILRSRAF 234
            LQG+  L +   ++  V+ EY +  E +   +  +F       L+ FLWAFG+L SRA 
Sbjct: 231 WLQGSPTLLSARALRSKVEREYAEACERLYRRHPHIFDLEGAFRLERFLWAFGVLFSRAV 290

Query: 235 SRLRGQN--LVLIPLADLINHSP------GITTEDYAYEIKGAGLFSRDLLFS-LRTPVP 285
           S L  +N  L L+P ADL NHS          T  + Y  + +    R   +     P  
Sbjct: 291 S-LPAENGMLALVPYADLANHSAFCVSFIDARTAAFPYAFRASSKQKRGQWWQRFLAPNS 349

Query: 286 VKAG-----------------------------EQVLIQYDLNKSNAELALDYGFIESKS 316
             AG                             EQV + Y   KSNAEL L YGF+   S
Sbjct: 350 DDAGAVANTDSSHYREDAQREVVAYADRFYDKFEQVYVSYG-QKSNAELLLLYGFV---S 405

Query: 317 DRNAYTLTLEISESDPFFGDKLDIAETNGLG---ESAYFDIVLGR-------------TL 360
           DRN Y  ++E+  S       L  +E  G G       F +  GR               
Sbjct: 406 DRNPYN-SVEVCVS-------LSGSEAAGAGLLDRKRSFLLACGRDPDKPECFPLYADRY 457

Query: 361 PPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDL-------PVSHANEELICRVVRDAC 413
           P  ++Q LR  +L   DA            G+ DL       PV+  NE      +  AC
Sbjct: 458 PLELMQLLRFASLTEQDAA-----------GYSDLEQIDVAQPVNRENEIAAKSALLQAC 506

Query: 414 KSALSGFHTTIEEDEQLLQGGN----LDPRLRIAVGVRAGEKRVLQQ 456
           K AL  + T+ +ED+  L+  +    L  + R++V +R  EKR+L++
Sbjct: 507 KIALQAYPTSADEDDAALKDKSMAQLLSRKQRLSVRLRRSEKRILER 553


>gi|336468018|gb|EGO56181.1| hypothetical protein NEUTE1DRAFT_83233 [Neurospora tetrasperma FGSC
           2508]
 gi|350289741|gb|EGZ70966.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
          Length = 459

 Score =  108 bits (270), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 110/400 (27%), Positives = 183/400 (45%), Gaps = 28/400 (7%)

Query: 70  WLRDQKVVSPKSPIRPATFPE-GLGLVAQRDIAKNEVVLEVPMK-FWINPDTVAASEIGS 127
           WL+    V   S +  A FP+ G G+   R   + E +L +P    W      A   +G 
Sbjct: 4   WLKQSGAVGLDS-LELADFPDTGRGVKTLRPFKEGEKILTIPAGILWTVKHAYADPLLGP 62

Query: 128 LCSGLKPWISV----ALFLIREKKKEDS--PWRVYLDILPECTDSTVFWSEEELVELQGT 181
                +P +SV    A +++  K +E      R ++  LP    S++ ++E++L    GT
Sbjct: 63  ALRSAQPPLSVEDTLATYILFVKSRESGYDGQRSHIAALPTSYSSSILFAEDDLEACAGT 122

Query: 182 QLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFP-RPITLDDFLWAFGILRSRA--FSRLR 238
            L + T  +++ +++++  +   + + +  LFP    T++D+ WA   + SRA  F    
Sbjct: 123 SLYTITKQLEQSIEDDHRALVVRLFVQHPDLFPLDKFTVEDYKWALCTVWSRAMDFVLAD 182

Query: 239 GQNL-VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYD 297
           G ++ +L P AD++NH+  +  + + Y+     L       S+      +AG+QV I Y 
Sbjct: 183 GNSIRLLAPFADMLNHTSEVK-QCHVYDPSSGNL-------SVLAGKDYEAGDQVFINYG 234

Query: 298 LNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLG 357
               N+ L   YGF+   +  ++Y L L      PFF  K  +  + GL  +A   + L 
Sbjct: 235 -PVPNSRLLRLYGFVIPGNPNDSYDLVLSTHPQAPFFEQKQKLWVSAGLDSTATIPLTLT 293

Query: 358 RTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSAL 417
             LP  +L+YLR+  L   DA  L  I R       D  +S +NE  I R + ++    L
Sbjct: 294 DPLPKKVLRYLRIQRL---DASGLAVIARQQT-DATDGKISDSNEVEILRFLVESFNYLL 349

Query: 418 SGFHTTIEEDEQLLQGGNLDP--RLRIAVGVRAGEKRVLQ 455
            GF T +E+ EQ L  G   P      A  V  GE+RVL+
Sbjct: 350 DGFGTPLEKLEQQLAEGVYSPGGNAWAAAHVSLGEQRVLR 389


>gi|336260071|ref|XP_003344832.1| hypothetical protein SMAC_06115 [Sordaria macrospora k-hell]
          Length = 456

 Score =  108 bits (270), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 104/400 (26%), Positives = 183/400 (45%), Gaps = 29/400 (7%)

Query: 70  WLRDQKVVSPKSPIRPATFPE-GLGLVAQRDIAKNEVVLEVPMK-FWINPDTVAASEIG- 126
           WL++   V     +  A FP+ G G+   R   + E +L +P    W      A   +G 
Sbjct: 4   WLKESGAVGLDG-LELADFPDTGRGVKTLRPFKEGEKILTIPSSILWTVEHAYADPLLGP 62

Query: 127 SLCSGLKPW-----ISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGT 181
           +LCS   P      ++  L  +R ++      R ++  LP    S++F++EEEL    GT
Sbjct: 63  ALCSVQPPLSPEDTLTTYLLFVRSRESGYDGQRSHVAALPTSYSSSIFFTEEELEVCAGT 122

Query: 182 QLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFP-RPITLDDFLWAFGILRSRA--FSRLR 238
            L + T  +++ +++++  +  ++ + ++ LFP    +++D+ WA   + SR   F    
Sbjct: 123 SLYTITKQLEQSIEDDHRALVMQLFIQHRDLFPLDKFSIEDYKWALCTVWSRRMDFQLRD 182

Query: 239 GQNL-VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYD 297
           G+++ +L P AD++NHS       + Y++    L       S+      + G+QV I Y 
Sbjct: 183 GKSMRLLAPFADMLNHS-SEAKPCHVYDVSSGNL-------SVLAGKDYEPGDQVFINYG 234

Query: 298 LNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLG 357
            +  N+ L   YGF+   +  + Y L L      PF+  K  +  + GL  ++   + L 
Sbjct: 235 -SVPNSRLLRLYGFVIPGNPNDTYDLVLSTHPQAPFYEQKHKLWVSAGLDSTSTIPLTLT 293

Query: 358 RTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSAL 417
             LP  +L+YLR+     +D  L     +N      D  VS +NE  I + + ++    L
Sbjct: 294 DPLPKNVLRYLRIQRADASD--LAAMALQN---AKADEKVSDSNEVEILQFLVESFGHLL 348

Query: 418 SGFHTTIEEDEQLLQGGNLDP--RLRIAVGVRAGEKRVLQ 455
            GF T +E+ E+ L  G   P      A  V  GE+RVL+
Sbjct: 349 GGFGTPLEKLEEQLAQGVYSPGGNAWAAAHVSLGEQRVLR 388


>gi|432952574|ref|XP_004085141.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Oryzias
           latipes]
          Length = 606

 Score =  108 bits (269), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 106/400 (26%), Positives = 177/400 (44%), Gaps = 31/400 (7%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL   RDI   E+ L VP K  +  ++   S +G + S  +       +++AL L+ 
Sbjct: 104 EGYGLRTTRDIKAEELFLWVPRKMLMTVESAQNSVLGPIYSQDRILQAMGNVTLALHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+    S W  Y+  LP+  D+ +++ +E++  L GTQ +   L   +    +Y    K+
Sbjct: 164 ERGDPASFWSPYIRSLPQEYDTPLYYQQEDVQLLLGTQAVQDVLNQYKNTARQYAYFYKL 223

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFG--ILRSRAFSRLRGQ--NLVLIPLADLINHSPGI 257
            +     +K       + DD+ WA    + R      + G    L LIPL D+ NH+ G+
Sbjct: 224 VQTHPAASKLPLKDGFSFDDYRWAVSSVMTRQNQIPTVDGSRVTLALIPLWDMCNHTNGL 283

Query: 258 TTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSD 317
            T  Y  E        R    +L+     K  EQ+ I Y   +SNAE  +  GF    + 
Sbjct: 284 ITTGYNLED------DRCECVALQD---YKKNEQIYIFYG-TRSNAEFVIHNGFFFQDNA 333

Query: 318 RNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLVALG 374
            +   + L +S+S+  +  K ++    G+  S  F   L    PP    +L +LR+  + 
Sbjct: 334 HDRVKIKLGVSKSERLYAMKAEVLARAGIPASCVF--ALHCNDPPISAQLLAFLRVFCMT 391

Query: 375 GTD--AFLLESIFRNTIW--GHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQL 430
             +   +LL     N I+  G+ D PVS  NE  +   +       L  + TT EED  +
Sbjct: 392 EEELKDYLLGERAINKIFTLGNSDFPVSWENEIKLWTFLETRAALLLKTYKTTSEEDRSI 451

Query: 431 LQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKERELELDE 470
           L+  +L    R+AV +R  EK++L+      + + L+  E
Sbjct: 452 LEKPDLSLHTRLAVQLRLAEKQILENASASGRAKRLQFQE 491


>gi|414885392|tpg|DAA61406.1| TPA: hypothetical protein ZEAMMB73_723554 [Zea mays]
          Length = 171

 Score =  108 bits (269), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 59/121 (48%), Positives = 75/121 (61%), Gaps = 5/121 (4%)

Query: 53  ASATTNPPTAQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMK 112
           A+A+T    + +E F +WL           I PA  PEGLGLVA RD+ + EVV EVP K
Sbjct: 44  AAASTTATVSAMEDFRRWLASHGAGDGGKAI-PAAVPEGLGLVAARDLPRGEVVAEVPKK 102

Query: 113 FWINPDTVAASEIGSLC---SGLKPWISVALFLIRE-KKKEDSPWRVYLDILPECTDSTV 168
            W++ D VAAS+IG  C    GL+PW++VAL L+ E  +  DSPW  YL ILP  TDST+
Sbjct: 103 LWMDADAVAASDIGRACGGGGGLRPWVAVALLLLSEVARGADSPWAPYLAILPRQTDSTI 162

Query: 169 F 169
           F
Sbjct: 163 F 163


>gi|240278777|gb|EER42283.1| conserved hypothetical protein [Ajellomyces capsulatus H143]
 gi|325090312|gb|EGC43622.1| conserved hypothetical protein [Ajellomyces capsulatus H88]
          Length = 471

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 85/291 (29%), Positives = 141/291 (48%), Gaps = 20/291 (6%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMK-FWINPDTVAASEIGSLCSGLKPWISV------ALFLI 143
           G G+   R   + E +L +P    W      A S +G      +P +SV       +  +
Sbjct: 25  GRGVKTLRHFKEGERILTIPSDVLWTVEHAYADSLLGPTLHSARPPLSVDDTLATYILFV 84

Query: 144 REKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEE 203
           R ++   +  R +L  LP+   S++F++E+EL    GT L + T  +   +Q++Y  +  
Sbjct: 85  RSRESGYNGLRSHLAALPKSYSSSIFFTEDELEVCTGTSLYAITKQLGRCIQDDYKALVV 144

Query: 204 EIILPNKQLFP-RPITLDDFLWAFGILRSRA--FSRLRGQNL-VLIPLADLINHSPGITT 259
            +++ ++ LFP    T++D+ WA   + SRA  F    G+++ +L P AD++NHS  +  
Sbjct: 145 RLLIQHRDLFPLSKFTIEDYKWALCTVWSRAMDFVLPDGKSIRLLAPFADMLNHSSDV-R 203

Query: 260 EDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRN 319
           + +AY+     L       S+      KAG+QV I Y    +N  L L YGFI   +  +
Sbjct: 204 QCHAYDPLSGNL-------SILAGKDYKAGDQVFIYYGSIPNNRLLRL-YGFIIPSNPND 255

Query: 320 AYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRL 370
            Y L LE     PFF  K  + E+ GL  ++   + L   LP  +LQYLR+
Sbjct: 256 NYELVLETHPMAPFFEQKHKLWESAGLDLTSTISLTLTDPLPKNVLQYLRI 306


>gi|164423408|ref|XP_963594.2| hypothetical protein NCU08733 [Neurospora crassa OR74A]
 gi|157070080|gb|EAA34358.2| conserved hypothetical protein [Neurospora crassa OR74A]
          Length = 459

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 110/400 (27%), Positives = 183/400 (45%), Gaps = 28/400 (7%)

Query: 70  WLRDQKVVSPKSPIRPATFPE-GLGLVAQRDIAKNEVVLEVPMK-FWINPDTVAASEIGS 127
           WL+    V   S +  A FP+ G G+   R   + E +L +P    W      A   +G 
Sbjct: 4   WLKQSGAVGLDS-LELADFPDTGRGVKTLRPFKEGEKILTIPAGILWTVKHAYADPLLGP 62

Query: 128 LCSGLKPWISV----ALFLIREKKKEDS--PWRVYLDILPECTDSTVFWSEEELVELQGT 181
                +P +SV    A +++  K +E      R ++  LP    S++ ++E++L    GT
Sbjct: 63  ALRSAQPPLSVEDTLATYILFVKSRESGYDGQRSHIAALPASYSSSILFAEDDLEACAGT 122

Query: 182 QLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFP-RPITLDDFLWAFGILRSRA--FSRLR 238
            L + T  +++ +++++  +   + + +  LFP    T++D+ WA   + SRA  F    
Sbjct: 123 SLYTITKQLEQSIEDDHRALVVRLFVQHPDLFPLDKFTVEDYKWALCTVWSRAMDFVLAD 182

Query: 239 GQNL-VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYD 297
           G ++ +L P AD++NH+  +  + + Y+     L       S+      +AG+QV I Y 
Sbjct: 183 GNSIRLLAPFADMLNHTSEVK-QCHVYDPSSGTL-------SVFAGKDYEAGDQVFINYG 234

Query: 298 LNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLG 357
               N+ L   YGF+   +  ++Y L L      PFF  K  +  + GL  +A   + L 
Sbjct: 235 -PVPNSRLLRLYGFVIPGNPNDSYDLVLSTHPQAPFFEQKQKLWVSAGLDSTATIPLTLT 293

Query: 358 RTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSAL 417
             LP  +L+YLR+  L   DA  L  I R       D  +S +NE  I R + ++    L
Sbjct: 294 DPLPKKVLRYLRIQRL---DASGLAVIARQQT-DATDGKISDSNEVEILRFLVESFNYLL 349

Query: 418 SGFHTTIEEDEQLLQGGNLDP--RLRIAVGVRAGEKRVLQ 455
            GF T +E+ EQ L  G   P      A  V  GE+RVL+
Sbjct: 350 DGFGTPLEKLEQQLAEGVYSPGGNAWAAAHVSLGEQRVLR 389


>gi|242066146|ref|XP_002454362.1| hypothetical protein SORBIDRAFT_04g029430 [Sorghum bicolor]
 gi|241934193|gb|EES07338.1| hypothetical protein SORBIDRAFT_04g029430 [Sorghum bicolor]
          Length = 499

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 115/425 (27%), Positives = 192/425 (45%), Gaps = 76/425 (17%)

Query: 93  GLVAQRDIAKNEVVLEVPMKFWINPDTV-AASEIGSLC--SGLKPWISVALFLIREKKKE 149
           GLVA ++I K E +L VP    I  D+     E+G +   + +  W  +A +LI E   E
Sbjct: 97  GLVALKNIRKGEKLLFVPPSLVITADSEWGRPEVGEVMKRNSVPDWPLIATYLISEASLE 156

Query: 150 DSP-WRVYLDILPECTDSTVFWSEEEL--------VELQGTQLLSTTLGVKEYVQNEYLK 200
            S  W  Y+  LP    S ++W+  EL        +  +  Q ++  +G        Y  
Sbjct: 157 GSSRWSSYIAALPRQPYSLLYWTRAELDAYLVASPIRKRAIQRITDVIGT-------YND 209

Query: 201 VEEEIILPNKQLFPRPI-TLDDFLWAFGILRSRAFSRL-----RGQNLVLIPLADLINHS 254
           + + I   +  LFP  +  ++ FLW+FGIL    FSRL       + + L+P AD++NHS
Sbjct: 210 LRDRIFSRHSDLFPEEVYNIETFLWSFGIL----FSRLVRLPSMDEKVALVPWADMLNHS 265

Query: 255 PGITT-EDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI- 312
           P + T  D+    +G  +F+ D  +        + GEQV I Y   KS+ EL L YGF+ 
Sbjct: 266 PEVETFLDFDKSSQGI-VFTTDRSY--------QPGEQVFISYG-KKSSGELLLSYGFVP 315

Query: 313 -ESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDI-VLGRTL---------- 360
            E  +  ++  L + + +SD  + +KL   + NGL ES  F + V G  +          
Sbjct: 316 KEGTNPNDSVELLVSLDKSDKCYKEKLQALKRNGLSESESFPLRVTGWPVELMAYAFLVV 375

Query: 361 -PPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSG 419
            PP M Q+   +A+  ++        +++    L  P     EE   + + D C+  ++ 
Sbjct: 376 SPPDMSQHFEEMAVAASN--------KSSSKPRLSYP---DLEEQALQFILDCCEPNIAK 424

Query: 420 FHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQI--DEIFKERELELDELEYYQER 477
           +        + L+GG   P + +    +A    +L+Q+  D    ER + L   +Y   R
Sbjct: 425 Y-------TKYLEGGGGSPEVSMNAK-QANRTLLLKQVARDLCISERRI-LYRTQYILRR 475

Query: 478 RLKDL 482
           RL+D+
Sbjct: 476 RLRDM 480


>gi|297829320|ref|XP_002882542.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297328382|gb|EFH58801.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 504

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 118/442 (26%), Positives = 195/442 (44%), Gaps = 49/442 (11%)

Query: 48  CSLHSASATTNPPTAQVETFW--------------QWLRDQKVVSPKSPIRPATFPEGLG 93
           CS+ +   TT       E  W               WL D  +   K  I      E  G
Sbjct: 44  CSVSAGETTTRSVEEAPEISWGCEIDSLENATSLQNWLSDSGLPPQKMAIDRVDIGER-G 102

Query: 94  LVAQRDIAKNEVVLEVPMKFWINPDTVAAS-EIGSLCS--GLKPWISVALFLIREKK-KE 149
           LVA +++ K E +L VP    I+ D+   + E G +     +  W  +A +LI E   ++
Sbjct: 103 LVASQNLRKGEKLLFVPPSLVISADSEWTNPEAGEVMKRYDVPDWPLLATYLISEASLQK 162

Query: 150 DSPWRVYLDILPECTDSTVFWSEEEL-VELQGTQLLSTTLGVKEYVQNEYLKVEEEIILP 208
            S W  Y+  LP    S ++W+  EL + L+ +Q+    +     V   Y  +   I   
Sbjct: 163 SSRWYNYISALPRQPYSLLYWTRTELDMYLEASQIRERAIERITNVVGTYEDLRSRIFSK 222

Query: 209 NKQLFPRPITLDD-FLWAFGILRSRAFSRLRGQN--LVLIPLADLINHSPGITT-EDYAY 264
           +  LFP+ +  D+ F W+FGIL SR   RL   +    L+P AD++NH+  + T  DY  
Sbjct: 223 HPHLFPKEVFNDETFKWSFGILFSR-LVRLPSMDGRFALVPWADMLNHNCEVETFLDYDK 281

Query: 265 EIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI--ESKSDRNAYT 322
             KG  +F+ D         P + GEQV I Y  NKSN EL L YGF+  E  +  ++  
Sbjct: 282 SSKGV-VFTTDR--------PYQPGEQVFISYG-NKSNGELLLSYGFVPREGTNPSDSVE 331

Query: 323 LTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALG---GTDAF 379
           L L + ++D  + +KLD  + +GL     F + +    P  ++ Y  LV      G +  
Sbjct: 332 LALSLRKNDKCYKEKLDALKKHGLSTPQCFPVRIT-GWPMELMAYAYLVVSPPDMGNNFE 390

Query: 380 LLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEE----DEQLLQGGN 435
            +     N      DL      E+ + + + D+C++++S +   ++E    D  +     
Sbjct: 391 EMAKAASNKTSTKTDLKYPEIEEDAL-QFILDSCETSISKYSRFLKESGSMDLDITSPKQ 449

Query: 436 LDPRL---RIAVGVRAGEKRVL 454
           L+ +    ++AV +   E+R+L
Sbjct: 450 LNRKAFLKQLAVDLSTSERRIL 471


>gi|296804474|ref|XP_002843089.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
 gi|238845691|gb|EEQ35353.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
          Length = 455

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 107/399 (26%), Positives = 181/399 (45%), Gaps = 28/399 (7%)

Query: 70  WLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMK-FWINPDTVAASEIGS 127
           WL+D         I  A F   G G+ A R   + E +L +P    W      A   +G 
Sbjct: 4   WLKDSGARGVDG-IEVANFAVTGSGVKALRSFKEGERILTIPSACLWTVEKAYADPLLGP 62

Query: 128 LCSGLKPWISVA------LFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGT 181
           +    +P +SV       L  +R +       R ++  +P+   +++F++E+EL    G+
Sbjct: 63  VLRSAQPPLSVEDALAVYLLFVRSRTSGYEGQRHHIAAMPQSYSASIFFTEDELQVCAGS 122

Query: 182 QLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFP-RPITLDDFLWAFGILRSRAFS-RLRG 239
            L + T  +++ V+++Y ++   ++  ++ LFP    T++D+ WA   + SRA    + G
Sbjct: 123 SLYALTRQLEQRVRDDYRQLLVPLLSQHRDLFPLDQFTIEDYKWALCSIWSRAMDFAVSG 182

Query: 240 QNLV--LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYD 297
              V  + PLAD++NHSP +  + +AY+     L       S+      + G+QV I Y 
Sbjct: 183 TTSVRLVAPLADMLNHSPDV-KQCHAYDPTSGDL-------SILAAKDYQVGDQVFIYYG 234

Query: 298 LNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLG 357
              +N  L L YGF+   +  ++Y L L+ S   P +  K  +    GL  +    + + 
Sbjct: 235 SVPNNRLLRL-YGFVLPDNPNDSYDLVLQTSPLAPLYEQKERLWALAGLDSTCTIPLTVK 293

Query: 358 RTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSAL 417
             LP  +L+YLR+  L   D   +  I    + G  D  VS  NE  + + + D+  S L
Sbjct: 294 DPLPNNVLRYLRIQRL---DESNITDITLQLVNG-TDGKVSDGNEMQVLQFLVDSIGSLL 349

Query: 418 SGFHTTIEEDEQLLQGGNLDP--RLRIAVGVRAGEKRVL 454
            GF   +E+ E  L  G+         A  V AGE+RVL
Sbjct: 350 EGFGIPLEKLEAQLAAGDYPAGGNAWAAAHVSAGEQRVL 388


>gi|156384284|ref|XP_001633261.1| predicted protein [Nematostella vectensis]
 gi|156220328|gb|EDO41198.1| predicted protein [Nematostella vectensis]
          Length = 403

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 110/408 (26%), Positives = 180/408 (44%), Gaps = 57/408 (13%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCS---GLK--PWISVALFLIR 144
           +GLGL A  D+ +N+V + VP K  ++  T   S +G L S   GL+  P + +AL ++ 
Sbjct: 20  QGLGLRATADLQENQVFVAVPEKLLMSVVTAKKSSLGPLISREHGLRSMPHVVLALHVLC 79

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLG-----VKEYVQNEYL 199
           E+  EDS W  YL+ILP    + +++S ++++ LQG+  +   L      VK+YV   + 
Sbjct: 80  ERLHEDSTWAPYLNILPRSYSTCLYFSPDDMMALQGSPSMGEALKQFRGIVKQYVY--FF 137

Query: 200 KVEEEIILPNKQLFP--RPITLDDFLWAFGILRSRAFSRLRGQN---LVLIPLADLINHS 254
           ++ +  I P     P     T DDF WA   + +R        N     LIP+ D+ NH 
Sbjct: 138 RLVQ--INPEASRLPLKNSFTFDDFRWAVSTVMTRQNDVKVSSNETVKALIPMWDMCNHC 195

Query: 255 PGITT---EDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF 311
            G  T   +D   E+K        L F      P +AG+QV I Y   ++NA+     GF
Sbjct: 196 NGPFTTGFDDSTKEVKS-------LAFK-----PTRAGDQVFIFYG-RRNNADRLFHNGF 242

Query: 312 IESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLP--PAMLQYLR 369
           + ++++ +   + L +S++D  +  K  I    GL  S     VL    P  P +  +LR
Sbjct: 243 VYTEAEEDWVNIQLGVSKNDRLYAMKAQILAMVGLDASGRSYRVLRGPEPISPELRIFLR 302

Query: 370 LVALGGTDAFLLESIFRNTIWGHLDLPV-------------SHANEELICRVVRDACKSA 416
           + ++   +        +  ++    LPV             S  NE  +        +  
Sbjct: 303 VFSMNTGE-------LKPYLFNPEGLPVTPLAELCKAEFTLSEENELKLWSFFHTRLQLI 355

Query: 417 LSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKER 464
           L  + TT +EDE LL   +     R  + +R  E+ +L    E  K R
Sbjct: 356 LGQYKTTKQEDEALLSRDDNTLHTRNCIRLRMSERDILVSALEHAKAR 403


>gi|21537309|gb|AAM61650.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Arabidopsis thaliana]
          Length = 504

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 113/407 (27%), Positives = 188/407 (46%), Gaps = 35/407 (8%)

Query: 69  QWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTV-AASEIGS 127
            WL D  +   K  I      E  GLVA +++ K E +L VP    I+ D+    +E G 
Sbjct: 79  NWLSDSGLPPQKMAIDRVDIGER-GLVASQNLRKGEKLLFVPPSLVISADSEWTNAEAGE 137

Query: 128 LCS--GLKPWISVALFLIREKK-KEDSPWRVYLDILPECTDSTVFWSEEEL-VELQGTQL 183
           +     +  W  +A +LI E   ++ S W  Y+  LP    S ++W+  EL + L+ +Q+
Sbjct: 138 VMKRYDVPDWPLLATYLISEANLQKSSRWFNYISALPRQPYSLLYWTRTELDMYLEASQI 197

Query: 184 LSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDD-FLWAFGILRSRAFSRLRGQN- 241
               +     V   Y  +   I   + QLFP+ +  D+ F W+FGIL SR   RL   + 
Sbjct: 198 RERAIERITNVVGTYEDLRSRIFSKHPQLFPKEVFNDETFKWSFGILFSR-LVRLPSMDG 256

Query: 242 -LVLIPLADLINHSPGITT-EDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLN 299
              L+P AD++NH+  + T  DY    KG  +F+ D         P + GEQV I Y  N
Sbjct: 257 RFALVPWADMLNHNCEVETFLDYDKSSKGV-IFTTDR--------PYQPGEQVFISYG-N 306

Query: 300 KSNAELALDYGFI--ESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLG 357
           KSN EL L YGF+  E  +  ++  L L + ++D  + +KLD  + +GL     F + + 
Sbjct: 307 KSNGELLLSYGFVPREGTNPSDSVELALSLRKNDKCYEEKLDALKKHGLSTPQCFPVRIT 366

Query: 358 RTLPPAMLQYLRLVALGGTDAFLLESIFR---NTIWGHLDLPVSHANEELICRVVRDACK 414
              P  ++ Y  LV          E + +   N      DL      E+ + + + D+C+
Sbjct: 367 -GWPMELMAYAYLVVSPPDMRNNFEEMAKAASNKTSTKNDLKYPEIEEDAL-QFILDSCE 424

Query: 415 SALSGFHTTIEE----DEQLLQGGNLDPRL---RIAVGVRAGEKRVL 454
           +++S +   ++E    D  +     L+ +    ++AV +   E+R+L
Sbjct: 425 TSISKYSRFLKESGSMDLDITSPKQLNRKAFLKQLAVDLSTSERRIL 471


>gi|15231493|ref|NP_187424.1| rubisco methyltransferase-like protein [Arabidopsis thaliana]
 gi|6466950|gb|AAF13085.1|AC009176_12 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Arabidopsis thaliana]
 gi|6648179|gb|AAF21177.1|AC013483_1 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Arabidopsis thaliana]
 gi|15028205|gb|AAK76599.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Arabidopsis thaliana]
 gi|19310671|gb|AAL85066.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Arabidopsis thaliana]
 gi|332641064|gb|AEE74585.1| rubisco methyltransferase-like protein [Arabidopsis thaliana]
          Length = 504

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 113/407 (27%), Positives = 188/407 (46%), Gaps = 35/407 (8%)

Query: 69  QWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTV-AASEIGS 127
            WL D  +   K  I      E  GLVA +++ K E +L VP    I+ D+    +E G 
Sbjct: 79  NWLSDSGLPPQKMAIDRVDIGER-GLVASQNLRKGEKLLFVPPSLVISADSEWTNAEAGE 137

Query: 128 LCS--GLKPWISVALFLIREKK-KEDSPWRVYLDILPECTDSTVFWSEEEL-VELQGTQL 183
           +     +  W  +A +LI E   ++ S W  Y+  LP    S ++W+  EL + L+ +Q+
Sbjct: 138 VMKRYDVPDWPLLATYLISEASLQKSSRWFNYISALPRQPYSLLYWTRTELDMYLEASQI 197

Query: 184 LSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDD-FLWAFGILRSRAFSRLRGQN- 241
               +     V   Y  +   I   + QLFP+ +  D+ F W+FGIL SR   RL   + 
Sbjct: 198 RERAIERITNVVGTYEDLRSRIFSKHPQLFPKEVFNDETFKWSFGILFSR-LVRLPSMDG 256

Query: 242 -LVLIPLADLINHSPGITT-EDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLN 299
              L+P AD++NH+  + T  DY    KG  +F+ D         P + GEQV I Y  N
Sbjct: 257 RFALVPWADMLNHNCEVETFLDYDKSSKGV-VFTTDR--------PYQPGEQVFISYG-N 306

Query: 300 KSNAELALDYGFI--ESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLG 357
           KSN EL L YGF+  E  +  ++  L L + ++D  + +KLD  + +GL     F + + 
Sbjct: 307 KSNGELLLSYGFVPREGTNPSDSVELALSLRKNDKCYEEKLDALKKHGLSTPQCFPVRIT 366

Query: 358 RTLPPAMLQYLRLVALGGTDAFLLESIFR---NTIWGHLDLPVSHANEELICRVVRDACK 414
              P  ++ Y  LV          E + +   N      DL      E+ + + + D+C+
Sbjct: 367 -GWPMELMAYAYLVVSPPDMRNNFEEMAKAASNKTSTKNDLKYPEIEEDAL-QFILDSCE 424

Query: 415 SALSGFHTTIEE----DEQLLQGGNLDPRL---RIAVGVRAGEKRVL 454
           +++S +   ++E    D  +     L+ +    ++AV +   E+R+L
Sbjct: 425 TSISKYSRFLKESGSMDLDITSPKQLNRKAFLKQLAVDLSTSERRIL 471


>gi|346319394|gb|EGX88996.1| Protein kinase-like domain [Cordyceps militaris CM01]
          Length = 1753

 Score =  106 bits (265), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 104/404 (25%), Positives = 184/404 (45%), Gaps = 28/404 (6%)

Query: 65   ETFWQWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMK-FWINPDTVAA 122
            E+   WL+    V   + I  A FP  G G+ A R   K E +L +P    W      A 
Sbjct: 883  ESMEAWLKHSGAVGVDA-IEVADFPVTGRGVKALRSFKKGERILTIPSACLWTAEAARAD 941

Query: 123  SEIGSLCSGLKPWISV------ALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELV 176
              +G +    +P +SV       L  ++ +       R+++  +P+   +++F++E+EL 
Sbjct: 942  PLLGPVLRSAQPPLSVEDTLAIHLLFVKSRTAGYEGQRLHIAAMPQRHSASIFFAEDELQ 1001

Query: 177  ELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFP-RPITLDDFLWAFGILRSRA-- 233
              +G+ L + T  +++ VQ+++ ++  +++  ++ LFP    T++D+ WA   + SRA  
Sbjct: 1002 VCEGSSLHTLTTQLEQRVQDDFRQLLVQLLSQHRDLFPLDQFTIEDYKWALCTIWSRAMD 1061

Query: 234  FSRLRGQNLVLI-PLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQV 292
            F+     ++ L+ PLAD++NHS  +  + +AY+     L       S+      + G+Q+
Sbjct: 1062 FAVSDTTSVRLVAPLADMLNHSLDV-KQCHAYDPTSGDL-------SILAAKDYQVGDQI 1113

Query: 293  LIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYF 352
             I Y    +N  L L YGF+   +  ++Y L L+ S   P +  K  +    GL  +   
Sbjct: 1114 FIYYGSVPNNRLLRL-YGFVLLDNPNDSYDLVLQTSPMAPLYEQKERLWALAGLDSTCTI 1172

Query: 353  DIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDA 412
             +     LP  +L+YLR   L   D   +     N   G     V+  NE  + + + D+
Sbjct: 1173 PLTAKHPLPKNVLRYLRTQRLDAADVADMTLQLLNGTDGK----VNDGNEIQVLQFLIDS 1228

Query: 413  CKSALSGFHTTIEEDEQLLQGG--NLDPRLRIAVGVRAGEKRVL 454
              S L GF   +E+ E  L GG          A  V AGE+ +L
Sbjct: 1229 LGSVLEGFGIPLEKLEAQLAGGFYPAGGNAWAAAQVSAGEQGIL 1272


>gi|323456050|gb|EGB11917.1| hypothetical protein AURANDRAFT_61181 [Aureococcus anophagefferens]
          Length = 516

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 132/452 (29%), Positives = 206/452 (45%), Gaps = 48/452 (10%)

Query: 62  AQVETFWQWLRDQKV---------VSPKS-PIRPATFPE------GLGLVAQRDIAKNEV 105
           A+ +    WL D  V         V P S  +   TF E      G GL+A+R I ++  
Sbjct: 77  ARNDALMAWLTDNDVWVSELSGWNVPPHSMALATTTFDELEGEDSGRGLLARRAITQDAE 136

Query: 106 VLEVPMKFWIN-PDTVAASEI-GSLCSGLKPWISVALFLIREKKK-EDSPWRVYLDILPE 162
           ++ +P++  +     + A E+ GSL      +I++AL LI E+ K   S W  Y+ ILP 
Sbjct: 137 LIRLPVRLCMTKASALKARELRGSLNDDTNEYIAIALLLILERSKGSRSFWSEYIAILPT 196

Query: 163 CTD--STVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLD 220
             D  +T  W  EEL  L+G+   S T  +   ++ E+  V E     N  L P   T +
Sbjct: 197 NEDVGATFTWPAEELAYLEGSPAASATASMMAKLRAEHAAVLE----GNSALDPEIFTFE 252

Query: 221 DFLWAFGILRSRAFSRLR----GQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRD- 275
            + WAF  L SRA  RL+    G+ L ++P  D INHSP  ++   A E+  A  +    
Sbjct: 253 AWQWAFTNLFSRAI-RLKASRAGELLAMVPYVDFINHSPFSSSYVDAREVPKAFPWEEKE 311

Query: 276 ---LLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDP 332
              +LF+ R     K  EQV I Y   KSNA+L L YGF   ++  N+  L +  S+ D 
Sbjct: 312 DEVVLFADRA---YKKFEQVFISYG-PKSNADLLLLYGFALDRNPFNSVDLAVGASKDDA 367

Query: 333 FFGDKLDIAETNGLG-ESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWG 391
            +  K   A   G    SA F +   R  P  ++Q+LR+     T+  L      +    
Sbjct: 368 LYDAKERFARGAGRDVSSAAFPLYADR-FPDELVQFLRMAC--ATEDHLGARPLDDPD-N 423

Query: 392 HLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEK 451
           ++D+ +S  NE  +   +RDAC +A++ +     +D   +    L    R+A  +   EK
Sbjct: 424 YVDI-LSLDNELAVLDTIRDACDAAVAAYPAKSGDD---VPDAFLSRNQRMAKRLVNTEK 479

Query: 452 RV-LQQIDEIFKERELELDELEYYQERRLKDL 482
           R+ L+ I  + ++    L    +  ERR  D+
Sbjct: 480 RILLKTIAAVERKSNELLAAPSFAFERRQPDV 511


>gi|195651313|gb|ACG45124.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
           N-methyltransferase I [Zea mays]
          Length = 503

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 119/422 (28%), Positives = 192/422 (45%), Gaps = 70/422 (16%)

Query: 93  GLVAQRDIAKNEVVLEVPMKFWINPDTV-AASEIGSLC--SGLKPWISVALFLIREKKKE 149
           GLVA ++I K E +L VP    I  D+     E+G +   + +  W  +A +LI E   E
Sbjct: 101 GLVALKNIRKGENLLFVPPSLVITADSEWGRPEVGDVMKRNSVPDWPLIATYLISEASLE 160

Query: 150 DSP-WRVYLDILPECTDSTVFWSEEEL--------VELQGTQLLSTTLGVKEYVQNEYLK 200
            S  W  Y+  LP    S ++W+  EL        +  +  Q ++  +G        Y  
Sbjct: 161 GSSRWISYIAALPRQPYSLLYWTRAELDAYLVASPIRKRAIQRITDVIGT-------YND 213

Query: 201 VEEEIILPNKQLFPRPI-TLDDFLWAFGILRSRAFSRLRGQN--LVLIPLADLINHSPGI 257
           + + I   +  LFP  +  ++ FLW+FGIL SR   RL   +  +VL+P AD++NHSP +
Sbjct: 214 LRDRIFSRHPDLFPEEVYNIETFLWSFGILFSR-LVRLPSMDGRVVLVPWADMLNHSPEV 272

Query: 258 TT-EDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI--ES 314
            T  D+    +G  +F+ D  +        + GEQV I Y   KS+ EL L YGF+  E 
Sbjct: 273 ETFLDFDKSSRGI-VFTTDRSY--------QPGEQVFISYG-KKSSGELLLSYGFVPKEG 322

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDI-VLGRTL-----------PP 362
            +  ++  L + + +SD  + +KL   + NGL ES  F + V G  +           PP
Sbjct: 323 TNPNDSVELLVSLDKSDNCYKEKLQALKRNGLSESESFPLRVTGWPVELMAYAFLVVSPP 382

Query: 363 AMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHT 422
            M Q    +A   ++        + +    L+ P     EE   + + D C+S       
Sbjct: 383 DMSQCFEEMAAAASN--------KTSSKPGLNYP---DLEEQALQFILDCCES------- 424

Query: 423 TIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQI--DEIFKERELELDELEYYQERRLK 480
            IE+  + L+GG   P + +    +A    +L+Q+  D    ER + L   +Y   RRL+
Sbjct: 425 NIEKYTKYLEGGAGSPEVPMNAK-QANRTLLLKQLARDLCISERRI-LYRTQYILRRRLR 482

Query: 481 DL 482
           D+
Sbjct: 483 DM 484


>gi|380015248|ref|XP_003691619.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Apis
           florea]
          Length = 483

 Score =  105 bits (262), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 100/411 (24%), Positives = 191/411 (46%), Gaps = 45/411 (10%)

Query: 67  FWQWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEI 125
           F  WL++       + +  A FP   LGL A+R+  +NE++L +P     +    AA E+
Sbjct: 84  FINWLKENGANVDGASV--AEFPGYDLGLKAERNFLENELILRIPRGLIFSIHN-AAPEL 140

Query: 126 GSLCSG----LKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGT 181
            +L +       P +++A+ L+ E+ KE+S W+ YLDILP    + ++ +  +++EL+G+
Sbjct: 141 ITLQNDPLIQHMPQVALAIALLIERHKENSKWKPYLDILPTTYTTVLYMTAADMIELKGS 200

Query: 182 QLLSTTLGVKEYVQNEYLKVEEEIILPNK---QLFPRPITLDDFLWAFGILRSRAFSRLR 238
             L   L     +  +Y    +     N     +     T + + WA   + +R      
Sbjct: 201 PTLEAALKQCRNIARQYSYFNKVFQNNNNAVSAILRDVFTYERYCWAVSTVMTR------ 254

Query: 239 GQNLV-----------LIPLADLINHSPGITTEDYAYEIKGAGLFS-RDLLFSLRTPVPV 286
            QNL+           LIP+ D+ NH  G  T D+         ++ RD           
Sbjct: 255 -QNLIPSEDGSRMIHALIPMWDMCNHENGRITTDFNATSNYCECYALRDF---------- 303

Query: 287 KAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGL 346
           K GEQ+ I Y   ++N++  +  GF+  ++ ++ + L L IS++D    +++++     L
Sbjct: 304 KKGEQIFISYG-PRTNSDFFVHSGFVYMENKQDGFKLRLGISKADSLQKERIELLNKLDL 362

Query: 347 GESAYFDIVLG-RTLPPAMLQYLRLVALGGTD-AFLLESIFRNTIWGHLDLPVSHANEEL 404
                F + LG   +   +L +LR+ ++   + A  + S   N +  H+D  +    EE 
Sbjct: 363 PTVGEFLLKLGTEPISDLLLAFLRVFSMRKAELAHWIRSDRVNDL-KHMDCALETVVEEN 421

Query: 405 ICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQ 455
           + + +    +  ++ + TT++ED QLL+   L    ++ + +R  EKR+LQ
Sbjct: 422 VRKFLLTRLQLLIANYPTTLKEDLQLLE-TTLPQIKKLTIQLRVTEKRILQ 471


>gi|145524453|ref|XP_001448054.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124415587|emb|CAK80657.1| unnamed protein product [Paramecium tetraurelia]
          Length = 581

 Score =  105 bits (262), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 118/442 (26%), Positives = 198/442 (44%), Gaps = 60/442 (13%)

Query: 65  ETFWQWLRDQKVVSPKSPIRPATFPEGL-GLVAQRDIAKNEVVLEVPMKFWINPDT---- 119
           +T  +WL+  K   PK  I+   + E   G+ A++ I   E++L +P    I  +     
Sbjct: 137 KTLLEWLKHGKAQFPK--IKIECYSESYRGVNAKQKINAKELILFIPKSHMITLEMAKET 194

Query: 120 -VAASEIGSLCSGLKPWIS-VALFLIREKKKEDSPWRVYLDILPECTDS-TVFWSEEELV 176
            VA   I      L P  S ++ FL++EK + +S W+ YLDILP+   S  +F++  +L 
Sbjct: 195 PVAKKMIQFRLDLLSPKHSFLSTFLLQEKSRPNSFWKPYLDILPQSYPSFPIFFNNYDLE 254

Query: 177 ELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSR 236
            LQG+  L         ++ +Y  +    + P    +    +  +F WA    R  A SR
Sbjct: 255 WLQGSPFLKQINDKLSDLKKDYNDICN--VAPEFSQY----SFYEFCWA----RMTASSR 304

Query: 237 LRGQNL------VLIPLADLINHS-PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAG 289
           + G N+        +PLAD++NH  P +T+  Y+ E +G         F + T   +  G
Sbjct: 305 IFGINIKGVKTDAFVPLADMLNHKRPKLTSWCYSEEKQG---------FIIETDEKIDRG 355

Query: 290 EQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLD---------- 339
           + +   Y   K N+   L+YGF+   +D N   +T+    +DP    K D          
Sbjct: 356 QMIFDSYG-RKCNSRFLLNYGFVVDDNDANEVNVTVAAEFNDPLIQLKEDATEEQLKQPK 414

Query: 340 ----IAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDA--FLLESIFRNTIWGHL 393
               I +T+G+ E  +F   L  T+   M  Y+R + +       FLL       I    
Sbjct: 415 TFRLIMDTDGINEITHF---LEATVMEFM-SYIRFLVIRDQTQLQFLLNERESKYIKPTK 470

Query: 394 DLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRV 453
             P+   NE  +  ++R  C  +LS + TT+E+D+++LQ  +L    R  + +R GEK +
Sbjct: 471 IQPLGIHNELDMWDLIRRICYVSLSRYPTTLEQDKEILQICDLTTNQRNCLILRMGEKEI 530

Query: 454 LQ---QIDEIFKERELELDELE 472
           L+   Q  E  K+     ++LE
Sbjct: 531 LKFYYQFSEKMKQLLSNFNQLE 552


>gi|328872715|gb|EGG21082.1| hypothetical protein DFA_00957 [Dictyostelium fasciculatum]
          Length = 643

 Score =  105 bits (261), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 106/414 (25%), Positives = 197/414 (47%), Gaps = 50/414 (12%)

Query: 52  SASATTNPPTAQVETFWQWLRDQKV-VSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVP 110
           S S +T P    +++F QWL ++   ++P   I     P G  +VA  +I K+E+++E+P
Sbjct: 199 STSISTTP--EDLKSFQQWLSNKNTYLNPSIDIVDLGPPFGRSMVANTNIKKDEILVEIP 256

Query: 111 MKFWINPDTVAASEIGSLCSGLKPWI---------SVALFLIREKKKEDSPWRVYLDILP 161
               + P ++    I +L   +  W+           A+ +I     EDS W  Y+ ILP
Sbjct: 257 KGIMMTPKSM----IKNLPRFIIDWMDEMKISRTDQQAIAIIYSILHEDSYWYEYVSILP 312

Query: 162 ECTDSTVFWSEEELVELQGTQLLSTT----LGVKEYV--------------QNEYLKVEE 203
           +   +TV+++ EE+ +LQ + +   T     GV  +               +++  K + 
Sbjct: 313 KQFTTTVYFTREEMTQLQASPVHRFTEMRLNGVHRHYDTTISRLRFGYEGGEDDSTKTKT 372

Query: 204 EIILPNKQLFPRP-ITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDY 262
           +  L   + F     TLD F WA G + SRAFS L  ++  ++PLAD+ N    I+    
Sbjct: 373 KSQLDAMKEFKDDRYTLDQFKWALGCVWSRAFS-LSEEDGGMVPLADMFNADTVISRSKV 431

Query: 263 AYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKS--NAELALDYGFI-ESKSDRN 319
             +I  +   S  L+++      ++AGEQ+   Y + K+  + ++ +DYGFI E  S  +
Sbjct: 432 HPKISAS---SPSLVYTASQ--DIEAGEQIFTPYGVYKTLGSGQMLMDYGFIHEDGSSAD 486

Query: 320 AYTLTLE-ISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDA 378
           +  +T+  I  S+P +  K  + ++NG+ ES  F I   + L   +  + R+ ++   ++
Sbjct: 487 STIVTVAPIPPSEPLYDLKRHLMQSNGI-ESEEFTITKNK-LAKELFLFARIKSINKKES 544

Query: 379 FLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQ 432
               + F +T    +  P    NE+   R++ +     L  + TTI++D Q+L+
Sbjct: 545 DQASAHFMSTQRHSMLNP---RNEKAALRLLSNLISRHLDAYQTTIDQDNQILK 595


>gi|146181028|ref|XP_001021989.2| SET domain containing protein [Tetrahymena thermophila]
 gi|146144300|gb|EAS01744.2| SET domain containing protein [Tetrahymena thermophila SB210]
          Length = 590

 Score =  105 bits (261), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 110/422 (26%), Positives = 188/422 (44%), Gaps = 42/422 (9%)

Query: 55  ATTNPPTAQVETFWQWLRDQKVVSPKSPIRPATFPEGL-GLVAQRDIAKNEVVLEVPMKF 113
           A  +P      T   WL D K  S    ++   + +   G+ A+R +   E +L +P   
Sbjct: 142 AEDDPELVNHNTMINWLLDGK--SEFDNLKLQWYSKNYRGVHARRKVYNKETILFIPKSH 199

Query: 114 WINPDT-----VAASEIGSLCSGLKPWIS-VALFLIREKKKEDSPWRVYLDILP-ECTDS 166
            I  +      VA   I +  + L P  S ++ FL++E+K ++S W+ YLDILP +    
Sbjct: 200 LITLEMAKETDVAKKIIAAKLNLLSPKHSFLSTFLLQERKNKESKWKPYLDILPSDYNQF 259

Query: 167 TVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAF 226
            +F+SE++L  L+G+   +     K  ++ +Y  +    + P    +    T +DF WA 
Sbjct: 260 PIFFSEDDLSWLKGSPFQNQVREKKADIKRDYDDICS--VAPEFAEY----TFEDFCWAR 313

Query: 227 GILRSRAFS-RLRGQNL-VLIPLADLINH-SPGITTEDYAYEIKGAGLFSRDLLFSLRTP 283
               SR F  ++  Q     +PLAD++NH  P  T+  Y  + +G         F ++  
Sbjct: 314 MTASSRVFGLQINEQKTDAFVPLADMLNHRRPKQTSWQYDDQREG---------FVIQAL 364

Query: 284 VPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAET 343
             +  GEQV   Y   K N+   L+YGFI   +D N   L L     DP    K ++   
Sbjct: 365 EDIPRGEQVYDSYG-RKCNSRFFLNYGFINLDNDANEVALRLTFDAEDPTIERKKEMM-G 422

Query: 344 NGLGESAYFDIV--LGRTLPPAMLQYLRLVAL-GGTDAFLLESIFRNTIWGHLDL----- 395
             + E   + I+           + YLR + +   +   +L S+         +L     
Sbjct: 423 GDVPEFKVYRILENYQEQNVSEFMSYLRFILIRDNSKLLMLSSLHEQQTENSENLSGYKP 482

Query: 396 ----PVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEK 451
               P+S  NE  +   + + C++++S ++TT++ED++LL   NL    R  V +R+GEK
Sbjct: 483 QKTPPISIQNETDMWVRISNMCQTSISLYNTTLKEDKELLAKDNLTQNQRNCVLLRSGEK 542

Query: 452 RV 453
            V
Sbjct: 543 EV 544


>gi|226501968|ref|NP_001140387.1| uncharacterized protein LOC100272441 [Zea mays]
 gi|194699272|gb|ACF83720.1| unknown [Zea mays]
 gi|413923744|gb|AFW63676.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
           N-methyltransferase I [Zea mays]
          Length = 503

 Score =  105 bits (261), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 118/422 (27%), Positives = 191/422 (45%), Gaps = 70/422 (16%)

Query: 93  GLVAQRDIAKNEVVLEVPMKFWINPDTV-AASEIGSLC--SGLKPWISVALFLIREKKKE 149
           GLVA ++I K E +L VP    I  D+     E+G +   + +  W  +A +LI E   E
Sbjct: 101 GLVALKNIRKGEKLLFVPPSLVITADSEWGRPEVGDVMKRNSVPDWPLIATYLISEASLE 160

Query: 150 DSP-WRVYLDILPECTDSTVFWSEEEL--------VELQGTQLLSTTLGVKEYVQNEYLK 200
            S  W  Y+  LP    S ++W+  EL        +  +  Q ++  +G        Y  
Sbjct: 161 GSSRWISYIAALPRQPYSLLYWTRAELDAYLVASPIRKRAIQRITDVIGT-------YND 213

Query: 201 VEEEIILPNKQLFPRPI-TLDDFLWAFGILRSRAFSRLRGQN--LVLIPLADLINHSPGI 257
           + + I   +  LFP  +  ++ FLW+FGIL SR   RL   +  + L+P AD++NHSP +
Sbjct: 214 LRDRIFSRHPDLFPEEVYNIETFLWSFGILFSR-LVRLPSMDGRVALVPWADMLNHSPEV 272

Query: 258 TT-EDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI--ES 314
            T  D+    +G  +F+ D  +        + GEQV I Y   KS+ EL L YGF+  E 
Sbjct: 273 ETFLDFDKSSRGI-VFTTDRSY--------QPGEQVFISYG-KKSSGELLLSYGFVPKEG 322

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDI-VLGRTL-----------PP 362
            +  ++  L + + +SD  + +KL   + NGL ES  F + V G  +           PP
Sbjct: 323 TNPNDSVELLVSLDKSDNCYKEKLQALKRNGLSESESFPLRVTGWPVELMAYAFLVVSPP 382

Query: 363 AMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHT 422
            M Q    +A   ++        + +    L+ P     EE   + + D C+S       
Sbjct: 383 DMSQCFEEMAAAASN--------KTSSKPGLNYP---DLEEQALQFILDCCES------- 424

Query: 423 TIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQI--DEIFKERELELDELEYYQERRLK 480
            IE+  + L+GG   P + +    +A    +L+Q+  D    ER + L   +Y   RRL+
Sbjct: 425 NIEKYTKYLEGGAGSPEVPMNAK-QANRTLLLKQLARDLCISERRI-LYRTQYILRRRLR 482

Query: 481 DL 482
           D+
Sbjct: 483 DM 484


>gi|345561352|gb|EGX44442.1| hypothetical protein AOL_s00188g347 [Arthrobotrys oligospora ATCC
           24927]
          Length = 468

 Score =  104 bits (260), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 105/400 (26%), Positives = 186/400 (46%), Gaps = 28/400 (7%)

Query: 70  WLRDQKVVSPKSPIRPATFPE-GLGLVAQRDIAKNEVVLEVPMK-FWINPDTVAASEIGS 127
           WL++   V     ++ A FP  G GL + R   + E +L +P    W      A S I  
Sbjct: 4   WLKESGAVGLDD-LKLADFPATGRGLGSLRHFKEGERILTIPSSILWTVEHAYADSIIRP 62

Query: 128 LCSGLKPWISV------ALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGT 181
           +   ++  +SV       +  +R ++   +  R +++ LP    S++F++++EL    G+
Sbjct: 63  VLQSMQGALSVDDTLAIYILFVRSRESGYNGLRSHVEALPTSYSSSIFFTDDELEVCAGS 122

Query: 182 QLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFP-RPITLDDFLWAFGILRSRA--FSRLR 238
            L + T  +K+ +Q++Y  + E +      +F     T++D+ WA   + SRA  F +  
Sbjct: 123 SLYTITKQLKQQIQDDYRTLVERLFGQYLDIFSLGKFTIEDYKWALCTVWSRAMDFVQPD 182

Query: 239 GQNL-VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYD 297
           G+++ +L P AD++NHS  +  + + Y+     L       S+      + G+QV I Y 
Sbjct: 183 GKSIRLLAPFADMLNHSSDVK-KCHVYDTSSGDL-------SILAGKDYEPGDQVFINYG 234

Query: 298 LNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLG 357
              +N  L L YGF+   +  ++Y L L      PFF  K  +  + GL   +   + L 
Sbjct: 235 SIPNNRLLRL-YGFVVPNNPNDSYDLVLMTQPEAPFFELKQKLWVSAGLDSVSTISLSLN 293

Query: 358 RTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSAL 417
             LP ++LQYLR+     +D   L  I    I    D  +S++NEE + + + ++    L
Sbjct: 294 DPLPKSVLQYLRIQRADESD---LAIIALQQI-DATDKILSNSNEEKVLQALIESFCELL 349

Query: 418 SGFHTTIEEDEQLLQGGNLDPRLRI--AVGVRAGEKRVLQ 455
             F T +E+ ++ L  G   P      A  V  GE+RVL+
Sbjct: 350 HNFGTQMEKLDEQLAEGFYPPGTNAWSAAHVSLGEQRVLR 389


>gi|453087416|gb|EMF15457.1| SET domain-containing protein [Mycosphaerella populorum SO2202]
          Length = 454

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 102/379 (26%), Positives = 179/379 (47%), Gaps = 28/379 (7%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKF-WINPDTVAASEIGSLCSGLKPWISV-------ALFL 142
           G G+ A R   + E +L +P    W      A   +G L S  +P +SV        LF+
Sbjct: 25  GRGIRALRRFEEKEKILTIPHGLLWTVKRAYADPVLGPLLSSTRPPLSVDDTLATYILFI 84

Query: 143 IREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVE 202
              K   D P + ++  LP    S++F+++ EL    G+ L +TT  +   ++ +Y  + 
Sbjct: 85  RARKSGYDGP-QSHVAALPASYSSSIFFADAELEICAGSSLYTTTKHLARQIEVDYKDLV 143

Query: 203 EEIILPNKQLFPR-PITLDDFLWAFGILRSRAFS-RLR-GQNLVLI-PLADLINHSPGIT 258
             +   ++ +FP    T+DD+ WA   + SRA   +LR G+++ L+ P AD++NHSP + 
Sbjct: 144 ARLFGRHRDVFPSDKFTIDDYKWALCTVWSRAMDFKLRDGESIRLMAPFADMLNHSPDVG 203

Query: 259 TEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDR 318
            + + Y+ +   L       S+      + G+QV I Y    +N  L+  YGF+   +  
Sbjct: 204 -QCHVYDPQSGNL-------SILAGKSYEPGDQVFINYGPIPNN-RLSRLYGFVVPGNPN 254

Query: 319 NAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDA 378
           ++Y L L      PFF  K  +    GL  ++   + L   LP ++L+YLR+  L  TD 
Sbjct: 255 DSYDLVLSTHPMAPFFEQKHKLWIAAGLDSTSTVSLTLTDPLPRSVLRYLRIQRLNETDL 314

Query: 379 FLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDP 438
             + +   +  +      +S +NE  +   + ++  + L GF   +E+ E+ L+ G   P
Sbjct: 315 AAVGTRQSDVAFEK----ISDSNETEVLTFLVESISALLDGFTLQLEQLEEQLEAGLYTP 370

Query: 439 --RLRIAVGVRAGEKRVLQ 455
                +A  V  GE+RVL+
Sbjct: 371 GENTWLAAHVSLGEQRVLR 389


>gi|315039895|ref|XP_003169325.1| hypothetical protein MGYG_08872 [Arthroderma gypseum CBS 118893]
 gi|311337746|gb|EFQ96948.1| hypothetical protein MGYG_08872 [Arthroderma gypseum CBS 118893]
          Length = 455

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 105/399 (26%), Positives = 187/399 (46%), Gaps = 28/399 (7%)

Query: 70  WLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMK-FWINPDTVAASEIGS 127
           WL+D   +     I  A F   G G+ A R   + E +L +P    W      A   +G 
Sbjct: 4   WLKDSGAIGVDG-IEVADFAVTGRGVKALRSFKEGERILTIPSACLWTVKKAYADPLLGP 62

Query: 128 LCSGLKPWISV----ALFL--IREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGT 181
           +    +P +SV    AL+L  ++ +       R ++  +P+   +++F++++EL   +G+
Sbjct: 63  VLRAAQPPLSVEDSLALYLLFVKSRTLGYEGQRHHIAAMPQSYSASIFFTDDELQVCKGS 122

Query: 182 QLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFP-RPITLDDFLWAFGILRSRA--FSRLR 238
            L + T  +++ V ++Y ++   ++  ++ LFP    T++D+ WA   + SRA  F+   
Sbjct: 123 SLYALTPQLEQRVHDDYRQLLVALLSQHRDLFPLDQFTIEDYKWALCSIWSRAMDFAVSE 182

Query: 239 GQNLVLI-PLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYD 297
             ++ L+ PLAD++NHSP +  + +AY+     L       S+      + G+Q+ I Y 
Sbjct: 183 TASVRLVAPLADMLNHSPDV-KQCHAYDPTSGDL-------SILAAKDYQVGDQIFIYYG 234

Query: 298 LNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLG 357
              +N  L L YGF+   +  ++Y L L+ S   P +  K  +    GL  +    + + 
Sbjct: 235 SVPNNRLLRL-YGFVLPDNPNDSYDLVLQTSPLAPLYEQKERLWALAGLDSTCTIPLTVK 293

Query: 358 RTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSAL 417
             LP  +L+YLR+  L   D   +  I    + G  D  V+  NE  + + + D+  S L
Sbjct: 294 DPLPNNVLRYLRIQRL---DESNITDITLRLVNG-TDGKVNDGNEIQVLQFLVDSIGSLL 349

Query: 418 SGFHTTIEEDEQLLQGGNLDP--RLRIAVGVRAGEKRVL 454
            GF   +E+ E  L  G+         A  V AGE+RVL
Sbjct: 350 EGFGIPLEKLEAQLVAGDYPAGGNAWAAAHVSAGEQRVL 388


>gi|116197927|ref|XP_001224775.1| hypothetical protein CHGG_07119 [Chaetomium globosum CBS 148.51]
 gi|88178398|gb|EAQ85866.1| hypothetical protein CHGG_07119 [Chaetomium globosum CBS 148.51]
          Length = 555

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 102/388 (26%), Positives = 173/388 (44%), Gaps = 35/388 (9%)

Query: 86  ATFP-EGLGLVAQRDIAKNEVVLEVPMK-FWINPDTVAASEIGSLCSGLKPWISVA---- 139
           A FP  G G+   R   K E +L +P    W      A   +G +    +P +SV     
Sbjct: 106 ADFPVTGRGVKTLRRFKKGERILTIPSGILWTVEHAYADPLVGPVLRSARPPLSVEDTLA 165

Query: 140 --LFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNE 197
             +  IR ++      R ++   P    S++F++EEEL    GT L + T  +   ++++
Sbjct: 166 TYILFIRSRESGYDGLRSHVAAFPTSYPSSIFFAEEELEVCAGTSLYTITKKLDRSIEDD 225

Query: 198 YLKVEEEIILPNKQLFP-RPITLDDFLWAFGILRSRA--FSRLRGQNLVLI-PLADLINH 253
           Y  +   ++  ++ LFP    +++D+ WA   + SRA  F    G ++ L+ P AD++NH
Sbjct: 226 YRTLVVRVLAQSRDLFPLDKFSIEDYKWALCTVWSRAMDFVLPDGNSIRLVAPFADMLNH 285

Query: 254 SPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIE 313
           S  +    + Y+     L       S+      +AG+Q  I Y  +  N+ L   YGF+ 
Sbjct: 286 SSEVE-PCHIYDASSGNL-------SVLAGKDYEAGDQAFIYYG-SIPNSRLLRLYGFVM 336

Query: 314 SKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVAL 373
             +  ++Y L +    S PFF  K  +  + GL  +    + L   LP  +L+YLR+  L
Sbjct: 337 PGNPNDSYDLVISTHPSAPFFERKQKLWASAGLDSACTISLTLTDPLPKNVLRYLRIQRL 396

Query: 374 GGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLL-- 431
             +D     +I    +    D  ++ +NE  + R + ++    L GF T +E+ E  L  
Sbjct: 397 DESD---FAAIAHRQL-AAADEKINDSNEVEVLRFLVESFCQLLDGFGTQLEKLEAQLAE 452

Query: 432 ----QGGNLDPRLRIAVGVRAGEKRVLQ 455
                GGN       A  V  GE+RVL+
Sbjct: 453 GVYPSGGN----AWAAAHVNLGEQRVLR 476


>gi|225447500|ref|XP_002267469.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic [Vitis
           vinifera]
 gi|296085051|emb|CBI28466.3| unnamed protein product [Vitis vinifera]
          Length = 497

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 115/416 (27%), Positives = 190/416 (45%), Gaps = 53/416 (12%)

Query: 69  QWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTV-AASEIGS 127
           +WL D  +   K  I      E  GLVA ++I K E +L VP    I  D+  + +E G 
Sbjct: 72  KWLSDSGLPPQKMGIERVEVGER-GLVALKNIRKGEKLLFVPPSLVITADSEWSCTEAGE 130

Query: 128 LC--SGLKPWISVALFLIREKK-KEDSPWRVYLDILPECTDSTVFWSEEELVE-LQGTQL 183
           +   + +  W  +A +LI E    + S W  Y+  LP    S ++W+  EL + L+ +Q+
Sbjct: 131 VLKRNSVPDWPLLATYLIGEASFMQSSRWSNYISALPRQPYSLLYWTRAELDKYLEASQI 190

Query: 184 LSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI-TLDDFLWAFGILRSRAFSRL--RGQ 240
               +     V   Y  +   I   +  LFP  +  ++ F W+FGIL SR   RL    +
Sbjct: 191 RERAIERINDVTGTYNDLRLRIFSKHPHLFPEEVFNMETFKWSFGILFSR-LVRLPSMDE 249

Query: 241 NLVLIPLADLINHSPGITT-EDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLN 299
            + L+P AD++NHS  + T  DY    +G  +F+ D     RT  P    EQV I Y   
Sbjct: 250 KIALVPWADMLNHSCEVETFLDYDKSSQGV-VFTTD-----RTYQP---SEQVFISYG-K 299

Query: 300 KSNAELALDYGFI--ESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDI-VL 356
           KSN EL L YGF+  E  +  +   L L + +SD  + +K +  + +GL  S  F I + 
Sbjct: 300 KSNGELLLSYGFVPREGTNPNDKVELLLSLKKSDKCYKEKSEAMKKHGLSTSQCFPIQIT 359

Query: 357 GRTL-----------PPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELI 405
           G  L           PP+M Q+   +A          ++  N      D+      E+ +
Sbjct: 360 GWPLELMAYAYLVVSPPSMSQHFEEIA----------AVASNKTTSKKDIRYPELEEQAL 409

Query: 406 CRVVRDACKSALSGFHTTIEE----DEQLLQGGNLDPRL---RIAVGVRAGEKRVL 454
            + + D+C++++S +   ++E    D  +     L+ R+   ++AV +   E+R+L
Sbjct: 410 -QFILDSCEASISKYSKFLQESGSMDLDVTSPKQLNRRVFLKQLAVDLCTSERRIL 464


>gi|255582876|ref|XP_002532210.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase, chloroplast precursor, putative
           [Ricinus communis]
 gi|223528106|gb|EEF30179.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase, chloroplast precursor, putative
           [Ricinus communis]
          Length = 508

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 116/420 (27%), Positives = 192/420 (45%), Gaps = 41/420 (9%)

Query: 69  QWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTV-AASEIGS 127
           +WL +  +   K  I      E  GLVA ++I K E +L VP    I  D+  +  E G 
Sbjct: 83  RWLSNNGLPDQKMAIDKVEVGER-GLVALKNIRKGEKLLFVPPSLVITADSEWSCPEAGE 141

Query: 128 LCS--GLKPWISVALFLIREKK-KEDSPWRVYLDILPECTDSTVFWSEEELVE-LQGTQL 183
           +     +  W  +A++LI E   ++ S W  Y+  LP    S ++W+  EL   L+ +Q+
Sbjct: 142 VLKQYSVPDWPLLAIYLISEANLQKSSKWSNYISALPRQPYSLLYWTRAELDRYLEASQI 201

Query: 184 LSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI-TLDDFLWAFGILRSRAFSRLRGQN- 241
               +     V   Y  +   I      LFP  +  L+ F W+FGIL SR   RL   + 
Sbjct: 202 RERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNLETFKWSFGILFSR-LVRLPSMDG 260

Query: 242 -LVLIPLADLINHSPGITT-EDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLN 299
            + L+P AD++NHS  + T  DY    +G  +F+ D  +        + GEQV I Y   
Sbjct: 261 KVALVPWADMLNHSCEVETFLDYDKSSQGV-VFTTDRQY--------EPGEQVFISYG-K 310

Query: 300 KSNAELALDYGFI--ESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDI-VL 356
           KSN EL L YGF+  E  +  ++  L+L + +SD  + +KL+  + +G   S  F + V 
Sbjct: 311 KSNGELLLSYGFVPREGTNPSDSVELSLSLKKSDKSYKEKLEALKKHGFSASQCFPVRVT 370

Query: 357 GRTLPPAMLQYLRLVALGGTDAFLLESIF-----RNTIWGHLDLPVSHANEELICRVVRD 411
           G   P  +L Y  L     + +   E +      + TI   +  P     EE   + + D
Sbjct: 371 G--WPVELLAYAYLAVSPPSMSSKFEELAAAASNKTTIKKDVGFP---EIEEQALQFILD 425

Query: 412 ACKSALSGFHTTIEE----DEQLLQGGNLDPRL---RIAVGVRAGEKRVLQQIDEIFKER 464
           +C+S++S +   ++     D  +     L+ RL   ++AV +   E+R+L +   + + R
Sbjct: 426 SCESSISKYTKFLQASGSMDLDVTSPKQLNRRLFLKQLAVDLCNSEQRILFRAQNVLRRR 485


>gi|392349055|ref|XP_003750278.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Rattus
           norvegicus]
          Length = 416

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 103/361 (28%), Positives = 162/361 (44%), Gaps = 41/361 (11%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 43  EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSILGPLYSQDRILQAMGNIALAFHLLC 102

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ EEE+  LQ TQ +       +    +Y    KV
Sbjct: 103 ERASPNSFWQPYIQTLPSEYDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQYAYFYKV 162

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 163 IQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 219

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +L+     +AG+Q+ I Y   +SNAE  +  GF   
Sbjct: 220 NGLITTGYNLED------DRCECVALQD---FQAGDQIYIFYG-TRSNAEFVIHSGFFFD 269

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 270 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTEPPISAQLLAFLRVF 327

Query: 372 ALGGTDAFLLE------SIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            +  T+  L E      +I R    G+ + PVS  NE  +   + D     L  + TTIE
Sbjct: 328 CM--TEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIE 385

Query: 426 E 426
           +
Sbjct: 386 Q 386


>gi|322712432|gb|EFZ04005.1| histone-lysine N-methyltransferase [Metarhizium anisopliae ARSEF
           23]
          Length = 462

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 108/404 (26%), Positives = 184/404 (45%), Gaps = 38/404 (9%)

Query: 70  WLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKF-WINPDTVAASEIG- 126
           WL +   V     +  A FP  G G+ A+R   + E +L +P    W        S +G 
Sbjct: 4   WLEESGAVGLDG-LEVADFPLTGRGVKARRRFKQGERILTIPSGLHWTVKHAQNDSLLGP 62

Query: 127 SLCSGLKPW-----ISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGT 181
           +LCS   P      ++V +  +R ++      R +++ LP    S++F++++EL    G 
Sbjct: 63  ALCSAQPPLSVEDTLAVHILFVRSRESGYDGLRSHVERLPASYSSSIFFTDDELEVCAGA 122

Query: 182 QLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFP-RPITLDDFLWAFGILRSRA--FSRLR 238
            L + T  +++ ++++Y  +   +++    LFP    TL  + WA   + SRA  F    
Sbjct: 123 SLYTITKQLQQRIEDDYRDLVVRVLVQYPDLFPLDKFTLHHYKWALCAVWSRAMDFQLSD 182

Query: 239 GQNL-VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYD 297
           G ++ +L P AD++NHS   + + + Y+     L       S+      +AG+QV I Y 
Sbjct: 183 GSSIRLLAPFADMLNHSSE-SKQCHVYDASSGDL-------SVLAGKDYEAGDQVYIHYG 234

Query: 298 LNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLG 357
            +  N  L   YGFI   +  ++Y L L      PFF  K  +    GL  +    + L 
Sbjct: 235 -SIPNHRLLRLYGFIIPGNPNDSYDLVLATHPLAPFFELKQKLWALAGLDSTCTISLTLT 293

Query: 358 RTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSAL 417
             LP  +++YLR+  L  +D   L SI    +    D  +S++NE  + + + ++  S L
Sbjct: 294 DPLPKNVIRYLRIQRLDESD---LASI---ALGQAADEKISNSNEVQVLQSLVESIASLL 347

Query: 418 SGFHTTIEEDEQLLQ------GGNLDPRLRIAVGVRAGEKRVLQ 455
             F T +E+ E+ L       GGN       A  V  GE+RVL+
Sbjct: 348 GSFGTRLEKLEEQLATGVYPVGGN----AWAAAHVSLGEQRVLK 387


>gi|66813084|ref|XP_640721.1| hypothetical protein DDB_G0281543 [Dictyostelium discoideum AX4]
 gi|60468751|gb|EAL66753.1| hypothetical protein DDB_G0281543 [Dictyostelium discoideum AX4]
          Length = 1339

 Score =  103 bits (256), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 83/262 (31%), Positives = 123/262 (46%), Gaps = 18/262 (6%)

Query: 59   PPTAQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPD 118
            P     + F  WL+   V  PK  I   T   G G+V  + + +NE V+ VP K+ IN D
Sbjct: 752  PSDEVYKRFENWLKAGGVQFPKLQIANFTDSTGRGVVTTKKVDENEAVVVVPKKYLINVD 811

Query: 119  TVAASEI-GSLCSGLKPWISVALFL--IREKKKEDSPWRVYLDILPECTDSTVFWSEEEL 175
               A  I G +   L       LFL  I EK   +S WR + D LP    +++ +S  EL
Sbjct: 812  VAKAHPILGPIFEELHLNDDTILFLFVIYEKGNANSFWRPFYDTLPSYFTTSIHYSATEL 871

Query: 176  VELQGTQLLSTTLGVKEYVQN--EYLKVEEEIILPNKQLFPRP-ITLDDFLWAFGILRSR 232
            +EL+GT L   TL  K+ + +  +YL  E     P+  +FP    + ++FLWA  +L SR
Sbjct: 872  LELEGTNLFEETLHTKQQLNSFRDYLFPELSKQYPD--IFPESQFSWENFLWARSLLDSR 929

Query: 233  AFS-RLRGQ-NLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGE 290
            A   ++ G     L+P+AD+INH       +  ++            F + +   + A  
Sbjct: 930  AIQLKIDGSIKSCLVPMADMINHHTNAQISERFFDHDSQS-------FKMISSCNIPANN 982

Query: 291  QVLIQYDLNKSNAELALDYGFI 312
            Q+ + Y     N ELAL YGFI
Sbjct: 983  QIFLHYGA-LQNWELALYYGFI 1003


>gi|400596811|gb|EJP64567.1| histone-lysine N-methyltransferase [Beauveria bassiana ARSEF 2860]
          Length = 406

 Score =  103 bits (256), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 99/384 (25%), Positives = 178/384 (46%), Gaps = 29/384 (7%)

Query: 86  ATFPE-GLGLVAQRDIAKNEVVLEVPMK-FWINPDTVAASEIGSLCSGLKPWISV----A 139
           A FPE G G+ AQR   ++E +L +P    W      A    G +   ++P +SV    A
Sbjct: 19  ADFPETGRGVKAQRPFKEDERILTIPANCLWTVKGAYADPLFGPVLQSVQPPLSVEDTLA 78

Query: 140 LFLIREKKKEDSPW----RVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQ 195
           L+++  + + + P     + ++ +LP     ++++++EEL    G+ L + T  ++  V 
Sbjct: 79  LYILFVRSRGEDPAYAERQTHVAMLPSEYTLSMYFTDEELRVCAGSSLYTLTTHLRGRVG 138

Query: 196 NEYLKVEEEIILPNKQLFP-RPITLDDFLWAFGILRSRA--FSRLRGQNLVLI-PLADLI 251
           ++Y K+   + + ++ LFP    +   + WA   + SR   F+   G ++ L+ P AD++
Sbjct: 139 DDYKKLLTGVFMRHRDLFPLDKFSFQHYKWALSSIWSRGMDFTISEGNSVRLMAPFADML 198

Query: 252 NHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF 311
           NH+     + +AY+     L     + + R     + G+QV I Y  N SN+ L   YGF
Sbjct: 199 NHASD-AKQCHAYDPSTGSL----TVLACRD---YEVGDQVFIYYG-NVSNSRLLRLYGF 249

Query: 312 IESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLV 371
           +   +  + Y L L+ S   P +  K  + +  GL E +   + L   LP ++L+YLR+ 
Sbjct: 250 VLPDNPNDNYELVLQTSSMAPLYEQKQRLWKLAGLDEISTIPLSLQNPLPDSVLRYLRIQ 309

Query: 372 ALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLL 431
            L  +D   +        +      +S  NE  I   +  + ++ L GF  ++E+ E  L
Sbjct: 310 RLDASDLGTMTMQIATESYTK----ISDENESQILLFLSQSIEALLEGFEISLEKLETQL 365

Query: 432 QGGNLDP--RLRIAVGVRAGEKRV 453
             G   P      A  V  GE+RV
Sbjct: 366 AEGLYAPGSNAWAAAQVSLGEQRV 389


>gi|410962953|ref|XP_003988033.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Felis catus]
          Length = 591

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 107/394 (27%), Positives = 172/394 (43%), Gaps = 57/394 (14%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLK-----PWISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 104 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNITLAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ E+E+ +LQ TQ +       +    +Y    KV
Sbjct: 164 ERADPNSFWQPYIQTLPSEYDTPLYFEEDEVRDLQSTQAIHDVFSQYKNTARQYAYFYKV 223

Query: 202 EEEIILPNKQLFPRPITLDDFL----------WAFGILRSRAFSRLRGQNLVLIPLADLI 251
            +     NK       T +D+           WA G+      +R               
Sbjct: 224 IQTHPHANKLPLKDAFTYEDYRLGLVSLALGRWALGLECGVGIARC-------------- 269

Query: 252 NHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF 311
              P ITT    Y ++      R    +L+     +AGEQ+ I Y   +SNAE  +  GF
Sbjct: 270 -GKPQITT---GYNLED----DRCECVALQD---FRAGEQIYIFYG-TRSNAEFVIHSGF 317

Query: 312 IESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPA---MLQYL 368
               +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +L
Sbjct: 318 FFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTEPPVSAQLLAFL 375

Query: 369 RLVALGGTDAFLLE------SIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHT 422
           R+  +  T+  L E      +I R    G+ + PVS  NE  +   + D     L  + T
Sbjct: 376 RVFCM--TEEELKEHLLGDNAIDRIFTLGNSEYPVSWDNEVKLWTFLEDRASLLLKTYKT 433

Query: 423 TIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           TIEED+  L+  NL  R  +A+ +R GEK +L++
Sbjct: 434 TIEEDKAFLKNHNLSVRATMAIKLRLGEKEILEK 467


>gi|440804743|gb|ELR25614.1| SET domain containing protein, partial [Acanthamoeba castellanii
           str. Neff]
          Length = 273

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 71/240 (29%), Positives = 122/240 (50%), Gaps = 18/240 (7%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPD----TVAASEIGSLCS---GLKPWISVALFLI 143
           G  +VA  DIA  E +L VP    ++        +A EI  +      L P    AL L+
Sbjct: 42  GRSVVAAHDIAAGETLLSVPFSLVVDSADALLATSAPEIRRILDEEFPLSPTNENALLLL 101

Query: 144 REKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEE 203
             K   +SPW+ Y+D+LP    +T+F+S++EL  L+G+ L       +  ++++Y  +  
Sbjct: 102 VHKNDPNSPWQRYIDVLPSTFSTTLFFSDDELSYLEGSSLHYFARQRRRAIESQYDTIFT 161

Query: 204 EIILPNKQLF-PRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDY 262
            + +   + F P   +LD + WA  ++ SR+F    G++  L+P AD+ N +P   TE  
Sbjct: 162 PLFVDYPEHFAPEQFSLDAWKWALSVIWSRSFVVDEGKS-GLVPWADMFNMAP--ETEQV 218

Query: 263 AYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNK--SNAELALDYGFIESKSDRNA 320
              +         L++S R+  P+K GEQ+ + Y  ++  SNA+L +DYGF+   +  +A
Sbjct: 219 KVAVDAV---DHHLIYSARS--PIKKGEQIFVAYGQSRQMSNAQLLMDYGFVLENNPHDA 273


>gi|449453618|ref|XP_004144553.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Cucumis
           sativus]
 gi|449511789|ref|XP_004164054.1| PREDICTED: LOW QUALITY PROTEIN: ribulose-1,5 bisphosphate
           carboxylase/oxygenase large subunit N-methyltransferase,
           chloroplastic-like [Cucumis sativus]
          Length = 497

 Score =  102 bits (254), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 109/406 (26%), Positives = 192/406 (47%), Gaps = 33/406 (8%)

Query: 69  QWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTV-AASEIGS 127
           +WL +  +   K  I+     E  GLVA +++ K E +L VP    I+ ++  +  E G 
Sbjct: 72  KWLSESGLPDQKMSIQRVNVGER-GLVALKNVRKGEKLLFVPPSLVISAESEWSCPEAGE 130

Query: 128 LC--SGLKPWISVALFLIREKK-KEDSPWRVYLDILPECTDSTVFWSEEELVE-LQGTQL 183
           +   + +  W  +A +LI E    + S W  Y+  LP    S ++W+ EEL   L+ +++
Sbjct: 131 VLKRNSVPDWPLIATYLISEASLMKSSRWNNYISALPRQPYSLLYWTREELDRYLEASEI 190

Query: 184 LSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI-TLDDFLWAFGILRSRAFSRLRGQN- 241
               +     V   Y  +   +   + +LFP  +  ++ F W+FGIL SR   RL   + 
Sbjct: 191 RERAIERITNVVGTYNDLSIRVFSKHPELFPEEVFNIETFKWSFGILFSR-LVRLPSMDG 249

Query: 242 -LVLIPLADLINHSPGITT-EDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLN 299
            + L+P AD++NH+  + T  DY    +G  +F+ D  +        + GEQV I Y   
Sbjct: 250 KVALVPWADMLNHNCEVETFLDYDKASQGV-VFTTDRAY--------QPGEQVFISYG-K 299

Query: 300 KSNAELALDYGFI--ESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDI-VL 356
           KSN EL L YGF+  E  +  ++  L L + +SD  + +KL+  + +GL  S  F I V 
Sbjct: 300 KSNGELLLSYGFVPKEGSNPSDSVELLLSLKKSDKCYKEKLEALKKHGLRASQCFPIQVT 359

Query: 357 GRTLPPAMLQYLRLVALGGTDAF-LLESIFRNTIWGHLDLPVSHANEELICRVVRDACKS 415
           G  L      YL +     ++ F  + +   N      DL      EE + + + D+C++
Sbjct: 360 GWPLELKAFAYLAVSPPSLSNQFDEMAAAASNKSTAKKDLNYPDIEEEAL-QFILDSCET 418

Query: 416 ALSGFHTTIEE----DEQLLQGGNLDPRL---RIAVGVRAGEKRVL 454
           ++S ++  ++     D  +     L+ R+   ++AV +   E+R+L
Sbjct: 419 SISKYNKFLQASGSMDLDVTSPKQLNRRVFLKQLAVDLCTSERRIL 464


>gi|322698908|gb|EFY90674.1| putative histone-lysine N-methyltransferase [Metarhizium acridum
           CQMa 102]
          Length = 437

 Score =  102 bits (253), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 106/404 (26%), Positives = 185/404 (45%), Gaps = 36/404 (8%)

Query: 70  WLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMK-FWINPDTVAASEIG- 126
           WL++   V   + +  A FP  G G+   +   + E +L +P    W      A S +G 
Sbjct: 4   WLKESGAVGLDN-LELADFPITGRGVRTLKCFKEGENILTIPSGILWTVEHAYADSILGP 62

Query: 127 -----SLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGT 181
                SL   ++  +++ +  +R +K      R ++  LP    S++F+ E++L    GT
Sbjct: 63  VLRSTSLPLSVEDTLAIYILFVRSRKSGYDGPRNHVAALPASYSSSIFFMEDQLEVCAGT 122

Query: 182 QLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFP-RPITLDDFLWAFGILRSRA--FSRLR 238
            L + T  +++ ++++Y  +   ++     LFP    T++D+ WA   + SRA  F    
Sbjct: 123 SLYTITKQLEQRIEDDYRGLVVRMLGQYPDLFPLDKFTVEDYKWALCTVWSRAMDFVLPD 182

Query: 239 GQNL-VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYD 297
           G+++ +L P AD++NHS     + + Y+     L       S+      +AG+QV I Y 
Sbjct: 183 GKSIRLLAPFADMLNHSSE-AKQCHVYDASSGNL-------SVLAGKDYEAGDQVFINYG 234

Query: 298 LNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLG 357
              +N  L L YGF+   +  ++Y L L      PFF  K  +  + GL  +    +   
Sbjct: 235 PMPNNRLLRL-YGFVVPGNPNDSYDLVLATHPMAPFFKQKQKLWASAGLDSTTTITLTFT 293

Query: 358 RTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSAL 417
             LP  +L+YLR+  L  +D  +L     NT     D  +S +NE  + R + ++    L
Sbjct: 294 DPLPKDVLRYLRVQRLDESDVAVLALQQTNTT----DAKISDSNEVEVLRFLVESIGGLL 349

Query: 418 SGFHTTIEEDEQLLQ------GGNLDPRLRIAVGVRAGEKRVLQ 455
           + F T +E+ E+ L+      GGN       A  V  GE+RVL+
Sbjct: 350 NNFGTHVEKLEEQLKEGVYSSGGN----AWAAAHVSLGEQRVLR 389


>gi|12718364|emb|CAC28558.1| related to histone-lysine N-methyltransferase [Neurospora crassa]
          Length = 471

 Score =  102 bits (253), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 110/404 (27%), Positives = 183/404 (45%), Gaps = 32/404 (7%)

Query: 70  WLRDQKVVSPKSPIRPATFPE-GLGLVAQRDIAKNEVVLEVPMK-FWINPDTVAASEIGS 127
           WL+    V   S +  A FP+ G G+   R   + E +L +P    W      A   +G 
Sbjct: 12  WLKQSGAVGLDS-LELADFPDTGRGVKTLRPFKEGEKILTIPAGILWTVKHAYADPLLGP 70

Query: 128 LCSGLKPWISV----ALFLIREKKKEDS--PWRVYLDILPECTDSTVFWSEEELVELQGT 181
                +P +SV    A +++  K +E      R ++  LP    S++ ++E++L    GT
Sbjct: 71  ALRSAQPPLSVEDTLATYILFVKSRESGYDGQRSHIAALPASYSSSILFAEDDLEACAGT 130

Query: 182 QLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFP-RPITLDD----FLWAFGILRSRA--F 234
            L + T  +++ +++++  +   + + +  LFP    T++D    + WA   + SRA  F
Sbjct: 131 SLYTITKQLEQSIEDDHRALVVRLFVQHPDLFPLDKFTVEDVGLHYKWALCTVWSRAMDF 190

Query: 235 SRLRGQNL-VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVL 293
               G ++ +L P AD++NH+  +  + + Y+     L       S+      +AG+QV 
Sbjct: 191 VLADGNSIRLLAPFADMLNHTSEVK-QCHVYDPSSGTL-------SVFAGKDYEAGDQVF 242

Query: 294 IQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFD 353
           I Y     N+ L   YGF+   +  ++Y L L      PFF  K  +  + GL  +A   
Sbjct: 243 INYG-PVPNSRLLRLYGFVIPGNPNDSYDLVLSTHPQAPFFEQKQKLWVSAGLDSTATIP 301

Query: 354 IVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDAC 413
           + L   LP  +L+YLR+  L   DA  L  I R       D  +S +NE  I R + ++ 
Sbjct: 302 LTLTDPLPKKVLRYLRIQRL---DASGLAVIARQQT-DATDGKISDSNEVEILRFLVESF 357

Query: 414 KSALSGFHTTIEEDEQLLQGGNLDP--RLRIAVGVRAGEKRVLQ 455
              L GF T +E+ EQ L  G   P      A  V  GE+RVL+
Sbjct: 358 NYLLDGFGTPLEKLEQQLAEGVYSPGGNAWAAAHVSLGEQRVLR 401


>gi|326495906|dbj|BAJ90575.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 507

 Score =  101 bits (252), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 112/421 (26%), Positives = 188/421 (44%), Gaps = 58/421 (13%)

Query: 93  GLVAQRDIAKNEVVLEVPMKFWINPDTVAAS-EIGSLCS--GLKPWISVALFLIREKKKE 149
           GLVA +++   E +L VP    I  D+   + E+G +     +  W  +A +LI E   E
Sbjct: 104 GLVALKNVRNGEKLLFVPPTLVITADSEWTNREVGDVMKRYSVPDWPLLATYLISEASLE 163

Query: 150 DSP-WRVYLDILPECTDSTVFWSEEEL-VELQGTQLLSTTLGVKEYVQNEYLKVEEEIIL 207
            S  W  Y+D LP    S ++W+  E+   L  + +    +     V   Y  + + I  
Sbjct: 164 GSSRWSSYIDALPRQPYSLLYWTRTEIDAYLVASPIRERAISRISDVIGTYNDLRDRIFS 223

Query: 208 PNKQLFPRPI-TLDDFLWAFGILRSRAFSRLR--GQNLVLIPLADLINHSPGITT-EDYA 263
            +  LFP  +  +++F W+FGIL SR   RL   G  + L+P AD++NHSP +    DY 
Sbjct: 224 KHPDLFPEKVYNMENFRWSFGILFSR-LVRLESMGGKVALVPWADMLNHSPEVDAFLDYD 282

Query: 264 YEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI--ESKSDRNAY 321
              +G  +F+ D  +        + GEQV I Y   KS+ EL L YGF+  E  +  ++ 
Sbjct: 283 KSSQGI-VFTTDRSY--------QPGEQVFISYG-KKSSGELLLSYGFVPKEGTNPNDSV 332

Query: 322 TLTLEISESDPFFGDKLDIAETNGLGESAYFDI-VLGRTL-----------PPAMLQYLR 369
              + + +SD  + +KL   + +GL ES  F + V G  +           PP M+Q   
Sbjct: 333 EFLVSLKKSDECYKEKLQALKKHGLSESESFPLRVTGWPVELMAYAFLVVSPPEMIQRFE 392

Query: 370 LVALGGTDAFLLESIFRNTIWGHLDLPVSHAN-EELICRVVRDACKSALSGFHTTIEEDE 428
            +A+  ++             G     V++   +E   + + D C+S++  +        
Sbjct: 393 EMAVAASNK------------GSSKPAVNYPELDEQALQFILDCCESSIKRY-------T 433

Query: 429 QLLQGGNLDPRLRIAVGVRAGEKRVLQQI--DEIFKERELELDELEYYQERRLKDLGLVG 486
           + L+G      + I    +A    +L+Q+  D    ER + L   +Y   RRL+D+   G
Sbjct: 434 KYLEGAKGSAEVSINTK-QANRTLLLKQLARDLCISERRI-LYRSQYILRRRLRDMRAGG 491

Query: 487 E 487
           E
Sbjct: 492 E 492


>gi|340520781|gb|EGR51016.1| N-methyltransferase [Trichoderma reesei QM6a]
          Length = 470

 Score =  101 bits (252), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 104/398 (26%), Positives = 179/398 (44%), Gaps = 49/398 (12%)

Query: 88  FP-EGLGLVAQRDIAKNEVVLEVPMK-FWINPDTVAASEIGSLCSGLKPWISVA------ 139
           FP  G G+   R   + E++L +P    W      +   +G     + P +SV       
Sbjct: 21  FPVTGRGVKTLRRFKQGEMILTIPSDVLWSVEHAYSDPNLGPALRSVMPPLSVEDILATY 80

Query: 140 LFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYL 199
           +  +R ++      R ++  LP    S++F+SE EL    GT L + T  +++ ++++Y 
Sbjct: 81  ILFVRSRESGYDGLRTHVSALPGIYSSSIFFSEGELEVCAGTSLYTVTKQLEQRIKDDYR 140

Query: 200 KVEEEIILPNKQLFP-RPITLDD----------FLWAFGILRSRA--FSRLRGQNL-VLI 245
           ++   +   +  LFP +  T++D          + WA   + SR+  F+   G ++ +L 
Sbjct: 141 QLAVRLFAQHPDLFPLQKFTIEDVRLLRRATDPYKWALCTVWSRSMDFTLPDGSSIRLLA 200

Query: 246 PLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAEL 305
           P AD++NHS  +  + +AY++K   L       S+      + G+QV I Y    +N  L
Sbjct: 201 PFADMLNHSSEV-KQCHAYDVKSGDL-------SVFAGKDYEIGDQVYIYYGPIPNNRLL 252

Query: 306 ALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAML 365
            L YGF+   +  ++Y L L      PF+  K  +  + GL  +    + L   LP  +L
Sbjct: 253 RL-YGFVIPDNPNDSYDLVLTTHPMAPFYEQKQKLWVSAGLDSTTTVALTLTDPLPKNIL 311

Query: 366 QYLRL--VALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTT 423
           +YLR+    + G  A  L+ I         D  +S +NE  I R + ++  S L+ F T 
Sbjct: 312 RYLRIQRADVSGLAAITLQQI------DGTDEKISDSNEMEILRFLEESISSLLNCFATP 365

Query: 424 IEEDEQLLQ------GGNLDPRLRIAVGVRAGEKRVLQ 455
           +E  E+ L       GGN       A  V  GE+RVL+
Sbjct: 366 LERLEKQLAEGVYPVGGN----AWAAAHVSVGEQRVLR 399


>gi|428175234|gb|EKX44125.1| hypothetical protein GUITHDRAFT_109909 [Guillardia theta CCMP2712]
          Length = 442

 Score =  101 bits (252), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 84/302 (27%), Positives = 138/302 (45%), Gaps = 45/302 (14%)

Query: 64  VETFWQWLRDQKVVSPKSPIRPA-TFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAA 122
           ++ F +WL   K ++ +S +  A T   G GL+A+R     E +L VP +  I PD    
Sbjct: 48  MKDFLEWL-TAKGMNFQSQVDVAITNGTGRGLLARRSFMPGETMLAVPPELLITPDMARR 106

Query: 123 SEIGSL--------CSG-------LKPWISVALFLIREKKKED-SPWRVYLDILPECTDS 166
           SE+G          CSG         P +++ L ++   +  D  PW   + ILP    +
Sbjct: 107 SEVGRAFREHGLDDCSGGEDSTYECMPLLAMHLTVLYYNESHDFHPW---MKILPRKLTT 163

Query: 167 TVFWSEEELVELQGTQLLST----TLGVKEYVQNEYLKVEEEIILPNKQLFPRPI-TLDD 221
            +FWS++E  ELQG+ L +     T+ V++  ++    + +  + P+    P+ I +L +
Sbjct: 164 PLFWSDKEREELQGSNLYNMLDGWTMNVEKLHRSTARVLGQHNVFPD---LPKAIYSLKE 220

Query: 222 FLWAFGILRSRAFS------RLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRD 275
           F WA+  + +RAF          G+  ++ P+ADL NH  G     Y +           
Sbjct: 221 FKWAYATIFARAFDVDGKSFGFSGRQRIMAPMADLFNH--GDVKTSYTFNAASGH----- 273

Query: 276 LLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFG 335
             F L T      GEQ+ + YD +K+NAE  L YGF+   +  +   +   I    PF+ 
Sbjct: 274 --FELFTQQFFSRGEQIFMNYD-SKNNAEFLLQYGFVIESNPHDYVGIAASIGNDQPFYR 330

Query: 336 DK 337
           DK
Sbjct: 331 DK 332


>gi|281205954|gb|EFA80143.1| hypothetical protein PPL_06965 [Polysphondylium pallidum PN500]
          Length = 417

 Score =  101 bits (251), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 108/423 (25%), Positives = 197/423 (46%), Gaps = 43/423 (10%)

Query: 66  TFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAA--- 122
           TF QW+ D+ +    S         G  ++A   I + +V++ VP    ++   +     
Sbjct: 13  TFKQWMDDEGIYLNPSLDIVKLEDYGRSIIANTLIKEGDVLIRVPRNVMMSRTGIELHIP 72

Query: 123 SEIGSLCSGLKPWISV----ALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVEL 178
            EI S+    +  I      A++L+     +DS W  Y  ILP+   +++++ ++E+ EL
Sbjct: 73  KEIRSIIDSNRDDIGSTDGQAVYLMYSLLNKDSYWHQYTSILPKQFTTSIYFDQDEMKEL 132

Query: 179 QGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLR 238
           Q ++L   T      ++  Y  + +++   N +   +  T + F WA   + SRAFS L 
Sbjct: 133 QLSKLRYFTESRLSGIERHYNVIFKKLSSLNDEFKKKEYTFELFKWALSCIWSRAFS-LS 191

Query: 239 GQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDL 298
             +  ++PLAD+ N       E    +++      + + ++ +    ++ GEQV   Y +
Sbjct: 192 SDDGGMVPLADMFN-----AIEKAKSKVRPDSRADQLIYYASK---DIERGEQVFTPYGV 243

Query: 299 NKS--NAELALDYGF-IESKSDRNAYTLTLEISESDPFFGD-KLDIAETNGLGESAYFDI 354
            K+  NA++ +DYGF  +  S+ +   LTL+    D  + D K+D+ E          DI
Sbjct: 244 YKTIGNAQMLMDYGFAFDDPSEGDTIQLTLDNFSDDELYIDTKIDLLEQ--------LDI 295

Query: 355 V----LGRT-LPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVV 409
           V    L R  LP  +L Y R+  L   +  L +  +RN    + + PVS  NE+   R +
Sbjct: 296 VREFNLKRNQLPQELLIYARVKNLKENELQLAKEHYRND--DNRNKPVSRRNEKTALRYL 353

Query: 410 RDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIA------VGVRAGEKRVLQQ-IDEIFK 462
            +     L  + TT+ +D +LL+  N + ++ ++      + +R GEK +L+Q ID   K
Sbjct: 354 SNYLSRYLDSYETTLSDDLELLE-KNKEKKITLSYNMLNIIRIRKGEKEILKQLIDSFDK 412

Query: 463 ERE 465
            RE
Sbjct: 413 IRE 415


>gi|357615786|gb|EHJ69829.1| putative SET domain containing 3 [Danaus plexippus]
          Length = 489

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 81/336 (24%), Positives = 155/336 (46%), Gaps = 24/336 (7%)

Query: 52  SASATTNPPTAQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPM 111
           S  +T       +E +  WL +         I       G GL A +D ++  ++L VP 
Sbjct: 71  SNKSTNRQRKLYIENYVSWLHEHGAEFEGVEISEFD-GYGFGLKATKDFSEGSLILTVPG 129

Query: 112 KFWINPDTVAASEIGSLCS-----GLKPWISVALFLIREKKKEDSPWRVYLDILPECTDS 166
           K  ++     AS++    +        P +++ALFL+ EK   +S W+ Y+D+LPE   +
Sbjct: 130 KVMMSEKDPKASDLSEFINIDPLLQNMPNVTLALFLLLEKNNPNSFWKPYIDVLPEKYST 189

Query: 167 TVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEII---LPNKQLFPRPITLDDFL 223
            ++++ EEL EL+ + +  ++L +   +  +Y     +I    LP  +      T D++ 
Sbjct: 190 VLYFNSEELAELRPSPVFESSLKLYRSIVRQYAYFYNKIHTIDLPVLKNLQDIFTFDNYR 249

Query: 224 WAFGILRSRAFSRLRGQNLVL----IPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFS 279
           WA   + +R  + ++G    L    IPL D+ NH  G  T D+  E+      +R   ++
Sbjct: 250 WAVSTVMTRQNNIVQGTAFTLTNAFIPLWDMCNHKHGKITTDFNLEL------NRGECYA 303

Query: 280 LRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLD 339
           L+     +  EQ+ I Y   + N++L L  GF+   +D ++ ++ L IS +D     K++
Sbjct: 304 LQ---DYRRDEQIFIFYGA-RPNSDLFLHNGFVYPDNDYDSLSIALGISPNDALRNGKVN 359

Query: 340 IAETNGLGESAYFDIVLGRT-LPPAMLQYLRLVALG 374
           +    GL     F +  G + +   +L ++R+  + 
Sbjct: 360 LLNKLGLSGVTNFSLYKGASPISVELLAFIRIFNMN 395


>gi|3065835|gb|AAC14296.1| putative methyltransferase [Arabidopsis thaliana]
          Length = 504

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 112/407 (27%), Positives = 185/407 (45%), Gaps = 35/407 (8%)

Query: 69  QWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTV-AASEIGS 127
            WL D  +   K  I      E  GLVA +++ K E +L V     I  D+    +E G 
Sbjct: 79  NWLSDSGLPPQKMAIDRVDIGER-GLVASQNLRKGEKLLFVSPSLVICADSEWTNAEAGE 137

Query: 128 LCS--GLKPWISVALFLIREKK-KEDSPWRVYLDILPECTDSTVFWSEEEL-VELQGTQL 183
           +     +  W  +A +LI E   ++ S W  Y+  LP    S ++W+  EL + L+ +Q+
Sbjct: 138 VMKRYDVPDWPLLATYLISEASLQKSSRWFNYISALPRQPYSLLYWTRTELDMYLEASQI 197

Query: 184 LSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDD-FLWAFGILRSRAFSRLRGQN- 241
               +     V   Y  +   I   + QLFP+ +  D+ F W+FGIL SR   RL   + 
Sbjct: 198 RERAIERITNVVGTYEDLRSRIFSKHPQLFPKEVFNDETFKWSFGILFSR-LVRLPSMDG 256

Query: 242 -LVLIPLADLINHSPGITT-EDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLN 299
              L+P AD++NH+  + T  DY    KG  +F+ D         P + GEQV I Y  N
Sbjct: 257 RFALVPWADMLNHNCEVETFLDYDKSSKGV-VFTTDR--------PYQPGEQVFISYG-N 306

Query: 300 KSNAELALDYGFI--ESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLG 357
           KSN EL L YGF+  E  +  ++  L L + ++D  + +KLD  + +GL     F + + 
Sbjct: 307 KSNGELLLSYGFVPREGTNPSDSVELALSLRKNDKCYEEKLDALKKHGLSTPQCFPVRIT 366

Query: 358 RTLPPAMLQYLRLVALGGTDAFLLESIFR---NTIWGHLDLPVSHANEELICRVVRDACK 414
              P  ++ Y  LV          E + +   N      DL      E+ + + + D+C+
Sbjct: 367 -GWPMELMAYAYLVVSPPDMRNNFEEMAKRASNKTSTKNDLKYPEIEEDAL-QFILDSCE 424

Query: 415 SALSGFHTTIEE----DEQLLQGGNLDPRL---RIAVGVRAGEKRVL 454
           +++S     ++E    D  +     L+ +    ++AV +   E+R+L
Sbjct: 425 TSISKCSRFLKESGSMDLDITSPKQLNRKAFLKQLAVDLSTSERRIL 471


>gi|168043570|ref|XP_001774257.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674384|gb|EDQ60893.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 458

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 95/420 (22%), Positives = 176/420 (41%), Gaps = 38/420 (9%)

Query: 70  WLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLC 129
           WL  +      + +      +G  L++ R I + E VL V  +  I P+ + +    SL 
Sbjct: 40  WLESRGETEALTSLTIGNTNQGRALLSIRHIKRGEQVLRVSRELMITPNRLPSCVEESLS 99

Query: 130 SGLKPWISVALF-LIREKKKEDSPWRVYLDILP--ECTDSTVFWSEEELVELQGTQLLST 186
             +  W  +ALF L+ +   + SPW  Y+  LP      +TVFW +EEL  L+ + +   
Sbjct: 100 EDVNEWSRLALFQLLHKHAGKASPWEPYIRCLPPLRGLQNTVFWRDEELELLRQSNVYDQ 159

Query: 187 TLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIP 246
           T   K  + N++  V+  ++    +LF   +TL+ F  A+ +  SR++      ++ ++P
Sbjct: 160 TEHRKTLISNQFDLVQA-VVNKYPELFGETVTLESFKHAYCVASSRSWGVEALGSITMVP 218

Query: 247 LADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELA 306
             D+ NH          YE +G      D  ++         G QV+I Y     N+ LA
Sbjct: 219 FVDMFNHDSSARALLAYYEEEGYAEVVADKDYN--------QGSQVVITYG-TLPNSSLA 269

Query: 307 LDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFD------------- 353
           LD+GF    +  +   + +E    DP   +KL +   +G+    + D             
Sbjct: 270 LDFGFTLPDNPHDEVQIWMEAPSGDPLRAEKLKLLRDHGIATDPFCDGTESGGAWFGLRE 329

Query: 354 ----IVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHL-DLPVSHANEELICRV 408
                  G+ +P A+  ++R+++   T      +       G L   P+    E    ++
Sbjct: 330 ISSPAARGKGIPRALRTFVRVISASTTKELEAMAEDAKRRQGRLAQRPLKDGKEARALKL 389

Query: 409 VRDACKSALSGFHTTIEEDEQLLQGGNL----DPRLRIAVGVRAGEKRVLQQIDEIFKER 464
           + D  +  +S   + +   +++L+  +L      R R+A  V  GE RVL+ +    + R
Sbjct: 390 LLDNIEQCVSSHRSAL---KKVLKAQDLPQIQAERARMAGHVLDGELRVLRSVGAWLRPR 446


>gi|406860468|gb|EKD13526.1| putative SET domain-containing protein [Marssonina brunnea f. sp.
           'multigermtubi' MB_m1]
          Length = 474

 Score = 99.4 bits (246), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 114/434 (26%), Positives = 190/434 (43%), Gaps = 55/434 (12%)

Query: 62  AQVETFWQWLRDQKVV-SPK-SPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDT 119
           A+  +F+ WL    V  SPK + +   +   G GLVAQ DI ++EV+  +P    +N  T
Sbjct: 8   AKTASFFAWLGRIGVHHSPKVALVDLRSAGRGRGLVAQSDIGEDEVLFTIPRDAVLNTTT 67

Query: 120 VAASEIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQ 179
              S        +  W+++   ++ E ++EDS W  YL +LP   DS VFWSE EL+ELQ
Sbjct: 68  ALGSADNPAILEMPCWLALTAIILTEGQQEDSKWAPYLALLPSRLDSLVFWSESELLELQ 127

Query: 180 GTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQL-FPRPITLDDFLWAFGILRSRAF---- 234
            + +++  +G +   +  +L+    + L N        +      +AF I   +      
Sbjct: 128 ASTVVN-KIG-RASAEQLFLEHISPLGLSNTNTEMCHKVASVVMAYAFDIPEKKGHDDPE 185

Query: 235 SRLRGQNLV------------LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRT 282
           S   G +LV            +IPLAD++N            +   A L   +    +R+
Sbjct: 186 SPEDGDDLVSDNEEEENTILSMIPLADMLNADA---------DGNNARLCCDNEELEMRS 236

Query: 283 PVPVKAGEQVLIQYDLNKSNAELALDYGFIESK------SDRNAYTLTLEISESDPFFG- 335
             P+  GE++L  Y      ++L   YG+I  K      ++ +  +L   +S   P    
Sbjct: 237 IKPISKGEEILNDYG-QLPRSDLLRRYGYISDKYAAYDVAELSTQSLLASLSTEQPLLAG 295

Query: 336 ------------DKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLES 383
                        ++++A+  G+ E +Y     G   P    + L L+ +   D   L +
Sbjct: 296 GTLQPLSREKLEQRVELAQREGVYEDSYDLTHPGPDDPSIPDELLALLYILLLDNENLAA 355

Query: 384 IFRNTIWGHLDLPV-SHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRI 442
           I  +    H  LP  S     L+ +++    +S    + TTIE D+ +LQ  NL  R R+
Sbjct: 356 IETS----HASLPSRSKLATSLVGQILTKILESRKQEYATTIEADQAILQADNLPSRKRM 411

Query: 443 AVGVRAGEKRVLQQ 456
           AV VR GEK VL++
Sbjct: 412 AVEVRLGEKLVLEK 425


>gi|146162512|ref|XP_001009518.2| SET domain containing protein [Tetrahymena thermophila]
 gi|146146406|gb|EAR89273.2| SET domain containing protein [Tetrahymena thermophila SB210]
          Length = 789

 Score = 99.0 bits (245), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 103/402 (25%), Positives = 177/402 (44%), Gaps = 36/402 (8%)

Query: 69  QWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSL 128
           +WL D    S  + I+   +    G+ A++ I K E +L +P+   I  +      I  L
Sbjct: 353 KWLSD--TSSEFNKIKMVYYNNYRGVHARQKIKKGECILFIPVDNMITLELSKELPICQL 410

Query: 129 CSG-----LKPWIS-VALFLIREKKKEDSPWRVYLDILP-ECTDSTVFWSEEELVELQGT 181
                   L P  + +++++I EKK   S W+ +LDILP E T   + +++EEL  L+G+
Sbjct: 411 IESKNIRLLSPKHTFLSIYIIIEKKNHKSFWKPFLDILPVEYTTFPILYTDEELFWLKGS 470

Query: 182 QLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRL--RG 239
             L+     +E +  +Y  +  +I  P    F +  TLD+F WA  +  SR +     + 
Sbjct: 471 PFLNQVKERRECITQDYQAIVSKI--PE---FAKLCTLDEFAWARMMAASRIYGLFINKK 525

Query: 240 QNLVLIPLADLINHS-PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDL 298
           +    +PLAD+ NH  P  T   +  E KG         F L+    ++ G+Q+      
Sbjct: 526 RTDAFVPLADMFNHRRPAYTNWGFC-EDKGG--------FMLKASEDIRRGDQIYYSCG- 575

Query: 299 NKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGR 358
            K N+   L+YGF+   ++ N   L ++  + D     KL +       ES  F I +  
Sbjct: 576 RKCNSRFLLNYGFVVKNNEANEIQLRVDFDKKDETLPIKLQMIGKRK-PESLIFRIHINY 634

Query: 359 TLPPAM--LQYLRLVALGGTDAFLLESIFRNTIWGHLDL----PVSHANEELICRVVRDA 412
                +    +LR V +   D  +LE     +     D     P S  NE+ +   +   
Sbjct: 635 EEKSVLEFFGFLRFVLI--RDYIVLEKFHEMSEGKEFDPLRTPPFSIENEKQMWTEIHKI 692

Query: 413 CKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVL 454
           C   +  + TT++ED+++L+   L    +  V +R GEK +L
Sbjct: 693 CAEIMIQYPTTLDEDKKILETSKLTINQKNCVILRMGEKEIL 734


>gi|358395377|gb|EHK44764.1| hypothetical protein TRIATDRAFT_80097 [Trichoderma atroviride IMI
           206040]
          Length = 463

 Score = 98.6 bits (244), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 89/327 (27%), Positives = 158/327 (48%), Gaps = 24/327 (7%)

Query: 136 ISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQ 195
           ++V L  +R ++      R ++  +P    S++F++E+EL    GT L + T  ++E ++
Sbjct: 77  LAVYLLFVRLREHGYEGPRSHVAAMPARYSSSIFFNEDELEVCAGTSLYTITKQLEERIE 136

Query: 196 NEYLKVEEEIILPNKQLFP-RPITLDDFLWAFGILRSRA--FSRLRGQNL-VLIPLADLI 251
           ++Y  +   +   +  L P   I++ D+ WA   + SRA  F    G+ L VL P AD+I
Sbjct: 137 DDYRVLVMRVFTQHPDLLPLAKISIQDYKWALCTVWSRAMDFVLPNGKPLRVLAPFADMI 196

Query: 252 NHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF 311
           NHSP +  + +AY+     L       S+      + G+Q+ I Y    +N  L L YGF
Sbjct: 197 NHSPEVK-QCHAYDPSSGNL-------SVLAGKDYEIGDQIYISYGSIPNNRLLRL-YGF 247

Query: 312 IESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLV 371
           +  ++  ++Y L L      PF+  K  +  + GL  ++   + L   LP ++L+YLR+ 
Sbjct: 248 VIPENPNDSYDLVLSTHPMAPFYEQKQKLWASAGLDSASTIPLTLIDPLPKSVLRYLRIQ 307

Query: 372 ALGGTD--AFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQ 429
            L  +D  A  L+ +  N         +S++ E  I + + ++  + L GF+T  E+ ++
Sbjct: 308 RLDASDLAAIALQKLDTNE-------KISNSKEVEILQFLVESISALLDGFNTPFEKLQK 360

Query: 430 LLQGG--NLDPRLRIAVGVRAGEKRVL 454
            L  G   L      A  V   E+RVL
Sbjct: 361 QLAKGVYPLGGNAWAAAHVSLSEQRVL 387


>gi|145516108|ref|XP_001443948.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124411348|emb|CAK76551.1| unnamed protein product [Paramecium tetraurelia]
          Length = 572

 Score = 98.6 bits (244), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 107/417 (25%), Positives = 185/417 (44%), Gaps = 45/417 (10%)

Query: 69  QWLRDQKVVSPKSPIRPATFPEGL-GLVAQRDIAKNEVVLEVPMKFWINPD-----TVAA 122
           +WL+  K + PK  I+   + E   G+ A++ I   E++L +P    I  +     TVA 
Sbjct: 141 EWLKIGKAIFPK--IKIECYSEDYRGVNAKQTINAKELILFIPKSHMITLEMAKETTVAK 198

Query: 123 SEIGSLCSGLKPWIS-VALFLIREKKKEDSPWRVYLDILPECTDS-TVFWSEEELVELQG 180
             +      L P  S ++ FL++EK + +S W+ Y+DILP    S  +F++  +L  L+G
Sbjct: 199 KMMQFRLDLLSPKHSFLSTFLLQEKFRPNSFWKPYIDILPSSYPSFPIFYNNSDLEWLKG 258

Query: 181 TQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQ 240
           +  L         +Q +Y  +    ++P    +       +F WA    R  A SR+ G 
Sbjct: 259 SPFLKQIKDKLADLQKDYNDICN--VVPEFTQY----QFHEFCWA----RMTASSRIFGI 308

Query: 241 NL------VLIPLADLINHS-PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVL 293
           N+        +PLAD++NH  P +T+  Y+ E +G         F + T   ++ G+ + 
Sbjct: 309 NINGVKTDAFVPLADMLNHKRPKLTSWCYSDEKQG---------FIIETDEKIERGQMIF 359

Query: 294 IQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFD 353
             Y   K N+   L+YGF+   +D N   L +E  ++DP    K + A    L     F 
Sbjct: 360 DSYG-RKCNSRFFLNYGFVVEGNDANEVNLAVEADQNDPLLQLK-EQAIKESLQWPKNFK 417

Query: 354 IVL--GRTLPPAMLQYLRLVALGGTD--AFLLESIFRNTIWGHLDLPVSHANEELICRVV 409
           +++    T     + ++R + +        LL              P+   NE  + +++
Sbjct: 418 LLMDTDETAVIDFMSHIRFLVIRDEAQLKLLLNQKNSQNFKSTKTQPLGIYNELEMWKMI 477

Query: 410 RDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQ---QIDEIFKE 463
              CK  L  + TT E+D+++LQ   L    R  + +R GEK +L+   Q  E  KE
Sbjct: 478 GRICKKTLKQYPTTFEQDQEILQICELTTNQRNCLILRMGEKEILKFYFQFSERMKE 534


>gi|302759643|ref|XP_002963244.1| hypothetical protein SELMODRAFT_80789 [Selaginella moellendorffii]
 gi|300168512|gb|EFJ35115.1| hypothetical protein SELMODRAFT_80789 [Selaginella moellendorffii]
          Length = 467

 Score = 98.2 bits (243), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 108/424 (25%), Positives = 194/424 (45%), Gaps = 43/424 (10%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTV-AASEIGSLCSG--LKPWISVALFLIREKK 147
           G GLV++R + K + +L +P    I  ++  A +E G +     L  W  +A +LI E  
Sbjct: 64  GRGLVSKRMLYKGDRLLFLPATLAITTESEWACAEAGKVIRAKDLPEWPFLACYLISEAS 123

Query: 148 -KEDSPWRVYLDILPECTDSTVFWSEEEL-VELQGTQLLSTTLGVKEYVQNEYLKVEEEI 205
             + SPW  Y+  LP    S + W+  ++   L  T +    L     V++ +  + +++
Sbjct: 124 LGKSSPWYPYIAALPRRPGSILLWTALDVETHLSATSIKDRALQCVREVEDTFNDLNKQV 183

Query: 206 ILPNKQLFPRPI-TLDDFLWAFGILRSRAFSRLR--GQNLVLIPLADLINHSPGITTEDY 262
            + N++ FP  +  L  F WAFGIL SR   RL   GQ L LIP  D++NH   +TT   
Sbjct: 184 FMKNREEFPPEVFNLKSFKWAFGILFSR-LVRLPSLGQKLALIPFGDMLNHDTEVTT--- 239

Query: 263 AYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES-KSDRNAY 321
            +    +G  S+ +  +L      ++  +V I Y   +SN EL + YGF+ S K+  ++ 
Sbjct: 240 -FLDFDSG--SKSITCTLDR--GYESNREVFISYG-KRSNGELLVAYGFVPSGKNSEDSV 293

Query: 322 TLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLL 381
           ++TL +  +D  +  KL   + +GL     + I L +  P  +  + RL+    +     
Sbjct: 294 SITLGLDPADEMYEAKLGTLKEHGLSPQQSYPIKL-KGWPVQLTAFARLITSPPSQLHRY 352

Query: 382 ESIFRNTI--WGHLDLPVSHANEELIC-RVVRDACKSALSGFHTTIEEDEQLL------- 431
             +        G    PV    E++    ++  ACK A++     +E ++ ++       
Sbjct: 353 SELASAATEEQGRRMQPVFTTEEQMKAYELILSACKQAIAASKNYLEVEQGIVLFRLIKE 412

Query: 432 --QGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKERELELDELEYYQERRLKDLGLVGEQG 489
             + G+L    ++A  +   E ++L + + I +    EL  L     R  K+ GL+G+ G
Sbjct: 413 VPRHGSLP---KLASALCESELKILYRNEYILRS---ELRNL-----RSGKNSGLLGKVG 461

Query: 490 DIIF 493
            + F
Sbjct: 462 SMFF 465


>gi|346474100|gb|AEO36894.1| hypothetical protein [Amblyomma maculatum]
          Length = 459

 Score = 98.2 bits (243), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 105/402 (26%), Positives = 179/402 (44%), Gaps = 52/402 (12%)

Query: 89  PEG-LGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSG---LKPW--ISVALFL 142
           P+G  GLVA+  I ++   L +PMK  +   +   S++G L      +K    +++A+FL
Sbjct: 67  PDGDYGLVAEEKIEESMQFLGIPMKLVMTTASARKSKLGPLLRDDPIMKSMSNVALAIFL 126

Query: 143 IRE-KKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYL-- 199
           I E    E S W  Y+ +LP+  ++ ++++ EEL  L G+ +L   L +   +  +Y   
Sbjct: 127 ILELSAGESSFWHPYISVLPDSFNTVLYFNIEELELLSGSAVLDEALKLHRSIARQYAYF 186

Query: 200 -KVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFS-------RLRGQNL--------- 242
            K+     L     F    T D + WA   + +R  +        L G ++         
Sbjct: 187 HKIFRTHPLAKSLPFKDCFTYDLYRWAVSAVMTRQNAVPWTESDGLGGDDVEIDGTAAVT 246

Query: 243 VLIPLADLINHSPGITTEDYAYEIKGAGLFS-RDLLFSLRTPVPVKAGEQVLIQYDLNKS 301
            L+PL D+ NHS G    DY         ++ RD             GE+V I Y   ++
Sbjct: 247 ALVPLWDMCNHSDGKVLTDYDSSASMVRCYAMRDF----------DKGEEVTIFYG-KRT 295

Query: 302 NAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVL-GRTL 360
           NAE  +  GF+   +  +A  + L +S+ DP F  K  + E + L  S  F +V   R +
Sbjct: 296 NAEFFIHNGFVFEDNRYDAVDIKLGVSKKDPLFAVKSKLCEDHDLSLSGTFALVARDRPV 355

Query: 361 PPAMLQYLRLVALGGT---DAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSAL 417
              +  +LR++ L      +AF  E I  ++         S+A + L   VVR   +  L
Sbjct: 356 SEDLSTFLRILVLKDASQPEAFSAEHILTSSD--------SNARDALTFLVVR--IELLL 405

Query: 418 SGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDE 459
             F  + EE E +++ G  + R+++A  +R  E +VL  + E
Sbjct: 406 KAFPKSDEEYEDIIKDGASNARVKMAARLRLLESKVLASVLE 447


>gi|330797452|ref|XP_003286774.1| hypothetical protein DICPUDRAFT_54488 [Dictyostelium purpureum]
 gi|325083217|gb|EGC36675.1| hypothetical protein DICPUDRAFT_54488 [Dictyostelium purpureum]
          Length = 1335

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 81/262 (30%), Positives = 134/262 (51%), Gaps = 19/262 (7%)

Query: 60   PTAQV-ETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPD 118
            PT ++   F  WL+   V  PK  I   T   G G+V  + + ++EVV+ VP K+ IN D
Sbjct: 798  PTDEIYRRFENWLKQGGVQFPKLQIANFTDSTGRGVVTTKKVDEDEVVVSVPRKYLINVD 857

Query: 119  TVAASEI-GSLCS--GLKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEEL 175
               ++ I G +     L     + LF+I EK+  ++ WR + D LP    +++ +S  EL
Sbjct: 858  VAKSNPILGPIFEELHLNDETILFLFVIYEKENPNTFWRPFYDTLPSYFTTSIHYSSTEL 917

Query: 176  VELQGTQLLSTTLGVKEYVQ--NEYLKVEEEIILPNKQLFPRPI-TLDDFLWAFGILRSR 232
            +EL+GT L + TL VK+ +Q   +YL  E     P+  +FP  + + ++FLWA  +L SR
Sbjct: 918  LELEGTNLFAETLAVKQQLQAFRDYLFPELSNQYPD--IFPESVFSWENFLWARSLLDSR 975

Query: 233  AFS-RLRGQ-NLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGE 290
            A   ++ G+    L+P+AD+INH       +  ++            F + +   + A  
Sbjct: 976  AIQLKIDGKIKSCLVPMADMINHHTNAQISERHFDQDSN-------CFRMVSSCNIPANN 1028

Query: 291  QVLIQYDLNKSNAELALDYGFI 312
            Q+ + Y     N++LAL YGF+
Sbjct: 1029 QIFLHYGA-LQNSDLALYYGFV 1049


>gi|302814473|ref|XP_002988920.1| hypothetical protein SELMODRAFT_129035 [Selaginella moellendorffii]
 gi|300143257|gb|EFJ09949.1| hypothetical protein SELMODRAFT_129035 [Selaginella moellendorffii]
          Length = 389

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 74/258 (28%), Positives = 118/258 (45%), Gaps = 14/258 (5%)

Query: 93  GLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLI-REKKKEDS 151
           GL A R I   E +L V     I P+ +       L   +  W  +ALFL+  +KKKE S
Sbjct: 2   GLFASRPIHTGECMLHVSHDLMITPEKLPEEVTKLLSKDVSAWAKLALFLLAHQKKKETS 61

Query: 152 PWRVYLDILPE--CTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPN 209
            W  Y+  LP      ST+FW+++ELV L+ + +   T+  K+ V+ E+   E  ++L  
Sbjct: 62  AWAPYISCLPPFGSMHSTIFWTQDELVYLKVSPVYRETVQRKDVVRMEFAAAENALLL-C 120

Query: 210 KQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGA 269
             +F   ++  +F  A+  + SRA+     ++L L+P  D  NH           E +  
Sbjct: 121 PHIFGSRVSALEFKHAYATVCSRAWGIETIKSLALVPFVDFFNHDANCRAMLSYDEDRHC 180

Query: 270 GLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF-IESKSDRNAYTLTLEIS 328
                D  ++         G+QV+I Y    SNA LALD+GF +          + L +S
Sbjct: 181 AEVVSDRDYA--------TGDQVVISYG-QLSNATLALDFGFALPFNPHDQVAGIWLSLS 231

Query: 329 ESDPFFGDKLDIAETNGL 346
           E DP    KL +  ++ +
Sbjct: 232 EKDPLRDSKLKLLHSHNM 249


>gi|302785554|ref|XP_002974548.1| hypothetical protein SELMODRAFT_101776 [Selaginella moellendorffii]
 gi|300157443|gb|EFJ24068.1| hypothetical protein SELMODRAFT_101776 [Selaginella moellendorffii]
          Length = 467

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 107/424 (25%), Positives = 196/424 (46%), Gaps = 43/424 (10%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTV-AASEIGSLCSG--LKPWISVALFLIREKK 147
           G GLV++R + K + +L +P    I  ++  A +E G +     L  W  +A +LI E  
Sbjct: 64  GRGLVSKRMLYKGDRLLFLPATLAITTESEWACAEAGKVIRAKDLPEWPFLACYLISEAS 123

Query: 148 -KEDSPWRVYLDILPECTDSTVFWSEEEL-VELQGTQLLSTTLGVKEYVQNEYLKVEEEI 205
             + SPW  Y+  LP    S + W+  ++   L  T +    L     V++ +  + +++
Sbjct: 124 LGKSSPWYPYIAALPRRPGSILLWTALDVEAHLSATSIKDRALQCVREVEDTFNDLNKQV 183

Query: 206 ILPNKQLFPRPI-TLDDFLWAFGILRSRAFSRLR--GQNLVLIPLADLINHSPGITTEDY 262
            + N++ FP  +  L+ F WAFGIL SR   RL   GQ L LIP  D++NH   +TT   
Sbjct: 184 FMKNREEFPPEVFNLESFKWAFGILFSR-LVRLPSLGQKLALIPFGDMLNHDTEVTT--- 239

Query: 263 AYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES-KSDRNAY 321
            +    +G  S+ +  +L      ++ ++V I Y   +SN EL + YGF+ S K+  ++ 
Sbjct: 240 -FLDFDSG--SKSITCTLDR--GYESNKEVFISYG-KRSNGELLVAYGFVPSGKNSEDSV 293

Query: 322 TLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLL 381
           ++TL +  +D  +  KL   + +GL     + I L +  P  +  + RL+    +     
Sbjct: 294 SITLGLDPADEMYEAKLGALKEHGLSPQQSYPIKL-KGWPVQLTAFARLITSPPSQLHRY 352

Query: 382 ESIFRNTI--WGHLDLPVSHANEELIC-RVVRDACKSALSGFHTTIEEDEQLL------- 431
             +        G    PV    E++    ++  ACK A++     +E ++ ++       
Sbjct: 353 SELTSAATEKQGRRMQPVFTTEEQMKAYEMILSACKQAIAASKNYLEVEQGIVSFQLIKE 412

Query: 432 --QGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKERELELDELEYYQERRLKDLGLVGEQG 489
             + G+L    ++A  +   E ++L + + I +    EL +L     R  K+ GL+G+  
Sbjct: 413 VPRHGSLP---KLASALCESELKILYRNEYILRS---ELRDL-----RSGKNSGLLGKVS 461

Query: 490 DIIF 493
            + F
Sbjct: 462 SMFF 465


>gi|328772335|gb|EGF82373.1| hypothetical protein BATDEDRAFT_86177 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 966

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 106/420 (25%), Positives = 182/420 (43%), Gaps = 65/420 (15%)

Query: 62  AQVETFWQWLRDQKVVSPKSPIRPATFPE--GLGLVAQRDIAKNEVVLEVPMKFWINPDT 119
           A +E+F QWL    + +    I+     +  GLG+ + R I K E ++++P+K  ++ DT
Sbjct: 556 ASLESFTQWLHANGINTDGISIKKVDDSKDVGLGIFSTRQIHKGECLVKIPLKLILSNDT 615

Query: 120 VAASEIGSLCSG-----LKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEE 174
            A   + S+          P + + + L++E     S W+ Y D+LP      V  S ++
Sbjct: 616 SAMPALNSIVKSNVLLKTDPSVILVIRLLQEYINPMSLWQPYFDLLPRVFTIPVLGSAQD 675

Query: 175 LVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFL-----WAFGIL 229
           L    GT ++   +     +  +YL ++       K +   PI L DF      WA  I+
Sbjct: 676 LAAYTGTSIIDEVVHDMIALMRQYLYLQHIF----KSIPEPPIPLADFTFAAFSWARAIV 731

Query: 230 RSR---------AFSRLRGQNLVLIPLADLINHSPGITTEDY-AYEIKGAGLFSRDLLFS 279
            +R         + S ++ Q L LIPL D+ NH PG +T  +   E     + S D    
Sbjct: 732 STRQNEICYANPSTSEMQ-QFLCLIPLFDMFNHKPGNSTTQFDTKEYCSETIASCD---- 786

Query: 280 LRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKS-DRNAYTLTLEISESDPFFGDKL 338
                 V  GEQ+ I Y   +SN E+ L  GF++  + + +   L++ I +SDP    ++
Sbjct: 787 ------VSPGEQIFIHYG-KRSNQEMLLYSGFVDPTNIEYDHIKLSVSIPQSDPIRNQRV 839

Query: 339 DIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFR-----------N 387
            + +   L   +  ++  G  +     Q +  + +   DA  + +I             +
Sbjct: 840 QLLKLFNLSSESRLNLC-GPPVLETSSQLILFLKVLTMDAVTINAILATSDPVEKMQSDH 898

Query: 388 TIWGHLDLPVSHANE--ELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDP--RLRIA 443
           TI+    L V+ A E   L C +++ A          TIE D+  LQ GN D   +LR+A
Sbjct: 899 TIYS---LSVTKALEWVRLRCMILKRA-------LQKTIETDKTTLQSGNADNPVKLRVA 948


>gi|66825817|ref|XP_646263.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|60474297|gb|EAL72234.1| SET domain-containing protein [Dictyostelium discoideum AX4]
          Length = 567

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 105/426 (24%), Positives = 177/426 (41%), Gaps = 41/426 (9%)

Query: 59  PPTAQ-VETFWQWLRDQKVVSPKSPIR-PATFPEGLGLVAQRDIAKNEVVLEVPMKFWIN 116
           P  AQ V  F +WL+ +     K  ++      EG GLVA +DI + E  +E+P   +I 
Sbjct: 64  PTEAQLVANFIEWLKGKGFDESKCKVKIDRNTSEGTGLVATQDIKEGEDFVEIPSNLFIT 123

Query: 117 PDTVAASEIGS--------LCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTV 168
              VA   +G         L   + P I +++FL++E     S W  Y+ +LP+  ++  
Sbjct: 124 -TAVAFQGLGKPPILENDRLIQSI-PGILLSIFLVKELSNPTSEWGPYIKLLPKQYNTVY 181

Query: 169 FWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFP-RPITLDDFLWAFG 227
           +W  +E  + +G+  L   +        +Y  +   I      + P    T D F+WA  
Sbjct: 182 YWGLKEFTQFRGSPNLEYAMRYVRGAMRQYCYLYSMIDRTQSNIMPISSFTWDAFVWAIS 241

Query: 228 ILRSRAFSRLRGQN----LVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTP 283
            ++SR      G      + LIP  D  NHS   +     Y +    + S  +       
Sbjct: 242 TVQSRQNPVYAGNGNGSIMALIPFWDFCNHSSTGSKITSFYHMDSNCMTSGAI------- 294

Query: 284 VPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDP-FFGDKLDIAE 342
              K GEQV + Y   + N +L +  GF    +  ++Y   L + E +     DK+ + E
Sbjct: 295 KDFKKGEQVYMFYGP-RDNTQLLMHAGFATKTNLHDSYPFELHLLEGNHEIRHDKVHLLE 353

Query: 343 TNGLGESAYFDIVLGRT---LPPAMLQYLRLVALGGTDAFLLESIF----RNTIWGH-LD 394
             G+ +    ++    T   LP  ++ + R+ AL   +   +         N   GH L+
Sbjct: 354 ERGIRDGVVVNLNQNPTSNELPLELIPFYRIYALSEQETRAIAPPQVPGEHNHHHGHQLE 413

Query: 395 LP------VSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRA 448
           L       ++  NEE     +  A K  L+ + TT+EEDEQ L+  N     R  +  + 
Sbjct: 414 LKPLAFKIITQENEEKAYSNLVQALKGKLASYPTTLEEDEQELK-KNPPANQRFILYTKI 472

Query: 449 GEKRVL 454
            EK++L
Sbjct: 473 NEKKIL 478


>gi|225561342|gb|EEH09622.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
          Length = 487

 Score = 96.3 bits (238), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 102/384 (26%), Positives = 179/384 (46%), Gaps = 33/384 (8%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMK-FWINPDTVAASEIGSLCSGLKPWISV----ALFLIRE 145
           G G+   R   + E +  +P    W      A S +G      +P +SV    A++++  
Sbjct: 43  GRGVRTLRCFKEGERIFTIPADVLWTVEHAYADSLLGPALRSARPPLSVDDTLAMYILFV 102

Query: 146 KKKE---DSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVE 202
           + +E   D P R +L  LP+   S++F++++EL    G+ L + T  +   ++++Y  + 
Sbjct: 103 RSRESGYDGP-RSHLATLPKSYSSSIFFTDDELEVCAGSSLYALTKRLGRCIEDDYRALV 161

Query: 203 EEIILPNKQLFP-RPITLDDFLWAFGILRSRA--FSRLRGQNLVLI-PLADLINHSPGIT 258
             +++ ++ LFP    T++D+ WA   + SRA  F    G+++ L+ P AD++NHS  + 
Sbjct: 162 VRLLVQHQDLFPLDKFTIEDYKWALCTVWSRAMDFVLPGGKSIRLMAPFADMLNHSSEVR 221

Query: 259 TEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQ-----VLIQYDLNKSNAELALDYGFIE 313
            + +AY+     L       ++      +AG+Q     V I Y    +N  L L YGF+ 
Sbjct: 222 -QCHAYDPLSGNL-------TILAGKDYEAGDQGVFFQVFIYYGSIPNNRLLRL-YGFVM 272

Query: 314 SKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVAL 373
             +  ++Y L LE     PFF  K  + +  G   ++   I L   LP  +L YLR+   
Sbjct: 273 PGNPNDSYDLVLETHPMAPFFEQKRKLWDLAGFDSTSTISITLTDPLPKNVLGYLRIQRS 332

Query: 374 GGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQG 433
             +D   L SI R  I    +  +S +NE  + + + ++    L  F T +E  E+ L  
Sbjct: 333 DESD---LASIARQRIDPKYE-KISDSNEVEVLQSLIESFCGLLDSFGTQLESLEKQLAE 388

Query: 434 GNLDPRLR--IAVGVRAGEKRVLQ 455
           G    R     A  V  GE++VL+
Sbjct: 389 GVYPSRGNAWAAAHVSLGEQQVLR 412


>gi|361129824|gb|EHL01706.1| putative Ribosomal N-lysine methyltransferase 4 [Glarea lozoyensis
           74030]
          Length = 483

 Score = 95.9 bits (237), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 123/471 (26%), Positives = 203/471 (43%), Gaps = 109/471 (23%)

Query: 62  AQVETFWQWLRDQKV-VSPKSPIRP-ATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDT 119
           A  E F +WL    V ++PK  ++   +   G G+VA  D  ++EVV  +P    +N + 
Sbjct: 8   ATTEAFLEWLSKIGVRINPKMTLKDLKSEGRGRGVVAAADFEEDEVVFCIPRTAVLNVNN 67

Query: 120 V-AASEIGSLCSGL---KPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEEL 175
           V A  + G+    L     W+++   ++ E ++ DS W  YL +LP+  DS VFWSEEEL
Sbjct: 68  VFAGQDSGASKEALLQMPNWLALTATMMSEGQQSDSRWAPYLAVLPQKLDSLVFWSEEEL 127

Query: 176 VELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPR---PITLDDF---------- 222
            ELQ +  ++  +G                    +++F +   P+ L +F          
Sbjct: 128 AELQASS-VAKKIGRSSA----------------EEMFTKHISPLGLGEFNVELCHQVAS 170

Query: 223 ---LWAFGILRSRAFSRLRGQN------------------LVLIPLADLINHSPGITTED 261
               +AF I       +  G                    L +IPLAD++N         
Sbjct: 171 VIMAYAFDIPEEEPAKQENGGAEGETDDLVSDDGEDEKTILSMIPLADMLNADA------ 224

Query: 262 YAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQY-DLNKSNAELALDYGFIESKSDRNA 320
              E   A ++  +    +RT  P+ AGE++   Y  L +S+  L   YG++   ++  A
Sbjct: 225 ---ERNNARIYYENEDLEMRTIKPIMAGEEIFNDYGQLPRSD--LLRRYGYV---TENYA 276

Query: 321 YTLTLEISES-------------------DPFFG----DKLDIAETNGLGESAYFDIVLG 357
               +EIS +                   DP       +++ +A+  G+ E +Y D+ + 
Sbjct: 277 QYDVVEISSASIKSLMTEKPQEIQSGQFLDPLTSAEAEERVALADREGILEDSY-DVNIA 335

Query: 358 ----RTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPV-SHANEELICRVVRDA 412
               R +P    + L L+ L   D   LE+I    +     LP  S    EL+ +V+   
Sbjct: 336 NAEERAIPD---ELLALLYLFLLDNENLEAI----VTSQSALPSRSKLATELVGKVLVKV 388

Query: 413 CKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQ-QIDEIFK 462
            +   + + TT+EEDE+LLQ  NL  R  +A+ VR GEKRVL+  ++E  K
Sbjct: 389 LRHREAEYATTLEEDEKLLQAANLPRRTAMAIQVRHGEKRVLRLAVEEAMK 439


>gi|217038301|gb|ACJ76599.1| SET domain-containing protein 3 (predicted) [Oryctolagus cuniculus]
          Length = 394

 Score = 95.9 bits (237), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 85/302 (28%), Positives = 136/302 (45%), Gaps = 33/302 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A R+I   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 104 EGFGLRATREIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ E+E+  LQ TQ +       +    +Y    +V
Sbjct: 164 ERASPNSFWQPYIQTLPSEYDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYRV 223

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 224 IQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 280

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G+ T  Y  E        R    +LR      AGEQ+ I Y   +SNAE  +  GF   
Sbjct: 281 NGLITTGYNLED------DRCECVALRD---FHAGEQIYIFYG-TRSNAEFVIHSGFFFD 330

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLV 371
            +  +   + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+ 
Sbjct: 331 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTEPPISAQLLAFLRVF 388

Query: 372 AL 373
            +
Sbjct: 389 CM 390


>gi|330822500|ref|XP_003291689.1| hypothetical protein DICPUDRAFT_57488 [Dictyostelium purpureum]
 gi|325078125|gb|EGC31794.1| hypothetical protein DICPUDRAFT_57488 [Dictyostelium purpureum]
          Length = 540

 Score = 95.5 bits (236), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 110/453 (24%), Positives = 195/453 (43%), Gaps = 48/453 (10%)

Query: 64  VETFWQWLRDQKVVSPKSPIRPA-TFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAA 122
           V  F +WL++      KS ++      EG GLV+  DI + E  LE+P K +I+  T   
Sbjct: 70  VSNFMEWLKNSGFDETKSKVKIGRNLAEGSGLVSTCDIKEGEEFLEIPEKLFIDIMTALK 129

Query: 123 SEIGSLCSGLK-----------PWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWS 171
           S  G   SG             P + +AL+LI+E    DS    YL +LP+   +  +W 
Sbjct: 130 S-FGQ--SGYDILLRDNLIRRVPNLVLALYLIKESTNPDSSIAPYLKVLPKTYSTIGYWG 186

Query: 172 EEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRS 231
            E+  +L+G+ +  T +        +Y    +        L     T + F+WA   ++S
Sbjct: 187 IEDFKQLEGSPVFQTAVNYTRGSMRQYCYFYQLFDNNPGILQTSNFTYEAFIWAVATVQS 246

Query: 232 RAFSRLRGQNLVLIPLADLINHSP-GITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGE 290
           R      GQ + LIP  D  NHS  G     +   +K         + +       K GE
Sbjct: 247 RQNPVGGGQEMALIPFWDFCNHSSHGGKITTFIDPVKH--------VLTCSAAKSYKKGE 298

Query: 291 QVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESD--PFFGDKLDIAETN-GLG 347
           QV + Y   + N++  L  GF    +  + Y+  +++   D      DK+ I E   GL 
Sbjct: 299 QVYMYYG-PRPNSQFYLFQGFSLKTNLNDDYSFDMDLDNEDDRDIAHDKIHILEERCGLR 357

Query: 348 ESAYFDIVLG---RTLPPAMLQYLRLVALGGTDAFLL----ESIFRNTIWGHLDLP---- 396
                 +        LP  ++ + R+ AL   +   L    E    +   G +D+     
Sbjct: 358 VGQTVSLSQNPSSEKLPAEIIPFYRIAALSPEETKKLAPPQEEGHHHHHQGPMDMKPEAF 417

Query: 397 --VSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVL 454
             +S  NE+   +++ D+ K+ LSG+ TT+ +DEQ ++  N   + R  + +   EK++L
Sbjct: 418 NIISEENEKKAFKLLLDSLKARLSGYPTTLAQDEQEMK-NNPTTQRRYVLYILINEKKIL 476

Query: 455 QQ----IDEIFKEREL--ELDELEYYQERRLKD 481
           ++    ++++ ++  +  EL ++E+ +E    D
Sbjct: 477 ERNIKYVEQLIEKGVMNAELKKIEFSEEESCGD 509


>gi|449016899|dbj|BAM80301.1| similar to ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase [Cyanidioschyzon merolae
           strain 10D]
          Length = 515

 Score = 95.5 bits (236), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 68/211 (32%), Positives = 100/211 (47%), Gaps = 38/211 (18%)

Query: 307 LDYGFIESKSDRNA----------YTLTLEISESDPFFGDKLDIAETNGLG----ESAYF 352
           L YG+ E  S+ ++           TL  EIS  D F+ DK+DI +         ++   
Sbjct: 289 LQYGYFEEHSNPDSARKLRECVGLVTLAFEISRLDRFYDDKVDILQQQQQQQQRIDATSE 348

Query: 353 DIVLGRTLP------------------PAMLQYLRLVALGGTDAFLLESIFRNTIWGHLD 394
           D + G+T                     AM+Q+LRL+ L G+DAFLLE +FR+ +W  L 
Sbjct: 349 DELAGQTFVVAASDLTSEALQAAENPLSAMVQFLRLLCLSGSDAFLLEGLFRDEVWEFLA 408

Query: 395 LPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVG-VRAGEKRV 453
           LPVS  NE ++C +V  +C+  +        ++ Q L   N  P L+ A   V+  E+  
Sbjct: 409 LPVSEGNERMVCELVLASCEERMQAL-----DEAQALCEENFSPELKQAAQLVQEAERTT 463

Query: 454 LQQIDEIFKERELELDELEYYQERRLKDLGL 484
           L+ I   F+     L E EYY ERRL+ L L
Sbjct: 464 LRAIMTYFRNELTSLREKEYYHERRLRALDL 494


>gi|302753470|ref|XP_002960159.1| hypothetical protein SELMODRAFT_437298 [Selaginella moellendorffii]
 gi|300171098|gb|EFJ37698.1| hypothetical protein SELMODRAFT_437298 [Selaginella moellendorffii]
          Length = 377

 Score = 95.1 bits (235), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 81/277 (29%), Positives = 132/277 (47%), Gaps = 17/277 (6%)

Query: 67  FWQWLRDQKVVSPKSPIRPATFPE-GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEI 125
           F  WL  +      S +R  T  E G GL A R +   E VLE+ +   I P  +     
Sbjct: 42  FMSWLASRNDDGDLSAVRIGTSQESGRGLFASRPVRAGERVLEISLDLMIAPSDLPDELS 101

Query: 126 GSLCSGLKPWISVALFLIREK-KKEDSPWRVYLDILPECT--DSTVFWSEEELVELQGTQ 182
             L S +KPW  +AL ++ E+ K + S W  Y+  LP+    D+T  W + EL  L+ + 
Sbjct: 102 MVLPSTVKPWTKLALIVLMERYKGQSSVWAPYISCLPQPAELDNTFLWEDTELSYLKASP 161

Query: 183 LLSTTLGVKEYVQNEYLKVEEEI-ILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQN 241
           L   T    E +  E+ +V+  + + P  QLF + ++L+DF   +  + SR+ +      
Sbjct: 162 LYGKTRERLEMITTEFGQVQNALNVWP--QLFGK-VSLEDFKHVYATVFSRSLAIGEDST 218

Query: 242 LVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKS 301
           LV+IP+ D  NH+       +A ++   GL +  ++ + R        +Q+ I Y  + S
Sbjct: 219 LVMIPMLDFFNHN----ATSFA-KLSFNGLLNYAVVTADRA---YTENDQIWINYG-DLS 269

Query: 302 NAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKL 338
           NAELALDYGF   ++  +   L  +  E +    D+L
Sbjct: 270 NAELALDYGFTVPENPYDETDLLTQFPEMNTILKDQL 306


>gi|321470773|gb|EFX81748.1| hypothetical protein DAPPUDRAFT_317395 [Daphnia pulex]
          Length = 495

 Score = 94.7 bits (234), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 103/446 (23%), Positives = 190/446 (42%), Gaps = 62/446 (13%)

Query: 59  PPTAQVET---FWQWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFW 114
           PP  + +T   F +W+ +  V     P+     P  G  + A + ++ +E++  +P K  
Sbjct: 69  PPVKRSDTLPPFLEWMTNHDV--KMGPVELVELPLYGCCVRATKQVSTDELLFSIPQKLM 126

Query: 115 INPDTVAASEIGSLCSG-----LKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVF 169
           ++ +T  +S IG   +        P +++A  ++ E     S W+ YLD LP   D+ ++
Sbjct: 127 LSNETANSSTIGHFINNDPILSQMPNVALAFHVLNELYDPKSFWKPYLDALPSSYDTVMY 186

Query: 170 WSEEELVELQGTQLLSTTLGVKEYVQNEY---LKVEEEIILPNKQLFPRPITLDDFLWAF 226
           ++ +E+ EL+G+      L +   +  +Y     + ++ + P         T +D+ WA 
Sbjct: 187 FTPDEITELKGSPAFDDALRMCRNIARQYSYFYSLLQKNVDPALSNLRANFTYNDYRWAV 246

Query: 227 GILRSRAFSRLRGQNLV-------------------LIPLADLINHSPGITTEDYAYEIK 267
             + +R       QNL+                   LIPL D  NH  G  + ++  E +
Sbjct: 247 STVMTR-------QNLIPSQEEISGNDKDQLPPVNALIPLWDFCNHQDGQFSTEFQLESR 299

Query: 268 GAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEI 327
                 R +  + R   P   GEQV I Y   ++ AE  +  GF++  +  +A TL + +
Sbjct: 300 ------RTVCQAGRDFGP---GEQVFIFYG-TRTCAEQFIHNGFVDINNAHDALTLKVGL 349

Query: 328 SESDPFFGD------KLDIAETNGLGESAYFDIVLG-RTLPPAMLQYLRLVAL--GGTDA 378
           S+SDP  G       KL I     +     F +  G + +   +L +LRL  +     D 
Sbjct: 350 SKSDPLAGQRATLLCKLRILSDEKISGPIAFQLKAGPQPVDGKLLAFLRLFCMTKDSLDR 409

Query: 379 FLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDP 438
           +L      N +  H +  +    ++     ++  C+  L  + TT E D ++L+  +L  
Sbjct: 410 WLQSDNASNLM--HEECGIETEVDDKSWSFLKARCQLLLQLYPTTKEADLKMLE-EDLSS 466

Query: 439 RLRIAVGVRAGEKRVLQQIDEIFKER 464
             R+ V +R  EKR+L    E   +R
Sbjct: 467 HRRMCVLLRLAEKRILLSAIECAAQR 492


>gi|145528147|ref|XP_001449873.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124417462|emb|CAK82476.1| unnamed protein product [Paramecium tetraurelia]
          Length = 605

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 104/407 (25%), Positives = 189/407 (46%), Gaps = 48/407 (11%)

Query: 69  QWLRDQKVVSPKSPIRPATFPEGL-GLVAQRDIAKNEVVLEVPMKFWIN----PDT-VAA 122
           +WL+  + + PK  I+   + E   G+ A++ I+  EV+L VP    I      DT VA 
Sbjct: 166 EWLKSGQALFPK--IKIECYAEDYRGVNARKAISSKEVILFVPRSHMITLEMAKDTPVAK 223

Query: 123 SEIGSLCSGLKPWIS-VALFLIREKKKEDSPWRVYLDILPEC-TDSTVFWSEEELVELQG 180
             I      L P  S ++ FL++EKK +DS W+ YLD+LP+  ++  +F+++ +L  L+G
Sbjct: 224 KIIQYRLDLLSPKHSFLSTFLLQEKKIQDSFWKPYLDVLPKSYSNFPIFFNDSDLEWLKG 283

Query: 181 TQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI--TLDDFLWAFGILRSRAFSRLR 238
           +  L         ++ +Y  +         Q+ P  +  + D+F WA    R  A SR+ 
Sbjct: 284 SPFLKQVKDKITDLKKDYCDI--------CQVAPEFLQNSFDEFCWA----RMTASSRIF 331

Query: 239 GQNL------VLIPLADLINHS-PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQ 291
           G N+        +PLAD++NH  P +T+  Y+ E +G         F + T   ++ G+ 
Sbjct: 332 GINIKGVKTDAFVPLADMLNHKRPKLTSWCYSDERQG---------FIIETDENIEKGQM 382

Query: 292 VLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIA-ETNGLGESA 350
           +   Y  +K N+   L+YGF+   ++ N   + +E   +      K  ++ ET    +S 
Sbjct: 383 IFDSYG-SKCNSRFLLNYGFVVDDNNANEVNVMVEPDGTISLIQLKEGLSRETLQFPKSF 441

Query: 351 YFDIVLGRTLPPAMLQYLRLVALGGTDAF---LLESIFRNTIWGHLDLPVSHANEELICR 407
              I          + ++R + +     F   L ++ +      H    +S  NE     
Sbjct: 442 RLVIDPNDVSFSDFMSFIRFILIQEEKEFANLLGKNSYIKPTKIHF---ISIQNELATWN 498

Query: 408 VVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVL 454
           ++ + C  AL+ + TT+E+D ++L+   L    R  + +R GEK++L
Sbjct: 499 LIENICIRALNQYPTTLEQDLEILKICELTTNQRNCLILRMGEKKIL 545


>gi|440792294|gb|ELR13522.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
          Length = 568

 Score = 94.0 bits (232), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 104/412 (25%), Positives = 175/412 (42%), Gaps = 55/412 (13%)

Query: 86  ATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCS-----GLKPWISVAL 140
           A  PEG G+VA++D  K E  L+VP K       +  + +G L          P + +AL
Sbjct: 92  ANLPEGSGVVAKKDFKKGEPFLQVPRKLMFTCQAMQNTPLGQLLKVDKFLAQSPSLCLAL 151

Query: 141 FLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLK 200
            L+ EK    S W  Y+  LP+   + ++++ EEL  L+G+   ++ + V   V  +Y  
Sbjct: 152 HLLVEKHNHSSFWTPYIKTLPKSYGTCLYFTLEELEGLRGSPTFTSAIKVIATVAIQYTY 211

Query: 201 VEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLR-GQNLV----LIPLADLINHSP 255
           + +   +    L     T D+F+WA   + SR     + G N +    LIP  D+ NH  
Sbjct: 212 IHDLFQIRKDILHINAFTWDEFIWAMSAVGSRQNQVPQWGHNALSEYALIPAWDMCNHDH 271

Query: 256 GITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESK 315
           G       +++      S     ++R     K GEQV I Y   + N++L L  GF+   
Sbjct: 272 GDLQT--FWDVNSDSTESH----AMRA---YKKGEQVYIFYG-PRPNSDLLLHAGFVYEN 321

Query: 316 SDRNAYTLTLEISESDPFFGDKLDIAETNGLG-ESAYFDIVLGRTLPPAMLQYLRLVA-- 372
           +  +A  + + ++       DKL +   N +  +S Y+   LG  L   ++ +LR+ A  
Sbjct: 322 NRFDALAIRVRLAPDAEHIKDKLRLLHLNNMKMDSQYYLYGLG--LAVDLMAFLRIHAMN 379

Query: 373 -------LGGTDAFLLESIFRN----------TIWGHLD--LPVSHANEELICRVVRDAC 413
                  LG  D    E+   N             G  D  + ++  NE    ++    C
Sbjct: 380 EQELQQVLGAYDQ--QEAKVHNGEHPASNGEVVASGVFDPRVKLNDRNELAALQLAEAKC 437

Query: 414 KSALSGFHTTI---------EEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
            S LS + TT+         +ED+  L+  +L P +R    +R  EK +L +
Sbjct: 438 LSLLSLYPTTLQVANGVELKQEDQAALRTTSLTPNMRAVTLLRLKEKEILNR 489


>gi|320163219|gb|EFW40118.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 1188

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 111/430 (25%), Positives = 184/430 (42%), Gaps = 55/430 (12%)

Query: 60  PTAQVETFWQWLRDQKV-VSPKSPIRPATFPEG-LGLVAQRDIAKNEVVLEVPMKFWINP 117
           P  ++  F  W RD    V  K  +R  T PEG  G  A  D+A  + +  +P+   I+ 
Sbjct: 126 PETRMTNFLSWARDMAGGVFDKIELR-TTGPEGDRGFFATCDLAPGDELASMPIATIISE 184

Query: 118 DTVAASEIG-SLCS-------GLKPWIS---VALFLIREKKKEDSPWRVYLDILPECTDS 166
              + S +G ++ S       G+ P      +  +LI  + K DSP+  Y++ILP+    
Sbjct: 185 QLASRSPVGMAMLSSPMLKRRGVTPIPGRTLICAYLIANRGKLDSPFYHYINILPQTYSD 244

Query: 167 TVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI-TLDDFLWA 225
            ++W++ EL  L GT +       +  V+N++L V   +      LFP+ + T + +LWA
Sbjct: 245 PLWWNDAELDHLDGTNIGGYIQERRNQVRNQFLNVFPVLSREQPALFPKDVFTYEAYLWA 304

Query: 226 FGILRSRAF-------------SRLRGQNL------VLIPLADLINHSPGITTEDYAYEI 266
           F    SRAF             S   G  +       L+PL D++NH  G +        
Sbjct: 305 FSTCSSRAFPLRVTVNPTTGVESHAIGNPMKEPCVECLLPLLDMMNHQFGASIT------ 358

Query: 267 KGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLE 326
                F+ +      T   V+ GEQV   Y   KSN EL + YGF    ++ +   + L 
Sbjct: 359 ----WFTDETSVRFFTGAKVRKGEQVYNNYG-PKSNEELLMGYGFCLPNNEADHVKIQLT 413

Query: 327 ISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFR 386
           +  +DP    KL I   +GL  + +      R++P  +   LR++ +   +      ++ 
Sbjct: 414 VG-NDPDGEAKLAILRWHGLSLTHFLH---NRSVPVELFSALRVLVMTPAEI----CMYS 465

Query: 387 NTIWGHLDLPVSHANEELICRVVRDACKSALS--GFHTTIEEDEQLLQGGNLDPRLRIAV 444
                 L   VS ANE  + R +     + L   G  +    D  LL   +LD  +R+A+
Sbjct: 466 TPQQADLTDRVSLANETRMLRTLSTLLLTRLVQLGGESRDGFDRGLLLQPSLDYHVRLAL 525

Query: 445 GVRAGEKRVL 454
             R G++ +L
Sbjct: 526 TYRVGQRSIL 535


>gi|229596469|ref|XP_001008992.3| SET domain containing protein [Tetrahymena thermophila]
 gi|225565279|gb|EAR88747.3| SET domain containing protein [Tetrahymena thermophila SB210]
          Length = 629

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 104/423 (24%), Positives = 191/423 (45%), Gaps = 81/423 (19%)

Query: 93  GLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWI------SVALFLIREK 146
            +V++R I  +E V+ +P +  I  D  ++S+   + +     +        ALFL++E+
Sbjct: 178 SIVSKRIIQADETVISIPQEQVITLDVASSSDFCKILTEKNTQLVQQKHAYFALFLLQEQ 237

Query: 147 KKED-SPWRVYLDILPECTDSTVF---WSEEELVELQGTQLLSTTLGVKEYVQNEYLKVE 202
           KK+D S ++ Y+D LP  TD + F   +SEEEL  L+GT  L      KE ++ +Y  + 
Sbjct: 238 KKKDASHYKAYIDSLP--TDLSSFPALFSEEELQYLEGTAALKLVQEQKEDIKTDYESIS 295

Query: 203 EEIILPNKQLFPRPITLDDFLWAFGILRSRAFS-RLRG-QNLVLIPLADLINHSPGITTE 260
           +  ++P    F    + + F WAF    SR F  +++G +  V++PLAD++NH      E
Sbjct: 296 Q--VIPE---FKSEFSFEQFRWAFLCSHSRVFGIKVKGVKTSVMVPLADMLNHKHS-GQE 349

Query: 261 DYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNA 320
           D  +    A        F+++    ++  +Q+   Y  +K N++L L+YGF+    D   
Sbjct: 350 DSEWVFDDAT-----NCFTVKALKKIQRNQQIHFSYG-SKCNSKLFLNYGFV----DVQD 399

Query: 321 YTLTLEISESDPFFGDKLDIAETNGLGESAYFD---------------------IVLGRT 359
           +T T++  + +   G +L   +  G+    YF+                      ++G +
Sbjct: 400 HTETIKKLKEEERPGRQLIQNDYRGIN---YFNELPVVFCFDQAQDDPTLPIKAKLIGGS 456

Query: 360 LP----------------PAMLQYLRLVALGGTDAF--LLES---IFRNTIWGHLDL--- 395
           +P                   L YLR + +   +    L +S       T   +L     
Sbjct: 457 IPVCCLTRINEDFETQNSTGFLGYLRFICIRDPNQLGKLFDSHKKYIEQTNQSNLPFLPQ 516

Query: 396 ---PVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKR 452
              P+S  NE  + + V++     L  + TT+ +D++LL+ G+L    R  V +R GEK 
Sbjct: 517 KTPPLSVENEHKMWQRVQELLIPLLLKYKTTVVKDKELLESGSLSENQRNCVLIRLGEKN 576

Query: 453 VLQ 455
           +++
Sbjct: 577 IIE 579


>gi|388250581|gb|AFK23406.1| histone-lysine N-methyltransferase [Cordyceps militaris]
          Length = 479

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 104/423 (24%), Positives = 183/423 (43%), Gaps = 52/423 (12%)

Query: 70  WLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMK-FWINPDTVAASEIGS 127
           WL+    V   + I  A FP  G G+ A R   K E +L +P    W      A   +G 
Sbjct: 4   WLKHSGAVGVDA-IEVADFPVTGRGVKALRSFKKGERILTIPSACLWTAEAARADPLLGP 62

Query: 128 LCSGLKPWISV------ALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGT 181
           +    +P +SV       L  ++ +       R+++  +P+   +++F++E+EL   +G+
Sbjct: 63  VLRSAQPPLSVEDTLAIHLLFVKSRTAGYEGQRLHIAAMPQRHSASIFFAEDELQVCEGS 122

Query: 182 QLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFP-------------------RPITLDD- 221
            L + T  +++ VQ+++ ++  +++  ++ LFP                   R I+L + 
Sbjct: 123 SLHTLTTQLEQRVQDDFRQLLVQLLSQHRDLFPLDQFTIEDVSYIAAFPRPTRSISLMNL 182

Query: 222 -----FLWAFGILRSRA--FSRLRGQNLVLI-PLADLINHSPGITTEDYAYEIKGAGLFS 273
                + WA   + SRA  F+     ++ L+ PLAD++NHS  +  + +AY+     L  
Sbjct: 183 YFPFQYKWALCTIWSRAMDFAVSDTTSVRLVAPLADMLNHSLDV-KQCHAYDPTSGDL-- 239

Query: 274 RDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPF 333
                S+      + G+Q+ I Y    +N  L L YGF+   +  ++Y L L+ S   P 
Sbjct: 240 -----SILAAKDYQVGDQIFIYYGSVPNNRLLRL-YGFVLLDNPNDSYDLVLQTSPMAPL 293

Query: 334 FGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHL 393
           +  K  +    GL  +    +     LP  +L+YLR   L   D   +     N      
Sbjct: 294 YEQKERLWALAGLDSTCTIPLTAKHPLPKNVLRYLRTQRLDAADVADMTLQLLNGT---- 349

Query: 394 DLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGG--NLDPRLRIAVGVRAGEK 451
           D  V+  NE  + + + D+  S L GF   +E+ E  L GG          A  V AGE+
Sbjct: 350 DGKVNDGNEIQVLQFLIDSLGSVLEGFGIPLEKLEAQLAGGFYPAGGNAWAAAQVSAGEQ 409

Query: 452 RVL 454
            +L
Sbjct: 410 GIL 412


>gi|66828265|ref|XP_647487.1| hypothetical protein DDB_G0268558 [Dictyostelium discoideum AX4]
 gi|60475797|gb|EAL73732.1| hypothetical protein DDB_G0268558 [Dictyostelium discoideum AX4]
          Length = 459

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 113/436 (25%), Positives = 200/436 (45%), Gaps = 65/436 (14%)

Query: 67  FWQWLRDQKVV-SPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEI 125
           F +WL + KV  +PK  I+      G  +VA++ I KNE ++ VP         +  S +
Sbjct: 38  FNKWLINNKVYKNPKIEIKVLE-KYGRSIVAKQSIKKNEKLISVP-------KLIIMSNM 89

Query: 126 GSLCSGL-----KPWISV--------ALFLIREKKKEDSPWRVYLDILPECTDSTVFWSE 172
           G     L     +P IS+        A+FL+  K  + S W  Y+ +LP+   +++++SE
Sbjct: 90  GGFSHHLPNEIYEPSISIGISPTNLQAIFLMYCKLNDKSFWYPYVSVLPKEFTTSIYFSE 149

Query: 173 EELVELQGTQLLSTTL----GVKEYVQNEYLKVEEEII-----LPNKQLFPRPITLDDFL 223
           EEL ELQ ++L   T+    G++ +  + + ++    I        + L  +  TL+ F 
Sbjct: 150 EELDELQSSKLKEFTIIRKDGIERHYNSTFTRLSNRGIAEFSPTSTQTLQQKGYTLELFT 209

Query: 224 WAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSR--DLLFSLR 281
           WA   + SRAFS L   +  ++PLAD+ N          A EI  + +  +  D      
Sbjct: 210 WALSCVWSRAFS-LSDSDGGMVPLADMFN----------AEEISKSKVQPKVTDSTLDYY 258

Query: 282 TPVPVKAGEQVLIQYDLNK--SNAELALDYGFI---ESKSDRNAYTLTLEISESDPFFGD 336
               ++ GEQ+   Y + K  S++++ +DYGF+    + SD  A ++ +       F  D
Sbjct: 259 ASDDIEIGEQIFTPYGVYKPLSSSQMLMDYGFVFDHGTPSDNVAISVPI-------FHPD 311

Query: 337 KLDIAETNGL-----GESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWG 391
           + +I     +      E+  F +     LP  +L Y R+  L   +    +  F +  + 
Sbjct: 312 EPNIQVKQSILEENEVENEVFLLTKKIPLPNDLLLYARVKNLITKETKYAKIHFLS--YQ 369

Query: 392 HLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLL-QGGNLDPRLRIAVGVRAGE 450
             D P++  NE++  R + +     L  + TT+E D+ L+ Q  NL      ++ +R  E
Sbjct: 370 TKDQPLNIRNEKVSLRFLENLIHRYLDSYETTLEFDKILIQQQNNLSFNSLNSIKIRISE 429

Query: 451 KRV-LQQIDEIFKERE 465
           K + L  ++EI K+RE
Sbjct: 430 KEILLLTLNEIEKKRE 445


>gi|357137766|ref|XP_003570470.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like
           [Brachypodium distachyon]
          Length = 389

 Score = 92.8 bits (229), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 86/308 (27%), Positives = 149/308 (48%), Gaps = 34/308 (11%)

Query: 93  GLVAQRDIAKNEVVLEVPMKFWINPDTVAAS-EIGSLCSG--LKPWISVALFLIREKKKE 149
           GLVA  ++   E +L VP    I+ D+  ++ E+G +     +  W  +A +LI E   E
Sbjct: 12  GLVALTNVRNGEKLLFVPPSLVISADSEWSNREVGDVMKSYSVPDWPLLATYLISEASLE 71

Query: 150 DSP-WRVYLDILPECTDSTVFWSEEEL-VELQGTQLLSTTLGVKEYVQNEYLKVEEEIIL 207
            S  W  Y+D LP    S ++W+  E+   L  + +    +     V   Y  + + I  
Sbjct: 72  GSSRWSSYIDALPRQPYSLLYWTRTEIDAYLVASPIRERAISRIGDVIGTYNDLRDRIFS 131

Query: 208 PNKQLFPRPI-TLDDFLWAFGILRSRAFSRLRGQN--LVLIPLADLINHSPGITT-EDYA 263
            + +LFP  +  +++F W+FGIL SR   RL   +  + L+P AD++NH+P +    D+ 
Sbjct: 132 KHPELFPEEVYNMENFRWSFGILFSR-LVRLPSMDGKVALVPWADMLNHNPEVDAFLDFD 190

Query: 264 YEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI--ESKSDRNAY 321
              +G  +F+ D  +        + GEQV I Y   KS+ EL L YGF+  E  +  ++ 
Sbjct: 191 KSSQGI-VFTTDRSY--------QPGEQVFISYG-KKSSGELLLSYGFVPKEGTNPNDSV 240

Query: 322 TLTLEISESDPFFGDKLDIAETNGLGESAYFDI-VLGRTL-----------PPAMLQYLR 369
             ++ +++SD  + +KL   + +GL ES  F + V G  +           PP M+Q   
Sbjct: 241 EFSVSLNKSDDCYREKLQALKRHGLSESESFPLRVTGWPVELMAYAFLVVSPPDMIQRFE 300

Query: 370 LVALGGTD 377
            +A+  ++
Sbjct: 301 EMAVAASN 308


>gi|242053769|ref|XP_002456030.1| hypothetical protein SORBIDRAFT_03g029140 [Sorghum bicolor]
 gi|241928005|gb|EES01150.1| hypothetical protein SORBIDRAFT_03g029140 [Sorghum bicolor]
          Length = 512

 Score = 92.4 bits (228), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 111/440 (25%), Positives = 194/440 (44%), Gaps = 48/440 (10%)

Query: 52  SASATTNPPTAQVE--------TFWQWLRDQKVVSPKSPIRPATFPEGLG--------LV 95
           S+S    PP   V+         F  WLR + +   K  IR    P  L         + 
Sbjct: 55  SSSEARAPPAPAVDPSSESATDCFVDWLRARGLPPGKVDIRERPVPCLLNGKDLPLRYVA 114

Query: 96  AQRDIAKNEVVLEVPMKFWINPDTVAASE-IGSLCSG--LKPWISVALFLIREKKK-EDS 151
           A  D+   +V  EVPM   +  + V   E I  L +   L     +AL+L+ EKK+ +DS
Sbjct: 115 AGVDLQAGDVAFEVPMSLVVTLERVLGDESIAELLTNNKLSELACLALYLMYEKKQGKDS 174

Query: 152 PWRVYLDILPE-------CTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEE 204
            W  Y+  L           +S + W+E EL  L G+ L    +   E ++ EY +++  
Sbjct: 175 FWYPYIKELDRHRGRGQLAVESPLLWTESELDYLTGSPLKDEVVARDEAIRREYNELDTL 234

Query: 205 IILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRG-----QNLVLIPLADLINHSPGITT 259
             +        P  +    + F I + +AF  ++      Q + L     L+   P + T
Sbjct: 235 WFMAGSLFQQYPFDIPTEAFPFEIFK-QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLT 293

Query: 260 EDYAYEIKGAGLFSRDLLFSLRTPV--PVKAGEQVLIQYDLNKSNAELALDYGFIESKSD 317
               Y+     + + D   S+R  V  P KAGE ++I +   ++N+ L L+YGF++  + 
Sbjct: 294 ----YKSNCKAMLTADG-DSVRLVVDRPYKAGEPIII-WCGPQTNSRLVLNYGFVDEDNP 347

Query: 318 RNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPA--MLQYLRLVALGG 375
            +   +   ++  DP + +K  +A+ NG      F++ +G+       ML YLRL  +  
Sbjct: 348 FDRIAIEASLNSEDPQYQEKRMVAQRNGKLAIQNFNVYVGKEKQTVAEMLPYLRLGYISD 407

Query: 376 TDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGN 435
            D   ++SI  +   G    P+S   E  +   +    +S L+G+ TT++EDE +L  G+
Sbjct: 408 PDE--MQSILSSE--GDT-CPLSPCTERAVLDQLVGYLESRLAGYPTTLDEDEAMLADGS 462

Query: 436 LDPRLRIAVGVRAGEKRVLQ 455
           L+P+  +A  +   EK+++ 
Sbjct: 463 LEPKKEVATRLVRLEKKMIH 482


>gi|218191491|gb|EEC73918.1| hypothetical protein OsI_08761 [Oryza sativa Indica Group]
          Length = 502

 Score = 92.4 bits (228), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 117/438 (26%), Positives = 192/438 (43%), Gaps = 55/438 (12%)

Query: 69  QWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTV-AASEIGS 127
           +WL D  +   +  I+     E  GLVA ++I K E +L VP    I  D+     E+G+
Sbjct: 77  RWLTDSGLPEQRLGIQRVDVGER-GLVALKNIRKGEKLLFVPPSLVITADSEWGCPEVGN 135

Query: 128 LC--SGLKPWISVALFLIREKKKEDSPWRV-YLDILPECTDSTVFWSEEEL-VELQGTQL 183
           +   + +  W  +A +LI E   E S     Y+  LP    S ++W+  EL   L  + +
Sbjct: 136 VLKRNSVPDWPLIATYLISEASLESSSRWSSYIAALPRQPYSLLYWTRPELDAYLVASPI 195

Query: 184 LSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI-TLDDFLWAFGILRSRAFSRLRGQN- 241
               +     V   Y  + + I   +  LFP  +  L+ F W+FGIL SR   RL   + 
Sbjct: 196 RERAIQRITDVVGTYNDLRDRIFSKHSDLFPEEVYNLETFRWSFGILFSR-LVRLPSMDG 254

Query: 242 -LVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNK 300
            + L+P AD++NHSP + T     +  G  +F+ D  +        + GEQV I Y   K
Sbjct: 255 RVALVPWADMLNHSPEVETFLDYDKSSGGIVFTTDRSY--------QPGEQVFISYG-KK 305

Query: 301 SNAELALDYGFI--ESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDI-VLG 357
           S+ EL L YGF+  E  +  ++  L + +++SD  + +KL   + NGL E   F + V G
Sbjct: 306 SSGELLLSYGFVPKEGTNPNDSVELLVSLNKSDKCYKEKLQALKRNGLSEFESFPLRVTG 365

Query: 358 RTL-----------PPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELIC 406
             +           PP M Q    +A+  ++        ++     L+ P     EE   
Sbjct: 366 WPVELMAYAFLVVSPPEMSQRFEEMAVAASN--------KSPSKPGLNYP---ELEEQAL 414

Query: 407 RVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQI--DEIFKER 464
           + + D C+S ++ +   +E     LQ              +A    +L+Q+  D    ER
Sbjct: 415 QFILDCCESNIAKYTKFLEGSSGSLQLS--------TNSKQANRTLLLKQLARDLCISER 466

Query: 465 ELELDELEYYQERRLKDL 482
            + L   +Y   RRL+D+
Sbjct: 467 RI-LYRTQYILRRRLRDM 483


>gi|115448405|ref|NP_001047982.1| Os02g0725200 [Oryza sativa Japonica Group]
 gi|45735887|dbj|BAD12920.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase [Oryza sativa Japonica
           Group]
 gi|45736017|dbj|BAD13045.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase [Oryza sativa Japonica
           Group]
 gi|113537513|dbj|BAF09896.1| Os02g0725200 [Oryza sativa Japonica Group]
 gi|215737236|dbj|BAG96165.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222623589|gb|EEE57721.1| hypothetical protein OsJ_08208 [Oryza sativa Japonica Group]
          Length = 502

 Score = 92.4 bits (228), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 117/438 (26%), Positives = 192/438 (43%), Gaps = 55/438 (12%)

Query: 69  QWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTV-AASEIGS 127
           +WL D  +   +  I+     E  GLVA ++I K E +L VP    I  D+     E+G+
Sbjct: 77  RWLTDSGLPEQRLGIQRVDVGER-GLVALKNIRKGEKLLFVPPSLVITADSEWGCPEVGN 135

Query: 128 LC--SGLKPWISVALFLIREKKKEDSPWRV-YLDILPECTDSTVFWSEEEL-VELQGTQL 183
           +   + +  W  +A +LI E   E S     Y+  LP    S ++W+  EL   L  + +
Sbjct: 136 VLKRNSVPDWPLIATYLISEASLESSSRWSSYIAALPRQPYSLLYWTRPELDAYLVASPI 195

Query: 184 LSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI-TLDDFLWAFGILRSRAFSRLRGQN- 241
               +     V   Y  + + I   +  LFP  +  L+ F W+FGIL SR   RL   + 
Sbjct: 196 RERAIQRITDVVGTYNDLRDRIFSKHSDLFPEEVYNLETFRWSFGILFSR-LVRLPSMDG 254

Query: 242 -LVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNK 300
            + L+P AD++NHSP + T     +  G  +F+ D  +        + GEQV I Y   K
Sbjct: 255 RVALVPWADMLNHSPEVETFLDYDKSSGGIVFTTDRSY--------QPGEQVFISYG-KK 305

Query: 301 SNAELALDYGFI--ESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDI-VLG 357
           S+ EL L YGF+  E  +  ++  L + +++SD  + +KL   + NGL E   F + V G
Sbjct: 306 SSGELLLSYGFVPKEGTNPNDSVELLVSLNKSDKCYKEKLQALKRNGLSEFESFPLRVTG 365

Query: 358 RTL-----------PPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELIC 406
             +           PP M Q    +A+  ++        ++     L+ P     EE   
Sbjct: 366 WPVELMAYAFLVVSPPEMSQRFEEMAVAASN--------KSPSKPGLNYP---ELEEQAL 414

Query: 407 RVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQI--DEIFKER 464
           + + D C+S ++ +   +E     LQ              +A    +L+Q+  D    ER
Sbjct: 415 QFILDCCESNIAKYTKFLEGSSGSLQLST--------NSKQANRTLLLKQLARDLCISER 466

Query: 465 ELELDELEYYQERRLKDL 482
            + L   +Y   RRL+D+
Sbjct: 467 RI-LYRTQYILRRRLRDM 483


>gi|226499862|ref|NP_001146209.1| uncharacterized protein LOC100279779 [Zea mays]
 gi|219886187|gb|ACL53468.1| unknown [Zea mays]
 gi|219888379|gb|ACL54564.1| unknown [Zea mays]
 gi|413948639|gb|AFW81288.1| hypothetical protein ZEAMMB73_810642 [Zea mays]
          Length = 494

 Score = 92.0 bits (227), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 105/419 (25%), Positives = 182/419 (43%), Gaps = 62/419 (14%)

Query: 89  PEGLGLVAQRDIA---KNEVVLEVPMKFWIN----------PDTVA-ASEIGSLCSGLKP 134
           P+G+G+ A RD+    +  V++E+P++  +           PD +     I  +     P
Sbjct: 109 PDGMGVYASRDVEPLRRARVIMEIPLELMLTITQKKPWMFFPDIIPLGHPIFDVIESTDP 168

Query: 135 ---W-ISVALFLIREKKKEDSPWRVYLDILP---ECTDSTVFWSEEELVELQGTQLLSTT 187
              W + +A  L+     ED+ W++Y D LP   ECT S +   +E+L+EL+   L S  
Sbjct: 169 ETDWDLRLACLLLYAFDIEDNFWQLYSDFLPSVDECT-SLLLAPKEDLMELEDEDLASQM 227

Query: 188 LGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFS-RLRGQNL---- 242
           L  +E   + + K  ++ I    +   R    + FLWA  I++SR+ + +LR        
Sbjct: 228 LKHQERAIDFWQKHWDKPIPLKLKRLAR--DHERFLWALSIVQSRSVNLKLRMGAFIQDA 285

Query: 243 -VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKS 301
            VL P AD++NHSP      + +  K      R L   ++    +K G+++ I Y ++  
Sbjct: 286 NVLAPYADMLNHSPNANCFLH-WRFK-----DRMLEVMIKAGQAIKKGDEMTIDY-MSGV 338

Query: 302 NAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLP 361
           N++    YGF    S  N + L    S +       L +    GL +  Y +  L     
Sbjct: 339 NSKFMERYGF---SSPTNPWELINFSSPATIHMDSFLSVFNIAGLHDELYHNSALTS--- 392

Query: 362 PAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFH 421
                   +V     D  ++ +      W   D+P   + E    + +++ C+  L  F 
Sbjct: 393 --------VVETDFVDGAVVAAARALPTWSDGDVPAIPSVERKSAQALQEECRQMLDSFP 444

Query: 422 TTIEEDEQLLQG-GNLDPRLRIAVGVRAGEKRVLQQIDEIFKERELELDELEYYQERRL 479
           TTIE+D+Q+L    ++     IA+  R   K +LQ+I          +D L+ YQ+R L
Sbjct: 445 TTIEQDQQILDSDAHISKTREIAIKYRLHRKMLLQKI----------MDALDIYQDRIL 493


>gi|302786274|ref|XP_002974908.1| hypothetical protein SELMODRAFT_102436 [Selaginella moellendorffii]
 gi|300157067|gb|EFJ23693.1| hypothetical protein SELMODRAFT_102436 [Selaginella moellendorffii]
          Length = 389

 Score = 91.7 bits (226), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 75/259 (28%), Positives = 117/259 (45%), Gaps = 18/259 (6%)

Query: 93  GLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLI-REKKKEDS 151
           GL A R I   E +L V     I P+ +       L   +  W  +ALFL+  +KKKE S
Sbjct: 2   GLFASRPIHTGECMLHVSHDLMITPEKLPEEVTKLLSKDVSAWAKLALFLLAHQKKKETS 61

Query: 152 PWRVYLDILPE--CTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPN 209
            W  Y+  LP      ST+FW+++ELV L+ + +   T+  K+ V+ E+   E   +L  
Sbjct: 62  AWAPYISCLPPFGSMHSTIFWTQDELVYLKVSPVYRETVQRKDVVRMEFAAAENVCMLMQ 121

Query: 210 K-QLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKG 268
           + +LF     L D++     + SRA+     ++L L+P  D  NH           E + 
Sbjct: 122 QVKLFVCSRILTDYI----TVCSRAWGIETIKSLALVPFVDFFNHDANCRAMLSYDEDRH 177

Query: 269 AGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF-IESKSDRNAYTLTLEI 327
                 D  ++         G+QV+I Y    SNA LALD+GF +          + L +
Sbjct: 178 CAEVVSDRDYA--------TGDQVVISYG-QLSNATLALDFGFALPFNPHDQVAGIWLSL 228

Query: 328 SESDPFFGDKLDIAETNGL 346
           SE DP    KL +  ++ +
Sbjct: 229 SEKDPLRDSKLKLLHSHNM 247


>gi|358384831|gb|EHK22428.1| hypothetical protein TRIVIDRAFT_84056 [Trichoderma virens Gv29-8]
          Length = 458

 Score = 91.7 bits (226), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 102/392 (26%), Positives = 174/392 (44%), Gaps = 42/392 (10%)

Query: 86  ATFPE-GLGLVAQRDIAKNEVVLEVPMK-FWINPDTVAASEIGSLCSGLKPWISVA---- 139
           A FP  G G+   R   + E +L +P    W      A + +G +    +P +SV     
Sbjct: 18  AHFPAIGRGVRTLRCFKQGERILTIPSGCLWTVEHAYADAVLGPVLRSAQPPLSVEDTLA 77

Query: 140 --LFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNE 197
             +  +R ++      R ++  LP    S++F+ ++EL    G+ L + T  +++ ++ +
Sbjct: 78  IYILFVRSRESGYDGLRSHVAALPASYSSSIFFEDDELEVCAGSSLYTITRQLEQRIEED 137

Query: 198 YLKVEEEIILPNKQLFP-RPITLDD--FLWAFGILRSRA--FSRLRGQNL-VLIPLADLI 251
           Y  +   +   +  LFP    T+++  + WA   + SRA  F    G  L +L P AD++
Sbjct: 138 YRGLVVRVFGLHLDLFPLNKFTIENVGYKWALCTVWSRAMDFVLPNGNPLRLLAPFADMV 197

Query: 252 NHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF 311
           NHSP +  + + Y+     L       S+      +A +QV I Y     N+ L   YGF
Sbjct: 198 NHSPEVK-QCHVYDASSGNL-------SILAGKDYEAEDQVFIYYG-PMPNSRLLRLYGF 248

Query: 312 IESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLV 371
           +   +  ++Y L L      PF+  K  +  + GL  +    + L   LP  +L+YLR+ 
Sbjct: 249 VIPDNPNDSYDLVLSTHPLAPFYEQKQKLWASAGLNSTCTISLTLDDPLPKNVLRYLRIQ 308

Query: 372 ALGGTD--AFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEE-DE 428
            L  +D  A  L+ I  N         +S + E  I R + ++  S L  F T +E+  E
Sbjct: 309 RLDESDLAAIALQKIDTNE-------KISDSKEVEILRFLVESIGSLLDSFGTRLEKLQE 361

Query: 429 QLLQ-----GGNLDPRLRIAVGVRAGEKRVLQ 455
           QL +     GGN       A  V  GE++VL+
Sbjct: 362 QLAKGVYPVGGN----AWAAAHVSLGEQKVLR 389


>gi|219886215|gb|ACL53482.1| unknown [Zea mays]
          Length = 413

 Score = 91.3 bits (225), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 105/419 (25%), Positives = 182/419 (43%), Gaps = 62/419 (14%)

Query: 89  PEGLGLVAQRDIA---KNEVVLEVPMKFWIN----------PDTVAASE-IGSLCSGLKP 134
           P+G+G+ A RD+    +  V++E+P++  +           PD +     I  +     P
Sbjct: 28  PDGMGVYASRDVEPLRRARVIMEIPLELMLTITQKKPWMFFPDIIPLGHPIFDVIESTDP 87

Query: 135 ---W-ISVALFLIREKKKEDSPWRVYLDILP---ECTDSTVFWSEEELVELQGTQLLSTT 187
              W + +A  L+     ED+ W++Y D LP   ECT S +   +E+L+EL+   L S  
Sbjct: 88  ETDWDLRLACLLLYAFDIEDNFWQLYSDFLPSVDECT-SLLLAPKEDLMELEDEDLASQM 146

Query: 188 LGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFS-RLRGQNL---- 242
           L  +E   + + K  ++ I    +   R    + FLWA  I++SR+ + +LR        
Sbjct: 147 LKHQERAIDFWQKHWDKPIPLKLKRLAR--DHERFLWALSIVQSRSVNLKLRMGAFIQDA 204

Query: 243 -VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKS 301
            VL P AD++NHSP      + +  K      R L   ++    +K G+++ I Y ++  
Sbjct: 205 NVLAPYADMLNHSPNANCFLH-WRFK-----DRMLEVMIKAGQAIKKGDEMTIDY-MSGV 257

Query: 302 NAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLP 361
           N++    YGF    S  N + L    S +       L +    GL +  Y +  L     
Sbjct: 258 NSKFMERYGF---SSPTNPWELINFSSPATIHMDSFLSVFNIAGLHDELYHNSALTS--- 311

Query: 362 PAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFH 421
                   +V     D  ++ +      W   D+P   + E    + +++ C+  L  F 
Sbjct: 312 --------VVETDFVDGAVVAAARALPTWSDGDVPAIPSVERKSAQALQEECRQMLDSFP 363

Query: 422 TTIEEDEQLLQG-GNLDPRLRIAVGVRAGEKRVLQQIDEIFKERELELDELEYYQERRL 479
           TTIE+D+Q+L    ++     IA+  R   K +LQ+I          +D L+ YQ+R L
Sbjct: 364 TTIEQDQQILDSDAHISKTREIAIKYRLHRKMLLQKI----------MDALDIYQDRIL 412


>gi|348537527|ref|XP_003456245.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Oreochromis niloticus]
          Length = 607

 Score = 91.3 bits (225), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 105/396 (26%), Positives = 177/396 (44%), Gaps = 44/396 (11%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L +P K  +  ++   S +G L S  +       +++AL L+ 
Sbjct: 104 EGYGLRATRDIKAEELFLWIPRKMLMTVESAQNSILGPLYSQDRILQAMGNVTLALHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLK---- 200
           E+    S W  Y+  LP+  D  +++ +E++  L GTQ +   L   +    +Y      
Sbjct: 164 ERANPASFWLPYIRSLPQEYDIPLYYQQEDVQLLLGTQAVQDVLSQYKNTARQYAYFYKL 223

Query: 201 VEEEIILPNKQ--LFPR--PITLDDFL--WAFGILRSRAF-------SRLRGQNLVLIPL 247
           V+++ +L + +  LF    P+        WA   + +R         SR+    L LIPL
Sbjct: 224 VQDKGMLGSVELRLFASLTPVMGGKLFDQWAVSSVMTRQNQIPTEDGSRV---TLALIPL 280

Query: 248 ADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELAL 307
            D+ NH+ G+ T  Y  E        R    +L+     K  EQ+ I Y   +SNAE  +
Sbjct: 281 WDMCNHTNGLITTGYNLED------DRCECVALQD---YKENEQIYIFYG-TRSNAEFVI 330

Query: 308 DYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AM 364
             GF       +   + L +S+S+  +  K ++    G+  S  F   L    PP    +
Sbjct: 331 HNGFFFQDDAHDRVKIKLGVSKSERLYAMKAEVLARAGIPASYVF--ALHCNEPPISAQL 388

Query: 365 LQYLRLVALGGTD--AFLLESIFRNTIW--GHLDLPVSHANEELICRVVRDACKSALSGF 420
           L +LR+  +   +   +LL     N I+  G+ + PVS  NE  +   +       L  +
Sbjct: 389 LAFLRVFCMTEDELKYYLLGDRAINKIFTLGNSEFPVSWENEIKLWTFLETRAALLLKTY 448

Query: 421 HTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
            TT EED  +L+  +L    R+A+ +R  EK++L++
Sbjct: 449 KTTSEEDRSMLEKPDLSLHSRMAIQLRLAEKQILEK 484


>gi|302816067|ref|XP_002989713.1| hypothetical protein SELMODRAFT_447801 [Selaginella moellendorffii]
 gi|300142490|gb|EFJ09190.1| hypothetical protein SELMODRAFT_447801 [Selaginella moellendorffii]
          Length = 400

 Score = 91.3 bits (225), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 93/361 (25%), Positives = 148/361 (40%), Gaps = 42/361 (11%)

Query: 93  GLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREKKK-EDS 151
           GL A R I   E ++ +P +  +  + +    +  L S    W  + L ++ E+ K E S
Sbjct: 11  GLFAARSIRAGEQIVRIPHELVLTAEKLDDC-VKKLLSTEYDWCPLTLLILAEQHKGEAS 69

Query: 152 PWRVYLDILPECTD--STVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPN 209
            W  Y+  LP   D  ST+FW +EEL  L+ T+    T   +E + +EY  V +++I   
Sbjct: 70  RWAPYVSCLPSFGDHHSTIFWGKEELKFLECTRAFRGTAERREMISDEYNSV-KDVISSC 128

Query: 210 KQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGA 269
             +F   I+L  F  A+  + SRA++      + + P  D  NH P           K A
Sbjct: 129 PHVFGEDISLFQFAHAYATVVSRAWNGALSSEISMRPFVDFCNHDPVSHATVSHDTCKDA 188

Query: 270 GLFS-RDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEIS 328
            + + RD             GE+V I Y   +SNA LA+DYGF+   +  +   L +EI 
Sbjct: 189 TIIAERDY----------TKGEEVFISYG-KRSNAVLAVDYGFVLPNNLSDQAELWMEIP 237

Query: 329 ESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNT 388
            +DP    KL++     +      D      LP                  +LE +  + 
Sbjct: 238 WNDPLREKKLELMRAFNMPTLQNADGTESGELP-----------------VILEEVLDSA 280

Query: 389 IW-GHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVR 447
            W   + LP + A   +IC         +L    + I+E + L       PR  +A   R
Sbjct: 281 DWEDGVVLPQARAFSRIICL-------DSLEDIESMIQESDTLQAPLAYQPRKDVAKETR 333

Query: 448 A 448
           A
Sbjct: 334 A 334


>gi|242089045|ref|XP_002440355.1| hypothetical protein SORBIDRAFT_09g030160 [Sorghum bicolor]
 gi|241945640|gb|EES18785.1| hypothetical protein SORBIDRAFT_09g030160 [Sorghum bicolor]
          Length = 495

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 104/419 (24%), Positives = 182/419 (43%), Gaps = 63/419 (15%)

Query: 89  PEGLGLVAQRDIA---KNEVVLEVPMKFWIN----------PDTVA-ASEIGSLCSGLKP 134
           P+G+G+ A RD+    +  V++E+P++  +           PD +     I  +     P
Sbjct: 111 PDGMGVYASRDVEPLRRARVIMEIPLELMLTITQRKPWMFFPDIIPLGHPIFDIIESTDP 170

Query: 135 ---W-ISVALFLIREKKKEDSPWRVYLDILP---ECTDSTVFWSEEELVELQGTQLLSTT 187
              W + +A  L+     ED+ W++Y D LP   ECT S +   +E+L+EL+   L S  
Sbjct: 171 ETDWDLRLACLLLYAFDIEDNFWQLYSDFLPSADECT-SLLLAPKEDLMELEDEDLASQM 229

Query: 188 LGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFS-RLRGQNL---- 242
           L  +    + + K  ++ I    +   R    + FLWA  I++SR+ + +LR        
Sbjct: 230 LQHQRRAIDFWQKHWDKPIPLKLKRLAR--DHERFLWALSIVQSRSVNLKLRMGAFIQDA 287

Query: 243 -VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKS 301
            VL P AD++NHSP      + +  K      R L   ++    +K G+++ I Y ++  
Sbjct: 288 NVLAPYADMLNHSPNANCFLH-WRFK-----DRMLEIMIKAGHAIKKGDEMTIDY-MSGV 340

Query: 302 NAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLP 361
           N++    YGF    S  N + L    S +       L +    GL +  Y +     TLP
Sbjct: 341 NSKFMERYGF---SSPTNPWELINFSSPATIHMDSFLSVFNIAGLHDELYHN----STLP 393

Query: 362 PAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFH 421
                ++    +    A           W   D+P   + E    + +++ C+  L  F 
Sbjct: 394 SVETDFVDGAVVAAARAL--------PTWSDGDVPAIPSVERKSAQALQEECRQMLDSFS 445

Query: 422 TTIEEDEQLLQGG-NLDPRLRIAVGVRAGEKRVLQQIDEIFKERELELDELEYYQERRL 479
           TTIE+D+Q+L    +++    IA+  R   K +LQ+I          +D L+ YQ++ L
Sbjct: 446 TTIEQDQQILDSDVHINKTREIAIKYRLHRKMLLQKI----------MDALDIYQDKIL 494


>gi|428163884|gb|EKX32933.1| hypothetical protein GUITHDRAFT_120884 [Guillardia theta CCMP2712]
          Length = 320

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 56/174 (32%), Positives = 93/174 (53%), Gaps = 10/174 (5%)

Query: 86  ATFPEGLG--LVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLI 143
           A + EG+G  L A + I+  E  L+VP    + P  + AS +  L  G +    + L L+
Sbjct: 20  AVWREGMGWALRASKRISPGETFLKVPRHLLLGPHQLRASSLDRLLEGWQLPDCMLLLLM 79

Query: 144 REKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEE 203
            E     S +R YLD+LP+  D+ + WS+EE  EL G+ +L   + ++  +   + ++++
Sbjct: 80  CESVNSSSFFRPYLDLLPDTVDTPITWSKEEAKELVGSPVLHRAVKLRHELARSFQEMKD 139

Query: 204 EIILPNKQLFPRPI--TLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSP 255
           ++       FP P+  + + + WA+ ILRSRAF      N  L+PL DL+NH P
Sbjct: 140 KVFDKYPDRFP-PLLFSYERYQWAYSILRSRAFG-----NYTLMPLIDLMNHHP 187


>gi|396468374|ref|XP_003838159.1| hypothetical protein LEMA_P116830.1 [Leptosphaeria maculans JN3]
 gi|312214726|emb|CBX94680.1| hypothetical protein LEMA_P116830.1 [Leptosphaeria maculans JN3]
          Length = 660

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 114/437 (26%), Positives = 185/437 (42%), Gaps = 100/437 (22%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSG----------------LKP 134
           G G+VA RDI     +  +P    IN   V  S++  L  G                L P
Sbjct: 38  GRGIVATRDIPAETTLFTIPRNAIIN---VETSDLARLLPGIFDGTLNDAEDEKAEPLDP 94

Query: 135 WISVALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEY 193
           W S+ L ++RE    E S W+ Y+DILP   D+ +FW+++EL EL+GT L +  +G  E 
Sbjct: 95  WASLILVMLREYLHGEQSYWKPYIDILPTSFDTPIFWTQDELKELEGTVLTAEKIGKSE- 153

Query: 194 VQNEYLKVEE-EIILPNKQLF-PR---PITLDDFL------------WAFGILRSRAFSR 236
             +E L+     I+  N   F P+   P+  +D L            +AF +   +  S 
Sbjct: 154 -SDEMLRTHVLPIVTQNPTAFCPKGAIPLNEEDLLALAHRIGSTIMSYAFDLDDDKEESD 212

Query: 237 LRGQNLV----------LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPV 286
              +  V          ++P+AD++N        D+   +          + SLR+   +
Sbjct: 213 AEEEGWVEDRDGLTMLGMVPMADVLN-----ANADFNAHVNHGEKLE---VTSLRS--DI 262

Query: 287 KAGEQVLIQYDLNKSNAELALDYGFIESKSDR-NAYTLTLEISESDPF----FGDKLDIA 341
           +AG ++L  Y    S +EL   YG++  +  R +   ++ E+  S         D +  A
Sbjct: 263 RAGTEILNYYGPLPS-SELLRRYGYVTPEHHRYDVAEVSWELVRSTLVAHLELSDGILQA 321

Query: 342 ETNGLGESAYFD-IVLGRT------------------LPPAMLQYLR--LVALGGTDAFL 380
               LGE    D  VL R                   +P  + + L+  L AL      L
Sbjct: 322 VETQLGEDELEDYFVLERDSGEPSDEGRLVQPPQTCEVPTELSEQLKTILKALKKQQPEL 381

Query: 381 LESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRL 440
           + S  R               +E++  V+ +A    LS + T++EEDE+LL+  +L  R 
Sbjct: 382 IGSAVR--------------RDEILHAVIGEALNRKLSEYATSVEEDEELLENSSLTKRH 427

Query: 441 RIAVGVRAGEKRVLQQI 457
           R+A+ VR GEKR+L ++
Sbjct: 428 RLAIEVRLGEKRLLHEL 444


>gi|297820264|ref|XP_002878015.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323853|gb|EFH54274.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 473

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 76/263 (28%), Positives = 127/263 (48%), Gaps = 27/263 (10%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREKKK-E 149
           G  L A + I   + +L+VP    I PD ++     SL   +     +A  LIREKKK +
Sbjct: 70  GRSLFASKVIHAGDCMLKVPFNVQITPDELSPDIRVSLTDEVGNIGKLAAVLIREKKKGQ 129

Query: 150 DSPWRVYLDILPECTD--STVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIIL 207
            S W  Y+  LP+  +  ST+FW E+E   ++ + +   T+  K  ++ E+  V +    
Sbjct: 130 KSRWVPYISRLPQPAEMHSTIFWGEDEFSMIRCSAVHKETVKQKAQIEKEFSFVAQAF-- 187

Query: 208 PNKQLFPRPIT---LDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINH---SPGITTED 261
             KQ +P  I    L+DF++A+ ++ SRA+   +G  + LIP AD +NH   S  I   D
Sbjct: 188 --KQHYPMVIERPYLEDFMYAYALVGSRAWETSKG--ISLIPFADFMNHDGLSASIVLSD 243

Query: 262 YAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAY 321
              ++      + D  +S         G++V I+Y    SNA L LD+GF    +  +  
Sbjct: 244 EDNQLSEV---TADRNYS--------PGDEVFIKYG-EFSNATLMLDFGFTVPYNIHDEV 291

Query: 322 TLTLEISESDPFFGDKLDIAETN 344
            + +++   DP    KL + +T+
Sbjct: 292 QIQMDVPNDDPLRDMKLGLLQTH 314


>gi|125536207|gb|EAY82695.1| hypothetical protein OsI_37912 [Oryza sativa Indica Group]
          Length = 505

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 110/417 (26%), Positives = 186/417 (44%), Gaps = 40/417 (9%)

Query: 67  FWQWLRDQKVVSPKSPI--RPA-TFPEGLGL-----VAQRDIAKNEVVLEVPMKFWINPD 118
           F  WLR+  +   K  I  RP   F EG  L      A +D+   +V  EVPM   +  +
Sbjct: 71  FSDWLREHGLPPGKVAILDRPVPCFREGKDLPLHYVAAGQDLEAGDVAFEVPMSLVVTLE 130

Query: 119 TVAASE-IGSL--CSGLKPWISVALFLIREKKK-EDSPWRVYLDILPE-------CTDST 167
            V   E +  L   + L     +AL+L+ EKK+ +DS W  Y+  L           +S 
Sbjct: 131 RVLGDESVAELLTTNKLSELACLALYLMYEKKQGQDSFWYPYIKELDRQRGRGQLAVESP 190

Query: 168 VFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFG 227
           + W+E EL  L+G+ +    +   E ++ EY +++    +        P  +    + F 
Sbjct: 191 LLWTESELNYLKGSPIKDEVVARDEGIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFE 250

Query: 228 ILRSRAFSRLRG-----QNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRT 282
           I + +AF  ++      Q + L     L+   P + T  Y    K       D   S+R 
Sbjct: 251 IFK-QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLT--YKSNCKAMLTAVGD---SVRL 304

Query: 283 PV--PVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDI 340
            V  P KAGE +++ +   + N+ L L+YGFI+  +  +   +   ++  DP F +K  +
Sbjct: 305 VVDRPYKAGEPIIV-WCGPQPNSRLLLNYGFIDEDNPYDRIVIEASLNIEDPQFQEKRMV 363

Query: 341 AETNGLGESAYFDIVLGRTLP--PAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVS 398
           A+ NG      F + +G+       ML YLRL  +   D   ++SI  +   G    PVS
Sbjct: 364 AQRNGKLAIQNFHVCVGKEKETIAEMLPYLRLGYISDPDE--MQSILSSE--GDT-CPVS 418

Query: 399 HANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQ 455
              E  +   +    +S L+ + TT++ED+ +L  GNL+P+  +A  +   EK++L 
Sbjct: 419 PCTERAVLDQLVGYLESRLADYPTTLDEDDAMLADGNLEPKKEVATRLVRLEKKLLH 475


>gi|357444999|ref|XP_003592777.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase [Medicago truncatula]
 gi|355481825|gb|AES63028.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase [Medicago truncatula]
          Length = 451

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 81/293 (27%), Positives = 134/293 (45%), Gaps = 38/293 (12%)

Query: 64  VETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAAS 123
           +  F  WL ++   +  S +  +    G  L A + I   + +L+VP    + PD +   
Sbjct: 34  LHNFLPWLENKASSTISSSLSISNSSYGNSLFASKSIQTGDCILQVPYSLQLTPDNLPPE 93

Query: 124 EIGSLCSGLKPWIS---------VALFLIREKKKEDSPWRVYLDILPECTD--STVFWSE 172
                   +KP+IS           + LI +   +DS W  Y+  LP   +  +T+FW+E
Sbjct: 94  --------IKPFISEDVGNIAKLATVLLIHKNLGQDSEWHPYISCLPPQAEMHNTIFWNE 145

Query: 173 EELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSR 232
            EL  ++ + +   T+  K  ++ ++L++ + +  P  Q F    T  DF+ A  ++ SR
Sbjct: 146 SELEMIRQSSVYQETIYQKSQIEKDFLEI-KPVFQPFCQSFGD-FTWKDFMHACTLVGSR 203

Query: 233 AFSRLRGQNLVLIPLADLINH---SPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAG 289
           A+   +G  L LIP AD +NH   S  I   D   +           +FS R  VP   G
Sbjct: 204 AWGSTKG--LSLIPFADFLNHDGISEAIVMSDDDNKCSE--------VFSDRDYVP---G 250

Query: 290 EQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAE 342
           EQVLI+Y    SNA L LD+GF    +  +   +  +I + DP    KL++ +
Sbjct: 251 EQVLIRYG-KFSNATLMLDFGFTIPYNIYDQVQIQYDIPKYDPLRHTKLELLQ 302


>gi|308807993|ref|XP_003081307.1| putative methyltransferase (ISS) [Ostreococcus tauri]
 gi|116059769|emb|CAL55476.1| putative methyltransferase (ISS) [Ostreococcus tauri]
          Length = 505

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 103/409 (25%), Positives = 168/409 (41%), Gaps = 62/409 (15%)

Query: 61  TAQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPD-T 119
           +A  E   +WL    + + K  +  +   EG GLVA  +I + E +L V     I  +  
Sbjct: 61  SATREDLTRWLASNGLRAQKMTLE-SNLAEGRGLVATEEIKRGEALLGVDASCLITVERA 119

Query: 120 VAASEIGSLCSGLKPWISVALFLIREKKKEDS----PWRVYLDILPECTDSTVFWSEEEL 175
           +A +++G   + L+ W  +A FL ++    +S     +  Y+  LP  T S + W E+E+
Sbjct: 120 IAEAKLGPRHAELQEWSVLATFLAQQAMALESGNAGTFGEYIRALPRRTGSVLDWPEDEV 179

Query: 176 -VELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAF 234
              L+G+         +E V     ++        +  FP  IT     WAF IL    F
Sbjct: 180 ETLLKGSPSRLAAAERQESVNAAIAEI--------RSSFP-DITEGALRWAFDIL----F 226

Query: 235 SRL-----RGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAG 289
           SRL      G  L L+P AD++NH PG        ++ G+ +       +L T     AG
Sbjct: 227 SRLIRLDAMGGELALVPWADMLNHKPGCAA---FIDLNGSAV-------NLTTDRAYAAG 276

Query: 290 EQVLIQYDLNKSNAELALDYGFIESKSDR--NAYTLTLEISESDPFFGDKLDIAETNGLG 347
           EQV   Y   + ++EL + YGF     +   + Y+LTL +  +DP+   K D+    GL 
Sbjct: 277 EQVWASYG-QRPSSELLISYGFAPEVGENPDDEYSLTLGVDVNDPYAQAKADVLRRMGLS 335

Query: 348 ESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVS--------- 398
               F + L    P  +LQY   +         LE + R    G  +   S         
Sbjct: 336 PVETFPLRL-NGYPRQLLQYASFILCNPDKPSELEGLARTAFTGSANFGQSIFDSVRGLA 394

Query: 399 --------------HANEELICRVVRDACKSALSGFHTTIEEDEQLLQG 433
                            E  +  ++ D C  ALS +  ++E+D+ + +G
Sbjct: 395 QGQARGKQGVILGGVPGEIAVREMLADMCAEALSAYPNSLEKDKGIAEG 443


>gi|225448769|ref|XP_002275729.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Vitis
           vinifera]
          Length = 480

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 79/252 (31%), Positives = 120/252 (47%), Gaps = 17/252 (6%)

Query: 64  VETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAAS 123
            + F  WL  +  V   S +       G  L A + I   + +L+VP    I+PD V  S
Sbjct: 42  CDDFLPWLEQKAGVEISSVLSIGKSTYGRSLFASKSIQTGDCILKVPYNVQISPDNVP-S 100

Query: 124 EIGSLCSGLKPWISVALFLIRE--KKKEDSPWRVYLDILPECTD--STVFWSEEELVELQ 179
           +I SL       I+    +I    K  +DS W  Y++ LP+  +  ST+FWSE EL  +Q
Sbjct: 101 KINSLLGDEVGNIAKLAIVISVEWKMGQDSEWAPYINRLPQPGEMHSTIFWSEGELKMIQ 160

Query: 180 GTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRG 239
            + +   T+  K  +Q ++L ++  +   ++ LF + I+L +F+ A  ++ SRA+   +G
Sbjct: 161 QSSVYQETINQKAQIQKDFLAIKPVLHHFSENLF-KDISLKEFMHACALVGSRAWGSTKG 219

Query: 240 QNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLN 299
             L LIP AD +NH     +     E K       D     R   P   GEQVLI+Y   
Sbjct: 220 --LSLIPFADFVNHDGFSDSVLLGDEDKQLSEVIAD-----RNYAP---GEQVLIRYG-K 268

Query: 300 KSNAELALDYGF 311
             NA L LD+GF
Sbjct: 269 FPNATLLLDFGF 280


>gi|115487958|ref|NP_001066466.1| Os12g0236900 [Oryza sativa Japonica Group]
 gi|77554044|gb|ABA96840.1| SET domain containing protein, expressed [Oryza sativa Japonica
           Group]
 gi|113648973|dbj|BAF29485.1| Os12g0236900 [Oryza sativa Japonica Group]
          Length = 509

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 110/417 (26%), Positives = 186/417 (44%), Gaps = 40/417 (9%)

Query: 67  FWQWLRDQKVVSPKSPI--RPA-TFPEGLGL-----VAQRDIAKNEVVLEVPMKFWINPD 118
           F  WLR+  +   K  I  RP   F EG  L      A +D+   +V  EVPM   +  +
Sbjct: 75  FSDWLREHGLPPGKVAILDRPVPCFREGKDLPLHYVAAGQDLEAGDVAFEVPMSLVVTLE 134

Query: 119 TVAASE-IGSL--CSGLKPWISVALFLIREKKK-EDSPWRVYLDILPE-------CTDST 167
            V   E +  L   + L     +AL+L+ EKK+ +DS W  Y+  L           +S 
Sbjct: 135 RVLGDESVAELLTTNKLSELACLALYLMYEKKQGQDSFWYPYIKELDRQRGRGQLAVESP 194

Query: 168 VFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFG 227
           + W+E EL  L+G+ +    +   E ++ EY +++    +        P  +    + F 
Sbjct: 195 LLWTESELNYLKGSPIKDEVVARDEGIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFE 254

Query: 228 ILRSRAFSRLRG-----QNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRT 282
           I + +AF  ++      Q + L     L+   P + T  Y    K       D   S+R 
Sbjct: 255 IFK-QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLT--YKSNCKAMLTAVGD---SVRL 308

Query: 283 PV--PVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDI 340
            V  P KAGE +++ +   + N+ L L+YGFI+  +  +   +   ++  DP F +K  +
Sbjct: 309 VVDRPYKAGEPIIV-WCGPQPNSRLLLNYGFIDEDNPYDRIVIEASLNIEDPQFQEKRMV 367

Query: 341 AETNGLGESAYFDIVLGRTLP--PAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVS 398
           A+ NG      F + +G+       ML YLRL  +   D   ++SI  +   G    PVS
Sbjct: 368 AQRNGKLAIQNFHVCVGKEKETIAEMLPYLRLGYISDPDE--MQSILSSE--GD-TCPVS 422

Query: 399 HANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQ 455
              E  +   +    +S L+ + TT++ED+ +L  GNL+P+  +A  +   EK++L 
Sbjct: 423 PCTERAVLDQLVGYLESRLADYPTTLDEDDAMLADGNLEPKKEVATRLVRLEKKLLH 479


>gi|307109960|gb|EFN58197.1| hypothetical protein CHLNCDRAFT_142047 [Chlorella variabilis]
          Length = 485

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 93/327 (28%), Positives = 142/327 (43%), Gaps = 40/327 (12%)

Query: 50  LHSASATTNPPTAQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEV 109
           L  A +T     A+ E    WL ++ +  PK     AT   G GLVA + I K E +L +
Sbjct: 44  LRGAGSTEIATDAEGEELKAWLIERGLPPPKLAA-AATPGSGRGLVAAQPIGKGESLLSI 102

Query: 110 PMKFWINPDTVAASEIGSLCSGLKP---------WISVALFLIREKKKEDSP-WRVYLDI 159
           P +  + P    A+ +   C  L+P         W  +AL+L  ++    +  W  Y+ +
Sbjct: 103 PQQLVLTP----AAALEQSC--LRPLLEEQPLPAWSVLALWLAEQRAAGSAGGWWPYVRL 156

Query: 160 LPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITL 219
           LPE T   + WSEEE+  L G+QL S  L ++   +  +   E + +L   +   R    
Sbjct: 157 LPERTGCVLEWSEEEVEWLCGSQLHSDALEIRAAAEASW--AEMQAVLAAAKAQGRAPAH 214

Query: 220 DDF-----LWAFGILRSRAFSRLR--GQNLVLIPLADLINHSPGITT-EDYAYEIKGAGL 271
             F      WAF +L SR   RL   G    L+P ADL+NH     +  D++        
Sbjct: 215 GAFGRAQLQWAFAVLLSR-LVRLAGLGDQEALLPWADLLNHDCAAASFLDWS-------- 265

Query: 272 FSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIE--SKSDRNAYTLTLEISE 329
            + +    LR     +AGEQ+LI Y   K++ EL L YGF      +  +   L LE++ 
Sbjct: 266 -ATEAAVVLRAERRYRAGEQLLISYG-QKTSGELLLSYGFCPDLGSNPHDGCRLLLELAP 323

Query: 330 SDPFFGDKLDIAETNGLGESAYFDIVL 356
            D     K      +GL  S  F + +
Sbjct: 324 GDAARNWKAAALRQHGLAASQLFPLRM 350


>gi|302821397|ref|XP_002992361.1| hypothetical protein SELMODRAFT_430576 [Selaginella moellendorffii]
 gi|300139777|gb|EFJ06511.1| hypothetical protein SELMODRAFT_430576 [Selaginella moellendorffii]
          Length = 463

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 94/328 (28%), Positives = 144/328 (43%), Gaps = 37/328 (11%)

Query: 87  TFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREK 146
           T P+  GL A RDI   E +L V     +  D +       L SG+  W  +AL L+ EK
Sbjct: 58  TGPDKRGLFAVRDIKAGECILRVSRDTMMTADRLPLEFQQLLSSGVSEWAQLALLLLFEK 117

Query: 147 KK-EDSPWRVYLDILPE--CTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEE 203
           +  E S W  Y+  LP      ST FW +EEL  +Q + L   T+  +  ++ E+ ++ +
Sbjct: 118 RAGEASIWAPYISCLPRWGTIHSTAFWRKEELAMIQESSLSYETMSRRAAIREEFNEM-Q 176

Query: 204 EIILPNKQLFPRPITLDDFLWAF--GILRSRAFSRLRG-QNLVLIPLADLINH---SPGI 257
            I    + +F  P++   F  A+    + SRA+ R+ G + L ++P AD +NH   S  +
Sbjct: 177 PIFQRYEHVFGGPVSYASFKHAYVTATVCSRAW-RIDGLEKLAMVPFADFMNHDWSSNAM 235

Query: 258 TTEDY---AYEIKGAGLFSRDL---LFS--LRTPVPVKAGEQVLIQYDLNKSNAELALDY 309
            T D    + E++   ++S  L   LF   L       AGEQV I +     NA LALD+
Sbjct: 236 LTYDTDNGSTEVEEVKVYSDCLDIALFCAQLFADKNYAAGEQVTISFG-PLCNASLALDF 294

Query: 310 GFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGL--------GESAYFDIVL----- 356
           GF    +  +   L L IS  D    +KL     + +         +S      L     
Sbjct: 295 GFTVPYNPWDKVQLWLGISRRDSLRKEKLQYLHAHEMLTLTNPDGSDSGGMSFTLREVCD 354

Query: 357 ----GRTLPPAMLQYLRLVALGGTDAFL 380
               GR LP  +    R+V    ++A L
Sbjct: 355 SAGNGRELPGPLFTMARVVCATSSEALL 382


>gi|414881266|tpg|DAA58397.1| TPA: hypothetical protein ZEAMMB73_027665 [Zea mays]
          Length = 512

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 109/422 (25%), Positives = 188/422 (44%), Gaps = 40/422 (9%)

Query: 62  AQVETFWQWLRDQKVVSPKSPIRPATFP-------EGLGLV-AQRDIAKNEVVLEVPMKF 113
           +  + F  WLR + +   K  IR    P       + L  V A  D+   +V  EV M  
Sbjct: 73  SATDCFVDWLRARGLPPGKVDIRERPVPCLRDGKDQPLRYVSAVVDLQAGDVAFEVSMSL 132

Query: 114 WINPDTVAASE-IGSLCSG--LKPWISVALFLIREKKK-EDSPWRVYLDILPE------- 162
            +  + V   E I  L +   L     +AL+L+ EKK+ +DS W  Y+  L         
Sbjct: 133 VVTLERVLGDESIAELLTNNKLSELACLALYLMYEKKQGKDSFWYPYIKELDRHRGRGQL 192

Query: 163 CTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDF 222
             +S + W+E EL  L G+ L    +   E ++ EY +++    +        P  +   
Sbjct: 193 AVESPLLWTESELDYLTGSPLKDEVVARDEAIRREYNELDTLWFMAGSLFQQYPFDIPTE 252

Query: 223 LWAFGILRSRAFSRLRG-----QNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLL 277
            + F I + +AF  ++      Q + L     L+   P + T    Y      + + D  
Sbjct: 253 AFPFEIFK-QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLT----YRSNCKAMLTADG- 306

Query: 278 FSLRTPV--PVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFG 335
            S+R  V  P KAGE ++I +   ++N+ L L+YGF++  +  +   +   ++  DP + 
Sbjct: 307 DSVRLVVDRPYKAGEPIII-WCGPQTNSRLVLNYGFVDEDNPFDRVAIEASLNTEDPQYQ 365

Query: 336 DKLDIAETNGLGESAYFDIVLGRTLPPA--MLQYLRLVALGGTDAFLLESIFRNTIWGHL 393
           +K  +A+ NG      F++ +G+       ML YLRL  +   D   ++SI  +   G  
Sbjct: 366 EKRMVAQRNGKLAIQNFNVYVGKEKQTVAEMLPYLRLGYISNPDE--MQSILSSE--GDT 421

Query: 394 DLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRV 453
             PVS   E  +   +    +S L+G+ TT++EDE +L  GNL+P+  +A  +   EK++
Sbjct: 422 -CPVSPCTERAVLDQLVGYLESRLAGYPTTLDEDEAMLADGNLEPKKEVATRLVRLEKKM 480

Query: 454 LQ 455
           L 
Sbjct: 481 LH 482


>gi|302832548|ref|XP_002947838.1| hypothetical protein VOLCADRAFT_88145 [Volvox carteri f.
           nagariensis]
 gi|300266640|gb|EFJ50826.1| hypothetical protein VOLCADRAFT_88145 [Volvox carteri f.
           nagariensis]
          Length = 508

 Score = 90.1 bits (222), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 105/396 (26%), Positives = 168/396 (42%), Gaps = 54/396 (13%)

Query: 67  FWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIG 126
           F  WLR + + +    +R     EG GLVA R +++ EV++++P    I  +  A     
Sbjct: 25  FQSWLRSEGLSTQPLLLRHCGR-EGRGLVASRSLSRGEVLVKLPDHLVITAERAAGE--- 80

Query: 127 SLCSGLKPWISVALFLIREKKK-----EDSP----WRVYLDILPECTDSTVFWSEEELVE 177
                   W  +AL L   K +       SP    W  Y+ +LP+   + + W  +E+ +
Sbjct: 81  --------WSLLALLLAEVKGRLAAGDRSSPAAARWGPYVAVLPQRPGTLLDWPAKEVQQ 132

Query: 178 L-QGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQ-LFPR--PITLDDFLWAFGILRSRA 233
           L +G+ L      +       + ++E  I       L P   P++  D  WAFG+L SR 
Sbjct: 133 LLRGSPLQRLADSITSAASASWRELEPLIAQGRADGLVPEHVPLSKGDLEWAFGVLLSRC 192

Query: 234 FSRL--RGQNLVLIPLADLINHSPGITTE-----DYAYEIKG--------AGLFSRDLLF 278
             RL  RG   VL P AD +NH   ++ E     D+++++ G        AG  ++  L 
Sbjct: 193 I-RLPSRGDLQVLAPWADQLNHD--VSAEEGCHLDWSWDVAGPAVPGGDRAGGATKGALV 249

Query: 279 SLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEIS---ESDPFFG 335
            LR   P  AG+QV + Y   KS+ EL L YGF    +        L ++   + DP   
Sbjct: 250 -LRADRPYAAGQQVYVSYG-PKSSGELLLSYGFCPPPASNPHQDCRLRVAVDRQGDPLAD 307

Query: 336 DKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALG---GTDAFLLESIFRNTIWGH 392
            K      +GL     F + L   +P  +LQYL  +        + F L S+   +  G 
Sbjct: 308 LKEQALARHGLPSELEFPLKL-EGIPEGLLQYLAFLDARPKVAQETFELASVLFES--GG 364

Query: 393 LDLPVSHANEELICRVVRDACKSALSGFHTTIEEDE 428
             L        L  R + + C +AL  + T++E D+
Sbjct: 365 FPLLDGQDTLVLALRGLSNRCTAALKAYPTSMEADQ 400


>gi|326510275|dbj|BAJ87354.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326525555|dbj|BAJ88824.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 523

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 115/466 (24%), Positives = 196/466 (42%), Gaps = 46/466 (9%)

Query: 34  KPSFRLKSRAFAATC---SLHSASATTNPPT-------AQVETFWQWLRDQKVVSPKSPI 83
           +P  R  SR   A C   +L S+S    PP+       +    F  WL    +   K  I
Sbjct: 46  QPGSRSCSRLRLAACHADTLLSSSGAQGPPSPAACLSASSAGGFSDWLLTNGLPPGKLAI 105

Query: 84  RPATFPEGLG--------LVAQRDIAKNEVVLEVPMKFWINPDTVAASE-IGSL--CSGL 132
                P   G        + A +D+   +V  EVPM   +  + V   E +  L   + L
Sbjct: 106 LERPVPCSRGGRDRPLHFVAAGQDLEAGDVAFEVPMSLVVTLERVLGDESVAELLTTNKL 165

Query: 133 KPWISVALFLIREKKK-EDSPWRVYLDILPE-------CTDSTVFWSEEELVELQGTQLL 184
                +AL+L+ EKK+  DS W  Y+  L           +S + W+E EL  L G+ + 
Sbjct: 166 SELACLALYLMYEKKQGRDSLWYPYIKELDRQRGRGQLAVESPLLWTESELDYLNGSPMR 225

Query: 185 STTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRG----- 239
              +   E ++ EY +++    +        P  +    + F I + +AF  ++      
Sbjct: 226 DEVVVRDEGIKKEYNELDTLWFMAGSLFKQYPFDVPTEAFPFEIFK-QAFVAVQSCVVHL 284

Query: 240 QNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLN 299
           Q + L     L+   P + T  Y    K A L + D    L    P KAGE +++ +   
Sbjct: 285 QKVSLARRFALVPLGPPLLT--YKSNCK-AMLTAVDGSVRLLVDRPYKAGEPIIV-WCGP 340

Query: 300 KSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGR- 358
           + N+ L L+YGF++  +  +   +   ++  DP + +K  +A+ NG      F + +G+ 
Sbjct: 341 QPNSRLLLNYGFVDEDNPYDRIAIEASLNTEDPQYQEKRMVAQRNGKLAIQKFQVCVGKE 400

Query: 359 -TLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSAL 417
                 ML YLRL  +   D   ++ I  +        PVS  +E  +   +    KS L
Sbjct: 401 KQTISEMLPYLRLGYISDPDE--MQCILSSE---GDTCPVSPCSERAVLDQLVVYLKSRL 455

Query: 418 SGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKE 463
           +G+ T ++EDE +L  G+L+P+  +A  +   EK++L    E   E
Sbjct: 456 AGYPTNLDEDEAMLADGSLEPKKEVATRLVRLEKKMLHGCLEAANE 501


>gi|302754340|ref|XP_002960594.1| hypothetical protein SELMODRAFT_402971 [Selaginella moellendorffii]
 gi|300171533|gb|EFJ38133.1| hypothetical protein SELMODRAFT_402971 [Selaginella moellendorffii]
          Length = 403

 Score = 89.4 bits (220), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 78/314 (24%), Positives = 140/314 (44%), Gaps = 23/314 (7%)

Query: 42  RAFAATCSLHSASATTNPPTAQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIA 101
           R F+ +  +H+ +         +E F  WLR +        +       G  L A R + 
Sbjct: 18  RRFSCSTRIHAYAQNLGSSRTHLE-FMSWLRRRGEDMNSIAVAIGMSKHGRALFAHRPMC 76

Query: 102 KNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREKKK-EDSPWRVYLDIL 160
             E +++      + P+ +   E+ +L      +  V+L ++ EK+K ++S W  Y++ L
Sbjct: 77  AGECMIKFSQNLVLTPEKLPC-EVIALLDQANEFTRVSLLVMAEKRKGQNSAWAPYIECL 135

Query: 161 PECTD--STVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPIT 218
           P   +  ST+FW  +EL  L+ + +   T      +Q+EY +V++ ++     L+   ++
Sbjct: 136 PSFGEIHSTIFWDPKELACLECSPIHRGTGERNALLQSEYREVKK-VVESCPHLYDPDVS 194

Query: 219 LDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSP-GITTEDYAYEIKGAGLFSRDLL 277
           L+ F   +  + SRA+ +    ++ +IPL D  NH P   T   +A +     + SRD  
Sbjct: 195 LEQFKHEYATVSSRAWGQGPHSDMTMIPLVDFANHDPRSRTLFSHADDNCTVVVASRDY- 253

Query: 278 FSLRTPVPVKAGEQ-----VLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDP 332
                    + G++     V I Y  + SNA LALDYGF+   +  +   + LEI   DP
Sbjct: 254 ---------QTGDENFHLKVHICYG-DHSNAVLALDYGFVVPDNPFDEAEIFLEIPSEDP 303

Query: 333 FFGDKLDIAETNGL 346
               KL     N +
Sbjct: 304 LREIKLQYMAQNNM 317


>gi|302771638|ref|XP_002969237.1| hypothetical protein SELMODRAFT_410177 [Selaginella moellendorffii]
 gi|300162713|gb|EFJ29325.1| hypothetical protein SELMODRAFT_410177 [Selaginella moellendorffii]
          Length = 336

 Score = 89.4 bits (220), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 78/314 (24%), Positives = 140/314 (44%), Gaps = 23/314 (7%)

Query: 42  RAFAATCSLHSASATTNPPTAQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIA 101
           R F+ +  +H+ +         +E F  WLR +        +       G  L A R + 
Sbjct: 18  RRFSCSTRIHAYAQNLGSSRTHLE-FMSWLRSRGEDMNSIAVAIGMSKHGRALFAHRPMC 76

Query: 102 KNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREKKK-EDSPWRVYLDIL 160
             E +++      + P+ +   E+ +L      +  V+L ++ EK+K ++S W  Y++ L
Sbjct: 77  AGECMIKFSQDLVLTPEKLPC-EVIALLDQANEFTRVSLLVMAEKRKGQNSAWAPYIECL 135

Query: 161 PECTD--STVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPIT 218
           P   +  ST+FW  +EL  L+ + +   T      +Q+EY +V++ ++     L+   ++
Sbjct: 136 PSFGEIHSTIFWDPKELACLECSPIHRGTGERNALLQSEYREVKK-VVESCPHLYDPDVS 194

Query: 219 LDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSP-GITTEDYAYEIKGAGLFSRDLL 277
           L+ F   +  + SRA+ +    ++ +IPL D  NH P   T   +A +     + SRD  
Sbjct: 195 LEQFKHEYATVSSRAWGQGPHSDMTMIPLVDFANHDPRSRTLFSHADDNCTVVVASRDY- 253

Query: 278 FSLRTPVPVKAGEQ-----VLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDP 332
                    + G++     V I Y  + SNA LALDYGF+   +  +   + LEI   DP
Sbjct: 254 ---------QTGDENFHLKVHICYG-DHSNAVLALDYGFVVPDNPFDEAEIFLEIPSEDP 303

Query: 333 FFGDKLDIAETNGL 346
               KL     N +
Sbjct: 304 LREIKLQYMAQNNM 317


>gi|145350419|ref|XP_001419603.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144579835|gb|ABO97896.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 524

 Score = 89.4 bits (220), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 105/401 (26%), Positives = 169/401 (42%), Gaps = 62/401 (15%)

Query: 69  QWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPD-TVAASEIGS 127
           +WL  +++   K  +      EG GLVA  +I + E +L VP    I  +  +A +++G 
Sbjct: 86  RWLEGRRLPGQKMALE-VNLAEGRGLVATEEIKRGEALLGVPRTTLITVERAIAEAKLGP 144

Query: 128 LCSGLKPWISVALFLIREKKKEDS----PWRVYLDILPECTDSTVFWSEEELVEL-QGTQ 182
             + L+ W  +A FL ++    +S     +  Y+  LP  T S + W E+E+ +L +G+ 
Sbjct: 145 KHAELQEWSVLATFLAQQALALESGTAGTFGEYIRALPRRTGSVLDWPEDEVDKLLKGS- 203

Query: 183 LLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRL----- 237
              + L   E  Q+      +EI    +  FP  IT+    WAF IL    FSRL     
Sbjct: 204 --PSRLAAAER-QDSVNAAIDEI----RSYFPE-ITVGALRWAFDIL----FSRLIRLDA 251

Query: 238 RGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYD 297
            G  L L+P AD++NH PG        ++ G  +       +L T      GEQV   Y 
Sbjct: 252 MGGELALVPWADMLNHKPGCAA---FIDLNGDAV-------NLTTDRSYVKGEQVWASYG 301

Query: 298 LNKSNAELALDYGFIESKSDR--NAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIV 355
             + ++EL + YGF     +   + Y LTL +  +DP    K  +    GL     F + 
Sbjct: 302 -QRPSSELLISYGFAPEVGENPDDEYALTLGVDVNDPLADAKAQVLRDMGLSPVETFPLR 360

Query: 356 LGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVS----------------- 398
           L    P  +LQY   +         L+ + ++   G  ++  S                 
Sbjct: 361 L-NGYPRQLLQYASFILCNPEKPSELKGLAQSAFTGSANIGQSIFDSVRGLTNGKARGKQ 419

Query: 399 ------HANEELICRVVRDACKSALSGFHTTIEEDEQLLQG 433
                  A E  +  ++ D C  ALS +  T+E+D+ L QG
Sbjct: 420 GVILGGVAGEIAVREMLADLCAEALSAYPNTLEKDKGLAQG 460


>gi|302804448|ref|XP_002983976.1| hypothetical protein SELMODRAFT_423083 [Selaginella moellendorffii]
 gi|300148328|gb|EFJ14988.1| hypothetical protein SELMODRAFT_423083 [Selaginella moellendorffii]
          Length = 266

 Score = 89.4 bits (220), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 70/227 (30%), Positives = 112/227 (49%), Gaps = 20/227 (8%)

Query: 93  GLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREKKK---- 148
           GL A R +   E VLE+ +   I P  +       L S +KPW  +AL ++ E+ K    
Sbjct: 22  GLFASRPVRAGERVLEISLDLMIAPSDLPGELSTVLSSTVKPWTKLALIVLMERYKGQAK 81

Query: 149 -EDSPWRVYLDILPECT--DSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEI 205
            + S W  Y+  LPE    D+T  W + EL  L+ + L   T    E +  E+ +V+  +
Sbjct: 82  LQSSAWAPYISCLPEPAELDNTFLWEDTELSYLRASPLYGKTRERLEMITTEFGQVQNAL 141

Query: 206 -ILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAY 264
            + P  QLF + ++L+DF   +  + SR+ +      LV+IP+ D  NH+       +A 
Sbjct: 142 DVWP--QLFGK-VSLEDFKHVYATVFSRSLAIGEDSTLVMIPMLDFFNHN----ATSFA- 193

Query: 265 EIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF 311
           ++   GL +  ++ + R        +Q+ I Y  + SNAELALDYGF
Sbjct: 194 KLSFNGLLNYAVVTADRD---YAENDQIWINYG-DLSNAELALDYGF 236


>gi|260819628|ref|XP_002605138.1| hypothetical protein BRAFLDRAFT_122719 [Branchiostoma floridae]
 gi|229290469|gb|EEN61148.1| hypothetical protein BRAFLDRAFT_122719 [Branchiostoma floridae]
          Length = 453

 Score = 88.6 bits (218), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 114/422 (27%), Positives = 180/422 (42%), Gaps = 58/422 (13%)

Query: 67  FWQWLRDQKVVSPKSPIRPATFPE-GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEI 125
           F+QWL      +   P++PA FPE G GL+A + +   E++L +P +  I  D +  S +
Sbjct: 33  FFQWLHRNGCRN--VPLKPAVFPETGRGLMATKALKHEELILVIPKRLLITIDAIMDSYL 90

Query: 126 GSLC----SGLKPWISVALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSEEELVELQG 180
                   S L P  ++A+FL+ EK ++E S WR Y+DILPE      F++EE+   L  
Sbjct: 91  APYIERADSQLTPSQALAVFLMCEKCRREKSFWRPYIDILPEEYTCPAFFTEEDFRLLPN 150

Query: 181 TQLLSTTLGVKEY-VQNEYLKVEEEIILPNKQLFP---RPITLDDFLWAFGILRSRAFS- 235
           +  L      K+Y    E++++     +    LFP         DF WA+  +++RAF  
Sbjct: 151 S--LRGKAKAKKYECHKEFMELAPFFKML-ADLFPDQEDAFNFKDFKWAWSAIKTRAFDV 207

Query: 236 --------RLRG----QNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTP 283
                   RLR      N  + PL D INH+       + Y  K   L S       RT 
Sbjct: 208 PLGGETCYRLRDSEDTSNPTMFPLVDSINHAAQAKIR-HRYNEKRRCLES-------RTE 259

Query: 284 VPVKAGEQVLIQYDLNKSNAELALDYGFI--ESKSDRNAYTLTLEI-----SESDPFFGD 336
              +   +V+  Y     N  L L++GF+   + +D   + L  ++      E++     
Sbjct: 260 TVYRRHAEVMNSYG-RADNDNLLLEFGFVVPGNPADTVTFHLVQDVLEYLQPENNELLER 318

Query: 337 KLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHL--D 394
           K+     N L      D+  G T     L  + +V L   D    E ++ +   G    D
Sbjct: 319 KIMFLARNNLIS----DLTCGVTGVSPQLLIVLMVFLCKQDD---EDVWEDLRTGRFEED 371

Query: 395 LPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVL 454
            P+     ++  R++       L  + TTIEEDE  L+     P  R+AV      KR+L
Sbjct: 372 GPLDVRARKVAGRLL----SLRLRTYATTIEEDEGRLKEVT-SPCGRLAVYCGMERKRLL 426

Query: 455 QQ 456
           Q 
Sbjct: 427 QN 428


>gi|408393455|gb|EKJ72719.1| hypothetical protein FPSE_07119 [Fusarium pseudograminearum CS3096]
          Length = 465

 Score = 88.2 bits (217), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 120/480 (25%), Positives = 197/480 (41%), Gaps = 111/480 (23%)

Query: 65  ETFWQWLR---------DQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWI 115
           E F QW +         D K+V  +   R A    G G++A RDI     +  +P K  I
Sbjct: 11  EKFLQWFKSLPGSTFSDDIKIVDLRD--RNA----GRGIIALRDIPAETTLFTIPRKGSI 64

Query: 116 NPDTV----AASEIGSL-------CSGLKPWISVALFLIREKKKEDSP-WRVYLDILPEC 163
           N +T        ++  L         GL  W S+ L +I E  + DS  W+ Y D+LP  
Sbjct: 65  NIETSELPQKIPDVFDLDKPDEDDVPGLDSWSSLILIMIYEYLRGDSSQWKSYFDVLPSS 124

Query: 164 TDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFL 223
            D+ +FWSE EL +LQ +  +   +G K   +N + K    II  N  +F      D  L
Sbjct: 125 FDTPMFWSENELDQLQASH-MRHKIG-KADAENMFKKTLVPIIRSNPSIFNAENRSDSEL 182

Query: 224 -------------WAFGILRSRAFSRLR--------GQNLV-LIPLADLINHSPGITTED 261
                        +AF +                  G++++ ++P+AD++N         
Sbjct: 183 VEIAHRMGSTIMAYAFDLENDEEEEEETEEWVEDRDGKSMMGMVPMADILNADAEFN--- 239

Query: 262 YAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDR-NA 320
                  A +   +   ++ +  P+KAGE++L  Y  +  N+EL   YG++  K  R + 
Sbjct: 240 -------AHVNHEEESLTVTSLRPIKAGEEILNYYGPH-PNSELLRRYGYVTEKHSRYDV 291

Query: 321 YTLTLEISES--------------------------DPFFGDKLDIAETNGLG---ESAY 351
             +  +I ES                          D F  ++ D  E N  G   E A 
Sbjct: 292 VEIPWDIVESVLTNFGISSKILKQIRGEFEGEEEFEDTFVLER-DTGEINSDGTFAEPAK 350

Query: 352 FDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRD 411
           F+      +P  + + L+    G      ++ +  +TI      P     +E+   V+  
Sbjct: 351 FE-----GMPEDLQEQLKSFLKG------IKKVQSDTI------PDKRKRDEIHQAVLVK 393

Query: 412 ACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKER-ELELDE 470
             ++  + + T+I ED+ LL G NLD R R+A  VR GEK++LQ+    F E  E+ +D+
Sbjct: 394 TLEALAARYPTSISEDQTLLNGQNLDQRARMATVVRLGEKKLLQEAIATFSEDVEMTMDD 453


>gi|255083899|ref|XP_002508524.1| set domain protein [Micromonas sp. RCC299]
 gi|226523801|gb|ACO69782.1| set domain protein [Micromonas sp. RCC299]
          Length = 425

 Score = 88.2 bits (217), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 111/431 (25%), Positives = 176/431 (40%), Gaps = 71/431 (16%)

Query: 88  FPEGLGLVAQRDIAKNEVVLEVPMKFWINPD-TVAASEIGSLCSGLKPWISVALFLIREK 146
            PEG GLVA  ++ + E +L++P    I  +  +A S +G   + L+ W  +A FL  + 
Sbjct: 18  LPEGRGLVATEEVRRGESLLDIPESTLITVERAIAESNLGPAHANLQEWSVLAAFLAEQA 77

Query: 147 -----KKEDSPWRVYLDILPECTDSTVFWSEEELVEL-QGTQLLSTTLGVKEYVQNEYLK 200
                  + S +  Y+  LP  T   + W EE++ EL  G+      +  +  V     +
Sbjct: 78  LAIDAGADGSRFATYVRALPRRTGGVLDWPEEDVKELLAGSPSQRAAMERQASVDAAIDE 137

Query: 201 VEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRL-----RGQNLVLIPLADLINHSP 255
           +        +  FP+ +T     WAF +L    FSRL     RG  L L+P AD++NH P
Sbjct: 138 I--------RASFPQ-LTPGALRWAFDVL----FSRLIRLPNRGGALALVPWADMLNHRP 184

Query: 256 GITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESK 315
           G   + Y  +  GA   S D  +        K GEQV   Y   + ++EL + YGF  + 
Sbjct: 185 GC--DAYIDDTGGAVCLSPDRRY--------KPGEQVYASYG-PRPSSELLISYGFAPAV 233

Query: 316 SDR--NAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLV-- 371
            +   + + + L I  +D     K D     GL     F + L    P  +LQY      
Sbjct: 234 GENPDDEFEVVLGIDPNDRHADAKADALRRIGLSPVEAFPLKL-NGYPKQLLQYASFALC 292

Query: 372 --------------ALGGTDAF--LLESIFRNTIWGHLDLPVSHANEELICRVVRDACKS 415
                         AL G  A   LL +I +       ++    A E  +  ++ D    
Sbjct: 293 DPDDPKELPGLAEKALIGNRAVGGLLGAITKGKRGTQGEVLGGVAGEIAVRELLADITSD 352

Query: 416 ALSGFHTTIEEDEQLLQGGNLDPRLRIAVG-----------VRAGEKRVLQQIDEIFKER 464
           ALS +  T+E+D ++       P  + A             VR  E+R+L + D    E 
Sbjct: 353 ALSKYPNTLEKDREIATSDPTKPAGKQAKKKKSFLAELRGPVRVSERRILAKTD---SEV 409

Query: 465 ELELDELEYYQ 475
            L+L +L+  Q
Sbjct: 410 RLQLRKLKASQ 420


>gi|159476096|ref|XP_001696150.1| protein N-methyltransferase [Chlamydomonas reinhardtii]
 gi|158275321|gb|EDP01099.1| protein N-methyltransferase [Chlamydomonas reinhardtii]
          Length = 474

 Score = 88.2 bits (217), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 93/363 (25%), Positives = 154/363 (42%), Gaps = 63/363 (17%)

Query: 62  AQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVA 121
           ++ +TF  W R    +   + ++P+TF    GL A  DIA + +++EVP     +   V 
Sbjct: 61  SRTQTFMNWARGPASIR-FAGVKPSTFAGIRGLAASSDIANDALIVEVPR----HSAVVL 115

Query: 122 ASEIGSLCSGL--------KPWISV--ALFLIREKKKEDSPWRVYLDILPECTDSTVFWS 171
           A +  + C G+         PW +   A+ L  +++   SP   ++  LP  T   + WS
Sbjct: 116 APKQRNSCPGMVNDEWWKNAPWFAKMGAMLLWHKRQGSQSPLAPWIAQLPADTGVPLNWS 175

Query: 172 EEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRS 231
           +++L  LQ   L++    VKE  Q E+  + + +          P + ++F WA G++RS
Sbjct: 176 DKQLAALQYPYLVAQ---VKEQ-QREWTALYDTLRGSGMAAGAAPPSREEFWWAMGVVRS 231

Query: 232 RAFS----------RLRGQNLV---------------LIPLADLINHSPGITTEDYAYEI 266
           R FS          RLR   LV               + PL DL NH+    +E  +Y  
Sbjct: 232 RTFSGPYIGSTLSDRLRLAGLVAALVVILSRSLKQYAICPLIDLFNHTSAAQSE-VSYNY 290

Query: 267 KGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLT-- 324
            G         +S+      K GEQV I Y   +SN  L   YGF E+ + ++ Y ++  
Sbjct: 291 FGDS-------YSVVASRDFKKGEQVFITYGA-QSNDSLMQYYGFAEADNPQDTYVISDV 342

Query: 325 -LEISESDPFFGDKLDIAETNGLGESAYFDIVLGRT-LPPAMLQYLRLV------ALGGT 376
              +    P    ++   + + LG +   ++ + R   P   LQ LR +      A  G 
Sbjct: 343 LRWLQGFRPLPPGRVQALQGSSLGAACLSNVAVQRAGFPAEALQALRFLLASDAEAAAGV 402

Query: 377 DAF 379
            AF
Sbjct: 403 SAF 405


>gi|162606198|ref|XP_001713614.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Guillardia theta]
 gi|13794534|gb|AAK39909.1|AF165818_117 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Guillardia theta]
          Length = 460

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 100/401 (24%), Positives = 181/401 (45%), Gaps = 57/401 (14%)

Query: 87  TFPE----GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAAS-EIGSLCSGLKPWISVALF 141
           TF E    G GL+A R+I KNE ++E+        D    + EI S  +G   +  +A+ 
Sbjct: 91  TFDEEEFIGRGLIASRNILKNEKIIEISENLMF--DKFEHNLEINS--NGSDNYSDLAIK 146

Query: 142 LIREK-KKEDSPWRVYLDILPECTDSTVF--WSEEELVELQGTQLLSTTLGVKEYVQNEY 198
           L+ E  K + S W  Y+ ILPE  D  +   W  +EL  ++G++L   +  +K+ ++ +Y
Sbjct: 147 LLVELFKNKKSFWFPYIGILPEEYDLKLLFRWPLKELFFIKGSRLSKASDYLKKKLKAQY 206

Query: 199 LKVEEEIILPNKQLFPRPI-TLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGI 257
             V +E+   N+ L+P  I    ++ W+  IL SR  S    + +VLIP  DL+NH+P  
Sbjct: 207 EMVNKEVFQRNRLLYPSKIFNYQNWEWSMSILLSRTISLQETKKVVLIPYIDLLNHNP-- 264

Query: 258 TTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLN-------------KSNAE 304
                         FS   + S R  +P+   +++++  D N             KSN E
Sbjct: 265 --------------FSSSFI-SYR-KIPLSDSKEIVVYSDKNCNKFDQLYISYGQKSNLE 308

Query: 305 LALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAM 364
           L   YGFI  ++  ++  + + +S  D FF +K     +N       + I L +  P  M
Sbjct: 309 LLNLYGFIAERNPYDSVIIRISMSPKDIFFKEKKSFLFSNKKFFYNSYPIFLYK-YPDEM 367

Query: 365 LQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTI 424
           +++++ + L  T+         N    +L+   ++   ++I   +    + +L+  +   
Sbjct: 368 IEFIK-ICLFNTNI--------NDKNFNLNKIENYDYTKIIKSCIVTVIEKSLNSNYNDY 418

Query: 425 EEDEQLLQGGNL---DPRLRIAVGVRAGEKRVLQQIDEIFK 462
           E    ++   NL       +I++   A EK++L +  E  K
Sbjct: 419 ENLRNIMLKENLLHISDNQKISIKYNALEKKILNRFLEQIK 459


>gi|297736447|emb|CBI25318.3| unnamed protein product [Vitis vinifera]
          Length = 487

 Score = 87.4 bits (215), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 75/223 (33%), Positives = 117/223 (52%), Gaps = 13/223 (5%)

Query: 94  LVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWIS-VALFLIRE-KKKEDS 151
           L A + I   + +L+VP    I+PD V  S+I SL       I+ +A+ +  E K  +DS
Sbjct: 73  LFASKSIQTGDCILKVPYNVQISPDNVP-SKINSLLGDEVGNIAKLAIVISVEWKMGQDS 131

Query: 152 PWRVYLDILPECTD--STVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPN 209
            W  Y++ LP+  +  ST+FWSE EL  +Q + +   T+  K  +Q ++L ++  +   +
Sbjct: 132 EWAPYINRLPQPGEMHSTIFWSEGELKMIQQSSVYQETINQKAQIQKDFLAIKPVLHHFS 191

Query: 210 KQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGA 269
           + LF + I+L +F+ A  ++ SRA+   +G  L LIP AD +NH     +     E K  
Sbjct: 192 ENLF-KDISLKEFMHACALVGSRAWGSTKG--LSLIPFADFVNHDGFSDSVLLGDEDKQL 248

Query: 270 GLFSRDL-LFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF 311
              S  L + + R   P   GEQVLI+Y     NA L LD+GF
Sbjct: 249 SESSSTLEVIADRNYAP---GEQVLIRYG-KFPNATLLLDFGF 287


>gi|357160358|ref|XP_003578740.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
           large subunit N-methyltransferase, chloroplastic-like
           [Brachypodium distachyon]
          Length = 516

 Score = 87.4 bits (215), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 99/390 (25%), Positives = 175/390 (44%), Gaps = 33/390 (8%)

Query: 84  RPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASE-IGSL--CSGLKPWISVAL 140
           RP  F     + A +D+   +V  E+PM   +  + V   E +  L   + L     +AL
Sbjct: 112 RPLHF-----VAAGQDLEVGDVAFEMPMSLVVTLERVLGDESVAELLTTNKLSELACLAL 166

Query: 141 FLIREKKK-EDSPWRVYLDILPE-------CTDSTVFWSEEELVELQGTQLLSTTLGVKE 192
           +L+ EKK+ +DS W  Y+  L           +S + W+E EL  L G+ +    +   E
Sbjct: 167 YLMYEKKQGKDSLWYPYIKELDRQRGRGQLAVESPLLWTESELDYLNGSPMRDEVVVRDE 226

Query: 193 YVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRG-----QNLVLIPL 247
            ++ EY +++    +        P  +    + F I + +AF  ++      Q + L   
Sbjct: 227 GIRREYNELDTLWFMAGSLFKQYPFDVPTEAFPFEIFK-QAFVAVQSCVVHLQKVSLARR 285

Query: 248 ADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELAL 307
             L+   P + T  Y    K A L + D    L    P KAGE +++ +   + N+ L L
Sbjct: 286 FALVPLGPPLLT--YKSNCK-AMLTAVDDSVRLVVDRPYKAGEPIIV-WCGPQPNSRLLL 341

Query: 308 DYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLP--PAML 365
           +YGF++  +  +   +   ++  DP + +K  +A+ NG      F + +G+       ML
Sbjct: 342 NYGFVDEDNPYDRIAIEASLNMEDPQYQEKRMVAQRNGKLAIQKFQVCVGKEKETISEML 401

Query: 366 QYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            YLRL  +   D   ++ I  +        PVS  +E  +   +    KS L+G+ TT++
Sbjct: 402 PYLRLGYISDPDE--MQCILSSE---GDTCPVSPCSERAVLDQLVVYLKSRLAGYPTTLD 456

Query: 426 EDEQLLQGGNLDPRLRIAVGVRAGEKRVLQ 455
           EDE +L  GNL+P+  +A  +   EK++L 
Sbjct: 457 EDEAMLADGNLEPKKEVATRLVRLEKKLLH 486


>gi|224112189|ref|XP_002316114.1| predicted protein [Populus trichocarpa]
 gi|222865154|gb|EEF02285.1| predicted protein [Populus trichocarpa]
          Length = 94

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 46/69 (66%), Positives = 56/69 (81%), Gaps = 2/69 (2%)

Query: 430 LLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKERELELDELEYYQE-RRLKDLGLVGEQ 488
           L +   L+PRL IAVG+R GEK+VL+ IDE+ K+RE ELDELEYY+E RRLK  GLVG+Q
Sbjct: 27  LFRSTELNPRLEIAVGIRTGEKKVLR-IDEVLKQRESELDELEYYKERRRLKGPGLVGKQ 85

Query: 489 GDIIFWEPK 497
           G+IIFWE K
Sbjct: 86  GEIIFWEYK 94


>gi|449455876|ref|XP_004145676.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cucumis
           sativus]
 gi|449492872|ref|XP_004159127.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cucumis
           sativus]
          Length = 521

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 108/416 (25%), Positives = 182/416 (43%), Gaps = 37/416 (8%)

Query: 70  WLRDQ-----KVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASE 124
           W+ D      KV+  + P           + A  D+   +V   VP    +  + V  +E
Sbjct: 93  WMHDNGLPPCKVILEEKPSHDKNHRPIHYVAASEDLEVGDVAFSVPNSLVVTLERVLGNE 152

Query: 125 IGS---LCSGLKPWISVALFLIREKKK-EDSPWRVYLDILPE-------CTDSTVFWSEE 173
             +     + L     +AL+L+ EKK+ + S W  Y+  L           +S + WSE+
Sbjct: 153 TVAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSED 212

Query: 174 ELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNK--QLFPRPITLDDFLWAFGILRS 231
           EL  L G+      L   E ++ EY +++    +     Q +P  I  + F  +F I + 
Sbjct: 213 ELDYLSGSPTKKEVLERAEGIKKEYNELDTVWFMAGSLFQQYPYDIPTEAF--SFEIFK- 269

Query: 232 RAFSRLRG-----QNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPV 286
           +AF  ++      Q + L     L+   P +    Y    K A L + D    L    P 
Sbjct: 270 QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLA--YRSNCK-AMLTAVDGAVELVVDRPY 326

Query: 287 KAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGL 346
           KAGE + + +   + N++L L+YGF++  +  +   +   ++  DP + DK  +A+ NG 
Sbjct: 327 KAGESIAV-WCGPQPNSKLLLNYGFVDEDNRYDRLVVEAALNTEDPQYQDKRMVAQRNGR 385

Query: 347 GESAYFDIVLGRTLPPA--MLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEEL 404
                F +  G+       ML YLRL  +  T    ++S+  +   G +  PVS   E  
Sbjct: 386 LSIQAFYVYAGKEKEAVLDMLPYLRLGYV--THPSEMQSVISSQ--GPV-CPVSPCMERA 440

Query: 405 ICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEI 460
           +   V D  K  L+G+ TT+ EDE LL  GNL+P+ R+A  +   EK++L    E+
Sbjct: 441 MLEQVADYFKRRLAGYPTTLSEDEFLLADGNLNPKKRVATQLVRLEKKLLHSCLEV 496


>gi|299473350|emb|CBN77749.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 563

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 114/440 (25%), Positives = 188/440 (42%), Gaps = 64/440 (14%)

Query: 62  AQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMK--------- 112
           A+V  F +WL +     P+  IR    PE  G+ A+ +I  +EV++E+P+K         
Sbjct: 74  AKVVAFERWLLENGARFPRLEIRRYD-PEVRGVHARGEIDPDEVIVEIPLKNKKQRRKRP 132

Query: 113 -FWI--NPDTVAAS---------EIGS--------LCSGLK----PWISVALFLIREKKK 148
             W    P+   ++         E+G         L SG+       I + LF++ + ++
Sbjct: 133 TGWSLQKPEKDGSASWRRCLVTVEMGKALPVGQAVLASGVSFDAPKHIFLMLFILTDMRR 192

Query: 149 EDSPWRVYLDILPEC-TDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIIL 207
            D+ +R Y D+LP   ++  +FW+EEE+  LQG+ L++      + ++ +Y      +I 
Sbjct: 193 PDTFFRPYYDLLPTTLSNMPIFWTEEEMRLLQGSYLVTQVEERNQAIEGDY-----GVIC 247

Query: 208 PNKQLFPRPITLDDFLWAFGILRSRAFS-RLRG-QNLVLIPLADLINHSPGITTEDYAYE 265
                F    TL++F WA   + SR F   + G +   L+P AD++NH     T+ + Y+
Sbjct: 248 DLYPPFRDVATLEEFKWARMCVCSRNFGLDINGLRTSALVPYADMLNHYRPRETK-WTYD 306

Query: 266 IKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF-IESKSDRNAYT-- 322
               G       F++ T   +  G QV   Y   K N    L+YGF IE+  + N +   
Sbjct: 307 NNRGG-------FTITTLHRILGGAQVYDSYG-QKCNHRFLLNYGFAIENNQEANGFCPN 358

Query: 323 ---LTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRT-LPPAMLQYLRLVALGGTDA 378
              L   +   DP    K      +G  E     + +G T      L  LR++     DA
Sbjct: 359 EVPLLFRLDARDPLRQKKARFWRMDG-PEQRRVRLCVGDTDAVRGALSMLRVIV---ADA 414

Query: 379 FLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDP 438
             + + +       +  P+S  NE      +      AL  + TT+EED   L+ G L+P
Sbjct: 415 AEMGARYMYRTVKDVRFPLSVRNEVAAMERLLLLTTGALDAYPTTLEEDRAALKNGGLEP 474

Query: 439 --RLRIAVGVRAGEKRVLQQ 456
               R A+    GEK VL  
Sbjct: 475 FSNRRHALIQVYGEKVVLAH 494


>gi|198413420|ref|XP_002131202.1| PREDICTED: similar to SET domain containing 3 [Ciona intestinalis]
          Length = 577

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 95/439 (21%), Positives = 188/439 (42%), Gaps = 52/439 (11%)

Query: 56  TTNPPTAQ-------VETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLE 108
            ++PP  Q       +  F  WL++  V      I+  +  EG G++A +DI     ++ 
Sbjct: 65  VSDPPVVQPIDRTTAIPKFKSWLKEHGVEYSAIDIQEVSEEEGFGVIALQDIEIKCPLVT 124

Query: 109 VPMKFWINPDTVAASEIGSLCSG-----LKPWISVALFLIREKKKEDSPWRVYLDILPEC 163
           +P K  +  +   +S +  L  G     + P + +AL+L  E+   +S ++ Y+D++P+ 
Sbjct: 125 IPRKAMMTYEDAKSSYLAGLIEGNEVLSVMPNVCLALYLHCERFTLNSKYQPYIDMIPQE 184

Query: 164 TDSTVFWSEEELVELQGTQLLSTTLG-----VKEY-----VQNEYLKVEEEIILPNKQLF 213
            ++ +++   E+  L+GT  LS  +      V+++     V N   + E+   LP +   
Sbjct: 185 FNTILYFKPHEMKYLKGTAALSVAINQFKSIVRQFALLYQVFNGSHQKEDVEKLPLQA-- 242

Query: 214 PRPITLDDFLWAFGILRSR----------AFSRL-RGQNLVLIPLADLINHSPGITTEDY 262
               T D + W    + +R              L     L LIP+ D+ NH+ G  +  Y
Sbjct: 243 RNAFTFDTYRWCASAVTTRQNKIPTHVGDVLGDLDENSTLALIPMWDMFNHAIGPLSTAY 302

Query: 263 AYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYT 322
               +G    +             K GEQV I Y   ++N++L +  GF+  +S  +   
Sbjct: 303 NALTRGIECLAMQ---------DFKTGEQVKICYGA-RTNSDLLIHNGFVMKESPFDKVR 352

Query: 323 LTLEISESDPFFGDKLDIAETNGLGESAYFDIV-LGRTLP--PAMLQYLRLVALGGTDAF 379
           + L +S+ DP +  K  + E   +  S  F +  +  +LP  P +L +LR+  +   +  
Sbjct: 353 IHLGVSQKDPLYSLKAKLLEKLNVEVSGQFAVCSMDNSLPTSPQLLVFLRVFHMNEEELR 412

Query: 380 LLESIFRNTIWG----HLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGN 435
                 +N +      ++   V   ++  +   + +  K  L GF    +  E++++  +
Sbjct: 413 SWLEKQKNELSSLREIYISGEVKFKSDVKVWEFLENRVKLLLMGFKKIGDNIEEMMEDKS 472

Query: 436 LDPRLRIAVGVRAGEKRVL 454
           L  R ++A+  R  E R+L
Sbjct: 473 LTHRSKLALQFRIEEHRIL 491


>gi|320170264|gb|EFW47163.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 938

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 108/402 (26%), Positives = 176/402 (43%), Gaps = 49/402 (12%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWIS-------VALFL 142
           +G  L+A   +  ++ +  +P    I+ DT  A ++G   +    W         +ALFL
Sbjct: 63  QGRRLIADNPLKPDDRIAAIPTLLTISLDT--ALQVGLPRAFTTIWHESGSQDDLLALFL 120

Query: 143 IREKK-KEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKV 201
           +REK     S W  Y++ILP+   + +F+++ EL +LQ  QL+      K  +Q  +L +
Sbjct: 121 LREKALGARSAWAPYIEILPKKLSNLLFFNDGELAQLQNEQLVEQVSQQKSELQGRFLAL 180

Query: 202 --EEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITT 259
              E  I   K      + L DFLWA  I+ SRAF+ +  +   LIP ADL+NH    T 
Sbjct: 181 RQHEADIFGGKA----ELVLSDFLWARAIVLSRAFT-IHARR-YLIPFADLLNHRFHPTR 234

Query: 260 E-DYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDR 318
             D + E        ++ +F L    PV   E+V   Y  N SNA+    YGF+   +  
Sbjct: 235 GLDESGEFFYRHHDFQNGMFLLTCDRPVNENEEVEDDYG-NLSNAQFLQLYGFVPESNPH 293

Query: 319 NAYTLTLEISESDPFFGDK--LDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVA-LGG 375
                 +EI+ +D   G++  L +             + +G T PP++   L  +A +  
Sbjct: 294 EC----VEINLADLLHGEREALLLKSEYAFKLGIPHIVCIGATRPPSVTGALEAIAYIND 349

Query: 376 TDAFLLESI-------------FRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHT 422
             A  L +              F N + G  +L ++   E +I ++      S  + F T
Sbjct: 350 LHALKLRACIDEFSPDRSPVESFSNCVQGR-ELNMADTVERIIVKLT-----SIAASFAT 403

Query: 423 TIEEDEQLLQGGNLDPRL---RIAVGVRAGEKRVLQQIDEIF 461
           T+  DE  LQ     PRL   R+A+  R   KR++ ++   F
Sbjct: 404 TVAADELALQRTEDKPRLLHRRLALQYRIQRKRLVAELISTF 445


>gi|357497055|ref|XP_003618816.1| SET domain protein [Medicago truncatula]
 gi|355493831|gb|AES75034.1| SET domain protein [Medicago truncatula]
          Length = 501

 Score = 86.7 bits (213), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 96/380 (25%), Positives = 173/380 (45%), Gaps = 28/380 (7%)

Query: 94  LVAQRDIAKNEVVLEVPMKFWINPDTVAASE-IGSLCSGLK--PWISVALFLIREKKK-E 149
           + A  D+ K ++   VP    +  + V  +E I  L +  K      +AL+L+ EKK+ +
Sbjct: 102 VAASEDLQKGDIAFSVPNSLVVTLERVLGNETIAELLTTNKFSELACLALYLMYEKKQGK 161

Query: 150 DSPWRVYLDILPE-------CTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVE 202
            S W  Y+  L           +S + WSE EL  L+G+ L    +   E ++ EY +++
Sbjct: 162 KSFWYPYIRELDRQRGRGQLAVESPLLWSESELAYLEGSPLKDEIVKRIEGIRKEYNELD 221

Query: 203 EEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRG-----QNLVLIPLADLINHSPGI 257
               +        P  L    + F I + +AF+ ++      QN+ L     L+   P +
Sbjct: 222 TVWFMSGSLFQQYPYDLPTEAFPFEIFK-QAFAAVQSCVVHLQNVSLARRFALVPLGPPL 280

Query: 258 TTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSD 317
               Y    K A L + D    L    P KAG+ +++ +   + N +L  +YGF++  + 
Sbjct: 281 LA--YCSNCK-AMLTAVDGAVQLVVDRPYKAGDPIVV-WCGPQPNTKLLTNYGFVDEDNS 336

Query: 318 RNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLG--RTLPPAMLQYLRLVALGG 375
            +   + + +S  DP + DK  +A+ NG      F +  G  R     M+ Y+RL  +  
Sbjct: 337 NDRLIVEVALSTEDPQYQDKRIVAQRNGKLSIQTFYVYTGKEREAVSDMIPYMRLGYV-- 394

Query: 376 TDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGN 435
           +D   ++S+  +   G +  PVS   E  +   + D   + L+ + TT+ EDE +L  G+
Sbjct: 395 SDPSEMQSVISSQ--GPV-CPVSPCMERAVLDQLADYFNTRLAAYPTTLAEDESMLTDGS 451

Query: 436 LDPRLRIAVGVRAGEKRVLQ 455
           L+P+ R+A  +   EK++L 
Sbjct: 452 LNPKRRVATQLVRLEKKMLH 471


>gi|395848935|ref|XP_003797093.1| PREDICTED: SET domain-containing protein 4 [Otolemur garnettii]
          Length = 440

 Score = 86.7 bits (213), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 79/308 (25%), Positives = 138/308 (44%), Gaps = 37/308 (12%)

Query: 69  QWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           +WL+D+K     + + PA FP  G GL+++  + + ++++ +P    +  DTV  S +G+
Sbjct: 38  KWLKDRKF--EDTNLMPAHFPGTGRGLMSKTSLQEGQMIISLPENCLLTTDTVIESYLGA 95

Query: 128 LCSGLK----PWISVALFLIREKKKED-SPWRVYLDILPECTDSTVFWSEEELVELQGTQ 182
             +  K    P +++  FL+ EK   D SPW+ YL+ILP+     V   E E+V L    
Sbjct: 96  YITKWKPPPSPLLALCTFLVSEKHAGDQSPWKPYLEILPKAYTCPVC-LEPEVVNLLPKP 154

Query: 183 LLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI----TLDDFLWAFGILRSRA-FSRL 237
           L +     + +VQ E+     +     + LF   +    +    LWA+  + +RA + R 
Sbjct: 155 LKAKAEEQRAHVQ-EFFASSRDFFSSLQPLFAEAVDSIFSYSALLWAWCTVNTRAVYLRH 213

Query: 238 RGQNLV--------LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAG 289
           R +  +        L P  DL+NHSP +           A        + +RT    +  
Sbjct: 214 RRRECLSAEPDTCALAPYLDLLNHSPNVQVR--------AAFNEETRCYEIRTASSWRKH 265

Query: 290 EQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEI-----SESDPFFGDKLDIAETN 344
           E+V I Y  +  N  L L+YGF+  ++      ++ EI       +D     K+ I + +
Sbjct: 266 EEVFICYG-HHDNQRLLLEYGFVSIQNPHACVYVSREILVKYLPSTDKQMNKKISILKDH 324

Query: 345 GLGESAYF 352
           G  E+  F
Sbjct: 325 GFIENLTF 332


>gi|403350379|gb|EJY74649.1| SET domain containing protein [Oxytricha trifallax]
          Length = 2165

 Score = 86.7 bits (213), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 108/440 (24%), Positives = 188/440 (42%), Gaps = 68/440 (15%)

Query: 58  NPPTAQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINP 117
           +P   + ET  +WL        K  IR  T  +  G+ A RDI K E++L VP    I  
Sbjct: 141 HPERQRFETLLKWLEQGGSHFEKLKIRYYT-ADYRGVHAARDIKKGEIILYVPKHQIITL 199

Query: 118 DTVAASEIGSLC--SGLK-----PWIS-VALFLIREKKKEDSPWRVYLDILPEC-TDSTV 168
           +    S +G      GL+     P  S ++ ++++EK+K +S W++Y+DILP+  ++  +
Sbjct: 200 EMAMTSPVGKKMYEKGLRQRLISPKHSFLSTYIMQEKRKPESQWQIYIDILPKNFSNFPI 259

Query: 169 FWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGI 228
           F++EEE + L+G+  L   L   E ++ +Y  + +E+  P    FP        +  +  
Sbjct: 260 FFTEEERIWLKGSPFLDQILEKIEDIKADYDLICKEV--PEYVQFP--------IREYSE 309

Query: 229 LRSRAFSRLRGQNLV------LIPLADLINHS-PGITTEDYAYEIKGAGLFSRDLLFSLR 281
           +R    SR+ G  +        +  AD++NH  P  T+  Y  E +G         F + 
Sbjct: 310 IRMMVSSRIFGIQIEGVKTDGFVAYADMLNHKRPRQTSWTYTDEKQG---------FIIE 360

Query: 282 TPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIA 341
               ++ GEQV   Y   K N+   L+YGFI   +D N   + +     D     K D+ 
Sbjct: 361 AMEDIQRGEQVYDSYG-KKCNSRFFLNYGFINLNNDANEVPIKVYYHTDDQLKQVKQDMI 419

Query: 342 ETNGLGESAYFDIVLGRTLPPAMLQ----YLRLVALGGTDAFLLE----SIFRNTIWGH- 392
               +  S +    +   L   ++Q    +LR V        + +    +I +   +   
Sbjct: 420 ----VDHSEFKKFRVVENLEDRVMQEFFSWLRFVEYDENITLIYQYQGAAISKAQKYKRG 475

Query: 393 -----------------LDL-PVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGG 434
                             DL P+S  NE+L+   + +  K A   +  T EED ++L+  
Sbjct: 476 DESDSDEADDPSKGFKAKDLPPLSIRNEKLVLERIINLAKEAYDKYERTYEEDLKILERE 535

Query: 435 NLDPRLRIAVGVRAGEKRVL 454
           +L    R  +   +GEK++L
Sbjct: 536 DLTFNHRNCLLYTSGEKKIL 555


>gi|195439104|ref|XP_002067471.1| GK16171 [Drosophila willistoni]
 gi|194163556|gb|EDW78457.1| GK16171 [Drosophila willistoni]
          Length = 511

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 70/294 (23%), Positives = 133/294 (45%), Gaps = 43/294 (14%)

Query: 62  AQVETFWQWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTV 120
           A++  F +W +   + S    I  A FP   +GL A +DI  ++ VL VP K   + + +
Sbjct: 85  AKIAAFSEWAKAGGIHSDGVEI--AIFPGYQMGLRATKDINADQQVLRVPRKKIFSEEQL 142

Query: 121 AASEIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQG 180
           + +E  S C+    +      ++ + +  DS W+ Y+D+LP   ++ ++++ E++  L+G
Sbjct: 143 SKTERESFCNFTTNFNLANALVVEKSRGADSIWKPYIDVLPSRYNTVLYFTVEQMRRLRG 202

Query: 181 TQLLSTTLGVKEYVQNEYLKV------EEEIILPNKQLFPRP-ITLDDFLWAFGILRSRA 233
           T + S+ L     +  +Y K+      +   + P+  LF +  +  + + WA   + +R 
Sbjct: 203 TSVCSSALRQCRMIARKYAKLYAFAYCDSSYLRPDTGLFTQHGLCYELYRWAVSTVMTR- 261

Query: 234 FSRLRGQNLV----------------LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLL 277
                 QNLV                LIP  D+ NH PG  T  Y          +++  
Sbjct: 262 ------QNLVPREIATKDDGNSPISALIPCWDMANHRPGKITSFYDSNAHQMECTAQEF- 314

Query: 278 FSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESD 331
                    KAG Q  I Y  ++ NA+L +  GF++  ++++   + L +S +D
Sbjct: 315 --------CKAGNQFFIYYG-DRPNADLLVHNGFVDPNNNKDFVNIRLGLSPTD 359


>gi|52545671|emb|CAH56365.1| hypothetical protein [Homo sapiens]
          Length = 380

 Score = 85.9 bits (211), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 75/264 (28%), Positives = 121/264 (45%), Gaps = 33/264 (12%)

Query: 209 NKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHSPGITTED 261
           NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+ G+ T  
Sbjct: 17  NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHTNGLITTG 73

Query: 262 YAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAY 321
           Y  E         D           +AGEQ+ I Y   +SNAE  +  GF    +  +  
Sbjct: 74  YNLE---------DDRCECVALQDFRAGEQIYIFYG-TRSNAEFVIHSGFFFDNNSHDRV 123

Query: 322 TLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP---AMLQYLRLVALGGTDA 378
            + L +S+SD  +  K ++    G+  S+ F   L  T PP    +L +LR+  +  T+ 
Sbjct: 124 KIKLGVSKSDRLYAMKAEVLARAGIPTSSVF--ALHFTEPPISAQLLAFLRVFCM--TEE 179

Query: 379 FLLESIFRNT------IWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQ 432
            L E +  ++        G+ + PVS  NE  +   + D     L  + TTIEED+ +L+
Sbjct: 180 ELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLK 239

Query: 433 GGNLDPRLRIAVGVRAGEKRVLQQ 456
             +L  R ++A+ +R GEK +L++
Sbjct: 240 NHDLSVRAKMAIKLRLGEKEILEK 263


>gi|427784595|gb|JAA57749.1| Putative histone-lysine n-methyltransferase setd3 [Rhipicephalus
           pulchellus]
          Length = 485

 Score = 85.9 bits (211), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 104/424 (24%), Positives = 177/424 (41%), Gaps = 50/424 (11%)

Query: 67  FWQWLRDQKVVSPKSPIRPATFPEG-LGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEI 125
           F +W  D         I+    P+G  G VA   I ++   L VP+K  +       S++
Sbjct: 77  FLKWCSDNGAYLGSVSIK--DLPDGEYGFVADEHIEESNQFLGVPLKLMMTTAAAKKSKL 134

Query: 126 GSLCSGLKPWISV-----ALFLIRE-KKKEDSPWRVYLDILPECTDSTVFWSEEELVELQ 179
           G L       +S+     A+FLI E    E S W  Y+  LP   ++ +++S EEL  L 
Sbjct: 135 GPLLRDDPIMMSMSNVALAMFLILEFCTGESSFWHPYISTLPASFNTVLYFSVEELELLH 194

Query: 180 GTQLLSTTLGVKEYVQNEYLKVEEEI-------ILPNKQLFPRPITLDDFLWAFGILRSR 232
           G+ +L   L +   +  +Y    +          LP K  F    T D + WA   + +R
Sbjct: 195 GSTVLDEALKLHRSIARQYSYFHKIFRTHPLAKSLPYKDCF----TYDLYRWAVSAVMTR 250

Query: 233 A----FSRLRGQN--------LVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSL 280
                 +   G +          ++PL D+ NHS G    D  Y+I    L      +++
Sbjct: 251 QNAVPLTDTAGGDDEDGTDAMTAMVPLWDMCNHSDGKVFTD--YDISANML----RCYAM 304

Query: 281 RTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDI 340
           R     + G++V I Y   ++NAE  +  GF+  ++  ++  + L IS+ DP +  K  +
Sbjct: 305 RD---FEKGQEVTIFYG-RRTNAEFFIHNGFVFPENRHDSVDIKLGISKQDPLYAVKAKL 360

Query: 341 AETNGLGESAYFDIV-LGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSH 399
            + + L  S  F +V   R +   +  +LR++ L   DA    S     I    D     
Sbjct: 361 CDDHELTPSGIFALVPRERPVCEDLSTFLRILVL--KDASQAASFTDEHIMVATDDNARE 418

Query: 400 ANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDE 459
           A   LI R+     +  L  F  + +E E ++     + RL++A  +R  E+++L  + E
Sbjct: 419 ALNFLIVRI-----QLLLRAFPKSDQEYENIIADEGSNGRLKMAAQLRLLERKILTAVLE 473

Query: 460 IFKE 463
              E
Sbjct: 474 TLNE 477


>gi|320166344|gb|EFW43243.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 514

 Score = 85.9 bits (211), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 83/312 (26%), Positives = 139/312 (44%), Gaps = 41/312 (13%)

Query: 61  TAQVET---FWQWLRDQKVVSPKSPIRPAT----------FPEGLGLVAQRDIAKNEVVL 107
           TAQV+       W R  ++   +  IRP+T             GLGLV      + E ++
Sbjct: 43  TAQVQRIIDLVSWARRAEIEMARVDIRPSTDTSASAKFLFESRGLGLVLNAPARRGEAIV 102

Query: 108 EVP--MKFWINP-DTVAASEIGSLC----SGLKPWISVALFLIREKKKEDSPWRVYLDIL 160
            +P   +F +   D+   S I        + + P  ++AL L+ E+ + DSPWR +L +L
Sbjct: 103 TLPPRARFRVPAFDSALRSLIDEFNEQHDNAIDPMTALALGLMYERSRADSPWRAWLRML 162

Query: 161 PECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILP-----NKQLFPR 215
           P+  +S + W++   VEL   + L      +E ++N    V E +I P        L   
Sbjct: 163 PDPIESMLEWND---VELWPVEQLYVKELREERIRN-LEAVYESVITPFIDTYESDLVGV 218

Query: 216 PITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRD 275
             T++ F+WA  I ++R         L L+P+ D+INH          +    A + +  
Sbjct: 219 DFTIEAFVWAAVIAQTRGLHESEKNGLSLLPIVDMINH----------HREPNAVVVASG 268

Query: 276 LLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFG 335
               +RT   +KAGE++ I Y++  S+  L L YGF+E   + + Y + L     D  + 
Sbjct: 269 PNILVRTKTSLKAGEEITIDYEM--SSHVLLLLYGFVEMSENLDFYPIRLSWESKDIDYP 326

Query: 336 DKLDIAETNGLG 347
            +L + E  GL 
Sbjct: 327 RRLRLLEGRGLS 338


>gi|403271547|ref|XP_003927684.1| PREDICTED: SET domain-containing protein 4 [Saimiri boliviensis
           boliviensis]
          Length = 440

 Score = 85.5 bits (210), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 81/308 (26%), Positives = 138/308 (44%), Gaps = 37/308 (12%)

Query: 69  QWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           +WL+D+K     S + PA FP  G GL++Q  + + ++++ +P    +  DTV  S +G+
Sbjct: 38  KWLKDRKFQD--SNLAPACFPGTGRGLMSQTSLQEGQMIISLPESCLLTTDTVIRSYLGA 95

Query: 128 LCSGLK----PWISVALFLIREKKKED-SPWRVYLDILPECTDSTVFWSEEELVELQGTQ 182
             +  K    P +++  FL+ EK   D S W+ YL+ILP+     V   E E+V L    
Sbjct: 96  YITKWKPPPSPLLALCTFLVSEKHAGDRSLWKPYLEILPKAYTCPVCL-EPEVVNLLPKS 154

Query: 183 LLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI----TLDDFLWAFGILRSRA-FSRL 237
           L +     + +VQ E+     +     + LF   +    +    LWA+  + +RA + R 
Sbjct: 155 LKAKAEEQRAHVQ-EFFASSRDFFSSLQPLFAEAVDSIFSYSALLWAWCTVNTRAVYLRP 213

Query: 238 RGQNLV--------LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAG 289
           R Q  +        L P  DL+NHSP +  +        A        + +RT    +  
Sbjct: 214 RQQECLSAEPDTCALAPYLDLLNHSPHVQVK--------AAFNEETHCYEIRTTSRWRKH 265

Query: 290 EQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEI-----SESDPFFGDKLDIAETN 344
           E+V I Y  +  N  L L+YGF+ + +      ++ EI       +D     K+ I + +
Sbjct: 266 EEVFICYGPH-DNQRLFLEYGFVSAHNPHACVYVSREILVKYLPSTDKQMDKKISILKDH 324

Query: 345 GLGESAYF 352
           G  E+  F
Sbjct: 325 GYIENLTF 332


>gi|156064409|ref|XP_001598126.1| hypothetical protein SS1G_00212 [Sclerotinia sclerotiorum 1980]
 gi|154691074|gb|EDN90812.1| hypothetical protein SS1G_00212 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 470

 Score = 85.5 bits (210), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 112/437 (25%), Positives = 185/437 (42%), Gaps = 63/437 (14%)

Query: 62  AQVETFWQWLRDQKV-VSPK-SPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDT 119
           A+  TF  WL++  V  +PK + +       G G+VA  DI  +E++  +P    +N   
Sbjct: 8   ARTATFSSWLKEMGVRTNPKMALVDLRQEGRGRGVVATGDIDDDEIIFSIPRNAVLNAQN 67

Query: 120 VAASEIGS-LCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVEL 178
           VA   +   L   +  W+ +   L+ E + E+S W  YL +LPE  DS VFWS+ EL EL
Sbjct: 68  VAPLPVSRRLFEKMPSWLVLTSILMTEAQMENSKWAPYLAVLPERLDSLVFWSDSELAEL 127

Query: 179 QGTQLLSTTLGVKE-------YVQNEYLKVEE--------EIILPNKQLFPRPITLDDFL 223
           Q + ++   +G K+       Y+  + LK            +I+      P P       
Sbjct: 128 QASAVVK-KIGKKDAEDMFKSYIAPQGLKHSSTEMCHKVASVIMAYAFDIPDPSDAPTSG 186

Query: 224 WAFGILRSRAFSRLRGQN----LVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFS 279
              G       S   G++    L +IPLAD++N            +   A L   +    
Sbjct: 187 GKGGEAGDDLVSD-DGEDEKTILSMIPLADMLNADA---------DRNNARLICDNEELE 236

Query: 280 LRTPVPVKAGEQVLIQY-DLNKSNAELALDYGFIESKSDRNAYTLTLEISE--------- 329
           +R   P+  GE++   Y  L +S+  L   YG++      +AY +  EIS          
Sbjct: 237 MRAIKPISKGEEIFNDYGQLPRSD--LLRRYGYV--TDGYSAYDVA-EISAELIVSLFRN 291

Query: 330 ----------SDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAF 379
                     +      +L++AE  G+ E + FD+V      P++   L           
Sbjct: 292 GKVHPSLHKLTQDGLKTRLELAEREGVYEDS-FDLVHSSPDEPSIPDELLAFL---YLLL 347

Query: 380 LLESIFRNTIWGHLDLPV-SHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDP 438
           + ES  +  +     +P  S    EL  +V+    ++    + TT+EEDE LL+  +L  
Sbjct: 348 VDESHLKAILNSESSIPSRSKLTTELAGQVLATLLQAREKEYSTTLEEDEDLLKNADLPV 407

Query: 439 RLRIAVGVRAGEKRVLQ 455
           R  +A+ VR+GEK+VL+
Sbjct: 408 RHAMAIQVRSGEKKVLR 424


>gi|303275964|ref|XP_003057276.1| set domain protein [Micromonas pusilla CCMP1545]
 gi|226461628|gb|EEH58921.1| set domain protein [Micromonas pusilla CCMP1545]
          Length = 308

 Score = 85.5 bits (210), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 89/311 (28%), Positives = 129/311 (41%), Gaps = 39/311 (12%)

Query: 88  FPEGLGLVAQRDIAKNEVVLEVPMKFWINPD-TVAASEIGSLCSGLKPWISVALFLIR-- 144
            PEG GLVA+ D+ + E +LE+P    I  +  V  S++G   + L+ W  +A FL    
Sbjct: 19  LPEGRGLVAREDVKRGEPLLEIPDASLITVERAVKESKLGPKHAELQEWSLLAAFLAEQA 78

Query: 145 ---EKKKEDSPWRVYLDILPECTDSTVFWSEEELVE-LQGTQLLSTTLGVKEYVQNEYLK 200
              E   E   +  Y+  LP  T   + W EE++   L G+         +  V     +
Sbjct: 79  LDIENGDESGVFAAYVKALPRRTGGVLDWPEEDVKTLLAGSPSQRAAYERQASVDGAIEE 138

Query: 201 VEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRL-----RGQNLVLIPLADLINHSP 255
           +  E        FP+ +T     WAF +L    FSRL     RG  L L+P AD++NH P
Sbjct: 139 IRAE--------FPQ-LTPGALRWAFDVL----FSRLIRLPNRGGELALVPWADMLNHKP 185

Query: 256 GITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIE-- 313
           G     Y  +  G      D  +        K GEQV   Y   + +AEL + YGF    
Sbjct: 186 GCNA--YIDDSGGKVCLQPDRAY--------KPGEQVFASYG-QRPSAELLISYGFAPEV 234

Query: 314 SKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVAL 373
            ++  + Y +TL I  +D +   K    E  GL     F + L    P  +LQY      
Sbjct: 235 GENPDDEYEITLGIDPNDRYADAKAAALEKIGLRPVESFPLRL-NGYPKQLLQYASFALC 293

Query: 374 GGTDAFLLESI 384
              D   LE +
Sbjct: 294 DPDDPKELEGL 304


>gi|327290197|ref|XP_003229810.1| PREDICTED: SET domain-containing protein 4-like [Anolis
           carolinensis]
          Length = 440

 Score = 85.5 bits (210), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 101/416 (24%), Positives = 174/416 (41%), Gaps = 42/416 (10%)

Query: 69  QWLRDQKVVSPKSPIRPATFPE-GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASE--- 124
           +WL+++     K  +RPA FPE G GLV  + +   E+++ +P K  +  DTV  S    
Sbjct: 38  KWLKEKGCNVNK--LRPAQFPETGRGLVTTKGLQVGELIISLPEKCLLTTDTVLNSYLRE 95

Query: 125 -IGSLCSGLKPWISVALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQ 182
            I      + P I++  FLI EK  +E SPW+ YLD+LPE     V   E+++V L    
Sbjct: 96  YIVKWTPPISPLIALCTFLIAEKWAQEKSPWKPYLDLLPEIYSCPVCL-EQKIVNLFPEP 154

Query: 183 LLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI----TLDDFLWAFGILRSR------ 232
           L       ++ VQ  ++   ++     + LFP+ +        F WA+  + +R      
Sbjct: 155 LRRKAHEQRKLVQELFIS-SQQFFFSLQPLFPKDVASVFNYQAFKWAWCTINTRTVYMKH 213

Query: 233 ----AFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKA 288
                FSR       L P  DL+NH+P +  +        AG   +   + + T      
Sbjct: 214 SQRDCFSR-DTDTYALAPYLDLLNHNPTVQVK--------AGFNEKTKCYEITTVTQCHH 264

Query: 289 GEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGE 348
             +V I Y  +  N  L L+YGF+   +  ++  +  +    + F  DK    + + L E
Sbjct: 265 YNEVFICYGPH-DNQRLLLEYGFVSRDNPHSSVYVGTDTLLKNVFPEDKQRPKKLSILQE 323

Query: 349 SAYF-DIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICR 407
                D+  G   P   L     +     D F   + ++  + G +   VS  NE+    
Sbjct: 324 HKLLDDLTFGWDGPSWKLLTALKLLSLEADQF---TSWKKVVLGEV---VSEVNEKKSLD 377

Query: 408 VVRDACKSALSGFHTTIEEDEQLLQGGN-LDPRLRIAVGVRAGEKRVLQQIDEIFK 462
           +  + C S +      +++   L      L  +L +   +R  E  +LQ+  E+F+
Sbjct: 378 LAAEICMSLIKETQHALQKVSLLKHDCEYLRTQLNLVEALRTEELSILQRSLEMFQ 433


>gi|62320136|dbj|BAD94330.1| hypothetical protein [Arabidopsis thaliana]
          Length = 483

 Score = 85.5 bits (210), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 112/425 (26%), Positives = 180/425 (42%), Gaps = 75/425 (17%)

Query: 89  PEGLGLVAQRDIA---KNEVVLEVPMKFWIN----------PDTVAASE-IGSLCSGLKP 134
           P+G G+ A +DI    +  VV+E+P++  I           PD V     I  + +   P
Sbjct: 99  PDGFGVYASKDIEPRRRARVVMEIPLELMITIRQKHPWMFFPDIVPIGHPIFDIINSTDP 158

Query: 135 ---W-ISVALFLIREKKKEDSPWRVYLDILP---ECTDSTVFWSEEELVELQGTQLLSTT 187
              W I +A  L+    ++D  WR+Y D LP   EC+ S +  +EE+L ELQ   L+ST 
Sbjct: 159 EIDWDIRLACLLLFSFDRDDHFWRLYGDFLPAADECS-SLLLATEEDLAELQDPDLVSTI 217

Query: 188 LGVKEYVQNEYLKV-EEEIILPNKQLFPRPITLDDFLWAFGILRSRAFS-RLRGQNLV-- 243
              ++ + + + K     + L  K+L   P   + F+WA  + ++R  S + R   LV  
Sbjct: 218 RQQQKRILDFWEKNWHSGVPLKIKRLAEDP---ERFIWAVSMAQTRCISMQTRVGALVQE 274

Query: 244 ---LIPLADLINHS--PGITTEDYAYEIKGAGLFSRDLLFSLRTPV--PVKAGEQVLIQY 296
              +IP AD++NHS  P                  RD +  + +     +K GE++ I Y
Sbjct: 275 LNMMIPYADMLNHSFEPNCFLH----------WRPRDRMLEVMSNAGQDIKKGEEMTINY 324

Query: 297 DLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVL 356
              + N  L   YGF    +  +A   +    +S       L +    GL E  Y D  L
Sbjct: 325 MPGQKNNMLMERYGFSTPVNPWDAIKFS---GDSRIHLNSFLSVFNIYGLPEEYYHDSEL 381

Query: 357 GRTLPPAMLQYLRLVALGGT--DAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACK 414
            R               G T  D  ++ +      W  +DLP   + E    + ++D C+
Sbjct: 382 SR---------------GDTFVDGAVIAAARTLPTWSDIDLPPIPSAERKAVKELQDECR 426

Query: 415 SALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKERELELDELEYY 474
             L+ + TT E+D++LL   + + R   A  V+    R +  I +I K        L+ Y
Sbjct: 427 KMLAEYPTTAEQDQKLLDSMS-EARTTFATAVKYRMHRKM-FIGKIIK-------ALDIY 477

Query: 475 QERRL 479
           QER L
Sbjct: 478 QERLL 482


>gi|190402231|gb|ACE77646.1| hypothetical protein [Sorex araneus]
          Length = 350

 Score = 85.1 bits (209), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 78/262 (29%), Positives = 121/262 (46%), Gaps = 29/262 (11%)

Query: 209 NKQLFPRPITLDDFLWAFGILRSRA-------FSRLRGQNLVLIPLADLINHSPGITTED 261
           NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+ G+ T  
Sbjct: 6   NKLPLKESFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHTNGLITTG 62

Query: 262 YAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAY 321
           Y  E        R    +L+     +AGEQ+ I Y   +SNAE  +  GF    +  +  
Sbjct: 63  YNLED------DRCECVALQD---FRAGEQIYIFYG-TRSNAEFVVHSGFFFDNNSHDRV 112

Query: 322 TLTLEISESDPFFGDKLDIAETNGLGESAYFDI-VLGRTLPPAMLQYLRLVALGGTDAFL 380
            + L +S+SD  +  K ++    G+  S+ F + V    +   +L +LR+  +  T+  L
Sbjct: 113 KIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHVTELPISAQLLAFLRVFCM--TEEEL 170

Query: 381 LE------SIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGG 434
            E      +I R    G+ + PVS  NE  +   + D     L  + TTIEED+  LQ  
Sbjct: 171 REHLLGENAIDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKAFLQSP 230

Query: 435 NLDPRLRIAVGVRAGEKRVLQQ 456
            L PR  +AV +R GEK +L++
Sbjct: 231 GLSPRAAMAVKLRLGEKEILEK 252


>gi|145553305|ref|XP_001462327.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124430166|emb|CAK94954.1| unnamed protein product [Paramecium tetraurelia]
          Length = 481

 Score = 85.1 bits (209), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 100/413 (24%), Positives = 186/413 (45%), Gaps = 43/413 (10%)

Query: 69  QWLRDQKVVSPKSPIRPATFPEGL-GLVAQRDIAKNEVVLEVPMKFWINPDTVAAS---- 123
           QWL+D K    K  I   +  EG   L A + I + E VL VP   +++ + V  S    
Sbjct: 45  QWLKDGKAEVSKVSIEVKS--EGYRTLRASQFIRQGEWVLFVPRTHYLSLEEVKKSCLIN 102

Query: 124 ----EIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTDS--TVFWSEEELVE 177
               ++  + + ++ +      L++E ++++S W+ Y+D+LP+      T F +E++ + 
Sbjct: 103 RKMIQLNYIPNNIQTYF--VNHLLQENRRQNSFWKPYIDVLPKDVSGFPTNFDAEQDAL- 159

Query: 178 LQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRL 237
           L+G+  L T +  ++  Q EY  ++E +    K+      T +DF+    +  SR+F   
Sbjct: 160 LKGSPTLFTVMNQRKTFQEEYDNLKEAV----KEFQRYGYTYNDFVKFRTLTISRSFPVY 215

Query: 238 RGQN---LVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLI 294
            G+N    +L+PLAD INH      + Y Y     G F       ++    ++ GE++  
Sbjct: 216 IGENEQQQLLVPLADFINHDNNGFLQ-YGYSPDADGFF-------MQAVRNIQKGEELFY 267

Query: 295 QYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDI 354
            Y    SN    ++YGF    +  N +   + +  +D  F  K+++      G   + + 
Sbjct: 268 NYG-QWSNKYFFMNYGFASLTNPMNQFDFDICLDRNDRMFKMKVELTG----GNICWGNR 322

Query: 355 VLGRTLPPAMLQYL---RLVALGGTDAFL-LESIFR--NTIW-GHLDLPVSHANEELICR 407
           ++  T      Q L   R   +   D FL LE   +  N  W G    P +   E+   +
Sbjct: 323 LVNETDHDTFRQSLATVRFAQISKLDDFLQLEEDVQNYNQFWPGWHTTPKTIELEKATFK 382

Query: 408 VVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEI 460
            +RD   S L+ F +TIE+D++ L   +     R  + +   EK+++++  E+
Sbjct: 383 ALRDLLVSELANFASTIEDDQRRLNDPSTPEFRRHIIMLTMREKQIIKKNIEV 435


>gi|302766942|ref|XP_002966891.1| hypothetical protein SELMODRAFT_408134 [Selaginella moellendorffii]
 gi|300164882|gb|EFJ31490.1| hypothetical protein SELMODRAFT_408134 [Selaginella moellendorffii]
          Length = 374

 Score = 85.1 bits (209), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 72/266 (27%), Positives = 117/266 (43%), Gaps = 37/266 (13%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREKKK-E 149
           G  L A R +      LE+P    I P+ V  S++  L S   P   ++LFL+ EK K +
Sbjct: 5   GRALFATRRVPAGSRFLEIPRIAIITPENVP-SQVSHLLSTSNPKTRLSLFLLSEKHKAQ 63

Query: 150 DSPWRVYLDILPECTD--STVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIIL 207
           +S W  YL  LP+  D  ST+FW  EEL  L+ +     T+   + +++E+      + L
Sbjct: 64  ESQWAPYLRCLPQLGDIESTMFWKAEELAWLKHSPTYRETMECLKIIKSEF----HLLTL 119

Query: 208 PNKQLFP------RPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTED 261
            NKQ+FP        ++L DF+ A+   +              IP AD  NH     T  
Sbjct: 120 ANKQVFPWCRDALGEVSLTDFMHAYSTDQ--------------IPFADFFNHDHNCQTRL 165

Query: 262 YAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAY 321
              + K   +   D  +        KAG+++ + Y  +  N+ LA+DYGF  + +     
Sbjct: 166 SYDKEKDCAVAVADQDY--------KAGDEIFLSYG-STPNSILAVDYGFAVASNPHEQV 216

Query: 322 TLTLEISESDPFFGDKLDIAETNGLG 347
            + + +S +DP    KL     + + 
Sbjct: 217 EVPMGVSLTDPLRDLKLQTLSRHNMS 242


>gi|328869852|gb|EGG18227.1| hypothetical protein DFA_03714 [Dictyostelium fasciculatum]
          Length = 504

 Score = 85.1 bits (209), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 60/197 (30%), Positives = 99/197 (50%), Gaps = 10/197 (5%)

Query: 69  QWLRDQKVVSPKSPI-------RPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVA 121
           QWLRD  VV  +S I        P    EGLG++A++D+  +E++  +P +  ++P T +
Sbjct: 12  QWLRDNCVVIDESKIEIVDTTTHPHVIVEGLGIIAKQDLKVDEIIAVIPKRCVLSPKTTS 71

Query: 122 ASEIGSLCSGLKPWISVALFLIREKKKE-DSPWRVYLDILPECTDSTVFWSEEELVELQG 180
            + I      L+  ++ ++ L+ E  K   S W  Y+  +P   D  + W +E +  L G
Sbjct: 72  IAPILEKYE-LEEAVATSIALMYETSKGVQSKWYSYIQSMPTVIDLPILWDKESIEYLVG 130

Query: 181 TQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI-TLDDFLWAFGILRSRAFSRLRG 239
           T L    +   E ++ +Y +  E II  + + F   I TL+ F  A  I+ SRAF+  + 
Sbjct: 131 TDLEEIVIENIETLEEQYREDVEPIIKNHPETFKENIFTLESFKIASTIVSSRAFNIDQY 190

Query: 240 QNLVLIPLADLINHSPG 256
               L+PLAD+ NH  G
Sbjct: 191 HGESLVPLADIFNHKTG 207


>gi|356511552|ref|XP_003524489.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
           large subunit N-methyltransferase, chloroplastic-like
           [Glycine max]
          Length = 503

 Score = 85.1 bits (209), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 96/382 (25%), Positives = 178/382 (46%), Gaps = 31/382 (8%)

Query: 94  LVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS---LCSGLKPWISVALFLIREKKK-E 149
           + A +D+   +V   VP    +  + V  +E  +     + L     +AL+L+ EKK+ +
Sbjct: 103 VAASQDLQVGDVAFSVPNSLVVTLERVLGNETVAELLTTNKLSELACLALYLMYEKKQGK 162

Query: 150 DSPWRVYLDILPE-------CTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVE 202
            S W  Y+  L           +S + WS+ EL  L G+ +    +  +E ++ EY +++
Sbjct: 163 KSFWYPYIRELDRQRGRGQLSVESPLLWSKSELDYLSGSPIKDEVIQREEAIRKEYKELD 222

Query: 203 EEIILPNK--QLFPRPITLDDFLWAFGILRSRAFSRLRG-----QNLVLIPLADLINHSP 255
               +     Q +P  I  + F  +F I + +AF+ ++      Q + L     L+   P
Sbjct: 223 TVWFMAGSLFQQYPYDIPTEAF--SFEIFK-QAFAAIQSCVVHLQKVSLARRFALVPLGP 279

Query: 256 GITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESK 315
            + +  Y    K A L + D    L    P KAG+ +++ +   + N++L ++YGF++  
Sbjct: 280 PLLS--YQSNCK-AMLTAVDGAVELAVDRPYKAGDPIVV-WCGPQPNSKLLINYGFVDEN 335

Query: 316 SDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGG 375
           +  +   +   ++  DP + DK  +A+ NG      F +  G+    A+L  LR + LG 
Sbjct: 336 NSNDRLIVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVYAGKER-EAVLDMLRYMRLGY 394

Query: 376 -TDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLL-QG 433
            +D   ++S+  +   G +  PVS   E      + D  K+ L+G+ T + EDE +L  G
Sbjct: 395 VSDPSEMQSVISSQ--GPV-CPVSPCMERAALDQLADYFKARLAGYPTILAEDESMLTDG 451

Query: 434 GNLDPRLRIAVGVRAGEKRVLQ 455
           GNL+P+ R+A      EK++L 
Sbjct: 452 GNLNPKKRVATQYVRLEKKMLH 473


>gi|330806388|ref|XP_003291152.1| hypothetical protein DICPUDRAFT_155733 [Dictyostelium purpureum]
 gi|325078672|gb|EGC32310.1| hypothetical protein DICPUDRAFT_155733 [Dictyostelium purpureum]
          Length = 465

 Score = 84.7 bits (208), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 102/407 (25%), Positives = 181/407 (44%), Gaps = 68/407 (16%)

Query: 61  TAQVET---FWQWL-RDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWIN 116
           T ++E+   F +WL  +   ++P   I       G  +VA++ I K + ++ +P    ++
Sbjct: 34  TKEIESLKEFKEWLVNNNAYINPNIDIELLD-KYGRSIVAKKSIKKQDKLISIPKDIIMS 92

Query: 117 -----PDTVAASEIGSLCS-GLKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFW 170
                P  +       + S GL P    A+F++  K  E S W  Y+ +LPE   +++++
Sbjct: 93  NIGGYPKKIPKEIYEQVQSIGLSPTNLQAVFIMYSKLNEKSFWHPYVTVLPESFSTSLYF 152

Query: 171 SEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEI--ILP---NKQLFPRPITLDDFLWA 225
           S+ EL ELQ +QL   T+  K+ ++  Y      +  ++P   N  L+ + +    F WA
Sbjct: 153 SDNELDELQASQLKEFTIIRKDGIERHYESTFSRLSKLVPEFSNLALYNQEL----FTWA 208

Query: 226 FGILRSRAFSRLRGQNLVLIPLADLINHS--------PGITTEDYAYEIKGAGLFSRDLL 277
              + SRAFS L   +  ++PLAD+ N          P +T     Y       ++ D  
Sbjct: 209 LSCVWSRAFS-LAENDGGMVPLADMFNAEDRSKSKVLPKVTDTTLDY-------YASD-- 258

Query: 278 FSLRTPVPVKAGEQVLIQYDLNK--SNAELALDYGFIESK---SDRNAYTLTLEISESDP 332
                   +  GEQ+   Y + K  S++++ +DYGFI  +   SD  A T+ +     +P
Sbjct: 259 -------DIAEGEQIFTPYGVYKPLSSSQMLMDYGFIFDEGTVSDNVAITVPV-FHNDEP 310

Query: 333 FFGDKLDIAETNGLGESAYFDIVLGRT--LPPAMLQYLRLVALGGTDAFLLESIF----- 385
               K +I E N +    +   +L +T  LP  +L Y R+  L   +    +  F     
Sbjct: 311 NLSTKQEILEENDIINEVF---LLQKTDPLPADLLLYARVKNLIAKECDQAKKHFLSPNT 367

Query: 386 RNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQ 432
           RNT       P++  NE++  R + +     L  + T +E D+ LL+
Sbjct: 368 RNT-------PLNTRNEKVSLRFLENLIHRYLDSYGTNLESDKNLLK 407


>gi|395518633|ref|XP_003763464.1| PREDICTED: SET domain-containing protein 4 [Sarcophilus harrisii]
          Length = 440

 Score = 84.7 bits (208), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 101/430 (23%), Positives = 177/430 (41%), Gaps = 60/430 (13%)

Query: 69  QWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           +WL+++K       +RP  F   G GL+A + +   E+++ +P K  +  DTV  S +G 
Sbjct: 38  KWLKERKF--EDHNLRPTRFSGTGRGLMAVKSLQPGELIISLPEKCLLTTDTVIKSYLGD 95

Query: 128 LCSG----LKPWISVALFLIREKKKED-SPWRVYLDILPECTDSTVFWSEEELVELQGTQ 182
             +     + P +++  FLI E    + SPW+ YLDILP+  D T        +E Q  +
Sbjct: 96  YITKWTPPISPLLALCTFLISENNAGNKSPWKPYLDILPK--DYTCL----VCLEPQVVR 149

Query: 183 LLSTTLGVKEYVQNEYLKVEEEIILPN------KQLFPRPIT----LDDFLWAFGILRSR 232
           LL   L +K   Q +  +V+E  +         + LF   +        FLWA+  + +R
Sbjct: 150 LLPKPLKIK--AQEQKTQVQELFVSSRGFFSSLQSLFTEDVKHIFHYHAFLWAWCTINTR 207

Query: 233 AFSRLRGQN---------LVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTP 283
                  Q            L P  DL+NHSPG+           A    +   + +RT 
Sbjct: 208 TVYMKHAQKKCLSAEPDVYALAPYLDLLNHSPGVQV--------NAAFNEKTRCYEIRTT 259

Query: 284 VPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLE-----ISESDPFFGDKL 338
              K  E++ I Y  +  N  L L+YGF+   +  +A  ++++     +   D     KL
Sbjct: 260 SSCKKYEELFICYGPH-DNHRLLLEYGFVAINNPHSAVYVSIDSLVDHLPSVDTQMNKKL 318

Query: 339 DIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVS 398
            + + +G  E    ++  G   P   L     +   G D F   + ++  + G +   +S
Sbjct: 319 SLLKEHGFSE----NLTFGWDGPSWRLLTALKLLCLGADEF---TCWKKVLLGEV---IS 368

Query: 399 HANEELICRVVRDACKSALSGFHTTIEEDEQLL-QGGNLDPRLRIAVGVRAGEKRVLQQI 457
             NE     + R  C S +    T +++   +  +   L  +L +   +   E ++LQ  
Sbjct: 369 DTNEWKSLVLSRKICSSFIEETKTALQKISHMKDEETALKDQLTLVENLWLEELKLLQVS 428

Query: 458 DEIFKERELE 467
            +I K    E
Sbjct: 429 ADILKNVHQE 438


>gi|302820198|ref|XP_002991767.1| hypothetical protein SELMODRAFT_430007 [Selaginella moellendorffii]
 gi|300140448|gb|EFJ07171.1| hypothetical protein SELMODRAFT_430007 [Selaginella moellendorffii]
          Length = 389

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 70/251 (27%), Positives = 109/251 (43%), Gaps = 26/251 (10%)

Query: 93  GLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREKKK-EDS 151
           GL A R I   E ++ +P    +  + +    +  L S    W  + L ++ E+ K E S
Sbjct: 11  GLFAARSIRAGEQIVRIPHDLVLTAEKLDDC-VKKLLSTEYDWCPLTLLILAEQHKGEAS 69

Query: 152 PWRVYLDILPECTD--STVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPN 209
            W  Y+  LP   D  ST+FW +EEL  L+ T+    T   +E + +EY+ V + +I   
Sbjct: 70  RWAPYVSCLPSFGDHHSTIFWEKEELKFLECTRAFRGTAERREMISDEYISV-KNVISSC 128

Query: 210 KQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGA 269
             +F   I+L  F  A+  + SRA++      + + P  D  NH P              
Sbjct: 129 PHVFGEDISLFQFAHAYATVVSRAWNGALSSEISMRPFVDFCNHDP-----------VSH 177

Query: 270 GLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISE 329
              S D           K    V I Y   +SNA LA+DYGF+   +  +   L +EI  
Sbjct: 178 ATVSHD---------SCKDATVVFISYG-KRSNAVLAVDYGFVLPNNLSDQAELWMEIPW 227

Query: 330 SDPFFGDKLDI 340
           +DP    KL++
Sbjct: 228 NDPLREKKLEL 238


>gi|42566980|ref|NP_193746.3| plastid transcriptionally active 14 protein [Arabidopsis thaliana]
 gi|28393566|gb|AAO42203.1| unknown protein [Arabidopsis thaliana]
 gi|28973141|gb|AAO63895.1| unknown protein [Arabidopsis thaliana]
 gi|110740232|dbj|BAF02014.1| hypothetical protein [Arabidopsis thaliana]
 gi|332658878|gb|AEE84278.1| plastid transcriptionally active 14 protein [Arabidopsis thaliana]
          Length = 483

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 112/427 (26%), Positives = 181/427 (42%), Gaps = 79/427 (18%)

Query: 89  PEGLGLVAQRDIA---KNEVVLEVPMKFWIN----------PDTVAASE-IGSLCSGLKP 134
           P+G G+ A +DI    +  V++E+P++  I           PD V     I  + +   P
Sbjct: 99  PDGFGVYASKDIEPRRRARVIMEIPLELMITIRQKHPWMFFPDIVPIGHPIFDIINSTDP 158

Query: 135 ---W-ISVALFLIREKKKEDSPWRVYLDILP---ECTDSTVFWSEEELVELQGTQLLSTT 187
              W I +A  L+    ++D  WR+Y D LP   EC+ S +  +EE+L ELQ   L+ST 
Sbjct: 159 EIDWDIRLACLLLFSFDRDDHFWRLYGDFLPAADECS-SLLLATEEDLAELQDPDLVSTI 217

Query: 188 LGVKEYVQNEYLKV-EEEIILPNKQLFPRPITLDDFLWAFGILRSRAFS-RLRGQNLV-- 243
              ++ + + + K     + L  K+L   P   + F+WA  + ++R  S + R   LV  
Sbjct: 218 RQQQKRILDFWEKNWHSGVPLKIKRLAEDP---ERFIWAVSMAQTRCISMQTRVGALVQE 274

Query: 244 ---LIPLADLINHS--PGI----TTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLI 294
              +IP AD++NHS  P        +D   E+       +D          +K GE++ I
Sbjct: 275 LNMMIPYADMLNHSFEPNCFLHWRPKDRMLEVMSNA--GQD----------IKKGEEMTI 322

Query: 295 QYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDI 354
            Y   + N  L   YGF    +  +A   +    +S       L +    GL E  Y D 
Sbjct: 323 NYMPGQKNNMLMERYGFSTPVNPWDAIKFS---GDSRIHLNSFLSVFNIYGLPEEYYHDS 379

Query: 355 VLGRTLPPAMLQYLRLVALGGT--DAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDA 412
            L R               G T  D  ++ +      W  +DLP   + E    + ++D 
Sbjct: 380 ELSR---------------GDTFVDGAVIAAARTLPTWSDIDLPPIPSAERKAVKELQDE 424

Query: 413 CKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKERELELDELE 472
           C+  L+ + TT E+D++LL   + + R   A  V+    R +  I +I K        L+
Sbjct: 425 CRKMLAEYPTTAEQDQKLLDSMS-EARTTFATAVKYRMHRKM-FIGKIIK-------ALD 475

Query: 473 YYQERRL 479
            YQER L
Sbjct: 476 IYQERLL 482


>gi|302794360|ref|XP_002978944.1| hypothetical protein SELMODRAFT_110000 [Selaginella moellendorffii]
 gi|300153262|gb|EFJ19901.1| hypothetical protein SELMODRAFT_110000 [Selaginella moellendorffii]
          Length = 432

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 97/390 (24%), Positives = 171/390 (43%), Gaps = 43/390 (11%)

Query: 94  LVAQRDIAKNEVVLEVPMKFWINPDTVAASE-IGSL--CSGLKPWISVALFLIREKKK-E 149
           +VA  D+   ++ L VPM   +  + V  +E I  L   + L     +AL+L+ EKK+ +
Sbjct: 29  VVASEDLKPGDLALSVPMSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKRGK 88

Query: 150 DSPWRVYLDILPE-------CTDSTVFWSEEELVE-LQGTQLLSTTLGVKEYVQNEYLKV 201
           +S W  ++  L           +S + W+ EEL E   G+++    L   E ++ EY ++
Sbjct: 89  ESFWYPFIRELDRQRGRGQVAVESPLLWTSEELDEYFTGSRMKEVVLERLEGIKREYQEL 148

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRS------RAFSRLRGQNL----VLIPLAD-L 250
           +    +        P  +    ++F I +            L+G +L     L+PL   L
Sbjct: 149 DTVWFMAGSLFKEYPFDIPTEAFSFEIFKQAFVAVQSCVVHLQGVSLPRRFALVPLGPPL 208

Query: 251 INHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYG 310
           + +            +K AG   R     L      K GEQ+L+ +   + N  L L+YG
Sbjct: 209 LAYKSNCKAM-----LKAAGDLVR-----LEVDRAYKKGEQILV-WCGPQPNTRLLLNYG 257

Query: 311 FIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPA--MLQYL 368
           F++  +  +  ++   ++  DPF+ +K  I + N       F I  GR       ML YL
Sbjct: 258 FVDPDNPHDRLSVEASLNTRDPFYQNKRIIVQKNNRLTIQNFQIFKGREKEAVLEMLPYL 317

Query: 369 RLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDE 428
           RL  +  +D + +ES+F          PVS  NE  +   +    +  ++ + TTI+ED 
Sbjct: 318 RLGHV--SDPYHMESVFSAE---GPTCPVSACNERAVLDQLAQYFQERIAKYKTTIDEDR 372

Query: 429 QLLQGG--NLDPRLRIAVGVRAGEKRVLQQ 456
            LL+ G  +++P+ R+A  +   EK +L  
Sbjct: 373 ALLEDGSSDINPKQRVATQLLLIEKEILHN 402


>gi|110740216|dbj|BAF02006.1| hypothetical protein [Arabidopsis thaliana]
          Length = 483

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 112/427 (26%), Positives = 181/427 (42%), Gaps = 79/427 (18%)

Query: 89  PEGLGLVAQRDIA---KNEVVLEVPMKFWIN----------PDTVAASE-IGSLCSGLKP 134
           P+G G+ A +DI    +  V++E+P++  I           PD V     I  + +   P
Sbjct: 99  PDGFGVYASKDIEPRRRARVIMEIPLELMITIRQKHPWMFFPDIVPIGHPIFDIINSTDP 158

Query: 135 ---W-ISVALFLIREKKKEDSPWRVYLDILP---ECTDSTVFWSEEELVELQGTQLLSTT 187
              W I +A  L+    ++D  WR+Y D LP   EC+ S +  +EE+L ELQ   L+ST 
Sbjct: 159 EIDWDIRLACLLLFSFDRDDHFWRLYGDFLPAADECS-SLLLATEEDLAELQDPDLVSTI 217

Query: 188 LGVKEYVQNEYLKV-EEEIILPNKQLFPRPITLDDFLWAFGILRSRAFS-RLRGQNLV-- 243
              ++ + + + K     + L  K+L   P   + F+WA  + ++R  S + R   LV  
Sbjct: 218 RQQQKRILDFWEKNWHSGVPLKIKRLAEDP---ERFIWAVSMAQTRCISMQTRVGALVQE 274

Query: 244 ---LIPLADLINHS--PGI----TTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLI 294
              +IP AD++NHS  P        +D   E+       +D          +K GE++ I
Sbjct: 275 LNMMIPYADMLNHSFEPNCFLHWRPKDRMLEVMSNA--GQD----------IKKGEEMTI 322

Query: 295 QYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDI 354
            Y   + N  L   YGF    +  +A   +    +S       L +    GL E  Y D 
Sbjct: 323 NYMPGQKNNMLMERYGFSTPVNPWDAIKFS---GDSRIHLNSFLSVFNIYGLPEEYYHDS 379

Query: 355 VLGRTLPPAMLQYLRLVALGGT--DAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDA 412
            L R               G T  D  ++ +      W  +DLP   + E    + ++D 
Sbjct: 380 ELSR---------------GDTFVDGAVIAAARTLPTWSDIDLPPIPSAESKAVKELQDE 424

Query: 413 CKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKERELELDELE 472
           C+  L+ + TT E+D++LL   + + R   A  V+    R +  I +I K        L+
Sbjct: 425 CRKMLAEYPTTAEQDQKLLDSMS-EARTTFATAVKYRMHRKM-FIGKIIK-------ALD 475

Query: 473 YYQERRL 479
            YQER L
Sbjct: 476 IYQERLL 482


>gi|22326803|ref|NP_196930.2| Rubisco methyltransferase family protein [Arabidopsis thaliana]
 gi|30684815|ref|NP_851038.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
 gi|42573363|ref|NP_974778.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
 gi|17473570|gb|AAL38260.1| putative protein [Arabidopsis thaliana]
 gi|23297671|gb|AAN13005.1| unknown protein [Arabidopsis thaliana]
 gi|332004624|gb|AED92007.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
 gi|332004625|gb|AED92008.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
 gi|332004626|gb|AED92009.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
          Length = 514

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 99/379 (26%), Positives = 172/379 (45%), Gaps = 28/379 (7%)

Query: 94  LVAQRDIAKNEVVLEVPMKFWINPDTVAASE-IGSL--CSGLKPWISVALFLIREKKK-E 149
           + A  D+ K +V   VP    +  + V  +E I  L   + L     +AL+L+ EKK+ +
Sbjct: 115 VAASEDLQKGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 174

Query: 150 DSPWRVYLDILPEC-------TDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVE 202
            S W  Y+  L           +S + WSE EL  L G+   +  L   E ++ EY +++
Sbjct: 175 KSVWYPYIRELDRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAEGIKREYNELD 234

Query: 203 EEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRG-----QNLVLIPLADLINHSPGI 257
               +        P  +    ++F I + +AF  ++      QN+ L     L+   P +
Sbjct: 235 TVWFMAGSLFQQYPFDIPTEAFSFEIFK-QAFVAIQSCVVHLQNVGLARRFALVPLGPPL 293

Query: 258 TTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSD 317
               Y    K A L + D    L    P KAG+ +++ +   + NA+L L+YGF++  + 
Sbjct: 294 LA--YCSNCK-AMLTAVDGAVELVVDRPYKAGDPIVV-WCGPQPNAKLLLNYGFVDEDNP 349

Query: 318 RNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLG--RTLPPAMLQYLRLVALGG 375
            +   +   ++  DP + DK  +A+ NG      F + +G  R     ML YLRL  +  
Sbjct: 350 YDRVIVEAALNTEDPQYQDKRMVAQRNGKLSQQVFQVRVGKEREAVQDMLPYLRLGYM-- 407

Query: 376 TDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGN 435
           +D   ++S+  +   G +  P+S   E  +   + +     LSG+ TT +ED+ LL   +
Sbjct: 408 SDPSEMQSVISSQ--GPV-CPMSPCMERAVLDQLANYFMRRLSGYPTTPKEDDALLADPS 464

Query: 436 LDPRLRIAVGVRAGEKRVL 454
           L PR R+A  +   EK++L
Sbjct: 465 LSPRKRVATRLVQLEKKIL 483


>gi|356571407|ref|XP_003553868.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
           large subunit N-methyltransferase, chloroplastic-like
           isoform 1 [Glycine max]
 gi|356571409|ref|XP_003553869.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
           large subunit N-methyltransferase, chloroplastic-like
           isoform 2 [Glycine max]
          Length = 502

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 95/381 (24%), Positives = 176/381 (46%), Gaps = 30/381 (7%)

Query: 94  LVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS---LCSGLKPWISVALFLIREKKK-E 149
           + A +D+   +V   VP    +  + V  +E  +     + L     +AL+L+ EKK+ +
Sbjct: 103 VAASQDLQVGDVAFSVPNSLVVTLERVLGNETVAELLTTNKLSELACLALYLMYEKKQGK 162

Query: 150 DSPWRVYLDILPE-------CTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVE 202
            S W  Y+  L           +S + W + EL  L G+ +    +  +E ++ EY +++
Sbjct: 163 KSFWYPYIRELDRQRGRGQLSVESPLLWLKSELDYLSGSPIKDEVIQREEAIRKEYNELD 222

Query: 203 EEIILPNK--QLFPRPITLDDFLWAFGILRSRAFSRLRG-----QNLVLIPLADLINHSP 255
               +     Q +P  I  + F  +F I + +AF+ ++      Q + L     L+   P
Sbjct: 223 TVWFMAGSLFQQYPYDIPTEAF--SFEIFK-QAFAAIQSCVVHLQKVSLARRFALVPLGP 279

Query: 256 GITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESK 315
            + +  Y    K A L + D    L    P KAG+ +++ +   + N++L ++YGF++  
Sbjct: 280 PLLS--YQSNCK-AMLTAVDGAVELAVDRPYKAGDPIVV-WCGPQPNSKLLINYGFVDEN 335

Query: 316 SDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGG 375
           +  +   +   ++  DP + DK  +A+ NG      F +  G+    A+L  LR + LG 
Sbjct: 336 NSNDRLIVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVYAGKERE-AVLDMLRYMRLGY 394

Query: 376 -TDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGG 434
            +D   +ES+  +   G +  PVS   E      + D  K+ L+G+ TT+ EDE +L   
Sbjct: 395 VSDPSEMESVISSQ--GPV-CPVSPCMERAALDQLADYFKARLAGYPTTLAEDESMLTDD 451

Query: 435 NLDPRLRIAVGVRAGEKRVLQ 455
           NL+P+ R+A      EK++L 
Sbjct: 452 NLNPKKRVATQYVRLEKKMLH 472


>gi|320169513|gb|EFW46412.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 495

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 116/428 (27%), Positives = 190/428 (44%), Gaps = 55/428 (12%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSG--LKPWISVALFLIRE--K 146
           G G+ A RD+A  E VL VP+   +N +  +AS +G +     L    ++A +LI E  +
Sbjct: 76  GRGVFALRDLAAGETVLRVPLSLLLNVEHASASPLGGILDDFRLSDAEAMAFWLIYELTR 135

Query: 147 KKEDSPWRVYLDILPECTDS-TVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEI 205
            +  SPW  YL+ LP      T+F+   E+  LQ + +   T      ++N++ K  E+I
Sbjct: 136 PERASPWLPYLESLPASIKQLTMFYDPFEMKRLQASPVAEFTSRRTVKMRNKFGKYREQI 195

Query: 206 I------LPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNL------------VLIPL 247
                  L   +     IT+DDFLWA  +     F+RL    +             L+PL
Sbjct: 196 SKHRPAHLAEIEFPVELITVDDFLWAMAV----QFTRLITVQVKHPADGEWERTKCLVPL 251

Query: 248 ADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNK--SNAEL 305
           ADL+N +P     D         L S    F   T  PV  G+++L  Y   +  SN +L
Sbjct: 252 ADLLNTAPA----DQINVECATNLDSTH--FECATIRPVAEGQELLTPYGGAEQLSNGQL 305

Query: 306 ALDYG--FIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDI-VLGR--TL 360
            +DYG  F  + SD  A  +  ++ E+   +  K+ +     L       + VL    ++
Sbjct: 306 IMDYGVTFRNNPSDLVALPIP-KLRETAVAYDSKMRLLMAMSLDRFDRLQLPVLDHFESI 364

Query: 361 PPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGF 420
           P  +L + R+     +D   LE +    +  H    ++ +NE     ++       +  +
Sbjct: 365 PKELLAFARVYVSTPSDLSDLEHVL-ELMKEH--RAINPSNERRALELLLQLTNEMILKY 421

Query: 421 HTTIEEDEQLLQGGNLD--PRLRI--AVGVRAGEKRVL----QQID---EIFKERELELD 469
            TTIEEDE +L+  + +  P      AV +R GEKR+L    Q +D   E   E +  L+
Sbjct: 422 ITTIEEDETMLRELDAESVPNANAVNAVVLRLGEKRILSSLWQLLDSAIEALPESKQALE 481

Query: 470 ELEYYQER 477
           ++E  QE+
Sbjct: 482 DVEDDQEQ 489


>gi|260807503|ref|XP_002598548.1| hypothetical protein BRAFLDRAFT_118329 [Branchiostoma floridae]
 gi|229283821|gb|EEN54560.1| hypothetical protein BRAFLDRAFT_118329 [Branchiostoma floridae]
          Length = 448

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 103/415 (24%), Positives = 178/415 (42%), Gaps = 67/415 (16%)

Query: 83  IRPATFPE-GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWIS---- 137
           +RPA FP+ G GL+  R I + + ++++P    ++  TV  S +G      +P ++    
Sbjct: 47  LRPALFPDTGRGLMVPRKIKRGQTMIKMPQHMILSTKTVLDSVLGPYIESAEPQLTTIQA 106

Query: 138 VALFLIREKK-KEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQN 196
           +  FLI +K   E S W+ YLDILP      V++ EE+ + L  +  L   +  K   + 
Sbjct: 107 ITTFLIYQKHIGETSFWKPYLDILPNEYTHPVYFGEEDFLYLPHS--LRANIKAK---KQ 161

Query: 197 EYLKVEEEI---------ILPNKQLFPRPITLDDFLWAFGILRSRA-FSRLRGQNLV--- 243
           E +K  EE+         +LPN +      T D + WA+  +++R+ +   +G  ++   
Sbjct: 162 ECIKSYEELKPFFPSLEPLLPNWEGI---FTFDAYRWAWSTVKTRSLYVDDKGSTVLRNL 218

Query: 244 ---------LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLI 294
                    L+P+ DL+NHS    T     +    G    D  +++      K G+QVL 
Sbjct: 219 DKSGLGVTSLVPMVDLLNHSHSARTGLLIKKSCKNG----DYFYTVTAEDDYKRGDQVLF 274

Query: 295 QYDLNKSNAELALDYGFI--ESKSDRNAYTLTLEI---------SESDP-FFGDKLDIAE 342
            Y     N  L L+YGF+  ++  D   + L  +I          E DP F   K+ +  
Sbjct: 275 CYR-RADNQTLLLNYGFVLPDNHLDTIKFFLVKDIIGILELMNFEEEDPKFRRRKVLLIA 333

Query: 343 TNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTI-WGHLDLPVSHAN 401
           T         D    R +   ++  +R++     D  L + +  + + W H   PVS   
Sbjct: 334 TKYTASDLTCDY---RGVSKTLMTVMRILVCDSEDRPLFKRVLADDLPWDH---PVSVQA 387

Query: 402 EELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
             L   ++     S        I +DE +L   +  P  R+ V +R  EK++LQ+
Sbjct: 388 RSLAQELLERRLLSY------GIMDDEAILAESD-TPVRRLVVAMRNEEKKILQK 435


>gi|172073177|ref|NP_663457.2| SET domain-containing protein 4 [Mus musculus]
 gi|148671824|gb|EDL03771.1| SET domain containing 4, isoform CRA_e [Mus musculus]
          Length = 439

 Score = 83.2 bits (204), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 97/418 (23%), Positives = 179/418 (42%), Gaps = 48/418 (11%)

Query: 69  QWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           +WL+++K     + + PA+FP  G GL+++  + + +V++ +P    +  DTV  S +G 
Sbjct: 37  KWLKERKF--EDTDLVPASFPGTGRGLMSKASLQEGQVMISLPESCLLTTDTVIRSSLGP 94

Query: 128 LCSGLKPWIS----VALFLIREKKKE-DSPWRVYLDILPECTDSTVFWSEEELVELQGTQ 182
                KP +S    +  FL+ EK     S W+ YLDILP+     V   E E+V+L  + 
Sbjct: 95  YIKKWKPPVSPLLALCTFLVSEKHAGCRSLWKSYLDILPKSYTCPVCL-EPEVVDLLPSP 153

Query: 183 LLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI----TLDDFLWAFGILRSRA-FSRL 237
           L +     +  VQ+ +           + LF  P+    +   FLWA+  + +RA + R 
Sbjct: 154 LKAKAEEQRARVQDLFTSA-RGFFSTLQPLFAEPVDSVFSYRAFLWAWCTVNTRAVYLRS 212

Query: 238 RGQNLV--------LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAG 289
           R Q  +        L P  DL+NHSP +  +        A    +   + +RT    +  
Sbjct: 213 RRQECLSAEPDTCALAPFLDLLNHSPHVQVK--------AAFNEKTRCYEIRTASRCRKH 264

Query: 290 EQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEI-----SESDPFFGDKLDIAETN 344
           ++V I Y  +  N  L L+YGF+  ++      ++ ++       +D     K+ I + +
Sbjct: 265 QEVFICYGPH-DNQRLLLEYGFVSVRNPHACVPVSADMLVKFLPAADKQLHRKITILKDH 323

Query: 345 GLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEEL 404
           G       ++  G   P   L     +     + F   + ++  + G +   +S  NE+ 
Sbjct: 324 GFTG----NLTFGWDGPSWRLLTALKLLCLEAERF---TSWKKVLLGEV---ISDTNEKT 373

Query: 405 ICRVVRDACKSALSGFHTTIEEDEQLLQGG-NLDPRLRIAVGVRAGEKRVLQQIDEIF 461
              V +  C   +   H  + +   + +G  +L  +L +   +R  E R+LQ   EI 
Sbjct: 374 SLGVAQKICSDVIEETHAVLRKVSDMKEGTVSLRSQLSLVEALRMEELRILQASAEIL 431


>gi|46130858|ref|XP_389160.1| hypothetical protein FG08984.1 [Gibberella zeae PH-1]
          Length = 1000

 Score = 82.8 bits (203), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 109/445 (24%), Positives = 185/445 (41%), Gaps = 96/445 (21%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVA-ASEIGSL----------CSGLKPWISVA 139
           G G++A +DI     +  +P K  IN +T     +I  +            GL  W S+ 
Sbjct: 575 GRGIIALKDIPAETTLFTIPRKGIINTETSELPKKIPDVFDLDKPDEDDVPGLDSWSSLI 634

Query: 140 LFLIREKKKEDSP-WRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY 198
           L +I E  + DS  W+ Y D+LP   D+ +FWSE EL +LQ +  +   +G K   ++ +
Sbjct: 635 LIMIYEYLQGDSSQWKSYFDVLPSSFDTPMFWSENELDQLQASH-MRHKIG-KADAEDMF 692

Query: 199 LKVEEEIILPNKQLFPRPITLDDFL-------------WAFGILRSRAFSRL-------- 237
            K    II  N  +F      D  L             +AF +                 
Sbjct: 693 KKTLVPIIRSNPSIFNAENRSDYELVEIAHRMGSTIMAYAFDLENDEEEEEETEEWVEDR 752

Query: 238 RGQNLV-LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQY 296
            G++++ ++P+AD++N                A +   +   ++ +  P+KAGE++L  Y
Sbjct: 753 EGKSMMGMVPMADILNADAEF----------NAHVNHEEESLTVTSLRPIKAGEEILNYY 802

Query: 297 DLNKSNAELALDYGFIESKSDR-NAYTLTLEISES------------------------- 330
             +  N+EL   YG++  K  R +   +  +I ES                         
Sbjct: 803 GPH-PNSELLRRYGYVTEKHSRYDVVEIPWDIVESVLTNFGISSKILEQIRGEFEEEEEF 861

Query: 331 -DPFFGDKLDIAETNGLG---ESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFR 386
            D F  ++ D  E N  G   E A F+      +P  + + L+    G      ++ +  
Sbjct: 862 EDTFVLER-DTGEVNSDGTFAEPAKFE-----GMPEDLQEQLKSFLKG------IKKLQS 909

Query: 387 NTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGV 446
           +TI      P     +E+   V+    ++  + + T+I ED+ LL G NLD R R+A  V
Sbjct: 910 DTI------PDKRKRDEIHQAVLVKTLEALAARYPTSISEDQILLNGQNLDQRARMATVV 963

Query: 447 RAGEKRVLQQIDEIFKER-ELELDE 470
           R GEK++LQ+    F E  E+ +D+
Sbjct: 964 RLGEKKLLQEAIATFSEDVEMTMDD 988


>gi|428173103|gb|EKX42007.1| hypothetical protein GUITHDRAFT_141487 [Guillardia theta CCMP2712]
          Length = 355

 Score = 82.8 bits (203), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 79/306 (25%), Positives = 138/306 (45%), Gaps = 44/306 (14%)

Query: 38  RLKSRAFAATCSLHSASATTN----PPTAQVETFWQWLRDQKVVSPKSPIRPATFPE-GL 92
           R ++  +AA+ + H A   ++        +++ F +W++ QK+   K  ++  + P   +
Sbjct: 16  RQEAEEWAASKADHDADTCSDEQCISDARKIDAFEKWIQSQKLAVNKLEVK--SIPGFRM 73

Query: 93  GLVAQRDIAKNEVVLEVPMKFWINPD-------------------TVAASEIGSLCSGLK 133
           G  A+ DIA  E+ + +P    + P+                   +++  E   L S   
Sbjct: 74  GTTAKDDIADGELYIAIPDHMLMGPERVEPGSRLDKKLMKIVKSQSISMQEQRRLLSEKN 133

Query: 134 PWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEY 193
             +   L  +   KKE S W+ Y DI+P    S +FWSE+EL EL G+++ +     K+ 
Sbjct: 134 KVLMYFLLQMYNPKKE-SFWKPYFDIMPTNLTSPIFWSEDELQELAGSEVSNMARIEKKR 192

Query: 194 VQNEYLKVEEEIILPNKQLF-PRPITLDDFLWAFGILRSRA--FSRLRGQNLV--LIPLA 248
           ++  Y ++ E I   +++ F  +  TL ++ WA G+  SR    +R  G   V   IPL 
Sbjct: 193 LRAMYDELRERIFKHDRKTFLKQAFTLKNWFWANGLYDSRVIQLNRQTGHGNVPTFIPLI 252

Query: 249 DLIN--HSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELA 306
           D++N   S   T   Y  +++ A +++            V  G QV   Y  NKSN E  
Sbjct: 253 DMVNCIESQDKTFIQYDKKLRAAVMYA---------DRAVSRGVQVFESYG-NKSNYEYL 302

Query: 307 LDYGFI 312
           L  GF+
Sbjct: 303 LYNGFV 308


>gi|17865444|sp|P58467.1|SETD4_MOUSE RecName: Full=SET domain-containing protein 4
 gi|17061796|gb|AAK68849.1| C21orf18 [Mus musculus]
          Length = 439

 Score = 82.8 bits (203), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 97/418 (23%), Positives = 179/418 (42%), Gaps = 48/418 (11%)

Query: 69  QWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           +WL+++K     + + PA+FP  G GL+++  + + +V++ +P    +  DTV  S +G 
Sbjct: 37  KWLKERKF--EDTDLVPASFPGTGRGLMSKASLQEGQVMISLPESCLLTTDTVIRSSLGP 94

Query: 128 LCSGLKPWIS----VALFLIREKKKE-DSPWRVYLDILPECTDSTVFWSEEELVELQGTQ 182
                KP +S    +  FL+ EK     S W+ YLDILP+     V   E E+V+L  + 
Sbjct: 95  YIKKWKPPVSPLLALCTFLVSEKHAGCRSLWKSYLDILPKSYTCPVCL-EPEVVDLLPSP 153

Query: 183 LLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI----TLDDFLWAFGILRSRA-FSRL 237
           L +     +  VQ+ +           + LF  P+    +   FLWA+  + +RA + R 
Sbjct: 154 LKAKAEEQRARVQDLFTSA-RGFFSTLQPLFAEPVDSVFSYRAFLWAWCTVNTRAVYLRS 212

Query: 238 RGQNLV--------LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAG 289
           R Q  +        L P  DL+NHSP +  +        A    +   + +RT    +  
Sbjct: 213 RRQECLSAEPDTCALAPFLDLLNHSPHVQVK--------AAFNEKTRCYEIRTASRCRKH 264

Query: 290 EQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEI-----SESDPFFGDKLDIAETN 344
           ++V I Y  +  N  L L+YGF+  ++      ++ ++       +D     K+ I + +
Sbjct: 265 QEVFICYGPH-DNQRLLLEYGFVSVRNPHACVPVSADMLVKFLPAADKQLHRKITILKDH 323

Query: 345 GLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEEL 404
           G       ++  G   P   L     +     + F   + ++  + G +   +S  NE+ 
Sbjct: 324 GFTG----NLTFGWDGPSWRLLTALKLLCLEAERF---TSWKKVLLGEV---ISDTNEKT 373

Query: 405 ICRVVRDACKSALSGFHTTIEEDEQLLQGG-NLDPRLRIAVGVRAGEKRVLQQIDEIF 461
              V +  C   +   H  + +   + +G  +L  +L +   +R  E R+LQ   EI 
Sbjct: 374 SLGVAQKICSDVIEETHAVLRKVSDMKEGTVSLRNQLSLVEALRMEELRILQASAEIL 431


>gi|357132366|ref|XP_003567801.1| PREDICTED: uncharacterized protein LOC100845588 [Brachypodium
           distachyon]
          Length = 494

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 111/435 (25%), Positives = 183/435 (42%), Gaps = 95/435 (21%)

Query: 89  PEGLGLVAQRDIA---KNEVVLEVPMKFWIN----------PDTVA-ASEIGSLCSGLKP 134
           P+G+G+ A RD+    +  V++E+P++  +           PD +     I  +     P
Sbjct: 110 PDGMGVYASRDVEPLRRARVIMEIPLELMLTITKKHPWMFFPDIIPLGHPIFDVIESTNP 169

Query: 135 ---W-ISVALFLIREKKKEDSPWRVYLDILP---ECTDSTVFWSEEELVELQGTQLLSTT 187
              W + +A  L+     + + W++Y D LP   ECT S +   +E+L+EL+   L S  
Sbjct: 170 ETDWDLRLACLLLYAFDVQGNFWQLYSDFLPSGDECT-SLLLAPKEDLMELEDQDLSSEM 228

Query: 188 LGVKEYVQNEYLKV-EEEIILPNKQLFPRPITLDDFLWAFGILRSRAFS-RLRGQNL--- 242
           L +++   + + K   + I L  K+L P     + FLWA  I++SR+F+ +LR       
Sbjct: 229 LKLQKRAVDFWQKHWHKAIPLKLKRLAP---DHERFLWALSIVQSRSFNMKLRMGAFMQD 285

Query: 243 --VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNK 300
             +L P AD++NHSP      + +  K      R L   ++    VK G+++ I Y ++ 
Sbjct: 286 ANILAPYADMLNHSPDANCFLH-WRFK-----DRMLEVMIKAGHAVKKGDEMTINY-MSG 338

Query: 301 SNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGL------------GE 348
            N+ L   YGF    S  N + L    S +       L +    GL            GE
Sbjct: 339 INSMLMQRYGF---SSPTNPWELINFSSPAKIHLDSFLSVFNIAGLHDELCHNAALTSGE 395

Query: 349 SAYFD---IVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELI 405
           S + D   +   RTLP                            W   DLP   + E   
Sbjct: 396 SNFVDGGVVAAARTLP---------------------------TWSDGDLPAIPSVERKS 428

Query: 406 CRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLR-IAVGVRAGEKRVLQQIDEIFKER 464
            + +++  +     F TTI++DEQ+L       + R IA+  R   K +L++I       
Sbjct: 429 AQALQEELRKMSESFSTTIQQDEQILDTDEPIRKTREIAIKYRLHRKLLLRKI------- 481

Query: 465 ELELDELEYYQERRL 479
              +D LE YQ+R L
Sbjct: 482 ---IDSLEIYQDRIL 493


>gi|413948641|gb|AFW81290.1| hypothetical protein ZEAMMB73_810642 [Zea mays]
          Length = 401

 Score = 82.4 bits (202), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 91/354 (25%), Positives = 155/354 (43%), Gaps = 44/354 (12%)

Query: 136 ISVALFLIREKKKEDSPWRVYLDILP---ECTDSTVFWSEEELVELQGTQLLSTTLGVKE 192
           + +A  L+     ED+ W++Y D LP   ECT S +   +E+L+EL+   L S  L  +E
Sbjct: 81  LRLACLLLYAFDIEDNFWQLYSDFLPSVDECT-SLLLAPKEDLMELEDEDLASQMLKHQE 139

Query: 193 YVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFS-RLRGQNL-----VLIP 246
              + + K  ++ I    +   R    + FLWA  I++SR+ + +LR         VL P
Sbjct: 140 RAIDFWQKHWDKPIPLKLKRLAR--DHERFLWALSIVQSRSVNLKLRMGAFIQDANVLAP 197

Query: 247 LADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELA 306
            AD++NHSP      + +  K      R L   ++    +K G+++ I Y ++  N++  
Sbjct: 198 YADMLNHSPNANCFLH-WRFK-----DRMLEVMIKAGQAIKKGDEMTIDY-MSGVNSKFM 250

Query: 307 LDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQ 366
             YGF    S  N + L    S +       L +    GL +  Y +  L          
Sbjct: 251 ERYGF---SSPTNPWELINFSSPATIHMDSFLSVFNIAGLHDELYHNSALTS-------- 299

Query: 367 YLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEE 426
              +V     D  ++ +      W   D+P   + E    + +++ C+  L  F TTIE+
Sbjct: 300 ---VVETDFVDGAVVAAARALPTWSDGDVPAIPSVERKSAQALQEECRQMLDSFPTTIEQ 356

Query: 427 DEQLLQG-GNLDPRLRIAVGVRAGEKRVLQQIDEIFKERELELDELEYYQERRL 479
           D+Q+L    ++     IA+  R   K +LQ+I          +D L+ YQ+R L
Sbjct: 357 DQQILDSDAHISKTREIAIKYRLHRKMLLQKI----------MDALDIYQDRIL 400


>gi|145537195|ref|XP_001454314.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124422069|emb|CAK86917.1| unnamed protein product [Paramecium tetraurelia]
          Length = 481

 Score = 82.4 bits (202), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 97/410 (23%), Positives = 184/410 (44%), Gaps = 37/410 (9%)

Query: 69  QWLRDQKVVSPKSPIRPATFPEGL-GLVAQRDIAKNEVVLEVPMKFWINPDTVAAS---- 123
           QWL+D K    K  I   +  EG   L A + I + E VL +P   +++ + V  S    
Sbjct: 45  QWLKDGKAEVSKVQIEVKS--EGYRTLRASQFIRQGEWVLFIPRTHYLSLEEVKKSCLIN 102

Query: 124 ----EIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILP-ECTDSTVFWSEEELVEL 178
               ++  + + ++ +      L++E ++++S W+ Y+D+LP + +    ++  E+   L
Sbjct: 103 RKMIQLNYIPNNIQTYF--VNHLLQENRRQNSFWKPYIDVLPKDVSGFPTYFDAEQDALL 160

Query: 179 QGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILR-SRAFSRL 237
           +G+  L T +  ++  + EY  ++E +    K+      T +DF+  F IL  SR+F   
Sbjct: 161 KGSPTLFTVMNQRKIFREEYDNLKEAV----KEFQRYGYTYNDFI-KFRILTISRSFPVY 215

Query: 238 RGQN---LVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLI 294
            G+N    +L+PLAD +NH      + Y Y     G F       ++    ++ GE++  
Sbjct: 216 IGENEQQQLLVPLADFVNHDNNGFLQ-YGYSPDADGFF-------MQAVRNIQKGEELFY 267

Query: 295 QYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDI 354
            Y    SN    ++YGF    +  N +   + +  +D  F  K+D+   N    +   + 
Sbjct: 268 NYG-QWSNKYFFMNYGFASLTNPMNQFDFDVCLDRNDRLFNLKVDLTGGNICWGNRLVNE 326

Query: 355 VLGRTLPPAMLQYLRLVALGGTDAFL-LESIFR--NTIW-GHLDLPVSHANEELICRVVR 410
               T   A L  +R   +   D FL LE      N  W G    P + A E+   +  +
Sbjct: 327 TDHDTFRQA-LATVRFAQISKLDDFLQLEEDVENYNQFWPGWHTTPKTIALEKATFKAFK 385

Query: 411 DACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEI 460
           +   S L+ F ++IE+D++ L   +     R  + +   EK+++++  E+
Sbjct: 386 ELLVSELANFASSIEDDQRRLNDPSTPEFRRHIIMLTMREKQIIKKNIEV 435


>gi|225452167|ref|XP_002264334.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
           [Vitis vinifera]
          Length = 509

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 101/382 (26%), Positives = 173/382 (45%), Gaps = 32/382 (8%)

Query: 94  LVAQRDIAKNEVVLEVPMKFWINPDTVAASE-IGSL--CSGLKPWISVALFLIREKKK-E 149
           + A  D+   +V   VP    +  + V  +E I  L   + L     +AL+L+ EKK+ +
Sbjct: 110 IAASEDLQAGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 169

Query: 150 DSPWRVYLDILPE-------CTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVE 202
            S W  Y+  L           +S + WSE EL  L G+   +  L   E ++ EY +++
Sbjct: 170 KSFWYPYIRELDRQRGRGQLAVESPLLWSESELAYLTGSPTKAEVLERAEGIKREYNELD 229

Query: 203 EEIILPNK--QLFPRPITLDDFLWAFGILRSRAFSRLRG-----QNLVLIPLADLINHSP 255
               +     Q +P  I  + F   F I + +AF  ++      Q + L     L+   P
Sbjct: 230 TVWFMAGSLFQQYPYDIPTEAF--PFEIFK-QAFVAIQSCVVHLQKVSLARRFALVPLGP 286

Query: 256 GITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESK 315
            +    Y    K A L + D    L    P KAGE +++ +   + N++L L+YGF++  
Sbjct: 287 PLLA--YRSNCK-AMLAAVDGSVQLVVDRPYKAGESIVV-WCGPQPNSKLLLNYGFVDED 342

Query: 316 SDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLG--RTLPPAMLQYLRLVAL 373
           +  +   +   ++  DP + DK  +A+ NG      F + +G  R     ML YLRL  +
Sbjct: 343 NSYDRIVVEAALNTEDPQYQDKRMVAQRNGKLTVQKFHVSVGKEREAVSDMLPYLRLGYV 402

Query: 374 GGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQG 433
             +D   ++S+  +   G +  PVS   E  +   + D  +  L+G+ TT+ EDE LL  
Sbjct: 403 --SDPSEMQSVISSQ--GPI-CPVSPCMERAVLDQLVDYFERRLAGYPTTMSEDECLLAD 457

Query: 434 GNLDPRLRIAVGVRAGEKRVLQ 455
            NL+P+  +A  +   EK++L 
Sbjct: 458 SNLNPKKLVATQLVRLEKKMLN 479


>gi|80479475|gb|AAI08868.1| Unknown (protein for MGC:132347) [Xenopus laevis]
          Length = 456

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 84/295 (28%), Positives = 130/295 (44%), Gaps = 38/295 (12%)

Query: 68  WQWLRDQKVVSPKS----PIRPATFPE-GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAA 122
           WQ+++ Q+ +  +      +R A F + G GL+A RD+   E+++ +P    I  +TV  
Sbjct: 31  WQYIQLQRWLKGRGFQGRHLRAAEFADTGRGLMATRDLKPGELIIALPETCLITTETVLQ 90

Query: 123 SEIGSLCSGLKPWIS----VALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSEEELVE 177
           S +G      +P +S    +  FLI E+   E S W+ YLD++P      V+W E E+V 
Sbjct: 91  SYLGKYIRLWRPHVSPLLALCTFLIAERFAGERSQWKPYLDVIPSTYSCPVYW-ELEIVH 149

Query: 178 LQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQ-LFPRPI----TLDDFLWAFGILRSR 232
           L    L    L  K  VQ   L  E      + Q LF   +    T D   WA+  + +R
Sbjct: 150 LLPAPLRQKALEQKTEVQE--LHTESLAFFNSLQPLFCDNVADIYTYDALRWAWCTVNTR 207

Query: 233 AF-------SRLRGQNLV--LIPLADLINHSPGITTEDYAYEIKGAGLFSRD-LLFSLRT 282
                     RL  Q  V  L P  DL+NHSP +  E           FS+D   + +RT
Sbjct: 208 TVYMKHTQQDRLLAQQDVCALAPYLDLLNHSPEVQVE---------AEFSKDRRCYEIRT 258

Query: 283 PVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDK 337
               +  +Q  I Y  +  N  L L+YGF+ + +   +  +T +   +    GDK
Sbjct: 259 NSGCRKHDQAFICYGPH-DNQRLLLEYGFVAANNPHRSVYVTKDAILAHLSPGDK 312


>gi|302809535|ref|XP_002986460.1| hypothetical protein SELMODRAFT_269129 [Selaginella moellendorffii]
 gi|300145643|gb|EFJ12317.1| hypothetical protein SELMODRAFT_269129 [Selaginella moellendorffii]
          Length = 432

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 96/390 (24%), Positives = 170/390 (43%), Gaps = 43/390 (11%)

Query: 94  LVAQRDIAKNEVVLEVPMKFWINPDTVAASE-IGSL--CSGLKPWISVALFLIREKKK-E 149
           +VA  D+   ++ L VPM   +  + V  +E I  L   + L     +AL+L+ EKK+ +
Sbjct: 29  VVASEDLKPGDLALSVPMSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKRGK 88

Query: 150 DSPWRVYLDILPE-------CTDSTVFWSEEELVE-LQGTQLLSTTLGVKEYVQNEYLKV 201
           +S W  ++  L           +S + W+ EEL E   G+++    L   E ++ EY ++
Sbjct: 89  ESFWYPFIRELDRQRGRGQVAVESPLLWTSEELDEYFTGSRMKEVVLERLEGIKREYQEL 148

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRS------RAFSRLRGQNL----VLIPLAD-L 250
           +    +        P  +    ++F I +            L+G +L     L+PL   L
Sbjct: 149 DTVWFMAGSLFKEYPFDIPTEAFSFEIFKQAFVAVQSCVVHLQGVSLPRRFALVPLGPPL 208

Query: 251 INHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYG 310
           + +            +K AG   R     L      K GEQ+L+ +   + N  L L+YG
Sbjct: 209 LAYKSNCKAM-----LKAAGDLVR-----LEVDRAYKKGEQILV-WCGPQPNTRLLLNYG 257

Query: 311 FIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPA--MLQYL 368
           F++  +  +  ++   ++  DPF+ +K  I + N       F I  GR       ML YL
Sbjct: 258 FVDPDNPHDRLSVEASLNTRDPFYQNKRIIVQKNNRLTIQNFQIFKGREKEAVLEMLPYL 317

Query: 369 RLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDE 428
           RL  +  +D + +ES+F          PVS  NE  +   +    +  ++ + TTI+ED 
Sbjct: 318 RLGHV--SDPYHMESVFSAE---GPTCPVSACNERAVLDQLAQYFQERIAKYKTTIDEDR 372

Query: 429 QLLQ--GGNLDPRLRIAVGVRAGEKRVLQQ 456
            LL+    +++P+ R+A  +   EK +L  
Sbjct: 373 ALLEDCSSDINPKQRVATQLLLIEKEILHN 402


>gi|302834219|ref|XP_002948672.1| hypothetical protein VOLCADRAFT_104004 [Volvox carteri f.
           nagariensis]
 gi|300265863|gb|EFJ50052.1| hypothetical protein VOLCADRAFT_104004 [Volvox carteri f.
           nagariensis]
          Length = 510

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 104/436 (23%), Positives = 167/436 (38%), Gaps = 93/436 (21%)

Query: 63  QVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAA 122
           + + F  W R    +   S +RP+TF    GL A  DI  + +V+EVP +  +    V A
Sbjct: 59  RTQAFMSWARGPASIR-FSGVRPSTFGGVRGLAASSDIPDDALVVEVPRRSAV----VLA 113

Query: 123 SEIGSLCSGL--------KPWIS--VALFLIREKKKEDSPWRVYLDILPECTDSTVFWSE 172
            +  + C G+         PW +   A+ L  +++   SP   ++  LP  T   V W E
Sbjct: 114 PKQRNSCPGMVTDDWWKSAPWFAKMAAMLLWHKRQGSQSPLAPWIAQLPSDTGVPVLWDE 173

Query: 173 EELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSR 232
            ++  LQ   L++    VKE  Q E+ ++  +++        +  + +DF WA   +RSR
Sbjct: 174 RQIAALQYPYLIAQ---VKEQ-QREWQQLYGDLVRSGTPAGVQAPSREDFFWAMSCVRSR 229

Query: 233 AFS----------RLRGQNLVLI------------------------------------- 245
            FS          RLR   LV +                                     
Sbjct: 230 TFSGPYIGSTLQDRLRTAGLVAVLAAGNTVLGLADPQKTLSAAIAVLLFNVLYELILSRS 289

Query: 246 -------PLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDL 298
                  PL DL NHS  + +E  AY   G         +S+      K GEQV I Y  
Sbjct: 290 LKQYAICPLIDLFNHSSAVQSE-VAYNYFGDS-------YSVVASREFKKGEQVFISYG- 340

Query: 299 NKSNAELALDYGFIESKSDRNAYTLT---LEISESDPFFGDKLDIAETNGLGESAYFDIV 355
            +SN  L   YGF E+ + ++ Y +T     ++        +LD  + + L  S     +
Sbjct: 341 AQSNDSLMQYYGFAEANNPQDVYVMTDMLRWLTAVRSVGQSRLDALKGSPLANSLQQVAI 400

Query: 356 LGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKS 415
                P   LQ +R +    ++A    S F  +  G  D       E  +  VV +  + 
Sbjct: 401 QRAGFPSETLQAVRFLLAADSEAGADVSSFSKS--GSPD------QEAQLAEVVAEVVRR 452

Query: 416 ALSGFHTTIEEDEQLL 431
            L    ++++ED  LL
Sbjct: 453 ELGHLGSSLQEDLALL 468


>gi|348671353|gb|EGZ11174.1| hypothetical protein PHYSODRAFT_361758 [Phytophthora sojae]
          Length = 486

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 104/414 (25%), Positives = 177/414 (42%), Gaps = 44/414 (10%)

Query: 69  QWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS- 127
           QWL      + K  ++    PE  G+ +++ +A  E +L +P K  I  +    ++IG  
Sbjct: 47  QWLEGNGADTKKLALQEYA-PEVRGVHSRKVLAPGERILVIPKKCLITVEMGKQTDIGRK 105

Query: 128 -LCSGLK----PWISVALFLIRE-KKKEDSPWRVYLDILPEC-TDSTVFWSEEELVELQG 180
            L   +       I + +FL+ + ++ E S +R Y   LP   ++  +FWS+EEL  L+G
Sbjct: 106 LLARNVDFVAPKHIFLMMFLLTDMERAETSFFRNYYSTLPSTLSNMPIFWSDEELGWLKG 165

Query: 181 TQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFS-RLRG 239
           + ++      K  ++ +Y     ++I      F R  +LD F WA  I+ SR F   + G
Sbjct: 166 SYIIQQIQERKAAIRKDY-----DVICRVDPAFAR-FSLDRFSWARMIVCSRNFGLTIDG 219

Query: 240 -QNLVLIPLADLINH-SPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYD 297
            +   L+P AD++NH  P  T+  +   I           F++ +   +  G QV   Y 
Sbjct: 220 VKTAALVPFADMLNHYRPRETSWTFDQSIDA---------FTITSLGTIGTGAQVYDSYG 270

Query: 298 LNKSNAELALDYGFI------ESKSDRNAYTLTLEISESDP--FFGDKLDIAETNGLGES 349
             K N    L+YGF       E   + N   +  ++S++D   F+  +  + E+      
Sbjct: 271 -KKCNHRFLLNYGFAVEDNTEEDGRNPNEVLIDFQLSQADGQLFYDKRAYLHESGIYTMD 329

Query: 350 AYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVV 409
           A                + RL+ +   D F   S  +     H   P+S  NE    + +
Sbjct: 330 ARLSCSHSDANTREGFSFARLI-VATEDEF---SSMKMKSPAHSSPPISFDNEIRALQYL 385

Query: 410 RDACKSALSGFHTTIEEDEQLLQGGN---LDPRLRIAVGVRAGEKRVLQQIDEI 460
           RD     LS + TTIEED +LL          R++    +R GEK+V +   E+
Sbjct: 386 RDLMTHQLSLYDTTIEEDNELLASKQYPLFSNRIQALFFIR-GEKQVCRYFQEL 438


>gi|297804126|ref|XP_002869947.1| PTAC14 [Arabidopsis lyrata subsp. lyrata]
 gi|297315783|gb|EFH46206.1| PTAC14 [Arabidopsis lyrata subsp. lyrata]
          Length = 483

 Score = 81.6 bits (200), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 113/430 (26%), Positives = 178/430 (41%), Gaps = 85/430 (19%)

Query: 89  PEGLGLVAQRDIA---KNEVVLEVPMKFWIN----------PDTVAASE-IGSLCSGLKP 134
           P+G G+ A +DI    +  V++E+P++  I           PD V     I  + +   P
Sbjct: 99  PDGFGVYASKDIEPRRRARVIMEIPLELMITIRQKHPWMFFPDIVPIGHPIFDIINSTDP 158

Query: 135 ---W-ISVALFLIREKKKEDSPWRVYLDILP---ECTDSTVFWSEEELVELQGTQLLSTT 187
              W I +A  L+    ++D  WR+Y D LP   EC+ S +  +EE+L ELQ   L+ST 
Sbjct: 159 EIDWDIRLACLLLFSFYRDDHFWRLYGDFLPAADECS-SLLLATEEDLAELQDPDLVSTI 217

Query: 188 LGVKEYVQNEYLKV-EEEIILPNKQLFPRPITLDDFLWAFGILRSRAFS-RLRGQNLV-- 243
              ++ V   + K     + L  K+L   P   + F+WA  I ++R  S + R   LV  
Sbjct: 218 RQQQKRVLEFWEKNWHSGVPLKIKRLAEDP---ERFIWAVSIAQTRCISMQTRIGALVQE 274

Query: 244 ---LIPLADLINHS--PGI----TTEDYAYEI-KGAGLFSRDLLFSLRTPVPVKAGEQVL 293
              +IP AD++NHS  P        +D   E+   AG               +K GE++ 
Sbjct: 275 LNMMIPYADMLNHSFEPNCFLHWRPKDRMLEVMSNAG-------------QAIKKGEEMT 321

Query: 294 IQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFD 353
           I Y   + N  L   YGF    +  +A   +     S       L +    GL E  Y D
Sbjct: 322 INYMPGQKNNMLMERYGFSTPVNPWDAIKFS---GASRIHLNSFLSVFNIFGLPEEYYHD 378

Query: 354 IVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNT----IWGHLDLPVSHANEELICRVV 409
                              L G D F+  ++         W  +DLP   + E    + +
Sbjct: 379 -----------------SELSGGDTFVDGAVIAAARTLPTWSDIDLPPIPSAERKAVKEL 421

Query: 410 RDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKERELELD 469
           +D C+  L+ + TT ++D++LL   + + R   A  V+    R +  I +I K       
Sbjct: 422 QDECRKMLAEYPTTADQDQKLLDSMS-EARTTFATAVKYRMHRKM-FIGKIIK------- 472

Query: 470 ELEYYQERRL 479
            L+ YQER L
Sbjct: 473 ALDIYQERLL 482


>gi|296090251|emb|CBI40070.3| unnamed protein product [Vitis vinifera]
          Length = 428

 Score = 81.6 bits (200), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 101/381 (26%), Positives = 173/381 (45%), Gaps = 32/381 (8%)

Query: 94  LVAQRDIAKNEVVLEVPMKFWINPDTVAASE-IGSL--CSGLKPWISVALFLIREKKK-E 149
           + A  D+   +V   VP    +  + V  +E I  L   + L     +AL+L+ EKK+ +
Sbjct: 29  IAASEDLQAGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 88

Query: 150 DSPWRVYLDILPE-------CTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVE 202
            S W  Y+  L           +S + WSE EL  L G+   +  L   E ++ EY +++
Sbjct: 89  KSFWYPYIRELDRQRGRGQLAVESPLLWSESELAYLTGSPTKAEVLERAEGIKREYNELD 148

Query: 203 EEIILPNK--QLFPRPITLDDFLWAFGILRSRAFSRLRG-----QNLVLIPLADLINHSP 255
               +     Q +P  I  + F   F I + +AF  ++      Q + L     L+   P
Sbjct: 149 TVWFMAGSLFQQYPYDIPTEAF--PFEIFK-QAFVAIQSCVVHLQKVSLARRFALVPLGP 205

Query: 256 GITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESK 315
            +    Y    K A L + D    L    P KAGE +++ +   + N++L L+YGF++  
Sbjct: 206 PLLA--YRSNCK-AMLAAVDGSVQLVVDRPYKAGESIVV-WCGPQPNSKLLLNYGFVDED 261

Query: 316 SDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLG--RTLPPAMLQYLRLVAL 373
           +  +   +   ++  DP + DK  +A+ NG      F + +G  R     ML YLRL  +
Sbjct: 262 NSYDRIVVEAALNTEDPQYQDKRMVAQRNGKLTVQKFHVSVGKEREAVSDMLPYLRLGYV 321

Query: 374 GGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQG 433
             +D   ++S+  +   G +  PVS   E  +   + D  +  L+G+ TT+ EDE LL  
Sbjct: 322 --SDPSEMQSVISSQ--GPI-CPVSPCMERAVLDQLVDYFERRLAGYPTTMSEDECLLAD 376

Query: 434 GNLDPRLRIAVGVRAGEKRVL 454
            NL+P+  +A  +   EK++L
Sbjct: 377 SNLNPKKLVATQLVRLEKKML 397


>gi|348679693|gb|EGZ19509.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Phytophthora sojae]
          Length = 606

 Score = 81.6 bits (200), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 93/378 (24%), Positives = 169/378 (44%), Gaps = 22/378 (5%)

Query: 75  KVVSPKSPIRPA---TFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCS- 130
           KV SP S +  A    F  G+      D+ +   +L +PM   ++  + A   +G L   
Sbjct: 179 KVESPTSSVLKAEGFNFGRGMAYTVTEDVEQGAELLSLPMSKVMSVASAARGRVGLLLEV 238

Query: 131 --GLKPWISVALFLIREKK-KEDSPWRVYLDILP--ECTDSTVFWSEEELVELQGTQLLS 185
              L P I++ L L+ E+     S +  ++  LP  E  +ST+F+SE +L E++G+QLL 
Sbjct: 239 NPDLPPAIALGLHLLEEQALGAKSNFSEFVSSLPGVEAINSTLFYSENQLKEMEGSQLLR 298

Query: 186 TTLGVKEYVQNEYLKVEEEIILPNKQLFP----RPITLDDFLWAFGILRSRAFSRLRGQ- 240
            TLG  + V+  Y  + + +  P     P    +  TLD F WA G++ + AF     + 
Sbjct: 299 YTLGRAQAVEAFYDALLQPVTSPEAVDPPIFKEQDFTLDKFRWAMGVVWASAFPVGEDEA 358

Query: 241 NLVLIPLADLINHSPGITTE-DYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLN 299
           ++VL P+ D I     +  E D A       +        +    P++ G +V +     
Sbjct: 359 DVVLAPVLDTIGICTDVADEGDEACPPNQIEVDQSSQRLVVHASSPLEKGREVRLSMP-G 417

Query: 300 KSNAELALDYGFIESKSDR--NAYTLTLEISESDPFFGDKLDIAETNGLGES--AYFDIV 355
           KS+A+  L+ GF   ++ +  +   LT+ +  SD     K  +  T  L ES  A + + 
Sbjct: 418 KSSAQFMLNNGFARDRASKKLDKLDLTVTLDPSDALASVKSYLLHTQ-LNESVNASYALF 476

Query: 356 LGRT-LPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACK 414
            G + +   + + L++  L G +    + +            +S  NE +  R +   C 
Sbjct: 477 HGSSKIDDEISKSLKMKLLSGAELSRFKELLDPKEEAEKRNILSLRNEFVFTRAIMSTCT 536

Query: 415 SALSGFHTTIEEDEQLLQ 432
           + L  + T++E+D++ L+
Sbjct: 537 TLLQQYPTSVEQDQEELE 554


>gi|148226164|ref|NP_001079674.1| SET domain containing 4 [Xenopus laevis]
 gi|28422727|gb|AAH46855.1| MGC53706 protein [Xenopus laevis]
          Length = 456

 Score = 81.6 bits (200), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 83/295 (28%), Positives = 130/295 (44%), Gaps = 38/295 (12%)

Query: 68  WQWLRDQKVVSPKS----PIRPATFPE-GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAA 122
           WQ+++ Q+ +  +      +R A F + G GL+A RD+   E+++ +P    I  +TV  
Sbjct: 31  WQYIQLQRWLKGRGFQGRHLRAAEFADTGRGLMATRDLKPGELIIALPETCLITTETVLQ 90

Query: 123 SEIGSLCSGLKPWIS----VALFLIREKKKED-SPWRVYLDILPECTDSTVFWSEEELVE 177
           S +G      +P +S    +  FLI E+   D S W+ YLD++P      V+W E E++ 
Sbjct: 91  SYLGKYIRLWRPHVSPLLALCTFLIAERFAGDCSQWKPYLDVIPSTYSCPVYW-ELEIIH 149

Query: 178 LQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQ-LFPRPI----TLDDFLWAFGILRSR 232
           L    L    L  K  VQ   L  E      + Q LF   +    T D   WA+  + +R
Sbjct: 150 LLPAPLRKKALEQKTEVQE--LHTESLAFFSSLQPLFCDNVADIYTYDALRWAWCTVNTR 207

Query: 233 AF-------SRLRGQNLV--LIPLADLINHSPGITTEDYAYEIKGAGLFSRD-LLFSLRT 282
                     RL  Q  V  L P  DL+NHSP +  E           FS+D   + +RT
Sbjct: 208 TVYMKHTQQDRLLAQQDVCALAPYLDLLNHSPEVQVE---------AEFSKDRRCYEIRT 258

Query: 283 PVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDK 337
               +  +Q  I Y  +  N  L L+YGF+ + +   +  +T +   +    GDK
Sbjct: 259 NSGCRKHDQAFICYGPH-DNQRLLLEYGFVAANNPHRSVYVTKDAILAHLSPGDK 312


>gi|440640494|gb|ELR10413.1| hypothetical protein GMDG_00825 [Geomyces destructans 20631-21]
          Length = 492

 Score = 81.6 bits (200), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 118/455 (25%), Positives = 186/455 (40%), Gaps = 83/455 (18%)

Query: 62  AQVETFWQWLRDQKV-VSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDT 119
           ++ E F +WL    V +S K+ +        G  LVA+ D A++E++  VP    ++   
Sbjct: 7   SKTEAFVKWLNHVGVRISAKAELTCLRADGRGRALVAKGDFAEDELIFSVPRTSTLSV-K 65

Query: 120 VAASEIGSLCSGLKP--------WISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWS 171
            A  E+ S    + P        W ++   +I E  + +S W  Y ++LP   DS VFWS
Sbjct: 66  AALPEMLSGRQDISPEDIESMPGWAALTAVIISEGLRPESKWAPYFNVLPTKLDSLVFWS 125

Query: 172 EEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRS 231
            EEL ELQ + +L      K+  ++   K EE       ++ P    +D F      + +
Sbjct: 126 PEELAELQASAVL------KKVGKD---KAEEIFHQSISKVTPEGTDVDIFHRVASTIMA 176

Query: 232 RAFS-------RLRGQN-------------LVLIPLADLINHSPGITTEDYAYEIKGAGL 271
            AF           G N             L +IPLAD++N          A     A L
Sbjct: 177 YAFDIPDIEQEDEEGANEDDLVDDDEQKTSLAMIPLADMLN----------ADADNNARL 226

Query: 272 FSRDLLFSLRTPVPVKAGEQVLIQY-DLNKSNAELALDYGFIESKSDRNAYTLTLEISES 330
                   +RT  P+K GE++L  Y  L +S+  L   YG++   +D+ A     EIS S
Sbjct: 227 HYDGEELEMRTINPIKTGEEILNDYGQLPRSD--LLRRYGYV---TDKYATFDVAEISTS 281

Query: 331 ---DPFFGD----------------KLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLV 371
              D  + D                +L++A    + E A+ D+       P +   L  +
Sbjct: 282 TITDHIYQDLAGELKVYLRAGEIEARLELARREDVYEDAH-DVGHATEEWPCISDELVAL 340

Query: 372 ALGGTDAFLLESIFRNTIWGHLDLPV-SHANEELICRVVRDACKSALSGFHTTIEEDEQL 430
                     E          +  P  S    EL+ + ++   +     + TT+ EDE L
Sbjct: 341 VYLLLVG---EETLAAIQSSKMSFPSRSKMETELVGKALQRILERREREYATTVVEDENL 397

Query: 431 LQGGNLDPRLRIAVGVRAGEKRVLQQID---EIFK 462
           LQ GN   R+++A+ VR GEK VL++     EIFK
Sbjct: 398 LQSGNHSNRVKMAIQVRMGEKVVLREAKKRREIFK 432


>gi|310799999|gb|EFQ34892.1| SET domain-containing protein [Glomerella graminicola M1.001]
          Length = 478

 Score = 81.6 bits (200), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 113/420 (26%), Positives = 181/420 (43%), Gaps = 70/420 (16%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVA-ASEIGSLCSG------------LKPWIS 137
           G G+VA +DIA   V+  +P K  IN +T     +I  + +G            L  W S
Sbjct: 41  GRGIVATKDIAPETVLFTIPRKSIINIETSELPKKIPQVFTGNDGDDEDMENEPLDSWGS 100

Query: 138 VALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLG---VKEY 193
           + L +I E  +   SPW+ Y ++LPE  D+ +FW   +L  L+G+ +LS  +G     E 
Sbjct: 101 LILVMIYEYLQGNASPWKTYFEVLPEKFDTLMFWESPDLEYLKGSAVLS-KIGKDEADEM 159

Query: 194 VQNEYLKV---EEEIILPNKQLFPRPITLDDFLWAFG-ILRSRAF--------------- 234
            ++  L V      I  P     P    L       G I+ + AF               
Sbjct: 160 FRSRILPVISANAGIFFPQGVSPPSESELLQLAHRMGSIIMAYAFDLENEEEPEQEDEEW 219

Query: 235 --SRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQV 292
              R     L ++P+AD++N       E  A+   G    S   + +LR   P+KAGE++
Sbjct: 220 VEDREGKTMLGMVPMADILNAD----AEFNAHVNHGEDDLS---VTALR---PIKAGEEI 269

Query: 293 LIQYDLNKSNAELALDYGFIESKSDRN----------AYTLTLEISESDPFFG------D 336
           L  Y  +  N+EL   YG++  K  R             TLT ++  +D  +       D
Sbjct: 270 LNYYGPH-PNSELLRRYGYVTPKHSRYDVVEIPWDLVQSTLTEQLRLTDEVWKQVAEHVD 328

Query: 337 KLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLP 396
             D+ +   L   +      G    PA +Q +    L      +L++I +  + G L +P
Sbjct: 329 PEDLEDVFVLERESGEPDSEGHLQTPAKVQEVS-AELEEQLKDVLKAIKK--VRGDL-IP 384

Query: 397 VSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
                +E+   VV    +  L+ + TT EEDE +L  GN+  R ++AV VR GEKR++++
Sbjct: 385 DKRKRDEVYQCVVVSTLQKLLAQYPTTAEEDEAMLASGNVTSRQKLAVEVRLGEKRLIKE 444


>gi|296232125|ref|XP_002761462.1| PREDICTED: SET domain-containing protein 4 [Callithrix jacchus]
          Length = 440

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 80/308 (25%), Positives = 135/308 (43%), Gaps = 37/308 (12%)

Query: 69  QWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           +WL+D+K     S + PA FP  G GL++Q  + + ++++ +P    +  DTV  S +G+
Sbjct: 38  KWLKDRKF--QDSNLVPARFPGTGRGLMSQTSLQEGQMIISLPESCLLTTDTVIQSYLGA 95

Query: 128 LCSGLK----PWISVALFLIREKKKED-SPWRVYLDILPECTDSTVFWSEEELVELQGTQ 182
             +  K    P +++  FL+ EK   D S W+ YL+ILP+     V   E E+V L    
Sbjct: 96  YIAKWKPPPSPLLALCTFLVSEKHAGDRSLWKPYLEILPKAYTCPVC-LEPEVVNLLPIS 154

Query: 183 LLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI----TLDDFLWAFGILRSRA-FSRL 237
           L +     + +VQ E+     +     + LF   +    +    LWA+  + +RA + R 
Sbjct: 155 LKAKAEEQRAHVQ-EFFASSRDFFSSLQPLFAEAVDSIFSYSALLWAWCTVNTRAVYLRP 213

Query: 238 RG--------QNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAG 289
           R             L P  DL+NHSP +  +        A        + +RT    +  
Sbjct: 214 RQWECLSAEPDTCALAPYLDLLNHSPHVQVK--------AAFNEETHCYEIRTTSRWRKH 265

Query: 290 EQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEI-----SESDPFFGDKLDIAETN 344
           E+V I Y  +  N  L L+YGF+   +      ++ EI       +D     K+ I + +
Sbjct: 266 EEVFICYGPH-DNHRLFLEYGFVSGHNPHACVYVSREILVKYLPSTDKQMDKKISILKDH 324

Query: 345 GLGESAYF 352
           G  E+  F
Sbjct: 325 GYIENLTF 332


>gi|125578929|gb|EAZ20075.1| hypothetical protein OsJ_35675 [Oryza sativa Japonica Group]
          Length = 536

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 94/374 (25%), Positives = 167/374 (44%), Gaps = 28/374 (7%)

Query: 100 IAKNEVVLEVPMKFWINPDTVAASE-IGSL--CSGLKPWISVALFLIREKKK-EDSPWRV 155
           +   +V  EVPM   +  + V   E +  L   + L     +AL+L+ EKK+ +DS W  
Sbjct: 143 LQAGDVAFEVPMSLVVTLERVLGDESVAELLTTNKLSELACLALYLMYEKKQGQDSFWYP 202

Query: 156 YLDILPE-------CTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILP 208
           Y+  L           +S + W+E EL  L+G+ +    +   E ++ EY +++    + 
Sbjct: 203 YIKELDRQRGRGQLAVESPLLWTESELNYLKGSPIKDEVVARDEGIRREYNELDTLWFMA 262

Query: 209 NKQLFPRPITLDDFLWAFGILRSRAFSRLRG-----QNLVLIPLADLINHSPGITTEDYA 263
                  P  +    + F I + +AF  ++      Q + L     L+   P + T  Y 
Sbjct: 263 GSLFQQYPFDIPTEAFPFEIFK-QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLT--YK 319

Query: 264 YEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTL 323
              K A L +      L    P KAGE +++ +   + N+ L L+YGFI+  +  +   +
Sbjct: 320 SNCK-AMLTAVGDSVRLVVDRPYKAGEPIIV-WCGPQPNSRLLLNYGFIDEDNPYDRIVI 377

Query: 324 TLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLP--PAMLQYLRLVALGGTDAFLL 381
              ++  DP F +K  +A+ NG      F + +G+       ML YLRL  +   D   +
Sbjct: 378 EASLNIEDPQFQEKRMVAQRNGKLAIQNFHVCVGKEKETIAEMLPYLRLGYISDPDE--M 435

Query: 382 ESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLR 441
           +SI  +        PVS   E  +   +    +S L+ + TT++ED+ +L  GNL+P+  
Sbjct: 436 QSILSSE---GDTCPVSPCTERAVLDQLVGYLESRLADYPTTLDEDDAMLADGNLEPKKE 492

Query: 442 IAVGVRAGEKRVLQ 455
           +A  +   EK++L 
Sbjct: 493 VATRLVRLEKKLLH 506


>gi|351701197|gb|EHB04116.1| SET domain-containing protein 3 [Heterocephalus glaber]
          Length = 705

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 86/307 (28%), Positives = 139/307 (45%), Gaps = 43/307 (14%)

Query: 209 NKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHSPGITTED 261
           NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+ G+ T  
Sbjct: 340 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHTNGLITTG 396

Query: 262 YAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAY 321
           Y  E        R    +L+     +AGEQ+ I Y   +SNAE  +  GF    +  +  
Sbjct: 397 YNLED------DRCECVALQD---FQAGEQIYIFYG-TRSNAEFVIHSGFFFDNNSHDRV 446

Query: 322 TLTLEISESDPFFGDKLDIAETNGLGESAYFDI-VLGRTLPP---AMLQYLRLVALGGTD 377
            + L +S+SD  +  K ++    G+    +  +  L  T PP    +L +LR+  +  T+
Sbjct: 447 KIKLGVSKSDRLYAMKAEVLARAGIPTYVWSSVFALHFTEPPISAQLLAFLRVFCM--TE 504

Query: 378 AFLLE------SIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLL 431
             L E      +I R    G+ + PVS  NE  +   + D     L  + TT EED+ +L
Sbjct: 505 EELKEHLLGENAIDRIFTLGNSEFPVSWENEVKLWSFLEDRASLLLKTYKTTTEEDKAVL 564

Query: 432 QGGNLDPRLRIAVGVRAGEKRVLQQI-------DEIFKERELELDELEYYQERRLKDLGL 484
           +  +L  R ++A+ +R GEK +L++         + F++   E   L  Y+E    +LGL
Sbjct: 565 KNPDLPARTKMAIKLRLGEKEILEKAVQSAATNRKCFRKHMEEQAPLPKYEE---SNLGL 621

Query: 485 V-GEQGD 490
           + G  GD
Sbjct: 622 LEGGAGD 628


>gi|18377718|gb|AAL67009.1| unknown protein [Arabidopsis thaliana]
          Length = 514

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 98/379 (25%), Positives = 171/379 (45%), Gaps = 28/379 (7%)

Query: 94  LVAQRDIAKNEVVLEVPMKFWINPDTVAASE-IGSL--CSGLKPWISVALFLIREKKK-E 149
           + A  D+ K +V   VP    +  + V  +E I  L   + L     +AL+L+ EKK+ +
Sbjct: 115 VAASEDLQKGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 174

Query: 150 DSPWRVYLDILPEC-------TDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVE 202
            S W  Y+  L           +S + WSE EL  L G+   +  L   E ++ EY +++
Sbjct: 175 KSVWYPYIRELDRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAEGIKREYNELD 234

Query: 203 EEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRG-----QNLVLIPLADLINHSPGI 257
               +        P  +    ++F I + +AF  ++      QN+ L     L+   P +
Sbjct: 235 TVWFMAGSLFQQYPFDIPTEAFSFEIFK-QAFVAIQSCVVHLQNVGLARRFALVPLGPPL 293

Query: 258 TTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSD 317
               Y    K A L + D    L    P KAG+ +++ +   + NA+L L+YGF++  + 
Sbjct: 294 LA--YCSNCK-AMLTAVDGAVELVVDRPYKAGDPIVV-WCGPQPNAKLLLNYGFVDEDNP 349

Query: 318 RNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLG--RTLPPAMLQYLRLVALGG 375
            +   +   ++   P + DK  +A+ NG      F + +G  R     ML YLRL  +  
Sbjct: 350 YDRVIVEAALNTEGPQYQDKRMVAQRNGKLSQQVFQVRVGKEREAVQDMLPYLRLGYM-- 407

Query: 376 TDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGN 435
           +D   ++S+  +   G +  P+S   E  +   + +     LSG+ TT +ED+ LL   +
Sbjct: 408 SDPSEMQSVISSQ--GPV-CPMSPCMERAVLDQLANYFMRRLSGYPTTPKEDDALLADPS 464

Query: 436 LDPRLRIAVGVRAGEKRVL 454
           L PR R+A  +   EK++L
Sbjct: 465 LSPRKRVATRLVQLEKKIL 483


>gi|320163048|gb|EFW39947.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 476

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 112/464 (24%), Positives = 189/464 (40%), Gaps = 93/464 (20%)

Query: 67  FWQWLR-DQKVVSPK----------SPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWI 115
           F  WLR +   VSPK          +  R       +   A+    K E +  +P K  +
Sbjct: 15  FIAWLRANGATVSPKLTLQATAAFNADSRTQVLHRRVIASAEAGFDKEEELFSIPRKLLL 74

Query: 116 NPDTVAASEI----GSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWS 171
           +  T + +E+          L  W+ + + ++ E   +DS WR YLD+LPE  D+ +FW+
Sbjct: 75  SASTSSIAELLLENKKEACALVGWMPLVVAMMYEITNKDSFWRPYLDLLPETLDTPMFWN 134

Query: 172 EEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAF----- 226
           +++L  L+GT  LS  LG KE  +  +     E I+P  +L P    L     A      
Sbjct: 135 DDDLELLEGTSTLS-HLG-KEDAETIFT----EQIVPFMKLHPTHFDLKVHNMALYHRVA 188

Query: 227 GILRSRAFS--------------------------------RLRGQNLVLIPLADLINHS 254
            ++ + +FS                                + R + + ++PLAD+++H 
Sbjct: 189 SVIMAYSFSEDDDEDDDDEDDDEEEDCCDGDANNECCSQKRQKRMEKIAMVPLADMLDHK 248

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQY-DLNKSNAELALDYGFIE 313
            G            A LF      ++    P  AG ++   Y DL  SN+EL   YGFI+
Sbjct: 249 TGCN---------NARLFYGKTTLAMSCIEPCAAGHELYNTYGDL--SNSELLRKYGFID 297

Query: 314 SKSDRNAYTLTLEI----SESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLR 369
             ++ N+  + +E+     ES  F  + ++  E  G     +   +    LPP  L+   
Sbjct: 298 DVNEHNSVDIPVEMLEERFESCSFMEEAMEALEEIGCWLPEFH--IPADALPPQELE--- 352

Query: 370 LVALGGTDAFLLESIFRNTIWGHLD--LPVSHANEELICRVVRDACKSALS-------GF 420
                 + A L +S  +      LD    +      L+ +  R   ++ L+        +
Sbjct: 353 -----ASIALLFQSPKQVRALRALDDEDEIRSFLATLVNKCRRKVSETLLAFGQKRAEEY 407

Query: 421 HTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKER 464
            TT EEDE+ L+  +L  R ++A+ VR GE+ +L       KER
Sbjct: 408 TTTREEDEERLKESDLTHRQKMALRVRIGERTILHNYISHLKER 451


>gi|212546319|ref|XP_002153313.1| SET domain protein [Talaromyces marneffei ATCC 18224]
 gi|210064833|gb|EEA18928.1| SET domain protein [Talaromyces marneffei ATCC 18224]
          Length = 481

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 113/465 (24%), Positives = 189/465 (40%), Gaps = 104/465 (22%)

Query: 63  QVETFWQWLRDQK--VVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDT 119
           Q + F QWL  Q    +SPK  +R       G G+VA+ +I + E +  +P    +   T
Sbjct: 16  QSDGFMQWLSQQTDVTISPKIEVRDLRHQGSGRGVVARSNIQEGEDLFHLPHHIVLMVKT 75

Query: 120 VAASEI-GSLCSGLKPWISVALFLIREKK-KEDSPWRVYLDILPECTDSTVFWSEEELVE 177
              ++I       L PW+S+ + +I E    E S W+ Y  +LP   D+ +FWSEEE  +
Sbjct: 76  SRLNQILADDLKNLGPWLSLVVVMIYEYSLGEQSNWKQYFQVLPSKFDTLMFWSEEEFSQ 135

Query: 178 LQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI----TLDD------------ 221
           LQ + ++   +G ++  ++ + KV   ++  +  LFP PI    + DD            
Sbjct: 136 LQASAVVD-KVGKRDAEEDIFEKV-LPLVRAHPDLFP-PIDGVMSYDDDTGAQALLELAH 192

Query: 222 ------FLWAFGILRSRAFSRLRGQNLV----------LIPLADLINHSPGITTEDYAYE 265
                   +AF I ++           +          ++PLADL+N            +
Sbjct: 193 RMGSLIMAYAFDIEKAEEEESEGEDGYLTDDEEQLPKGMVPLADLLNADA---------D 243

Query: 266 IKGAGLFSRDLLFSLRTPVPVKAGEQVLIQY-DLNKSNAELALDYGFIESKSDRNAYTLT 324
              A LF  +    +R   P+KAG+++   Y +L +S+  L   YG++   +D  A    
Sbjct: 244 RNNARLFQEEGALVMRAIKPIKAGDEIFNDYGELPRSD--LLRRYGYV---TDNYAQYDV 298

Query: 325 LEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP-AMLQYLRLVALGGT------- 376
           +E+                 G+  +A FD +  +  P   +L  L ++  G         
Sbjct: 299 VELP--------------LTGICHAAGFDNIEDKEYPQLKLLDQLEILEDGYCILRPSPE 344

Query: 377 DAFL-----------------LESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSG 419
           D  L                  E + R         P+  A E    R++ DA +S +  
Sbjct: 345 DTLLDILPDELLALLKTLTLDSEELQRLLSKNKHPKPILGARE---ARILLDAAQSKMGQ 401

Query: 420 FHTTIEEDEQLLQG-------GNLDPRLRIAVGVRAGEKRVLQQI 457
           + TTI+ED+ LLQ           + R  +AV VR GEK +LQ +
Sbjct: 402 YGTTIQEDKILLQQFASSSVLRTRERRRHMAVQVRVGEKEILQAL 446


>gi|42565948|ref|NP_191068.2| SET domain-containing protein [Arabidopsis thaliana]
 gi|56236044|gb|AAV84478.1| At3g55080 [Arabidopsis thaliana]
 gi|59958342|gb|AAX12881.1| At3g55080 [Arabidopsis thaliana]
 gi|332645816|gb|AEE79337.1| SET domain-containing protein [Arabidopsis thaliana]
          Length = 463

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 74/261 (28%), Positives = 126/261 (48%), Gaps = 23/261 (8%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISV-ALFLIREKKK- 148
           G  L A + I   + +L+VP    I PD +  S+I  L S     I + A  LIREKK  
Sbjct: 70  GRSLFASKVIYAGDCMLKVPFNAQITPDELP-SDIRVLLSNEVGNIGMLAAVLIREKKMG 128

Query: 149 EDSPWRVYLDILPECTD--STVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEII 206
           + S W  Y+  LP+  +  S++FW E+EL  ++ + +   T+  K  ++ ++  V +   
Sbjct: 129 QKSRWVPYISRLPQPAEMHSSIFWGEDELSMIRCSAVHQETVKQKAQIEKDFSFVAQAFK 188

Query: 207 LPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINH---SPGITTEDYA 263
                +  RP  L+DF++A+ ++ SRA+     + + LIP AD +NH   S  I   D  
Sbjct: 189 QHCPIVTERP-DLEDFMYAYALVGSRAWEN--SKRISLIPFADFMNHDGLSASIVLRDED 245

Query: 264 YEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTL 323
            ++      + D  +S         G++V I+Y    SNA L LD+GF    +  +   +
Sbjct: 246 NQLSEV---TADRNYS--------PGDEVFIKYG-EFSNATLMLDFGFTFPYNIHDEVQI 293

Query: 324 TLEISESDPFFGDKLDIAETN 344
            +++   DP    KL + +T+
Sbjct: 294 QMDVPNDDPLRNMKLGLLQTH 314


>gi|126325439|ref|XP_001376285.1| PREDICTED: SET domain-containing protein 4-like [Monodelphis
           domestica]
          Length = 437

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 95/425 (22%), Positives = 175/425 (41%), Gaps = 60/425 (14%)

Query: 69  QWLRDQKVVSPKSPIRPATFPE-GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           +WL+ +K       +RP  F   G GL+A + +   E+++ +P +  +  DTV  S +G 
Sbjct: 39  KWLKKRKF--EDHNLRPTRFSNTGRGLMAVKSLQPGELIISLPKECLLTTDTVIRSYLGD 96

Query: 128 LCSG----LKPWISVALFLIREKKKED-SPWRVYLDILPECTDSTVFWSEEELVELQGTQ 182
             +     + P +++  FLI EK   + SPW+ YLD+LP+     V       +E +  +
Sbjct: 97  YITKWMPPISPLLALCAFLISEKHAGNKSPWKPYLDVLPKAYTCLV------CLEPEVVR 150

Query: 183 LLSTTLGVKEYVQNEYLKVEEEIILPN------KQLFPRPIT----LDDFLWAFGILRSR 232
           LL   L +K   + + ++V++  I         + LF   +        FLWA+  + +R
Sbjct: 151 LLPRPLQMK--AEEQRMQVQKLFISSRGFFSSLQSLFTEDVKHVFHYHAFLWAWCTINTR 208

Query: 233 AFSRLRGQN---------LVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTP 283
                  Q            L P  DL+NHSP +  E        A        + +RT 
Sbjct: 209 TVYMKHAQKQCLSAEPDVYALAPYLDLLNHSPRVWVE--------AAFNEETCCYEIRTT 260

Query: 284 VPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLE-----ISESDPFFGDKL 338
              K  E++ I Y  +  N  L L+YGF+ S +  +A  + ++     +   D     K+
Sbjct: 261 SHCKKFEELFICYGPH-DNHRLLLEYGFVASNNPHSAVYIAIDSLVDHLPSVDKQMNKKI 319

Query: 339 DIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVS 398
            + + +G  E    ++  G   P   L     +   G + F   + ++  + G +   +S
Sbjct: 320 SLLKEHGFSE----NLTFGWDGPSWKLLTALKLLCLGPEEF---TCWKKVLLGEV---IS 369

Query: 399 HANEELICRVVRDACKSALSGFHTTIEEDEQLL-QGGNLDPRLRIAVGVRAGEKRVLQQI 457
             NE+    + R  C   +    T + +   +  +   L+ +L +   +   E R+LQ  
Sbjct: 370 DTNEQKSLVLSRQICSYFIEETKTALHKISHMKNEQTALEYQLTLVENLWVEELRILQVS 429

Query: 458 DEIFK 462
            +I K
Sbjct: 430 ADILK 434


>gi|258563540|ref|XP_002582515.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237908022|gb|EEP82423.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 445

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 80/312 (25%), Positives = 139/312 (44%), Gaps = 40/312 (12%)

Query: 70  WLRDQKVVSPKSPIRPATFPE-GLGLVAQRDIAKNEVVLEVPMK-FWINPDTVAASEIGS 127
           WL++   V   + +  A FP  G G+   R   + E +L +P    W      A   +G 
Sbjct: 4   WLKESGAVGLDA-LELAEFPVIGRGVRTLRRFNEGERILTIPRDVLWTVEHAYADPLLGP 62

Query: 128 LCSGLKPWISV------ALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGT 181
           +    +P +SV       +  +R ++      R +L  +P+   S++F++E+EL    GT
Sbjct: 63  VLRSARPPLSVDDTLATYILFVRSRESGYDGLRSHLAAVPKSYSSSIFFTEDELEVCAGT 122

Query: 182 QLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRA--FSRLRG 239
            L + T  +   ++++Y                R +    + WA   + SRA  F+   G
Sbjct: 123 SLYAITKQLGRCIEDDY----------------RALV---YKWALCTVWSRAMDFALPDG 163

Query: 240 QNL-VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDL 298
           +++ +L P AD++NHS  +  + +AY+     L       S+      +AG+QV I Y  
Sbjct: 164 KSVRLLAPFADMLNHSSEVR-QCHAYDPLSGNL-------SILAGKGYEAGDQVFIHYGS 215

Query: 299 NKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGR 358
             +N  L L YGF+   +  ++Y L LE     PFF  K  +    GL  ++   + L  
Sbjct: 216 VPNNRLLRL-YGFVIPSNPNDSYDLVLETHPLAPFFEQKRKLWALAGLDSTSTISLTLTD 274

Query: 359 TLPPAMLQYLRL 370
            LP  +L+YLR+
Sbjct: 275 PLPNNVLRYLRI 286


>gi|367023575|ref|XP_003661072.1| hypothetical protein MYCTH_2300057 [Myceliophthora thermophila ATCC
           42464]
 gi|347008340|gb|AEO55827.1| hypothetical protein MYCTH_2300057 [Myceliophthora thermophila ATCC
           42464]
          Length = 496

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 114/431 (26%), Positives = 186/431 (43%), Gaps = 78/431 (18%)

Query: 91  GLGLVAQRDIAKNEVVL-----------------EVPMKFWINPDTVAASEIGS---LCS 130
           G G+VA+ DIA + V+                  E+P  F +  D    S+ G      S
Sbjct: 46  GRGIVARTDIAADTVLFTIPRSSIICTATSALKNEIPGIFDLEGDEDGNSDSGGEDGTSS 105

Query: 131 GLKPWISVALFLIREKKKED-SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLG 189
               W  + L LI E  + D S W+ YLD+LP   D+ +FWS  EL ELQ + L+ T +G
Sbjct: 106 SQDSWTLLILILIYEYLQGDASQWKPYLDVLPSAFDTPMFWSPTELAELQASALV-TKVG 164

Query: 190 VKE---YVQNEYLKV---EEEIILP-------NKQLFPRPITLDDFLWAFGILRSR---A 233
            +E    ++++ L V    + +  P       + QLF     +   + A+     +   A
Sbjct: 165 REEADRMIRSKILPVIRGHDHVFFPHGRQRLDDDQLFELAHRMGSAIMAYAFDLEKDDDA 224

Query: 234 FSRLRGQN-----------LVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRT 282
                 Q+           L ++P+AD++N       E  AY   GA   +     +LRT
Sbjct: 225 NEEASEQDEWVDDREGRTMLGMVPMADMLNAD----AEFNAYINHGADSLT---ATALRT 277

Query: 283 PVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDR-NAYTLTLEISESD--PFFGDKLD 339
              +KAGE++L  Y     N EL   YG++  K  R +   L  ++ E+      GD++ 
Sbjct: 278 ---IKAGEEILNYYG-PLPNGELLRRYGYVTPKHARYDVVELPWDLIEAGLRERVGDRMS 333

Query: 340 IAETNGLGESAYFD------IVLGRTL--PPAMLQYLRLVALGG-----TDAFLLESIFR 386
            A+   + +    D       VL R+   P +  Q        G     +D F       
Sbjct: 334 PADWEQVNKLTRSDDDFEESFVLERSSEDPDSTGQLEGDATFTGLPDELSDQFKTFLKAT 393

Query: 387 NTIWGHLDLPVSHANEELI-CRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVG 445
            T+ GH+DL       +++  +VV DA ++    + T++E+D+ L+  G L  R R+ V 
Sbjct: 394 KTV-GHVDLVSDKDTRKVVFLQVVFDALRARERQYATSLEDDQSLVDAGQLTERQRMGVW 452

Query: 446 VRAGEKRVLQQ 456
           VR GEK++L++
Sbjct: 453 VRMGEKQLLRE 463


>gi|302755392|ref|XP_002961120.1| hypothetical protein SELMODRAFT_402746 [Selaginella moellendorffii]
 gi|300172059|gb|EFJ38659.1| hypothetical protein SELMODRAFT_402746 [Selaginella moellendorffii]
          Length = 371

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 69/260 (26%), Positives = 115/260 (44%), Gaps = 28/260 (10%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREKKK-E 149
           G  L A R +      LE+P    I P+ V  S++  L S   P   ++LFL+ EK K +
Sbjct: 5   GRALFATRRVPAGSRFLEIPRIAIITPENVP-SQVSHLLSTSNPKTRLSLFLLSEKHKAQ 63

Query: 150 DSPWRVYLDILPECTD--STVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIIL 207
           +S W  YL  LP+  D  ST+FW +EEL  L+ +     T+   + +++E+  V E  + 
Sbjct: 64  ESQWAPYLRCLPQLGDIESTMFWKDEELAWLKHSPTYRETMECLKIIKSEF-HVLEANVF 122

Query: 208 PNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIK 267
           P  +     ++L DF+ A+   +              IP AD  NH     T     + K
Sbjct: 123 PWCRDVLGEVSLTDFMHAYSTDQ--------------IPFADFFNHDHNCQTRLSYDKEK 168

Query: 268 GAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEI 327
              +   D  +        KAG+++ + Y  +  N+ LA+DYGF  + +      + + +
Sbjct: 169 DCAVAVADQDY--------KAGDEIFLSYG-STPNSILAVDYGFAVASNPHEQVEVPMGV 219

Query: 328 SESDPFFGDKLDIAETNGLG 347
           S +DP    KL     + + 
Sbjct: 220 SLTDPLRDLKLQTLSRHNMS 239


>gi|224117488|ref|XP_002331687.1| SET domain protein [Populus trichocarpa]
 gi|222874165|gb|EEF11296.1| SET domain protein [Populus trichocarpa]
          Length = 502

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 95/382 (24%), Positives = 177/382 (46%), Gaps = 32/382 (8%)

Query: 94  LVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS---LCSGLKPWISVALFLIREKKK-E 149
           + A  D+  ++V + VP    +  + V  +E  +     + L     +AL+L+ EKK+ +
Sbjct: 103 VAASEDLQASDVAVSVPNSLVVTLERVLGNETLAELLTTNKLSELACLALYLMYEKKQGK 162

Query: 150 DSPWRVYLDILPE-------CTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVE 202
            S W  Y+  L           +S + WSE EL  L G+   +  L   + ++ EY +++
Sbjct: 163 KSFWYPYIRELDRQRGRGQLAVESPLLWSEAELAYLTGSPTKAEVLDRADGIKREYEELD 222

Query: 203 EEIILPNK--QLFPRPITLDDFLWAFGILRSRAFSRLRG-----QNLVLIPLADLINHSP 255
               +     Q +P  I  + F   F I + +AF  ++      Q + L     L+   P
Sbjct: 223 TVWFMAGSLFQQYPYDIPTEAF--PFEIFK-QAFVAIQSCVVHLQKVSLARRFALVPLGP 279

Query: 256 GITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESK 315
            +    Y+   K A L + D    L    P KAGE +++ +   + N++L L+YGF++  
Sbjct: 280 PLLA--YSSNCK-AMLTAVDGAVELVVDRPYKAGEPIVV-WCGPQPNSKLLLNYGFVDED 335

Query: 316 SDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPA--MLQYLRLVAL 373
           +  +   +   ++  DP + DK  +A+ NG      F +  G+       +L YLRL  +
Sbjct: 336 NPYDRIAVEAALNTEDPQYQDKRMVAQRNGKLSVQVFQVYAGKEKEAVSDILPYLRLGYV 395

Query: 374 GGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQG 433
             +D   ++S+  +   G +  PVS   E+ +   +    ++ L+G+ T+I EDE +L  
Sbjct: 396 --SDPSEMQSVISSQ--GPV-CPVSPCMEQAVLDQLTVYFRTRLAGYCTSISEDELMLAD 450

Query: 434 GNLDPRLRIAVGVRAGEKRVLQ 455
            NL+P+ R+A  +   EK++L+
Sbjct: 451 PNLNPKKRVATQLVRLEKKMLK 472


>gi|297807453|ref|XP_002871610.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297317447|gb|EFH47869.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 516

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 98/379 (25%), Positives = 171/379 (45%), Gaps = 28/379 (7%)

Query: 94  LVAQRDIAKNEVVLEVPMKFWINPDTVAASE-IGSL--CSGLKPWISVALFLIREKKK-E 149
           + A  D+ K +V   VP    +  + V  +E I  L   + L     +AL+L+ EKK+ +
Sbjct: 117 VAASEDLQKGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 176

Query: 150 DSPWRVYLDILPEC-------TDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVE 202
            S W  Y+  L           +S + WSE EL  L G+   +  L   E ++ EY +++
Sbjct: 177 KSVWYPYIRELDRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAEGIKREYNELD 236

Query: 203 EEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRG-----QNLVLIPLADLINHSPGI 257
               +        P  +    ++F I + +AF  ++      QN+ L     L+   P +
Sbjct: 237 TVWFMAGSLFQQYPFDIPTEAFSFEIFK-QAFVAIQSCVVHLQNVGLARRFALVPLGPPL 295

Query: 258 TTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSD 317
               Y    K A L + D    L    P KAG+ +++ +   + NA+L L+YGF++  + 
Sbjct: 296 LA--YCSNCK-AMLTAVDGAVELVVDRPYKAGDPIVV-WCGPQPNAKLLLNYGFVDEDNP 351

Query: 318 RNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLG--RTLPPAMLQYLRLVALGG 375
            +   +   ++  DP + DK  +A+ NG      F + +G  R     ML YLRL  +  
Sbjct: 352 YDRIIVEAALNTEDPQYQDKRMVAQRNGKLSQQVFQVRVGKEREAVQDMLPYLRLGYM-- 409

Query: 376 TDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGN 435
           +D   ++S+  +   G +   +S   E  +   + +     LSG+ TT +ED+ LL   +
Sbjct: 410 SDPSEMQSVISSQ--GPV-CTMSPCMERAVLDQLANYFMRRLSGYPTTPKEDDALLADPS 466

Query: 436 LDPRLRIAVGVRAGEKRVL 454
           L PR R+A  +   EK++L
Sbjct: 467 LSPRKRVATRLVQLEKKIL 485


>gi|428177025|gb|EKX45907.1| hypothetical protein GUITHDRAFT_138732 [Guillardia theta CCMP2712]
          Length = 505

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 96/399 (24%), Positives = 177/399 (44%), Gaps = 55/399 (13%)

Query: 64  VETFWQWLRDQKVVSPKSPIR-PATFPEGLGLVAQRDIAKNEVVLEVP-MKFWINP-DTV 120
           ++ F  WL+ + ++  +S +   +T   GLGL  + D+   +++L +P ++F  N  ++ 
Sbjct: 77  IKAFEGWLKGKGIIREESKVCLESTESCGLGLFCREDVKAGDLLLSLPALEFMSNKLESS 136

Query: 121 AASEIGSLCSGLKPWISVALFLIREKKKE-----------------DSPWRVYLDILPEC 163
           +     S  S ++    V   +  E+++                  +S ++ Y+D+LPE 
Sbjct: 137 SLQARPSFLSFIRQTDQVLSAMSWERRRSGNAALALQLLLHRNLGGESEFKPYIDLLPEY 196

Query: 164 TD--STVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDD 221
            D   T  WS EE  +L   ++L  ++ +   ++ E+  ++E +            +L+ 
Sbjct: 197 HDYEMTWLWSVEEQQDLLSGKILKDSMSITSQIEREHHTIKEVLGRFQDCAEFGEFSLES 256

Query: 222 FLWAFGILRSRAF--------SRLRG-QNLVLIPLADLINHSPGITTEDYAYEIKGAGLF 272
           + WA   + SRAF        +R +G QNL+L+PL D++NHSP  +   ++ +   AG  
Sbjct: 257 YKWAQATIMSRAFDLDEGQETARRQGEQNLLLVPLCDMVNHSPDAS---FSIDCDAAGNV 313

Query: 273 SRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI-ESKSDRNAYTLTLEISESD 331
           +   LF+       KAG++V I Y  + SN +L L +GF+ E         +TLE+ +  
Sbjct: 314 N---LFASEN---YKAGQEVHINYG-SSSNEQLLLSFGFVLEGGWQAQETEITLEVPQDV 366

Query: 332 PFFGDKLDIAETNGLGESAY---FDIVLGRTLPPAMLQYLRLVALG-GTDAFLLESIFRN 387
             F  K ++    GL  SA+     +    +  P   ++ +   +  G D F   +  + 
Sbjct: 367 EGFEIKRNLLFNGGL-PSAWQLVLPVPFHDSQQPHSREFSQQRTMSQGEDVFDFLASLQT 425

Query: 388 TIWGH--------LDLPVSHANEELICRVVRDACKSALS 418
              G         L  P+S  NE L  R + D C  A S
Sbjct: 426 LSQGKGNFNPLALLQRPLSSFNEGLAVRFILDQCHRASS 464


>gi|390354259|ref|XP_001201449.2| PREDICTED: SET domain-containing protein 4-like [Strongylocentrotus
           purpuratus]
          Length = 455

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 65/285 (22%), Positives = 127/285 (44%), Gaps = 34/285 (11%)

Query: 63  QVETFWQWLRDQKVVSPKSPIRPATFPE-GLGLVAQRDIAKNEVVLEVPMKFWINPDTVA 121
           Q  T  +W+++         ++PA F + G GL+ ++++   + ++E+P    +    + 
Sbjct: 42  QYITLMKWMKEHGFNCKGCCLKPAVFSDTGRGLMTKKNLRPGDSIVEIPRHLLVTAKDIL 101

Query: 122 ASEIGSLCSGLK----PWISVALFLIREKKK-EDSPWRVYLDILPECTDSTVFWSEEE-- 174
            +E+G +    +    P+  V  FL+ E+ K + S W  Y+++LP+   +  F S ++  
Sbjct: 102 NTELGPIIKRQRQKPTPYQVVCAFLLTERSKGKSSFWYPYINVLPKDFTTPAFGSTKQAD 161

Query: 175 ---LVELQGTQLLSTTLGVKEYVQNEYLKVEE-EIILPNKQLFPRPITLDDFLWAFGILR 230
              L  +  ++ ++    ++   ++     E+ E   P  ++F    +LD F+WA+ ++ 
Sbjct: 162 FDVLPTIARSRAINQLQDIRAAFESASCLFEDIERTFPQYRIF---FSLDSFVWAWFVIN 218

Query: 231 SR----------AFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSL 280
           SR          AF      +  L P  DL+NHSPG            AG       + +
Sbjct: 219 SRSVYIEPSGCEAFDPKASDDFALAPFLDLLNHSPGAEVT--------AGFDPVSNCYRI 270

Query: 281 RTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTL 325
           +T     A +QV I Y  +  N  L L+YGF+   +  +A +  L
Sbjct: 271 KTLDSYHAYDQVFIHYGPH-DNVNLLLEYGFVIPSNPHDAVSFEL 314


>gi|297707870|ref|XP_002830708.1| PREDICTED: SET domain-containing protein 4 [Pongo abelii]
          Length = 440

 Score = 79.3 bits (194), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 81/314 (25%), Positives = 138/314 (43%), Gaps = 49/314 (15%)

Query: 69  QWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           +WL+ +K     S + PA FP  G GL++Q  + + ++++ +P    +  DTV  S +G+
Sbjct: 38  KWLKARKF--QDSNLAPACFPGTGRGLMSQTSLQEGQMIISLPESCLLTTDTVIRSYLGA 95

Query: 128 LCSGLK----PWISVALFLIREKKKED-SPWRVYLDILPECTDSTVFWSEEELVELQGTQ 182
             +  K    P +++  FL+ EK   D S W+ YL+ILP+     V   E E+V L    
Sbjct: 96  YITKWKPPPSPLLALCTFLVSEKHAGDRSLWKPYLEILPKAYTCPVCL-EPEVVNLLPQS 154

Query: 183 LLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI----TLDDFLWAFGILRSRA-FSRL 237
           L +     + +VQ E+     +     + LF   +    +    LWA+  + +RA + R 
Sbjct: 155 LKAKAEEQRAHVQ-EFFASSRDFFSSLQPLFAEAVDSIFSYSALLWAWCTVNTRAVYLRP 213

Query: 238 RGQNLV--------LIPLADLINHSPGITT------EDYAYEIKGAGLFSRDLLFSLRTP 283
           R +  +        L P  DL+NHSP +        E ++YEI+    + R         
Sbjct: 214 RHRECLSAELDTCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRRH-------- 265

Query: 284 VPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEI-----SESDPFFGDKL 338
                 E+V I Y  +  N  L L+YGF+   +      ++ EI       +D     K+
Sbjct: 266 ------EEVFICYGPH-DNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKI 318

Query: 339 DIAETNGLGESAYF 352
            I + +G  E+  F
Sbjct: 319 SILKDHGYIENLTF 332


>gi|24640264|ref|NP_727144.1| CG32732 [Drosophila melanogaster]
 gi|22831862|gb|AAF46222.2| CG32732 [Drosophila melanogaster]
 gi|28316927|gb|AAO39485.1| RE55639p [Drosophila melanogaster]
 gi|220957744|gb|ACL91415.1| CG32732-PA [synthetic construct]
          Length = 537

 Score = 79.3 bits (194), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 89/333 (26%), Positives = 143/333 (42%), Gaps = 46/333 (13%)

Query: 46  ATCSLHSASATTNPPTAQVETFWQWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNE 104
           A C   S S       A+VE F  W +D  V S    I  A FP   LGL A R +AK+E
Sbjct: 100 AVCPQISDSPDDQTRLAKVEAFSAWAKDGGVHSEGLEI--AIFPGYQLGLRATRPLAKDE 157

Query: 105 VVLEVPMKFWINPDTVAASEI-GSLCSGLKPWISVALFLIREK-KKEDSPWRVYLDILPE 162
           +VL VP K  ++ +  +   + G +       +++A  L+ EK + E S WR Y+D+LP 
Sbjct: 158 LVLSVPRKLILSEENNSDCRLFGKMTQATH--LNLAYDLVIEKIRGEFSEWRPYIDVLPA 215

Query: 163 CTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKVEEEIILPNKQLFPRPITL 219
             ++ ++++ +++  L+GT   +  +     +  +Y    K    +  P+      P   
Sbjct: 216 KYNTVLYFTTKQMELLRGTAAAALAMRQCRVIAKQYAFLYKYAHTMTEPSTGNRSHPGER 275

Query: 220 DDFLWAFGIL----RSRAFSRLRGQNLV----------------LIPLADLINHSPGITT 259
             F    G+     R    + +  QNLV                LIP  D+ NH PG  T
Sbjct: 276 GLFFTQHGLCYKLYRWAVSTVMTRQNLVPSEKQESEDGPKLISALIPYWDMANHRPGKIT 335

Query: 260 EDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRN 319
             YA         SR L  + +    V  GEQ  I Y  ++SN +L +  GF++  + ++
Sbjct: 336 SFYA-------TVSRQLECTAQE--AVNTGEQFFIYYG-DRSNTDLLVHNGFVDPNNTKD 385

Query: 320 AYTLTLEISESDPFFG------DKLDIAETNGL 346
              + + +S +D          DKL+I  T  L
Sbjct: 386 YVNIRVGLSLTDALAAKRASILDKLNIRHTAEL 418


>gi|291410015|ref|XP_002721306.1| PREDICTED: SET domain containing 4 [Oryctolagus cuniculus]
          Length = 440

 Score = 79.3 bits (194), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 80/312 (25%), Positives = 134/312 (42%), Gaps = 45/312 (14%)

Query: 69  QWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           +WL+D+K       + PA FP  G GL++   + + ++++ +P    +  DTV  S +G 
Sbjct: 38  KWLKDRKFEDKN--LAPARFPGTGRGLMSTVSLQEGQMIISLPESCLLTTDTVIESYLGP 95

Query: 128 LCSGLK----PWISVALFLIREKKKED-SPWRVYLDILPECTDSTVFWSEEELVELQGTQ 182
             +  K    P +++  FL+ EK   D SPW+ YL+ILP+     V       ++ +   
Sbjct: 96  YITKWKPPPSPLLALCTFLVSEKHAGDRSPWQPYLEILPKAYTCPV------CLDPEVVN 149

Query: 183 LLSTTLGVKEYVQN----EYLKVEEEIILPNKQLFPRPI----TLDDFLWAFGILRSRA- 233
           LL   L +K   Q     E+           + LF  PI    +    LWA+  + +RA 
Sbjct: 150 LLPKPLQMKAEEQRARLWEFFASSRGFFSSLQPLFVEPIDSIFSYSALLWAWCTVNTRAV 209

Query: 234 FSRLRGQNLV--------LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVP 285
           + R R +  +        L P  DL+NHSP +  E        A        + +RT   
Sbjct: 210 YLRRRPRECLSAEPDTCALAPYLDLLNHSPHVQVE--------AAFNEETRCYEIRTASR 261

Query: 286 VKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEI-----SESDPFFGDKLDI 340
            +  E+V I Y  +  N  L L+YGF+  ++      ++ EI       +D     K+ I
Sbjct: 262 FRKHEEVFICYGPH-DNQRLLLEYGFVSVRNPHACVYVSGEILVKYLPPTDKQLNKKVAI 320

Query: 341 AETNGLGESAYF 352
            + +G  E+  F
Sbjct: 321 LKDHGFIENLTF 332


>gi|308810511|ref|XP_003082564.1| related to histone-lysine N-methyltransferase (ISS) [Ostreococcus
           tauri]
 gi|116061033|emb|CAL56421.1| related to histone-lysine N-methyltransferase (ISS) [Ostreococcus
           tauri]
          Length = 1472

 Score = 79.3 bits (194), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 75/263 (28%), Positives = 113/263 (42%), Gaps = 30/263 (11%)

Query: 83  IRPATFPE-GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAAS----EIGSLCSGLKPWIS 137
           +RP  F   G G    RD+ + +V+LEVP++   + D   A     EI   C      ++
Sbjct: 682 VRPVEFAATGRGHGVVRDVQRGDVLLEVPLRRGFSYDDAMADDEMREIAKACVRRDDVVA 741

Query: 138 VALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNE 197
           + + L R + KE +    +++ LP+  D    WSE+EL EL GT  L  T  + E  + +
Sbjct: 742 LHVCLERYRGKE-AKHAAHVEALPKTFDCAFNWSEDELSELVGTTCLKDTRALIEETRED 800

Query: 198 YLKVEEEIILPNKQ--LFPRPITLDDFLWAFGILRSRAFSRLR---GQNLVLIPLADLIN 252
           Y  +   ++   K   L  R +  + + WA   L SR    +R    +   +IP  D+ N
Sbjct: 801 YDAIGRRLMAMGKGGWLLERGVDYERYAWARQCLWSRQCDLMRPDGTRTRAMIPYFDIFN 860

Query: 253 HSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPV------KAGEQVLIQYDLNK-SNAEL 305
           HSP             A L     L + R  V V      K GEQ  I Y   + +NA+L
Sbjct: 861 HSP------------EAPLGKTHKLNAERNCVTVYAGRDYKEGEQAFISYGSGEAANAKL 908

Query: 306 ALDYGFIESKSDRNAYTLTLEIS 328
              YGF    +      LTL I+
Sbjct: 909 LTWYGFCIENNPYEELDLTLTIT 931


>gi|451852693|gb|EMD65988.1| hypothetical protein COCSADRAFT_86793 [Cochliobolus sativus ND90Pr]
          Length = 478

 Score = 79.3 bits (194), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 114/446 (25%), Positives = 183/446 (41%), Gaps = 114/446 (25%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASE------------------------IG 126
           G G++A++DI ++  +  +P    I+  T   +E                          
Sbjct: 38  GRGIIAKQDIPEDTTLFTIPRNIIISIQTSDLAEKLPGIFEQPVDADDDNEDDDNEDDQD 97

Query: 127 SLCSGLKPWISVALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLS 185
                L  W S+ L ++ E  + E S W+ YLDILP+  ++ +FW+ +EL EL+GT L +
Sbjct: 98  HESEALDSWGSLILVMLYEYLQGEASRWKTYLDILPQAFETPIFWTPDELKELEGTSLTT 157

Query: 186 TTLGVKEYVQNEYLKVEEEIILP----NKQLFPRP----ITLDDFL------------WA 225
             +G KE   +  L+   E ILP    +  +F  P    +  DD L            +A
Sbjct: 158 EKIGKKE--SDRMLR---ERILPIVTSHPDVFSPPGAPRLNEDDLLSLAHRMGSTIMAYA 212

Query: 226 FGILRSRAFSRLR---------GQNLV-LIPLADLINHSPGITTEDYAYEIKGAGLFSRD 275
           F +      S            G++L+ ++P+AD++N       E  A+   G  L    
Sbjct: 213 FDLENEEEQSEDEEDGWIEDRDGKSLIGMVPMADMLN----ANAEFNAHVHHGDQL---- 264

Query: 276 LLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSD------------RNAYTL 323
            + SLR  +P  AG ++L  Y    S +EL   YG++ S+              R A   
Sbjct: 265 QVTSLRESIP--AGSEILNYYGPLPS-SELLRRYGYVTSEHHRYDVAEISWSLVRTALAE 321

Query: 324 TLEISES-------------DPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRL 370
            L++SE              + FF  + D  E      S+Y     G    P +L+ +  
Sbjct: 322 ELKLSEDTIADIERKLESELEEFFVIERDAGEP-----SSY-----GTLTQPPVLREIST 371

Query: 371 VALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQL 430
                T AFL     R+        P    +E +   V+  A ++ L  + T+ ++DE L
Sbjct: 372 ELEEQTKAFLKALKKRD--------PKRKRSETICNTVLEKALRTRLGQYPTSAKQDESL 423

Query: 431 LQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           L    L  R R+AV VR GEKR+LQ+
Sbjct: 424 LSKEGLSKRHRMAVEVRLGEKRLLQE 449


>gi|422293007|gb|EKU20308.1| ribulose- -bisphosphate carboxylase oxygenase small subunit
           n-methyltransferase i [Nannochloropsis gaditana CCMP526]
          Length = 385

 Score = 79.3 bits (194), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 52/183 (28%), Positives = 94/183 (51%), Gaps = 14/183 (7%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREKK-KE 149
           G GL+ +  I     + ++P+   I+ +   A+  G+L + +  + ++AL LI+E+    
Sbjct: 151 GRGLICKYPINMGNALFQLPLSIVIDKEKSLAAFDGALPADINEYFAIALMLIKERALGP 210

Query: 150 DSPWRVYLDILP--ECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIIL 207
            S W  Y+D+LP  E  + T+ W E +L  L+ + L++ T  +K  +  E+  +EE+ + 
Sbjct: 211 SSFWAPYIDVLPTTEEVNPTLVWPEGDLALLEASPLVAATRSLKRKLAAEFALLEEQYMR 270

Query: 208 PNKQLF-PRPITLDDFLWAFGILRSRAFSRLRG----------QNLVLIPLADLINHSPG 256
               +F P   T + +LWAF  + SRA     G          +++++ P ADLINH+P 
Sbjct: 271 ARSDVFDPSVFTFEAYLWAFINIFSRAIRVKIGGKRGPSGEEEESIIMCPYADLINHNPF 330

Query: 257 ITT 259
             T
Sbjct: 331 ANT 333


>gi|449506720|ref|XP_004162829.1| PREDICTED: uncharacterized LOC101212907 [Cucumis sativus]
          Length = 559

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 61/236 (25%), Positives = 111/236 (47%), Gaps = 14/236 (5%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCS---GLKPWISVALFLIREKK 147
           G G +A+ D+   + VLE+P+   I+ + V  S +  + S   G+ P   + L+ ++EK 
Sbjct: 184 GRGTIAKEDLDVGDTVLEIPLAIIISEELVQKSTMYPVLSKVEGMLPETMMLLWSMKEKH 243

Query: 148 KEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIIL 207
             DS +RVY D LPE  ++ + +    +  L GT L    +  KE+++ +Y ++   +  
Sbjct: 244 IVDSEFRVYFDTLPEAFNTGLSFGVGAMTTLVGTLLFDELMQAKEHLRKQYNELFPALCN 303

Query: 208 PNKQLFPRPI-TLDDFLWAFGILRSRAFSRLRGQNLV---LIPLADLINHSPGITTEDYA 263
            +  +FP    + ++FLWA  +  S +   +     V   L+P+A  +NHS       + 
Sbjct: 304 NHPDIFPEEFYSWEEFLWACELWYSNSLKIMFPDGNVRTCLVPIAGFLNHSL------HP 357

Query: 264 YEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRN 319
           + +    + S       R   P +AGE+  + Y  N S + L   YGF+    + N
Sbjct: 358 HILHYGKVDSDTDSLKFRLSRPCRAGEECYLSYG-NYSGSHLVTFYGFLPEGDNVN 412


>gi|388516285|gb|AFK46204.1| unknown [Lotus japonicus]
          Length = 271

 Score = 79.0 bits (193), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 78/273 (28%), Positives = 133/273 (48%), Gaps = 28/273 (10%)

Query: 219 LDDFLWAFGILRSRAFSRLRGQN--LVLIPLADLINHSPGITTE-DYAYEIKGAGLFSRD 275
           ++ F W+FGIL SR   RL   +  + L+P AD++NHS  + T  DY  + KG       
Sbjct: 1   MESFKWSFGILFSRMV-RLPSMDGKVALVPWADMLNHSCDVETFLDYDKQSKG------- 52

Query: 276 LLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI--ESKSDRNAYTLTLEISESDPF 333
           ++F+  T  P + GEQV I Y   KSN EL L YGF+  E  +  ++  L+L + +SD  
Sbjct: 53  IVFT--TDRPYQPGEQVFISYG-KKSNGELLLSYGFVTREGANPSDSVELSLSLKKSDGS 109

Query: 334 FGDKLDIAETNGLGESAYFDI-VLGRTLPPAMLQYLRLVALGGTDAF-LLESIFRNTIWG 391
           + +KL++ +  GL  S  F I + G  L      YL +        F  + +   N I  
Sbjct: 110 YKEKLELLKKYGLSGSQCFPIRITGWPLELMAYAYLAVSPSSMRGQFEKMAAAASNKITS 169

Query: 392 HLDLPVSHANEELICRVVRDACKSALSGFHTTIEE----DEQLLQGGNLDPRL---RIAV 444
             D       E+ + + + D+C+S++S ++  ++     D  +     L+ RL   ++AV
Sbjct: 170 TKDFKYPEIEEQAL-QFILDSCESSMSKYNKFLQASGSLDLDVTSPKQLNRRLFLKQLAV 228

Query: 445 GVRAGEKRVLQQIDEIFKE--RELELDELEYYQ 475
            +   E+R+L +   I +   R++   EL  ++
Sbjct: 229 DLCTSERRILYRAQYILRRQLRDMRDGELRAFK 261


>gi|294868786|ref|XP_002765694.1| hypothetical protein Pmar_PMAR013760 [Perkinsus marinus ATCC 50983]
 gi|239865773|gb|EEQ98411.1| hypothetical protein Pmar_PMAR013760 [Perkinsus marinus ATCC 50983]
          Length = 330

 Score = 79.0 bits (193), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 73/243 (30%), Positives = 113/243 (46%), Gaps = 41/243 (16%)

Query: 92  LGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGL-KPWISVALFLIREKKKED 150
           +G  A  DI + E +L VP    + P  V          GL +P + +A  L++ +   +
Sbjct: 43  IGCTATADICQGERLLYVPHSACVTPSGV---------QGLYEPQVMLAASLVKHRTDPN 93

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEII--LP 208
           SP+  YL  LP   D  + WS +ELV L+GT +          +    L+V + ++   P
Sbjct: 94  SPFHDYLQSLPSEFDHPLEWSADELVCLKGTTVWE--------MHQLSLEVVDSVVELCP 145

Query: 209 NKQLFPRPITLDDFLWAFGILRSRAF-SRLRGQNLVLIPLADLINHSPGITTEDYAYEIK 267
           N    PR +      WA  ++ SRAF S + G  L +IPLAD  NHS          E++
Sbjct: 146 NS---PRAM----IRWAVEVMMSRAFESEVCG--LCVIPLADQFNHS-STKWHTRVREVE 195

Query: 268 GAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEI 327
           G         F +    PVK GE++   Y L  +N  L L +GFIE  +  + + +T+E+
Sbjct: 196 GG--------FQMLAEKPVKKGEEIFNNYGL-YTNEMLLLTHGFIEFDNPHDHF-ITIEV 245

Query: 328 SES 330
           S +
Sbjct: 246 SNA 248


>gi|255568191|ref|XP_002525071.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase, chloroplast precursor, putative
           [Ricinus communis]
 gi|223535652|gb|EEF37318.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase, chloroplast precursor, putative
           [Ricinus communis]
          Length = 456

 Score = 79.0 bits (193), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 70/239 (29%), Positives = 113/239 (47%), Gaps = 40/239 (16%)

Query: 88  FPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVA-------ASEIGSLCSGLKPWISVAL 140
           F     L A + I   + +L VP    I  D +          E+GS+       +++ L
Sbjct: 43  FSRVRSLFASKSIQTGDCILRVPYSAQIASDNLLPELSDLLGDEVGSVAK-----LAIVL 97

Query: 141 FLIREKKKEDSPWRVYLDILPECTD--STVFWSEEELVELQGTQLLSTTLGVKEYVQNEY 198
            L+ +K  ++S W  Y+  LP+  +  ST+FWS+ EL  +  + +   T+  K  ++ ++
Sbjct: 98  -LVDQKVGQESKWAPYISRLPQLGEMHSTIFWSKSELDMIFQSSVYKETIKQKAQIEKDF 156

Query: 199 LKVEEEIILPNKQLFP---RPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINH-- 253
           L ++     P  + FP   R IT  DF+ A+ +++SRA+   +G  + LIP AD +NH  
Sbjct: 157 LTIK-----PVLEHFPQISRSITFQDFMHAYALVKSRAWGSTKG--VSLIPFADFLNHDG 209

Query: 254 -SPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF 311
            S  +   D   E K     + D     R   P    E+VLI+Y    SNA L LD+GF
Sbjct: 210 FSEAVVLND---EDKQVSEVAAD-----RNYAP---HEEVLIRYG-KFSNATLLLDFGF 256


>gi|452825744|gb|EME32739.1| ribulose-1,5 bisphosphate carboxylase oxygenase large subunit
           N-methyltransferase, putative isoform 1 [Galdieria
           sulphuraria]
          Length = 487

 Score = 79.0 bits (193), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 111/464 (23%), Positives = 191/464 (41%), Gaps = 73/464 (15%)

Query: 27  LLTSLTKKPSFRL--KSRAFAATCSLHSASATTNPPTAQVETFWQWLRDQKVVSPKSPIR 84
           +L   T +PS RL   SR F   CS           + +++ F+ WL  + V  PK  I+
Sbjct: 47  ILNVPTLRPSSRLCKPSRLFLV-CS-----------SGRLDLFYHWLTRENVYMPK--IK 92

Query: 85  PATFPEGL-GLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKP--WISVALF 141
                +GL G+VA   I  +E  L+VP    +    V   E  ++   + P  W     +
Sbjct: 93  LDQNKDGLRGVVAVEGIECDESFLKVPRDLSLQ---VTEHEECTMSEFVDPELWSQENWY 149

Query: 142 LIREKKKED-------SPWRVYLDILPECTDS-TVFWSEEELVELQGTQLLSTTLGVKEY 193
           +    K          S W+ Y+DILP   ++  V+WS  EL +LQ   L+      + Y
Sbjct: 150 VKLSLKLLKEKYLGKLSLWKPYIDILPHALNTGLVYWSSSELAQLQYRPLIEEVKINQYY 209

Query: 194 VQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFS--RLRGQNLVLIPLADLI 251
            +  Y +V E +  P + ++ +    + F WA  +++SRAF    +  +   L+P+ D++
Sbjct: 210 REALYTRVFESLSSPVR-VWLQNEKENVFFWALDMVQSRAFGIPDVGNKTYALLPMMDML 268

Query: 252 NHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF 311
           NH     T      I           + ++T   +  G  + I Y     N  L   YGF
Sbjct: 269 NHRVNSQTHFLYDSIANQ--------YEMKTYSKLSPGTDIYISYG-PLDNDHLLHFYGF 319

Query: 312 IESKSDRNAYTLT-----------LEISESDP--FFGDKLDIAETNGLGESAYFDIVLGR 358
           +++ +  + + +             E  ++ P     +KL +     + E+       G+
Sbjct: 320 LQTNNPSDYFQVKDIFQWLHLMYEQEEWQAQPSHLLEEKLSLLRKYHIYEN-------GK 372

Query: 359 TLPPAMLQY-------LRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRD 411
           T       Y       LR+     TD    + I  N   G     +S  N+  + +V+  
Sbjct: 373 TFHLYHDHYDDEIDIILRVFMASKTD---WQQIQENFAMGLFHKALSLENQLHVWQVIIG 429

Query: 412 ACKSALSGFHTTIEEDEQLLQGGN-LDPRLRIAVGVRAGEKRVL 454
            CK  L    T++EEDEQLL+  + L  +L++A+  R  +K +L
Sbjct: 430 GCKHLLKDMKTSVEEDEQLLKNKDQLSTKLQLAIQFRLEKKYIL 473


>gi|242007310|ref|XP_002424484.1| SET domain-containing protein, putative [Pediculus humanus
           corporis]
 gi|212507902|gb|EEB11746.1| SET domain-containing protein, putative [Pediculus humanus
           corporis]
          Length = 492

 Score = 78.6 bits (192), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 81/334 (24%), Positives = 147/334 (44%), Gaps = 55/334 (16%)

Query: 70  WLRDQKVVSPKSPIRPATFPE-GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSL 128
           W+++   V+    I+   F E G GL A +D+ ++E++  +P    +  D V  S +  L
Sbjct: 91  WIKENGGVADNVTIKH--FNEMGYGLEAAKDLEESELICAIPKNVMMTLDNVKVSPLKYL 148

Query: 129 CSG---LKPW--ISVALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSEEELVELQ--- 179
                 LK    +++ALFLI E  K E+S W  Y+  LP   ++ +++   + +E++   
Sbjct: 149 YENNPILKNMGNVALALFLILEHVKNENSFWHHYISSLPSDYNTVLYFDLNDFLEMKNSP 208

Query: 180 ----GTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFS 235
                T+          Y  N +    +E  L  + +F    T   + WA   + +R   
Sbjct: 209 TFEMATKHCKNIARQYAYFNNLFQNSNDEASLILRNVF----TYQLYRWAVSTVMTR--- 261

Query: 236 RLRGQNLV---------------LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSL 280
               QN +               LIPL D+ NH+ G  +  Y  +        R    + 
Sbjct: 262 ----QNFIPSSSTSNDVENGINGLIPLWDMCNHTNGYLSTQYKVD--------RSECLAC 309

Query: 281 RTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDI 340
           +   P K GEQVLI Y   +SN++  +  GF+  ++  +++ L L IS+SD   G + ++
Sbjct: 310 K---PFKKGEQVLIFYG-ERSNSDFLVHNGFVYDENPHDSFRLRLGISKSDKLHGLRCEL 365

Query: 341 AETNGLGESAYFDIVLGRT-LPPAMLQYLRLVAL 373
            +  G+ +S  F +  G   +   +L +LR+  +
Sbjct: 366 LKDLGIPDSGDFYLYSGSEPVRENLLAFLRIFNM 399


>gi|358056251|dbj|GAA97802.1| hypothetical protein E5Q_04481 [Mixia osmundae IAM 14324]
          Length = 433

 Score = 78.6 bits (192), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 96/401 (23%), Positives = 170/401 (42%), Gaps = 66/401 (16%)

Query: 86  ATFPE-GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAA-------SEIGSLCSGLKPWIS 137
           A++P+ G G+ A  ++     +  +P    ++  T          SEI +     + W+ 
Sbjct: 28  ASYPDMGCGMRATSNLRSETELFSIPRSLVLSVHTSPLPKSLPDWSEIST-----QGWVG 82

Query: 138 VALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNE 197
           + L L+ E+    S W+ YL+ +P C DS +FWS++EL ELQG+ +L   +G +E  +  
Sbjct: 83  LILCLMYEQIDPASHWKRYLNSMPTCFDSLMFWSDDELRELQGSSVLD-KIG-REEAEGS 140

Query: 198 YLKVEEEIILPNKQLFPRPI---TLDDFLWAFGILRSRAFSRLRG--------------- 239
           Y  +    +  +  +F +P+   +L  +     ++ SR+F                    
Sbjct: 141 YYSILVPYLSKHADIF-KPLEAYSLALYHRCGSLILSRSFHVSNQDDSASDASDDDDAAY 199

Query: 240 ---QNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQY 296
              + + ++P+AD++N   G       Y               + T   + AGEQ+   Y
Sbjct: 200 HEVETVGMVPMADVLNAKSGSANACLVYHPDA---------LVMTTTKEIAAGEQIFNTY 250

Query: 297 DLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGE--SAYFDI 354
           + +  NA+L   YG ++         L   +  S    G K D+   + L +     + +
Sbjct: 251 N-DPPNADLLRRYGHVDE------VNLNDNVEISADLIGCK-DLERVDWLLDRLDDVYTL 302

Query: 355 VLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACK 414
                LP   +  ++++    ++       FR  I    DLP    +E    R VR+  +
Sbjct: 303 TQAEDLPEDFITAVKILTASKSE-------FRK-IQKADDLPDDVLDEATAMR-VREILQ 353

Query: 415 SALSGFHTTIEEDEQLLQGGN-LDPRLRIAVGVRAGEKRVL 454
             L+ + +TIEEDE LL     L  R R A+ VR GEKR+L
Sbjct: 354 MRLAQYSSTIEEDESLLASSTMLTSRSRAALLVRLGEKRIL 394


>gi|356534483|ref|XP_003535783.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Glycine
           max]
          Length = 463

 Score = 78.6 bits (192), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 81/291 (27%), Positives = 127/291 (43%), Gaps = 32/291 (10%)

Query: 69  QWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTV---AASEI 125
           QWL  +   S  S +       G  L A + I   + +L+VP +  I  D +     S I
Sbjct: 38  QWLERKAACSISSSLFIGNSSYGRSLFASKIIQTGDCILKVPYRVQITADNLLPEIRSLI 97

Query: 126 GSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTD--STVFWSEEELVELQGTQL 183
           G     +    +V   LI +K  + S W  Y+  LP+  +  +TVFW+E EL  ++ + +
Sbjct: 98  GEEVGNIAKLATV--ILIEKKLGQGSEWYPYISCLPQQGELHNTVFWTESELEMIRPSSV 155

Query: 184 LSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGI-----------LRSR 232
              T+  K  ++ ++L + + I   + Q F    T  DF+ A  +           + SR
Sbjct: 156 YQETIDQKSQIEKDFLAI-KHIFECSHQSFGDS-TYKDFMHACTLVLFDHFNVELPVGSR 213

Query: 233 AFSRLRGQNLVLIPLADLINH---SPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAG 289
           A+    G  L LIP AD +NH   S  I   D   + K         + + R   P   G
Sbjct: 214 AWGSTNG--LALIPFADFLNHDGVSEAIVMSD---DDKQCSEVQSLQIIADRDYAP---G 265

Query: 290 EQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDI 340
           EQVLI+Y    SNA L LD+GF    +  +   +  +I + DP    KL++
Sbjct: 266 EQVLIRYG-KFSNATLMLDFGFTIPYNIYDQVQIQFDIPKHDPLRDMKLEL 315


>gi|449466129|ref|XP_004150779.1| PREDICTED: uncharacterized protein LOC101212907 [Cucumis sativus]
          Length = 559

 Score = 78.6 bits (192), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 61/236 (25%), Positives = 110/236 (46%), Gaps = 14/236 (5%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCS---GLKPWISVALFLIREKK 147
           G G +A+ D+   + VLE+P+   I+ + V  S +  + S   G+ P     L+ ++EK 
Sbjct: 184 GRGTIAKEDLDVGDTVLEIPLAIIISEELVQKSTMYPVLSKVEGMLPETMTLLWSMKEKH 243

Query: 148 KEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIIL 207
             DS +RVY D LPE  ++ + +    +  L GT L    +  KE+++ +Y ++   +  
Sbjct: 244 IVDSEFRVYFDTLPEAFNTGLSFGVGAMTTLVGTLLFDELMQAKEHLRKQYNELFPALCN 303

Query: 208 PNKQLFPRPI-TLDDFLWAFGILRSRAFSRLRGQNLV---LIPLADLINHSPGITTEDYA 263
            +  +FP    + ++FLWA  +  S +   +     V   L+P+A  +NHS       + 
Sbjct: 304 NHPDIFPEEFYSWEEFLWACELWYSNSLKIMFPDGNVRTCLVPIAGFLNHSL------HP 357

Query: 264 YEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRN 319
           + +    + S       R   P +AGE+  + Y  N S + L   YGF+    + N
Sbjct: 358 HILHYGKVDSDTDSLKFRLSRPCRAGEECYLSYG-NYSGSHLVTFYGFLPEGDNVN 412


>gi|189236574|ref|XP_975615.2| PREDICTED: similar to SET domain containing 3 [Tribolium castaneum]
          Length = 667

 Score = 78.6 bits (192), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 61/221 (27%), Positives = 107/221 (48%), Gaps = 11/221 (4%)

Query: 244 LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNA 303
           LIPL D+ NH+ G  +  Y        +  R    +++     KAGEQ+ I Y  ++SNA
Sbjct: 453 LIPLWDMCNHTNGTISTAYN------PVLDRSECLAVKN---FKAGEQLFIFYG-SRSNA 502

Query: 304 ELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRT-LPP 362
           +L +  GF+   +D + Y + L IS+SDP    +  +     +  +  F I  G + +  
Sbjct: 503 DLFVHNGFVFENNDYDVYWIRLGISKSDPLQQKRGHLLGKLSIASTCDFSIRKGASPIDG 562

Query: 363 AMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHT 422
            +L +LR+  +         +  ++   GH+D  +  A E    R +    K  LS + T
Sbjct: 563 QLLAFLRVFNMNEEQLDHWINSDKSADLGHIDCALDTALETKSWRFLHARLKLLLSTYKT 622

Query: 423 TIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKE 463
           T++EDE+LL      P   +A+ +RA EKR++++  E  ++
Sbjct: 623 TLDEDEKLLAEAQATPNRLLAIKMRATEKRIIRETLEYVEQ 663



 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 43/155 (27%), Positives = 78/155 (50%), Gaps = 14/155 (9%)

Query: 53  ASATTNPPTAQVETFWQWLRDQKVVSPKSPIRPATFPE----GLGLVAQRDIAKNEVVLE 108
           +S       A +E F +W  +       + +   +  E    GLG+ A  DIA++ +V+ 
Sbjct: 76  SSVDNRASAATIENFLKWANENG-----AQLNGCSIDEFEGYGLGVKANVDIAESSLVIA 130

Query: 109 VPMKFWINPDTVAASEIGSLCS-----GLKPWISVALFLIREKKKEDSPWRVYLDILPEC 163
           VP K  ++ +    S +  L       G  P +++++FL+ EK K DS W+ Y+DILP+ 
Sbjct: 131 VPRKLMMSVENAKESVLKDLIEKDKILGSMPNVALSIFLLLEKYKGDSFWKPYIDILPKT 190

Query: 164 TDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY 198
             + +++S +EL EL+G+  L   L   + +  +Y
Sbjct: 191 YTTVLYFSIDELEELRGSPTLEVALRQIKSITRQY 225


>gi|195480581|ref|XP_002101314.1| GE17555 [Drosophila yakuba]
 gi|194188838|gb|EDX02422.1| GE17555 [Drosophila yakuba]
          Length = 548

 Score = 78.6 bits (192), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 112/456 (24%), Positives = 187/456 (41%), Gaps = 68/456 (14%)

Query: 62  AQVETFWQWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTV 120
           A+VE F  W +D  V S    I  A FP   LGL A R +AK E+VL VP K   + +  
Sbjct: 120 AKVEAFSAWAKDGGVHSEGLEI--AIFPGYQLGLRANRPLAKEELVLSVPRKLIFSEENN 177

Query: 121 AASEI-GSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQ 179
           +   + G +       ++  L LI + + E S WR Y+D+LP    + ++++ +++  L+
Sbjct: 178 SDCRLFGKMTQATHLNLAYDL-LIEKIRGEFSEWRPYIDVLPAKYSTVLYFTTKQMERLR 236

Query: 180 GTQLLSTTLG-----VKEYV-QNEYLKVEEEIILPNKQ------LF--PRPITLDDFLWA 225
           GT   S  L       K+Y     Y     E    N+       LF   R +    + WA
Sbjct: 237 GTAACSLALRQCRVIAKQYAFLYRYAHTLAESSTGNRSHPGERGLFFTQRGLCYKLYRWA 296

Query: 226 FGILRSRAFSRLRGQNLV----------------LIPLADLINHSPGITTEDYAYEIKGA 269
              + +R       QNLV                LIP  D+ NH PG  T  YA      
Sbjct: 297 VSTVMTR-------QNLVPSEKQEAQDSPKFISALIPYWDMANHRPGKITSFYA------ 343

Query: 270 GLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISE 329
              SR L  + +  V   AGEQ  I Y  ++SN +L +  GF++  + ++   + + +S 
Sbjct: 344 -AVSRQLECTAQEAVA--AGEQFFIYYG-DRSNTDLLVHNGFVDVNNLKDYVNIRVGLSP 399

Query: 330 SDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP------AMLQYLRLVALGGTD-AFLLE 382
           +D     +  I +   +  +A       R LP        +L ++R+  +          
Sbjct: 400 TDALAAKRASILDKLNIRHTAEL-----RVLPAPDFISKELLAFVRVFKMSAEQLDHWCS 454

Query: 383 SIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRI 442
            + R     H+D  +   +E    + + D  K  L+ F   + E +++ +      +L++
Sbjct: 455 DLERAGDLLHIDCALETDHETRTWQFLEDRLKLLLAVFDKEMREADEVKELS----QLQL 510

Query: 443 AVGVRAGEKRVLQQIDEIFKERELELDELEYYQERR 478
             G   G++  L  +     ER +    L+Y QE R
Sbjct: 511 KDGASLGQEIELMLLLYRRLERSILAGALQYAQEHR 546


>gi|348676999|gb|EGZ16816.1| hypothetical protein PHYSODRAFT_251772 [Phytophthora sojae]
          Length = 424

 Score = 78.2 bits (191), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 119/437 (27%), Positives = 178/437 (40%), Gaps = 78/437 (17%)

Query: 65  ETFWQWLRDQKVVSPKSPIRPATFPEGL-GLVAQRDIAKNEVVLEVPMKFWINPDTVAAS 123
           + F QWLRD     PK    P T P GL G VA   +A  E +L +P +  I+ D     
Sbjct: 12  QRFLQWLRDNGATFPKLQW-PVTSPNGLRGTVAAAAVASGEPMLCIPRRLLISEDLCWRD 70

Query: 124 -EIGSLCSGLKPWIS-----VALFLIREKKKED-SPWRVYLDILPECTDSTVFWSEEELV 176
            ++G +    +   +     +ALFL+RE    D S +  YL +LP   +S   W++ EL 
Sbjct: 71  PQLGRVFQDNRDVFTRDDPVLALFLVRELLLADRSFFHPYLAVLP-YPESVQDWTQAELG 129

Query: 177 ELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI-TLDDFLWAFGILRSRAFS 235
           EL   +L+         +   Y +V   +       FP  + T D F +A+  +++R F 
Sbjct: 130 ELHDERLVDAAARRTSEIDVYYRRVMVRLQTKYPGEFPEALYTFDRFKFAWKTIQARTFG 189

Query: 236 RLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQ 295
           R R     L+P AD +NH+   T   Y +++   GLF                G +V   
Sbjct: 190 R-RLPWTALVPFADCLNHTNVATK--YDFDVNDNGLFR----LYPSGATSFAQGAEVFNS 242

Query: 296 YDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIV 355
           Y   +SN +L LDYGF             L  +E D                   Y D+ 
Sbjct: 243 YG-RRSNFQLLLDYGF------------ALPDNEWD-------------------YVDVE 270

Query: 356 LG--RTLPPAM-LQYL-RLVALGGTDAFLLESIFRNTIWGHLDLPV-----SHANEELIC 406
           +G  R  P    L+++ R+V +    +  L+ +F  +    L  PV     S A  EL  
Sbjct: 271 IGKDRAGPRGRKLRFMKRVVRIDRQSS--LDELFPPSFLAGLADPVPDEEQSEAAAELSE 328

Query: 407 RVVRDACKSALSGFHT----------TIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQ- 455
           R    A   AL    +          T E DE++LQ      RLR AV  R G ++++Q 
Sbjct: 329 RT---ALCDALEWLRSILIETIADWGTAENDERILQHAASSDRLRAAVVFRTGRRQIVQK 385

Query: 456 ---QIDEIFKERELELD 469
              QID+       +LD
Sbjct: 386 VITQIDDKLVSARRQLD 402


>gi|359488614|ref|XP_003633789.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
           [Vitis vinifera]
          Length = 515

 Score = 78.2 bits (191), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 91/335 (27%), Positives = 156/335 (46%), Gaps = 29/335 (8%)

Query: 138 VALFLIREKKK-EDSPWRVYLDILPE-------CTDSTVFWSEEELVELQGTQLLSTTLG 189
           +AL+L+ EKK+ + S W  Y+  L           +S + WSE EL  L G+   +  L 
Sbjct: 163 LALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSESELAYLTGSPTKAEVLE 222

Query: 190 VKEYVQNEYLKVEEEIILPNK--QLFPRPITLDDFLWAFGILRSRAFSRLRG-----QNL 242
             E ++ EY +++    +     Q +P  I  + F   F I + +AF  ++      Q +
Sbjct: 223 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF--PFEIFK-QAFVAIQSCVVHLQKV 279

Query: 243 VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSN 302
            L     L+   P +    Y    K A L + D    L    P KAGE +++ +   + N
Sbjct: 280 SLARRFALVPLGPPLLA--YRSNCK-AMLAAVDGSVQLVVDRPYKAGESIVV-WCGPQPN 335

Query: 303 AELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLG--RTL 360
           ++L L+YGF++  +  +   +   ++  DP + DK  +A+ NG      F + +G  R  
Sbjct: 336 SKLLLNYGFVDEDNSYDRIVVEAALNTEDPQYQDKRMVAQRNGKLTVQKFHVSVGKEREA 395

Query: 361 PPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGF 420
              ML YLRL  +  +D   ++S+  +   G +  PVS   E  +   + D  +  L+G+
Sbjct: 396 VSDMLPYLRLGYV--SDPSEMQSVISSQ--GPI-CPVSPCMERAVLDQLVDYFERRLAGY 450

Query: 421 HTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQ 455
            TT+ EDE LL   NL+P+  +A  +   EK++L 
Sbjct: 451 PTTMSEDECLLADSNLNPKKLVATQLVRLEKKMLN 485


>gi|168063638|ref|XP_001783777.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664720|gb|EDQ51429.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 395

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 68/269 (25%), Positives = 111/269 (41%), Gaps = 27/269 (10%)

Query: 94  LVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREKK-KEDSP 152
           L A R I   E VL V     I P+ +       L +G+  W  +ALF++ E+   + S 
Sbjct: 3   LFAARPIEVGEQVLRVSGDLMITPNKLPTEVKELLPTGVTEWARLALFILVEQHLGQASQ 62

Query: 153 WRVYLDILPECT--DSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNK 210
           W  Y++ LP C    STVFW +EEL  ++ T L   T+  +  + +E+  V   ++    
Sbjct: 63  WAPYINCLPTCGALHSTVFWKKEELELVRFTSLHRETMQRRAVIGSEFASV-LPVLQKCP 121

Query: 211 QLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITT------EDYAY 264
            +F   +    F  A+     ++  R     ++ +P  D  NH            E    
Sbjct: 122 HIFGERVLHSKFKQAYAT--GKSLRRSSNTRILTVPFVDFFNHDSNCRALLSYDEERACA 179

Query: 265 EIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLT 324
           E+     ++R              GEQV+I Y     N  LALD+GF  S +  +   + 
Sbjct: 180 EVIADKNYAR--------------GEQVVISYG-RLPNTTLALDFGFTISCNPYDQVEVW 224

Query: 325 LEISESDPFFGDKLDIAETNGLGESAYFD 353
           + +S  DP    KL +   +G+    + D
Sbjct: 225 MALSHRDPLRKMKLALLHAHGMPTVVHAD 253


>gi|332229557|ref|XP_003263953.1| PREDICTED: SET domain-containing protein 4 [Nomascus leucogenys]
          Length = 440

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 79/308 (25%), Positives = 134/308 (43%), Gaps = 37/308 (12%)

Query: 69  QWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           +WL+ +K     S + PA FP  G GL++Q  + + ++++ +P    +  DTV  S +G+
Sbjct: 38  KWLKARKF--QDSNLAPACFPGTGRGLMSQTSLQEGQMIISLPESCLLTTDTVIRSYLGA 95

Query: 128 LCSGLK----PWISVALFLIREKKKED-SPWRVYLDILPECTDSTVFWSEEELVELQGTQ 182
             +  K    P +++  FL+ EK   D S W+ YL+ILP+     V   E E+V L    
Sbjct: 96  YITKWKPPPSPLLALCTFLVSEKHAGDRSLWKPYLEILPKAYTCPVC-LEPEVVNLLPKS 154

Query: 183 LLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI----TLDDFLWAFGILRSRA-FSRL 237
           L +     + +VQ E+     +     + LF   +    +    LWA+  + +RA + R 
Sbjct: 155 LKAKAEEQRAHVQ-EFFASSRDFFSSLQPLFAEAVDSIFSYSALLWAWCTVNTRAVYLRP 213

Query: 238 RG--------QNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAG 289
           R             L P  DL+NHSP +  +        A        + +RT    +  
Sbjct: 214 RQWECLSAEPDTCALAPYLDLLNHSPHVQVK--------AAFNEETHSYEIRTTSRWRKH 265

Query: 290 EQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEI-----SESDPFFGDKLDIAETN 344
           E+V I Y  +  N  L L+YGF+   +      ++ EI       +D     K+ I + +
Sbjct: 266 EEVFICYGPH-DNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDH 324

Query: 345 GLGESAYF 352
           G  E+  F
Sbjct: 325 GYIENLTF 332


>gi|194896580|ref|XP_001978500.1| GG17647 [Drosophila erecta]
 gi|190650149|gb|EDV47427.1| GG17647 [Drosophila erecta]
          Length = 544

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 85/334 (25%), Positives = 138/334 (41%), Gaps = 38/334 (11%)

Query: 46  ATCSLHSASATTNPPTAQVETFWQWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNE 104
           A C   + S       A+VE F  W +D  V S    I  A FP   LGL A R +AK E
Sbjct: 102 AVCPQITDSPDDQSRLAKVEAFSAWAKDGGVHSEGLEI--AIFPGYQLGLRATRPLAKEE 159

Query: 105 VVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECT 164
           +VL VP K   + +  +   +         W  V   +I + + E S WR Y+DILP   
Sbjct: 160 LVLTVPRKLIFSEENNSDCRLFGKMPQATHW--VYDLVIEKIRGEFSEWRPYIDILPAKY 217

Query: 165 DSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKVEEEIILPNKQLFPRPITLDD 221
            + ++++ +++  L+GT   S  L     +  +Y    +    +  P+      P     
Sbjct: 218 STVLYFTIKQMERLRGTAACSLALRQCRVIAKQYAFLYRYAHTLTEPSDGNRSHPGERGL 277

Query: 222 FLWAFGIL----RSRAFSRLRGQNLV----------------LIPLADLINHSPGITTED 261
           F    G+     R    + +  QNLV                LIP  D+ NH PG  T  
Sbjct: 278 FFTQHGLCYKLYRWAVSTVMTRQNLVPSEKQESQDSPKFISALIPYWDMANHKPGKITSF 337

Query: 262 YAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAY 321
           YA         SR L  + +    V+AGEQ  I Y  ++SN +L +  GF++  + ++  
Sbjct: 338 YA-------AVSRQLECTAQE--AVEAGEQFFIYYG-DRSNTDLLVHNGFVDVNNLKDYV 387

Query: 322 TLTLEISESDPFFGDKLDIAETNGLGESAYFDIV 355
            + + +S +D     +  I +   +  +A   ++
Sbjct: 388 NIRVGLSPTDALAAKRASILDKLNIRHNAELRVL 421


>gi|449464220|ref|XP_004149827.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Cucumis
           sativus]
          Length = 499

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 74/261 (28%), Positives = 116/261 (44%), Gaps = 38/261 (14%)

Query: 65  ETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVA--- 121
           +TF  WL  +      S +       G  L A   I   + +L+VP    I+PD++    
Sbjct: 49  DTFLPWLERKAETKISSVLSIGKSSIGRFLFASETIRAGDCILKVPFNVQISPDSLPLPI 108

Query: 122 ----ASEIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPEC--TDSTVFWSEEEL 175
                +EIG++           + L+  K    S W  Y+  LP+     +T+FW E EL
Sbjct: 109 RDLLGNEIGNVAK------LAVVVLLEHKLGLGSEWAPYIIRLPQPWEMHNTIFWKESEL 162

Query: 176 VELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPR---PITLDDFLWAFGILRSR 232
             ++ + L   +L  +  ++ E+L + + +     + FP     I+ DDF+ A+ ++ SR
Sbjct: 163 EMIRKSSLYEESLNQRSQIKREFLAIRKAL-----EAFPEIIDRISCDDFMHAYALVTSR 217

Query: 233 AFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRD--LLFSLRTPVPVKAGE 290
           A+    G  + LIP AD +NH       D A E   A L + D   L  +        GE
Sbjct: 218 AWRSTEG--VSLIPFADFLNH-------DGASE---AMLLNDDDKQLSEVVADRDFAPGE 265

Query: 291 QVLIQYDLNKSNAELALDYGF 311
            VLI+Y    SNA L LD+GF
Sbjct: 266 HVLIRYG-KYSNATLMLDFGF 285


>gi|270005261|gb|EFA01709.1| hypothetical protein TcasGA2_TC007289 [Tribolium castaneum]
          Length = 230

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 61/222 (27%), Positives = 107/222 (48%), Gaps = 11/222 (4%)

Query: 243 VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSN 302
            LIPL D+ NH+ G  +  Y        +  R    +++     KAGEQ+ I Y  ++SN
Sbjct: 15  ALIPLWDMCNHTNGTISTAYN------PVLDRSECLAVKN---FKAGEQLFIFYG-SRSN 64

Query: 303 AELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRT-LP 361
           A+L +  GF+   +D + Y + L IS+SDP    +  +     +  +  F I  G + + 
Sbjct: 65  ADLFVHNGFVFENNDYDVYWIRLGISKSDPLQQKRGHLLGKLSIASTCDFSIRKGASPID 124

Query: 362 PAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFH 421
             +L +LR+  +         +  ++   GH+D  +  A E    R +    K  LS + 
Sbjct: 125 GQLLAFLRVFNMNEEQLDHWINSDKSADLGHIDCALDTALETKSWRFLHARLKLLLSTYK 184

Query: 422 TTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKE 463
           TT++EDE+LL      P   +A+ +RA EKR++++  E  ++
Sbjct: 185 TTLDEDEKLLAEAQATPNRLLAIKMRATEKRIIRETLEYVEQ 226


>gi|62860180|ref|NP_001017105.1| SET domain containing 4 [Xenopus (Silurana) tropicalis]
 gi|89267009|emb|CAJ81787.1| novel protein containing a SET domain [Xenopus (Silurana)
           tropicalis]
          Length = 442

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 95/372 (25%), Positives = 159/372 (42%), Gaps = 45/372 (12%)

Query: 68  WQWLRDQKVVSPKS----PIRPATFPE-GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAA 122
           W++++ Q+ +  +      +R A F + G GL+A RD+   E+++ +P    I  +TV  
Sbjct: 31  WEYIQLQRWLKERGFQGRHLRAAEFTDTGRGLMATRDLQPGELIISLPDSCLITTETVLQ 90

Query: 123 SEIG----SLCSGLKPWISVALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSEEELVE 177
           S +G    +    + P +++  FLI E+  +E SPW+ YLD+LP      V+W E E++ 
Sbjct: 91  SYLGKYIRTWSPPVSPLLALCTFLIAERVARERSPWKPYLDVLPSSYSCPVYW-ESEIIS 149

Query: 178 LQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQ-LFPRPI----TLDDFLWAFGILRSR 232
           L    L    L  +  V+   L  E      + Q LF   I    T     WA+  + +R
Sbjct: 150 LLPAPLRQKALEQQTEVKE--LHTESWSFFVSLQPLFGGNITDIYTYGALRWAWCTVNTR 207

Query: 233 A----FSRLRGQN-----LVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTP 283
                  R  G +       + P  DL+NHSP +  E        A        + +RT 
Sbjct: 208 TVYMKHPRRHGLSAQQDVYAMAPYLDLLNHSPAVQVE--------AAFNEERRCYEIRTN 259

Query: 284 VPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAET 343
              +  +Q  I Y  +  N  L L+YGFI + +   +  +T +        GDK    + 
Sbjct: 260 SGCRKHDQAFICYGPH-DNQRLLLEYGFIAANNPHRSVYVTKDAIVGYLPPGDKQMPKKW 318

Query: 344 NGLGESAYF-DIVLGRTLPP-AMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHAN 401
             LGE+ +  ++  G   P   +L  ++L+ L   +     + ++  I G     VS AN
Sbjct: 319 ALLGENDFLGNLTFGLDGPSWKLLTAVKLLCLRPEEF----TSWKKVILGSF---VSDAN 371

Query: 402 EELICRVVRDAC 413
           E     +VR  C
Sbjct: 372 ERDSLELVRKIC 383


>gi|355747383|gb|EHH51880.1| SET domain-containing protein 4 [Macaca fascicularis]
          Length = 440

 Score = 77.4 bits (189), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 79/308 (25%), Positives = 135/308 (43%), Gaps = 37/308 (12%)

Query: 69  QWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           +WL+ +K     S + PA FP  G GL++Q  + + ++++ +P    +  DTV  S +G+
Sbjct: 38  KWLKARKFQD--SNLAPACFPGTGRGLMSQTSLQEGQMIISLPESCLLTTDTVIRSYLGA 95

Query: 128 LCSGLK----PWISVALFLIREKKKED-SPWRVYLDILPECTDSTVFWSEEELVELQGTQ 182
             +  K    P +++  FL+ EK   D S W+ YL+ILP+     V   E E+V L    
Sbjct: 96  YITKWKPPPSPLLALCTFLVSEKHAGDRSLWKPYLEILPKAYTCPVCL-EPEVVNLLPKS 154

Query: 183 LLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI----TLDDFLWAFGILRSRA-FSRL 237
           L +     + +VQ E+     +     + LF   +    +    LWA+  + +RA + R 
Sbjct: 155 LKAKAEEQRAHVQ-EFFASSRDFFSSLQPLFVEAVDSIFSYSALLWAWCTINTRAVYLRP 213

Query: 238 RGQNLV--------LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAG 289
           R +  +        L P  DL+NHSP +  +        A        + +RT    +  
Sbjct: 214 RQRECLSAEPDTCALAPYLDLLNHSPRVQVK--------AAFNEETHSYEIRTTSRWRKH 265

Query: 290 EQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEI-----SESDPFFGDKLDIAETN 344
           E+V I Y  +  N  L L+YGF+   +      ++ EI        D     K+ I + +
Sbjct: 266 EEVFICYGPH-DNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSRDKQMDKKISILKDH 324

Query: 345 GLGESAYF 352
           G  E+  F
Sbjct: 325 GYIENLTF 332


>gi|345326326|ref|XP_001512617.2| PREDICTED: SET domain-containing protein 4-like [Ornithorhynchus
           anatinus]
          Length = 499

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 82/315 (26%), Positives = 127/315 (40%), Gaps = 51/315 (16%)

Query: 69  QWLRDQKVVSPKSPIRPATFPE-GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           +WL+ ++     S +RPA FP+ G GL+A + +   E+++ +P    +  DTV  S +G 
Sbjct: 38  KWLKGRRFDG--SNLRPARFPDTGRGLMATKSLKAGEMIISLPEACLLTTDTVLKSPLGD 95

Query: 128 LCSGLKPWIS----VALFLIREKKK-EDSPWRVYLDILPECTDSTVFWSEEELVELQGTQ 182
                KP +S    +  FLI EK+    S W+ YL +LP+     V      L       
Sbjct: 96  YIWKWKPPVSPLLALCTFLIAEKQAGARSLWQPYLGVLPQAYTCPVGLDAAVL------S 149

Query: 183 LLSTTLGVKEYVQNEYLKVEEEIILPNKQLFP-----------RPITLDDFLWAFGILRS 231
           LL   LG +   Q   ++   E+   ++  F            R  TLD   WA+  + +
Sbjct: 150 LLPQPLGRRAREQRTAVR---ELFAASRAFFSSLQPLFSEDVERVFTLDALGWAWCTVNT 206

Query: 232 RAFSRLRGQN---------LVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRT 282
           R       Q            L P  DL+NHSPG   E        A        + +RT
Sbjct: 207 RTVYMEHAQRDCFSAEADIYALAPYLDLLNHSPGAQVE--------AAFNKETRCYEIRT 258

Query: 283 PVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAY-----TLTLEISESDPFFGDK 337
               +  E+VLI Y  +  N  L L+YGF+ S +  +        L   +   D     K
Sbjct: 259 ASRCRKYEEVLICYGPH-DNRRLLLEYGFVCSNNPHSNVVVSPDVLVRHLPSGDKQMTKK 317

Query: 338 LDIAETNGLGESAYF 352
           L + + +G  E+  F
Sbjct: 318 LSLLKEHGFLENLTF 332


>gi|428172369|gb|EKX41279.1| hypothetical protein GUITHDRAFT_112741 [Guillardia theta CCMP2712]
          Length = 329

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 81/304 (26%), Positives = 139/304 (45%), Gaps = 39/304 (12%)

Query: 41  SRAFAATCSLHSASATTNPPTAQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDI 100
           ++ F A  +L  A   +   +A  E F+QWL+       K  I  A F    G+ A+++I
Sbjct: 18  AKGFGAKPTL--AGKASRIDSADSERFFQWLKAGGASVSKVAI--ADFDGLRGVAAEKNI 73

Query: 101 AKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREKKKE---DSPWRVYL 157
            + EV++ +P    +N        +GS   G+ P  + A  +  EK +E   +  ++ + 
Sbjct: 74  EEGEVIVSIPRDICLN--------LGS--DGVNPGYAAAQLVRIEKDEERRRNGWFQPFF 123

Query: 158 DILP--ECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPR 215
           D+LP  E  D+T F+SE EL  L+   ++  T      +++ Y     ++ L N   F  
Sbjct: 124 DMLPKYEQCDTTEFYSENELNALEWDAVIQETKSRVAMLRSTY--EASQVGLSNDIKF-- 179

Query: 216 PITLDDFLWAFGILRSRA---FSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLF 272
             T ++FLW    + SR    ++   G    LIP+ D+ NH          +++K     
Sbjct: 180 --TWEEFLWGVYQIVSRVLTIYTNEDGAVKYLIPMIDMFNHDAA-----SPHQLKA---- 228

Query: 273 SRDLLFSLRTPVPVKAGEQVLIQYDL-NKSNAELALDYGFIESKSDRNAYTLTLEISESD 331
           +RD LF +     + AG+Q+   Y   N +N  +  DYGF+ES +  +   L L  + S+
Sbjct: 229 TRDGLFQIIAGKKIFAGQQINFPYGGGNLNNDRIIQDYGFVESSNSHDVKQLLLPAT-SE 287

Query: 332 PFFG 335
           P  G
Sbjct: 288 PVQG 291


>gi|8393013|ref|NP_059134.1| SET domain-containing protein 4 isoform 1 [Homo sapiens]
 gi|12229715|sp|Q9NVD3.1|SETD4_HUMAN RecName: Full=SET domain-containing protein 4
 gi|7023055|dbj|BAA91819.1| unnamed protein product [Homo sapiens]
 gi|119630162|gb|EAX09757.1| SET domain containing 4, isoform CRA_b [Homo sapiens]
 gi|119630163|gb|EAX09758.1| SET domain containing 4, isoform CRA_b [Homo sapiens]
 gi|119630165|gb|EAX09760.1| SET domain containing 4, isoform CRA_b [Homo sapiens]
          Length = 440

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 78/308 (25%), Positives = 135/308 (43%), Gaps = 37/308 (12%)

Query: 69  QWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           +WL+ +K     S + PA FP  G GL++Q  + + ++++ +P    +  DTV  S +G+
Sbjct: 38  KWLKARKF--QDSNLAPACFPGTGRGLMSQTSLQEGQMIISLPESCLLTTDTVIRSYLGA 95

Query: 128 LCSGLK----PWISVALFLIREKKK-EDSPWRVYLDILPECTDSTVFWSEEELVELQGTQ 182
             +  K    P +++  FL+ EK     S W+ YL+ILP+     V   E E+V L    
Sbjct: 96  YITKWKPPPSPLLALCTFLVSEKHAGHRSLWKPYLEILPKAYTCPVCL-EPEVVNLLPKS 154

Query: 183 LLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI----TLDDFLWAFGILRSRA-FSRL 237
           L +     + +VQ E+     +     + LF   +    +    LWA+  + +RA + R 
Sbjct: 155 LKAKAEEQRAHVQ-EFFASSRDFFSSLQPLFAEAVDSIFSYSALLWAWCTVNTRAVYLRP 213

Query: 238 RGQNLV--------LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAG 289
           R +  +        L P  DL+NHSP +  +        A        + +RT    +  
Sbjct: 214 RQRECLSAEPDTCALAPYLDLLNHSPHVQVK--------AAFNEETHSYEIRTTSRWRKH 265

Query: 290 EQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEI-----SESDPFFGDKLDIAETN 344
           E+V I Y  +  N  L L+YGF+   +      ++ EI       +D     K+ I + +
Sbjct: 266 EEVFICYGPH-DNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDH 324

Query: 345 GLGESAYF 352
           G  E+  F
Sbjct: 325 GYIENLTF 332


>gi|332872029|ref|XP_001168891.2| PREDICTED: SET domain-containing protein 4 isoform 8 [Pan
           troglodytes]
 gi|410222532|gb|JAA08485.1| SET domain containing 4 [Pan troglodytes]
 gi|410259176|gb|JAA17554.1| SET domain containing 4 [Pan troglodytes]
 gi|410287500|gb|JAA22350.1| SET domain containing 4 [Pan troglodytes]
 gi|410336605|gb|JAA37249.1| SET domain containing 4 [Pan troglodytes]
          Length = 440

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 78/308 (25%), Positives = 135/308 (43%), Gaps = 37/308 (12%)

Query: 69  QWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           +WL+ +K     S + PA FP  G GL++Q  + + ++++ +P    +  DTV  S +G+
Sbjct: 38  KWLKARKF--QDSNLAPACFPGTGRGLMSQTSLQEGQMIISLPESCLLTTDTVIRSYLGA 95

Query: 128 LCSGLK----PWISVALFLIREKKK-EDSPWRVYLDILPECTDSTVFWSEEELVELQGTQ 182
             +  K    P +++  FL+ EK     S W+ YL+ILP+     V   E E+V L    
Sbjct: 96  YITKWKPPPSPLLALCTFLVSEKHAGHRSLWKPYLEILPKAYTCPVCL-EPEVVNLLPKS 154

Query: 183 LLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI----TLDDFLWAFGILRSRA-FSRL 237
           L +     + +VQ E+     +     + LF   +    +    LWA+  + +RA + R 
Sbjct: 155 LKAKAEEQRAHVQ-EFFASSRDFFSSLQPLFAEAVDSIFSYSALLWAWCTVNTRAVYLRP 213

Query: 238 RGQNLV--------LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAG 289
           R +  +        L P  DL+NHSP +  +        A        + +RT    +  
Sbjct: 214 RQRECLSAEPDTCALAPYLDLLNHSPHVQVK--------AAFNEETHSYEIRTTSRWRKH 265

Query: 290 EQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEI-----SESDPFFGDKLDIAETN 344
           E+V I Y  +  N  L L+YGF+   +      ++ EI       +D     K+ I + +
Sbjct: 266 EEVFICYGPH-DNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDH 324

Query: 345 GLGESAYF 352
           G  E+  F
Sbjct: 325 GYIENLTF 332


>gi|388452885|ref|NP_001253203.1| SET domain-containing protein 4 [Macaca mulatta]
 gi|355560299|gb|EHH16985.1| SET domain-containing protein 4 [Macaca mulatta]
 gi|387541878|gb|AFJ71566.1| SET domain-containing protein 4 isoform 1 [Macaca mulatta]
          Length = 440

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 79/308 (25%), Positives = 135/308 (43%), Gaps = 37/308 (12%)

Query: 69  QWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           +WL+ +K     S + PA FP  G GL++Q  + + ++++ +P    +  DTV  S +G+
Sbjct: 38  KWLKARKFQD--SNLAPACFPGTGRGLMSQTSLQEGQMIISLPESCLLTTDTVIRSYLGA 95

Query: 128 LCSGLK----PWISVALFLIREKKKED-SPWRVYLDILPECTDSTVFWSEEELVELQGTQ 182
             +  K    P +++  FL+ EK   D S W+ YL+ILP+     V   E E+V L    
Sbjct: 96  YITKWKPPPSPLLALCTFLVSEKHAGDRSLWKPYLEILPKAYTCPVCL-EPEVVNLLPKS 154

Query: 183 LLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI----TLDDFLWAFGILRSRA-FSRL 237
           L +     + +VQ E+     +     + LF   +    +    LWA+  + +RA + R 
Sbjct: 155 LKAKAEEQRAHVQ-EFFASSRDFFSSLQPLFVEAVDSIFSYSALLWAWCTVNTRAVYLRP 213

Query: 238 RGQNLV--------LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAG 289
           R +  +        L P  DL+NHSP +  +        A        + +RT    +  
Sbjct: 214 RQRECLSAEPDTCALAPYLDLLNHSPRVQVK--------AAFNEETHSYEIRTTSRWRKH 265

Query: 290 EQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEI-----SESDPFFGDKLDIAETN 344
           E+V I Y  +  N  L L+YGF+   +      ++ EI        D     K+ I + +
Sbjct: 266 EEVFICYGPH-DNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSRDKQMDKKISILKDH 324

Query: 345 GLGESAYF 352
           G  E+  F
Sbjct: 325 GYIENLTF 332


>gi|301119251|ref|XP_002907353.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262105865|gb|EEY63917.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 424

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 83/346 (23%), Positives = 161/346 (46%), Gaps = 31/346 (8%)

Query: 136 ISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQ 195
           +++AL   +  + + S W  ++++LP+   + +++   E+  L+G+ L      ++E V 
Sbjct: 93  LAIALLYEKYMQGDKSKWAKHIELLPKTYHNALYFEAGEIKALEGSNLFFIAQQMEEKVA 152

Query: 196 NEYLKVEEEIILP---------NKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNL-VLI 245
           ++Y  ++E ++              LF    +LD++ WA   + SR    +  Q+   ++
Sbjct: 153 SDYAVLKESVLFELFENITEGITVDLFDEIFSLDNYKWALSTIWSRFVLPVAKQSFKAMV 212

Query: 246 PLADLINHSPGITTEDYAYEIKGAGLFSRDL-LFSLRTPVPVKAGEQVLIQYDLNKSNAE 304
           P+ D++NH P         E + +  F  +   F L +     AG Q+ I Y    ++  
Sbjct: 213 PVFDMLNHDP---------EAEMSHFFDMETQCFKLVSHQHWNAGAQMFINYGALSNHKL 263

Query: 305 LALDYGFIESKSDRNAYTLTLEISE-SDPFFGDKLDIAETNGLGESAY-FDIVLGRTLPP 362
           L+L YGF+   +  +A  + L + E S  F+ +K  +   NGL  +   F++V   T   
Sbjct: 264 LSL-YGFVIIGNLFDAVDMWLPMDEASTKFYHEKEQLLLVNGLDHATNPFELVAEET-ND 321

Query: 363 AMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHT 422
            +L   R+  +       LE++    + G +   VS  NE+     +    +  L  F T
Sbjct: 322 LLLMATRIQEIDCDTIDELETLAYKALDGEV---VSLENEQQALTRLIYTLEQMLKAFPT 378

Query: 423 TIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKERELEL 468
           +IEED++LL G N +   R+AV VR  +K +L +   + K + L++
Sbjct: 379 SIEEDDELL-GLNHE---RMAVAVRRSDKYILSENINMLKWKLLDI 420


>gi|451992452|gb|EMD84936.1| hypothetical protein COCHEDRAFT_1149681 [Cochliobolus
           heterostrophus C5]
          Length = 478

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 110/448 (24%), Positives = 180/448 (40%), Gaps = 118/448 (26%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSG------------------- 131
           G G++A++DI ++  +  VP    ++  T   S++G+   G                   
Sbjct: 38  GRGIIAKQDIPEDTTLFTVPRNIILSTQT---SDLGAKLPGIFEQHVDGNDDDDGDGQDH 94

Query: 132 ----LKPWISVALFLIREKKKED-SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLST 186
               L  W S+ L ++ E  + D SPW+ YLDILP+  ++ +FW+ +EL EL+GT L + 
Sbjct: 95  EPESLDSWGSLILVMLYEYLQGDASPWKPYLDILPQAFETPIFWTADELKELEGTSLTTE 154

Query: 187 TLGVKEYVQNEYLKVEEEIILP------------------NKQLFPRPITLDDFLWAF-- 226
            +G     + E  ++  E ILP                   + L P    +   + A+  
Sbjct: 155 KIG-----KEESDRMLRERILPIVTSHPNVFFPPGAPLLNEEDLLPLAHRMGSTIMAYAF 209

Query: 227 --------------GILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLF 272
                         G +  R    L G    ++P+AD++N       E  A+   G  L 
Sbjct: 210 DLENEDEQSDDEEDGWIEDRDGKSLMG----MVPMADMLN----ANAEFNAHVHHGDQL- 260

Query: 273 SRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAY----------T 322
               + SLR  +P  AG ++L  Y    S +EL   YG++  +  R              
Sbjct: 261 ---QVTSLRESIP--AGSEILNYYGPLPS-SELLRRYGYVTPEHHRYDVAELPWSLVRTA 314

Query: 323 LTLEISESDPFFGD-------KLD-------IAETNGLGESAYFDIVLGRTLPPAMLQYL 368
           L  E++ S+    D       KLD       + E +    S+Y     G    P +L+ +
Sbjct: 315 LAEELNLSEDVIADVEGEVERKLDSELEEFFVIERDAGEPSSY-----GTFTQPPVLREV 369

Query: 369 RLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDE 428
                  T AFL     R+        P       +   V+  A ++ L  + T+ E+DE
Sbjct: 370 SSELEEQTKAFLKALNKRD--------PKRKRGTVIYDTVLEKALRTRLGQYPTSAEQDE 421

Query: 429 QLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
            L+   +L  R R+AV VR GEKR+LQ+
Sbjct: 422 SLISKQDLSKRHRMAVQVRLGEKRLLQE 449


>gi|397507017|ref|XP_003824008.1| PREDICTED: SET domain-containing protein 4 [Pan paniscus]
          Length = 440

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 78/308 (25%), Positives = 135/308 (43%), Gaps = 37/308 (12%)

Query: 69  QWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           +WL+ +K     S + PA FP  G GL++Q  + + ++++ +P    +  DTV  S +G+
Sbjct: 38  KWLKARKF--QDSNLAPACFPGTGRGLMSQTSLQEGQMIISLPESCLLTTDTVIRSYLGA 95

Query: 128 LCSGLK----PWISVALFLIREKKK-EDSPWRVYLDILPECTDSTVFWSEEELVELQGTQ 182
             +  K    P +++  FL+ EK     S W+ YL+ILP+     V   E E+V L    
Sbjct: 96  YITKWKPPPSPLLALCTFLVSEKHAGHRSLWKPYLEILPKAYTCPVCL-EPEVVNLLPKS 154

Query: 183 LLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI----TLDDFLWAFGILRSRA-FSRL 237
           L +     + +VQ E+     +     + LF   +    +    LWA+  + +RA + R 
Sbjct: 155 LKAKAEEQRAHVQ-EFFASSRDFFSSLQPLFAEAVDSIFSYSALLWAWCTVNTRAVYLRP 213

Query: 238 RGQNLV--------LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAG 289
           R +  +        L P  DL+NHSP +  +        A        + +RT    +  
Sbjct: 214 RQRECLSAEPDTCALAPYLDLLNHSPHVQVK--------AAFNEETHSYEIRTTSRWRKH 265

Query: 290 EQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEI-----SESDPFFGDKLDIAETN 344
           E+V I Y  +  N  L L+YGF+   +      ++ EI       +D     K+ I + +
Sbjct: 266 EEVFICYGPH-DNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDH 324

Query: 345 GLGESAYF 352
           G  E+  F
Sbjct: 325 GYIENLTF 332


>gi|22328112|gb|AAH36556.1| SETD4 protein [Homo sapiens]
 gi|119630166|gb|EAX09761.1| SET domain containing 4, isoform CRA_d [Homo sapiens]
 gi|167773807|gb|ABZ92338.1| SET domain containing 4 [synthetic construct]
          Length = 416

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 80/309 (25%), Positives = 140/309 (45%), Gaps = 39/309 (12%)

Query: 69  QWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           +WL+ +K     S + PA FP  G GL++Q  + + ++++ +P    +  DTV  S +G+
Sbjct: 14  KWLKARKF--QDSNLAPACFPGTGRGLMSQTSLQEGQMIISLPESCLLTTDTVIRSYLGA 71

Query: 128 LCSGLK----PWISVALFLIREKKK-EDSPWRVYLDILPECTDSTVFWSEEELVELQGTQ 182
             +  K    P +++  FL+ EK     S W+ YL+ILP+     V   E E+V L    
Sbjct: 72  YITKWKPPPSPLLALCTFLVSEKHAGHRSLWKPYLEILPKAYTCPVC-LEPEVVNLLPKS 130

Query: 183 LLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI----TLDDFLWAFGILRSRA-FSRL 237
           L +     + +VQ E+     +     + LF   +    +    LWA+  + +RA + R 
Sbjct: 131 LKAKAEEQRAHVQ-EFFASSRDFFSSLQPLFAEAVDSIFSYSALLWAWCTVNTRAVYLRP 189

Query: 238 RGQNLV--------LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLL-FSLRTPVPVKA 288
           R +  +        L P  DL+NHSP +       ++K A  F+ +   + +RT    + 
Sbjct: 190 RQRECLSAEPDTCALAPYLDLLNHSPHV-------QVKAA--FNEETHSYEIRTTSRWRK 240

Query: 289 GEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEI-----SESDPFFGDKLDIAET 343
            E+V I Y  +  N  L L+YGF+   +      ++ EI       +D     K+ I + 
Sbjct: 241 HEEVFICYGPH-DNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKD 299

Query: 344 NGLGESAYF 352
           +G  E+  F
Sbjct: 300 HGYIENLTF 308


>gi|428182808|gb|EKX51668.1| hypothetical protein GUITHDRAFT_102933 [Guillardia theta CCMP2712]
          Length = 436

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 83/310 (26%), Positives = 139/310 (44%), Gaps = 35/310 (11%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTV--------AASEIGSLCSGLKPWISVALF 141
           EGLG+ A R +   E +  +P    I P+ V        +  ++ S        +++A F
Sbjct: 52  EGLGVFANRRLEPGETLFMIPKSCCIYPELVFEDRQLGKSMQKLASAAGEGIEVVALATF 111

Query: 142 LIREK-KKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLK 200
           L REK K  +S ++ ++D+LP  +   + W++EE+  L+GT      L  +E V+    +
Sbjct: 112 LAREKMKGSESSYKPFIDVLPWDSLHPLLWTDEEVDLLEGTYAHREILAFREQVEVA-TE 170

Query: 201 VEEEIILPN--KQLF----PRPITLDDFLW----AFGILRSRAFSRLRG------QNLVL 244
           + E ++ P   KQ F       +T ++F +    AF  + SRAF    G      +  V+
Sbjct: 171 LFEPVLNPKGWKQFFQTIETEKMTPEEFGFMMRGAFASVLSRAFDSKIGRGDKGLEERVV 230

Query: 245 IPLADLINHS---PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKS 301
           IPL D+ NH    P IT  D A E      F   +    ++   ++ GE++   Y  +K 
Sbjct: 231 IPLLDIFNHGSYGPSITF-DTALERDNEKGFPVRVADKGKS---IEEGEELFGFYG-DKP 285

Query: 302 NAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGL-GESAYFDIVLGRTL 360
           N  +   YGF+         TL++ I E DP+F  K +I +  G+      FDI      
Sbjct: 286 NWNMLTTYGFVSPNPKCQETTLSVSIDEKDPYFAQKEEILKARGMVAVEQLFDIRHDTDP 345

Query: 361 PPAMLQYLRL 370
              ++ Y R+
Sbjct: 346 MGPLINYFRI 355


>gi|412990750|emb|CCO18122.1| predicted protein [Bathycoccus prasinos]
          Length = 543

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 86/294 (29%), Positives = 128/294 (43%), Gaps = 36/294 (12%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEI---GSLCSGLKPWISVALFLIREK 146
           EG GLVA   I + E VLE+P +  I  +      +       + L+ W  +A FL    
Sbjct: 114 EGRGLVATESIKRGEKVLEIPQEAIITVEVALKESLLREKKKLAELQEWSILATFLAETA 173

Query: 147 KK---EDSP-----WRVYLDILPECTDSTVFWSEEELVEL-QGTQLLSTTLGVKEYVQNE 197
           +    ED+      +  Y+  LP  T S + W E ++  L  G+  L + L  +  V   
Sbjct: 174 QNLSTEDNSSNKYRFATYVKALPRSTGSVLEWPESDVRTLLAGSPSLFSALERRASVAAA 233

Query: 198 YLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLR--GQNLVLIPLADLINHSP 255
             ++        +  FP  +      WAF IL SR   RL   G NL L+P AD++NH P
Sbjct: 234 IAEI--------RVNFPE-LNEKTLQWAFDILFSRLI-RLESLGGNLALVPWADMLNHQP 283

Query: 256 GITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESK 315
           G   E +    +G    SR +   L T    + GEQV   Y   + ++EL + YGF  + 
Sbjct: 284 G--CEAFIDLDRG----SRKVC--LTTDRSYEPGEQVWASYG-QRPSSELLISYGFAPAV 334

Query: 316 SDR--NAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQY 367
            D   + Y L L+I E DPF   K++   +  +     F + L    P  +LQY
Sbjct: 335 GDNPDDEYALNLQIDEEDPFASAKVNALASQNIQAFETFPLRLN-GYPRQLLQY 387


>gi|145353540|ref|XP_001421068.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144581304|gb|ABO99361.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 813

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 71/252 (28%), Positives = 109/252 (43%), Gaps = 25/252 (9%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTV----AASEIGSLCSGLKPWISVALFLIREK 146
           G G  A RD A+ EV+LE+P++            A   + S C+       VAL +  E+
Sbjct: 21  GRGQRALRDCARGEVLLEIPLERGFTLAAALEDDAVKRVASCCARHDD--VVALHVCAER 78

Query: 147 -KKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY--LKVEE 203
            + E +    ++  LP   D+  FWSEEEL EL GT  L  T+ ++E  +N+Y  L  + 
Sbjct: 79  FRGEKATRAAHVATLPRSFDTAFFWSEEELRELTGTTCLRETMNLREETKNDYETLTKKM 138

Query: 204 EIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLR---GQNLVLIPLADLINHS---PGI 257
           E I     +    +  + + WA   L SR    L     +   ++P  D+ NHS   P  
Sbjct: 139 EAIGEGGWMREHEVDYERYAWARSNLWSRQCDLLMPDGKRTRAMVPTFDIFNHSAKAPLG 198

Query: 258 TTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNK-SNAELALDYGFIESKS 316
            T     E     +++ D           KAGEQ  I Y   + +N++L   YGF    +
Sbjct: 199 KTHKLNAEKNCVTVYAAD---------DYKAGEQAFISYGSGEAANSKLLTWYGFCIDDN 249

Query: 317 DRNAYTLTLEIS 328
                 +TL I+
Sbjct: 250 PYEELDVTLTIT 261


>gi|426392958|ref|XP_004062802.1| PREDICTED: SET domain-containing protein 4 [Gorilla gorilla
           gorilla]
          Length = 440

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 78/308 (25%), Positives = 135/308 (43%), Gaps = 37/308 (12%)

Query: 69  QWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           +WL+ +K     S + PA FP  G GL++Q  + + ++++ +P    +  DTV  S +G+
Sbjct: 38  KWLKARKF--QDSNLAPACFPGTGRGLMSQTSLQEGQMIISLPESCLLTTDTVIRSYLGA 95

Query: 128 LCSGLK----PWISVALFLIREKKK-EDSPWRVYLDILPECTDSTVFWSEEELVELQGTQ 182
             +  K    P +++  FL+ EK     S W+ YL+ILP+     V   E E+V L    
Sbjct: 96  YITKWKPPPSPLLALCTFLVSEKHAGRRSLWKPYLEILPKAYTCPVCL-EPEVVNLLPKS 154

Query: 183 LLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI----TLDDFLWAFGILRSRA-FSRL 237
           L +     + +VQ E+     +     + LF   +    +    LWA+  + +RA + R 
Sbjct: 155 LKAKAEEQRAHVQ-EFFASSRDFFSSLQPLFAEAVDSIFSYSALLWAWCTVNTRAVYLRP 213

Query: 238 RGQNLV--------LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAG 289
           R +  +        L P  DL+NHSP +  +        A        + +RT    +  
Sbjct: 214 RQRECLSAEPDTCALAPYLDLLNHSPHVQVK--------AAFNEETHSYEIRTTSRWRKH 265

Query: 290 EQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEI-----SESDPFFGDKLDIAETN 344
           E+V I Y  +  N  L L+YGF+   +      ++ EI       +D     K+ I + +
Sbjct: 266 EEVFICYGPH-DNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDH 324

Query: 345 GLGESAYF 352
           G  E+  F
Sbjct: 325 GYIENLTF 332


>gi|171678927|ref|XP_001904412.1| hypothetical protein [Podospora anserina S mat+]
 gi|170937534|emb|CAP62192.1| unnamed protein product [Podospora anserina S mat+]
          Length = 466

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 95/384 (24%), Positives = 162/384 (42%), Gaps = 52/384 (13%)

Query: 86  ATFP-EGLGLVAQRDIAKNEVVLEVPMK-FWINPDTVAASEIGSLCSGLKPWISVA---- 139
           A FP  G G+ A R   K E +L +P    W      A   +G      +P +SV     
Sbjct: 41  ADFPVTGRGVRALRRFKKGERILTIPCGVLWTVEHAFADPLLGPALRSARPPLSVEDILA 100

Query: 140 --LFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNE 197
             +  IR ++      R ++  LP    S++F+S++EL    GT L + T  +   + ++
Sbjct: 101 TYILFIRSRESGYDGLRSHVAALPTSYSSSIFFSKDELEVCAGTSLYTITKQLDRSIDDD 160

Query: 198 YLKVEEEIILPNKQLFP-RPITLDDFLWAFGILRSRA--FSRLRGQNL-VLIPLADLINH 253
           Y  +   ++  ++ L P    T++D  WA   + SRA  F+   G ++ +L P AD++NH
Sbjct: 161 YRALVVGVLAQHRDLLPLDKFTIED--WALCTVWSRAMDFALPDGNSIRLLAPFADMLNH 218

Query: 254 SPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIE 313
           S  +    + Y++    L       S+      +AG+Q  I Y     N+ L   YGF++
Sbjct: 219 SSEV-KPCHVYDVSSGNL-------SVLAGKDYEAGDQAFISYG-PIPNSRLLRLYGFVQ 269

Query: 314 SKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVAL 373
                                  K  +  + GL  +    + L   LP  +L+YLR+   
Sbjct: 270 -----------------------KHKLWVSAGLDSTCTIPLTLTDPLPKNVLRYLRIQRS 306

Query: 374 GGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQG 433
             +D  ++   FR       D  +S +NE  + + + ++    L  F T +E+ E+ L  
Sbjct: 307 DESDLAVIA--FRQI--NATDEKISDSNEVEVLQFLIESFSHLLDSFGTQLEKLEEQLAE 362

Query: 434 GNLDP--RLRIAVGVRAGEKRVLQ 455
           G   P      A  V  GE+RVL+
Sbjct: 363 GVYSPGGNAWAAAHVSLGEQRVLR 386


>gi|301094750|ref|XP_002896479.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262109454|gb|EEY67506.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 478

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 103/414 (24%), Positives = 173/414 (41%), Gaps = 44/414 (10%)

Query: 69  QWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS- 127
           QWL      S K  ++    PE  G+ +++ +   E +L +P K  I  +    ++IG  
Sbjct: 39  QWLETNGADSKKLTLQEYA-PEVRGVHSRKVLVPGERILVIPKKCLITVEMGKQTDIGRK 97

Query: 128 -LCSGLK----PWISVALFLIREKKK-EDSPWRVYLDILPEC-TDSTVFWSEEELVELQG 180
            L   +       I + +FL+ + +  E S +R Y   LP   ++  +FWSEEEL  L+G
Sbjct: 98  LLARNVDFVAPKHIFLMMFLLTDMEHVETSFFRNYYSTLPSTLSNMPIFWSEEELSWLKG 157

Query: 181 TQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFS-RLRG 239
           + ++      K  ++ +Y     ++I      F R  +LD F WA  I+ SR F   + G
Sbjct: 158 SYIIQQIQERKAAIRKDY-----DVICRVDPSFAR-FSLDRFSWARMIVCSRNFGLTIDG 211

Query: 240 -QNLVLIPLADLINH-SPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYD 297
            +   L+P AD++NH  P  T+  +   I           F++ +   +  G QV   Y 
Sbjct: 212 VKTAALVPFADMLNHYRPRETSWTFDQSIDA---------FTITSLGTIGTGAQVYDSYG 262

Query: 298 LNKSNAELALDYGFI------ESKSDRNAYTLTLEISESDP--FFGDKLDIAETNGLGES 349
             K N    L+YGF       E   + N   +  ++S +D   F+  +  + E+      
Sbjct: 263 -KKCNHRFLLNYGFAVEDNTEEDGRNPNEVLIDFQLSPADGQLFYDKRAYLHESGIYTMD 321

Query: 350 AYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVV 409
           A                + RL+ +   + F   S  +     H   P+S  NE      +
Sbjct: 322 ARLSCSHSDANTREGFSFARLI-VATEEEF---STMKMKSPAHSSPPISFDNEIRALEYL 377

Query: 410 RDACKSALSGFHTTIEEDEQLLQGGN---LDPRLRIAVGVRAGEKRVLQQIDEI 460
           R+     LS + TTIEED +LL          R++    +R GEK+V +   E+
Sbjct: 378 RNLMTHQLSLYDTTIEEDNELLASKQYPLFSNRIQALFFIR-GEKQVCRYFQEL 430


>gi|166091525|ref|NP_001107219.1| SET domain-containing protein 4 [Rattus norvegicus]
 gi|165971256|gb|AAI58670.1| Setd4 protein [Rattus norvegicus]
          Length = 439

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 96/418 (22%), Positives = 176/418 (42%), Gaps = 47/418 (11%)

Query: 69  QWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           +WL+++K       + PA FP  G GL+++  + + +V++ +P    +  DTV  S +G 
Sbjct: 36  KWLKERKF-EDTGLLVPACFPGTGRGLMSKASLQEGQVIISLPESCLLTTDTVIRSSVGP 94

Query: 128 LCSGLKPWIS----VALFLIREKKK-EDSPWRVYLDILPECTDSTVFWSEEELVELQGTQ 182
                KP +S    +  FL+ E+     S W+ YLDILP+     V   E E+V+L    
Sbjct: 95  YIKKWKPPVSPLLALCTFLVSERHAGSHSLWKSYLDILPKSYTCPVCL-EPEVVDLLPGP 153

Query: 183 LLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI----TLDDFLWAFGILRSRA-FSRL 237
           L +     +  VQ+ +     +     + LF   +    +   FLWA+  + +RA + + 
Sbjct: 154 LRAKAEEQRARVQDLFAS-SRDFFSTLQPLFAESVDSIFSYHAFLWAWCTVNTRAVYLKS 212

Query: 238 RGQNLV--------LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAG 289
           R Q  +        L P  DL+NHSP +  +        A    +   + +RT    +  
Sbjct: 213 RRQECLSSEPDTCALAPFLDLLNHSPHVQVK--------AAFNEKTRCYEIRTASRCRKH 264

Query: 290 EQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLE-----ISESDPFFGDKLDIAETN 344
           ++  I Y  +  N  L L+YGF+   +      ++ E     +  +D     KL I E +
Sbjct: 265 QEAFICYGPH-DNQRLLLEYGFVAFGNPHACVPVSGEMLLKYLPPADKQVHKKLSILEDH 323

Query: 345 GLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEEL 404
           G       ++  G   P   L     +     + F   + ++  + G +   +S  NE+ 
Sbjct: 324 GFTG----NLTFGWDGPSWRLLTALKLLCLEAERF---TSWKKVLIGDV---ISDRNEKA 373

Query: 405 ICRVVRDACKSALSGFHTTIEEDEQLLQ-GGNLDPRLRIAVGVRAGEKRVLQQIDEIF 461
              V +  C   ++  H+ + +   + +  G+L  +L +   +R  E R+LQ   EI 
Sbjct: 374 SLGVAQKICSDFIAEIHSELRKVSDMKERKGSLISQLCLVEALRMEELRILQASAEIL 431


>gi|359473101|ref|XP_002275523.2| PREDICTED: uncharacterized protein LOC100264713 [Vitis vinifera]
 gi|297738036|emb|CBI27237.3| unnamed protein product [Vitis vinifera]
          Length = 482

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 103/413 (24%), Positives = 174/413 (42%), Gaps = 63/413 (15%)

Query: 89  PEGLGLVAQRDIA---KNEVVLEVPMKFWIN----------PDTVAASE-IGSLCSGLKP 134
           P+G G+ A +D+    +  +++E+P++  +           PD V     I  + +   P
Sbjct: 97  PDGFGVYASKDVEPLRRARMIMEIPLELMLTIRKKLPWMFFPDIVPVGHPIFDIINSTNP 156

Query: 135 ---W-ISVALFLIREKKKEDSPWRVYLDILP---ECTDSTVFWSEEELVELQGTQLLSTT 187
              W + +A  L+    +ED+ W++Y D LP   ECT S +   EE+L+ELQ  +L ST 
Sbjct: 157 ETDWDLRLACLLLFAFDQEDNFWQLYGDFLPSEGECT-SLLLAKEEDLLELQDPKLASTM 215

Query: 188 LGVKEYVQNEYLKV-EEEIILPNKQLFPRPITLDDFLWAFGILRSRAFS-RLRGQNLV-- 243
              +      + K     + L  K+L   P   + F+WA  I +SR  + ++R   LV  
Sbjct: 216 RDQQRRASEFWEKNWHSGVPLKIKRLARDP---NRFIWAVSIAQSRCINMQMRIGALVQD 272

Query: 244 ---LIPLADLINHS--PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDL 298
              LIP AD++NHS  P      + +  K      R L   +     +K GE++ + Y  
Sbjct: 273 ANMLIPYADMLNHSFQPNCF---FHWRFK-----DRMLEVMINAGQRIKKGEEMTVNYMS 324

Query: 299 NKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGR 358
              N  L   YGF    S  N + +      +       L +   +GL E  Y +  L  
Sbjct: 325 GLKNDMLMQRYGF---SSPVNPWDVIQFSGNAQIHLDSFLSVFNISGLPEEYYHNSRLSN 381

Query: 359 TLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALS 418
                             D  ++ +      W   D+P   + E    + +++ C+  L 
Sbjct: 382 N------------GDSFVDGAVIAAARTLPTWSDGDVPPMPSMERKAVKQLQEECQQMLL 429

Query: 419 GFHTTIEEDEQLLQGGNLDPR-LRIAVGVRAGEK----RVLQQIDEIFKEREL 466
            F TT E+D+++L     + R L  A+  R   K    +V+Q +D I++ER L
Sbjct: 430 EFPTTSEQDQKILDSMTEERRTLEAAIKYRLHRKLLIEKVIQALD-IYQERIL 481


>gi|389622275|ref|XP_003708791.1| hypothetical protein MGG_14610 [Magnaporthe oryzae 70-15]
 gi|351648320|gb|EHA56179.1| hypothetical protein MGG_14610 [Magnaporthe oryzae 70-15]
 gi|440464619|gb|ELQ34017.1| hypothetical protein OOU_Y34scaffold00823g1 [Magnaporthe oryzae
           Y34]
          Length = 419

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 96/405 (23%), Positives = 172/405 (42%), Gaps = 82/405 (20%)

Query: 69  QWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPM-KFWINPDTVAASEIG 126
            WL++   V   + +  A FP  G G+   R   + E +L +P    W      A S +G
Sbjct: 3   NWLKETGAVGLDN-LELADFPITGRGVRTLRHFKEGEKILTIPCGSLWTVEQAHADSLLG 61

Query: 127 SLCSGLKPWISVA------LFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQG 180
                ++P +SV       +  +R ++      R ++  LP    S++F++EEEL    G
Sbjct: 62  PALRSVRPPLSVEDILATYILFVRSRESGYDGLRSHVAALPSSYSSSIFFAEEELEVCAG 121

Query: 181 TQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFP-RPITLDDFLWAFGILRSRAFS-RLR 238
           T L + T  +++ ++++Y  +   +++ ++ LFP    T++D+ WA   + SRA    L 
Sbjct: 122 TSLYTVTKQLEQRIEDDYRALVMRLLVQHRDLFPLEQFTIEDYKWALCTVWSRAMDFVLP 181

Query: 239 GQNLV--LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQY 296
           G N +  L P AD++NHS  +  + +AY                                
Sbjct: 182 GGNSIRLLAPFADMLNHSDNV-KQCHAY-------------------------------- 208

Query: 297 DLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVL 356
                               D ++ TL++ ++  D   GD+L +  +  L  ++   + L
Sbjct: 209 --------------------DSSSKTLSV-LAGKDYEAGDQLKLWASARLDPTSTISLTL 247

Query: 357 GRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSA 416
              LP  +L+YLR+     +DA  L  +    I       +S +NE  + R + ++    
Sbjct: 248 TDPLPNDVLRYLRIQR---SDASELAGMACQRI--DAAEKISDSNEVEVLRFLVESLSGL 302

Query: 417 LSGFHTTIEE-DEQLLQ-----GGNLDPRLRIAVGVRAGEKRVLQ 455
           L  F T +E+ +EQL +     GGN       A  V +GE+RVL+
Sbjct: 303 LKNFGTQLEKLEEQLAEGVYASGGN----AWAAAHVSSGEQRVLR 343


>gi|302847476|ref|XP_002955272.1| hypothetical protein VOLCADRAFT_76643 [Volvox carteri f.
           nagariensis]
 gi|300259344|gb|EFJ43572.1| hypothetical protein VOLCADRAFT_76643 [Volvox carteri f.
           nagariensis]
          Length = 488

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 86/318 (27%), Positives = 144/318 (45%), Gaps = 24/318 (7%)

Query: 94  LVAQRDIAKNEVVLEVPMKFWINPDTVAASE-IGSLCS--GLKPWISVALFLIREKK-KE 149
           +VA R +A  EV L VP +  +  D +  SE +  L +   L     +AL+L+ EKK K+
Sbjct: 75  VVAGRSLAAGEVALSVPERLCLTLDRIFESEFVAELLTTDKLSELACLALYLMYEKKLKK 134

Query: 150 DSPWRVYLDIL-------PECTDSTVFWSEEELVEL-QGTQLLSTTLGVKEYVQNEYLKV 201
            S W  Y+  L       P+  +S + W ++EL  L +G+ LL      +  ++ EY  +
Sbjct: 135 KSFWYPYIKELDKQQARGPQAAESPLLWGDQELDSLLKGSPLLPAVRQRQAGIRKEYEAL 194

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQ--NLVLIPLADLINHSP-GIT 258
           +    +        P  L    ++F + + +AF+ ++    +L  +P+A      P G  
Sbjct: 195 DTVWFMAGSLFNKYPFDLPTETFSFELFQ-QAFAVVQASIVHLQGVPIAKRFALVPLGPP 253

Query: 259 TEDYAYEIKGAGLFSRDLLFSLRTPV--PVKAGEQVLIQYDLNKSNAELALDYGFIESKS 316
              Y+   K    +  D   S+R  V  PV+AG  V   +   + N+ L L+YG ++  +
Sbjct: 254 LMAYSSTSKNMMTYDEDSR-SVRLVVSGPVEAGRPV-AAWCGPQPNSRLLLNYGVVDEHN 311

Query: 317 --DRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALG 374
             D+     T  +  SDP F  K  +    GL     FD+ + R LPP +L Y+ ++AL 
Sbjct: 312 PFDKLQARFTFTLPTSDPLFPAKRAVLSEAGLATQQSFDVSVARPLPPQLLPYM-MLALA 370

Query: 375 GTDAFLLESIFRNTIWGH 392
            T   +    F +T  GH
Sbjct: 371 TTPEQVASVSFSDTA-GH 387


>gi|149059902|gb|EDM10785.1| hypothetical protein RDA279, isoform CRA_e [Rattus norvegicus]
          Length = 475

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 96/418 (22%), Positives = 176/418 (42%), Gaps = 47/418 (11%)

Query: 69  QWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           +WL+++K       + PA FP  G GL+++  + + +V++ +P    +  DTV  S +G 
Sbjct: 72  KWLKERKF-EDTGLLVPACFPGTGRGLMSKASLQEGQVIISLPESCLLTTDTVIRSSVGP 130

Query: 128 LCSGLKPWIS----VALFLIREKKK-EDSPWRVYLDILPECTDSTVFWSEEELVELQGTQ 182
                KP +S    +  FL+ E+     S W+ YLDILP+     V   E E+V+L    
Sbjct: 131 YIKKWKPPVSPLLALCTFLVSERHAGSHSLWKSYLDILPKSYTCPVCL-EPEVVDLLPGP 189

Query: 183 LLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI----TLDDFLWAFGILRSRA-FSRL 237
           L +     +  VQ+ +     +     + LF   +    +   FLWA+  + +RA + + 
Sbjct: 190 LRAKAEEQRARVQDLFAS-SRDFFSTLQPLFAESVDSIFSYHAFLWAWCTVNTRAVYLKS 248

Query: 238 RGQNLV--------LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAG 289
           R Q  +        L P  DL+NHSP +  +        A    +   + +RT    +  
Sbjct: 249 RRQECLSSEPDTCALAPFLDLLNHSPHVQVK--------AAFNEKTRCYEIRTASRCRKH 300

Query: 290 EQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEI-----SESDPFFGDKLDIAETN 344
           ++  I Y  +  N  L L+YGF+   +      ++ E+       +D     KL I E +
Sbjct: 301 QEAFICYGPH-DNQRLLLEYGFVAFGNPHACVPVSGEMLLKYLPPADKQVHKKLSILEDH 359

Query: 345 GLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEEL 404
           G       ++  G   P   L     +     + F   + ++  + G +   +S  NE+ 
Sbjct: 360 GFTG----NLTFGWDGPSWRLLTALKLLCLEAERF---TSWKKVLIGDV---ISDRNEKA 409

Query: 405 ICRVVRDACKSALSGFHTTIEEDEQLLQ-GGNLDPRLRIAVGVRAGEKRVLQQIDEIF 461
              V +  C   ++  H+ + +   + +  G+L  +L +   +R  E R+LQ   EI 
Sbjct: 410 SLGVAQKICSDFIAEIHSELRKVSDMKERKGSLISQLCLVEALRMEELRILQASAEIL 467


>gi|116786810|gb|ABK24248.1| unknown [Picea sitchensis]
          Length = 507

 Score = 76.3 bits (186), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 93/380 (24%), Positives = 166/380 (43%), Gaps = 28/380 (7%)

Query: 94  LVAQRDIAKNEVVLEVPMKFWINPDTVAASE-IGSL--CSGLKPWISVALFLIREKKKED 150
           + A  D+   +V   +P    +  + V  +E I  L   + L     +AL+L+ EKK+ +
Sbjct: 108 VAASEDLQPGDVAFSIPNSLIVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGN 167

Query: 151 -SPWRVYLDILPE-------CTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVE 202
            S WR ++  L           +S + WS EEL    G+ +    L     ++ EY +++
Sbjct: 168 QSFWRPFIRELDRQRGRGQLAVESPLLWSSEELKYFTGSPMKEIMLERNSGIKREYEELD 227

Query: 203 EEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRG-----QNLVLIPLADLINHSPGI 257
               +        P  +    + F I + +AF  ++      QN+ L     L+   P +
Sbjct: 228 TVWFMAGSLFKQYPYDIPTEAFPFEIFK-QAFVAVQSCVVHLQNVNLARRFALVPLGPPL 286

Query: 258 TTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSD 317
            +  Y    K A L +      L      KAGE +++ +   + NA L L+YGF++  + 
Sbjct: 287 LS--YKSNCK-AMLKAVGDSVQLEVDREYKAGEPIVV-WCGPQPNARLLLNYGFVDEDNP 342

Query: 318 RNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPA--MLQYLRLVALGG 375
            +   + + +   DP + DK  IA+ NG      F+I +GR       ML YLRL  +  
Sbjct: 343 HDRLIVEVSLDTKDPLYQDKRIIAQRNGKLSVQTFNIYIGREKEAVLDMLPYLRLAYV-- 400

Query: 376 TDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGN 435
           +D   ++S+  +   G +  PVS   E  +   +    +  L+G+ TT  EDE +L    
Sbjct: 401 SDPSEMQSVLSSQ--GPV-CPVSPCTERAVLDQLSRYFRERLAGYPTTASEDEIVLADPT 457

Query: 436 LDPRLRIAVGVRAGEKRVLQ 455
            +P+ ++A  +   EK++L 
Sbjct: 458 TNPKRQVATQLVLIEKKMLN 477


>gi|428171155|gb|EKX40074.1| hypothetical protein GUITHDRAFT_113813 [Guillardia theta CCMP2712]
          Length = 353

 Score = 76.3 bits (186), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 74/260 (28%), Positives = 112/260 (43%), Gaps = 43/260 (16%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEI-----GSLCSGLKPWISVALFL--- 142
           GL L   +D+ + EV+L +P +  +  D  A         G +   +  +  +A++L   
Sbjct: 47  GLELRLVKDVKRGEVLLAIPRRAILEIDDAATCPCKEYITGEMWQAIPSYAKLAIYLLYS 106

Query: 143 IREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVE 202
           I   +++  P R Y D+LP+   ST  WSEE + ELQ   ++       E +Q    K++
Sbjct: 107 IDHAEQDPRPLRDYFDVLPKQVLSTFSWSEEAIQELQDPYMI-------EQIQTRRRKIQ 159

Query: 203 EEIILPNKQLFPRPITLDDFLWAFGILRSRAF--SRLRGQNLVLIPLADLINHSPGITTE 260
                  K L PR IT D  LWA  I+ SRAF  SR  G +LV             +  +
Sbjct: 160 RLFHEIQKGLSPR-ITYDRLLWAIEIVLSRAFAFSRTGGDDLVF--------SGTSVKYD 210

Query: 261 DYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNA 320
           +   E            F +      K G+ V I Y L KSN EL L YGFI   +  + 
Sbjct: 211 NSKQE------------FQIVAEKDFKVGQSVEISYGL-KSNHELLLSYGFILPDNPEDF 257

Query: 321 YTLTLEISESDPFFGDKLDI 340
           + +T    +S  F    L++
Sbjct: 258 FVIT----DSKKFLSSVLNV 273


>gi|212721730|ref|NP_001132096.1| uncharacterized protein LOC100193512 [Zea mays]
 gi|194693412|gb|ACF80790.1| unknown [Zea mays]
 gi|413942692|gb|AFW75341.1| hypothetical protein ZEAMMB73_065275 [Zea mays]
 gi|413942693|gb|AFW75342.1| hypothetical protein ZEAMMB73_065275 [Zea mays]
          Length = 549

 Score = 76.3 bits (186), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 79/309 (25%), Positives = 126/309 (40%), Gaps = 61/309 (19%)

Query: 63  QVETFWQWLRDQKVVSPKSPIRPATFPEGLGL---VAQRDIAKNEVVLEVPMKFWINPDT 119
           ++E+F QWL+          IR A   +GLG+    A    + + V + VP+   I P  
Sbjct: 11  KLESFLQWLQSNGADLRSCTIR-ACGGKGLGVFSTAAPEPGSNDGVAMVVPLDLAITPMR 69

Query: 120 VAASE-IGSLCSGL------KPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSE 172
           V     +G  C  L         + V LFL+ E+++  S W+ YLD+LP    S+++++E
Sbjct: 70  VLQDPLVGPRCRALLEDGVVDDRLLVMLFLMAERRRPGSLWKPYLDMLPSTFGSSLWFTE 129

Query: 173 EELVELQGTQLLSTTLGVKEYVQNEY-LKVE---EEIILPNKQLFPRPITLDDFLWAFGI 228
           EEL EL+GT L   TL  ++ +Q+ +  KV+   EE++  ++      +  +DFLWA  I
Sbjct: 130 EELAELEGTTLHRATLIQRKSLQSSFDEKVKGLVEELLHVDESASSVEVLFEDFLWANSI 189

Query: 229 LRSRAFS----------------RLRGQNLV----------------------------- 243
             +RA +                + R  N                               
Sbjct: 190 FWTRALNIPLPHSYVFLGSCGDQQARADNDAHQEIDITAKDCSADENSKPSNTESIWVEG 249

Query: 244 LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNA 303
           L+P  D  NH+           +  A      +   L    P + G ++ I Y  NK N 
Sbjct: 250 LVPGIDFCNHNVKALATWEVDSVGNATGIPASMYLLLADKSPAETGAEICINYG-NKGNE 308

Query: 304 ELALDYGFI 312
           EL   YGF+
Sbjct: 309 ELLYLYGFV 317


>gi|224005597|ref|XP_002291759.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220972278|gb|EED90610.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 541

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 118/471 (25%), Positives = 183/471 (38%), Gaps = 126/471 (26%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDT------VAASEIGSLCSGLK-----PW---I 136
           GLGL + + +  N ++++VP K  ++ +T      V   E+    S  K     PW   +
Sbjct: 86  GLGLYSTQSLTPNTLLIQVPSKLTLSVETPIDYNIVMEKEL--FASNPKSYRNAPWYAAL 143

Query: 137 SVALFLIREKKKEDS-------PWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLG 189
           SV L    +    +S       PW   +D LP   D+ + WSE  L ELQ T ++     
Sbjct: 144 SVQLNYYDKVNANNSKAGVSMTPW---MDSLPRKFDTPIHWSESSLKELQYTPMMEAVAL 200

Query: 190 VKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFS-RLRGQNLVLIPLA 248
            +   +N+Y     ++     + F   ++ DDF+W     RSRAFS    G     IP A
Sbjct: 201 QQRVWRNQY-----DMFAAASKDFASRVSYDDFVWGCETARSRAFSGAYSGSAFNPIPYA 255

Query: 249 -----------------------------------------------------DLINHSP 255
                                                                D+ NH  
Sbjct: 256 TVAVCVAAYLALGLGTIEQAANGAALVVCGTILRDFVLPKLLKVQKYVICPLIDMANHVG 315

Query: 256 GITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESK 315
              T + A+E      FS     S+  P    AG +V IQY   ++N +L   YGF+ES+
Sbjct: 316 VGATGNVAFE-----YFSDGFSLSVLEPGVPDAGSEVFIQYG-PRNNDQLLQYYGFVESE 369

Query: 316 SDRNAYTL---------TLEISESDPFFGDKLDIAETNG-LGESAYFD------------ 353
           +  + Y L          LE++        +L+  +  G LG  +  D            
Sbjct: 370 NVHDVYILPPIRQWDIRALEVACGRMVGPGRLEKLDRAGLLGRESSIDSSSNTEAANDIG 429

Query: 354 -IVLGRT--LPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVR 410
            +VL R   + PA++Q LR  AL  +DA   ES       G+  + VS  NE      ++
Sbjct: 430 GVVLTRATGIDPAVIQALR--ALVSSDAEWEES---GEAIGNFAVQVSPENERAAKLAIQ 484

Query: 411 DACKSALSGFHTTIEEDEQLL-----QGGNLDPRLRIAVGVRAGEKRVLQQ 456
            A +  L    TT+EEDE LL     + G L+    +AV  R  +K++L++
Sbjct: 485 RAIELELESKATTMEEDENLLKVFQAKSGGLEEEEVLAVKFRLEKKKLLKE 535


>gi|410082051|ref|XP_003958604.1| hypothetical protein KAFR_0H00600 [Kazachstania africana CBS 2517]
 gi|372465193|emb|CCF59469.1| hypothetical protein KAFR_0H00600 [Kazachstania africana CBS 2517]
          Length = 508

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 78/285 (27%), Positives = 121/285 (42%), Gaps = 48/285 (16%)

Query: 62  AQVETFWQWLRDQKVVSPKSPIRPATFP---EGLGLVAQRDIAKNEVVLEVPMKFWINPD 118
           A  + F  WL     V     I    F    EG G++A +DIA+ EV+ E+P    +N  
Sbjct: 6   ADTDAFVSWLTADANVKLSQNIEVTHFNTSNEGRGVIAVKDIAEGEVLFEIPRDSILNVL 65

Query: 119 TVAASEIGSL----CSGLKPWISVALFLIRE--KKKEDSPWRVYLDILP--ECTDSTVFW 170
           T + S   S        +  W  + L L+ E   KKE S W  Y ++LP     +  ++W
Sbjct: 66  TSSLSSDFSDLEETLQSIGSWEGLILCLLYEWKGKKEKSKWWKYFNVLPSSNAMNGLMYW 125

Query: 171 SEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPR---PITLDDFLWAFG 227
           +E+EL  L+ + +L   +G K+  +N Y KV   + L  +  FP     +  +DF++A  
Sbjct: 126 NEQELEHLRPSLVLD-RIG-KKSAKNMYHKV---LTLVKESKFPEVLCNVEWEDFVYAAS 180

Query: 228 ILRSRAFSRLRGQNLVL-------------------IPLADLINHSPGITTEDYAYEIKG 268
           ++ + +F    G++  L                   IPLAD +N                
Sbjct: 181 VIMAYSFDVENGESQTLNEEDDDQDEEENTGYIKSMIPLADTLNSDTHQC---------N 231

Query: 269 AGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIE 313
           A L   D    +    P+K GEQV   Y  N  NAE+   YG++E
Sbjct: 232 ANLMYDDKFLKMYAIKPIKKGEQVFNIYG-NHPNAEILRRYGYVE 275


>gi|348690659|gb|EGZ30473.1| hypothetical protein PHYSODRAFT_553476 [Phytophthora sojae]
          Length = 437

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 85/342 (24%), Positives = 158/342 (46%), Gaps = 38/342 (11%)

Query: 136 ISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQ 195
           +++AL   +  +   S W  ++++LP    + +++  EEL  L+G+ +      ++E V 
Sbjct: 97  LAIALLYEKFVRGSKSKWAKHIELLPRTYHNALYFGPEELRALEGSNVYFIAQQMEEKVA 156

Query: 196 NEYLKVEEEIILP---------NKQLFPRPITLDDFLWAFGILRSRAFSRL---RGQNLV 243
           ++Y +++E ++L          N  LF    +L+++ WA   + SR F  +   +     
Sbjct: 157 HDYARLKESVLLELFENVPEGINVDLFDEFFSLENYKWALSTIWSR-FGDVPVAKQSFKA 215

Query: 244 LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLL-FSLRTPVPVKAGEQVLIQYDLNKSN 302
           ++P+ D++NH P         E + +  F      F L +     AG Q+ I Y    ++
Sbjct: 216 MVPVFDMLNHDP---------EAEMSHFFDMSTQRFKLVSHQHWNAGAQMFINYGPLSNH 266

Query: 303 AELALDYGFIESKSDRNAYTLTLEISE-SDPFFGDKLDIAETNGLGESAY-FDIVLGRTL 360
             LAL YGF+   +  +A  + L + E S  FF +K  +  TNGL  +   F++V   + 
Sbjct: 267 KLLAL-YGFVIIGNPFDAVEMWLPMDEASTKFFQEKEQLLLTNGLDHATNPFELVADES- 324

Query: 361 PPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANE-ELICRVVRDACKSALSG 419
              +L   R+  +        E +    + G +   +S  NE E + R++    +  L  
Sbjct: 325 NDLLLMAARIQEIDCETVEEFEELANKALEGEM---ISLENEQEALTRLIY-TLEKMLES 380

Query: 420 FHTTIEEDEQLLQ-----GGNLDPRLRIAVGVRAGEKRVLQQ 456
           F T+IEED+ LL+       NL+   R+AV VR  +K +L +
Sbjct: 381 FPTSIEEDDILLEQDDKKTDNLNHE-RMAVAVRRSDKYILSE 421


>gi|71895277|ref|NP_001025965.1| SET domain-containing protein 4 [Gallus gallus]
 gi|53134599|emb|CAG32346.1| hypothetical protein RCJMB04_23h14 [Gallus gallus]
          Length = 439

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 107/425 (25%), Positives = 176/425 (41%), Gaps = 62/425 (14%)

Query: 69  QWLRDQKVVSPKSPIRPATF-PEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           +WL+D+      S +RPA F   G GL+  + +   E+V+ +P K  +   TV  S +G 
Sbjct: 37  KWLKDRGF--GDSSLRPAQFWGTGRGLMTTKALQAGELVISLPEKCLVTTTTVLNSCLGE 94

Query: 128 LCSGLKPWIS--VAL--FLIREKKK-EDSPWRVYLDILPECTDSTVFWSEEELVELQGTQ 182
                KP +S  +AL  FLI EK   E S W+ YLD+LP+ T S     E+++V+L    
Sbjct: 95  YIMKWKPPVSPLIALCPFLIAEKHAGERSLWKPYLDVLPK-TYSCPVCLEQDVVQLLPEP 153

Query: 183 LLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFL---WAFGILRSRAFSRLRG 239
           L       +  V   Y+  +               T+ ++    WA+  + +R       
Sbjct: 154 LRKQAQEQRTAVHELYMSSKAFFSSLQSLFAENTATIFNYSALEWAWCTINTRTIYMKHS 213

Query: 240 QN---------LVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGE 290
           Q            L P  DL+NHSP +  +        A    +   + ++T    K  E
Sbjct: 214 QRECFSLEPDVYALAPYLDLLNHSPNVQVK--------AAFNEQSRNYEIQTNSQCKKYE 265

Query: 291 QVLIQYDLNKSNAELALDYGFIESKSDRNAY-----TLTLEISESDPFFGDKLDIAETNG 345
           +V I Y  +  N  L L+YGF+   +  ++      TL       D     KL I + + 
Sbjct: 266 EVFICYGPH-DNQRLLLEYGFVAVDNPHSSVYVSSDTLLKYFPSLDKQKNAKLSILKEHD 324

Query: 346 LGESAYFDIVLGRTLPP-AMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEEL 404
           L E    ++  G   P   +L  L++++LGG D F   + +R  + G +   +S  NE+ 
Sbjct: 325 LLE----NLTFGWDGPSWRLLTALKVLSLGG-DEF---TCWRRALLGDV---ISARNEQQ 373

Query: 405 ICRVVRDACKSALSGFHTTIEEDEQLL--------QGGNLDPRLRIAVGVRAGEKRVLQQ 456
              +    C       H  IEE + +L           NL  +L +   +R  + ++LQ+
Sbjct: 374 ALNITTKIC-------HFLIEETQHVLLQISQMKRDKENLKEQLTLVEALRLEDLKILQK 426

Query: 457 IDEIF 461
             EI 
Sbjct: 427 SAEIL 431


>gi|145344456|ref|XP_001416748.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144576974|gb|ABO95041.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 515

 Score = 75.9 bits (185), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 108/453 (23%), Positives = 183/453 (40%), Gaps = 49/453 (10%)

Query: 39  LKSRAFAATCSLHSASATTNPPTAQVETFWQWLRDQKVVSPKSPIRPAT----------- 87
           + SRA AAT     A A TNP  +      + + D    +     R              
Sbjct: 5   IASRAVAATTPRARARARTNPRASSTSRRTRAIADANAEATAEDARELAAWLSYDKGVDA 64

Query: 88  ----FPEG----LGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS-LCSGLKPWISV 138
               F EG    + +  + D+     VL VP    +    V A  I S L  G    + +
Sbjct: 65  SGLVFKEGARGEVEVALRGDVDAGARVLAVPQDCAVTSVDVDAHPIVSGLAKGRPELVGL 124

Query: 139 ALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSE-EELVELQGTQLLSTTLGVKEYVQN 196
           AL+L  E+ K   S W  Y+  L    D+ +FW+E E+   L+G+ +++  +      + 
Sbjct: 125 ALWLCAERIKGGASDWAPYVKTLAANPDAPLFWTEAEDFALLKGSPIVNDAVERSRSARE 184

Query: 197 EYLKVEEEIILPNKQLFPRP----ITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLIN 252
           EY  +  E+I  +   FP       T + F+ A   + ++A          L+PL D+I 
Sbjct: 185 EYAAI-VEVIKGDPTAFPAEAYEFFTEERFVDALATVCAKATWLPTASCYALVPLLDVIT 243

Query: 253 HS----PGITTEDYAYEI-KGAGLFSRDLLFSLRTPVPVKAGEQVLIQYD-LNKSNAELA 306
            +    PG++       I + A  +  D    + + V        ++Q D L ++N EL 
Sbjct: 244 IAGSPVPGVSPPSAKDGIARCAADYDVDSACVVLSAVVKAPANSRVVQLDPLQRNNGELF 303

Query: 307 LDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGL-GESAYFDIVLGRTLPPAML 365
           L+ G ++ K   +   +  EI  SD  F  K  + E  G   E+ YF +   R +P  + 
Sbjct: 304 LNTGRVDQKHPGDYLYMRTEIQPSDRLFSAKKQVLEGMGFTAENQYFPVYEDR-MPTQLY 362

Query: 366 QYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
            YLR   +      +  S   + I       VS  NE  I +++   C+  +S + T  E
Sbjct: 363 SYLRFARVQDPGEMMAVSFEEDKI-------VSVMNEYEILQLLMGDCRELMSEYDTNEE 415

Query: 426 EDEQLLQGGNLDPRLRI----AVGVRAGEKRVL 454
           ++  LL+   L   +R+    A  +R  EK+++
Sbjct: 416 DELNLLK---LSDTMRVREIEAAKLRMSEKKLI 445


>gi|195565510|ref|XP_002106342.1| GD16174 [Drosophila simulans]
 gi|194203718|gb|EDX17294.1| GD16174 [Drosophila simulans]
          Length = 395

 Score = 75.9 bits (185), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 89/340 (26%), Positives = 144/340 (42%), Gaps = 60/340 (17%)

Query: 46  ATCSLHSASATTNPPTAQVETFWQWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNE 104
           A C   + S       A+VE F  W +D  V S    I  A FP   LGL A R +AK+E
Sbjct: 10  AVCPQINDSPDDQTRLAKVEAFSAWAKDGGVHSEGLEI--AIFPGYQLGLRATRPLAKDE 67

Query: 105 VVLEVPMKFWINPDTVAASEI-GSLCSGLKPWISVALFLIREK-KKEDSPWRVYLDILPE 162
           +VL VP K   + ++ +   + G +       +++A  L+ EK + E S WR Y+D+LP 
Sbjct: 68  LVLSVPRKLIFSEESNSDCRLFGKMTQATH--LNLAYDLVIEKIRGEFSEWRPYIDVLPA 125

Query: 163 CTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKVEEEIILPNKQLFPRP--- 216
              + ++++ +++  L+GT   S  L     +  +Y    +    +  P+      P   
Sbjct: 126 KYSTVLYFTTKQMELLRGTAAASLALRQCRVIAKQYAFLYRYAHTMTEPSTGNRSHPGER 185

Query: 217 --------ITLDDFLWAFGILRSRAFSRLRGQNLV----------------LIPLADLIN 252
                   +  + + WA   + +R       QNLV                LIP  D+ N
Sbjct: 186 GLFFTQHGLCYELYRWAVSTVMTR-------QNLVPSEKQESEDTPKLISALIPYWDMAN 238

Query: 253 HSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI 312
           H PG  T  YA          R L  + +    V AGEQ  I Y  ++SN +L +  GF+
Sbjct: 239 HRPGKITSFYA-------AVPRQLECTAQE--AVDAGEQFFIYYG-DRSNTDLLVHNGFV 288

Query: 313 ESKSDRNAYTLTLEISESDPFFG------DKLDIAETNGL 346
           +  + ++   + + +S +D          DKL+I  T  L
Sbjct: 289 DDNNLKDYVNIRVGLSLTDALAAKRASILDKLNIRYTAEL 328


>gi|260835124|ref|XP_002612559.1| hypothetical protein BRAFLDRAFT_219602 [Branchiostoma floridae]
 gi|229297937|gb|EEN68568.1| hypothetical protein BRAFLDRAFT_219602 [Branchiostoma floridae]
          Length = 327

 Score = 75.5 bits (184), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 67/264 (25%), Positives = 123/264 (46%), Gaps = 34/264 (12%)

Query: 88  FPE-GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWIS----VALFL 142
           FP+ G G+++ R++ + + ++ +P    I   TV  S +G      KP ++    ++L+L
Sbjct: 28  FPDTGRGVMSTRNLKEGDCIVSLPENLLITTTTVVNSHLGQYIKTWKPRLTPKQVLSLYL 87

Query: 143 IREKKK-EDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKV 201
           I EK + +DS W  Y+  LP    +  ++S  E+  L    +   TL  ++ +QN Y  +
Sbjct: 88  IAEKSRGKDSFWYPYIQTLPTSYTTPSYFSTAEVDALPAL-VREATLRHRKVLQNSYKSL 146

Query: 202 EEEIILPNKQ-LFP---RPITLDDFLWAFGILRSRA-FSRLRGQNLV---------LIPL 247
           +    L N + LFP      TL  + WA+  + +R+ + R  G   +         L P 
Sbjct: 147 QTS--LHNLEPLFPDWKTVFTLKSYRWAWATVYTRSVYKRGPGWEFLDPSDPDVYALAPF 204

Query: 248 ADLINHSPGITTE-DYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELA 306
            D++NHSP + T+ D+    K          + ++T    +   QV I YD    N  L 
Sbjct: 205 LDMLNHSPLVQTDTDFNVSSK---------CYEVKTEGACRKYRQVFINYD-PYDNGRLL 254

Query: 307 LDYGFIESKSDRNAYTLTLEISES 330
           ++YGF+  ++  +  T T  + ++
Sbjct: 255 MEYGFVMPRNPHSVVTFTAAVKQN 278


>gi|260835045|ref|XP_002612520.1| hypothetical protein BRAFLDRAFT_214305 [Branchiostoma floridae]
 gi|229297897|gb|EEN68529.1| hypothetical protein BRAFLDRAFT_214305 [Branchiostoma floridae]
          Length = 287

 Score = 75.5 bits (184), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 73/268 (27%), Positives = 116/268 (43%), Gaps = 35/268 (13%)

Query: 67  FWQWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEI 125
           F+QWL      +   P++PA FP  G G++A + +   E++L +P +  I  D +  S I
Sbjct: 24  FFQWLHRNGCRN--VPLKPAVFPGTGRGMMATKALKHEELMLVIPQRLLITMDAIMDSYI 81

Query: 126 GSLCSG----LKPWISVALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSEEELVELQG 180
                     L P  ++A+FL+ EK ++E S WR Y+DILPE      F++E++   L  
Sbjct: 82  APYIERADPRLTPTQALAVFLMCEKYRREKSFWRPYIDILPEEYSCPTFFTEDDFRLLPN 141

Query: 181 TQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFP---RPITLDDFLWAFGILRSRAF--- 234
           +  L      K+Y  ++  K           LFP         DF WA+  +++RA    
Sbjct: 142 S--LRGKAKAKKYECHKEYKELAPFFKMLADLFPDQEDAFNFKDFKWAWSAIKTRALDVP 199

Query: 235 -SRLRGQNL---------VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPV 284
             R   ++L          + PL D INH+       + Y  K   L S       RT  
Sbjct: 200 IGRESCRHLRDAEDTPTPTMFPLVDSINHAAQAKIR-HRYNEKSRCLES-------RTET 251

Query: 285 PVKAGEQVLIQYDLNKSNAELALDYGFI 312
             +   +V+  Y     N  L L++GF+
Sbjct: 252 VYRRHAEVMNSYG-RADNDNLLLEFGFV 278


>gi|159465555|ref|XP_001690988.1| lysine N-methylase [Chlamydomonas reinhardtii]
 gi|158279674|gb|EDP05434.1| lysine N-methylase [Chlamydomonas reinhardtii]
          Length = 563

 Score = 75.5 bits (184), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 109/426 (25%), Positives = 172/426 (40%), Gaps = 78/426 (18%)

Query: 67  FWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAA-SEI 125
           F  WLR Q +++    +R     EG GLVA R + + E +L++P    + P   A  S +
Sbjct: 49  FQAWLRGQGIITQPLVLRQCGR-EGRGLVADRPLGRGEALLQLPDSLLLTPQRAAEESCL 107

Query: 126 GSLCSGLKP-----------------WISVALFLIREKKK-----EDSPWRVYLDILPEC 163
             L   L P                 W  +AL+L   + +       S W  Y+D+LP+ 
Sbjct: 108 APLLRQLSPAGASTSAAGAAALPLPEWSLLALYLAELRGRAAAGDRGSRWAAYVDMLPQR 167

Query: 164 TDSTVFWSEEELVEL-QGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQ-LFPRPITLD- 220
             + + W  +E  +L +G+ LL     +       + ++   I     + L P  ++L  
Sbjct: 168 PGTVLDWPAKETRQLLRGSPLLRLADSIAAAAAASWEELAPLIARGRAEGLVPAHVSLSK 227

Query: 221 -DFLWAFGILRSRAFSRLRGQNL--VLIPLADLINHSPGITTE----------------- 260
            D  WAFG+L SR   RL G++   VL P ADL+NH   +  E                 
Sbjct: 228 ADLDWAFGVLLSRCI-RLPGRDQLQVLAPWADLLNHD--VNAETGAAAAGAAGSGATGSG 284

Query: 261 ---------DYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF 311
                    D+    +G           LRT     AG+QV + Y   KS+ EL L YGF
Sbjct: 285 ASGSGGCHLDWEPTARGGAGA-----LVLRTDRAYAAGQQVYVSYG-PKSSGELLLSYGF 338

Query: 312 I--ESKSDRNAYTLTLEISES---DPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQ 366
               + +    Y L + +++S   DP    K ++   +GL  S  F + L   LP  +L 
Sbjct: 339 CPPPAANPHQDYKLLVGVNDSAAADPLAALKAEVLAKHGLPPSLEFPLKL-EGLPAGLLN 397

Query: 367 YLRLVALGGTDAFLLES----IFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHT 422
           YL  V         L      +F   ++  LD   + A      + + + C +AL  + T
Sbjct: 398 YLAFVEAAPQVPQELHDLGSLLFEEGVFPLLDGADTLAA---ALKSLGNRCTAALKAYPT 454

Query: 423 TIEEDE 428
           T+E D+
Sbjct: 455 TMEVDQ 460


>gi|238494116|ref|XP_002378294.1| SET domain protein [Aspergillus flavus NRRL3357]
 gi|317148877|ref|XP_001822982.2| SET domain protein [Aspergillus oryzae RIB40]
 gi|220694944|gb|EED51287.1| SET domain protein [Aspergillus flavus NRRL3357]
          Length = 478

 Score = 75.5 bits (184), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 109/470 (23%), Positives = 193/470 (41%), Gaps = 98/470 (20%)

Query: 62  AQVETFWQWLRDQKVVSPKSPIRPATF---PEGLGLVAQRDIAKNEVVLEVPMKFWINPD 118
           +Q + F  WL  +  V     IR A       G G+VAQ DIA+ E +  +P +  +   
Sbjct: 15  SQSDEFISWLSGKPGVKVNPKIRLADLRSRAAGRGVVAQSDIAEGEELFTIPREHVL--- 71

Query: 119 TVAASEIGSLCSG----LKPWISVALFLIREKKKED-SPWRVYLDILPECTDSTVFWSEE 173
           +   S++  L S     L PW+S+ L +I E    D S W  Y  ILP   D+ +FWS  
Sbjct: 72  STQNSKLKDLLSQDVEELGPWLSLMLVMIYEYLLGDQSAWASYFKILPRKFDTLMFWSPS 131

Query: 174 ELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRA 233
           EL ELQG+ ++   +G KE  +   L++   I+  N  LFP    L  +    G      
Sbjct: 132 ELQELQGSAIVD-RIG-KEGAEESILEMIAPIVRANPSLFPPVDGLASYDGDAGTQALLN 189

Query: 234 FSRLRGQNLV---------------------------------LIPLADLINHSPGITTE 260
            + + G  ++                                 ++PLADL+N        
Sbjct: 190 LAHVMGSLIMAYAFDIEKPEDEDDEGDDESGYVTDDEEQLSKGMVPLADLLNADA----- 244

Query: 261 DYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI-ESKSDRN 319
               +   A LF  +    ++   P+ AG ++   Y      A+L   YG++ ++ S  +
Sbjct: 245 ----DQNNARLFQEETGLVMKAIKPISAGAEIFNDYG-EIPRADLLRRYGYVTDNYSPYD 299

Query: 320 AYTLTLEI-----------SESDPF--FGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQ 366
              L+LE+           +E+ P   F + L++ + +G       D  L   LP  ++ 
Sbjct: 300 VVELSLELICQAAGLENADTENQPVLQFLEDLEVLD-DGYDIPRPLDDDLAGVLPDELVL 358

Query: 367 YLRLVALGGTDAFLLESIFRNT----IWGHLDLPVSHANEELICRVVRDACKSALSGFHT 422
             + + +   +  L + + +N       GH ++            ++  A +S  + + T
Sbjct: 359 LAKTLCMSSDE--LKQQVSKNKPPKPSLGHKEV-----------TLLSKAIQSKQAQYTT 405

Query: 423 TIEEDEQLL-QGGNLDP---------RLRIAVGVRAGEKRVLQQIDEIFK 462
           T+ +D++LL Q   L+          R ++A+ VR GEK +LQ + ++ +
Sbjct: 406 TLAQDKELLAQLSQLEATTPLEDSARRQKMAIQVRIGEKEILQTLSDMLR 455


>gi|299115489|emb|CBN75653.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 451

 Score = 75.5 bits (184), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 81/325 (24%), Positives = 149/325 (45%), Gaps = 28/325 (8%)

Query: 64  VETFWQWLRDQKVVSPKSPIRPATFP--EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVA 121
            E +  W +++  V    P+R ++ P  E  G+  + +I    +V+ VP +  +  D+  
Sbjct: 9   AEGYLDWAQEELGVVVHHPLRVSSTPGKEERGVFCEENIPAETIVVSVPWEALMTVDSAK 68

Query: 122 ASEI-GSLCSGLKPWISVALFLIREKK--KEDSPWRVYLDILPECTDSTVFWSEEELVEL 178
            +   G + +G +    + L L+  +   KE SP + ++D+LP     T+F+S++EL  L
Sbjct: 69  GTPFEGLMEAGAREDDVLCLLLLYHRHILKERSPLKGHMDVLPREYHQTIFYSDDELELL 128

Query: 179 QGTQLLSTTLGVKEYVQNEYLKVEEEIILPN-----------KQLFPRPITLDDFLWAFG 227
           +GT L + T+  K  V  ++ ++ E + LP+           +      +T +++LWA G
Sbjct: 129 RGTSLHAVTVQWKAQVDTDFREL-EALPLPSPRSEEGGSSTARDALEGFLTKEEYLWALG 187

Query: 228 ILRSRAFSRLR-GQNL-VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVP 285
            + SR  +  R G+ L  + P+ D+ NH P +++  + Y+       S D L  L T   
Sbjct: 188 TVWSRFVTVERAGRGLKAMAPVFDMFNHGP-LSSTVHGYQ------ESNDCLH-LVTLQD 239

Query: 286 VKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNG 345
             +G +V   Y     N+ L L +GF    +   +  L   +    P F +K  I   NG
Sbjct: 240 WASGSEVKFSYG-PLPNSRLLLLHGFCLPDNPFESVELWAMMEPGAPGFAEKNKIMLDNG 298

Query: 346 LGESAYFDIVLGRTLPPAMLQYLRL 370
           +  S     +  + L P +L  LR+
Sbjct: 299 VDPSKRPFFLTSKGLDPLLLPALRV 323


>gi|344277088|ref|XP_003410336.1| PREDICTED: SET domain-containing protein 4 [Loxodonta africana]
          Length = 440

 Score = 75.1 bits (183), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 76/309 (24%), Positives = 135/309 (43%), Gaps = 39/309 (12%)

Query: 69  QWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           +WL+D+K     + + PA FP  G GL+++  +   ++++ +P    ++ DTV  S +G+
Sbjct: 38  KWLKDRKF--EDTNLIPARFPGTGRGLMSKTSLQVGQMIISLPESCLLSTDTVIRSYLGA 95

Query: 128 LCSGLK----PWISVALFLIREKKKED-SPWRVYLDILPECTDSTVFWSEEELVELQGTQ 182
             +  K    P +++  FL+ EK   D S W+ YL+ LP+     V W E E+V L    
Sbjct: 96  YITKWKPPPSPLLALCTFLVLEKHAGDQSSWKPYLETLPKTYTCPVCW-EPEVVNLLPRP 154

Query: 183 LLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI----TLDDFLWAFGILRSRAFSRLR 238
           L +     +  VQ E+     +     + LF   +    T    LWA+  + +RA   LR
Sbjct: 155 LRAKAQEQRTRVQ-EFFTSFRDFFSSLQPLFSEAVENIFTYSALLWAWCTVNTRAV-YLR 212

Query: 239 GQNL----------VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKA 288
            + L           L P  DL+NHSP +  +        A    +   + +      + 
Sbjct: 213 HRQLRCFSAEPDTCALAPYLDLLNHSPDVQVK--------AAFNEKTRCYEIVAVSSCRK 264

Query: 289 GEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEI-----SESDPFFGDKLDIAET 343
            E+V I Y  +  N  L L+YGF+ +++      ++ +I       +D     K+ I + 
Sbjct: 265 HEEVFICYGPH-DNHRLLLEYGFVSTRNPHACVYVSRDILVKYLPSTDKQMNKKISILKD 323

Query: 344 NGLGESAYF 352
           +   E+  F
Sbjct: 324 HDFIENLTF 332


>gi|211826273|gb|AAH09054.2| SETD3 protein [Homo sapiens]
          Length = 228

 Score = 75.1 bits (183), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 57/187 (30%), Positives = 88/187 (47%), Gaps = 18/187 (9%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 36  EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLC 95

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ E+E+  LQ TQ +       +    +Y    KV
Sbjct: 96  ERASPNSFWQPYIQTLPSEYDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV 155

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 156 IQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 212

Query: 255 PGITTED 261
            G+T ED
Sbjct: 213 NGLTPED 219


>gi|384484604|gb|EIE76784.1| hypothetical protein RO3G_01488 [Rhizopus delemar RA 99-880]
          Length = 400

 Score = 75.1 bits (183), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 68/261 (26%), Positives = 116/261 (44%), Gaps = 50/261 (19%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREKKKED 150
           G G+ A +DI + +++  +P    ++  T +  +  S  S L  W  + L ++ E +K D
Sbjct: 38  GRGVTANKDIKEGDLLFSLPRSILLSQLTSSLKDQVSELSELSGWSPLILCMMYEIEKPD 97

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVK---EYVQNEYLKVEEEIIL 207
           S W+ Y D+LP    + +FW++E+L EL+GT ++S  +G K   E   NE     E II 
Sbjct: 98  SFWKPYFDVLPREFTTPMFWNQEDLKELEGTDIIS-KIGKKESEELFHNEL----EPIIK 152

Query: 208 PNKQLF-PRPITLDDFLWAFGILRSRAFSRLRGQN------------------------- 241
               LF  +  T++ F     ++ + +F+    +                          
Sbjct: 153 KYPNLFDEQKHTIELFHICGSLIMAYSFNDELQKAPKENNKEEEKEEEEEEEEEEEEEEE 212

Query: 242 ----LVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQY- 296
               + ++P+AD++NH  G            A LF       +R    +K GEQ+   Y 
Sbjct: 213 EEGLISMVPMADMLNHKTGFN---------NARLFHEPDSLQMRAIKDIKEGEQIYNTYG 263

Query: 297 DLNKSNAELALDYGFIESKSD 317
           DL   NA+L   YGF++ K+D
Sbjct: 264 DL--CNADLLRKYGFVDEKND 282


>gi|148671823|gb|EDL03770.1| SET domain containing 4, isoform CRA_d [Mus musculus]
          Length = 397

 Score = 75.1 bits (183), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 91/395 (23%), Positives = 165/395 (41%), Gaps = 45/395 (11%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWIS----VALFLIREK 146
           G GL+++  + + +V++ +P    +  DTV  S +G      KP +S    +  FL+ EK
Sbjct: 16  GRGLMSKASLQEGQVMISLPESCLLTTDTVIRSSLGPYIKKWKPPVSPLLALCTFLVSEK 75

Query: 147 KKE-DSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEI 205
                S W+ YLDILP+     V   E E+V+L  + L +     +  VQ+ +       
Sbjct: 76  HAGCRSLWKSYLDILPKSYTCPVCL-EPEVVDLLPSPLKAKAEEQRARVQDLFTSA-RGF 133

Query: 206 ILPNKQLFPRPI----TLDDFLWAFGILRSRA-FSRLRGQNLV--------LIPLADLIN 252
               + LF  P+    +   FLWA+  + +RA + R R Q  +        L P  DL+N
Sbjct: 134 FSTLQPLFAEPVDSVFSYRAFLWAWCTVNTRAVYLRSRRQECLSAEPDTCALAPFLDLLN 193

Query: 253 HSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI 312
           HSP +  +        A    +   + +RT    +  ++V I Y  +  N  L L+YGF+
Sbjct: 194 HSPHVQVK--------AAFNEKTRCYEIRTASRCRKHQEVFICYGPH-DNQRLLLEYGFV 244

Query: 313 ESKSDRNAYTLTLE-----ISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQY 367
             ++      ++ +     +  +D     K+ I + +G   +  F    G   P   L  
Sbjct: 245 SVRNPHACVPVSADMLVKFLPAADKQLHRKITILKDHGFTGNLTF----GWDGPSWRLLT 300

Query: 368 LRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEED 427
              +     + F   + ++  + G +   +S  NE+    V +  C   +   H  + + 
Sbjct: 301 ALKLLCLEAERF---TSWKKVLLGEV---ISDTNEKTSLGVAQKICSDVIEETHAVLRKV 354

Query: 428 EQLLQGG-NLDPRLRIAVGVRAGEKRVLQQIDEIF 461
             + +G  +L  +L +   +R  E R+LQ   EI 
Sbjct: 355 SDMKEGTVSLRSQLSLVEALRMEELRILQASAEIL 389


>gi|40068483|ref|NP_954574.1| histone-lysine N-methyltransferase setd3 isoform b [Homo sapiens]
 gi|28071060|emb|CAD61911.1| unnamed protein product [Homo sapiens]
 gi|111309143|gb|AAI20968.1| SET domain containing 3 [Homo sapiens]
 gi|118341365|gb|AAI27625.1| SET domain containing 3 [Homo sapiens]
 gi|118341638|gb|AAI27626.1| SET domain containing 3 [Homo sapiens]
 gi|119602071|gb|EAW81665.1| SET domain containing 3, isoform CRA_b [Homo sapiens]
 gi|156138972|gb|AAI48252.1| SET domain containing 3 [Homo sapiens]
          Length = 296

 Score = 74.7 bits (182), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 57/187 (30%), Positives = 88/187 (47%), Gaps = 18/187 (9%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 104 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ E+E+  LQ TQ +       +    +Y    KV
Sbjct: 164 ERASPNSFWQPYIQTLPSEYDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV 223

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 224 IQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 280

Query: 255 PGITTED 261
            G+T ED
Sbjct: 281 NGLTPED 287


>gi|258569485|ref|XP_002543546.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237903816|gb|EEP78217.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 480

 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 110/445 (24%), Positives = 187/445 (42%), Gaps = 83/445 (18%)

Query: 67  FWQWLRDQKVVSPKSPIRPATF-----PEGLGLVAQRDIAKNEVVLEVPMKFWIN-PDTV 120
           F  WL+    V     IR A         G+  VA  +IA++E +  +P    ++  ++ 
Sbjct: 25  FMNWLKSHPGVRVSFKIRIADLRSNAAGRGVETVACEEIAQDEELFAIPENLVLSVQNSK 84

Query: 121 AASEIGSLCSGLKPWISVALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSEEELVELQ 179
               +      L  W+S+ + +I E      S W  Y  +LP   D+ +FWS++EL ELQ
Sbjct: 85  LKDHLNFTDKELDSWLSLIVTMIYEYLHGGASRWSSYFAVLPTDFDTLMFWSQDELRELQ 144

Query: 180 GTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDF----------------- 222
           G+ +LS  +G +E  +    KV   +IL    LFP P  L  F                 
Sbjct: 145 GSSVLS-KIGRQEADEMIMGKV-YPLILDYPGLFPTPKELSSFNSQQGKEAILHLAHRMG 202

Query: 223 ----LWAFGILR--SRAFSRLRGQNLV-----------LIPLADLINHSPGITTEDYAYE 265
                +AF I     R      G++             ++PLAD++N             
Sbjct: 203 TLIMAYAFDIENEMDREEEDQDGEDGYITDNEQETAKGMVPLADMLNADA---------H 253

Query: 266 IKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI-ESKSDRNAYTLT 324
              A LF  D  F +++ VP+   E++   Y      A+L   YG+I E+ S  +   ++
Sbjct: 254 RNNARLFQEDGYFIMKSIVPISMEEEIFNDYG-ELPRADLLRRYGYITENYSPYDVVEIS 312

Query: 325 LE-------ISESDPFFGDKLDIAETNGLGESAY------FDIVLGRTLPPAMLQYLRLV 371
           LE       + ++ P    +L++ ET  + E  Y       D  L   + P ++  LR +
Sbjct: 313 LEAICKVAGVEKNCP----QLELLETAEILEDGYSLLRPETDANLVEAISPELIVLLRTL 368

Query: 372 ALGGTDAFLLESIFRNTIWGHLDLP-VSHANEELICRVVRDACKSALSGFHTTIEEDEQL 430
            +        +++ +  + G L  P +  A+ +L+  V+    +S  + + TTI +D++L
Sbjct: 369 TMTP------DNLNQMRVKGKLPKPQLDQASAKLLIEVL----QSRQNDYPTTIAQDDEL 418

Query: 431 LQGGNLDPRLRIAVGVRAGEKRVLQ 455
           L   +   R  +A+ VR GEK VLQ
Sbjct: 419 LHSTH-SHRREMAIRVRKGEKEVLQ 442


>gi|195168946|ref|XP_002025291.1| GL13316 [Drosophila persimilis]
 gi|194108747|gb|EDW30790.1| GL13316 [Drosophila persimilis]
          Length = 568

 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 82/311 (26%), Positives = 135/311 (43%), Gaps = 51/311 (16%)

Query: 62  AQVETFWQWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTV 120
           A+V  F +W +   V +    I  A FP   LGL A +DIA  + VL VP     + + +
Sbjct: 134 AKVTAFSEWAKAGGVKTDCLEI--AIFPGYQLGLRATQDIAAEQPVLSVPRTLIFSEEHL 191

Query: 121 AASEIGSLCS-GLKPWISVALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSEEELVEL 178
             ++    C+  L    ++A  L+ EK +  DS WR Y+D+LP   ++ +++S E++  L
Sbjct: 192 PETDRKLFCNFPLLTNFNLAYALVIEKVRGPDSVWRPYIDVLPARYNTVLYFSIEQMQRL 251

Query: 179 QGTQLLSTTLGVKEYVQNEYLKVEE-EIILPNKQ-------LFPRP-ITLDDFLWAFGIL 229
           +GT   ++ L     +  +Y  + +   I P+         LF +  +  + + WA   +
Sbjct: 252 RGTAACTSALRQCRVIARQYANMYKCAHIRPDASSASSMGVLFTQHGLCYELYRWAVSTV 311

Query: 230 RSRAFSRLRGQNLV--------------------LIPLADLINHSPGITTEDYAYEIKGA 269
            +R       QNLV                    LIP  D+ NH PG  T  Y      +
Sbjct: 312 MTR-------QNLVPRELQANDDGDDLSQLPISALIPYWDMANHRPGKITSYY-----DS 359

Query: 270 GLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISE 329
           G+   D           KAGEQ  I Y  ++SNA+L +  GFI+  + ++   + L +  
Sbjct: 360 GVHQMDCT----AQEACKAGEQFFIYYG-DRSNADLLVHNGFIDVNNRKDYVKIRLGLGL 414

Query: 330 SDPFFGDKLDI 340
           SD     +  I
Sbjct: 415 SDALVEQRAKI 425


>gi|198470241|ref|XP_001355267.2| GA17108 [Drosophila pseudoobscura pseudoobscura]
 gi|198145358|gb|EAL32324.2| GA17108 [Drosophila pseudoobscura pseudoobscura]
          Length = 568

 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 82/311 (26%), Positives = 135/311 (43%), Gaps = 51/311 (16%)

Query: 62  AQVETFWQWLRDQKVVSPKSPIRPATFPE-GLGLVAQRDIAKNEVVLEVPMKFWINPDTV 120
           A+V  F +W +   V +    I  A FP   LGL A +DIA  + VL VP     + + +
Sbjct: 134 AKVTAFSEWAKAGGVKTDCLEI--AIFPGYQLGLRATQDIAAEQPVLSVPRTLIFSEEHL 191

Query: 121 AASEIGSLCS-GLKPWISVALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSEEELVEL 178
             ++    C+  L    ++A  L+ EK +  DS WR Y+D+LP   ++ +++S E++  L
Sbjct: 192 PETDRKLFCNFPLLTNFNLAYALVIEKVRGPDSVWRPYIDVLPARYNTVLYFSIEQMQRL 251

Query: 179 QGTQLLSTTLGVKEYVQNEYLKVEE-EIILPNKQ-------LFPRP-ITLDDFLWAFGIL 229
           +GT   ++ L     +  +Y  + +   I P+         LF +  +  + + WA   +
Sbjct: 252 RGTAACTSALRQCRVIARQYANMYKCAHIRPDASSASSMGVLFTQHGLCYELYRWAVSTV 311

Query: 230 RSRAFSRLRGQNLV--------------------LIPLADLINHSPGITTEDYAYEIKGA 269
            +R       QNLV                    LIP  D+ NH PG  T  Y      +
Sbjct: 312 MTR-------QNLVPRELQANDDGDDLSQLPISALIPYWDMANHRPGKITSYY-----DS 359

Query: 270 GLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISE 329
           G+   D           KAGEQ  I Y  ++SNA+L +  GFI+  + ++   + L +  
Sbjct: 360 GVHQMDCT----AQEACKAGEQFFIYYG-DRSNADLLVHNGFIDVNNRKDYVKIRLGLGL 414

Query: 330 SDPFFGDKLDI 340
           SD     +  I
Sbjct: 415 SDALVEQRAKI 425


>gi|440472932|gb|ELQ41762.1| SET domain-containing protein 8 [Magnaporthe oryzae Y34]
 gi|440478704|gb|ELQ59514.1| SET domain-containing protein 8 [Magnaporthe oryzae P131]
          Length = 478

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 82/312 (26%), Positives = 133/312 (42%), Gaps = 39/312 (12%)

Query: 64  VETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNE-----VVLEVPMKFWINPD 118
           ++T   W R   VV   + I   T  +G GLVAQRD+   +     V+L VP +  +N +
Sbjct: 8   IDTLLIWARFNGVVFDGAAI-TQTEGKGYGLVAQRDLQAKDGEDTTVLLSVPRELLLNSE 66

Query: 119 TVAAS--------EIGSLCSGLKPWISVALFLIR---------EKKKEDSPWRVYLDILP 161
            V           ++        P   V LFL+          E     +PW  Y+  LP
Sbjct: 67  YVEQCSKTDQRFRDLFDAAGHQSPRQDVILFLMAQIIHIWASDEGGGVSNPWTQYIKYLP 126

Query: 162 ECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKVEEEIILPNKQLFPRPI- 217
                   W+E+E   L+GT L +        ++NE+   L+   EI   N+ L  + + 
Sbjct: 127 RTVPLPTLWNEDERQLLRGTSLEAAVHSKLRALENEFDNLLEKAAEIPSWNEALCEKQVV 186

Query: 218 TLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLL 277
           T+ D+       RSR    L      ++P  D++NH+  I    Y   +K +      + 
Sbjct: 187 TVSDYARLDAWYRSRCM-ELPASGPTMVPCIDMVNHA-AIPNASY---VKSSDC---GVN 238

Query: 278 FSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDK 337
             LR+   VK+G+++ I Y   KS AE+   YGF++S++  +   L       +P   D 
Sbjct: 239 LCLRSGALVKSGQEITISYGEKKSAAEMLFSYGFVDSEAAGDEKILV----PVEPPGDDP 294

Query: 338 LDIAETNGLGES 349
           L +A+T   GE+
Sbjct: 295 LVMAKTRIYGEA 306


>gi|449283795|gb|EMC90389.1| SET domain-containing protein 4 [Columba livia]
          Length = 440

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 109/436 (25%), Positives = 183/436 (41%), Gaps = 79/436 (18%)

Query: 67  FWQWLRDQKVVSPKSPIRPATFPE-GLGLVAQRDI-AKNEVVLEVPMKFWINPDTVAASE 124
           F +WL+D+      S +RPA F + G GL+  + +    ++++ +P K  +  DTV +S 
Sbjct: 35  FRKWLKDRGF--EDSHLRPAEFWDTGRGLMTTKTLQVSRDLIISLPEKCLLTTDTVLSSC 92

Query: 125 IGSLCSGLKPWIS----VALFLIREKKK-EDSPWRVYLDILPECTDSTVFWSEEELVELQ 179
           +G      KP +S    +  FLI EK   E S W+ YLD+LP+ T S     E ++V L 
Sbjct: 93  LGEYIMKWKPPVSPLTALCTFLIAEKHAGEKSLWKPYLDVLPK-TYSCPVCLEHDVVSLL 151

Query: 180 GTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFP--RPITLDD---------FLWAFGI 228
              L       ++  Q +  KV  E+ + +K  F   +P+  ++           WA+  
Sbjct: 152 PEPL-------RKKAQEQRTKV-HELYISSKAFFSSLQPLFAENTETIFNYSALEWAWCT 203

Query: 229 LRSRAFSRLRGQN---------LVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFS 279
           + +R       Q            L P  DL+NHSP +  +        A    +   + 
Sbjct: 204 INTRTIYMKHSQRKCFSLEPDVYALAPYLDLLNHSPNVQVK--------AAFNEQTRSYE 255

Query: 280 LRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI-----ESKSDRNAYTLTLEISESDPFF 334
           +RT    K  E+V I Y  +  N  L L+YGF+      S    ++ TL       D   
Sbjct: 256 IRTNSLCKKYEEVFICYGPH-DNQRLLLEYGFVAMDNPHSSVYVSSATLLKYFPPLDKQR 314

Query: 335 GDKLDIAETNGLGESAYFDIVLGRTLPP-AMLQYLRLVALGGTDAFLLESIFRNTIWGHL 393
             K+ I + + L E    ++  G   P   +L  L++++LG  D F   + +R T+ G +
Sbjct: 315 NAKVSILKDHDLLE----NLTFGWDGPSWRLLTALKVLSLGA-DEF---TCWRRTLLGDV 366

Query: 394 DLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLL--------QGGNLDPRLRIAVG 445
              +S  NE+    +    C       H  IEE + +L           NL   L +   
Sbjct: 367 ---ISARNEQQALNITAKIC-------HFLIEETQHVLLQISQLKRNNENLKKHLPLVEA 416

Query: 446 VRAGEKRVLQQIDEIF 461
           +R  + ++L++  EI 
Sbjct: 417 LRLEDLKILEKSAEIL 432


>gi|167521575|ref|XP_001745126.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776740|gb|EDQ90359.1| predicted protein [Monosiga brevicollis MX1]
          Length = 390

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 70/305 (22%), Positives = 131/305 (42%), Gaps = 32/305 (10%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSG-----LKPWISVALFLIRE 145
           G GL A  + A  E +L +P    ++ ++   S +G+         L P +++A  L+ E
Sbjct: 53  GQGLKATAEAAPGETLLRIPEACMLSEESARRSTLGAYMDSDTMLKLMPNVTLAFHLLLE 112

Query: 146 KKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEI 205
               DS WR Y+  LP      ++W   +L+ L+G+ L    + + ++V  +Y  +  ++
Sbjct: 113 LHDLDSFWRPYIACLPVSYSVPLYWDLPDLMSLRGSSLFVEAIRLYKHVCRQYGYLHNKL 172

Query: 206 IL---PNKQLF-------PRPITLDDFLWAFGILRSRAFSRLR----GQ---NLVLIPLA 248
            +   P+   F       P   T +D+ WA   + +R  S  +    GQ    L LIPL 
Sbjct: 173 SVRANPSCSCFPLTLGLSPEAFTFEDWRWAVATVMTRQNSIPQAGPDGQMKPTLALIPLW 232

Query: 249 DLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALD 308
           D+INH+    +  +  E        R+ L     P P K G Q+ + Y  +++N +  L 
Sbjct: 233 DMINHANHPMSTQFDSE--------RECL-EFVCPAPAKPGSQITMWYG-DRNNGQFLLH 282

Query: 309 YGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYL 368
            GF  +    +   +   + E+D  +  K  +     +  +  F +     L P +L + 
Sbjct: 283 QGFFFAGHANDYVNVPFSLDETDSLYKIKALLLRNLTVPPAGDFVLSTDDQLDPQLLAFA 342

Query: 369 RLVAL 373
           R+ ++
Sbjct: 343 RVASM 347


>gi|55953063|ref|NP_001007260.1| SET domain-containing protein 4 isoform 2 [Homo sapiens]
 gi|12804091|gb|AAH02898.1| SET domain containing 4 [Homo sapiens]
 gi|119630161|gb|EAX09756.1| SET domain containing 4, isoform CRA_a [Homo sapiens]
          Length = 307

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 70/263 (26%), Positives = 119/263 (45%), Gaps = 32/263 (12%)

Query: 69  QWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           +WL+ +K     S + PA FP  G GL++Q  + + ++++ +P    +  DTV  S +G+
Sbjct: 38  KWLKARKFQD--SNLAPACFPGTGRGLMSQTSLQEGQMIISLPESCLLTTDTVIRSYLGA 95

Query: 128 LCSGLK----PWISVALFLIREKKK-EDSPWRVYLDILPECTDSTVFWSEEELVELQGTQ 182
             +  K    P +++  FL+ EK     S W+ YL+ILP+     V   E E+V L    
Sbjct: 96  YITKWKPPPSPLLALCTFLVSEKHAGHRSLWKPYLEILPKAYTCPVCL-EPEVVNLLPKS 154

Query: 183 LLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI----TLDDFLWAFGILRSRA-FSRL 237
           L +     + +VQ E+     +     + LF   +    +    LWA+  + +RA + R 
Sbjct: 155 LKAKAEEQRAHVQ-EFFASSRDFFSSLQPLFAEAVDSIFSYSALLWAWCTVNTRAVYLRP 213

Query: 238 RGQNLV--------LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAG 289
           R +  +        L P  DL+NHSP +  +        A        + +RT    +  
Sbjct: 214 RQRECLSAEPDTCALAPYLDLLNHSPHVQVK--------AAFNEETHSYEIRTTSRWRKH 265

Query: 290 EQVLIQYDLNKSNAELALDYGFI 312
           E+V I Y  +  N  L L+YGF+
Sbjct: 266 EEVFICYGPH-DNQRLFLEYGFV 287


>gi|389644184|ref|XP_003719724.1| SET domain-containing protein 8 [Magnaporthe oryzae 70-15]
 gi|351639493|gb|EHA47357.1| SET domain-containing protein 8 [Magnaporthe oryzae 70-15]
          Length = 478

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 82/312 (26%), Positives = 133/312 (42%), Gaps = 39/312 (12%)

Query: 64  VETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNE-----VVLEVPMKFWINPD 118
           ++T   W R   VV   + I   T  +G GLVAQRD+   +     V+L VP +  +N +
Sbjct: 8   IDTLLIWARFNGVVFDGAAI-TQTEGKGYGLVAQRDLQAKDGEDTTVLLSVPRELLLNSE 66

Query: 119 TVAAS--------EIGSLCSGLKPWISVALFLIR---------EKKKEDSPWRVYLDILP 161
            V           ++        P   V LFL+          E     +PW  Y+  LP
Sbjct: 67  YVEQCSKTDQRFRDLFDAAGHQSPRQDVILFLMAQIIHIWASDEGGGVSNPWTQYIKYLP 126

Query: 162 ECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKVEEEIILPNKQLFPRPI- 217
                   W+E+E   L+GT L +        ++NE+   L+   EI   N+ L  + + 
Sbjct: 127 RTVPLPTLWNEDERQLLRGTSLEAAVHSKLRALENEFDNLLEKAAEIPSWNEVLCEKQVV 186

Query: 218 TLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLL 277
           T+ D+       RSR    L      ++P  D++NH+  I    Y   +K +      + 
Sbjct: 187 TVSDYARLDAWYRSRCM-ELPASGPTMVPCIDMVNHA-AIPNASY---VKSSDC---GVN 238

Query: 278 FSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDK 337
             LR+   VK+G+++ I Y   KS AE+   YGF++S++  +   L       +P   D 
Sbjct: 239 LCLRSGAVVKSGQEITISYGEKKSAAEMLFSYGFVDSEAAGDEKILV----PVEPPGDDP 294

Query: 338 LDIAETNGLGES 349
           L +A+T   GE+
Sbjct: 295 LVMAKTRIYGEA 306


>gi|114684050|ref|XP_001168792.1| PREDICTED: SET domain-containing protein 4 isoform 4 [Pan
           troglodytes]
 gi|410222534|gb|JAA08486.1| SET domain containing 4 [Pan troglodytes]
 gi|410259178|gb|JAA17555.1| SET domain containing 4 [Pan troglodytes]
 gi|410287502|gb|JAA22351.1| SET domain containing 4 [Pan troglodytes]
 gi|410336607|gb|JAA37250.1| SET domain containing 4 [Pan troglodytes]
          Length = 307

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 70/263 (26%), Positives = 119/263 (45%), Gaps = 32/263 (12%)

Query: 69  QWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           +WL+ +K     S + PA FP  G GL++Q  + + ++++ +P    +  DTV  S +G+
Sbjct: 38  KWLKARKFQD--SNLAPACFPGTGRGLMSQTSLQEGQMIISLPESCLLTTDTVIRSYLGA 95

Query: 128 LCSGLK----PWISVALFLIREKKK-EDSPWRVYLDILPECTDSTVFWSEEELVELQGTQ 182
             +  K    P +++  FL+ EK     S W+ YL+ILP+     V   E E+V L    
Sbjct: 96  YITKWKPPPSPLLALCTFLVSEKHAGHRSLWKPYLEILPKAYTCPVCL-EPEVVNLLPKS 154

Query: 183 LLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI----TLDDFLWAFGILRSRA-FSRL 237
           L +     + +VQ E+     +     + LF   +    +    LWA+  + +RA + R 
Sbjct: 155 LKAKAEEQRAHVQ-EFFASSRDFFSSLQPLFAEAVDSIFSYSALLWAWCTVNTRAVYLRP 213

Query: 238 RGQNLV--------LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAG 289
           R +  +        L P  DL+NHSP +  +        A        + +RT    +  
Sbjct: 214 RQRECLSAEPDTCALAPYLDLLNHSPHVQVK--------AAFNEETHSYEIRTTSRWRKH 265

Query: 290 EQVLIQYDLNKSNAELALDYGFI 312
           E+V I Y  +  N  L L+YGF+
Sbjct: 266 EEVFICYGPH-DNQRLFLEYGFV 287


>gi|350595011|ref|XP_003484025.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Sus
           scrofa]
          Length = 326

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 68/222 (30%), Positives = 108/222 (48%), Gaps = 20/222 (9%)

Query: 212 LFPRPI-TLDDFLWAFGILRSRA--FSRLRGQNLVLIPLADLINHSPGITT-EDYAYEIK 267
            FP  +  ++ F W+FGIL SR      + G+N  L+P AD++NHS  + T  DY    K
Sbjct: 84  FFPEEVFNIESFKWSFGILFSRMVRLPSMDGKN-ALVPWADMMNHSCEVETFLDYDKSSK 142

Query: 268 GAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI--ESKSDRNAYTLTL 325
           G       ++F   T  P + GEQV I Y   KSN EL L YGF+  E  +  ++  L+L
Sbjct: 143 G-------IVFP--TDRPYQPGEQVFISYG-KKSNGELLLSYGFVPKEGTNPSDSVELSL 192

Query: 326 EISESDPFFGDKLDIAETNGLGESAYFDI-VLGRTLPPAMLQYLRLVALGGTDAF-LLES 383
            + +SD  + +KL++ +  GL  S  F I V G  L      YL +        F  + +
Sbjct: 193 SLKKSDESYKEKLELLKKYGLSGSQCFPIRVTGWPLELMAYAYLAVSPSSMRGKFEEMAA 252

Query: 384 IFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
              N      DL      E+ + + + D+C+S++S ++  ++
Sbjct: 253 AASNKTTSKKDLRYPEIEEQAL-QFILDSCESSISKYNKFLQ 293


>gi|348552908|ref|XP_003462269.1| PREDICTED: SET domain-containing protein 4-like [Cavia porcellus]
          Length = 440

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 77/308 (25%), Positives = 135/308 (43%), Gaps = 37/308 (12%)

Query: 69  QWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           +WL+D+      + + PA FP  G GL+++  + + ++++ +P    +  DTV  S +G+
Sbjct: 38  KWLKDRNF--EDTNLMPARFPGTGRGLMSKTSLREGQMIISLPGSCLLTTDTVIRSSLGA 95

Query: 128 LCSGLK----PWISVALFLIREKKKED-SPWRVYLDILPECTDSTVFWSEEELVELQGTQ 182
                K    P +++  FL+ EK   D S W+ YLDILP+     V   E E+V L    
Sbjct: 96  YIIKWKPPPSPLLALCTFLVSEKHAGDQSVWKPYLDILPKSYTCPVCL-EPEVVNLLPEP 154

Query: 183 LLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRP----ITLDDFLWAFGILRSRA-FSRL 237
           L +     +  VQ ++     +     + LF        +    LWA+  + +RA + R 
Sbjct: 155 LKAKAEEQRMSVQ-QFFASSRDFFSSLQPLFEEATDSVFSYSALLWAWCTVNTRAVYLRT 213

Query: 238 RGQNLV--------LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAG 289
           R ++ +        L P  DL+NHSP +  +    E  G         + +RT    +  
Sbjct: 214 RRRDCLSLEPDTCALAPYLDLLNHSPNVQVKAAFNEETGC--------YEIRTASDYRKH 265

Query: 290 EQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEI-----SESDPFFGDKLDIAETN 344
           ++V I Y  +  N  L L+YGF+   +      ++ EI       +D     K+ I + +
Sbjct: 266 KEVFICYGPH-DNHRLLLEYGFVSLCNPHACVYVSREILVKYLPSTDKQMNKKISILKDH 324

Query: 345 GLGESAYF 352
           G  E+  F
Sbjct: 325 GFLENLTF 332


>gi|297735395|emb|CBI17835.3| unnamed protein product [Vitis vinifera]
          Length = 583

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 75/299 (25%), Positives = 131/299 (43%), Gaps = 36/299 (12%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSL---CSGLKPWISVALFLIREKK 147
           G G +A  D+   +V LE+PM   I+ + V  S++  +     G+     + L+ ++EK 
Sbjct: 206 GRGAIATEDLKVGDVALEIPMSIVISEELVHESDMFPILEKIDGISSETMLLLWSMKEKH 265

Query: 148 KEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIIL 207
             +S +  Y + LPE  ++ + +  + ++ L GT LL   +  K+++  +Y ++   +  
Sbjct: 266 NSNSKFNTYFNALPEAFNTGLSFEFDAIMVLAGTLLLEEIIEAKKHLNAQYEELVPALCK 325

Query: 208 PNKQLFPRPI-TLDDFLWAFGILRSRAF------SRLRGQNLVLIPLADLINHSPGITTE 260
            +  +FP    T + FLWA  +  S          +LR     LIP+A  +NHS      
Sbjct: 326 DHPDIFPPEFYTQEQFLWACELWYSNGMQVMFTDGKLRT---CLIPIAGFLNHSL----- 377

Query: 261 DYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNA 320
            Y + +    + S+          P   GEQ  + Y  N S++ L   YGFI  + D   
Sbjct: 378 -YPHIMHYGKVDSKTNSLKFCVSKPCNMGEQCYLSYG-NFSSSHLVTFYGFI-PQGDNLY 434

Query: 321 YTLTLEISESDPFFGDKLDIAETNGLGESAYFDI----------VLGRTLPPAMLQYLR 369
            T+ LEI       GD  +  E + + +SA   +          +    LPP +L +LR
Sbjct: 435 DTIPLEIDNPQ---GDCPE--EFHPMSDSATHMVRGTWLSNNHEIFHYGLPPPLLDHLR 488


>gi|195132508|ref|XP_002010685.1| GI21676 [Drosophila mojavensis]
 gi|193907473|gb|EDW06340.1| GI21676 [Drosophila mojavensis]
          Length = 593

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 78/308 (25%), Positives = 131/308 (42%), Gaps = 47/308 (15%)

Query: 53  ASATTNPPTAQVETFWQWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPM 111
           A  T     +++E F +W R   V +    I  ATFP   LGL A RDI   E VL VP 
Sbjct: 164 AQLTEQTRLSKIEAFNEWARAGGVKTDCVEI--ATFPGYQLGLRATRDIKAGEQVLSVPR 221

Query: 112 KFWINPDTVAASEIGSLCSGLKPWISVALFLIREK-KKEDSPWRVYLDILPECTDSTVFW 170
           K   + + +   +   L       + V   LI EK +  DSPW+ ++D LP   ++ +++
Sbjct: 222 KLIFSEELLPEKQ-RQLFRNFPTHLKVTYTLIMEKLRGADSPWQPFIDTLPSRYNTVLYF 280

Query: 171 SEEELVELQGTQLLSTTLGVKEYVQNEY--------LKVEEEIILPNKQLFP-RPITLDD 221
           + E++  L+GT   S  +     +   Y        +++++ ++     LF    +  + 
Sbjct: 281 TVEQMQRLRGTSACSAAVRHCRVIARLYASMYKCAFMQLDDSVMGGMANLFTDYGLCYEL 340

Query: 222 FLWAFGILRSRAFSRLRGQNLV----------------LIPLADLINHSPGITTEDYAYE 265
           + WA   + +R       QNLV                LIP  D+ NH  G  T  Y   
Sbjct: 341 YRWAVSTVTTR-------QNLVPRQEIPSDAANLPISALIPYWDMANHRSGKITSFYD-- 391

Query: 266 IKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTL 325
            + AG                K+GEQ  I Y  ++SNA+  +  GF++ ++ ++   + L
Sbjct: 392 -QAAG------QMECTAQEAYKSGEQYFIYYG-DRSNADRLVHNGFVDMQNPKDYVQIRL 443

Query: 326 EISESDPF 333
            +S +D  
Sbjct: 444 GLSPTDAL 451


>gi|225446052|ref|XP_002268920.1| PREDICTED: uncharacterized protein LOC100256524 [Vitis vinifera]
          Length = 566

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 75/299 (25%), Positives = 131/299 (43%), Gaps = 36/299 (12%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSL---CSGLKPWISVALFLIREKK 147
           G G +A  D+   +V LE+PM   I+ + V  S++  +     G+     + L+ ++EK 
Sbjct: 189 GRGAIATEDLKVGDVALEIPMSIVISEELVHESDMFPILEKIDGISSETMLLLWSMKEKH 248

Query: 148 KEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIIL 207
             +S +  Y + LPE  ++ + +  + ++ L GT LL   +  K+++  +Y ++   +  
Sbjct: 249 NSNSKFNTYFNALPEAFNTGLSFEFDAIMVLAGTLLLEEIIEAKKHLNAQYEELVPALCK 308

Query: 208 PNKQLFPRPI-TLDDFLWAFGILRSRAF------SRLRGQNLVLIPLADLINHSPGITTE 260
            +  +FP    T + FLWA  +  S          +LR     LIP+A  +NHS      
Sbjct: 309 DHPDIFPPEFYTQEQFLWACELWYSNGMQVMFTDGKLRT---CLIPIAGFLNHSL----- 360

Query: 261 DYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNA 320
            Y + +    + S+          P   GEQ  + Y  N S++ L   YGFI  + D   
Sbjct: 361 -YPHIMHYGKVDSKTNSLKFCVSKPCNMGEQCYLSYG-NFSSSHLVTFYGFI-PQGDNLY 417

Query: 321 YTLTLEISESDPFFGDKLDIAETNGLGESAYFDI----------VLGRTLPPAMLQYLR 369
            T+ LEI       GD  +  E + + +SA   +          +    LPP +L +LR
Sbjct: 418 DTIPLEIDNPQ---GDCPE--EFHPMSDSATHMVRGTWLSNNHEIFHYGLPPPLLDHLR 471


>gi|335300684|ref|XP_003358991.1| PREDICTED: SET domain-containing protein 4 [Sus scrofa]
          Length = 440

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 98/425 (23%), Positives = 178/425 (41%), Gaps = 62/425 (14%)

Query: 69  QWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           +WL+D+      + + PA FP  G GL+++  + + ++V+ +P    +  DTV  S +G 
Sbjct: 38  KWLKDRNFED--TNLIPARFPGTGRGLMSKTSLQEGQLVIALPESCLLTTDTVLRSYLGP 95

Query: 128 LCSGLK----PWISVALFLIREKKKED-SPWRVYLDILPECTDSTVFWSEEELVELQGTQ 182
             +  +    P +++  FL+ EK   D SPW+ YL++LP+ T +     E E+V L    
Sbjct: 96  YIAKWQPPPSPLLALCTFLVSEKHAGDQSPWKPYLEVLPK-TYTCPVCLEPEVVNLLPGP 154

Query: 183 LLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI----TLDDFLWAFGILRSRA-FSRL 237
           L S     +  V  E+     +     + LFP  +    +    LWA+  + +RA + + 
Sbjct: 155 LKSKAREQRTRVW-EFFSSSRDFFSSLQPLFPEAVESIFSYSALLWAWCTVNTRAVYMKQ 213

Query: 238 RGQN--------LVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAG 289
           R +           L P  DL+NHSP +  +        A        + +RT    +  
Sbjct: 214 RPRQCFSTEPDTCALAPYLDLLNHSPAVQVK--------AAFNEESRCYEIRTGTSCRKH 265

Query: 290 EQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEI-----SESDPFFGDKLDIAETN 344
           E+V I Y  + S+  L L+YGF+  ++      +  +I       +D     K+ I + +
Sbjct: 266 EEVFICYGPHGSH-RLLLEYGFVSPRNPHACVYVPKDILVKYLPSTDKQMNKKISILKDH 324

Query: 345 GLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEEL 404
              E    ++  G   P   L     +   G + F   + ++  + G +   +S A+E+ 
Sbjct: 325 DFIE----NLTFGWDGPSWRLLTALKLLCLGAEDF---TCWKKVLLGEV---ISDAHEKT 374

Query: 405 ICRVVRDACKSALSGFHTTIEEDEQLLQGGN--------LDPRLRIAVGVRAGEKRVLQQ 456
              + +  C       H  IEE   +LQ  +        L  +L +   +R  E ++LQ 
Sbjct: 375 SLDIAQKIC-------HYFIEETNAMLQKISHMKDEDVALINQLTLVETLRTEELKILQA 427

Query: 457 IDEIF 461
             EI 
Sbjct: 428 STEIL 432


>gi|384251962|gb|EIE25439.1| ResB-domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 889

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 85/355 (23%), Positives = 136/355 (38%), Gaps = 68/355 (19%)

Query: 88  FPEGLGLVAQRDIAKNEVVLEVPMKFWINPD-TVAASEIGSL--CSGLKPWISVALFLI- 143
            PEG GLVA R + K+E +L VP +  +  D  +  S  G L    G+  W  +A FL  
Sbjct: 54  IPEGRGLVATRRVRKHEKLLNVPAQLLLTADVALQHSAYGGLLESCGVPAWSVLATFLAE 113

Query: 144 --REKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKV 201
             R+ + + + W  Y+D LP  T   + W+ EE+  L+GT  +       E +      V
Sbjct: 114 TRRQPEGDKNVWGQYVDALPSQTGCVLEWASEEVDLLRGTAAMRAA---DEIIAACSASV 170

Query: 202 EE--EIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITT 259
            E   I+  +  +   P+T  D  W F +L SR   RL G+                   
Sbjct: 171 AELAPILRESASMPGGPLTEQDLRWGFSMLLSRLI-RLPGKQ------------------ 211

Query: 260 EDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES--KSD 317
                               L   V V  G+         KS+ +L L YGF+ +   + 
Sbjct: 212 -------------------DLEACVYVSYGQ---------KSDTQLLLSYGFMPAPLSNP 243

Query: 318 RNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTD 377
            +A  L L +   DP F  K  + E  G      F + L  +LP  ++ Y   +     D
Sbjct: 244 HSACNLRLSLQRDDPCFDAKRALLEEAGHSACMEFPLRLD-SLPQKLINYAAFLCTEAPD 302

Query: 378 AFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQ 432
             ++  I +  +       V  +  +    ++   CK AL+    T+  D++ L 
Sbjct: 303 RRVVSQIDQTGV-------VDGSARKGAINLITKECKVALTAIPATLAADQEALN 350


>gi|115466126|ref|NP_001056662.1| Os06g0127200 [Oryza sativa Japonica Group]
 gi|55295876|dbj|BAD67744.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
 gi|55296208|dbj|BAD67926.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
 gi|113594702|dbj|BAF18576.1| Os06g0127200 [Oryza sativa Japonica Group]
 gi|218197481|gb|EEC79908.1| hypothetical protein OsI_21454 [Oryza sativa Indica Group]
 gi|222634879|gb|EEE65011.1| hypothetical protein OsJ_19962 [Oryza sativa Japonica Group]
          Length = 557

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 57/190 (30%), Positives = 95/190 (50%), Gaps = 14/190 (7%)

Query: 59  PPTAQVETFWQWLRDQKVVSPKSPIRPATFPEGLGL--VAQRDIAKNEVVLEVPMKFWIN 116
           P  A++++F QWL+          IR     EG G+   A    A +EVV+ VP+   I 
Sbjct: 11  PGDAKLDSFLQWLQANGADLRGCTIRRCG-REGYGVFSTAAEAGATDEVVMVVPLDLAIT 69

Query: 117 PDTVAASE-IGSLC------SGLKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVF 169
           P  V     +G  C       G+   + V LFL+ E+ +  S W+ YLD+LP    S+++
Sbjct: 70  PMRVLQDPLVGPRCRALFEEGGVDDRLLVMLFLMVERLRPSSLWKPYLDMLPSTFGSSIW 129

Query: 170 WSEEELVELQGTQLLSTTLGVKEYVQ----NEYLKVEEEIILPNKQLFPRPITLDDFLWA 225
           ++E+EL EL+GT L   T+  ++ +Q    N+   +  E++  ++      +  +DFLWA
Sbjct: 130 FTEDELAELEGTTLHRATVMQRKSLQTLFDNKVKGLVGELLNVDESGSSIEVRFEDFLWA 189

Query: 226 FGILRSRAFS 235
             I  +RA +
Sbjct: 190 NSIFWTRALN 199


>gi|145549620|ref|XP_001460489.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124428319|emb|CAK93092.1| unnamed protein product [Paramecium tetraurelia]
          Length = 482

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 100/414 (24%), Positives = 177/414 (42%), Gaps = 46/414 (11%)

Query: 44  FAATCSLHSASATTNPPTAQV-------ETFWQWLRDQKVVSPKSPIRPATFPEGL-GLV 95
           F   C   S     + P  Q        +   +WL+D K    K  I   +  EG   L 
Sbjct: 14  FLTLCQDDSEYEGNSKPKRQTYDDPDPYKNLIEWLKDGKAEISKVQIEVQS--EGHRTLR 71

Query: 96  AQRDIAKNEVVLEVPMKFWINPDTVAASEIGS---LCSGLKPWISVALF---LIREKKKE 149
           A + I + E VL +P   +++ + V  S + +   +    KP      F   L++E +++
Sbjct: 72  ATQFIRQGEWVLFIPRTQYLSLEEVKKSCLINRKMIQINYKPNNIQTYFVNHLLQENRRK 131

Query: 150 DSPWRVYLDILP-ECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILP 208
            S W+ Y+D+LP + +    ++  E+   L+G+  L T +  ++  + EY  ++E +   
Sbjct: 132 YSFWKPYIDVLPKDVSGFPTYFDAEQDALLKGSPTLFTVINQRKVFKEEYENLKEAV--- 188

Query: 209 NKQLFPRPITLDDFLWAFGILR-SRAFSRLRG---QNLVLIPLADLINHSPGITTEDYAY 264
            K+      T DDF+  F IL  SR+F+   G   Q  +L+PLAD INH      + Y Y
Sbjct: 189 -KEFQKYGYTYDDFI-KFRILTISRSFTVQIGEKEQQQLLVPLADFINHDNNGFLK-YGY 245

Query: 265 EIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLT 324
                G F       ++    ++ GE++   Y    SN    ++YGF    +  N + L 
Sbjct: 246 SKDADGFF-------MQAVRNIQKGEELFYNYG-QWSNKYFFMNYGFASLTNPMNQFDLD 297

Query: 325 LEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYL---RLVALGGTDAFL- 380
           + ++++D  F  K+ + +    G   + + ++  T      Q L   R   +   D FL 
Sbjct: 298 ICLNKNDRLFNLKISLTK----GNMCWGNRLVNETDHDTFRQSLATVRFTQISKLDDFLQ 353

Query: 381 LESIFRN--TIWGHLDLPVSHAN-EELICRVVRDACKSALSGFHTTIEEDEQLL 431
           LE   +N    W      +     E+   + ++    + L  F +TIE+DE+ L
Sbjct: 354 LEEDVQNFKQFWPGWHTTIKTIELEKATFKALKGILVTELGNFASTIEDDERRL 407


>gi|407852222|gb|EKG05847.1| hypothetical protein TCSYLVIO_003073 [Trypanosoma cruzi]
          Length = 565

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 95/378 (25%), Positives = 158/378 (41%), Gaps = 66/378 (17%)

Query: 121 AASEIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILP-ECTDSTVFWSEEELVELQ 179
           A+ E+ S+ SG +P + ++L   R    E S W   L   P E  +   FW  E+L EL+
Sbjct: 186 ASGELSSMISGDEPLLVLSLIYER-YVAETSHWNELLLSCPGEYPNVPSFWDWEDLAELE 244

Query: 180 GTQLLSTTLGVKEYVQNEYLKVEEEIILP------------NKQLFPRPITLDDFLWAFG 227
           G  +L   L  K   Q    + E   +LP             K  F    +++  +WA  
Sbjct: 245 GLDVLDDVLAKK--AQLAQFQTETMAVLPFIHEALAGGCRFGKDEFLECFSIEAMMWARA 302

Query: 228 ILRSRAFS-RLRGQNLV-LIPLADLINHSPGITT---------EDYAYEIKGAGLFSRDL 276
              SRAF+  + G+ ++ L+P+AD+INH                D+  +I GA L ++D+
Sbjct: 303 TFDSRAFNLNVDGRVVIALVPVADMINHHNRSDVLVRRVEPNGGDFVMQI-GASLTAQDI 361

Query: 277 LFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISES---DPF 333
                       G ++ + Y     N EL   YGF+   ++ +     L+  E+   D +
Sbjct: 362 ------------GREIWMSYG-PLQNWELLQFYGFVLEGNEHDRLPFPLDFPEAAVGDEW 408

Query: 334 FGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHL 393
            G +  +    GL  +    I      PPA++  LR V L   + F  +++ RN  +  L
Sbjct: 409 DGRRAALVAKYGLHLAGCCWICHDGRPPPALVALLR-VHLAEAEEF--DTMERNGPFASL 465

Query: 394 DLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGN---------------LDP 438
                   E  +   + D  +  L  F T++EEDE+LL+ G+               L  
Sbjct: 466 ----GAGTEARVFATIADTIRCILDLFSTSLEEDERLLENGSGPVATHSGDDGNTQPLSC 521

Query: 439 RLRIAVGVRAGEKRVLQQ 456
             R+A+ +R G KR+  +
Sbjct: 522 NKRLAILLRVGMKRIAHR 539


>gi|79315114|ref|NP_001030864.1| SET domain-containing protein [Arabidopsis thaliana]
 gi|51971180|dbj|BAD44282.1| unnamed protein product [Arabidopsis thaliana]
 gi|332645817|gb|AEE79338.1| SET domain-containing protein [Arabidopsis thaliana]
          Length = 353

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 62/213 (29%), Positives = 106/213 (49%), Gaps = 16/213 (7%)

Query: 138 VALFLIREKKK-EDSPWRVYLDILPECTD--STVFWSEEELVELQGTQLLSTTLGVKEYV 194
           +A  LIREKK  + S W  Y+  LP+  +  S++FW E+EL  ++ + +   T+  K  +
Sbjct: 2   LAAVLIREKKMGQKSRWVPYISRLPQPAEMHSSIFWGEDELSMIRCSAVHQETVKQKAQI 61

Query: 195 QNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINH- 253
           + ++  V +        +  RP  L+DF++A+ ++ SRA+     + + LIP AD +NH 
Sbjct: 62  EKDFSFVAQAFKQHCPIVTERP-DLEDFMYAYALVGSRAWEN--SKRISLIPFADFMNHD 118

Query: 254 --SPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF 311
             S  I   D   E      FS   + + R   P   G++V I+Y    SNA L LD+GF
Sbjct: 119 GLSASIVLRD---EDNQLSEFSTLQVTADRNYSP---GDEVFIKYG-EFSNATLMLDFGF 171

Query: 312 IESKSDRNAYTLTLEISESDPFFGDKLDIAETN 344
               +  +   + +++   DP    KL + +T+
Sbjct: 172 TFPYNIHDEVQIQMDVPNDDPLRNMKLGLLQTH 204


>gi|299115166|emb|CBN75532.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 524

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 79/327 (24%), Positives = 138/327 (42%), Gaps = 51/327 (15%)

Query: 92  LGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREKKKED- 150
           L L+  + + K EVV+ +P+   +  D+V                 +AL L+ E++K D 
Sbjct: 56  LSLLTGQALNKGEVVMSIPISLCMTVDSV-----------------LALHLMAERRKGDG 98

Query: 151 SPWRVYLDILPECTDSTVFW----SEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEII 206
           S W+ YL  LP+  D+ + W    +EEE   L GT +   +  +   V+ ++    EE  
Sbjct: 99  SFWKQYLRTLPDDVDTPLRWLVEQAEEEFRLLDGTMVGLLSRMMHSQVRKDW----EEFH 154

Query: 207 LPNKQLFPR---PITLDDFLWAFGILRSRAFSRLR-------GQNLVLIPLADLINHSPG 256
           LP  +  P     +T +D+LWA   + SR+F                ++P+ +  NH P 
Sbjct: 155 LPLVEAHPEILGGVTFEDYLWAMSSIWSRSFDYQEPGPDDSPCSRRAMVPVINAANHDPS 214

Query: 257 IT---TEDYAYEIKGAGL-FSRDLLFSLRTPVPVKAG------EQVLIQYDLNKSNAELA 306
                +E   ++ +  GL          R  + V AG      EQ  I Y    SNA+L 
Sbjct: 215 AADSLSEMIEFQAQEGGLSMGIGEPGRARGTLRVSAGRDYAAREQFFILYG-RYSNAKLL 273

Query: 307 LDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFD----IVLGRTLPP 362
             YGF+ + +        + + ++DP F  K  + + + L  +  +D    +  G  + P
Sbjct: 274 YSYGFVLASNPYGGLDYWVRVPQTDPGFAWKQALLDEHPLTAAQAYDFSGTVRAGGWISP 333

Query: 363 AMLQYLRLVALGGTDAFLLESIFRNTI 389
           A+L  +R+  L   +  + E  F   +
Sbjct: 334 ALLATVRVAQLTADERPVAEKAFEGKM 360


>gi|71656153|ref|XP_816628.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70881769|gb|EAN94777.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 565

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 95/375 (25%), Positives = 155/375 (41%), Gaps = 66/375 (17%)

Query: 121 AASEIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILP-ECTDSTVFWSEEELVELQ 179
           A+ E+ S+ SG +P + ++L   R    E S W   L   P E  +   FW  E+L EL+
Sbjct: 186 ASGELSSMISGDEPLLVLSLIYER-YVAETSHWNELLFSCPGEYPNVPTFWDWEDLAELE 244

Query: 180 GTQLLSTTLGVKEYVQNEYLKVEEEIILP------------NKQLFPRPITLDDFLWAFG 227
           G  +L   L  K   Q    + E   +LP             K  F    +++  +WA  
Sbjct: 245 GLDVLDDVLAKK--AQLAQFQTETMAVLPFIHEALAGSCRLGKDEFLECFSIEAMMWART 302

Query: 228 ILRSRAFS-RLRGQNLV-LIPLADLINHSPGI---------TTEDYAYEIKGAGLFSRDL 276
              SRAF+  + G+ ++ L+P+AD+INH                D+  +I GA L ++D+
Sbjct: 303 TFDSRAFNLNVDGRVVIALVPVADMINHHNRSDVLVRKVEPNGGDFVMQI-GASLTAQDI 361

Query: 277 LFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISES---DPF 333
                       G ++ + Y     N EL   YGF+   ++        +  E    D +
Sbjct: 362 ------------GREIWMSYG-PLQNWELLQFYGFVVEGNEHERLPFPFDFPEGAVGDEW 408

Query: 334 FGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHL 393
            G +  +  T GL  +    I      PPA++  LR V L   + F  +++ RN  +  L
Sbjct: 409 DGRRAALVATYGLHLAGCCWICHDGRPPPALVALLR-VHLAEAEEF--DTMERNGPFASL 465

Query: 394 DLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGN---------------LDP 438
                   E  +   + D  +  L  F T++EEDE+LL+ G+               L  
Sbjct: 466 ----GAGTEARVVATIADTIRCILDLFSTSLEEDERLLENGSGPVATHAGDDGNTQPLSC 521

Query: 439 RLRIAVGVRAGEKRV 453
             R+A+ +R G KR+
Sbjct: 522 NKRLAILLRVGMKRI 536


>gi|308802083|ref|XP_003078355.1| ribulose-1,5-bisphosphate carb (ISS) [Ostreococcus tauri]
 gi|116056807|emb|CAL53096.1| ribulose-1,5-bisphosphate carb (ISS) [Ostreococcus tauri]
          Length = 520

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 87/389 (22%), Positives = 162/389 (41%), Gaps = 23/389 (5%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS-LCSGLKPWISVALFLIREK-KK 148
           G+ ++ + D       L VP    +    V    I S L SG    I +AL+L  E+ K 
Sbjct: 77  GVRVILKADAEAGATALRVPQSAAVTSVDVGEHPIVSELASGRPELIGLALWLCAERIKG 136

Query: 149 EDSPWRVYLDILPECTDSTVFWSE-EELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIIL 207
             S W  Y+  L    D+ +FW++ ++   L+G+ + +  +   +  + EY  + E +I 
Sbjct: 137 GASEWAPYVKTLRANPDAPLFWTDAKDFALLKGSPVAADAIERSKSARTEYASITE-VIK 195

Query: 208 PNKQLFPRP----ITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHS----PGITT 259
            +   +P      +T   F+ A   + ++A      Q   L+PL D+I+      PG+  
Sbjct: 196 SDPSSYPPEAYEFLTEARFVDALATVCAKATWLPTAQCYALVPLLDVISIGGAPVPGVLP 255

Query: 260 EDYAYEIKGAGLFSRDLLFS---LRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKS 316
              +  +   G    D+  +   LR      A  +V+    L ++N EL L+ G+++ K 
Sbjct: 256 PSASDGVVRCGPADYDVDTASVVLRCATKAAANSEVIQLDALQRNNGELFLNTGYVDQKH 315

Query: 317 DRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGT 376
             +   +  +I  SD  F  K  + E  G   +  +  V    +P  +  YLR   +   
Sbjct: 316 PGDYIYMKTDIQTSDRLFTAKKQVLEGMGFTAADQYFPVYKDRMPTQLYSYLRFSRVQDP 375

Query: 377 DAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNL 436
              +  S   + I       VS  NE  I +++   C+  ++ + T  E++  LL+  + 
Sbjct: 376 GEMMAVSFEEDKI-------VSVMNEYEILQILMGDCRELMAEYDTNEEDELNLLKLSDQ 428

Query: 437 DPRLRI-AVGVRAGEKRVLQQIDEIFKER 464
            P   I A  +R  EK+++       ++R
Sbjct: 429 MPVREIEAAKLRMSEKKLIGATMNAVRKR 457


>gi|340519125|gb|EGR49364.1| predicted protein [Trichoderma reesei QM6a]
          Length = 963

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 107/422 (25%), Positives = 182/422 (43%), Gaps = 77/422 (18%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEI-GSLCSGLK-------------PWI 136
           G G+VA +DI    V+  VP    ++ +T   SE+ G L    +             PW 
Sbjct: 534 GRGIVALQDIPAEAVLFTVPRSGILSSET---SELKGKLPEIFQETAMEVDDKPQQDPWS 590

Query: 137 SVALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTT--LGVKEY 193
           ++ + ++ E  K  +S W+ Y+D+LP   ++ +FWS+ EL ELQ +   S       +E 
Sbjct: 591 TLIIVMMYEYFKGSESKWKPYIDVLPSSFETPMFWSDAELDELQASATRSKVGKASAEEM 650

Query: 194 VQNEYLKVEEEIILPNKQLFPRPITL--DDFLWAFGILRSRAFS-RLRGQN--------- 241
            Q++ L V    I  N+ LFP   T   DD +     + S   S     QN         
Sbjct: 651 FQDKVLPV----IRANQHLFPTSQTYSDDDLIQLAHRMGSTIMSYSFDFQNEDEEDEDET 706

Query: 242 ------------LVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAG 289
                       + ++P+AD++N       E  A+   G    +   + +LRT   +KAG
Sbjct: 707 EEWVEEREAKSTMGMVPMADILN----ADAEYNAHVNYGDDALT---VTALRT---IKAG 756

Query: 290 EQVLIQYDLNKSNAELALDYGFIESKSDRN-----AYTLTLE-ISESDPFFGDKLDIA-- 341
           E++   Y  +  N+EL   YG++  K  R       +TL  E ++ S     ++LD A  
Sbjct: 757 EEIFNYYGPH-PNSELLRRYGYVTPKHSRYDVVELPWTLVEESVAASLGLSSEQLDKARE 815

Query: 342 --ETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTD-----AFLLESIFRNTIWGHLD 394
             +++ L ++   +       P   L      +    D       LL++I +      +D
Sbjct: 816 CLDSDELEDTFVLERETEEPNPDGTLTGSARFSEIPEDLRDQLKSLLKAIRKAVPSSVVD 875

Query: 395 LPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVL 454
                  +E+   ++  A  +  S + T+I EDE++L G ++  R R AV VR GEKR++
Sbjct: 876 ---KRKRDEIQHNILIRALDALASRYPTSISEDERILAGNDISERRRAAVTVRLGEKRLI 932

Query: 455 QQ 456
           Q+
Sbjct: 933 QE 934


>gi|302824754|ref|XP_002994017.1| hypothetical protein SELMODRAFT_138060 [Selaginella moellendorffii]
 gi|300138120|gb|EFJ04899.1| hypothetical protein SELMODRAFT_138060 [Selaginella moellendorffii]
          Length = 424

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 97/406 (23%), Positives = 171/406 (42%), Gaps = 67/406 (16%)

Query: 89  PEGLGLVAQRD---IAKNEVVLEVPMKFWIN----------PDTVAASE-IGSLCSGLKP 134
           P+G+G+ A +D   I +  V+LEVPM+  +           PD V     +  + +   P
Sbjct: 44  PDGVGVYAAKDMTDIERPRVILEVPMEILVTVSSNLPWMFFPDIVPIGHPVFDVINSTDP 103

Query: 135 W----ISVALFLIREKKKEDSPWRVYLDILPECTDST--VFWSEEELVELQGTQLLSTTL 188
                I +   L+   +++ + WR+Y D LP   +ST  +  +EEEL  LQ + L     
Sbjct: 104 ETDYDIRLGCLLLLALEQDGNFWRLYGDYLPNQAESTDLLLATEEELDALQDSVLSQCIK 163

Query: 189 GVKEYVQNEYLKVEEEIILPNKQLFPRPITLDD--FLWAFGILRSRAF-------SRLRG 239
             +E +Q ++    EE   P   L  + +  D   F WA G+ RSR+        S+++ 
Sbjct: 164 DEQEKIQKKW----EECFHPGAPLKLKRLARDSDRFKWAVGMARSRSHTLTMKIGSKVQV 219

Query: 240 QNLVLIPLADLINHS--PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYD 297
            N++  P AD++NHS  P  +    A +        R L   ++    ++ G+++ + Y 
Sbjct: 220 ANII-SPYADMLNHSFQPTCSLRWRACD--------RVLEVVVKAGQTIREGDEMTLDYL 270

Query: 298 LNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLG 357
             K   +    +GF    +  N    +  +S     F     I+   GLG + Y      
Sbjct: 271 PKKPTRDYMHRFGFSSPVNPWNVVQFSGSVSIHRDSFLSAFGIS---GLGPNYYL----- 322

Query: 358 RTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSAL 417
           R  P A             D  +L +     +W   DLP   + E+   R ++  C+  L
Sbjct: 323 RDDPMA-------------DGAVLAAARILPMWSDADLPFLPSMEKRAVRELQKECRQLL 369

Query: 418 SGFHTTIEEDEQLLQGGNLDP--RLRIAVGVRAGEKRVLQQIDEIF 461
           S + ++ E+D +L+     D   R + A+  R+ EK  L+++  I 
Sbjct: 370 SAYPSSYEQDVKLMAAAGEDKSVRWKSALKYRSDEKLFLRKVVRIL 415


>gi|302842147|ref|XP_002952617.1| hypothetical protein VOLCADRAFT_118106 [Volvox carteri f.
           nagariensis]
 gi|300261961|gb|EFJ46170.1| hypothetical protein VOLCADRAFT_118106 [Volvox carteri f.
           nagariensis]
          Length = 713

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 87/306 (28%), Positives = 131/306 (42%), Gaps = 82/306 (26%)

Query: 87  TFPEG-LGLVAQRDIAKNEVVLEVPM----------KFWINPDTV--AASEIGSLCSGLK 133
           T P G  G++A+ DIA+ E++L +P+          +F  +P+    A   +     GL 
Sbjct: 23  TLPSGDRGVIARSDIAEGELLLLLPIDCAIYIPTDEEFKKHPNDFPDAVRYLREAHPGLS 82

Query: 134 PWISVALFLIREKKKED-SPWRVYLDILP-ECTDSTVFWSEEELVELQGTQLLSTTLGVK 191
           P+++  L L+ E  +   SPW  Y+  LP  C D  + W+EEE +EL+GT L  +     
Sbjct: 83  PFLATTLVLMSEMTRGSVSPWAAYVATLPASCPDCLLNWTEEEKLELKGTSLEQSG---P 139

Query: 192 EYVQNEYLKVEEEIILPNKQLFP------------RPITLD----DFLWAFGILRSRAFS 235
           +   + Y +    I+     L+P               TLD     F  A G+++SRAF 
Sbjct: 140 DPAVDVYRRHVAPILACRTDLWPGLAAKEPPAAEATGATLDAGLAAFARAAGLVQSRAF- 198

Query: 236 RLRGQNLV-----------------LIPLADLINHS------------------------ 254
            L  +N V                 L+P  D+INHS                        
Sbjct: 199 HLEAENWVSGAKEIAHLENGGTQVFLLPGIDMINHSHNPDRRNAHLERLNVAQAAAAKLL 258

Query: 255 ---PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF 311
              PG   ED     KG G+   +  F +R   P+KAGE+VL  Y  N S+A+L   YGF
Sbjct: 259 EREPG--EEDAREGAKGVGVRGVEAFFVMRADKPIKAGEEVLHTYG-NLSDAQLLQTYGF 315

Query: 312 IESKSD 317
           ++S+ D
Sbjct: 316 LDSEDD 321


>gi|302803412|ref|XP_002983459.1| hypothetical protein SELMODRAFT_445547 [Selaginella moellendorffii]
 gi|300148702|gb|EFJ15360.1| hypothetical protein SELMODRAFT_445547 [Selaginella moellendorffii]
          Length = 536

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 89/374 (23%), Positives = 160/374 (42%), Gaps = 47/374 (12%)

Query: 94  LVAQRDIAKNEVVLEVPMKFWINPDTVAASE-IGSLC--SGLKPWISVALFLIREKKK-E 149
           ++A   I K  V+ EVP++ +++  T      +G +   +  +PW+++   L+ E+ + +
Sbjct: 65  MIASGAIDKGSVLAEVPLQAFLSEKTAERCRLVGPMLRKNDFRPWLTMCAHLLVERSRGK 124

Query: 150 DSPWRVYLDILPECTDSTV----FWSEEELVEL-QGTQLLSTTLGVKEYVQNEY---LKV 201
           +S W  Y+  LP   + ++     W  E + EL QG+ +L T     +  Q ++   L  
Sbjct: 125 ESFWHPYIAALPSVDELSISHPLLWPAETIQELLQGSPMLDTIATRLKLCQEDHEALLTA 184

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRG-----QNLVLIPLADLINHSPG 256
             E  LP  +     ++  D  WA  +L SRAFS           L L+P AD++NH   
Sbjct: 185 GIEKFLPGGE----TLSEGDVRWASAVLLSRAFSLELDVDDDFDTLCLVPWADMLNHCSS 240

Query: 257 ITTEDYAYEIKGAGLFSRDL-LFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESK 315
              E          +F +D    SL        G++V   Y    + ++L LDYGF++ +
Sbjct: 241 AGEESCL-------IFDQDTKTASLEAHKSYSKGDEVFDSYGPALTGSQLFLDYGFVDDE 293

Query: 316 SDRNAYTLTLEISESDPFFGD-KLDIAETNGLGESAYFDIVLGRTLPPAMLQYLR----- 369
           ++  A  L  ++    P        + E  GL        ++   +  ++L + R     
Sbjct: 294 NENYAVDLPAQVL--GPVRSSANAALLEALGLPAGGTLVSIVPEGVDESVLAWTRAAIAS 351

Query: 370 ---LVALG-GTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
              L A G  +D    ++I       +   P++  NE  + R +  AC++  S + T +E
Sbjct: 352 PRELSAAGWKSDGKYQKAIM------YFSEPINRDNECEVLRRLIAACENLWSKYPTNLE 405

Query: 426 EDEQLLQGGNLDPR 439
            D + L GG  D R
Sbjct: 406 HDLRELTGGESDMR 419


>gi|356564844|ref|XP_003550657.1| PREDICTED: uncharacterized protein LOC100778605 [Glycine max]
          Length = 549

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 62/246 (25%), Positives = 120/246 (48%), Gaps = 15/246 (6%)

Query: 93  GLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSL---CSGLKPWISVALFLIREKKKE 149
           G +A++D+   ++ LE+P+   I+ + V  +++  +     G+     + L+ ++EK   
Sbjct: 175 GAIARKDLKVGDIALEIPVSIIISEELVHETDMYGVLKEIDGISSETILLLWSMKEKYNC 234

Query: 150 DSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPN 209
           DS +++Y D LPE  ++ + +S + +  L GT LL   +  ++++  +Y ++   +    
Sbjct: 235 DSKFKIYFDTLPEKFNTGLSFSIQAITMLDGTLLLEEIMQARQHLHAQYDELFPALCNNF 294

Query: 210 KQLFPRPI-TLDDFLWAFGILRSRAFSRLRGQN---LVLIPLADLINHSPGITTEDYAYE 265
             +FP  + T + FLWA  +  S +   +         LIPLA  +NHS       Y  +
Sbjct: 295 PDIFPPELYTWEKFLWACELWYSNSMKIMYSDGKLRTCLIPLAGFLNHSLCPHVMHYG-K 353

Query: 266 IKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTL 325
           +  A   +  L F L    P ++GE+  + Y  N S++ L   YGF+  + D +   + L
Sbjct: 354 VDPA---TNSLKFCLSR--PCRSGEECCLSYG-NFSSSHLITFYGFL-PQGDNSYDVIPL 406

Query: 326 EISESD 331
           +I  SD
Sbjct: 407 DIDGSD 412


>gi|342881738|gb|EGU82570.1| hypothetical protein FOXB_06936 [Fusarium oxysporum Fo5176]
          Length = 467

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 101/430 (23%), Positives = 170/430 (39%), Gaps = 103/430 (23%)

Query: 95  VAQRDIAKNEVVLEVPMKFWINPDT-----------VAASEIGSLCSGLKPWISVALFLI 143
           VA  DI     +  +P K  IN +T                      GL  W S+ L +I
Sbjct: 48  VALEDIPAETTLFTIPRKGIINVETSELPKKIPDAFDLDKPDDDDAPGLDSWSSLILIMI 107

Query: 144 REK-KKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVE 202
            E  + E+S W+ Y D+LP   D+ +FWS+ EL +LQ +  +   +G K   +N + K  
Sbjct: 108 YEYLQGENSKWKPYFDVLPSSFDTPMFWSDNELDQLQASH-MRHKIG-KADAENMFQKTL 165

Query: 203 EEIILPNKQLFPRPITLDDFL-------------WAF-------------GILRSRAFSR 236
             II  N ++F      D  L             +AF             G +  R    
Sbjct: 166 LPIIRSNAEIFNAGNKTDAELIEIAHRMGSTIMAYAFDLENDEEEEEEADGWVEDRDGKS 225

Query: 237 LRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQY 296
           + G    ++P+AD++N                A +   +   ++ +  P+KAGE++L  Y
Sbjct: 226 MMG----MVPMADILNADAEFN----------AHVNHEEESLTVTSLRPIKAGEEILNYY 271

Query: 297 DLNKSNAELALDYGFIESKSDR-NAYTLTLEISES--DPFFGD----------------- 336
             +  N+EL   YG++  K  R +   +  +I ES     FG                  
Sbjct: 272 GPH-PNSELLRRYGYVTEKHSRYDVVEIPWDIVESALTSNFGIPGQVLEQIRGALEEDEE 330

Query: 337 -------KLDIAETNGLG---ESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFR 386
                  + +  E N  G   E A F+  +   L   +  +L+ +     DA        
Sbjct: 331 FEDTFVLERETGEVNSDGTFAEPARFE-SMPEDLQEQLKTFLKGIKKAQPDA-------- 381

Query: 387 NTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGV 446
                   +P     +E+   V+    ++ ++ + T+I EDE LL+  +L+ R R+A+ V
Sbjct: 382 --------IPDKRKRDEIHQAVLAKTLEALVARYPTSISEDENLLK-QDLNQRTRMAIAV 432

Query: 447 RAGEKRVLQQ 456
           R GEK++LQ+
Sbjct: 433 RLGEKKLLQE 442


>gi|310800174|gb|EFQ35067.1| SET domain-containing protein [Glomerella graminicola M1.001]
          Length = 485

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 80/305 (26%), Positives = 131/305 (42%), Gaps = 42/305 (13%)

Query: 68  WQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEI-- 125
           W    D  +V  ++   P     GLGLV+ +D+++ E  L++P    I  + + ++E   
Sbjct: 14  WAMFNDVDLVDVEAREIPGC---GLGLVSNKDLSREEETLDIPTLLRIPHELILSAEAVE 70

Query: 126 --GSLCSGLKPWISVA----------LFLIREKKKED-----------SPWRVYLDILPE 162
               +    +  ++VA          +FL+ ++   D           +PW  Y+  LP 
Sbjct: 71  NYAKVDKNFRQLLNVAGHKSTRHDICIFLLTQQILADQPETSLHGGVPTPWTEYIKYLPP 130

Query: 163 CTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEE---EIILPNKQLFPR-PIT 218
               T  W+  E   L GT L S T      + +E+ ++ E    + L N+  +    ++
Sbjct: 131 QVPVTTLWTVRERQMLNGTSLESATAAKIVALSDEFDELREVSSSLPLWNELFWESGKVS 190

Query: 219 LDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRD-LL 277
           L D++      RSR    L      ++P+ DL NHS   +  +  YE       S+D ++
Sbjct: 191 LIDWVRVDAWFRSRCL-ELPKSGEAMVPVLDLANHS---SKANAYYEQN-----SKDEVV 241

Query: 278 FSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDK 337
             LR    V +GE++ I Y   KS AE+   YGFI+  S  +  TL L   E DP    K
Sbjct: 242 LLLRPGCRVSSGEEMTISYGDAKSGAEMLFSYGFIDPASAADRITLPLTPLEDDPLGKAK 301

Query: 338 LDIAE 342
           L I E
Sbjct: 302 LHIFE 306


>gi|346326264|gb|EGX95860.1| SET domain protein [Cordyceps militaris CM01]
          Length = 467

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 57/197 (28%), Positives = 84/197 (42%), Gaps = 28/197 (14%)

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNK 210
           +PW  Y+  LP+       WSE   + L+GT L          +QN              
Sbjct: 118 TPWSEYIKFLPKAIPVPTLWSEPHRLLLRGTSLEDALEAKVATLQN-------------- 163

Query: 211 QLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAG 270
                  TL D+  A    RSR    L   +  ++P  D++NH+   T   Y YE + +G
Sbjct: 164 -----GATLSDWFLADAWYRSRCL-ELPHASHAMVPALDMVNHAGAGTATAY-YEEEASG 216

Query: 271 LFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISES 330
               D+   +R    V AG+++ I Y   KS AE+   YGF+++ S   A  LTL +   
Sbjct: 217 ----DVSLRIRPGARVPAGDEISISYGDGKSAAEMLFSYGFLDADSSTTARQLTLRL--- 269

Query: 331 DPFFGDKLDIAETNGLG 347
           DPF  D L +A+    G
Sbjct: 270 DPFPDDPLGLAKRRAFG 286


>gi|242774560|ref|XP_002478464.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
 gi|218722083|gb|EED21501.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
          Length = 491

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 61/197 (30%), Positives = 89/197 (45%), Gaps = 12/197 (6%)

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEE---EIIL 207
           +PW  Y+  L   T     WSE+ELV L GT L          ++ E+ ++ E    I  
Sbjct: 123 NPWSEYIKFLSSETLLPTLWSEDELVLLYGTSLKDAVDHKLAALEAEFDRLREATKSIAW 182

Query: 208 PNKQLFPRP--ITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYE 265
             ++ +     +TLDD+     + RSRA   L G   V++P  D+ NH+ G  T    YE
Sbjct: 183 CEREWWDEENGLTLDDWKVVDAMYRSRALD-LPGSGHVMVPCVDMANHASGEETVAL-YE 240

Query: 266 IKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESK-SDRNAYTLT 324
             G     R+ +  LR    ++ GE+V I Y   K  +E+   YGF+ES   D     L 
Sbjct: 241 TDG----ERNAVLQLRWGKKLRKGEEVTITYGDEKGASEMIFSYGFLESSVEDARQLFLP 296

Query: 325 LEISESDPFFGDKLDIA 341
           L+I + DP    K  I 
Sbjct: 297 LDIPDDDPLKQTKKKIC 313


>gi|255574450|ref|XP_002528137.1| conserved hypothetical protein [Ricinus communis]
 gi|223532435|gb|EEF34228.1| conserved hypothetical protein [Ricinus communis]
          Length = 483

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 105/426 (24%), Positives = 179/426 (42%), Gaps = 76/426 (17%)

Query: 89  PEGLGLVAQRDIA---KNEVVLEVPMKFWIN----------PDTVAASE-IGSLCSGLKP 134
           P+G G+ A +DI    +  +++E+P++  +           PD +     I  + +   P
Sbjct: 98  PDGFGVYASKDIEPLRRARMIMEIPIELMLTISKKLPWMFFPDIIPVGHPIFDIINSTDP 157

Query: 135 ---W-ISVALFLIREKKKEDSPWRVYLDILP---ECTDSTVFWSEEELVELQGTQLLSTT 187
              W + +A  L+     +D+ W++Y D LP   ECT S +  +EE+L+ELQ   L S  
Sbjct: 158 ETDWDLRLACLLLFSFDCKDNFWQLYGDFLPSEDECT-SLLLATEEDLLELQDQNLAS-- 214

Query: 188 LGVKEYVQNEYLKVEEE-----IILPNKQLFPRPITLDDFLWAFGILRSRAFS-RLRGQN 241
             +    Q+  L+  E+     + L  K+L   P   + F+WA  + +SR  + ++R   
Sbjct: 215 --IMRKQQHRALEFWEKNWHSGVPLKIKRLAREP---ERFIWAVSMAQSRCINMQMRVGA 269

Query: 242 LV-----LIPLADLINHS--PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLI 294
           LV     LIP AD++NHS  P      + +  K      R L   +     +K GEQ+ +
Sbjct: 270 LVQDANMLIPYADMLNHSFQPNCF---FHWRFK-----DRMLEVMINAGQQIKKGEQMTV 321

Query: 295 QYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDI 354
            Y   + N      YGF    S  N + +      +       L     +GL E  Y + 
Sbjct: 322 NYMSGQKNDLFMQRYGF---SSSVNPWDVIQFSGNACIHLDSFLSAFNISGLPEEYYHNN 378

Query: 355 VLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACK 414
            L  T P   +           D  ++ +      W   D+P   + E    + +++ C+
Sbjct: 379 QLTST-PDTFV-----------DGAVIAAARTLPTWTDGDVPPLPSIERKAAKELQEECQ 426

Query: 415 SALSGFHTTIEEDEQLLQGGNLDPR-LRIAVGVRAGEKRVLQQIDEIFKERELELDELEY 473
             L+ F TT EED+ LL       R L  A+  R   K++++++          +  L+ 
Sbjct: 427 QMLAEFPTTSEEDQTLLDTVPDSRRTLEAAIKYRLHRKKLIEKV----------IQALDI 476

Query: 474 YQERRL 479
           YQER L
Sbjct: 477 YQERLL 482


>gi|357125312|ref|XP_003564338.1| PREDICTED: uncharacterized protein LOC100842937 [Brachypodium
           distachyon]
          Length = 558

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 52/196 (26%), Positives = 96/196 (48%), Gaps = 13/196 (6%)

Query: 53  ASATTNPPTAQVETFWQWLRDQKVVSPKSPIRPATFPEGLGL--VAQRDIAKNEVVLEVP 110
           ++A  +    ++++F QWL+          IR     +G G+        A + V++ VP
Sbjct: 2   SAAAASCDDGKLQSFLQWLQANGTDLRGCTIRACDRNKGFGVYSTGAEVCATDGVMMVVP 61

Query: 111 MKFWINPDTVAASE-IGSLC------SGLKPWISVALFLIREKKKEDSPWRVYLDILPEC 163
           +   + P  V     +G  C       G+   + V LFL+ E+ +  S W+ YLD+LP  
Sbjct: 62  LDLAVTPMRVLQDPLVGPRCRALFEEGGVDDRLLVMLFLMAERLRPTSLWKPYLDMLPST 121

Query: 164 TDSTVFWSEEELVELQGTQLLSTTLGVKEYVQ----NEYLKVEEEIILPNKQLFPRPITL 219
             S+V++S++EL EL+GT L   T+  ++ +Q    ++   + EE++  ++      +  
Sbjct: 122 FGSSVWFSDDELAELEGTTLHRATVMQRKSLQTLFDDKVKGLVEELLRVDESGSSSEVQF 181

Query: 220 DDFLWAFGILRSRAFS 235
           +DFLWA  I  +RA +
Sbjct: 182 EDFLWANSIFWTRALN 197


>gi|308802149|ref|XP_003078388.1| related to histone-lysine N-methyltransferase (ISS) [Ostreococcus
           tauri]
 gi|116056840|emb|CAL53129.1| related to histone-lysine N-methyltransferase (ISS), partial
           [Ostreococcus tauri]
          Length = 446

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 107/428 (25%), Positives = 182/428 (42%), Gaps = 62/428 (14%)

Query: 81  SPIRPATFPEGL--GLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGL--KPWI 136
           S IR     EG   G+   RD+ + E++  VP++  ++  T +A    +L  GL  +P  
Sbjct: 10  SAIRFVPTSEGAERGVATTRDVTRGELLATVPLEKCVS--TSSARADATLWRGLSARPGA 67

Query: 137 SV----ALFLIREK--KKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGV 190
           S+    A  ++RE     E S +  +L +LP  TD+ V W E+EL ELQG+ +++    +
Sbjct: 68  SLDGILAAHVLREAFGLGERSAFWPWLRLLPSETDAAVGWDEDELRELQGSNVVAFARAI 127

Query: 191 KEYVQNEYLKVEEEIILPNKQLFPRPI--------TLDDFLWAFGILRSRAFSRLRGQNL 242
           K+  + EY  ++   +  +   FP           T + F WA  ++ SRA   L+  + 
Sbjct: 128 KKSWREEYDALDFAGLGVD---FPEAFGGEHAAHYTFEKFTWARFVVWSRAID-LKTDST 183

Query: 243 ------VLIPLADLINHSPG------ITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGE 290
                 +L+P+ D+ NH+P          +  A +I     F R+               
Sbjct: 184 SAPVIRMLVPILDMANHAPSGKLLPRWDAKANAVKIYAGSAFKRNT-------------- 229

Query: 291 QVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESA 350
           ++   YD   S   L L YGFI   +      +T+++S+ D     K  +   +GL  + 
Sbjct: 230 ELRFNYDTKPSQYFL-LQYGFIPEANPAECVEVTMQLSQRDNLRERKEALLRRHGLDPTK 288

Query: 351 YFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVR 410
                  R L   +L   R++A+  ++     S+  +         VS  N+     V+ 
Sbjct: 289 RNFEWKVRGLDYDLLAAARIIAMDESELDDDTSVALSVSGAS----VSAKNDARTKAVLL 344

Query: 411 DACKSALSGFHTTIEEDEQLLQGGNLDP------RLRIAVGVRAGEKRV-LQQIDEIFKE 463
            +  ++L G+ TT+ ED   +   N         R R AV +R  EK + L   D +FKE
Sbjct: 345 KSLITSLDGYGTTLGEDNSYIARFNTSSDELPKKRKRFAVLLRMREKGILLASADALFKE 404

Query: 464 RELELDEL 471
              E +EL
Sbjct: 405 LPNEGEEL 412


>gi|367016539|ref|XP_003682768.1| hypothetical protein TDEL_0G01900 [Torulaspora delbrueckii]
 gi|359750431|emb|CCE93557.1| hypothetical protein TDEL_0G01900 [Torulaspora delbrueckii]
          Length = 573

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 85/356 (23%), Positives = 149/356 (41%), Gaps = 52/356 (14%)

Query: 63  QVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAA 122
           Q++T  +W +D   +         T   G+  +A+  I   E ++ VP    I  + +A 
Sbjct: 6   QLKTCVEWCKDHGAIIDDRLEFKVTQAAGVTAIAKSVIKTTEPLISVPANLLITKE-LAE 64

Query: 123 SEIGSLCSGL---KPWISVALFLIREKKKEDS-PW-RVYLDILPECTDSTVFWSEEELVE 177
            E GS    +    P   V LF  + K    + P+ + Y DILP   D   FW  +E+  
Sbjct: 65  KEFGSASGAVSSENPNALVQLFTAKMKFDPSARPFHKPYFDILPTKLDQPYFWKLQEVEL 124

Query: 178 LQGTQL-LSTTLGVKEYVQNEYLKVEEEIILP-NKQLFPRPITLD--------------- 220
           L+GT + L     +++ V+  ++ +++  + P + +L+ +    D               
Sbjct: 125 LKGTDIYLLMKQNLRKIVKEWHVLLDQLKLKPEDGELYEQSEAQDFDILKYICEYREQHK 184

Query: 221 --------DFLWAFGILRSRAFSRL-------RGQNLVLIPLADLINHSPGITTEDYAYE 265
                    +LWA GI  SRAF +L             L PL DL+NH    T   + + 
Sbjct: 185 SISWKSFVGYLWATGIFTSRAFPKLILEEKCSSINEAFLYPLVDLLNHKND-TKVKWTFT 243

Query: 266 IKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTL 325
                  S++++         K GE+V   Y   KSN +L L YGF++ ++  +   LTL
Sbjct: 244 NDNVCFVSQEIM---------KEGEEVFNNYG-EKSNEDLLLSYGFVQDQNPYDLTRLTL 293

Query: 326 EISES--DPFFGDKLDIAETNGLGESAY-FDIVLGRTLPPAMLQYLRLVALGGTDA 378
            +++   D     +L  +E N + +    F I     LP +M+ +   +   G++A
Sbjct: 294 RLTKEMIDEALNAELGFSEKNKVADDCVQFQITAVEPLPSSMVNFFGYLCKLGSEA 349


>gi|294948379|ref|XP_002785721.1| hypothetical protein Pmar_PMAR008080 [Perkinsus marinus ATCC 50983]
 gi|239899769|gb|EER17517.1| hypothetical protein Pmar_PMAR008080 [Perkinsus marinus ATCC 50983]
          Length = 353

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 72/244 (29%), Positives = 112/244 (45%), Gaps = 43/244 (17%)

Query: 92  LGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGL-KPWISVALFLIREKKKED 150
           +G  A  DI + E +L VP    + P  V          GL +P + +A  L++ +   +
Sbjct: 43  IGCTATADICQGERLLFVPHSACVTPSGV---------QGLYEPQVMLAASLVKHRTDPN 93

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGT---QLLSTTLGVKEYVQNEYLKVEEEIIL 207
           SP+  YL  LP   +  + WS +ELV L+GT   ++   +L V + V           + 
Sbjct: 94  SPFHDYLQSLPSEFEHPLEWSADELVCLKGTTVWEMHQLSLEVVDSVAE---------LC 144

Query: 208 PNKQLFPRPITLDDFLWAFGILRSRAF-SRLRGQNLVLIPLADLINHSPGITTEDYAYEI 266
           PN    PR +      WA  ++ SRAF S + G  L +IPLAD  NHS          E+
Sbjct: 145 PNS---PRAM----IRWAVEVMMSRAFESEVCG--LCVIPLADQFNHS-STKWHTRVREV 194

Query: 267 KGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLE 326
           +          F +    PVK GE++   Y L  +N  L L +GFIE  +  + + +T+E
Sbjct: 195 EEG--------FQMLAEKPVKKGEEIFNNYGL-YTNEMLLLTHGFIEFDNPHDHF-ITIE 244

Query: 327 ISES 330
           +S +
Sbjct: 245 VSNA 248


>gi|67516817|ref|XP_658294.1| hypothetical protein AN0690.2 [Aspergillus nidulans FGSC A4]
 gi|40746310|gb|EAA65466.1| hypothetical protein AN0690.2 [Aspergillus nidulans FGSC A4]
 gi|259489037|tpe|CBF88977.1| TPA: SET domain protein (AFU_orthologue; AFUA_1G13520) [Aspergillus
           nidulans FGSC A4]
          Length = 480

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 57/188 (30%), Positives = 92/188 (48%), Gaps = 11/188 (5%)

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQL-LSTTLGVKEYVQN-EYLKVEEEIILP 208
           + W  Y+  LP       FW+ EEL  L+GT L L+    +K   +  E+L+   E I  
Sbjct: 116 NAWSEYVKFLPSFITLPTFWTMEELELLRGTSLRLAYEAKIKALEKELEHLRETTEAIEW 175

Query: 209 NKQLF--PRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEI 266
            ++L+     +TL+D+ +   + RSR    L G    ++P  D+ NH+   +T +  YE 
Sbjct: 176 CRELWWDEDSVTLEDWKYLDAVFRSRVLD-LPGYGHAMVPCIDMANHASD-STVNALYEK 233

Query: 267 KGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES-KSDRNAYTLTL 325
              G    D +  LR    ++  E+V I Y  +K+ +E+   YGF++S +SD     L L
Sbjct: 234 DDNG----DAILQLRPRKNLRLDEEVTISYGQDKAASEMVFSYGFLDSERSDAKQMVLDL 289

Query: 326 EISESDPF 333
           +I E DP 
Sbjct: 290 DIPEDDPL 297


>gi|302815683|ref|XP_002989522.1| hypothetical protein SELMODRAFT_129980 [Selaginella moellendorffii]
 gi|300142700|gb|EFJ09398.1| hypothetical protein SELMODRAFT_129980 [Selaginella moellendorffii]
          Length = 464

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 111/430 (25%), Positives = 181/430 (42%), Gaps = 70/430 (16%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWI------NPDTVAASEIGSLC--SGLKP---W-IS 137
            G+GLVA +D+ +   ++ +P +  +      N   +A SE G +C  +   P   W + 
Sbjct: 46  HGMGLVATQDLPQGSTIITLPRRIPMPMPDPENAAVLAPSE-GVICEIANRVPEELWAMR 104

Query: 138 VALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQ-- 195
           + L L+ E+ ++ S W  Y+ +LP      +F+S    V+++       T  VK+  +  
Sbjct: 105 LGLKLLYERAQKGSYWWPYISMLPHSFTLPIFFSG---VDIESIDYAPVTHQVKKRCRFL 161

Query: 196 ----NEYLKVEE--EIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQ-----NLVL 244
               +E  K+E   E I P    F   +      WA   + SRAF R+ G      + ++
Sbjct: 162 LQFSSELAKLESLPEEIHPFAGQF---VDSGALGWAMAAVSSRAF-RIHGVTNKLCSAMM 217

Query: 245 IPLADLINHS--PGITTEDYAYEIKGAGLFSRDL----LFSLRTPVPVKAGEQVLIQYDL 298
           +PL D+ NHS  P    E+           SRD        + T   ++ G  + + Y  
Sbjct: 218 LPLIDMCNHSFQPNAHIEED---------LSRDAQDVSFLKVVTKRNLEKGSAITLNYG- 267

Query: 299 NKSNAELALDYGFI--ESKSDR-----------NAYTLT-LEISESDPFFG------DKL 338
             SN  L LDYGF+  ++  DR           NA  +  L  + S PF        D+L
Sbjct: 268 PLSNDLLLLDYGFVIPDNPHDRIELRYDGSLMENARMIAGLSRTGSPPFSSPASWQVDRL 327

Query: 339 DIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVS 398
                   GES    +     +   +L  LR++     +      +     WG ++  VS
Sbjct: 328 KQLGLADSGESQKVTLGGPEEVDGRLLAALRILHAESQEPLERRELVSLQAWG-VESMVS 386

Query: 399 HANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQID 458
             NEE + R +        + F TTIEEDE  L   +L    RIAV  R  +KR++ ++ 
Sbjct: 387 SDNEERVLRTLCGLGAIVFNQFKTTIEEDEAKLSDKSLAETSRIAVQFRLTKKRLVVRVL 446

Query: 459 EIFKERELEL 468
           E  K+R ++L
Sbjct: 447 ESLKKRLMDL 456


>gi|195353393|ref|XP_002043189.1| GM17489 [Drosophila sechellia]
 gi|194127287|gb|EDW49330.1| GM17489 [Drosophila sechellia]
          Length = 537

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 85/338 (25%), Positives = 142/338 (42%), Gaps = 60/338 (17%)

Query: 48  CSLHSASATTNPPTAQVETFWQWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVV 106
           C   S S       A+VE F  W +D  V S    I  A FP   LGL + R +AK+E+V
Sbjct: 102 CPQISDSPDDQTRLAKVEAFSAWAKDGGVHSEGLEI--AIFPGYQLGLRSTRPLAKDELV 159

Query: 107 LEVPMKFWINPDTVAASEI-GSLCSGLKPWISVALFLIREK-KKEDSPWRVYLDILPECT 164
           L VP K   + ++ +   + G +       +++A  L+ EK + E S WR Y+D+LP   
Sbjct: 160 LSVPRKLIFSEESNSDCRLFGKMTQATH--LNLAYDLVIEKIRGEFSEWRTYIDVLPAKY 217

Query: 165 DSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKVEEEIILPNKQLFPRP----- 216
            + ++++ +++  L+GT   S  L     +  +Y    +    +  P+      P     
Sbjct: 218 STVLYFTTKQMELLRGTAAASLALRQCRVIAKQYAFLYRYAHTMTEPSTGNRSHPGERGL 277

Query: 217 ------ITLDDFLWAFGILRSRAFSRLRGQNLV----------------LIPLADLINHS 254
                 +  + + WA   + +R       QNLV                LIP  D+ NH 
Sbjct: 278 FFTQHGLCYELYRWAVSTVMTR-------QNLVPSEKQESEDTPKLISALIPYWDMANHR 330

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G  T  YA   +     +++          V AGEQ  I Y  ++SN +L +  GF++ 
Sbjct: 331 QGKITSFYAAVPRQLECTAQE---------AVDAGEQFFIYYG-DRSNTDLLVHNGFVDD 380

Query: 315 KSDRNAYTLTLEISESDPFFG------DKLDIAETNGL 346
            + ++   + + +S +D          DKL+I  T  L
Sbjct: 381 YNLKDYVNIRVGLSLTDALAAKRASILDKLNIRHTAEL 418


>gi|170588849|ref|XP_001899186.1| SET domain containing protein [Brugia malayi]
 gi|158593399|gb|EDP31994.1| SET domain containing protein [Brugia malayi]
          Length = 278

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 73/288 (25%), Positives = 126/288 (43%), Gaps = 48/288 (16%)

Query: 67  FWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIG 126
           F +W            IR  +   G GL A  D  +NE ++ +P+   I    +A  E+ 
Sbjct: 6   FMEWAVGNGAYHSGIDIRDCSNEGGKGLFATTDFRENETIISIPVGLIITAGFIA--EMP 63

Query: 127 SLCS-----GLKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGT 181
             C       LKP+ ++  F + E K+++S W  YL++LP+      F +   L      
Sbjct: 64  DYCDVFKRYCLKPFEALVYFFLVE-KEQNSKWTPYLEVLPK-----SFSTPASLHPSLKP 117

Query: 182 QLLSTTLGVKEYVQNEYLKVEEE---IILPNKQLFPRPITLDDFLWAFGILRSRAFSR-- 236
           +     L  + YVQ   LK+  E    IL +  ++      D FLWA+ I+ +R   R  
Sbjct: 118 EDFPYCLRKQWYVQKNELKIMYEKFVTILADNTIW------DHFLWAWHIVNTRCIYRNN 171

Query: 237 --------LRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLF-SRDLLFSLRTPVPVK 287
                       +L ++PL D++NHS          + +   ++ S+  L+ +    P++
Sbjct: 172 KLHPLIDNTEDDSLAIVPLIDMLNHSN---------DSQCCAIWDSKFNLYKVIVTRPIR 222

Query: 288 AGEQVLIQYDLNKSNAELALDYGF-----IESKSDRNAYTLTLEISES 330
            GEQ+ I Y  + +N  L ++YGF     I  K + +  +L L I++S
Sbjct: 223 KGEQIFICYG-SHTNGSLWIEYGFYLKDNICDKVEISLGSLILNITDS 269


>gi|302784522|ref|XP_002974033.1| hypothetical protein SELMODRAFT_414219 [Selaginella moellendorffii]
 gi|300158365|gb|EFJ24988.1| hypothetical protein SELMODRAFT_414219 [Selaginella moellendorffii]
          Length = 527

 Score = 72.0 bits (175), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 89/374 (23%), Positives = 160/374 (42%), Gaps = 47/374 (12%)

Query: 94  LVAQRDIAKNEVVLEVPMKFWINPDTVAASE-IGSLC--SGLKPWISVALFLIREKKK-E 149
           ++A   I K  V+ EVP++ +++  T      +G +   +  +PW+++   L+ E+ + +
Sbjct: 65  MIASGAIDKGSVLAEVPLQAFLSEKTAERCLLVGPMLRKNDFRPWLTMCAHLLVERSRGK 124

Query: 150 DSPWRVYLDILPECTDSTV----FWSEEELVEL-QGTQLLSTTLGVKEYVQNEY---LKV 201
           +S W  Y+  LP   + ++     W  E + EL QG+ +L T     +  Q ++   L  
Sbjct: 125 ESFWHPYISALPSVEELSISHPLLWPAETIQELLQGSPMLDTIATRLKLCQEDHEALLTA 184

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRG-----QNLVLIPLADLINHSPG 256
             E  LP  +     ++  D  WA  +L SRAFS           L L+P AD++NH   
Sbjct: 185 GIEKFLPGGE----TLSEGDVRWASAVLLSRAFSLELDVDDDFDTLCLVPWADMLNHCSS 240

Query: 257 ITTEDYAYEIKGAGLFSRDL-LFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESK 315
              E          +F +D    SL        G++V   Y    + ++L LDYGF++ +
Sbjct: 241 AGEESCL-------IFDQDTKTASLEAHKSYSKGDEVFDSYGPALTGSQLFLDYGFVDDE 293

Query: 316 SDRNAYTLTLEISESDPFFGD-KLDIAETNGLGESAYFDIVLGRTLPPAMLQYLR----- 369
           ++  A  L  ++    P        + E  GL        ++   +  ++L + R     
Sbjct: 294 NENYAVDLPAQVL--GPVRSSVNAALLEALGLPAGGTLVSIVPEGVDESVLAWTRAAIAS 351

Query: 370 ---LVALG-GTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
              L A G  +D    ++I       +   P++  NE  + R +  AC++  S + T++E
Sbjct: 352 PRELSAAGWKSDGKYQKAIM------YFSEPINRDNECEVLRRLIAACENLWSKYPTSLE 405

Query: 426 EDEQLLQGGNLDPR 439
            D   L GG  D R
Sbjct: 406 HDLDELTGGESDMR 419


>gi|261190993|ref|XP_002621905.1| SET domain-containing protein [Ajellomyces dermatitidis SLH14081]
 gi|239590949|gb|EEQ73530.1| SET domain-containing protein [Ajellomyces dermatitidis SLH14081]
 gi|239613147|gb|EEQ90134.1| SET domain-containing protein [Ajellomyces dermatitidis ER-3]
 gi|327354785|gb|EGE83642.1| SET domain-containing protein [Ajellomyces dermatitidis ATCC 18188]
          Length = 481

 Score = 72.0 bits (175), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 108/454 (23%), Positives = 182/454 (40%), Gaps = 85/454 (18%)

Query: 65  ETFWQWLRDQKVVSPKSPIRPATFPE---GLGLVAQRDIAKNEVVLEVPMKFWINPDTVA 121
           + F  WL+    V     I+ A       G G+VA  +I ++E +  +P    ++     
Sbjct: 18  DEFMCWLKQSAGVRVSPKIKIADLRSEGAGRGIVALSNINEDEELFAIPQNLVLSFQNSK 77

Query: 122 ASEIGSLC-SGLKPWISVALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSEEELVELQ 179
             ++  +    L PW+ + L +I E  +   SPW  Y  +LP   D+ +FW++EEL EL 
Sbjct: 78  LKDLLHISEKDLGPWLCLILVMIYEYLQGGASPWSRYFQVLPTEFDTLMFWTDEELRELS 137

Query: 180 GTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDF----------------- 222
           G+ +L+     K   +   L+    I+  N  LFP    L  +                 
Sbjct: 138 GSAVLNKI--GKSDAEAAILRDIFPIVSTNPHLFPPISGLGSYDSPDGRATLLSLAHRMG 195

Query: 223 ----LWAFGILRSR--------AFSRLRGQNLV--LIPLADLINHSPGITTEDYAYEIKG 268
                +AF I +           +    G+ L   ++PLADL+N            +   
Sbjct: 196 SLIMAYAFDIEKGEDEEGEVQDGYITDEGEELTKGMVPLADLLNADA---------DRNN 246

Query: 269 AGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEIS 328
           A LF  D   ++++  P++ GE++   Y      A+L   YG++   +D  A     EIS
Sbjct: 247 ARLFQEDGYLAMKSIKPIRNGEEIFNDYG-ELPRADLLRRYGYV---TDNYAQYDEAEIS 302

Query: 329 ES--------DPFFGD----KLDIAETNGLGESAYF------DIVLGRTLPPAMLQYLRL 370
                      P   D    +L+  E  G+ +  Y       +  L   LP  +L  L  
Sbjct: 303 MQAICKAAGIQPATPDPEQPRLEFLENLGVFDDGYGIPRLNPNTPLMEALPEELLIALNT 362

Query: 371 VALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQL 430
           + +        E   +N    +  L V+ A       ++  A + AL G+ TT+E+D+QL
Sbjct: 363 LTMPPEQ--FSERKTKNKA-PNPTLGVTQAT------LLGHAVQQALGGYITTLEQDKQL 413

Query: 431 L------QGGNLDP-RLRIAVGVRAGEKRVLQQI 457
           L      Q   +   R ++A+ VR GEK +L Q+
Sbjct: 414 LATISNVQASTVSGRRFKMALQVRMGEKEILSQL 447


>gi|308809221|ref|XP_003081920.1| N-methyltransferase (ISS) [Ostreococcus tauri]
 gi|116060387|emb|CAL55723.1| N-methyltransferase (ISS) [Ostreococcus tauri]
          Length = 403

 Score = 72.0 bits (175), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 85/334 (25%), Positives = 150/334 (44%), Gaps = 34/334 (10%)

Query: 153 WRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQL 212
           WR Y D LP   DS + WS+EEL  LQG+ L    +  ++  + EY  +   +   + + 
Sbjct: 65  WRAYCDALPAAVDSLMMWSDEELEVLQGSALRQRAVFRRDLCKREYDALFPALARADPET 124

Query: 213 FP--RPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPG-----ITTEDYAYE 265
           F      + D F WA+  + +RAF     Q + L+P  D+ N +       +  ++ A E
Sbjct: 125 FGDVEAYSFDVFRWAYATVMARAFVLPDLQCMALLPGLDIYNSARDAEKCVVERDEGACE 184

Query: 266 IKGAGLFSR-DLLFSLRTPV-PVKAGEQVLIQYDLNKSNAELALDYGFI-ESKSDRNAYT 322
           +  +  F   +   +LR  V  V+AG Q+   Y  + S   L L++GF+   + +R +  
Sbjct: 185 VDDSSSFDESEARVTLRVGVGGVQAGSQLFHDYADHASGGAL-LEFGFVYHGERERGSGV 243

Query: 323 LTLEI----------SESDPFFGDKLDIAETNGLGESAYFDIV-LGRTLPPAM--LQYLR 369
             L++          + S  F  D+ D+     + +S  F+I  +G    PA+  ++ ++
Sbjct: 244 DALDVCLKPALARLDARSRAFLVDE-DVFHKFNVRKSLTFEISNVGGVYKPALKGIECIK 302

Query: 370 LVALGGTDAFLL--ESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEED 427
                   A  L  +    +++   LD        +++ RV+RD  +   +   TTI+ED
Sbjct: 303 PEMWRAARALALRPDEPLPSSLDARLDDDGERRARDILIRVLRDERERYSA---TTIDED 359

Query: 428 EQLLQ----GGNLDPRLRIAVGVRAGEKRVLQQI 457
             LL+    GG    R  +A+ VR GEK +L  +
Sbjct: 360 IDLLRRASDGGPSRRREAMALDVRLGEKLLLDDV 393


>gi|384246167|gb|EIE19658.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 523

 Score = 72.0 bits (175), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 121/462 (26%), Positives = 183/462 (39%), Gaps = 98/462 (21%)

Query: 69  QWLRDQK-VVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           QW+      VSP   + P     G GL A +     E+++ +P    ++ D      +  
Sbjct: 46  QWVSSSGGTVSPTVHVSPPDSVMGAGLRASKACRSGELLVSLPRSCQLSYDGSTEPNLLQ 105

Query: 128 LCSGLKP--W-ISVALFLIREK-KKEDSPWRVYLDILPECTDST-VFWSEEELVELQGTQ 182
           L S +    W   +AL +++E+    DSP+  Y+D LP       +F+S + +  L+   
Sbjct: 106 LISKVPEELWGAKLALRVLKERIMGPDSPFHSYIDNLPMGVPGIPMFFSPDAIRALEQYP 165

Query: 183 LLSTTLGVKEYVQ------NEYLKVEEEIILPNKQLFP---RPITLDDFLWAFGILRSRA 233
            LS    VK+  +      +E+L       LP     P    P+  +   WA  +  SRA
Sbjct: 166 PLSEQ--VKKRCRWLLSFSSEHLSA-----LPGSPADPFLGTPVDANILGWALAMTTSRA 218

Query: 234 FSRLRG--QNLVLIPLADLINHS----------PGITTEDYAYEIKGAGLFSRDLLFSLR 281
           F R++G      L+PL D+ NHS          PG + E  A         SRD      
Sbjct: 219 F-RVQGPQHPAALLPLIDMSNHSFAPNCEVKPGPGGSVEMVA---------SRD------ 262

Query: 282 TPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRN------------AYTLTLEISE 329
               ++A E +L+ Y     N  L LDYGF+   +  +            A  L  +   
Sbjct: 263 ----IRAEEDLLLSYG-KLDNTFLLLDYGFMVPGNPHDTVLIRYDPIMFEAGHLLAQGQG 317

Query: 330 SDPFFGDKLDIA--------ETNGLGESAYFDIVLGRTL---PPAMLQYLRLVALGGTDA 378
           + P     L  A        E N LG  A  ++  GR+L    P +L   RL     T A
Sbjct: 318 APPDNAPDLGSATFQQQLLTEMNLLGPKADLELAFGRSLQRCSPRLLAVARLF----TAA 373

Query: 379 FLLESIFRNTI-WGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGG--- 434
              E   R+    G  D P+S  NE    R+V       L+ F TT+E+D + L+GG   
Sbjct: 374 SAAEVRGRSADDLGAWDQPLSLINEVKALRMVAGVAAMLLTRFPTTLEQDRRALEGGPAV 433

Query: 435 ------------NLDPRLRIAVGVRAGEKRVLQQIDEIFKER 464
                        L P +R AV +RA +K VL ++ +    R
Sbjct: 434 AAEGGLPEGGPHQLAPDVRAAVQLRAEKKAVLSEVLQDIGHR 475


>gi|301099608|ref|XP_002898895.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262104601|gb|EEY62653.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 440

 Score = 72.0 bits (175), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 105/422 (24%), Positives = 188/422 (44%), Gaps = 37/422 (8%)

Query: 77  VSPKSPIRPATFPEGLGL------VAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCS 130
           V P +P+     PEG         +   ++    V+L +PM   ++ ++ A   +G L  
Sbjct: 13  VKPLAPMSTVLQPEGFNFGRGTAYITTENVEVGSVLLSLPMSQVMSVESAARGRVGLLLE 72

Query: 131 ---GLKPWISVALFLIREKK-KEDSPWRVYLDILP--ECTDSTVFWSEEELVELQGTQLL 184
               L   I++ L L+ E+     S +  ++  LP  E  +ST+F+SE+E+  L+G+QL 
Sbjct: 73  VNPDLPSAIALGLHLLEERALGAASNFSDFVATLPTIEAINSTLFYSEDEMKGLEGSQLQ 132

Query: 185 STTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI------TLDDFLWAFGILRSRAFSRLR 238
             TLG  + V   Y  + + +   +++    PI      TLD F WA G++ S  F    
Sbjct: 133 RFTLGRAQAVDAFYDALVQPVT--SREAVDPPIFHKSEFTLDKFRWAMGVVWSSTFQ--F 188

Query: 239 GQNLVLIPLADLINHSPGITTE-----DYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVL 293
           G+N   + LA ++N + GI T+     + A       + +     ++   V    G++V 
Sbjct: 189 GENEDDVILAPVLN-TIGICTDLNQEGNEACPETSIKVDTDTQRLTVYASVAYSKGQEVR 247

Query: 294 IQYDLNKSNAELALDYGFIESKSDR-NAYTLTLEISESDPFFGDKLDIAETNGLGES--A 350
           +     KS+ +L L +GF  +++ + +   LT+ +  SD     K  + +T  L ES  A
Sbjct: 248 LSMP-GKSSTQLMLSHGFARARASKLDKLDLTVTLDSSDTLAPLKNYLLQTQ-LNESINA 305

Query: 351 YFDIVLGRT-LPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVV 409
            ++   G + +   +   LR+  L G +    + +   T        VS  NE +  R V
Sbjct: 306 TYEFFYGSSKIDEYVSTSLRMKLLSGGELARYKELLTPTEGEEHRPIVSLRNEFVFTRAV 365

Query: 410 RDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKER-ELEL 468
              C + L  + T+IE+D++ L    L  +  +     A  +R+L    +I  E  EL L
Sbjct: 366 ISTCTTLLKQYPTSIEQDQENL--AKLSDKDDVESVRIAHVQRILIMEKQILNETMELAL 423

Query: 469 DE 470
           D+
Sbjct: 424 DQ 425


>gi|149742140|ref|XP_001496337.1| PREDICTED: SET domain-containing protein 4 [Equus caballus]
          Length = 440

 Score = 72.0 bits (175), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 68/263 (25%), Positives = 117/263 (44%), Gaps = 32/263 (12%)

Query: 69  QWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           +WL+++K       + PA FP  G GL+++  + + ++++ +P    +  DTV  S +G+
Sbjct: 38  KWLKERKFEDMN--LTPARFPGTGRGLMSKISLQEGQMIISLPESCLLTTDTVIRSYLGA 95

Query: 128 LCSG----LKPWISVALFLIREKKKED-SPWRVYLDILPECTDSTVFWSEEELVELQGTQ 182
             +     L P +++  FL+ EK   D S W+ YL++LP+     V   E E+V+L    
Sbjct: 96  YIAKWQPPLSPLLALCTFLVAEKHAGDRSVWKPYLEVLPKAYTCPVCL-EPEVVDLLPKP 154

Query: 183 LLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI----TLDDFLWAFGILRSRAF---- 234
           L +     +  +Q  +     +     + LF   +    +   FLWA+  + +RA     
Sbjct: 155 LKAKAREQRTRLQ-AFFTSSRDFFSSLRPLFSEAVESIFSYSAFLWAWCTVNTRAVYMKP 213

Query: 235 SRLRG-----QNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAG 289
            R R          L P  DL+NHSP +           AG       + +RT    +  
Sbjct: 214 RRRRCFSAEPDTYALAPYLDLLNHSPDVQVR--------AGFNEETRCYEIRTVSSCRKH 265

Query: 290 EQVLIQYDLNKSNAELALDYGFI 312
           E+V I Y  +  N  L L+YGF+
Sbjct: 266 EEVFICYGPH-DNQRLLLEYGFV 287


>gi|302826668|ref|XP_002994755.1| hypothetical protein SELMODRAFT_432653 [Selaginella moellendorffii]
 gi|300136963|gb|EFJ04180.1| hypothetical protein SELMODRAFT_432653 [Selaginella moellendorffii]
          Length = 688

 Score = 71.6 bits (174), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 74/269 (27%), Positives = 122/269 (45%), Gaps = 38/269 (14%)

Query: 68  WQWLRDQKVVSPKSPIRPATFPE---GL----GLVAQRDIAKNEVVLEVPMKFWINPDTV 120
           WQ++R      P     PA       GL    GL A R +   E VLE+ +   I P T 
Sbjct: 406 WQYMRRNVGACPGLFFFPARLDRSRGGLARFRGLFASRPVRAGERVLEISLDLMIAP-TR 464

Query: 121 AASEIGSL-CSGLKPWISV--------------ALFLIREKKKEDSPWRVYLDILPECT- 164
              ++ +L  S   P+IS                +F+ ++ + + S W  Y+  LPE   
Sbjct: 465 LPDQLSTLQSSAWAPYISCLPEPAELDNTVLDYRVFVSQKFQLQSSAWAPYISCLPEPAE 524

Query: 165 -DSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEI-ILPNKQLFPRPITLDDF 222
            D+T  W + EL  L+ + L   T    E +  E+ +V+  + + P  QLF + ++++DF
Sbjct: 525 LDNTFLWEDTELSYLRASPLYGKTRERLEIITTEFGQVQNALDVWP--QLFGK-VSVEDF 581

Query: 223 LWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRT 282
           +  +  + SR  +      LV+IP+ D  NH+       +A ++   GL +  ++ + R 
Sbjct: 582 MHVYATVFSRPLAIGEDSTLVMIPMLDFFNHN----AASFA-KLSFNGLLNYAVVTADRD 636

Query: 283 PVPVKAGEQVLIQYDLNKSNAELALDYGF 311
                  +Q+ I    + SNAELALDYGF
Sbjct: 637 ---CAENDQIWINCG-DLSNAELALDYGF 661


>gi|219126444|ref|XP_002183467.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217405223|gb|EEC45167.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 519

 Score = 71.6 bits (174), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 79/299 (26%), Positives = 131/299 (43%), Gaps = 41/299 (13%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIG------SLCSGLKPWISVALFLI 143
           E  G+ A+R I  +   + +P +  I  +   A+ IG       L       I + ++L+
Sbjct: 62  EMRGVHARRSIPPHTTCVSIPRRCLITVEMGQATPIGRAILQADLDLDAPKHIFLMIYLL 121

Query: 144 REKKKEDSP--WRVYLDILPEC-TDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLK 200
            ++K   S   +  Y +ILP    +  +FWS  EL EL+G+ LLS      + +Q++Y  
Sbjct: 122 WDRKTHGSSSFFHPYYEILPPTLRNMPIFWSAFELQELEGSHLLSQIADRGQAIQDDY-- 179

Query: 201 VEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFS-RLRGQNL-VLIPLADLINHSPGIT 258
              E IL          TLD+F WA   + SR F  ++ G     L+P AD++NH     
Sbjct: 180 ---EAILEVAPSLGTLCTLDEFKWARMCVCSRNFGLQIDGHRTSALVPHADMLNHYRPRE 236

Query: 259 TEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF-IESKSD 317
           T+    E+           F++ +   ++AG QV   Y   K N    L+YGF +E   +
Sbjct: 237 TKWTFDEVTQC--------FTITSLQSIQAGAQVYDSYG-QKCNHRFLLNYGFAVEDNRE 287

Query: 318 RNAYT-----LTLEISESDPFFGDKLDI---AETNGLGESAYFDIV-------LGRTLP 361
            + +      L L +  +D  F DKL+     ETN +  +    ++       +GR +P
Sbjct: 288 LDGFCPNEVPLELYVDPADILFQDKLEFWTRGETNQISGAVTAGLIAQAVGGSMGRGVP 346


>gi|380477696|emb|CCF44010.1| SET domain-containing protein, partial [Colletotrichum
           higginsianum]
          Length = 448

 Score = 71.6 bits (174), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 75/279 (26%), Positives = 120/279 (43%), Gaps = 37/279 (13%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASE----IGSLCSGLKPWISVA------- 139
           GLGL++ +D++++E  LE      I  + V ++E       +    +  +  A       
Sbjct: 34  GLGLISNKDLSRDEETLETSTLLRIPHELVLSAEAVENYAKVDKNFRQLLDAAGHKSTRH 93

Query: 140 ---LFLIREKKKED-----------SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLS 185
              L+L+ +K   +           +PW  Y+  LP        W+E+E   LQGT L S
Sbjct: 94  DICLYLLTQKVLSERLKTSLQGGVSTPWTEYVKYLPPRVPVPTLWTEQERDMLQGTSLES 153

Query: 186 TTLGVKEYVQNEYLKVEE--EIILPNKQLF--PRPITLDDFLWAFGILRSRAFSRLRGQN 241
            T      + +E+ ++ E    +    +LF     I+L D++      RSR    L    
Sbjct: 154 ATAAKIVALTDEFDELRETSSTLTFWNELFWESEKISLIDWVRVDAWFRSRCL-ELPKSG 212

Query: 242 LVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKS 301
             ++P+ DL NHS   +  +  YE  G      +++  LR    V +GE++ I Y   KS
Sbjct: 213 EAMVPVLDLANHS---SEANAYYEENGKD----EVVLLLRPGCRVSSGEEMTISYGDAKS 265

Query: 302 NAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDI 340
            AE+   YGFI+  S  +  TL L   E DP    KL I
Sbjct: 266 GAEMLFSYGFIDPVSAADRMTLPLMPLEDDPLGKAKLHI 304


>gi|347836900|emb|CCD51472.1| similar to SET domain-containing protein [Botryotinia fuckeliana]
          Length = 470

 Score = 71.6 bits (174), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 108/441 (24%), Positives = 182/441 (41%), Gaps = 71/441 (16%)

Query: 62  AQVETFWQWLRDQKV-VSPK-SPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDT 119
           A+  TF  WL++  +  +PK + +       G G+VA  DI  +E++  +P    +N   
Sbjct: 8   ARTATFSAWLQEMGIRTNPKMALVDLRQEGRGRGVVAIEDIDDDEIIFSIPRSAVLNAQN 67

Query: 120 VAASEIGS-LCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVEL 178
                I   L   +  W+++   L+ E + +DS W  YL ILPE  +S VFWS+ EL EL
Sbjct: 68  AKPLAISKRLAEKMPSWLALTSILMAEGQVDDSKWAPYLAILPEQLNSLVFWSDSELAEL 127

Query: 179 QGTQLLS---------------TTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI------ 217
           Q + ++                T  G+ ++   E       +I+      P P       
Sbjct: 128 QASAVVKKIGKQGAEDMFKTYITPQGL-QHSSTEMCHKVASVIMAYAFDIPDPSEGPTSG 186

Query: 218 -----TLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLF 272
                  DD +   G       S        +IPLAD++N            +   A L 
Sbjct: 187 GKGEEAADDLVSDDGEDEKTILS--------MIPLADMLNADA---------DRNNARLI 229

Query: 273 SRDLLFSLRTPVPVKAGEQVLIQY-DLNKSNAELALDYGFI-ESKSDRNAYTLTLE---- 326
             +    +R   P+  GE++   Y  L +S+  L   YG++ +  S  +   ++ E    
Sbjct: 230 CDNEDLEMRAIKPIAKGEEIFNDYGQLPRSD--LLRRYGYVTDGYSAYDVAEISAELIVS 287

Query: 327 ------ISESDPFFGD-----KLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGG 375
                 +  S P         +LD+AE  G+ + + FD+V      P++   L       
Sbjct: 288 LFRNGKVHPSLPKLTQDKLKVRLDLAEREGVYDES-FDLVHSSPDEPSIPDELLAFL--- 343

Query: 376 TDAFLLESIFRNTIWGHLDLPV-SHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGG 434
               + ES  +  +     LP  S    EL  +V+    ++  + + TT+EEDE LL+  
Sbjct: 344 YLLLVDESHLKAILDSESSLPSRSKLTTELAGQVLAILLQARENEYSTTVEEDEDLLKNA 403

Query: 435 NLDPRLRIAVGVRAGEKRVLQ 455
           +L  R  +A+ VR+GEK+VL+
Sbjct: 404 DLPIRTAMAIQVRSGEKKVLR 424


>gi|326472332|gb|EGD96341.1| SET domain-containing protein [Trichophyton tonsurans CBS 112818]
          Length = 485

 Score = 71.6 bits (174), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 107/419 (25%), Positives = 175/419 (41%), Gaps = 67/419 (15%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWIN-PDTVAASEIGSLCSGLKPWISVALFLIREK-KK 148
           G G+ A RDIA++E +  +P    ++  ++ A S +G     L PW+S+ + +I E  + 
Sbjct: 48  GRGICASRDIAEDEELFVIPDDLILSVQNSEARSVLGLDDKQLGPWLSLIITMIYEYYQG 107

Query: 149 EDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILP 208
           E S W  Y  ILP   D+ +FW++E+L ELQG+ ++   +G K    +  L+    +I  
Sbjct: 108 EQSKWYPYFRILPSSFDTLMFWTDEQLSELQGSAVVG-KIG-KAAADDTILQKVVPLIQA 165

Query: 209 NKQLFP-----RPITLDDFLWAFGILRSRAFSRLRGQ--NLVLIPLADLINHSPGITTED 261
           N   FP      P+   D   A   L  R  S +     ++     AD      G  T+D
Sbjct: 166 NPHHFPPRPNMPPLNSPDSQNALLCLAHRMGSIIMAYAFDIEKADEADEDTAEDGYMTDD 225

Query: 262 YAYEIKG----------------AGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAEL 305
                KG                A LF  +  F ++    + +GE++   Y      A+L
Sbjct: 226 EDEPAKGMVPLADIFNADAQRNNARLFQEEGSFVMKAIKNIHSGEEIFNDYG-ELPRADL 284

Query: 306 ALDYGFIESKSDRNAYTLTLEIS-----------ESDPFFGD-KLDIAETNGLGESAYF- 352
              YG++   +D  A    +E S           +S P   + +L++ +   + E  Y  
Sbjct: 285 LRRYGYV---TDNYAQYDVVEFSLDGICKVAGLPDSKPSSTNPRLELLDNLDMLEEGYSI 341

Query: 353 -----DIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICR 407
                +  L   +P   L  LR + L       +E + R         P   A+E     
Sbjct: 342 PRVPPNGTLKDAIPEDFLVLLRALTLP------VEDLNRLKARNKAPKPEFSASE---AS 392

Query: 408 VVRDACKSALSGFHTTIEEDEQLL-----QGGNLDP----RLRIAVGVRAGEKRVLQQI 457
           ++R    S  S + T+I+EDE +L     Q G ++     R ++AV VR GEK +L QI
Sbjct: 393 LLRSLVNSRQSEYPTSIQEDENILRYLEQQNGYINDSILIRRKMAVQVRKGEKEILTQI 451


>gi|225554758|gb|EEH03053.1| SET domain-containing protein [Ajellomyces capsulatus G186AR]
          Length = 485

 Score = 71.6 bits (174), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 105/457 (22%), Positives = 182/457 (39%), Gaps = 88/457 (19%)

Query: 65  ETFWQWLRDQKVVSPKSPIRPATFPE---GLGLVAQRDIAKNEVVLEVPMKFWINPDTVA 121
           + F  WL+ +  V     I+ A       G G+VA  DI ++E +  +P    ++    +
Sbjct: 18  DEFMSWLKQRPGVKVSPKIKIADLRSEGAGRGIVADDDIGEDEELFAIPQNLVLSFQNSS 77

Query: 122 ASEIGSLCS-GLKPWISVALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSEEELVELQ 179
             ++         PW+ + + +I E  +   S W  Y  +LP   D+ +FW++EEL EL 
Sbjct: 78  LKDLLDFNERDFDPWLCLIVVMIYEYLQGGASTWSRYFQLLPTNFDTLMFWTDEELRELS 137

Query: 180 GTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFG------------ 227
           G+ +L+  +G  +   N  L+    ++  N   FP    +  F    G            
Sbjct: 138 GSAVLN-KIGRSDAEAN-ILRNILPLVSGNPSHFPPMSGVASFDSPEGKAALLSLAHRMG 195

Query: 228 -ILRSRAFSRLRGQN------------------LVLIPLADLINHSPGITTEDYAYEIKG 268
            ++ + AF   +G+N                    ++PLADL+N            +   
Sbjct: 196 SLIMAYAFDIEKGENDGGEGQDGYVTDDEEELSKGMVPLADLLNADT---------DRNN 246

Query: 269 AGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEIS 328
           A LF  D   S+R+  P++ GE++   Y      A+L   YG++   +D  A    +EIS
Sbjct: 247 ARLFQEDCYLSMRSIKPIRKGEEIFNDYG-ELPRADLLRRYGYV---TDNYAQYDEVEIS 302

Query: 329 ESDPFFGDK------------LDIAETNGLGESAY------FDIVLGRTLPPAMLQYLRL 370
                +               L+  E  G+ +  Y       +  L   LP  +L  L +
Sbjct: 303 MRTICYAAGIPSAIPGPEHPCLEFLENMGVFDDGYSIPRLNLNTPLMEVLPEELLIALNV 362

Query: 371 VALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQL 430
           + +   D F    +  N       L V+ A       ++  A +  L  + TT+EED+ L
Sbjct: 363 LTM-PPDQF--NQLKMNNKVPKPRLGVTEAT------LLATAVQQTLGDYTTTLEEDKDL 413

Query: 431 LQG-GNLDP---------RLRIAVGVRAGEKRVLQQI 457
           L    N+           RL++A+ VR GEK++L Q+
Sbjct: 414 LSSISNIQESTDSTASARRLKMALQVRIGEKQILSQV 450


>gi|325095092|gb|EGC48402.1| SET domain-containing protein [Ajellomyces capsulatus H88]
          Length = 485

 Score = 71.6 bits (174), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 107/453 (23%), Positives = 180/453 (39%), Gaps = 86/453 (18%)

Query: 67  FWQWLRDQKVVSPKSPIRP-ATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEI 125
           FW   R    VSPK  I    +   G G+VA  DI ++E +  +P    +        ++
Sbjct: 22  FWLKQRPGVKVSPKIKIADLRSEGAGRGIVADDDIGEDEELFAIPQNLVLGFQNSRLKDL 81

Query: 126 GSLCS-GLKPWISVALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQL 183
                    PW+ + + +I E  +   S W  Y  +LP   D+ +FW++EEL EL G+ +
Sbjct: 82  LDFNERDFDPWLCLIVVMIYEYLQGGASTWSRYFQLLPTNFDTLMFWTDEELRELSGSAV 141

Query: 184 LSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFG-------------ILR 230
           L+  +G  +   N  L+    ++  N   FP    +  F    G             ++ 
Sbjct: 142 LN-KIGRSDAEAN-ILRNILPLVSGNPSHFPPMSGVASFDSPEGKAALLSLAHRMGSLIM 199

Query: 231 SRAFSRLRGQN------------------LVLIPLADLINHSPGITTEDYAYEIKGAGLF 272
           + AF   +G+N                    ++PLADL+N            +   A LF
Sbjct: 200 AYAFDIEKGENDGREGQDGYVTDDEEELSKGMVPLADLLNADA---------DRNNARLF 250

Query: 273 SRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDP 332
             D   S+R+  P++ GE++   Y      A+L   YG++   +D  A    +EIS    
Sbjct: 251 QEDCYLSMRSIKPIRKGEEIFNDYG-ELPRADLLRRYGYV---TDNYAQYDEVEISMRTI 306

Query: 333 FFGD------------KLDIAETNGLGESAY------FDIVLGRTLPPAMLQYLRLVALG 374
            +              +L+  E  G+ +  Y       +  L   LP  +L  L ++ + 
Sbjct: 307 CYAAGIPSAIPGPEHPRLEFLENMGVFDDGYSIPRLNLNTPLMEVLPKELLIALNVLTM- 365

Query: 375 GTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQG- 433
             D F    +  N       L V+ A       ++  A +  L  + TT+EED+ LL   
Sbjct: 366 PPDQF--NQLKMNNKVPKPRLGVTEAT------LLATAVQQTLGDYTTTLEEDKDLLSSI 417

Query: 434 GNLDP---------RLRIAVGVRAGEKRVLQQI 457
            N+           RL++A+ VR GEK++L Q+
Sbjct: 418 SNIQESTDSTASARRLKMALQVRIGEKQILSQV 450


>gi|345795412|ref|XP_544872.3| PREDICTED: SET domain-containing protein 4 [Canis lupus familiaris]
          Length = 440

 Score = 71.6 bits (174), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 71/264 (26%), Positives = 123/264 (46%), Gaps = 34/264 (12%)

Query: 69  QWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           +WL+D+K     + + PA FP  G GL+++  + + ++++ +P    I  DTV  S +G+
Sbjct: 38  KWLKDRKF--EDTNLIPACFPGTGRGLMSKTSLREGQMIISLPESCLITTDTVIRSYLGT 95

Query: 128 LCSGLK----PWISVALFLIREKKKED-SPWRVYLDILPECTDSTVFWSEEELVELQGTQ 182
             +  +    P +++  FL+ EK   D S W+ YL+ILP+     V   E E+V L    
Sbjct: 96  YIAKWQPPPSPLLALCTFLVSEKHAGDQSLWKPYLEILPQAYTCPVCL-EPEVVNLFPKP 154

Query: 183 LLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI----TLDDFLWAFGILRSRA-FSRL 237
           L +     +  VQ E+     +     + LF   +    +    LWA+  + +RA + + 
Sbjct: 155 LKAKAEEQRARVQ-EFFSSSRDFFSSLQPLFSEAVESIFSYRALLWAWCTVNTRAVYVKH 213

Query: 238 RGQNL--------VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDL-LFSLRTPVPVKA 288
           R +           L P  DL+NHSP         E++  G F+ +   + +RT    + 
Sbjct: 214 RQRQCFSTEPNTYALAPYLDLLNHSP---------EVQVKGAFNEETRCYEIRTASNCRK 264

Query: 289 GEQVLIQYDLNKSNAELALDYGFI 312
            E+V I Y  +  N  L L+YGF+
Sbjct: 265 HEEVFICYGPH-DNQRLLLEYGFV 287


>gi|159131477|gb|EDP56590.1| SET domain protein [Aspergillus fumigatus A1163]
          Length = 490

 Score = 71.6 bits (174), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 56/197 (28%), Positives = 91/197 (46%), Gaps = 13/197 (6%)

Query: 144 REKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQL---LSTTLGVKEYVQNEYLK 200
           R+K    +PW  Y+  +P       F+S EE   L+GT L   +   LG  E   +   +
Sbjct: 119 RQKIGISNPWTEYIRFMPASVPLPTFYSAEERELLRGTSLQTAVDAKLGSLEKEFDHLRQ 178

Query: 201 VEEEIILPNKQLFPR---PITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGI 257
             EEI    +  +       T DD+ +   + RSR     R  +  ++P  D+ NH+   
Sbjct: 179 ATEEIPWCQEHWWDEDTGKFTFDDWKYVDAVYRSRVVDLPRSGH-AIVPCVDMANHACED 237

Query: 258 TTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES-KS 316
           + +   Y+ +GAG    + +  LRT   ++ GE+V I Y   K  +E+   YGF+E+ ++
Sbjct: 238 SVK-AKYDEEGAG----NAVLQLRTGKKLRVGEEVTISYGDEKPASEMVFSYGFVENERT 292

Query: 317 DRNAYTLTLEISESDPF 333
           D     L LEI + DP 
Sbjct: 293 DAKQIFLDLEIPDDDPL 309


>gi|449525577|ref|XP_004169793.1| PREDICTED: uncharacterized protein LOC101224730 [Cucumis sativus]
          Length = 553

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 55/188 (29%), Positives = 93/188 (49%), Gaps = 11/188 (5%)

Query: 58  NPPTAQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINP 117
           N   A +E F QWL+          I+ +   +G GL +  + A + V+L VP+   I P
Sbjct: 3   NSDQANLELFLQWLQVNGADLRGCTIKYSDLSKGCGLFSA-NYASDGVLLVVPLDLAITP 61

Query: 118 DTVAASEI-GSLCSGLKPWISV------ALFLIREKKKEDSPWRVYLDILPECTDSTVFW 170
             V    + G  C  +     V       LFL+ E+ +E+S W  YLD+LP    + +++
Sbjct: 62  MRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVERLRENSSWNPYLDVLPTRFGNPLWF 121

Query: 171 SEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEII---LPNKQLFPRPITLDDFLWAFG 227
           +++EL+EL+GT L   T   K  +Q+ Y    ++++   L  +    R ++ +DFLWA  
Sbjct: 122 TDDELLELKGTTLYRATELQKNSLQSLYENKVKKLVSRLLTLEGFTGREVSFEDFLWANS 181

Query: 228 ILRSRAFS 235
           I  +RA +
Sbjct: 182 IFWARALN 189


>gi|347841961|emb|CCD56533.1| similar to SET domain-containing protein [Botryotinia fuckeliana]
          Length = 377

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 84/309 (27%), Positives = 135/309 (43%), Gaps = 60/309 (19%)

Query: 65  ETFWQWLRDQKVVSPKSPIRPATFPE-GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAAS 123
           + F +W  +Q VV   + + P  FPE GLGLVA ++    +++++VP+K         A+
Sbjct: 7   QEFTKWAVNQGVVI--NDVSPFRFPEKGLGLVANKNFETGDILVQVPIKV-----LRKAT 59

Query: 124 EIGSLCSGLKPWISV-ALFLIREKKKEDSPWRVYLDILPECTDST----VFW--SEEELV 176
           ++ S  + L P I+V ALF +         W   +  LP   D      +FW  S +EL+
Sbjct: 60  DVPSQFAALAPDIAVHALFALSLDSLLGPEW---IATLPSKQDMHSSMPLFWDISLQELL 116

Query: 177 ELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPR-PITLDDFLWAFGILRSRAFS 235
                 LL T +   E + + +  +         + FP  PIT D F++ + I+ SR F 
Sbjct: 117 PYSSRALLKTQM---ENITSAWTAI--------CKTFPEPPITYDGFIYNYSIVNSRTFY 165

Query: 236 RL------------RGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTP 283
            L            +   L L P AD INHS   T +     +  AG       ++L   
Sbjct: 166 YLSPTIKPSKPQPSKENRLALNPFADYINHSSEPTVDA---TLSRAG-------YTLTAS 215

Query: 284 VPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAET 343
            P+K G +V I Y  + +N  L ++YGFI   +  +  TL       DP+    L + + 
Sbjct: 216 QPIKQGSEVHISYG-SHNNDFLLVEYGFILEDNRWDEVTL-------DPWITPLLSVEQK 267

Query: 344 NGLGESAYF 352
             L E+ + 
Sbjct: 268 EHLEETGFL 276


>gi|432901733|ref|XP_004076920.1| PREDICTED: SET domain-containing protein 4-like [Oryzias latipes]
          Length = 441

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 90/407 (22%), Positives = 170/407 (41%), Gaps = 43/407 (10%)

Query: 81  SPIRPATFPE-GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKP----W 135
           +P++PA F +   GL   + I    +++ +P    +   TV  S +G      KP     
Sbjct: 48  TPLQPALFSDTDRGLQTLQPIQPGGMLVSLPESCLLTTSTVLHSYLGPFLKSWKPRPSSL 107

Query: 136 ISVALFLIREKKK-EDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYV 194
           +++ +FL+ E+ + E S W  Y+D+LP C+     +  + ++ +  + +       +E +
Sbjct: 108 VALCVFLVCERHRGEASDWFPYIDVLP-CSYCCPPYFTDTVMAVLPSGVRRRAEEQREGL 166

Query: 195 QNEYLKVEEEIILPNKQLFPRP----ITLDDFLWAFGILRSRAF-------SRLRG-QNL 242
           Q+ Y  V ++  +  + +   P    +T +   WA+  + +R+        S L G  N 
Sbjct: 167 QHLY-AVHQDFFMSLQPVLSHPPEEVLTYEALRWAWCSINTRSVFMDRPSSSFLSGPDNY 225

Query: 243 VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSN 302
            L P  DL+NH P +  +        AG       + +R+   V+   Q  I Y  +  N
Sbjct: 226 ALAPFLDLLNHRPDVQVK--------AGFNRTSGCYEIRSISGVQRYHQAFINYG-SHDN 276

Query: 303 AELALDYGFIESKSDRNAY----TLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGR 358
             L L+YGF+ S +  +       L  E+   D    +K+     NG  +    ++ L  
Sbjct: 277 QRLLLEYGFVSSCNPHSVIYVEEDLLCEVLRGDESLDEKMKFLRENGFLQ----NLTLSD 332

Query: 359 TLPP-AMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSAL 417
             P   +L  LRL++      +L    +R  + G +   VS    +   +  +  C+  L
Sbjct: 333 EGPSWRLLTALRLLSPLPIPRWLSSQRWRTLLLGQM---VSDEEAQRSVQTAKTLCERLL 389

Query: 418 SGFHTTIEEDEQLLQGGNLDPR--LRIAVGVRAGEKRVLQQIDEIFK 462
               T+++E   LLQ      R  L++   +R  E+ +L    E+ K
Sbjct: 390 RETRTSLQEISHLLQQSERPVREQLQVVGSLRREERCILGNCLEVLK 436


>gi|302812508|ref|XP_002987941.1| hypothetical protein SELMODRAFT_126991 [Selaginella moellendorffii]
 gi|300144330|gb|EFJ11015.1| hypothetical protein SELMODRAFT_126991 [Selaginella moellendorffii]
          Length = 424

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 96/406 (23%), Positives = 170/406 (41%), Gaps = 67/406 (16%)

Query: 89  PEGLGLVAQRD---IAKNEVVLEVPMKFWIN----------PDTVAASE-IGSLCSGLKP 134
           P+G+G+ A +D   I +  V+LEVPM+  +           PD V     +  + +   P
Sbjct: 44  PDGVGVYAAKDMTDIERPRVILEVPMEILVTVSSNLPWMFFPDIVPIGHPVFDVINSTDP 103

Query: 135 W----ISVALFLIREKKKEDSPWRVYLDILPECTDST--VFWSEEELVELQGTQLLSTTL 188
                I +   L+   +++ + WR+Y D LP   +ST  +  +EEEL  LQ + L     
Sbjct: 104 ETDYDIRLGCLLLLALEQDGNFWRLYGDYLPNQAESTDLLLATEEELDALQDSVLSQCIK 163

Query: 189 GVKEYVQNEYLKVEEEIILPNKQLFPRPITLDD--FLWAFGILRSRAF-------SRLRG 239
             +E +Q ++    EE   P   L  + +  D   F WA G+ RSR+        S+++ 
Sbjct: 164 DEQEKIQKKW----EECFHPGAPLKLKRLARDSDRFKWAVGMARSRSHTLTMKIGSKVQV 219

Query: 240 QNLVLIPLADLINHS--PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYD 297
            N++  P AD++NHS  P  +    A +        R L   ++    ++ G+++ + Y 
Sbjct: 220 ANII-SPYADMLNHSFQPTCSLRWRACD--------RVLEVVVKAGQTIREGDEMTLDYL 270

Query: 298 LNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLG 357
                 +    +GF    +  N    +  +S     F     I+   GLG + Y      
Sbjct: 271 PKNPTRDYMHRFGFSSPVNPWNVVKFSGSVSIHRDSFLSAFGIS---GLGPNYYL----- 322

Query: 358 RTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSAL 417
           R  P A             D  +L +     +W   DLP   + E+   R ++  C+  L
Sbjct: 323 RDDPMA-------------DGAVLAAARILPMWSDADLPFLPSMEKRAVRELQKECRQLL 369

Query: 418 SGFHTTIEEDEQLLQGGNLDP--RLRIAVGVRAGEKRVLQQIDEIF 461
           S + ++ E+D +L+     D   R + A+  R+ EK  L+++  I 
Sbjct: 370 SAYPSSYEQDVKLMAAAGEDKSVRWKSALKYRSDEKLFLRKVVRIL 415


>gi|384248321|gb|EIE21805.1| SET domain-containing protein, partial [Coccomyxa subellipsoidea
           C-169]
          Length = 275

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 67/246 (27%), Positives = 111/246 (45%), Gaps = 34/246 (13%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEI-----GSLCSGLKPWISVALF--LI 143
           G G+VA +DI+  EVV+ VP +  + P+  + SE       +  SG     S+ L   L+
Sbjct: 29  GRGVVATKDISCGEVVVHVPDESVLMPENCSCSEALEDAGLTNASGDAEMESIGLILALM 88

Query: 144 REKK-KEDSPWRVYLDILPECTDST-VFWSEEELVELQGTQLLSTTLGVKEY-------- 193
            EKK  + S W+ YLD LP+      +FW  E+L  L+GT L+    G K          
Sbjct: 89  TEKKLGKSSKWKGYLDFLPKSIPGMPLFWDSEQLQSLEGTSLIEKMNGCKAMPDRPLEPP 148

Query: 194 -----VQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLA 248
                V   +L+    + LP+     R +    ++WA  ++ + +F+    +   ++P+ 
Sbjct: 149 CKFNSVVLPFLQSNAHLKLPHNAASTRRL----YVWATAMVSAYSFTIGEDRFQAMVPMW 204

Query: 249 DLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALD 308
           D +NH  G       +  +   L  R +   L     +  GEQV+  Y  +  N+EL   
Sbjct: 205 DALNHITGHANVRLHHCARKGAL--RMIATCL-----ITKGEQVINSYG-DLPNSELLRR 256

Query: 309 YGFIES 314
           YGF+E+
Sbjct: 257 YGFVET 262


>gi|367001244|ref|XP_003685357.1| hypothetical protein TPHA_0D02870 [Tetrapisispora phaffii CBS 4417]
 gi|357523655|emb|CCE62923.1| hypothetical protein TPHA_0D02870 [Tetrapisispora phaffii CBS 4417]
          Length = 495

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 84/311 (27%), Positives = 134/311 (43%), Gaps = 56/311 (18%)

Query: 65  ETFWQWLR--DQKVVSPKSPIRP-ATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVA 121
           + FW+WL    +  VSPK  I       EG  +VA   I K+EV+ E+P    +N DT  
Sbjct: 11  DEFWRWLHCVAKYSVSPKVRIEDFREINEGRCVVASEHIEKDEVLFEIPRDSILNVDT-- 68

Query: 122 ASEI-----------GSLCSGLKPWISVALFLIREK--KKEDSPWRVYLDILPECTD--S 166
            SE+            ++   +  W ++ L L  E   KKE+S W  Y  +LP+ TD  +
Sbjct: 69  -SELFKNHYEGYIDGKTVIEEIGLWETLILCLFYEMFVKKEESFWSQYFAVLPKATDFNT 127

Query: 167 TVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLF-PRPITLDDFLWA 225
            ++W + EL  L+ + +L      K    +E L    E +  N  +      T D FL  
Sbjct: 128 LMYWEDRELENLKPSFILERIGKDKSVAMHEKLM---EFVEKNLDVIETSSFTWDRFLLV 184

Query: 226 FGILRSRAFSRLRG----------------QNLV--LIPLADLINHSPGITTEDYAYEIK 267
             I+ + +F   RG                ++L+  +IPLAD +N        +  Y+  
Sbjct: 185 ASIIMAYSFDIERGECDADEEEEEEEEDIERSLIKSMIPLADTLNADTKRCNANLIYD-- 242

Query: 268 GAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIE---SKSDRNAYTLT 324
            +G      +  +    P+KA EQ+   Y  N +N EL   YG++E   + +D    +LT
Sbjct: 243 -SG------VLKMCAIKPIKANEQIYNTYG-NHANFELLRRYGYVEVDGTLNDCGEISLT 294

Query: 325 LEISESDPFFG 335
             I  +  +FG
Sbjct: 295 TIIDATIKYFG 305


>gi|449451195|ref|XP_004143347.1| PREDICTED: uncharacterized protein LOC101219815 [Cucumis sativus]
          Length = 553

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 55/188 (29%), Positives = 92/188 (48%), Gaps = 11/188 (5%)

Query: 58  NPPTAQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINP 117
           N   A +E F QWL+          I+ +   +G GL +  + A + V+L VP+   I P
Sbjct: 3   NSDQANLELFLQWLQVNGADLRGCTIKYSDLSKGCGLFSA-NYASDAVLLVVPLDLAITP 61

Query: 118 DTVAASEI-GSLCSGLKPWISV------ALFLIREKKKEDSPWRVYLDILPECTDSTVFW 170
             V    + G  C  +     V       LFL+ E+ +E+S W  YLD+LP    + +++
Sbjct: 62  MRVLQDPLYGPECRAMYEEGEVDDRFLMILFLMVERLRENSSWNPYLDVLPTRFGNPLWF 121

Query: 171 SEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEII---LPNKQLFPRPITLDDFLWAFG 227
           ++ EL+EL+GT L   T   K  +Q+ Y    ++++   L  +    R ++ +DFLWA  
Sbjct: 122 TDNELLELKGTTLYRATELQKNSLQSLYENKVKKLVSRLLTLEGFTGREVSFEDFLWANS 181

Query: 228 ILRSRAFS 235
           I  +RA +
Sbjct: 182 IFWARALN 189


>gi|70995934|ref|XP_752722.1| SET domain protein [Aspergillus fumigatus Af293]
 gi|66850357|gb|EAL90684.1| SET domain protein [Aspergillus fumigatus Af293]
          Length = 490

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 56/197 (28%), Positives = 91/197 (46%), Gaps = 13/197 (6%)

Query: 144 REKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQL---LSTTLGVKEYVQNEYLK 200
           R+K    +PW  Y+  +P       F+S EE   L+GT L   +   LG  E   +   +
Sbjct: 119 RQKIGISNPWTEYIRFMPASVPLPTFYSAEERELLRGTSLQTAVDAKLGSLEKEFDHLRQ 178

Query: 201 VEEEIILPNKQLFPR---PITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGI 257
             EEI    +  +       T DD+ +   + RSR     R  +  ++P  D+ NH+   
Sbjct: 179 ATEEIPWCQEHWWDEDTGKFTFDDWKYVDAVYRSRVVDLPRSGH-AIVPCVDMANHACED 237

Query: 258 TTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES-KS 316
           + +   Y+ +GAG    + +  LRT   ++ GE+V I Y   K  +E+   YGF+E+ ++
Sbjct: 238 SVK-ARYDEEGAG----NAVLQLRTGKKLRVGEEVTISYGDEKPASEMVFSYGFVENERT 292

Query: 317 DRNAYTLTLEISESDPF 333
           D     L LEI + DP 
Sbjct: 293 DAKQIFLDLEIPDDDPL 309


>gi|242823770|ref|XP_002488126.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
 gi|218713047|gb|EED12472.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
          Length = 480

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 77/302 (25%), Positives = 128/302 (42%), Gaps = 57/302 (18%)

Query: 54  SATTNPPTA-----QVETFWQWLRDQKVVSPKSPIRPATFPE---GLGLVAQRDIAKNEV 105
           S+T + P A     Q + F QWL  Q  V+  S I          G G+VA+ DI + E 
Sbjct: 2   SSTEHFPDADDFQRQTDGFMQWLSQQTGVTISSKIEVQDLHHQGSGRGVVARSDIQEGED 61

Query: 106 VLEVPMKFWINPDTVAASEI-GSLCSGLKPWISVALFLIREKK-KEDSPWRVYLDILPEC 163
           +  +P +  +   T   +EI       L PW+S+ + +I E    E S W  Y  +LP  
Sbjct: 62  LFHLPQRVVLMVKTSPLNEILADELKNLGPWLSLVVVMIYEYSLGERSNWNQYFQVLPTK 121

Query: 164 TDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI----TL 219
            D+ +FWS EEL +LQ + ++   +G K+  ++ + K+   ++  +  LFP P+    + 
Sbjct: 122 FDTLMFWSGEELSQLQASAVIH-KIGKKDAEEDIFEKI-IPLVRSHPDLFP-PVNGVMSY 178

Query: 220 DD------------------FLWAFGILRSRAFSRLRGQNLV----------LIPLADLI 251
           DD                    +AF I +            +          ++PLADL+
Sbjct: 179 DDDAGAQALLELAHRMGSLIMAYAFDIEKGEEEESEGEDGYLTDDEEQLPKGMVPLADLL 238

Query: 252 NHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQY-DLNKSNAELALDYG 310
           N            +   A LF  D    +R   P+K G+++   Y +L +S+  L   YG
Sbjct: 239 NADA---------DRNNARLFQEDGALVMRAIKPIKTGDEIFNDYGELPRSD--LLRRYG 287

Query: 311 FI 312
           ++
Sbjct: 288 YV 289



 Score = 41.6 bits (96), Expect = 0.88,   Method: Compositional matrix adjust.
 Identities = 22/65 (33%), Positives = 38/65 (58%), Gaps = 7/65 (10%)

Query: 406 CRVVRDACKSALSGFHTTIEEDEQLLQG-------GNLDPRLRIAVGVRAGEKRVLQQID 458
            R++ D+ KS LS + TT+E+D+ +LQ           + R ++AV VR GEK +LQ + 
Sbjct: 388 ARILLDSVKSKLSQYGTTVEQDKAILQQFASTSSLSTSERRRKMAVEVRLGEKEILQHVS 447

Query: 459 EIFKE 463
            + ++
Sbjct: 448 VMLQD 452


>gi|302762396|ref|XP_002964620.1| hypothetical protein SELMODRAFT_81798 [Selaginella moellendorffii]
 gi|300168349|gb|EFJ34953.1| hypothetical protein SELMODRAFT_81798 [Selaginella moellendorffii]
          Length = 464

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 110/431 (25%), Positives = 183/431 (42%), Gaps = 72/431 (16%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWI------NPDTVAASEIGSLC--SGLKP---W-IS 137
            G+GLVA +D+ +   ++ +P +  +      N   +A SE G +C  +   P   W + 
Sbjct: 46  HGMGLVATQDLPQGSTIITLPRRVPMPMPDPENAAVLAPSE-GVICEIANRVPEELWAMR 104

Query: 138 VALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQ-- 195
           + L L+ E+ ++ S W  Y+ +LP      +F+S    V+++       T  VK+  +  
Sbjct: 105 LGLKLLYERAQKGSYWWPYISMLPHSFTLPIFFSG---VDIESIDYAPVTHQVKKRCRFL 161

Query: 196 ----NEYLKVEEEIILPNKQLFP---RPITLDDFLWAFGILRSRAFSRLRGQ-----NLV 243
                E  K+E    LP +++ P   + +      WA   + SRAF R+ G      + +
Sbjct: 162 LQFSAELAKLES---LP-EEVHPFAGQSVDSGALGWAMAAVSSRAF-RIHGVTNKLCSAM 216

Query: 244 LIPLADLINHS--PGITTEDYAYEIKGAGLFSRDL----LFSLRTPVPVKAGEQVLIQYD 297
           ++PL D+ NHS  P    E+           SRD        + T   ++ G  + + Y 
Sbjct: 217 MLPLIDMCNHSFQPNAHIEED---------LSRDAQDVSFLKVVTKRNLEKGSAITLNYG 267

Query: 298 LNKSNAELALDYGFI--ESKSDR-----------NAYTLT-LEISESDPFFG------DK 337
              SN  L LDYGF+  ++  DR           NA  +  L  + S PF        D+
Sbjct: 268 -PLSNDLLLLDYGFVIPDNPHDRIELRYDGSLMENARMIAGLSRTGSPPFSSPASWQVDR 326

Query: 338 LDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPV 397
           L        GES    +     +   +L  LR++     +      +     WG ++  V
Sbjct: 327 LKQLGLADSGESQKVTLGGPEEVDGRLLAALRILHAESQEPLERRELVSLQAWG-VESMV 385

Query: 398 SHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQI 457
           S  NEE + R +        + F TTIEEDE  L   +L    RIAV  R  +KR++ ++
Sbjct: 386 SSDNEERVLRTLCGLAAIVFNQFKTTIEEDEAKLSDKSLAETSRIAVQFRLTKKRLVVRV 445

Query: 458 DEIFKERELEL 468
            E  K+R ++L
Sbjct: 446 LESLKKRLMDL 456


>gi|357145323|ref|XP_003573603.1| PREDICTED: SET domain-containing protein 4-like [Brachypodium
           distachyon]
          Length = 532

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 63/256 (24%), Positives = 117/256 (45%), Gaps = 18/256 (7%)

Query: 65  ETFWQWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAAS 123
           ++  +W  DQ V   KS ++ A F   G G+VA  +I    + LE+P    I+ + +  S
Sbjct: 151 DSLLKWGEDQGV---KSKLQIAFFQGAGRGMVASENIGVGHIALEIPESLIISEELLCQS 207

Query: 124 EIGSLCSGLKPWISVALFLI---REKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQG 180
           ++      L    +  + L+   RE+    S ++++ + LP   ++ + +    L  L+G
Sbjct: 208 DMFLALKDLNSITTETMLLLWSMRERHNPSSNFKMFFETLPSNFNTGLNFGIGALAALEG 267

Query: 181 TQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI-TLDDFLWAFGILRSRAFSRLRG 239
           T L    +  ++++  +Y ++   +     ++F + I T D+FLWA  +  S +   +  
Sbjct: 268 TLLFDELMQARQHLHQQYDELFPMLCTKFPEIFTQDIYTWDNFLWACELWYSNSMMVVLS 327

Query: 240 QN---LVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQY 296
                  LIP+A L+NHS      +Y    +     ++ L F L    P KAG+Q  + Y
Sbjct: 328 SGKLTTCLIPVAGLLNHSVYPHILNYGRVDQA----TKSLKFPLSR--PCKAGQQCFLSY 381

Query: 297 DLNKSNAELALDYGFI 312
               S + L   YGF+
Sbjct: 382 G-KHSGSHLITFYGFL 396


>gi|426218421|ref|XP_004003445.1| PREDICTED: SET domain-containing protein 4 [Ovis aries]
          Length = 439

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 67/263 (25%), Positives = 115/263 (43%), Gaps = 32/263 (12%)

Query: 69  QWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           +WL+D++     + + PA FP  G GL+++  + + + ++ +P    +  DTV  S +G+
Sbjct: 38  KWLKDRRFED--ATLIPARFPGTGRGLMSKTSLQEGQTIISLPESCLLTTDTVIRSYLGA 95

Query: 128 LCSGLK----PWISVALFLIREKKKED-SPWRVYLDILPECTDSTVFWSEEELVELQGTQ 182
             +  +    P +++  FL+ EK   D SPW+ YL++LP+     V   E E+V L    
Sbjct: 96  YIAKWQPPPSPLLALCTFLVSEKHAGDRSPWKPYLEVLPKAYTCPVCL-EPEVVNLLPNP 154

Query: 183 LLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI----TLDDFLWAFGILRSRAFSRLR 238
           L +     + +VQ E+           + LF   I    +     WA+  + +RA    R
Sbjct: 155 LKTKAWEQRSHVQ-EFFSSSRGFFSSLQPLFSEAIETIFSYRALRWAWCTVNTRAVYMKR 213

Query: 239 GQNLV---------LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAG 289
              L          L P  DL+NHSP +  +        A        + +RT       
Sbjct: 214 PPQLCLSPEPDTCALAPYLDLLNHSPDVQVK--------AAFNEETRCYEIRTATRCGKH 265

Query: 290 EQVLIQYDLNKSNAELALDYGFI 312
           ++V I Y  +  N  L L+YGF+
Sbjct: 266 KEVFICYGPH-DNHRLLLEYGFV 287


>gi|428185794|gb|EKX54646.1| hypothetical protein GUITHDRAFT_100121 [Guillardia theta CCMP2712]
          Length = 354

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 70/230 (30%), Positives = 102/230 (44%), Gaps = 19/230 (8%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTV-AASEIGSLCSGLKPWISVALFLIREKKKE 149
           G GL A RD+  +EV L+VP +  ++  TV   S    +  G+     +AL L+ EK + 
Sbjct: 43  GRGLRATRDLLPDEVALKVPSELLLSDQTVLHHSNYEDMQEGMSWSSRLALGLLNEKSRG 102

Query: 150 D-SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGV----KEYVQNEYLKVEEE 204
           D S  R Y+ +LP        W E+ELVELQ   L   T  +     E  ++     E  
Sbjct: 103 DRSVHRDYIGMLPAPPRVLSRWKEDELVELQNRTLEGETDMIYFWRHENWEDTITSAERV 162

Query: 205 IILPNKQLFPRPITLDDFLWAFGILRSRA--FSRLRGQNLVLIPLADLINHSPGITTEDY 262
           +       F + I+  DFL A  ++ SRA      RG    L+P+ DL NHSP      Y
Sbjct: 163 LTGSEFDSFRQRISQQDFLDAHDLVCSRAIKLETARGVERFLVPVFDLANHSP--RGGRY 220

Query: 263 AYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI 312
           A +  G           L T + V  G +V + Y   ++N +  L YG +
Sbjct: 221 AMDEDGN--------VCLITGIEVTRGAEVCLDYG-GRTNDQFLLHYGHL 261


>gi|115448513|ref|NP_001048036.1| Os02g0733800 [Oryza sativa Japonica Group]
 gi|46390671|dbj|BAD16153.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
 gi|113537567|dbj|BAF09950.1| Os02g0733800 [Oryza sativa Japonica Group]
 gi|215768359|dbj|BAH00588.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 486

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 109/441 (24%), Positives = 183/441 (41%), Gaps = 57/441 (12%)

Query: 63  QVETFWQWL-RDQKVVSPKSPIRPATFPE-GLGL---VAQRDIAKNEVVLEVPMKFWIN- 116
           Q     +W+ R+   V P   +R    PE GLG+    A+ DI   +V++ +P +  +  
Sbjct: 53  QPPDLLRWVQREGGFVHPA--LRVVDHPEHGLGVSAAAAEGDIPPGDVLIALPGRLPLRL 110

Query: 117 --PDTVAASEIGSLCSGLKP--W-ISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWS 171
             P   A + +  L   +    W + + L L++E+ K DS W  Y+  LPE     +F+ 
Sbjct: 111 RRPAGAADAVLVQLADQVPEELWAMRLGLRLLQERAKSDSFWWPYIANLPETFTVPIFFP 170

Query: 172 EEELVELQGTQLLSTTLGVKEYVQNEYLKVEEE---IILPNKQLFPRPITLDDFLWAFGI 228
            E++  LQ   LL        ++     +V+ +   + L +     + +      WA   
Sbjct: 171 GEDIKNLQYAPLLHQVNKRCRFLLEFEKEVKHKLGTVPLEDHPFCGQDVNSSSLGWAMSA 230

Query: 229 LRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKA 288
             +RAF RL G+  +L+PL D+ NHS        A  ++   + S D+   +     +  
Sbjct: 231 ASTRAF-RLHGEIPMLLPLIDMCNHS----FNPNARIVQEGNVDSPDMSVKVVAETKIDQ 285

Query: 289 GEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTL----------TLEISESDPFFGD-- 336
              V + Y    ++  L LDYGF+ + +  +   L          ++    S P F    
Sbjct: 286 NAAVTLNYGCYPNDFFL-LDYGFVITSNSYDQVELSYDGTLLDAASMAAGVSSPNFSAPA 344

Query: 337 --KLDI-AETNGLGESAYFDIVLG--RTLPPAMLQYLRLV------ALGGTDAFLLESIF 385
             + DI ++ N  GE A   + +G    +   +L  LR++      A+ G D   L S+ 
Sbjct: 345 KWQQDILSQLNLYGEGAILKVSIGGPEIVDGRLLAALRVIIAADPDAVSGHDLKTLMSLK 404

Query: 386 RNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRL--RIA 443
                     P+  A E    R V   C  AL  FHT I EDE +L+G   +P L   +A
Sbjct: 405 EKA-------PLGPAVEASALRTVLALCTFALQHFHTKIMEDEAILKG---EPPLTTELA 454

Query: 444 VGVRAGEKRVLQQIDEIFKER 464
           V  R  +K +L  + +    R
Sbjct: 455 VQFRLQKKLLLLDVIQNLSRR 475


>gi|410082986|ref|XP_003959071.1| hypothetical protein KAFR_0I01550 [Kazachstania africana CBS 2517]
 gi|372465661|emb|CCF59936.1| hypothetical protein KAFR_0I01550 [Kazachstania africana CBS 2517]
          Length = 584

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 112/475 (23%), Positives = 189/475 (39%), Gaps = 95/475 (20%)

Query: 61  TAQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEV---VLEVPMKFWINP 117
           T  + +F  W +D      +S     +   G+     ++I  + +   ++ VP K  I  
Sbjct: 5   TDLLTSFLNWSKDNGAKINESIEFKTSKDSGISATIIKEIPSDAISKPLISVPSKLLI-T 63

Query: 118 DTVAASEIG----SLCSGLKPWISVALFLIREKKKEDSP----WRVYLDILPECTDSTVF 169
           + +A  E      +L S   P   + L+L + K    +     ++ YLDILP       F
Sbjct: 64  NELALKEFNVSSKNLSSLFNPNALIQLYLCKLKFNATTAKSDFFKPYLDILPPNVPHPYF 123

Query: 170 WSEEELVELQGTQLLSTTLGVKEYVQN------------EYLKVEEE------------- 204
           W++ +L  LQGT    T + +K+ +QN            E   +E+E             
Sbjct: 124 WNKSQLQLLQGTD---TLIKIKQNLQNLITEWYELLNVLEITPIEKEGTAFDVNDIDSIF 180

Query: 205 -IILPN-KQLFPRPITLDDFLWAFGILRSRAFSRL--------RGQNLVLIPLADLINHS 254
             I  N K   P+  +   +LW+FGI  SRAF  +              L P+ DL+NH 
Sbjct: 181 SYISENVKTTSPKWNSFIAYLWSFGIFTSRAFPEILINPDNCSNVNQAFLYPIVDLLNHK 240

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G T+  + +E   A  F+ +      T +    G+         KSN EL L YGF++S
Sbjct: 241 NG-TSVKWTFEDDQAHFFTNEKNLKKHTELFNNYGD---------KSNEELLLGYGFVQS 290

Query: 315 KSDRNAYTLTLEIS----ESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRL 370
            +  +   LTL++     ES P FG  L+   T G  E   F +     LP  +L+    
Sbjct: 291 NNAHDDTKLTLKLDPQLIESMPSFGIVLNKENTVG-TECLQFTLSSRNPLPRNLLKLFGF 349

Query: 371 VALGGTDAFLLESIFRNTIWGHLDL-PVSHANEELICRVVR-DACKSALSGFHTTIEEDE 428
           ++   ++  L    +RN + G  +L  +  A  +   R  + DA     S  +T +    
Sbjct: 350 LSKLKSETSL---NYRNILEGSDELSSILDAKIQFFKRASKIDASTLEPSSNNTVV---- 402

Query: 429 QLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKERELELDELEYYQERRLKDLG 483
                      ++I     AG++R+              L+EL+ +Q+  LK++ 
Sbjct: 403 -----------IKILKDFFAGQRRIYIS----------SLEELQKFQKTTLKNIN 436


>gi|44890428|gb|AAH66931.1| SETD3 protein [Homo sapiens]
          Length = 292

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 55/185 (29%), Positives = 86/185 (46%), Gaps = 18/185 (9%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 104 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ E+E+  LQ TQ +       +    +Y    KV
Sbjct: 164 ERASPNSFWQPYIQTLPSEYDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV 223

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 224 IQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 280

Query: 255 PGITT 259
            G+ T
Sbjct: 281 NGLVT 285


>gi|240276868|gb|EER40379.1| SET domain-containing protein [Ajellomyces capsulatus H143]
          Length = 485

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 107/453 (23%), Positives = 180/453 (39%), Gaps = 86/453 (18%)

Query: 67  FWQWLRDQKVVSPKSPIRP-ATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEI 125
           FW   R    VSPK  I    +   G G+VA  DI ++E +  +P    +        ++
Sbjct: 22  FWLKQRPGVKVSPKIKIADLRSEGAGRGIVADDDIGEDEELFAIPQNLVLGFQNSRLKDL 81

Query: 126 GSLCS-GLKPWISVALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQL 183
                    PW+ + + +I E  +   S W  Y  +LP   D+ +FW++EEL EL G+ +
Sbjct: 82  LDFNERDFDPWLCLIVVMIYEYLQGGASTWSRYFQLLPTNFDTLMFWTDEELRELSGSAV 141

Query: 184 LSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFG-------------ILR 230
           L+  +G  +   N  L+    ++  N   FP    +  F    G             ++ 
Sbjct: 142 LN-KIGRSDAEAN-ILRNILPLVSGNPSHFPPMSGVASFDSPEGKAALLSLAHRMGSLIM 199

Query: 231 SRAFSRLRGQN------------------LVLIPLADLINHSPGITTEDYAYEIKGAGLF 272
           + AF   +G+N                    ++PLADL+N            +   A LF
Sbjct: 200 AYAFDIEKGENDGREGQDGYVTDDEEELSKGMVPLADLLNADA---------DRNNARLF 250

Query: 273 SRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDP 332
             D   S+R+  P++ GE++   Y      A+L   YG++   +D  A    +EIS    
Sbjct: 251 QEDCYLSMRSIKPIRKGEEIFNDYG-ELPRADLLRRYGYV---TDNYAQYDEVEISMRTI 306

Query: 333 FF------------GDKLDIAETNGLGESAY------FDIVLGRTLPPAMLQYLRLVALG 374
            +              +L+  E  G+ +  Y       +  L   LP  +L  L ++ + 
Sbjct: 307 CYVAGIPSAIPGPEHPRLEFLENMGVFDDGYSIPRLNLNTPLMEVLPKELLIALNVLTM- 365

Query: 375 GTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQG- 433
             D F    +  N       L V+ A       ++  A +  L  + TT+EED+ LL   
Sbjct: 366 PPDQF--NQLKMNNKVPKPRLGVTEAT------LLATAVQQTLGDYTTTLEEDKDLLSSI 417

Query: 434 GNLDP---------RLRIAVGVRAGEKRVLQQI 457
            N+           RL++A+ VR GEK++L Q+
Sbjct: 418 SNIQESTDSTASARRLKMALQVRIGEKQILSQV 450


>gi|354502761|ref|XP_003513450.1| PREDICTED: SET domain-containing protein 4 [Cricetulus griseus]
          Length = 440

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 71/277 (25%), Positives = 121/277 (43%), Gaps = 32/277 (11%)

Query: 69  QWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           +WL+ +K     + + PA FP  G GL+++  + + ++++ +P    +  +TV  S +G 
Sbjct: 36  RWLKGRKF--EDTGLVPACFPGTGRGLMSKTALQEGQMIISLPESCLLTTNTVIRSSLGP 93

Query: 128 LCSGLKP----WISVALFLIREKKKE-DSPWRVYLDILPECTDSTVFWSEEELVELQGTQ 182
                KP     +++  FLI E+     S W+ YLDILP+     V   E ++V+L    
Sbjct: 94  YMKKWKPPPSPLLALCTFLISERHAGGQSLWKSYLDILPKSYTCPVCL-EPDVVDLLPQP 152

Query: 183 LLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI----TLDDFLWAFGILRSRA-FSRL 237
           L +     +  VQ+ +           + LF  P+    +   FLWA+  + +RA + R 
Sbjct: 153 LKAKAEEQRADVQD-FFASSRAFFSTLQPLFVEPVDGIFSYSAFLWAWCTVNTRAVYLRS 211

Query: 238 RGQNLV--------LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAG 289
             Q  +        L P  DL+NHSP +  +    E  G         + +RT    +  
Sbjct: 212 TRQECLSAEPDTCALAPYLDLLNHSPHVQVKAAFSEKTGC--------YEIRTASRCRKH 263

Query: 290 EQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLE 326
           EQV I Y     N  L L+YGF+   +      + +E
Sbjct: 264 EQVFICYG-PYDNQRLLLEYGFVSVCNPHACVPVNIE 299


>gi|212721460|ref|NP_001132025.1| uncharacterized protein LOC100193433 [Zea mays]
 gi|194693232|gb|ACF80700.1| unknown [Zea mays]
 gi|414881264|tpg|DAA58395.1| TPA: hypothetical protein ZEAMMB73_027665 [Zea mays]
 gi|414881265|tpg|DAA58396.1| TPA: hypothetical protein ZEAMMB73_027665 [Zea mays]
          Length = 252

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 51/173 (29%), Positives = 91/173 (52%), Gaps = 8/173 (4%)

Query: 285 PVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETN 344
           P KAGE ++I +   ++N+ L L+YGF++  +  +   +   ++  DP + +K  +A+ N
Sbjct: 56  PYKAGEPIII-WCGPQTNSRLVLNYGFVDEDNPFDRVAIEASLNTEDPQYQEKRMVAQRN 114

Query: 345 GLGESAYFDIVLGRTLPPA--MLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANE 402
           G      F++ +G+       ML YLRL  +   D   ++SI  +   G    PVS   E
Sbjct: 115 GKLAIQNFNVYVGKEKQTVAEMLPYLRLGYISNPDE--MQSILSSE--GDT-CPVSPCTE 169

Query: 403 ELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQ 455
             +   +    +S L+G+ TT++EDE +L  GNL+P+  +A  +   EK++L 
Sbjct: 170 RAVLDQLVGYLESRLAGYPTTLDEDEAMLADGNLEPKKEVATRLVRLEKKMLH 222


>gi|156361027|ref|XP_001625323.1| predicted protein [Nematostella vectensis]
 gi|156212150|gb|EDO33223.1| predicted protein [Nematostella vectensis]
          Length = 447

 Score = 70.9 bits (172), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 79/305 (25%), Positives = 133/305 (43%), Gaps = 40/305 (13%)

Query: 69  QWLRDQKVVSPKSPIRPATFPE-GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           +W +   +V  K  IRPA F   G G++A   I  +E V+ VP +  I   +V  S IG+
Sbjct: 55  KWAKRNGMVFKK--IRPAIFSSTGRGMLAIERIHSSECVISVPERLLITASSVLESAIGN 112

Query: 128 LCS-----GLKPWIS--VALFLIREKKKED-SPWRVYLDILPECTDSTVFWSEEELV--- 176
             +     G K      + LFL+ EK  E  S W  Y+  LP+  ++  +++ +EL    
Sbjct: 113 YVAERMKGGAKSSNDYLLVLFLMYEKYLEKGSFWAPYIRTLPDTFNTPCYFTRKELFLLP 172

Query: 177 ELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSR 236
           E    Q       +K+  ++ + K   +++      F R +  + F WA+ ++ +R+   
Sbjct: 173 EQCREQAFEQVTQIKQSYKS-FAKAYNDVLQDFDCNFWRTVDFESFKWAWCVVNTRSVYH 231

Query: 237 ----LRGQ----NLVLIPLADLINHSPGITTEDYAYEIKGAGLF-SRDLLFSLRTPVPVK 287
                R Q    N  L PL DL+NH           + +  G F S    + +      +
Sbjct: 232 DEPNRRAQPIDGNCALAPLLDLLNHCD---------KAEMCGRFNSSSKNYEINVITEYQ 282

Query: 288 AGEQVLIQYDLNKSNAELALDYGFIESKSDRNAY-----TLTLEISESDPFFGD-KLDIA 341
            G QV I Y  +  N  L L+YGF+  ++  N+Y     T+   +  S+ F  + K ++ 
Sbjct: 283 KGTQVFINYGPH-DNTRLFLEYGFVLPRNVHNSYRFTRSTILSHLGMSNSFLSNAKEELI 341

Query: 342 ETNGL 346
           E N L
Sbjct: 342 ERNSL 346


>gi|218197347|gb|EEC79774.1| hypothetical protein OsI_21183 [Oryza sativa Indica Group]
          Length = 470

 Score = 70.9 bits (172), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 98/420 (23%), Positives = 171/420 (40%), Gaps = 95/420 (22%)

Query: 89  PEGLGLVAQRDI---AKNEVVLEVPMKFWIN----------PDTVA-ASEIGSLCSGLKP 134
           P+G+ + A RD+    +  V++E+P++  +           PD +     I  +     P
Sbjct: 116 PDGMAVYASRDVDPLRRARVIMEIPLELMLTITQKRPWMFFPDIIPLGHPIFDIIESTDP 175

Query: 135 ---W-ISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGV 190
              W + +A  L+     ED+ W++Y D LP   + T                 S  L  
Sbjct: 176 ETDWDLRLACLLLYAFDVEDNFWQLYGDFLPSVDECT-----------------SLLLAP 218

Query: 191 KEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFS-RLR-GQNL----VL 244
           K           + I L  K+L P     + FLWA  I++SR+ + +LR G  L    VL
Sbjct: 219 KH----------KTIPLKLKRLAP---DHERFLWALSIVQSRSVNLKLRMGAFLQDANVL 265

Query: 245 IPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAE 304
           +P AD++NHSP      + +  K      R +   ++    VK G+++ I Y ++  N+ 
Sbjct: 266 VPYADMLNHSPDANCFLH-WRFK-----DRMVEVMIKAGRAVKKGDEMTIDY-MSGVNSS 318

Query: 305 LALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAM 364
               YGF    S  N + L    S++       L +    GL +  Y++  L        
Sbjct: 319 FMERYGF---SSPTNPWELINFSSDAKIHLDSFLSVFNIAGLHDELYYNAALT------- 368

Query: 365 LQYLRLVALGGTDAFLLESIFRNT----IWGHLDLPVSHANEELICRVVRDACKSALSGF 420
                     G + F+   +         W   D+P   + E    + +++ C + L  F
Sbjct: 369 ---------SGENNFVDGGVVAAARTLPTWSEGDVPAIPSLERKSAQALQEECHTMLESF 419

Query: 421 HTTIEEDEQLLQG-GNLDPRLRIAVGVRAGEKRVLQQIDEIFKERELELDELEYYQERRL 479
            TTI++D+++L   G++     IA+  R   K +LQ+I          +D L+ YQ++ L
Sbjct: 420 STTIQQDQEILDSDGHIRRTREIAIKYRLHRKLLLQKI----------IDALDIYQDKIL 469


>gi|424513480|emb|CCO66102.1| predicted protein [Bathycoccus prasinos]
          Length = 571

 Score = 70.9 bits (172), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 97/409 (23%), Positives = 170/409 (41%), Gaps = 42/409 (10%)

Query: 86  ATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASE---IGSLCSGLKPWISVALFL 142
           AT  E       +++   +V+L +P    +    V A E   +  L       + +AL+L
Sbjct: 111 ATTTEMAEFTLSKNVEGGDVILSIPQDNCVT--AVDAKEHPIVAPLIEEKPELVQLALWL 168

Query: 143 IREKKK-EDSPWRVYLDILPECTDSTVFWSEEELVEL-QGTQLLSTTLGVKEYVQNEYLK 200
             EK K + S W  YL  L    +S + ++EEE  EL +GT +       ++  + EY  
Sbjct: 169 CCEKAKAKGSEWWPYLKTLNGNPNSVLRFTEEEFKELLKGTSIDKEARQRRDSAKEEYEA 228

Query: 201 VEEEIIL-PNKQLFPRPI----TLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLI---N 252
           +   I   P K  +P  +    T   F+ A  I+ +RA          ++PL D I    
Sbjct: 229 LRAAIAEDPGK--YPLDVYAFLTESAFIDALDIVCARAQWLNSANCYAMVPLMDAIPICG 286

Query: 253 HSPGITTEDYA----YEIKG--AGLFSRDLLFS----------LRTPVPVKAGEQVLIQY 296
             P ++ ED +    YEI+    GL +    ++          L       AG ++L   
Sbjct: 287 APPPVSPEDPSFARFYEIRDIKTGLTAVRCGYADYDVDSASVVLCANTRASAGSKILQID 346

Query: 297 DLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVL 356
              ++N+EL L +G ++ +   +      E+SE+DP +  K  + E  G  +      V 
Sbjct: 347 HSVRNNSELYLSFGDVDDQHPGDYEYWPTELSENDPLYAAKKSVLEAQGFADKGQTFPVY 406

Query: 357 GRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSA 416
              +P   L YLR   +  ++     S   + +       VS  NE    +++   C+  
Sbjct: 407 KDRMPREFLSYLRFARVTNSEELFAVSFTEDKV-------VSPMNEYETLQLLMADCRDR 459

Query: 417 LSGFHTTIEEDEQLLQG-GNLDPRLRIAVGVRAGEKRVLQQIDEIFKER 464
           +S + T  EEDE LLQ   ++  ++R A  +R  EK ++ ++    + R
Sbjct: 460 MSAYDTN-EEDELLLQKRDDVSLKIRNASRLRRCEKELVGEMMNAVRRR 507


>gi|145344497|ref|XP_001416768.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144576994|gb|ABO95061.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 514

 Score = 70.9 bits (172), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 107/421 (25%), Positives = 170/421 (40%), Gaps = 69/421 (16%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINP-----DTVAASEIGSLCSGLKPWISVALFLIRE 145
           G G+   R+++  E++ EVP++  +       D      IG+  SG      +A  ++RE
Sbjct: 82  GRGVATTRNVSAGELLAEVPLEKCLCAASARMDARLWRAIGA--SGASGDAILAAHVLRE 139

Query: 146 KKKEDS-----PWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLK 200
                S     PW   L +LP   DSTV W+E+EL EL G+ ++  T  +K   + EY  
Sbjct: 140 AFDAGSKSAYWPW---LRLLPRDVDSTVGWNEDELSELSGSNVVVFTRAIKAQWRMEY-- 194

Query: 201 VEEEIILPN-KQLFP--------RPITLDDFLWAFGILRSRAFSRLRGQNL------VLI 245
             + + +P   + FP           T D F WA  I+ SRA   L  ++       VL+
Sbjct: 195 --DALDVPTLGEKFPDVFGGERAAHYTFDKFTWARFIIWSRAID-LSTESAEAPTIRVLV 251

Query: 246 PLADLINHSPG------ITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLN 299
           PL D+ NH+PG            A ++  A  F              +   ++   YD  
Sbjct: 252 PLLDMANHAPGGKLRPEWDARSNAVKVYAASAF--------------REHTELRFNYDTK 297

Query: 300 KSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRT 359
            S   L L YGFI   +       T+ +S+ D     K ++   +GL           R+
Sbjct: 298 PSQYFL-LQYGFIPETNPAECVEATVRVSDHDSLRDAKEELLRLHGLDPKKRNFEWKPRS 356

Query: 360 LPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSG 419
           +   +L   R++ +   +     S+            VS  N+     V+  +  S L  
Sbjct: 357 IDYDLLAATRVITMDEAEMSDATSL----TLAVSGASVSAKNDARTKAVLLKSLASFLES 412

Query: 420 FHTTIEEDEQLL----QGGNLDP----RLRIAVGVRAGEKRV-LQQIDEIFKERELELDE 470
           + TT+ ED + +       N +P    R R AV +R  EK++ L   D +FKE   E  E
Sbjct: 413 YTTTLAEDNEYVARVDDESNDEPLPGKRKRFAVLLRMREKQILLASADALFKELPDEDQE 472

Query: 471 L 471
           L
Sbjct: 473 L 473


>gi|327295769|ref|XP_003232579.1| hypothetical protein TERG_06571 [Trichophyton rubrum CBS 118892]
 gi|326464890|gb|EGD90343.1| hypothetical protein TERG_06571 [Trichophyton rubrum CBS 118892]
          Length = 488

 Score = 70.9 bits (172), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 110/463 (23%), Positives = 190/463 (41%), Gaps = 99/463 (21%)

Query: 65  ETFWQWLRDQKVVSPKSPIRPA-------TFPEGLGLVAQRDIAKNEVVLEVPMKFWI-- 115
           + F  WL   K  SP   + P        +   G G+ A RDIA++E +  +P    +  
Sbjct: 18  DDFLLWL---KRSSPHFKMHPGIHIADLRSVGAGRGICASRDIAEDEELFIIPDDLVLSV 74

Query: 116 -NPDTVAASEIGSLCSGLKPWISVALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSEE 173
            N +  +A E+      L PW+S+ + +I E  + E S W  Y  ILP   D+ +FW++E
Sbjct: 75  QNSEARSALELDD--KQLGPWLSLIITMIYEYYQGEQSKWYPYFRILPSSFDTLMFWTDE 132

Query: 174 ELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFP-----RPITLDD------- 221
           +L+ELQG+ ++   +G K    +  L+    +I  N + FP      P+   D       
Sbjct: 133 QLLELQGSAVVG-KIG-KAAADDTILQKVVPLIQANPRHFPPRPNMPPLNSSDSQNALLC 190

Query: 222 ---------FLWAFGILRSRAFSRLRGQNLV-----------LIPLADLINHSPGITTED 261
                      +AF I ++        ++             ++PLAD+ N         
Sbjct: 191 LAHRMGSIIMAYAFDIEKTDEVDEDTAEDGYMTDDEDEPAKGMVPLADIFNADA------ 244

Query: 262 YAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAY 321
              +   A LF  +  F ++    + +GE++   Y      A+L   YG++   +D  A 
Sbjct: 245 ---QRNNARLFQEEGSFVMKAIKNIHSGEEIFNDYG-ELPRADLLRRYGYV---TDNYAQ 297

Query: 322 TLTLEIS-----------ESDPFFGD-KLDIAETNGLGESAYF------DIVLGRTLPPA 363
              +E S           +S+P   + +L++ +   + E  Y       +  L  T+P  
Sbjct: 298 YDVVEFSLDSICKVAGLPDSEPSSTNPRLELLDNLDMLEEGYSIPRIPPNGTLKDTIPKD 357

Query: 364 MLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTT 423
            L  LR + L       +E + R         P    +E  + R +   C+   S + T+
Sbjct: 358 FLVLLRALTLP------IEDLNRLKARNKAPKPEFSTSEASLLRSLV-TCRQ--SEYPTS 408

Query: 424 IEEDEQLL-----QGGNLDP----RLRIAVGVRAGEKRVLQQI 457
           ++EDE +L     Q G ++     R ++AV VR GEK +L QI
Sbjct: 409 VQEDESILRCLEQQNGYINDSIPIRKKMAVQVRKGEKEILTQI 451


>gi|449662705|ref|XP_002165483.2| PREDICTED: uncharacterized protein LOC100209819 [Hydra
           magnipapillata]
          Length = 819

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 76/336 (22%), Positives = 142/336 (42%), Gaps = 43/336 (12%)

Query: 66  TFWQWLRDQKV-VSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASE 124
            F QW +   + +S K  +         G++A  DI K EV+  VP +  +N +T     
Sbjct: 102 NFLQWCKANNLNLSSKVKVDFNGTSHRYGMLATEDIKKGEVLFTVPRQLLLNQNTATLKN 161

Query: 125 -----------IGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTD--STVFWS 171
                       G   +    W+ + + L+ E  ++DS W  YL ++PE ++    +FW 
Sbjct: 162 RLNEFEKWLDTHGKSLNDSSGWLPLLITLMWEFNQKDSFWASYLLLVPEISEFGHPLFWK 221

Query: 172 EEEL-VELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFP--RPITLDDFLWAFGI 228
           EEE  +E QG  LL+  +  +E ++ EY +     +  NK LF      +L+ F      
Sbjct: 222 EEEYNLEFQGMPLLNDIIVDRENIETEYAEFVLLFLRRNKDLFGSLENYSLEFFKRMVAF 281

Query: 229 LRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKA 288
           + + +F+    ++  ++P+AD++NH               A L        + +   +K 
Sbjct: 282 VMAYSFTE-DEESPSMVPMADILNHHSN----------NNAHLVFHKSNLQMISIRRIKK 330

Query: 289 GEQVLIQYDLNKSNAELALDYGFIESKSD---------RNAYTLTLE---ISESDPFFGD 336
           GE+V   +     N EL   YG++E  S+         ++ Y +       +  DP+   
Sbjct: 331 GEEVFNTFG-KLGNTELLQMYGYVEIPSNQYDSLLLPVKDFYKIMTSKNGTANDDPYLLA 389

Query: 337 KLDIAETNGLGESAYFDIV--LGRTLPPAMLQYLRL 370
           K+++    G+ E   F +    G    P ++Q+L++
Sbjct: 390 KINLLNRTGIAEVDAFFMFDKNGLRCGPDLIQFLKI 425


>gi|226505024|ref|NP_001151430.1| SET domain containing protein [Zea mays]
 gi|195646778|gb|ACG42857.1| SET domain containing protein [Zea mays]
 gi|413923893|gb|AFW63825.1| SET domain containing protein [Zea mays]
          Length = 491

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 105/422 (24%), Positives = 181/422 (42%), Gaps = 46/422 (10%)

Query: 47  TCSLHSASATTNPPTAQVETFWQWLRDQKVVSPKSPIRPATFPE-GLGLVAQR-----DI 100
           +CS    +AT   P  Q     +W++ +     ++ +R A  PE GLG+ A       DI
Sbjct: 42  SCSAAFTAATRLAP--QPPDLVRWVQREGGFVHRA-LRVANHPEHGLGISASSAAAYGDI 98

Query: 101 AKNEVVLEVP--MKFWINPDTVAASEIGSLCSGLKP---W-ISVALFLIREKKKEDSPWR 154
              +V++ +P  +   +   T AA ++    +   P   W + + L L++E+ K DS W 
Sbjct: 99  PIGDVLIALPSQLPLRLRRPTSAADDVLVQLAQQVPDELWAMKLGLRLLQERAKSDSFWW 158

Query: 155 VYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYV---QNEYLKVEEEIILPNKQ 211
            Y+  LPE     +F+  E++  LQ   +L        ++   + E  +    + L +  
Sbjct: 159 PYIANLPETFTVPIFFPGEDIKNLQYAPILHQVNKRCRFLLEFEKEVQQKLHTVPLVDHP 218

Query: 212 LFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGL 271
            + + +      WA     SRAF RL G+  +L+PL D+ NHS        A  ++   +
Sbjct: 219 FYGQDVNSSSLGWAMSAASSRAF-RLHGEVPMLLPLIDMCNHS----FNPNARIVQERSV 273

Query: 272 FSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTL-------- 323
            S D+   +     +K  E + + Y    ++  L LDYGF+ +++  +   L        
Sbjct: 274 NSLDMSVKVLAEKKIKQNEAITLNYGCYPNDFFL-LDYGFVITQNPYDQVELSYDGALLD 332

Query: 324 --TLEISESDPFFGD----KLDI-AETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGT 376
             ++    S P F      + DI ++ N  GE A   + LG    P ++    L AL   
Sbjct: 333 AASMAAGVSSPNFSAPAKWQQDILSQLNLHGEGAVLKVSLGG---PDVVDGRLLAALRVL 389

Query: 377 DAFLLESIFR---NTIWG-HLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQ 432
            A   E++ +   NT+    + +P+    E    R V   C  AL  FHT I +D+ +L 
Sbjct: 390 LADDPEAVHKHDLNTLMSLDVQVPLGPTVEASALRTVLALCAIALQHFHTKIMDDQAILG 449

Query: 433 GG 434
           GG
Sbjct: 450 GG 451


>gi|396495152|ref|XP_003844476.1| similar to SET domain-containing protein [Leptosphaeria maculans
           JN3]
 gi|312221056|emb|CBY00997.1| similar to SET domain-containing protein [Leptosphaeria maculans
           JN3]
          Length = 475

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 117/487 (24%), Positives = 205/487 (42%), Gaps = 110/487 (22%)

Query: 62  AQVETFWQWLRDQKV-VSPKSPIRP-ATFPEGLGLVAQRDIAKNEVVLEVPMKFWIN-PD 118
           A  + F  WLR     +SPK  +    +   G G+VA ++I ++EV+  +P    ++  +
Sbjct: 6   AAGQAFLAWLRRSGTDISPKVQLEDLRSKGAGRGVVATQEIREHEVLFRIPRSAVLSVEN 65

Query: 119 TVAASEI-GSLCSGLKPWISVALFLIREKKKED-SPWRVYLDILPECTDSTVFWSEEELV 176
           ++ ++EI  S    L PW+S+ L ++ E    D S W  Y  +LP   ++ +FWSE EL 
Sbjct: 66  SILSTEIPTSTFDLLGPWLSLILVMLYEHLNGDASNWAPYFAVLPNEFNTLMFWSEHELA 125

Query: 177 ELQGTQLLSTT---------LG-----VKEYV------------QNEYLKVEEEIILPNK 210
           ELQ + +L+           LG     +KE+             + E ++ E+ I L +K
Sbjct: 126 ELQASAVLAKIGREGANEAFLGQLVPVIKEFAGIFFSGDSRAAQKAEEMRDEKNITLMHK 185

Query: 211 QLFPRPITLDDFLWAFGILRSRAFSRLRGQNLV-------------LIPLADLINHSPGI 257
                   +   + A+      A  R                    +IPLAD++N     
Sbjct: 186 --------MGSLIMAYAFDIEPATPRKDVDEEGFAEEEEDEALPKGMIPLADMLNADADR 237

Query: 258 TTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYD-LNKSNAELALDYGFI-ESK 315
                      A LF       ++   P+KAGE++   Y  L +S+  L   YG++ E+ 
Sbjct: 238 CN---------ARLFYEQKYLEMKALKPIKAGEEIFNDYGPLPRSD--LLRRYGYVTENY 286

Query: 316 SDRNAYTLTLEI-----SESDPFFGDKLDIAETNGLGESAYFDI------VLGRTLPPAM 364
           +  +   + +E+     + S  +   +L   +   + +S Y DI       L  +L P +
Sbjct: 287 AQYDVVEVPMELVSEVATASGSWIDQRLQYLDEQEVIDSGY-DITASSPFTLQESLSPEL 345

Query: 365 LQYLRLVALGGTDAFLLESIFRNTIWGHLDLP----VSHANEELICRVVRDACKSALSGF 420
           +  +  + L   +    E + R +      LP    +S    EL+ ++V    ++ L+ +
Sbjct: 346 IILVESMLLPDEE---FERLQRKS-----KLPKPEKISSKGAELLHKIV----QARLAQY 393

Query: 421 HTTIEEDEQLLQGG-NLDP------RLRIAVGVRAGEKRVLQQIDEIFKE---------- 463
            TT+E D QL+      DP      R  +A  VR GEK++L+Q +E   E          
Sbjct: 394 KTTLEHDLQLVNDAPTPDPSDVTRHRQAMARAVRIGEKQILRQFEEALAEMLARAGGIAK 453

Query: 464 RELELDE 470
           R+ E+DE
Sbjct: 454 RQREVDE 460


>gi|348684109|gb|EGZ23924.1| hypothetical protein PHYSODRAFT_296170 [Phytophthora sojae]
          Length = 452

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 84/334 (25%), Positives = 135/334 (40%), Gaps = 60/334 (17%)

Query: 155 VYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFP 214
           +Y++ LP   D  ++W +++  ELQG +     +   ++    Y +V + +   N Q   
Sbjct: 134 LYVNALPRYVDLPLYWDDKQFEELQGCEEARRAM---QHGARFYSQVYKHLFGANNQF-- 188

Query: 215 RPITLDDFLWAFGILRSRAFSRLRGQN--LVLIPLADLINHSPGITTEDYAYEIKGAGL- 271
             +  + F WA  IL SRA S   GQN    LIP  D  NH+ G  +++  + +      
Sbjct: 189 --VNAEAFFWAISILMSRATS---GQNQPFALIPFFDWFNHA-GNGSDNCRHALDSDECV 242

Query: 272 --FSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTL---- 325
             F     F++ T    + GEQ+ I Y  +  N  L  +YGF    +  +   L +    
Sbjct: 243 QDFDMQKGFTIHTTRSYEPGEQLFINYG-SHGNLRLLRNYGFTMPNNPYDVVNLPMPAAL 301

Query: 326 -EISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAM--LQYLRLVALGGTDAFLLE 382
            + +E+DP F  K D+           F     ++  PA+  L++    +L       LE
Sbjct: 302 QQPNEADPAFAQKRDL----------LFSATGSQSAMPALNSLRFQNDGSLAQNAKHWLE 351

Query: 383 ----------SIFRN--------TIWGHLDLPVS--HANEELICRVVRDACKSALSGFHT 422
                      IFR              L LP S  H     + +V+    K   S F  
Sbjct: 352 VMLATPEELNEIFRQAASQGSPEAATSSLRLPSSLTHKVRSEVGKVLASRLKQHTSSF-- 409

Query: 423 TIEEDEQLLQGG--NLDPRLRIAVGVRAGEKRVL 454
             EED+  L+G    + P LR  + +R GEK+ L
Sbjct: 410 --EEDDVFLRGNEQQMAPWLRSCLHIRMGEKQTL 441


>gi|242094496|ref|XP_002437738.1| hypothetical protein SORBIDRAFT_10g001630 [Sorghum bicolor]
 gi|241915961|gb|EER89105.1| hypothetical protein SORBIDRAFT_10g001630 [Sorghum bicolor]
          Length = 478

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 58/187 (31%), Positives = 95/187 (50%), Gaps = 15/187 (8%)

Query: 63  QVETFWQWLRDQKVVSPKSPIRPATFPEGLGL---VAQRDIAKNEVVLEVPMKFWINPDT 119
           ++E+F QW +          IR A   +GLG+    A    A + V + VP+   I P  
Sbjct: 11  KLESFLQWFQANGADLRGCTIR-ACGGKGLGVFSTAAPEPGANDGVAMVVPLDLAITPMR 69

Query: 120 VAASE-IGSLCSGL------KPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSE 172
           V     IG  C  L         + V LFL+ E+++  S W+ YLD+LP    S+++++E
Sbjct: 70  VLQDPLIGPRCHALFEEGRVDDRLLVMLFLMAERRRPGSLWKPYLDMLPSTFGSSLWFTE 129

Query: 173 EELVELQGTQLLSTTLGVKEYVQNEY-LKVE---EEIILPNKQLFPRPITLDDFLWAFGI 228
           EEL EL+GT L   T+  ++ +Q+ +  KV+   EE++  ++      +  +DFLWA  I
Sbjct: 130 EELAELEGTTLHRATVIQRKSLQSSFDEKVKGLVEELLHVDESESSIEVLFEDFLWANSI 189

Query: 229 LRSRAFS 235
             +RA +
Sbjct: 190 FWTRALN 196


>gi|242066082|ref|XP_002454330.1| hypothetical protein SORBIDRAFT_04g028760 [Sorghum bicolor]
 gi|241934161|gb|EES07306.1| hypothetical protein SORBIDRAFT_04g028760 [Sorghum bicolor]
          Length = 490

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 105/403 (26%), Positives = 176/403 (43%), Gaps = 52/403 (12%)

Query: 69  QWL-RDQKVVSPKSPIRPATFPE-GLGLVAQR---DIAKNEVVLEVP--MKFWINPDTVA 121
           +W+ R+   V P   +R A  PE GLG+ A     DI   +V++ +P  +   +   T A
Sbjct: 63  RWVQREGGFVHPA--LRVANHPEHGLGVSAAAPDGDIPPGDVLIALPGRLPLRLRRPTGA 120

Query: 122 ASEIGSLCSGLKP---W-ISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVE 177
           A ++    +   P   W + + L L++E+ K DS W  Y+  LPE     +F+  E++  
Sbjct: 121 ADDVLVQLAQQVPEELWAMKLGLRLLQERAKSDSFWWPYIANLPETFTVPIFFPGEDIKN 180

Query: 178 LQGTQLLSTTLGVKEYVQNEYLKVEEEI-----ILP--NKQLFPRPITLDDFLWAFGILR 230
           LQ   LL        ++    L+ E+EI      +P  +   + + +      WA     
Sbjct: 181 LQYAPLLHQVNKRCRFL----LEFEKEIQQKLHTVPSVDHPFYGQDVNSSSLGWAMSAAS 236

Query: 231 SRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGE 290
           SRAF RL G+  +L+PL D+ NHS        A  ++   + S D+   +     ++   
Sbjct: 237 SRAF-RLHGEIPMLLPLIDMCNHS----FNPNARIVQEGSVNSLDMSVKVVAEKKIEQNA 291

Query: 291 QVLIQYDLNKSNAELALDYGFIESKSDRNAYTLT----------LEISESDPFFGD---- 336
            + + Y  + ++  L LDYGF+ + +  +   L+          +    S P F      
Sbjct: 292 SITLNYGCHPNDFFL-LDYGFVITPNPYDQVELSYDGTLLDAASMAAGVSSPNFSAPAKW 350

Query: 337 KLDI-AETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDL 395
           + DI ++ N  GE A   + LG    P ++    L AL    A   E++ ++ +   + L
Sbjct: 351 QQDILSQLNLHGEGAILKVSLGG---PDIVDGRLLAALRVLLADDPEALHKHDLKTLMSL 407

Query: 396 ----PVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGG 434
               P+  A E    R V   C  AL  FHT I +DE +L+GG
Sbjct: 408 DVQAPLGPAIEASALRTVLALCAIALQHFHTKIMDDEAILRGG 450


>gi|168046556|ref|XP_001775739.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162672891|gb|EDQ59422.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 524

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 106/471 (22%), Positives = 190/471 (40%), Gaps = 58/471 (12%)

Query: 38  RLKSRAFAATCSLHSASATTNPPTAQVETFWQWLRDQKVVSPKSPIRPATFPEGL----- 92
           R   R   +  S H+  +  +    +VE    WL+  K+      ++    P+G      
Sbjct: 49  RFGCRWVQSNGSTHTKESNVSISNTKVERLRNWLK--KLNHDDCNLKLERCPQGGSGSGY 106

Query: 93  -GLVAQRDIAKNEVVLEVPMKFWINPDTVA-ASEIGSLC--SGLKPWISVALFLIREKKK 148
                   +     +++VP K  +  +T     ++G L   S L PW ++ L L+ E+ +
Sbjct: 107 GAFAGPGGVGNGSTIVKVPRKALMTEETARLCQDVGPLVKKSDLTPWQAMCLHLLYERAR 166

Query: 149 -EDSPWRVYLDILPECTD----STVFWSEEELVE-LQGTQLLSTTLGVKEYVQNEYLKVE 202
            E S W  Y+ +LP+  +      + WS++   E L+G+ +L  T       + +Y   E
Sbjct: 167 GETSFWYPYIAVLPKELELIGIHPMLWSQKMRREWLEGSPMLDVTERRLAICREDY---E 223

Query: 203 EEIILPNKQLFPR-----PITLDD--FLWAFGILRSRAFS-RLRGQNL----------VL 244
             ++    +L PR     PI++ +    WA  +L SR+FS  L+ Q L           L
Sbjct: 224 AMLLAGAGRLTPRGNEGEPISITETAVQWAATMLLSRSFSLNLQTQKLRPGSFAEDTIAL 283

Query: 245 IPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAE 304
           +P AD++NHS     E      + +G+ +      L+       GEQV   Y  + S + 
Sbjct: 284 VPWADMLNHSSSAGRESCLVYDQKSGVAT------LQAHRTYSEGEQVFDSYGPSCSPSR 337

Query: 305 LALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAM 364
           L LDYGF++ ++  ++  L   +        ++L + E  GL        +    +  ++
Sbjct: 338 LLLDYGFVDEENTNHSVDLPASVLGPVNSKANEL-LLEAMGLPLDGAIFSLTSAGVDESV 396

Query: 365 LQYLRLVALGGT---DAFLLESIFRN--------TIWGHLDLPVSHANEELICRVVRDAC 413
           + + R+         DA   E I           T+      P++  NE  + R +   C
Sbjct: 397 MAWTRVAVATRQELYDAGWKEGIRERAAGYPSAATVMFRFSTPINRDNESEVLRRLLSTC 456

Query: 414 KSALSGFHTTIEEDEQLLQ--GGNLDPRLRIAVGVRAGEKRVLQQIDEIFK 462
           +  L  + TT E+D  +L   G  L    + A+     EK  L+   + FK
Sbjct: 457 EFLLQKYPTTYEQDMDMLARPGLTLPWGRKQALRALISEKLALRDSQQTFK 507


>gi|148686778|gb|EDL18725.1| mCG18357, isoform CRA_c [Mus musculus]
          Length = 536

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 66/221 (29%), Positives = 107/221 (48%), Gaps = 25/221 (11%)

Query: 287 KAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGL 346
           +AG+Q+ I Y   +SNAE  +  GF    +  +   + L +S+SD  +  K ++    G+
Sbjct: 246 QAGDQIYIFYG-TRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGI 304

Query: 347 GESAYFDIVLGRTLPP---AMLQYLRLVALGGTDAFLLE------SIFRNTIWGHLDLPV 397
             S+ F   L  T PP    +L +LR+  +  T+  L E      +I R    G+ + PV
Sbjct: 305 PTSSVF--ALHSTEPPISAQLLAFLRVFCM--TEEELKEHLLGDSAIDRIFTLGNAEFPV 360

Query: 398 SHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQI 457
           S  NE  +   + D     L  + TTIEED+ +L+  +L  R  +A+ +R GEK +L++ 
Sbjct: 361 SWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRLGEKEILEKA 420

Query: 458 -------DEIFKERELELDELEYYQERRLKDLGLV-GEQGD 490
                   E +++   E   L  Y+E    DLGL+ G  GD
Sbjct: 421 VKSAAVNREYYRKHMEERAPLPRYEE---SDLGLLEGGVGD 458



 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 35/114 (30%), Positives = 56/114 (49%), Gaps = 5/114 (4%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 104 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY 198
           E+   +S W+ Y+  LP   D+ +++ EEE+  LQ TQ +       +    +Y
Sbjct: 164 ERASPNSFWQPYIQTLPSEYDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY 217


>gi|407417214|gb|EKF38012.1| hypothetical protein MOQ_001785 [Trypanosoma cruzi marinkellei]
          Length = 578

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 96/383 (25%), Positives = 157/383 (40%), Gaps = 76/383 (19%)

Query: 121 AASEIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECT----DSTVFWSEEELV 176
           A+ E+ S+ SG +P + ++L   R    E S W    D+L  C     +   FW  E+L 
Sbjct: 186 ASGELSSMISGDEPLLVLSLIYER-YVAETSHWN---DLLCSCPVDYPNVPSFWDWEDLA 241

Query: 177 ELQGTQLLSTTLGVKEYVQNEYLKVEEEIILP------------NKQLFPRPITLDDFLW 224
           EL+G  +L   L  K   Q      E   +LP             K  F    +++  +W
Sbjct: 242 ELEGLDVLDDVLAKK--AQLAQFHTETMAVLPFIYEALAGSCRLGKDEFLECFSIEAMMW 299

Query: 225 AFGILRSRAFS-RLRGQNLV-LIPLADLINHSPGITT---------EDYAYEIKGAGLFS 273
           A     SRAF+  + G+ ++ L+P+AD+INH                D+  +I GA L +
Sbjct: 300 ARATFDSRAFNLNVDGRVVIALVPVADMINHHNRSDVLVRKVEPNGGDFVMQI-GASLTA 358

Query: 274 RDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPF 333
           +D+            G ++ + Y     N EL   YGF+  +++ +      +  E    
Sbjct: 359 QDI------------GRELWMSYG-PLQNWELLQFYGFVVEENEHDRLPFPFDFPEG--V 403

Query: 334 FGDKLD-----IAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNT 388
            GD+ D     +  T GL  +    I      PPA++  LR V L   + F  +++ RN 
Sbjct: 404 AGDEWDRRRATLVATYGLHLAGRCWICHDGRPPPALVALLR-VHLAEAEEF--DTMERNG 460

Query: 389 IWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGN------------- 435
            +  L        E  +   + D  +  L  F T++EEDE LL+ G+             
Sbjct: 461 PFASL----GAGTEARVVATIADTIRCILDLFSTSLEEDEWLLENGSGPVATIAGDDDIT 516

Query: 436 --LDPRLRIAVGVRAGEKRVLQQ 456
             L    R+A+ +R G KR+  +
Sbjct: 517 QPLSCNKRMAILLRMGVKRIAHR 539


>gi|148686780|gb|EDL18727.1| mCG18357, isoform CRA_e [Mus musculus]
          Length = 458

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 66/221 (29%), Positives = 107/221 (48%), Gaps = 25/221 (11%)

Query: 287 KAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGL 346
           +AG+Q+ I Y   +SNAE  +  GF    +  +   + L +S+SD  +  K ++    G+
Sbjct: 168 QAGDQIYIFYG-TRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGI 226

Query: 347 GESAYFDIVLGRTLPP---AMLQYLRLVALGGTDAFLLE------SIFRNTIWGHLDLPV 397
             S+ F   L  T PP    +L +LR+  +  T+  L E      +I R    G+ + PV
Sbjct: 227 PTSSVF--ALHSTEPPISAQLLAFLRVFCM--TEEELKEHLLGDSAIDRIFTLGNAEFPV 282

Query: 398 SHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQI 457
           S  NE  +   + D     L  + TTIEED+ +L+  +L  R  +A+ +R GEK +L++ 
Sbjct: 283 SWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRLGEKEILEKA 342

Query: 458 -------DEIFKERELELDELEYYQERRLKDLGLV-GEQGD 490
                   E +++   E   L  Y+E    DLGL+ G  GD
Sbjct: 343 VKSAAVNREYYRKHMEERAPLPRYEE---SDLGLLEGGVGD 380



 Score = 39.3 bits (90), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 28/106 (26%), Positives = 49/106 (46%), Gaps = 5/106 (4%)

Query: 98  RDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIREKKKEDSP 152
           R     E+ L VP K  +  ++   S +G L S  +       I++A  L+ E+   +S 
Sbjct: 34  RSKKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLCERASPNSF 93

Query: 153 WRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY 198
           W+ Y+  LP   D+ +++ EEE+  LQ TQ +       +    +Y
Sbjct: 94  WQPYIQTLPSEYDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY 139


>gi|413948642|gb|AFW81291.1| hypothetical protein ZEAMMB73_810642 [Zea mays]
          Length = 340

 Score = 70.1 bits (170), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 78/307 (25%), Positives = 134/307 (43%), Gaps = 33/307 (10%)

Query: 136 ISVALFLIREKKKEDSPWRVYLDILP---ECTDSTVFWSEEELVELQGTQLLSTTLGVKE 192
           + +A  L+     ED+ W++Y D LP   ECT S +   +E+L+EL+   L S  L  +E
Sbjct: 45  LRLACLLLYAFDIEDNFWQLYSDFLPSVDECT-SLLLAPKEDLMELEDEDLASQMLKHQE 103

Query: 193 YVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFS-RLRGQNL-----VLIP 246
              + + K  ++ I    +   R    + FLWA  I++SR+ + +LR         VL P
Sbjct: 104 RAIDFWQKHWDKPIPLKLKRLAR--DHERFLWALSIVQSRSVNLKLRMGAFIQDANVLAP 161

Query: 247 LADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELA 306
            AD++NHSP      + +  K      R L   ++    +K G+++ I Y ++  N++  
Sbjct: 162 YADMLNHSPNANCFLH-WRFKD-----RMLEVMIKAGQAIKKGDEMTIDY-MSGVNSKFM 214

Query: 307 LDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQ 366
             YGF    S  N + L    S +       L +    GL +  Y +  L          
Sbjct: 215 ERYGF---SSPTNPWELINFSSPATIHMDSFLSVFNIAGLHDELYHNSALTS-------- 263

Query: 367 YLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEE 426
              +V     D  ++ +      W   D+P   + E    + +++ C+  L  F TTIE+
Sbjct: 264 ---VVETDFVDGAVVAAARALPTWSDGDVPAIPSVERKSAQALQEECRQMLDSFPTTIEQ 320

Query: 427 DEQLLQG 433
           D+Q+L  
Sbjct: 321 DQQILDS 327


>gi|50554725|ref|XP_504771.1| YALI0E34397p [Yarrowia lipolytica]
 gi|49650640|emb|CAG80378.1| YALI0E34397p [Yarrowia lipolytica CLIB122]
          Length = 492

 Score = 70.1 bits (170), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 67/258 (25%), Positives = 116/258 (44%), Gaps = 31/258 (12%)

Query: 64  VETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAAS 123
           +E  + W +D         I  +  P+    +  +   K++ ++ +P K  IN +  A  
Sbjct: 1   MEGLFSWAKDNGA-KISEEIEVSNLPDFGDSIVAKAALKDQELINIPTKLIINHEK-AVK 58

Query: 124 EIGSLCSGL-----KPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVEL 178
           E G+  S       K  ++   F + + K  DS +  Y+  LP+   + ++++ E+   L
Sbjct: 59  EFGAASSTFSTVADKQSLTKYFFAVEKNKGSDSFFHPYISQLPKKVTTPLYFTPEQQESL 118

Query: 179 QGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-SRL 237
            GT L        E+ +N+  ++ E+     +Q     +TL+D+L+A  I  SR+F  RL
Sbjct: 119 VGTNL--------EFYKNDKTELWEKEFKKLQQFVKTSVTLEDYLYASTIFTSRSFPERL 170

Query: 238 ---RGQNL-VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVL 293
              + ++L +LIP+ DLINH P    E   + + G         F+ +    V  G QV 
Sbjct: 171 MDPKNEDLSMLIPVLDLINHKPLTAVE---WNVTGDA-------FAFKACSDVVKGSQVF 220

Query: 294 IQYDLNKSNAELALDYGF 311
             Y  +K N EL   YGF
Sbjct: 221 NNYG-SKGNEELLGAYGF 237


>gi|330798760|ref|XP_003287418.1| hypothetical protein DICPUDRAFT_32466 [Dictyostelium purpureum]
 gi|325082565|gb|EGC36043.1| hypothetical protein DICPUDRAFT_32466 [Dictyostelium purpureum]
          Length = 479

 Score = 70.1 bits (170), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 92/405 (22%), Positives = 162/405 (40%), Gaps = 88/405 (21%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAA-------------------------SEI 125
           G G++A  DI +NE+++ +P K+ I+  +  +                         +  
Sbjct: 25  GRGVIANNDIKENEILISIPSKYLIHSHSKFSIPSLNIPELNNSDSSNSSSSSDDIYTPF 84

Query: 126 GSLCSGLKPWISVALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLL 184
            +    L     ++L LI EK  K+ S W  YL+ LP+    T  +S+EE+  L     +
Sbjct: 85  HNCLKKLNSKQRISLILIIEKLIKKHSIWFNYLNELPDDYTITSTYSDEEIESLSYPIYV 144

Query: 185 STTLGVKEYVQNEY-LKVEEEIILPNKQLFPRPITLDD-------------FLWAFGILR 230
            ++  +K  + N + L  E   +     L    I L+D             ++W +G ++
Sbjct: 145 ESSKKLKNEMLNSFKLFCEIFQLYYGTDLDRVVIELNDLQVKLSDILNKELYIWCWGTIQ 204

Query: 231 SRAF---SRLRGQN----------LVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDL- 276
           +R +     ++  N            L+PLADL NH+  + TE          LF+ +L 
Sbjct: 205 TRTYFYDKNMKKNNSKENNEEKDDCTLVPLADLFNHTSNVETE---------ALFNDELN 255

Query: 277 LFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNA------------YTLT 324
            + ++T  P   G QV I Y    SN  L   YGFI   +D+++            + + 
Sbjct: 256 CYQVKTKTPFSKGSQVFISYG-KHSNFTLMNYYGFIIENNDQDSIPLLQSNCIPTEFAVP 314

Query: 325 LEISESDPFFGDKLDIAETNGLGESAYFD---IVLGRTLPPAMLQYLRLVALGGTDAFLL 381
              S+    +  K+ I   N  G S Y D   +V+     P    YL ++ +     ++ 
Sbjct: 315 PTSSDEAKLYEKKIGIL--NNYGLSIYSDGKFLVMQDEKLPFSWNYLTVLKV----LYMT 368

Query: 382 ESIFRNTI---WGHLDLPVSHANEELICRVVRDACKSALSGFHTT 423
           +    N +     H D P+S  NE L+   + +  ++ LS F  T
Sbjct: 369 KEEINNQLELNLFHYDEPISKNNENLVLSFLENLTENQLSFFKNT 413


>gi|410970027|ref|XP_003991492.1| PREDICTED: SET domain-containing protein 4 [Felis catus]
          Length = 440

 Score = 70.1 bits (170), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 70/263 (26%), Positives = 116/263 (44%), Gaps = 32/263 (12%)

Query: 69  QWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           +WL+D+K     + + PA FP  G GL+++  + + +V++ +P    +  DTV  S +G+
Sbjct: 38  KWLKDRKF--EDTNLIPACFPGTGRGLMSKTSLQEGQVIISLPETCLLTTDTVIRSYLGA 95

Query: 128 LCSGLK----PWISVALFLIREKKKED-SPWRVYLDILPECTDSTVFWSEEELVELQGTQ 182
             +  +    P +++  FL+ EK   D S W+ YL+ILP+     V   E E+V L    
Sbjct: 96  YIAKWRPPPSPLLALCTFLVSEKHAGDQSVWKPYLEILPKAYTCPVCL-EPEVVNLFPKP 154

Query: 183 LLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI----TLDDFLWAFGILRSRAF---- 234
           L +     +  V+ E+           + LF   +    +    LWA+  + +RA     
Sbjct: 155 LRAKAEEQRARVR-EFFSSSRGFFSSLQPLFSEAVGSIFSYRALLWAWCTVNTRAVYVKP 213

Query: 235 SRLRG-----QNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAG 289
            R R          L P  DL+NHSP +  E        A        + +RT    +  
Sbjct: 214 RRRRCFSAEPDTCALAPYLDLLNHSPHVQVE--------AAFNEETRCYEIRTASSCRKH 265

Query: 290 EQVLIQYDLNKSNAELALDYGFI 312
           E+V I Y  +  N  L L+YGF+
Sbjct: 266 EEVFICYGPH-DNQRLLLEYGFV 287


>gi|149059901|gb|EDM10784.1| hypothetical protein RDA279, isoform CRA_d [Rattus norvegicus]
          Length = 399

 Score = 70.1 bits (170), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 89/395 (22%), Positives = 164/395 (41%), Gaps = 45/395 (11%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWIS----VALFLIREK 146
           G GL+++  + + +V++ +P    +  DTV  S +G      KP +S    +  FL+ E+
Sbjct: 18  GRGLMSKASLQEGQVIISLPESCLLTTDTVIRSSVGPYIKKWKPPVSPLLALCTFLVSER 77

Query: 147 KK-EDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEI 205
                S W+ YLDILP+     V   E E+V+L    L +     +  VQ+ +     + 
Sbjct: 78  HAGSHSLWKSYLDILPKSYTCPVCL-EPEVVDLLPGPLRAKAEEQRARVQDLFAS-SRDF 135

Query: 206 ILPNKQLFPRPI----TLDDFLWAFGILRSRA-FSRLRGQNLV--------LIPLADLIN 252
               + LF   +    +   FLWA+  + +RA + + R Q  +        L P  DL+N
Sbjct: 136 FSTLQPLFAESVDSIFSYHAFLWAWCTVNTRAVYLKSRRQECLSSEPDTCALAPFLDLLN 195

Query: 253 HSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI 312
           HSP +  +        A    +   + +RT    +  ++  I Y  +  N  L L+YGF+
Sbjct: 196 HSPHVQVK--------AAFNEKTRCYEIRTASRCRKHQEAFICYGPH-DNQRLLLEYGFV 246

Query: 313 ESKSDRNAYTLTLE-----ISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQY 367
              +      ++ E     +  +D     KL I E +G       ++  G   P   L  
Sbjct: 247 AFGNPHACVPVSGEMLLKYLPPADKQVHKKLSILEDHGFTG----NLTFGWDGPSWRLLT 302

Query: 368 LRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEED 427
              +     + F   + ++  + G +   +S  NE+    V +  C   ++  H+ + + 
Sbjct: 303 ALKLLCLEAERF---TSWKKVLIGDV---ISDRNEKASLGVAQKICSDFIAEIHSELRKV 356

Query: 428 EQLLQ-GGNLDPRLRIAVGVRAGEKRVLQQIDEIF 461
             + +  G+L  +L +   +R  E R+LQ   EI 
Sbjct: 357 SDMKERKGSLISQLCLVEALRMEELRILQASAEIL 391


>gi|328772032|gb|EGF82071.1| hypothetical protein BATDEDRAFT_23340 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 419

 Score = 70.1 bits (170), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 74/293 (25%), Positives = 121/293 (41%), Gaps = 28/293 (9%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPD-TVAASEIGSLCSGLKPWISVALFLIREKKKE 149
           G GL+A  D    + V+ +P +  + P  T            LK   S+ALF+  +K   
Sbjct: 44  GRGLMATSDFQIGDPVVRIPARLLLVPRRTHKLFNNHPAIVALKQHPSIALFIAWQKIHP 103

Query: 150 DSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPN 209
              W  Y+DILP   D+     + +L+ +    +       +  +  +Y  V   + +  
Sbjct: 104 TPEWSPYIDILPRSFDTMPLCIDLKLLAMLPYDIQEIAKNQQSKLDTDYAFVCTALAVSG 163

Query: 210 KQLFPRPITLDDFLWAFGILRSRA------------FSRLRGQNLV-LIPLADLINHSPG 256
            ++ P+    D F WA+ ++ +R              S +  Q ++ L P  D +NH+  
Sbjct: 164 YEMIPK----DIFKWAWIVVNTRCITMNTNAISKPQLSHIHQQPIITLAPFLDCLNHTST 219

Query: 257 ITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKS 316
                       AG  + +  + +RT VP K G QV I Y  + +N  LA +YGF   K+
Sbjct: 220 ARI--------SAGYDTVEKAYIIRTLVPYKKGSQVFINYGPHDNNFLLA-EYGFAILKN 270

Query: 317 DRNAYTLTLEISESDPFFGDKLDIAETNGL-GESAYFDIVLGRTLPPAMLQYL 368
             N   L  E+      FG   D+ ++ GL GE    +  LG  L  AM  Y+
Sbjct: 271 PFNHVVLDREVDFMMQHFGTVSDLLKSEGLYGEFIIANDDLGYRLMNAMRLYV 323


>gi|328700922|ref|XP_003241429.1| PREDICTED: SET domain-containing protein 3-like [Acyrthosiphon
           pisum]
          Length = 463

 Score = 70.1 bits (170), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 66/285 (23%), Positives = 120/285 (42%), Gaps = 31/285 (10%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSG-----LKPWISVALFLIRE 145
             G+ A ++I   + ++ VP    +  + + +S +  L S        P +++A+F++ E
Sbjct: 112 AYGMKANKNITVGDKLVTVPRALMMTEENIPSSPLWKLHSQDMMLRNMPNVALAIFILVE 171

Query: 146 --KKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEE 203
             +K + S W  YL  LP    + V++   +L  L+G+      L +   +  +Y   ++
Sbjct: 172 SLRKDKKSFWHSYLTTLPVTYSTPVYFDVADLEALKGSPAFEAALKLNRNIARQYAYFKK 231

Query: 204 EIILPNKQLFPRPITLDD------FLWAFGILRSRAFSRLRGQN-----LVLIPLADLIN 252
              L N    P  + L D      + WA   L SR  +     N       LIPL D+ N
Sbjct: 232 LFQLSND---PASVILKDTFTYEYYRWAVSTLMSRQNTVPSSDNPSENVSALIPLWDMFN 288

Query: 253 HSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI 312
           H  G  + D+         ++              A EQV I Y + ++NA+  +  GF+
Sbjct: 289 HRSGRLSTDFVKSSNVCVCYADG---------DYAADEQVYIFYGV-RTNADFLVHNGFV 338

Query: 313 ESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLG 357
              ++ +A  + L +S SDP +  +  + +T  L   A F +  G
Sbjct: 339 YPDNEHDAVKIRLGVSRSDPLYSLRYRLLQTLSLPALAEFYLTPG 383


>gi|50307933|ref|XP_453965.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
 gi|49643100|emb|CAG99052.1| KLLA0E00441p [Kluyveromyces lactis]
          Length = 558

 Score = 70.1 bits (170), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 83/319 (26%), Positives = 137/319 (42%), Gaps = 46/319 (14%)

Query: 83  IRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWI-NPDTVAASEIGSLCSGLKPWISVALF 141
           I P T   G+G   +  +   E ++++   F +   + +   EI S CS   P     L+
Sbjct: 29  IDPVT---GVGAYLKHKLHNKEALIKISQDFILTKAEALQYFEIDS-CSSSNPNAITQLY 84

Query: 142 LIREKKKEDSPWRVYLDILPECTD--STVFWSEEELVELQGTQLLSTTLGVKEYVQNEYL 199
           LI  +  + + W+ YLD+LP   D  S + W   EL  ++G+ L   T      + +E+ 
Sbjct: 85  LIALQSNQSAKWQPYLDVLPSLDDISSPLVWQPHELEIIRGSDLYIKTKRKLASLLDEWY 144

Query: 200 KVEEEIILPNKQ-------------LFPRPITLDDF---LWAFGILRSRAFSRLRGQNLV 243
           ++  E+ L +++                +  ++D F   LWA  I  SRAF  +   N  
Sbjct: 145 EILTELNLCSEKAKKYYELQDRDNIAVEKCYSVDSFAAYLWAHLIFSSRAFPSIIYDNSA 204

Query: 244 ------LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYD 297
                 L+P+ DL+NH       D     K  G F   + FS    +  K GE  L    
Sbjct: 205 GLEEGFLLPIVDLLNH-----KSDTKVHWKSEGSF---ITFSSEEIIEAK-GE--LYNNY 253

Query: 298 LNKSNAELALDYGFIESKSDRNAYTLTLEISESD----PFFGDKLDIAETNGLGESAYFD 353
            +KSN EL L YGF    +  +A +++L++ E        +G   +I   N  G+S  FD
Sbjct: 254 GDKSNEELLLGYGFAIDSNPHDATSISLKLDEKTLSEASNYG--TEINPNNVSGDSVRFD 311

Query: 354 IVLGRTLPPAMLQYLRLVA 372
           I L   LP  +++    ++
Sbjct: 312 ISLKDPLPVPLIKLFGFIS 330


>gi|67540796|ref|XP_664172.1| hypothetical protein AN6568.2 [Aspergillus nidulans FGSC A4]
 gi|40738718|gb|EAA57908.1| hypothetical protein AN6568.2 [Aspergillus nidulans FGSC A4]
 gi|259480141|tpe|CBF71002.1| TPA: SET domain protein (AFU_orthologue; AFUA_6G04520) [Aspergillus
           nidulans FGSC A4]
          Length = 484

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 112/462 (24%), Positives = 186/462 (40%), Gaps = 97/462 (20%)

Query: 65  ETFWQWLRDQKVVSPKSPIRPATF---PEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVA 121
           + F  WL  +  V   S IR A       G G+VAQ DI ++E +  +P    +   +  
Sbjct: 18  DEFTTWLSSRPGVKVNSKIRIADLRANAAGRGVVAQADIDEDEELFAIPRDLVL---STH 74

Query: 122 ASEIGSLCSG----LKPWISVALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSEEELV 176
            S++  L S     L PW+S+ L +I E  +   S W  Y  +LP+  D+ +FWS EEL 
Sbjct: 75  NSKLKDLLSQDLDQLGPWLSLMLVMIFEYLQGGKSTWAPYFKVLPQNFDTLMFWSPEELE 134

Query: 177 ELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI------------------- 217
           ELQG+ ++   +G K+  +   LK+   ++  N  LFP PI                   
Sbjct: 135 ELQGSAVVE-KIG-KQGAEESILKLIIPVVRANPALFP-PINGLASYDGDVGAQALLGLA 191

Query: 218 -TLDDFL--WAFGILRSRAFSRLRGQNLVL------------IPLADLINHSPGITTEDY 262
            T+   +  +AF I          G++  L            +PLAD++N          
Sbjct: 192 HTMGSLIMAYAFDIETPENEDEREGEDGYLTDEEEEQSSKGMVPLADMLNAD-------- 243

Query: 263 AYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYT 322
           AY    A LF  +    ++   P++AGE++   Y      ++L   YG++   +D  A  
Sbjct: 244 AYR-NNARLFQEEESLVMKAIKPIRAGEEIFNDYG-EIPRSDLLRRYGYV---TDNYASY 298

Query: 323 LTLEISESDPFFGDKLD-IAETNGLGESAYFD----------------IVLGRTLPPAML 365
             +E+S         LD +  + GL +S+  +                  + R  P   L
Sbjct: 299 DVIELS---------LDTLCSSAGLSDSSIENQPRLEFLESLELLDDGYAISRPSPNDTL 349

Query: 366 QYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIE 425
             +    L      L  S  + T       P   A  +   +++ D+   +   + TTIE
Sbjct: 350 TDIIPEELVVLLKTLTLSPEQFTHQQQKQKPPKAAFGQEEAKILFDSILMSEKRYSTTIE 409

Query: 426 EDEQLLQGGNL----------DPRLRIAVGVRAGEKRVLQQI 457
           +D++LL   N           D R ++A+ VR GEK + + +
Sbjct: 410 QDQELLAQINQVEAVSPLEGSDRRRKMAIQVRLGEKEIFEAV 451


>gi|357137816|ref|XP_003570495.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Brachypodium distachyon]
          Length = 485

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 114/453 (25%), Positives = 187/453 (41%), Gaps = 51/453 (11%)

Query: 47  TCSLHSASATTNPPTAQVETFWQWL-RDQKVVSPKSPIRPATFPE-GLGLVA---QRDIA 101
            CS  SA+A    P  Q     +W+ R+   V P   +R +  PE GLG+ A     DI 
Sbjct: 38  ACSAASATAPRIAP--QPPDLLRWVQREGGFVHPA--LRVSDHPEYGLGVSATGADGDIP 93

Query: 102 KNEVVLEVPMKFWIN---PDTVAASEIGSLCSGLKP--W-ISVALFLIREKKKEDSPWRV 155
              V++++P +  +    P   A + +  L   +    W + + L L++E+ K DS W  
Sbjct: 94  PGTVLIDLPGRIPLRLRRPAGAADAVLMQLADQVPEELWAMRLGLRLLQERTKSDSFWWP 153

Query: 156 YLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEE---IILPNKQL 212
           Y+  LPE     +F+  E++  LQ   LL        ++     +V+ +   + L +   
Sbjct: 154 YIANLPETFTVPIFFPGEDIKNLQYAPLLHQINKRCRFLLEFEKEVQHKLGTVPLADHPF 213

Query: 213 FPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLF 272
             + +      WA     SRAF RL G+  +L+PL D+ NHS        A  ++   + 
Sbjct: 214 CGQDVNSSSLGWAMSAASSRAF-RLHGEIPMLLPLVDMCNHS----FSPNARIVQDGDVD 268

Query: 273 SRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTL--------- 323
           S D+   +     +     V + Y    ++  L LDYGF+ + +  +   L         
Sbjct: 269 SPDMSVKVVADTQIDQNATVTLNYGCYPNDFYL-LDYGFVVTSNPYDQVELSYDGALLDA 327

Query: 324 -TLEISESDPFFGDKLD-----IAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTD 377
            ++    S P F          +++ N  GE A   + LG    P ++    L AL    
Sbjct: 328 ASMAAGVSSPNFSTPAKWQQDFLSKLNLHGEGAILKVSLGG---PDIVDGRLLAALRVLL 384

Query: 378 AFLLESIFRNTIWGHLDL----PVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQG 433
           A   E++  + +     L    P+  A E    R V   C  AL  FHT I EDE +L+G
Sbjct: 385 AAGPETVQEHDLKTLTSLNKKAPLGPAVESSALRTVLALCAIALQHFHTKIMEDEAVLKG 444

Query: 434 GNLDPRL--RIAVGVRAGEKRVLQQIDEIFKER 464
              +P L   +AV  R  +K +L  + +    R
Sbjct: 445 ---EPPLTTELAVQFRLQKKLMLVDVMQNLSRR 474


>gi|307173810|gb|EFN64588.1| SET domain-containing protein 4 [Camponotus floridanus]
          Length = 376

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 80/318 (25%), Positives = 126/318 (39%), Gaps = 49/318 (15%)

Query: 70  WLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSL 128
           WL ++  +S +  I P  FP  G GL   + I  NEV++++P +  I  DT+  S I  L
Sbjct: 10  WLLNENCLSIRHLI-PEYFPLTGRGLKTLKHIECNEVLIQLPFRMLITTDTLLQSNIRFL 68

Query: 129 ----CSGLKPWISVALFLIREKK-KEDSPWRVYLDILPECTDSTVFWSEEE--------L 175
                    P   +A FL+ E      S W +YL  LP+   +  F S +E        L
Sbjct: 69  FLNTTDSFNPQCMLATFLVYETHLGIKSKWYLYLKTLPQSFTNPDFCSNKEKRILPSFIL 128

Query: 176 VELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQ-------LFPRPITLDDFLWAFGI 228
             L     L +   +        +K  + + + NK           + IT + + WA+ +
Sbjct: 129 NSLHQAHRLESNFSL-------LMKAVKHLDIINKNHCSHCNLHLRKIITFEKYKWAYYV 181

Query: 229 LRSRA-------------FSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRD 275
           + +RA             F+  +  NL L P  DL NH+      D A ++      +++
Sbjct: 182 VNTRAVYIDTKLLREKNIFNIKQPNNLALAPFLDLFNHNV-----DTAVKVSIITDNNQN 236

Query: 276 LLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYG-FIESKSDRNAYTLTLEISESDPFF 334
             + + T  P     QV I Y  + +N +L +DYG FI        Y   LEI       
Sbjct: 237 QFYQIITLKPFDRESQVFINYGAH-NNLKLYIDYGFFIPCNPLDEIYFDILEIQRCFDIP 295

Query: 335 GDKLDIAETNGLGESAYF 352
             KLD    N   ++  F
Sbjct: 296 RSKLDFITFNSFHKNMSF 313


>gi|71425330|ref|XP_813082.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70877934|gb|EAN91231.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 565

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 93/378 (24%), Positives = 155/378 (41%), Gaps = 66/378 (17%)

Query: 121 AASEIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTDST-VFWSEEELVELQ 179
           A+ E+ S+ SG +P + ++L   R    E S W   L   P    +   FW  E+L EL+
Sbjct: 186 ASGELSSMISGDEPLLVLSLIYER-YVAETSHWNELLLSCPGGYPNVPSFWDWEDLAELE 244

Query: 180 GTQLLSTTLGVKEYVQNEYLKVEEEIILP------------NKQLFPRPITLDDFLWAFG 227
           G  +L   L  K   Q    + E   +LP             K  F    +++  +WA  
Sbjct: 245 GLDVLDDVLAKK--AQLAQFQTETMAVLPFIHEALAGGCRFGKDEFLECFSIEAMMWARA 302

Query: 228 ILRSRAFS-RLRGQNLV-LIPLADLINHSPGI---------TTEDYAYEIKGAGLFSRDL 276
              SRAF+  + G+ ++ L+P+AD+INH                D+  +I GA L ++D+
Sbjct: 303 TFDSRAFNLNVDGRVVIALVPVADMINHHNRSDVLVRKVEPNGGDFVMQI-GASLTAQDI 361

Query: 277 LFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISES---DPF 333
                       G ++ + Y     N EL   YGF+   ++ +      +  E    D +
Sbjct: 362 ------------GREIWMSYG-PLQNWELLQFYGFVLEGNEHDRLPFPFDFPEGVVGDEW 408

Query: 334 FGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHL 393
            G +  +  T GL  +    I      PPA++  LR V L   + F  +++ R   +  L
Sbjct: 409 DGRRAALVATYGLHLAGRCWICHDGRPPPALVALLR-VHLAEAEEF--DTMERKGPFASL 465

Query: 394 DLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGN---------------LDP 438
                   E  +   + D  +  L  F T++EEDE+LL+ G+               L  
Sbjct: 466 ----GAGTEARVVATIADTIRCILDLFSTSLEEDERLLENGSGPVATHSGDDGNTQPLSC 521

Query: 439 RLRIAVGVRAGEKRVLQQ 456
             R+A+ +R G KR+  +
Sbjct: 522 NKRLAILLRMGMKRIAHR 539


>gi|322802325|gb|EFZ22721.1| hypothetical protein SINV_12919 [Solenopsis invicta]
          Length = 435

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 76/314 (24%), Positives = 128/314 (40%), Gaps = 40/314 (12%)

Query: 70  WLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSL 128
           WL  +  +S    I P  FP  G GL   + I KNEV++++P++  I  D +  S++ +L
Sbjct: 33  WLLSENCMSISYFI-PEHFPLSGRGLKTLKRIEKNEVLIQLPLRMLITTDILMQSDVKTL 91

Query: 129 -----CSGLKPWISVALFLIREKK-KEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQ 182
                     P   +A FL+ E      S W +YL  LP+   +  F S +E   L    
Sbjct: 92  FLYSTTDSFSPQCMLATFLVYETHLGIKSKWYLYLKTLPQSFTNPDFCSNKEKAIL--PD 149

Query: 183 LLSTTLGVKEYVQNEY---LKVEEEIILPNKQLFP-------RPITLDDFLWAFGILRSR 232
            +   L     +Q ++   +K  + + + ++   P       + IT   + WA+ ++ +R
Sbjct: 150 FILHPLHQAHKLQKDFSLLMKAVKRLDINSRNSCPHCNACLQKIITFAKYKWAYYVVNTR 209

Query: 233 A------------FSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSL 280
           A            F+  +  NL L P  DL NH      +     +       ++  + +
Sbjct: 210 AVYIDNGVCKENVFNIKQPNNLALAPFLDLFNHDINTAVKVSIVTVSDC----QNKFYQI 265

Query: 281 RTPVPVKAGEQVLIQYDLNKSNAELALDYGFI--ESKSDRNAYTLTLEISESDPFFGDKL 338
            T  P   G QV I Y  + S  +L +DYGF    +  D   + +  +I        +KL
Sbjct: 266 VTLKPFDKGSQVFINYGAHDS-LKLYIDYGFFIPHNPLDEIKFDI-FDIQRCFDVSRNKL 323

Query: 339 DIAETNGLGESAYF 352
           D    NG  +S  F
Sbjct: 324 DFIMLNGFHKSMSF 337


>gi|149044196|gb|EDL97578.1| rCG27725, isoform CRA_b [Rattus norvegicus]
          Length = 538

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 64/221 (28%), Positives = 107/221 (48%), Gaps = 25/221 (11%)

Query: 287 KAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGL 346
           +AG+Q+ I Y   +SNAE  +  GF    +  +   + L +S+SD  +  K ++    G+
Sbjct: 246 QAGDQIYIFYG-TRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGI 304

Query: 347 GESAYFDIVLGRTLPP---AMLQYLRLVALGGTDAFLLESIFRNT------IWGHLDLPV 397
             S+ F   L  T PP    +L +LR+  +  T+  L E +  ++        G+ + PV
Sbjct: 305 PTSSVF--ALHFTEPPISAQLLAFLRVFCM--TEEELKEHLLGDSAIDRIFTLGNSEFPV 360

Query: 398 SHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQI 457
           S  NE  +   + D     L  + TTIEED+ +L+  +L  R  +A+ +R GEK +L++ 
Sbjct: 361 SWDNEVKLWTFLEDRASLLLKTYKTTIEEDKTVLKNPDLSVRATMAIKLRLGEKEILEKA 420

Query: 458 -------DEIFKERELELDELEYYQERRLKDLGLV-GEQGD 490
                   E +++   E   L  Y+E    DLGL+ G  GD
Sbjct: 421 VKSAAMNREYYRKHMEERAPLPRYEE---SDLGLLEGGVGD 458



 Score = 52.8 bits (125), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 35/114 (30%), Positives = 56/114 (49%), Gaps = 5/114 (4%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 104 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSILGPLYSQDRILQAMGNIALAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY 198
           E+   +S W+ Y+  LP   D+ +++ EEE+  LQ TQ +       +    +Y
Sbjct: 164 ERASPNSFWQPYIQTLPSEYDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY 217


>gi|148671819|gb|EDL03766.1| SET domain containing 4, isoform CRA_a [Mus musculus]
          Length = 378

 Score = 69.7 bits (169), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 87/391 (22%), Positives = 162/391 (41%), Gaps = 45/391 (11%)

Query: 95  VAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWIS----VALFLIREKKKE- 149
           +++  + + +V++ +P    +  DTV  S +G      KP +S    +  FL+ EK    
Sbjct: 1   MSKASLQEGQVMISLPESCLLTTDTVIRSSLGPYIKKWKPPVSPLLALCTFLVSEKHAGC 60

Query: 150 DSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPN 209
            S W+ YLDILP+     V   E E+V+L  + L +     +  VQ+ +           
Sbjct: 61  RSLWKSYLDILPKSYTCPVCL-EPEVVDLLPSPLKAKAEEQRARVQDLFTSARG-FFSTL 118

Query: 210 KQLFPRPI----TLDDFLWAFGILRSRA-FSRLRGQNLV--------LIPLADLINHSPG 256
           + LF  P+    +   FLWA+  + +RA + R R Q  +        L P  DL+NHSP 
Sbjct: 119 QPLFAEPVDSVFSYRAFLWAWCTVNTRAVYLRSRRQECLSAEPDTCALAPFLDLLNHSPH 178

Query: 257 ITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKS 316
           +  +        A    +   + +RT    +  ++V I Y  +  N  L L+YGF+  ++
Sbjct: 179 VQVK--------AAFNEKTRCYEIRTASRCRKHQEVFICYGPH-DNQRLLLEYGFVSVRN 229

Query: 317 DRNAYTLTLE-----ISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLV 371
                 ++ +     +  +D     K+ I + +G       ++  G   P   L     +
Sbjct: 230 PHACVPVSADMLVKFLPAADKQLHRKITILKDHGFTG----NLTFGWDGPSWRLLTALKL 285

Query: 372 ALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLL 431
                + F   + ++  + G +   +S  NE+    V +  C   +   H  + +   + 
Sbjct: 286 LCLEAERF---TSWKKVLLGEV---ISDTNEKTSLGVAQKICSDVIEETHAVLRKVSDMK 339

Query: 432 QGG-NLDPRLRIAVGVRAGEKRVLQQIDEIF 461
           +G  +L  +L +   +R  E R+LQ   EI 
Sbjct: 340 EGTVSLRSQLSLVEALRMEELRILQASAEIL 370


>gi|169626351|ref|XP_001806576.1| hypothetical protein SNOG_16462 [Phaeosphaeria nodorum SN15]
 gi|160705819|gb|EAT76160.2| hypothetical protein SNOG_16462 [Phaeosphaeria nodorum SN15]
          Length = 474

 Score = 69.7 bits (169), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 109/466 (23%), Positives = 194/466 (41%), Gaps = 98/466 (21%)

Query: 65  ETFWQWLRDQKV-VSPK---SPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWIN-PDT 119
           + F  WLR   V +SPK     +R A    G G+VA +DI ++E++  +P    ++  ++
Sbjct: 9   QAFLAWLRRSGVEISPKIQLEDLRNAQ--AGRGVVATQDIPEHELLFRIPRTAILSVENS 66

Query: 120 VAASEI-GSLCSGLKPWISVALFLIREKKKED-SPWRVYLDILPECTDSTVFWSEEELVE 177
           + ++EI  +    L PW+S+ L ++ E    D S W  Y  +LP   ++ +FWSE+EL E
Sbjct: 67  ILSTEIPAATFEMLGPWLSLILVMLYEYINGDASNWAPYFSVLPTEFNTLMFWSEDELAE 126

Query: 178 LQGTQLLSTTLG---------------VKEYV------------QNEYLKVEEEIILPNK 210
           LQ + +L+  +G               +KE+             + E ++ E  ++L +K
Sbjct: 127 LQASAVLN-KIGKEGANEAFMEQLLPIIKEFADIFFAGDERAKQRAEEMRDERNVLLMHK 185

Query: 211 QLFPRPITLDDFLWAFGILRSRAFSRLRGQNLV-------------LIPLADLINHSPGI 257
                   +   + A+      A SR                    +IPLAD++N     
Sbjct: 186 --------MGSLIMAYAFDVEPATSRKDVDEEGFAEEEEDEALPKGMIPLADMLNADA-- 235

Query: 258 TTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYD-LNKSNAELALDYGFIESKS 316
                  +   A LF  +    ++   P++AGE+V   Y  L +S+  L   YG++   +
Sbjct: 236 -------DCNNARLFYEEKYLEMKALKPIRAGEEVFNDYGPLPRSD--LLRRYGYV---T 283

Query: 317 DRNAYTLTLEI---------SESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQY 367
           D  A    +EI         +++     ++L+  E   + ++ Y DI             
Sbjct: 284 DNYAQYDVVEINMDLVTELATDAGIHSKERLEYLEEQEIVDTGY-DITASNPFTIQESLS 342

Query: 368 LRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICR---VVRDACKSALSGFHTTI 424
             LV L  T     E   R      L  P     E++  R    + +  ++ ++ + TT+
Sbjct: 343 PELVILVETLLLPAEEFERLKTKSKLPKP-----EKITSRGAGFLHELVQARIAQYATTL 397

Query: 425 EEDEQLLQGGNL-------DPRLRIAVGVRAGEKRVLQQIDEIFKE 463
           EED Q ++   L       + R  +A  VR GEK++L+Q  +   E
Sbjct: 398 EEDAQSIKEAPLIRDAKPKERRFAMARAVRVGEKQLLKQTQDALAE 443


>gi|320170563|gb|EFW47462.1| hypothetical protein CAOG_05400 [Capsaspora owczarzaki ATCC 30864]
          Length = 479

 Score = 69.3 bits (168), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 73/285 (25%), Positives = 130/285 (45%), Gaps = 37/285 (12%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSL--CSGLKPWISVALFL------ 142
           G GLVA++ I    V   +P+   IN +    S++G +   S L     +++FL      
Sbjct: 69  GRGLVAKQAIPPKTVFARIPLTALINIEHAMVSDLGPVIDASDLSDQEIMSVFLWHQLHG 128

Query: 143 ---IREKKKEDSPWRVYLDILPECTDS--TVFWSEEELVELQGTQLLSTTLGVKEYVQNE 197
              + +    +S W+ +LD LP+  +   T+ W+ E+L  L G+ L   +    + ++  
Sbjct: 129 CGQVEDGGVAESNWQPFLDTLPDRQEMHLTMLWTPEQLAHLDGSLLRDFSERRIQVLEAS 188

Query: 198 YLKVEEEII--LPNKQLFP-RPITLDDFLWAFGILRSRAFS-RLR-GQNL-----VLIPL 247
           + + ++      P+ +       TL+DFLW   I  SR  + R+R G+        L+P+
Sbjct: 189 FKRHQQSTFGKFPSAESCDWTKFTLEDFLWGMAIGWSRTHAVRVRDGEGAWQTANCLVPV 248

Query: 248 ADLINHSPGITTEDYAYEIKGAGLFSRD--LLFSLRTPVPVKAGEQVLIQYDLNKS---N 302
           ADL+N        D A ++  A  ++ D    F  RT   +   E++L QY+ + +   N
Sbjct: 249 ADLLN-------TDIASKV-NAECYTNDESTHFECRTRHQLAQSEELLAQYNADSASIDN 300

Query: 303 AELALDYGFI-ESKSDRNAYTLTLEISESDPFFGDKLDIAETNGL 346
             L +DYGF+    S R A T+   I   DP    +L + + + +
Sbjct: 301 HHLLMDYGFVLNDDSARRAATIGRPIPLDDPDRAKRLSVLKQHKM 345


>gi|302768639|ref|XP_002967739.1| hypothetical protein SELMODRAFT_408995 [Selaginella moellendorffii]
 gi|300164477|gb|EFJ31086.1| hypothetical protein SELMODRAFT_408995 [Selaginella moellendorffii]
          Length = 421

 Score = 69.3 bits (168), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 81/325 (24%), Positives = 123/325 (37%), Gaps = 73/325 (22%)

Query: 87  TFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREK 146
           T P+  GL A RDI   E +L V     +  D +       L SG+  W  +AL L+ EK
Sbjct: 58  TGPDKRGLFAVRDIKAGECILRVSRDTMMTADRLPLEFQQLLSSGVSEWAQLALLLLFEK 117

Query: 147 KK-EDSPWRVYLDILPE--CTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEE 203
           +  E S W  Y+  LP      ST FW +EEL  +Q + L   T+  +  ++ E+ +++ 
Sbjct: 118 RAGEASIWAPYISCLPRWGTIHSTAFWRKEELTMIQESSLSYETMSRRAAIREEFNEMQS 177

Query: 204 EIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINH---SPGITTE 260
                                                    +P AD +NH   S  + T 
Sbjct: 178 -----------------------------------------VPFADFMNHDWSSNAMLTY 196

Query: 261 DY---AYEIKGAGLFSRDLLFSLRTP-----VPVKAGEQVLIQYDLNKSNAELALDYGFI 312
           D    + E++   ++S  L  +L            AGEQV I +     NA LALD+GF 
Sbjct: 197 DTDNGSTEVEEVKVYSDCLYIALFCAQLFADKNYAAGEQVTISFG-PLCNASLALDFGFT 255

Query: 313 ESKSDRNAYTLTLEISESDPFFGDKLDIAETNGL--------GESAYFDIVL-------- 356
              +  +   L L IS  D    +KL    ++ +         +S      L        
Sbjct: 256 VPYNPWDKVQLWLGISRRDSLRKEKLQYLHSHEMLTLTNPDGSDSGGMSFTLREVCDSAG 315

Query: 357 -GRTLPPAMLQYLRLVALGGTDAFL 380
            GR LP  +    R+V    ++A L
Sbjct: 316 NGRELPGPLFTMARVVCATSSEALL 340


>gi|303279242|ref|XP_003058914.1| set domain protein [Micromonas pusilla CCMP1545]
 gi|226460074|gb|EEH57369.1| set domain protein [Micromonas pusilla CCMP1545]
          Length = 457

 Score = 69.3 bits (168), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 76/273 (27%), Positives = 113/273 (41%), Gaps = 27/273 (9%)

Query: 64  VETFWQWLRDQKVVSPKSPIRPAT-FPEG-------LGLVAQRDIAKNEVVLEVPMKFWI 115
           +E F  WLR       +  I   +   EG       +G+ A+RDI   E V  +P     
Sbjct: 1   MEAFKTWLRSNGFWWNEDAIELGSRIDEGGGEDAPRVGVKAKRDIEIGESVARIPSSACF 60

Query: 116 NPDTVAASEIG---SLCSGLKPWI-SVALFLIREKK-KEDSPWRVYLDILPECT-DSTVF 169
             +  A ++      L +G   W+ S+   L+ E+     S W  YLD LP    D  + 
Sbjct: 61  TCENCAHADAVRKVKLSAGEDEWLASLGTALVLERTLGSSSRWNAYLDSLPHSEPDVVMM 120

Query: 170 WSE--EELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFG 227
           WSE  E    L GT +  +    +   + E+ +   + +L   +   + +  DDFL A  
Sbjct: 121 WSEDGERRRYLCGTDIEQSLRDERAAARTEWTR-HVKPVLDTLRGAAKDVGFDDFLAARS 179

Query: 228 ILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSR-------DLLFSL 280
           +  SRAF+        L+P+ADL NH  G     Y  + +G    S        D LF +
Sbjct: 180 VASSRAFTVNPRVGAGLVPIADLFNHRTG-GHHVYLSDARGTAAVSERDEGSDDDALF-V 237

Query: 281 RTPVPVKAGEQVLIQYDLNKSNAELALDYGFIE 313
           R     KAGE+V   Y     NA+L   YGF +
Sbjct: 238 RVVKASKAGEEVFNTYG-KLGNAKLLCSYGFAQ 269


>gi|413950742|gb|AFW83391.1| hypothetical protein ZEAMMB73_866859 [Zea mays]
 gi|413950743|gb|AFW83392.1| hypothetical protein ZEAMMB73_866859 [Zea mays]
          Length = 252

 Score = 69.3 bits (168), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 50/173 (28%), Positives = 91/173 (52%), Gaps = 8/173 (4%)

Query: 285 PVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETN 344
           P KAGE ++I +   ++N+ L L+YGF++  +  +  ++   ++  DP + +K  +A+ N
Sbjct: 56  PYKAGEPIII-WCGPQTNSRLVLNYGFVDENNPFDRISIEASLNTEDPQYQEKRMVAQRN 114

Query: 345 GLGESAYFDIVLGRTLPPA--MLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANE 402
           G      F++ +G+       ML YLRL  +   D   ++SI  +   G    PVS   E
Sbjct: 115 GKHAIQNFNVYVGKEKQTVAEMLPYLRLGYISDPDE--MQSILSSE--GD-TCPVSPCTE 169

Query: 403 ELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQ 455
             +   +    +S L+G+ TT+ EDE +L  G+L+P+  +A  +   EK++L 
Sbjct: 170 RAVLDQLGGYLESRLAGYPTTLNEDEAMLADGSLEPKQEVATRLVRLEKKMLH 222


>gi|347967018|ref|XP_321037.5| AGAP002018-PA [Anopheles gambiae str. PEST]
 gi|333469795|gb|EAA01259.5| AGAP002018-PA [Anopheles gambiae str. PEST]
          Length = 493

 Score = 69.3 bits (168), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 106/433 (24%), Positives = 178/433 (41%), Gaps = 42/433 (9%)

Query: 56  TTN--PPTAQVETFWQWLRDQKVVSPKSPIRPATFPE--GLGLVAQRDIAKNEVVLEVP- 110
           TTN  P    V  F +W  ++        +R A   E  GLGL +   I   E ++ VP 
Sbjct: 62  TTNGRPRMETVAHFMRWAVERGCQVEN--VRVAEHAEYGGLGLESCGPIPAGECIITVPR 119

Query: 111 -MKFWIN--PDTVAASEI--GSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTD 165
            M F++   P      E+  G++ S  +  I +AL LI E+ +  S W+ YLD+LP+   
Sbjct: 120 SMFFYVTNEPRYRQLLELMPGAMMSE-QGNIMLALALIMERFRAKSDWKPYLDLLPDRYT 178

Query: 166 STVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKVEEEIILPNKQLFPRPITLDDF 222
           + ++++ E++ EL  T      L + +++  +Y    +  +E +   +  F    T D F
Sbjct: 179 TPLYYTTEDMGELAETDAFLPALKLCKHIARQYGFIRRFVQEKVDELRDCF----TYDVF 234

Query: 223 LWAFGILRSRA------FSRLRG--QNLVLIPLADLINHS-PGITTEDYAYEIKGAGLFS 273
            WA   + +R        +   G    L LIPL D+ NH+ P    E            +
Sbjct: 235 RWAVSTVMTRQNKVPVNLAEFDGMDHTLALIPLWDMANHAFPDTANETRCVAETCYNATN 294

Query: 274 RDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPF 333
             L  SL   V   A   + I Y   +++AE  +  GF+  ++          +  + P 
Sbjct: 295 EQLECSLTREVSDIASVPIFIVYG-TRTDAEFLVHNGFVCPRNPHANVQKRFTLVPAIPL 353

Query: 334 FGDKLDIAETNGLGESAYFDIVLGR------TLPPAMLQYLRLVALGGTDAFLLE--SIF 385
           + ++  + E  G+  +  F     R      T  P   + + L  +    A  L+  +  
Sbjct: 354 YKERAHLLELLGMPTTGTFSFGPAREPAAATTTTPISQELISLARVSSMTAKELDEYTAM 413

Query: 386 RNTIWGHLDLPVSHANEELICRVVR---DACKSALSGFHTTIEEDEQLLQGGNLDPRLRI 442
           + T    L    +    EL  R  R      K  L  + TTIE+DE LL+      R R+
Sbjct: 414 KETQRQTLRTYQALLPAELCARTERWLATVMKIMLLRYPTTIEQDEALLKTNRHHIR-RL 472

Query: 443 AVGVRAGEKRVLQ 455
            +  R GEK++L+
Sbjct: 473 LIEYRLGEKQILR 485


>gi|222632731|gb|EEE64863.1| hypothetical protein OsJ_19720 [Oryza sativa Japonica Group]
          Length = 367

 Score = 69.3 bits (168), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 98/420 (23%), Positives = 170/420 (40%), Gaps = 95/420 (22%)

Query: 89  PEGLGLVAQRDI---AKNEVVLEVPMKFWIN----------PDTVAASE-IGSLCSGLKP 134
           P+G+ + A RD+    +  V++E+P++  +           PD +     I  +     P
Sbjct: 13  PDGMAVYASRDVDPLRRARVIMEIPLELMLTITQKRPWMFFPDIIPLGHPIFDIIESTDP 72

Query: 135 ---W-ISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGV 190
              W + +A  L+     ED+ W++Y D LP   + T                 S  L  
Sbjct: 73  ETDWDLRLACLLLYAFDVEDNFWQLYGDFLPSVDECT-----------------SLLLAP 115

Query: 191 KEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFS-RLR-GQNL----VL 244
           K           + I L  K+L P     + FLWA  I++SR+ + +LR G  L    VL
Sbjct: 116 KH----------KTIPLKLKRLAP---DHERFLWALSIVQSRSVNLKLRMGAFLQDANVL 162

Query: 245 IPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAE 304
           +P AD++NHSP      + +  K      R +   ++    VK G+++ I Y ++  N+ 
Sbjct: 163 VPYADMLNHSPDANCFLH-WRFK-----DRMVEVMIKAGHAVKKGDEMTIDY-MSGVNSS 215

Query: 305 LALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAM 364
               YGF    S  N + L    S++       L +    GL +  Y +  L        
Sbjct: 216 FMERYGF---SSPTNPWELINFSSDAKIHLDSFLSVFNIAGLHDELYHNAALT------- 265

Query: 365 LQYLRLVALGGTDAFLLESIFRNT----IWGHLDLPVSHANEELICRVVRDACKSALSGF 420
                     G + F+   +         W   D+P   + E    + +++ C + L  F
Sbjct: 266 ---------SGENNFVDGGVVAAARTLPTWSEGDVPAIPSLERKSAQALQEECHTMLESF 316

Query: 421 HTTIEEDEQLLQG-GNLDPRLRIAVGVRAGEKRVLQQIDEIFKERELELDELEYYQERRL 479
            TTI++D+++L   G++     IA+  R   K +LQ+I          +D L+ YQ++ L
Sbjct: 317 STTIQQDQEILDSDGHIRRTREIAIKYRLHRKLLLQKI----------IDALDIYQDKIL 366


>gi|326521260|dbj|BAJ96833.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 389

 Score = 69.3 bits (168), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 56/199 (28%), Positives = 95/199 (47%), Gaps = 14/199 (7%)

Query: 50  LHSASATTNPPTAQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIA--KNEVVL 107
           + S +A  +   A++  F QWL+          IR A    G G+ A    A   + VV+
Sbjct: 1   MSSPAAAPSDDDAKLRCFLQWLQANGAYLRGCTIR-ACGRTGFGVYATAVDAGGADGVVM 59

Query: 108 EVPMKFWINPDTVAASE-IGSLC------SGLKPWISVALFLIREKKKEDSPWRVYLDIL 160
            VP+   + P  V     +G  C       G+   + V LFL+ E+ +  S W+ YLD+L
Sbjct: 60  AVPLDLAVTPMRVLQDPLVGPRCRALFEEGGVDDRLLVMLFLMAERLRPTSLWKPYLDVL 119

Query: 161 PECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQ----NEYLKVEEEIILPNKQLFPRP 216
           P    S+V++ +EEL E++GT L   T+  ++ +Q    ++   + EE++  +       
Sbjct: 120 PSTFGSSVWFDDEELAEVEGTTLHRATVMQRKSLQKLFDDKVKGLVEELLHVDGSGSSIE 179

Query: 217 ITLDDFLWAFGILRSRAFS 235
           +  +DFLWA  I  +RA +
Sbjct: 180 VRFEDFLWANSIFWTRALN 198


>gi|34784341|gb|AAH57968.1| Setd3 protein [Mus musculus]
          Length = 408

 Score = 69.3 bits (168), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 66/221 (29%), Positives = 107/221 (48%), Gaps = 25/221 (11%)

Query: 287 KAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGL 346
           +AG+Q+ I Y   +SNAE  +  GF    +  +   + L +S+SD  +  K ++    G+
Sbjct: 118 QAGDQIYIFYG-TRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGI 176

Query: 347 GESAYFDIVLGRTLPP---AMLQYLRLVALGGTDAFLLE------SIFRNTIWGHLDLPV 397
             S+ F   L  T PP    +L +LR+  +  T+  L E      +I R    G+ + PV
Sbjct: 177 PTSSVF--ALHSTEPPISAQLLAFLRVFCM--TEEELKEHLLGDSAIDRIFTLGNAEFPV 232

Query: 398 SHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQI 457
           S  NE  +   + D     L  + TTIEED+ +L+  +L  R  +A+ +R GEK +L++ 
Sbjct: 233 SWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRLGEKEILEKA 292

Query: 458 -------DEIFKERELELDELEYYQERRLKDLGLV-GEQGD 490
                   E +++   E   L  Y+E    DLGL+ G  GD
Sbjct: 293 VKSAAVNREYYRKHMEERAPLPRYEE---SDLGLLEGGVGD 330


>gi|111306423|gb|AAI20969.1| SETD3 protein [Homo sapiens]
          Length = 284

 Score = 69.3 bits (168), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 54/183 (29%), Positives = 85/183 (46%), Gaps = 18/183 (9%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A RDI   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 104 EGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKV 201
           E+   +S W+ Y+  LP   D+ +++ E+E+  LQ TQ +       +    +Y    KV
Sbjct: 164 ERASPNSFWQPYIQTLPSEYDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV 223

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
            +     NK       T +D+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 224 IQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 280

Query: 255 PGI 257
            G+
Sbjct: 281 NGL 283


>gi|428163078|gb|EKX32170.1| hypothetical protein GUITHDRAFT_121664 [Guillardia theta CCMP2712]
          Length = 449

 Score = 69.3 bits (168), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 90/394 (22%), Positives = 157/394 (39%), Gaps = 54/394 (13%)

Query: 94  LVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLC--SGLKPWISVALFLIREKKKEDS 151
           L A+ DI   EV+L +P            S +  +   + +    ++ L+LI E+    S
Sbjct: 55  LHAKEDIEAGEVILSLPANLLFPTRVSDHSPVVHMIENTTIGRITAICLYLISERADSSS 114

Query: 152 PWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQ 211
            W+ +L  LP      + +SE++++  Q +         K+ V+ EY    E+ + P   
Sbjct: 115 HWKPWLQSLPPRFFHALSYSEDDMLHFQASSFKELRDRKKKNVRQEY----EQTVAPLLH 170

Query: 212 LFP------------RPITLDDFL-----WAFGILRSRAFSRLRGQN---------LVLI 245
             P            + +T +DF      WA+ ++ +R                  LVL 
Sbjct: 171 KLPAFDPLLAAVDKPQNVTREDFTYEAFEWAYSVVTTRGIFPGLLGEEEREGEVPLLVLG 230

Query: 246 PLAD-LINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAE 304
           PLAD  I+ + G+     A E +   +FS     +  +P+ +  G           SN E
Sbjct: 231 PLADSFIHGASGVKISYDAQEHRC--VFSALHKVAKNSPISIGVGMS---------SNME 279

Query: 305 LALDYGFIESKSDRNAYTLTLEIS-ESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPA 363
           L  + GF+   +  N   +  ++   SD     +  + +   L     + +  G  +P  
Sbjct: 280 LLANRGFMMQNNGNNFVLMKFQLDRNSDMHASARESMMKQLNLSNPMTYVVRYGE-MPQG 338

Query: 364 MLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTT 423
           +L  LR+ +L   +          +    L  PV+  NE    R++  +C S L+ + TT
Sbjct: 339 LLASLRIQSLSPVEF--------GSYGKALATPVTLENEWRAYRLLISSCNSILAMYPTT 390

Query: 424 IEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQI 457
           IEEDE +L        LR AV +R  EK + + I
Sbjct: 391 IEEDEIVLTQTKTSRHLRAAVLLRREEKLIYESI 424


>gi|302422352|ref|XP_003009006.1| SET domain-containing protein [Verticillium albo-atrum VaMs.102]
 gi|261352152|gb|EEY14580.1| SET domain-containing protein [Verticillium albo-atrum VaMs.102]
          Length = 485

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 58/194 (29%), Positives = 86/194 (44%), Gaps = 10/194 (5%)

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEE---EIIL 207
           +PW  Y+  LP        WSE+E   LQGT L        + + NE+  + E   ++  
Sbjct: 117 TPWTEYVKFLPREVPVPTMWSEQERDFLQGTSLELAVSAKIQALTNEFEALREKSSDLPF 176

Query: 208 PNKQLFPRP-ITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEI 266
            N   + +  + L D+       RSR+   L G  + ++P+ DL NH+P   T +  YE 
Sbjct: 177 WNAIFWDKNNVILADWFLVDAWYRSRSL-ELPGVGVSMVPVLDLANHAP---TPNAYYEE 232

Query: 267 KGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLE 326
                   +LL  LR    + AG++V I Y   KS AE+   YGFI+     +   L L 
Sbjct: 233 SARREGDVELL--LRPGSTLAAGDEVTISYGAGKSGAEMLFSYGFIDPARSTDTVALPLA 290

Query: 327 ISESDPFFGDKLDI 340
             E DP    K+ I
Sbjct: 291 PLEDDPLSKAKVHI 304


>gi|323452617|gb|EGB08490.1| hypothetical protein AURANDRAFT_71532 [Aureococcus anophagefferens]
          Length = 1114

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 90/336 (26%), Positives = 135/336 (40%), Gaps = 55/336 (16%)

Query: 137 SVALFLIREKKK-EDSPWRVYLDILPECTDSTVF-WSEEELVELQGTQLLSTTLGVKEYV 194
           ++AL L+ E++K   S W  ++ +LP      +  WSE EL EL G+  L      +  V
Sbjct: 100 ALALRLLYERRKGAKSRWGPHIALLPATPPHALLRWSEAELAELAGSDALELANRWRSQV 159

Query: 195 QNEYLKVEE------EIILPNKQL---------FPRPITLDDFLWAFGILRSRAFSRLRG 239
            +++ ++ +      E   P KQL         FP  + L+ F WA  ++ SR  S  R 
Sbjct: 160 SSDFSEIVDKSRAAVEESDPGKQLSAAVKASLRFPW-LDLEGFSWAVSMIWSRCVSVSRK 218

Query: 240 QN---LVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQY 296
                   +P+ D+ NH PG   E++ ++    G       F LR     K G+++ + Y
Sbjct: 219 GAPPIKAFLPVVDMHNHDPG-APENHGFDDARDG-------FVLRRTGNAKKGDELKLCY 270

Query: 297 DLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAY----F 352
           D    NA L L YGF    +      L   +S   P +  K    E  GLG +A     F
Sbjct: 271 D-GLPNAWLLLLYGFALDHAAHAGRDLYAPLSPEAPHYEAKRAALEKLGLGATADGAAPF 329

Query: 353 DIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRD- 411
            +     LP  +L      AL    A L E            LP   A  E   R     
Sbjct: 330 RLAADDALPERLL-----TALMAQRATLDE------------LPGLPATSEATARAAAGD 372

Query: 412 ---ACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAV 444
              AC + L+ +  + +ED   L      PRLR+A+
Sbjct: 373 LVAACDALLAAYRGSEDEDAAALADPATPPRLRLAL 408


>gi|154272535|ref|XP_001537120.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
 gi|150409107|gb|EDN04563.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
          Length = 485

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 106/479 (22%), Positives = 180/479 (37%), Gaps = 132/479 (27%)

Query: 65  ETFWQWLRDQKVVSPKSPIRPATFPE---GLGLVAQRDIAKNEVVLEVPMKFWINPDTVA 121
           + F  WL+ +  V     I+ A       G G+VA  DI ++E +  +P    +   +  
Sbjct: 18  DEFMSWLKQRPGVKVSPKIKIADLRSEGAGRGIVADDDIGEDEELFAIPQSLVL---SFQ 74

Query: 122 ASEIGSLCS----GLKPWISVALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSEEELV 176
            S +  L         PW+ + + +I E  +   S W  Y  +LP   D+ +FW++EEL 
Sbjct: 75  NSRLKDLLDFNERDFDPWLCLIVVMIYEYLQGGASTWSRYFQLLPTNFDTLMFWTDEELR 134

Query: 177 ELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFG--------- 227
           EL G+ +L+  +G  +   N +  +   ++  N  LFP    +  F    G         
Sbjct: 135 ELSGSAVLN-KIGRSDAEANIFRNI-LPLVSGNPSLFPPMSGVASFDSPEGKAALLSLAH 192

Query: 228 ----ILRSRAFSRLRGQN------------------LVLIPLADLINHSPGITTEDYAYE 265
               ++ + AF   +G+N                    ++PLADL+N            +
Sbjct: 193 RMGSLVMAYAFDIEKGENDGREGQDGYVTDDEEELSKGMVPLADLLNADA---------D 243

Query: 266 IKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTL 325
              A LF  D   S+R+  P++ GE++   Y      A+L   YG++   +D  A    +
Sbjct: 244 RNNARLFQEDCYLSMRSIKPIRKGEEIFNDYG-ELPRADLLRRYGYV---TDNYAQYDEV 299

Query: 326 EISESDPFFGD------------KLDIAETNGLGESAY------------------FDIV 355
           EIS     +               L+  E  G+ +  Y                    I 
Sbjct: 300 EISMRSICYAAGIPSAIPGPEHPHLEFLENMGVFDDGYSIPRLNLNTPVMEVFPEELLIA 359

Query: 356 LG-RTLPPAMLQYLRLVA------LGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRV 408
           L   T+PP     L++        LG T+A LL +  + T+W                  
Sbjct: 360 LNVLTMPPDQFNQLKMNNKVPKPRLGVTEATLLATAVQQTLW------------------ 401

Query: 409 VRDACKSALSGFHTTIEEDEQLLQG-GNLDP---------RLRIAVGVRAGEKRVLQQI 457
                      + TT+E+D+ LL    N+           R ++A+ VR GEK++L Q+
Sbjct: 402 ----------DYTTTLEKDKDLLTSISNIQESTHSTASARRFKMALQVRIGEKQILSQV 450


>gi|367036287|ref|XP_003648524.1| hypothetical protein THITE_2106073 [Thielavia terrestris NRRL 8126]
 gi|346995785|gb|AEO62188.1| hypothetical protein THITE_2106073 [Thielavia terrestris NRRL 8126]
          Length = 496

 Score = 68.9 bits (167), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 114/468 (24%), Positives = 181/468 (38%), Gaps = 98/468 (20%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVA----------------------ASEIGSL 128
           G G++A+ DI  + V+  +P    +   T A                        E G+ 
Sbjct: 39  GRGIIAKADIPADTVLFTIPRSAILCAATSALRDKIPDVFDLEGDHGAGHSDSGDEDGAA 98

Query: 129 CSGLK-PWISVALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLST 186
            S  +  W  + L LI E  + E S WR YLD+LP   D+ +FWS  EL ELQ + L++ 
Sbjct: 99  SSSSQDSWTLLILVLIYEHLQGEASRWRPYLDVLPPTFDTPMFWSPTELSELQASALVA- 157

Query: 187 TLGVKE---YVQNEYLKV---EEEIILP-------NKQLFPRPITLDDFLWAFGI----- 228
            +G  E    ++ + L V    EE+  P       + QLF     +   + A+       
Sbjct: 158 KVGRAEADRMIEAKVLPVIRAHEEVFFPPGRAKLDDAQLFELAHRMGSTIMAYAFDLEND 217

Query: 229 --------LRSRAFSRLRGQNLV-LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFS 279
                             G+ ++ ++P+AD++N                A +   D   +
Sbjct: 218 DSDNDEADEDDEWVEDREGRTMLGMVPMADMLNADAEFN----------AHINHGDDALT 267

Query: 280 LRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDR-NAYTLTLEISESD--PFFGD 336
                P++AG+++L  Y     N EL   YG++  K  R +   L  E+ E++     G 
Sbjct: 268 ATALRPIRAGDEILNYYG-PLPNGELLRRYGYVTPKHSRYDVVELPWELVEAELKGRVGT 326

Query: 337 KLDIAETNGLGESAYFD------IVLGRTL--PPAMLQYLRLVALGGTDAFL-LESIFRN 387
               +E + + E    D       VL R+   P +  Q      L G   F  L      
Sbjct: 327 HWTASEWDKVRELVRADDDFEGSFVLERSSADPDSTGQ------LSGDAVFAGLPDELGE 380

Query: 388 TIWGHLDLPVSHANEELICRVVRD----------ACKSALSG----FHTTIEEDEQLLQG 433
                L      AN E+  R   D          A  +AL      + T++E+D QL+  
Sbjct: 381 QFKAFLKAAKKVANSEIAARAASDKEMRKELYLQAVLAALQARERQYATSLEDDRQLMDA 440

Query: 434 GNLDPRLRIAVGVRAGEKRVLQQIDEIFKERELELDELEYYQERRLKD 481
           GNL  R  +AV VR GEK +L++     +    ELDEL+    R  ++
Sbjct: 441 GNLTGRQEMAVCVRIGEKVLLREAQSWVRR---ELDELQSQASRTTRE 485


>gi|440792461|gb|ELR13682.1| [Ribulose-bisphosphate-carboxylase]-lysine N-methyltransferase
           [Acanthamoeba castellanii str. Neff]
          Length = 400

 Score = 68.9 bits (167), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 105/413 (25%), Positives = 163/413 (39%), Gaps = 89/413 (21%)

Query: 102 KNEVVLEVPMKFWINPDTVAASEIGSLCSGL---------KPWISVALFLIREKKKEDSP 152
           + E++ EVP +F ++           L   L         +  ++  L+L+       S 
Sbjct: 4   EGELLAEVPARFILHSRNERVCHAADLRKALAAHPRVASHRHMLAAVLWLLESVNCAQSF 63

Query: 153 WRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILP---- 208
           W+ YL  LP+   +   W++EEL E+  T +L   +   EY + +       I+LP    
Sbjct: 64  WQPYLSELPDAVATVDRWNQEELAEVGHTLMLYEMV---EYKKKKIAADYAAILLPFLQE 120

Query: 209 NKQLFPRPI-TLDDFLWAFGILRSRA--FSRLRGQNLVLIPLADLINHSPGITTEDYAYE 265
           N QLF   I + +++  A  ++ SR   FS L G++ V IP  D +NHS   T       
Sbjct: 121 NTQLFGGSIPSEEEYRRALSLVYSRTFDFSELIGEH-VFIPFVDFLNHSINDTG------ 173

Query: 266 IKGAGLFSRDL---LFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI-----ESKSD 317
            K A  +S +     F L        GE+V I Y   K++++L   YGF+     E   D
Sbjct: 174 -KAACTYSYNHDKDCFELLAGADYDEGEEVFISYG-EKTSSQLLASYGFMYENNAEDTVD 231

Query: 318 RNAYTLTL---------------------------EISESDPFFGDKLDIAETNGLGESA 350
             A   +L                           +++  D FF  K D       G+  
Sbjct: 232 ITASLFSLSPPHSTSTSTSTSSSSSSTRRSPPSLKQLTLEDLFFSLKTD----QHTGQP- 286

Query: 351 YFDIVLGRTLPPAMLQYLRLVALGGTD--AFLLESIFRNTIWGHLDLPVSH-----ANEE 403
                    L  ++L ++R+  L       FL + + +       DL  SH      NE 
Sbjct: 287 --------FLHASLLNFIRIQHLSTEQKACFLTQDLGQRH-----DLVRSHIIFDDQNEA 333

Query: 404 LICRVVRDACKSALSGFHTTIEEDEQLLQGG-NLDPRLRIAVGVRAGEKRVLQ 455
              R +  A +  L  F TTIEEDE LL    +L    R A+  R  EKR+L+
Sbjct: 334 ETNRFLLSALERMLKAFPTTIEEDEHLLATCEDLTSGKRWAIRYRKEEKRLLR 386


>gi|384246822|gb|EIE20311.1| hypothetical protein COCSUDRAFT_48681 [Coccomyxa subellipsoidea
           C-169]
          Length = 539

 Score = 68.9 bits (167), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 92/386 (23%), Positives = 154/386 (39%), Gaps = 41/386 (10%)

Query: 95  VAQRDIAKNEVVLEVPMKFWINPDTVAASE-IGSLCS--GLKPWISVALFLIREKKK-ED 150
           VA RD+   E+ L +P    I  D V   E +  L +   L     + L+L+ EKK    
Sbjct: 48  VAARDLQAGELALRIPDHLVITLDRVFEDESLAELLTTDKLSELACLTLYLMYEKKNGRQ 107

Query: 151 SPWRVYLDILPEC-------TDSTVFWSEEELVE-LQGTQLLSTTLGVKEYVQNEYLKVE 202
           S W  ++  L            S + W E ++ E L G+ L++      + ++ EY +++
Sbjct: 108 SVWYEFIKELDRIQGRGQMGAKSPLLWDEGQVDEYLAGSPLVAEIKERLKGIEKEYAELD 167

Query: 203 EEIILPNKQLFPRP-------ITLDDFLWAFGILRSRAFSRLRG----QNLVLIPLAD-- 249
               +        P        +L  F   F  +++ +   L+G    +   L+PL    
Sbjct: 168 TVWFMAGSLFKSYPYDVPTEAFSLKLFRQGFAAVQA-SVVHLQGVPLSKRFALVPLGPPL 226

Query: 250 LINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDY 309
           L   S       Y  E K   L + D  ++   P+    G Q          N  L L+Y
Sbjct: 227 LSYSSTAKAMLTYNREAKEVQL-AVDRSYTKGEPIEAWCGPQ---------PNRRLLLNY 276

Query: 310 GFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLR 369
           G +   +  +   LT+ +  +DP F  K  + + N L     F +   + LP  +L YLR
Sbjct: 277 GIVTDNNPHDKMALTVTLPHADPLFQAKRAVLQQNNLSTQQTFQLQRDKGLPELLLPYLR 336

Query: 370 LVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQ 429
           L     TDA  L+    +T       P+S  NE  +   +    +  L  + TT EEDE 
Sbjct: 337 LAHC--TDAASLKLATLDTCCA---APISPENERTVLHQLASHLQDRLDRYKTTCEEDEV 391

Query: 430 LLQGGNLDPRLRIAVGVRAGEKRVLQ 455
           +++     PR ++A  +   EK +L+
Sbjct: 392 IIRSTTAGPRQKVAARLLRIEKAILR 417


>gi|302914506|ref|XP_003051150.1| hypothetical protein NECHADRAFT_106131 [Nectria haematococca mpVI
           77-13-4]
 gi|256732088|gb|EEU45437.1| hypothetical protein NECHADRAFT_106131 [Nectria haematococca mpVI
           77-13-4]
          Length = 499

 Score = 68.6 bits (166), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 61/194 (31%), Positives = 87/194 (44%), Gaps = 13/194 (6%)

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEE---IIL 207
           +PW  YL  LP        WSE E   LQGT L +        + NE+ ++ E+   +  
Sbjct: 135 TPWTEYLKFLPRHVPVPTMWSEVERALLQGTSLEAALEAKLAALNNEFDELREKSSGLAF 194

Query: 208 PNKQLFPRPI-TLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEI 266
            N   + +   T+ D++    + RSR     R  +  ++P  D+ NHS   T   Y  + 
Sbjct: 195 WNSLFWEKETATIQDWILIDALYRSRCLELPRAGD-AMVPGLDMANHSHDPTAY-YEEDD 252

Query: 267 KGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLE 326
           K       D++  LR  V V  GE+V I Y  +KS AE+   YGFI+  S  +  TL LE
Sbjct: 253 KD------DVVLLLRLGVEVTGGEEVSISYG-DKSPAEMLFSYGFIDRDSAAHDLTLPLE 305

Query: 327 ISESDPFFGDKLDI 340
               DP    KL I
Sbjct: 306 ALPDDPLGKAKLHI 319


>gi|119495234|ref|XP_001264406.1| SET domain protein [Neosartorya fischeri NRRL 181]
 gi|119412568|gb|EAW22509.1| SET domain protein [Neosartorya fischeri NRRL 181]
          Length = 492

 Score = 68.6 bits (166), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 57/200 (28%), Positives = 88/200 (44%), Gaps = 19/200 (9%)

Query: 144 REKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQL---LSTTLGVKEYVQNEYLK 200
           R+K    +PW  Y+  +P       F+S EE   L+GT L   +   LG  E       +
Sbjct: 119 RQKIGVSNPWTEYIRFMPASIPLPTFYSAEERELLRGTSLQTAVDAKLGSLEKEFEHLRQ 178

Query: 201 VEEEIILPNKQLFPR---PITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGI 257
             E+I    +  +       T DD  +   + RSR     R  +  ++P  D+ NH+   
Sbjct: 179 ATEDIHWCQEHWWDEDTGKFTFDDLKYVDAVYRSRVVDLPRSGH-AIVPCVDMANHA--- 234

Query: 258 TTEDYA---YEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
             ED     Y+  GAG    + +  LRT   ++ GE+V I Y   K  +E+   YGF+E+
Sbjct: 235 -CEDLVKARYDEDGAG----NAVLQLRTGKKLRVGEEVTISYGDEKPASEMVFSYGFVEN 289

Query: 315 -KSDRNAYTLTLEISESDPF 333
            ++D     L LEI + DP 
Sbjct: 290 ERTDAKQIFLDLEIPDDDPL 309


>gi|428179206|gb|EKX48078.1| hypothetical protein GUITHDRAFT_106158 [Guillardia theta CCMP2712]
          Length = 410

 Score = 68.6 bits (166), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 93/391 (23%), Positives = 163/391 (41%), Gaps = 73/391 (18%)

Query: 138 VALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQN 196
           ++L L+ EK K E S W  ++  +P   D+   WSE+ +  LQ    L+     K  V +
Sbjct: 26  LSLHLLVEKWKAERSRWWRFIRSIPPSYDTLENWSEQSVARLQYKPFLAIAARRKRVVND 85

Query: 197 EYLKVEEEIILPNK---------------QLFPRPITLDDFLWAFGILRSRA--FSR--- 236
           E+ +++  +    K               QL     + +D+LWA G + +R+  + R   
Sbjct: 86  EFSQLQRLLSRCKKRSWNEPEAAEEAERIQLGFSSFSREDYLWAAGTVSTRSCHYERKSG 145

Query: 237 --LRGQNL-VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVL 293
             LRG+ +  L+P+ D +NHS           +   G     +++ +      + GEQV+
Sbjct: 146 YSLRGETVGCLVPVLDFLNHSTA--------PVAACGFCKDAMVYRVTCLRSYEEGEQVM 197

Query: 294 IQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDP--------------------- 332
           I Y  N SNA L   YGF+   +  ++  L L    + P                     
Sbjct: 198 IHYG-NWSNAGLLEHYGFVLEDNPLDSCMLWLRHPPNPPEHLIATPTLKSEFRRMQDMII 256

Query: 333 FFGDKLDIAETNG----LGESAY-----FDIVLGRTLPPAMLQYLRLVALGGTDAFLLES 383
            FG +++   TN     +GE+       F++  G    PA L +  L  L  + A+L   
Sbjct: 257 SFGKEVEEDGTNSRAPRMGETKAASWLEFELYAGDEGAPAELSWNILTFLRWS-AYLANG 315

Query: 384 IFRNTIWGHL-----DLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDP 438
           I+  + +  L     D PVS  NE    +++  + ++ LS    T + DE    G + D 
Sbjct: 316 IYDASFFDKLLEADEDSPVSLQNETEALQMLEKSLQNVLSTLAIT-DGDEANKCGNDHD- 373

Query: 439 RLRIAVGVRAGEKRVLQQIDEIFKERELELD 469
             RIAV ++    +++ +     + R +EL+
Sbjct: 374 --RIAVTLKKQAGKIIMKHMSWLRLRHIELE 402


>gi|384483765|gb|EIE75945.1| hypothetical protein RO3G_00649 [Rhizopus delemar RA 99-880]
          Length = 376

 Score = 68.6 bits (166), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 64/240 (26%), Positives = 110/240 (45%), Gaps = 21/240 (8%)

Query: 94  LVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREKKKEDSPW 153
           ++A  DI   EV++ VP  F I  +++     G+    L P   +AL L+   + + S W
Sbjct: 1   MMATEDIEAGEVIVSVPRNFLITNESLTKL-YGT--HSLSPHQLLALHLVLLTRDKQSWW 57

Query: 154 RVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLF 213
           + Y D+LP   ++       EL+      L   T+  K+ +  +Y+   +     +KQL 
Sbjct: 58  KPYTDLLPMHFNTMPVNYPSELLSHLPNSLKQETMQQKDNIHTDYVTCLK--FCKSKQL- 114

Query: 214 PRPITLDDFLWAFGILRSRAFSRL------RGQNLVLIPLADLINHSPGITTEDYAYEIK 267
           P+ IT ++F WA+  + +R           +G+N+ L P+ D +NH+     E       
Sbjct: 115 PQDITAEEFKWAWLCVNTRCIHMTVPDYLAKGENIALAPMLDFLNHTTEAKIE------- 167

Query: 268 GAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEI 327
            +G   R   F ++T    K GEQV I Y  +  N  +  +YGF+ +++  N   L  EI
Sbjct: 168 -SGFNIRTQRFEIKTLTAYKKGEQVYINYGPH-DNLAMLKEYGFVLNENIYNFVLLDDEI 225


>gi|363747032|ref|XP_003643892.1| PREDICTED: histone-lysine N-methyltransferase setd3-like, partial
           [Gallus gallus]
          Length = 283

 Score = 68.2 bits (165), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 54/183 (29%), Positives = 85/183 (46%), Gaps = 18/183 (9%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A R+I   E+ L VP K  +  ++   S +GSL S  +       I++A  L+ 
Sbjct: 104 EGFGLKATREIKAEELFLWVPRKLLMTVESAKNSVLGSLYSQDRILQAMGNITLAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEE 204
           E+   +S W  Y+  LP   D+ +++ E+E+  L+ TQ +       +    +Y    + 
Sbjct: 164 ERANPNSFWLPYIQTLPSEYDTPLYFEEDEVQYLRSTQAIHDVFSQYKNTARQYAYFYKV 223

Query: 205 I-ILPNKQLFP--RPITLDDFLWAFGILRSRAF-------SRLRGQNLVLIPLADLINHS 254
           I   PN    P     T DD+ WA   + +R         SR+    L LIPL D+ NH+
Sbjct: 224 IQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPTEDGSRV---TLALIPLWDMCNHT 280

Query: 255 PGI 257
            G+
Sbjct: 281 NGL 283


>gi|322703179|gb|EFY94792.1| UV-endonuclease UVE-1 [Metarhizium anisopliae ARSEF 23]
          Length = 1118

 Score = 68.2 bits (165), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 112/482 (23%), Positives = 180/482 (37%), Gaps = 121/482 (25%)

Query: 61   TAQVETFWQWLRDQKVVSPKSPIRPATFPE------------GLGLVAQRDIAKNEVVLE 108
            TA+   F QW R          +  ATF +            G G+VA RDI  +  +  
Sbjct: 658  TARTAAFLQWFR---------ALPGATFSDAIEIVDLRSRDAGRGIVALRDIPADTTLFT 708

Query: 109  VPMKFWINPDTVAASEI----------GSLCSGLKPWISVALFLIREK-KKEDSPWRVYL 157
            +P    IN DT +  E                 L  W ++ L ++ E      S W+ Y+
Sbjct: 709  IPRDAIINSDTSSLREKLPELFESQGDEDEQQALDSWSALILIMMYEFFLGHQSKWKPYI 768

Query: 158  DILPECTDSTVFWSEEELVELQGTQLLST--TLGVKEYVQNEYLKVEEEIILPNKQLFPR 215
            D+LP   D+ +FWSEEEL  LQ +  ++       +E  +   +      I  N  +F  
Sbjct: 769  DVLPLTFDTPMFWSEEELSYLQASATVNKIGKADAEEMFRTRLIPA----IRGNPSVFAS 824

Query: 216  PITLDD--------------FLWAF--------------GILRSRAFSRLRGQNLVLIPL 247
                 D                +AF              G +  R    + G    ++ +
Sbjct: 825  SGDCSDEDLIGLAHRMGSTIMAYAFDLENEEAENDDESDGWVEDREGKSMMG----MVAM 880

Query: 248  ADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELAL 307
            AD++N                A +   D   ++ +   +KAGE++L  Y  +  N+EL  
Sbjct: 881  ADILNADAEFN----------AHVNHGDEELTVTSIRDIKAGEEILNYYGPH-PNSELLR 929

Query: 308  DYGFIESKSDR-----------------------NAYTLTLEISESDPF---FGDKLDIA 341
             YG+I  K  R                       +  T T++  + D     F  + D  
Sbjct: 930  RYGYITEKHSRYDVVEIPWDAVQHSLMSELGVPQDIMTETMDKMDQDDLEDIFVLERDSG 989

Query: 342  ETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHAN 401
            E N  G  A   +V G  +PP + + L+          LL+ +  N I            
Sbjct: 990  EPNPDGTFAGPAVVDG--MPPDLKEQLKATIK------LLQKVDGNLISD------KRKR 1035

Query: 402  EELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIF 461
            ++++   + +  +   S + TTI EDE LL   +L  R R+AV VR GEK++LQ+  + F
Sbjct: 1036 DDILRSTMVETLRLIASRYSTTIAEDEILLAQDSLTRRQRMAVRVRLGEKKLLQEAFDHF 1095

Query: 462  KE 463
             E
Sbjct: 1096 SE 1097


>gi|413923745|gb|AFW63677.1| hypothetical protein ZEAMMB73_839660 [Zea mays]
 gi|413923746|gb|AFW63678.1| hypothetical protein ZEAMMB73_839660 [Zea mays]
          Length = 306

 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 56/175 (32%), Positives = 86/175 (49%), Gaps = 9/175 (5%)

Query: 93  GLVAQRDIAKNEVVLEVPMKFWINPDT-VAASEIGSLC--SGLKPWISVALFLIREKKKE 149
           GLVA ++I K E +L VP    I  D+     E+G +   + +  W  +A +LI E   E
Sbjct: 101 GLVALKNIRKGEKLLFVPPSLVITADSEWGRPEVGDVMKRNSVPDWPLIATYLISEASLE 160

Query: 150 DSP-WRVYLDILPECTDSTVFWSEEEL-VELQGTQLLSTTLGVKEYVQNEYLKVEEEIIL 207
            S  W  Y+  LP    S ++W+  EL   L  + +    +     V   Y  + + I  
Sbjct: 161 GSSRWISYIAALPRQPYSLLYWTRAELDAYLVASPIRKRAIQRITDVIGTYNDLRDRIFS 220

Query: 208 PNKQLFPRPI-TLDDFLWAFGILRSRAFSRLRGQN--LVLIPLADLINHSPGITT 259
            +  LFP  +  ++ FLW+FGIL SR   RL   +  + L+P AD++NHSP + T
Sbjct: 221 RHPDLFPEEVYNIETFLWSFGILFSR-LVRLPSMDGRVALVPWADMLNHSPEVET 274


>gi|218191524|gb|EEC73951.1| hypothetical protein OsI_08832 [Oryza sativa Indica Group]
          Length = 486

 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 100/407 (24%), Positives = 168/407 (41%), Gaps = 52/407 (12%)

Query: 63  QVETFWQWL-RDQKVVSPKSPIRPATFPE-GLGL---VAQRDIAKNEVVLEVPMKFWIN- 116
           Q     +W+ R+   V P   +R    PE GLG+    A+ DI   +V++ +P +  +  
Sbjct: 53  QPPDLLRWVQREGGFVHPA--LRVVDHPEHGLGVSAAAAEGDIPPGDVLIALPGRLPLRL 110

Query: 117 --PDTVAASEIGSLCSGLKP--W-ISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWS 171
             P   A + +  L   +    W + + L L++E+ K DS W  Y+  LPE     +F+ 
Sbjct: 111 RRPAGAADAVLVQLADQVPEELWAMRLGLRLLQERAKSDSFWWPYIANLPETFTVPIFFP 170

Query: 172 EEELVELQGTQLLSTTLGVKEYVQNEYLKVEEE---IILPNKQLFPRPITLDDFLWAFGI 228
            E++  LQ   LL        ++     +V+ +   + L +     + +      WA   
Sbjct: 171 GEDIKNLQYAPLLHQVNKRCRFLLEFEKEVKHKLGTVPLEDHPFCGQDVNSSSLGWAMSA 230

Query: 229 LRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKA 288
             +RAF RL G+  +L+PL D+ NHS        A  ++   + S D+   +     +  
Sbjct: 231 ASTRAF-RLHGEIPMLLPLIDMCNHS----FNPNARIVQEGNVDSPDMSVKVVAETKIDQ 285

Query: 289 GEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTL----------TLEISESDPFFGD-- 336
              V + Y    ++  L LDYGF+ + +  +   L          ++    S P F    
Sbjct: 286 NAAVTLNYGCYPNDFFL-LDYGFVITSNSYDQVELSYDGTLLDAASMAAGVSSPNFSAPA 344

Query: 337 --KLDI-AETNGLGESAYFDIVLG--RTLPPAMLQYLRLV------ALGGTDAFLLESIF 385
             + DI ++ N  GE A   + +G    +   +L  LR++      A+ G D   L S+ 
Sbjct: 345 KWQQDILSQLNLYGEGAILKVSIGGPEIVDGRLLAALRVIIAADPDAVSGHDLKTLMSLK 404

Query: 386 RNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQ 432
                     P+  A E    R V   C  AL  FHT I EDE +L+
Sbjct: 405 EKA-------PLGPAVEASALRTVLALCTFALQHFHTKIMEDEAILK 444


>gi|297836754|ref|XP_002886259.1| hypothetical protein ARALYDRAFT_319874 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297332099|gb|EFH62518.1| hypothetical protein ARALYDRAFT_319874 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 541

 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 68/237 (28%), Positives = 111/237 (46%), Gaps = 31/237 (13%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSL---CSGLKPWISVALFLIREKK 147
           G G +A  D+   +V LE+P+   I+ + V  S++  +     G+     V L+ +REK 
Sbjct: 168 GRGAIASEDLKFGDVALEIPISSIISEEYVFNSDMYPILEKIDGITSETMVLLWTMREKH 227

Query: 148 KEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEII- 206
             DS ++ Y D L E   + + +    ++EL GT LL   +  KE ++  Y    +E+I 
Sbjct: 228 NLDSKFKPYFDSLQENFCTGMSFGVNAIMELDGTLLLDEIMQAKELLRERY----DELIP 283

Query: 207 -LPN-KQLF-PRPITLDDFLWAFGILRSRAF------SRLRGQNLVLIPLADLINHSPGI 257
            L N + +F P   T + +LWA  +  S +        +L+     LIP+A  +NHS   
Sbjct: 284 LLSNHRHVFPPEHYTWEHYLWACELYYSNSMQIKFPDGKLK---TCLIPVAGFLNHSI-- 338

Query: 258 TTEDYAYEIKGAGLFSRDLLFSLRTPV--PVKAGEQVLIQYDLNKSNAELALDYGFI 312
               Y + +K   +       SL+ PV  P   GEQ  + Y  N S++ L   YGF+
Sbjct: 339 ----YPHIVKYGKVCVE--TSSLKFPVSRPCNKGEQCFLSYG-NYSSSHLLTFYGFL 388


>gi|410900968|ref|XP_003963968.1| PREDICTED: SET domain-containing protein 4-like [Takifugu rubripes]
          Length = 386

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 69/261 (26%), Positives = 115/261 (44%), Gaps = 55/261 (21%)

Query: 83  IRPATFPE-GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWIS---- 137
           + PA F + G GL   R++   ++++ +P    +   TV  S +GS     KP +S    
Sbjct: 48  LHPAAFADTGRGLQVLRNVKPGDMLISLPESCLLTTSTVLNSYLGSFIKSWKPHLSPLLA 107

Query: 138 VALFLIREKKK-EDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQN 196
           + +FL+ E+ + E S W  Y+D+LP+      ++++E +       LL  ++  K   Q 
Sbjct: 108 LCVFLVCERHRGEASDWFPYIDVLPKSYTCPAYFTDEVMA------LLPPSVQRKAREQR 161

Query: 197 EYLKVEEEIILPNKQLFP--RPI---------TLDDFLWAFGILRSRAF-------SRLR 238
           E ++   EI   NK  F   +P+         T +   WA+  + +R+          LR
Sbjct: 162 EAVR---EIHSSNKAFFRSLQPVLTQPAEDVLTYEALRWAWCSVNTRSVFMLHSSNDFLR 218

Query: 239 GQNL-VLIPLADLINHSPGITT------EDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQ 291
           GQ++  L P  DL+NH P +        E   YEI+     SR L +           +Q
Sbjct: 219 GQDVYALAPFLDLLNHCPDVQVKASFNEETKCYEIRSV---SRMLQY-----------QQ 264

Query: 292 VLIQYDLNKSNAELALDYGFI 312
             I Y  +  N  L L+YGF+
Sbjct: 265 AFINYG-SHDNQRLMLEYGFV 284


>gi|323447496|gb|EGB03414.1| hypothetical protein AURANDRAFT_72732 [Aureococcus anophagefferens]
          Length = 403

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 66/260 (25%), Positives = 117/260 (45%), Gaps = 29/260 (11%)

Query: 93  GLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIG------SLCSGLKPWISVALFLIREK 146
           G+ A RD+   E+++EVP+K  I  +   A+++G       L       + + LF++ ++
Sbjct: 60  GVHATRDLETEEILVEVPLKCLITVEMGKATDVGRAVLEAELELDAPKHVFLMLFVLLDR 119

Query: 147 KKEDSPWRVYLDILPEC-TDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEI 205
           +   + +  Y DILP   ++  +FW  +EL  L+G+ LL+     K  ++ +Y     E 
Sbjct: 120 RDSSTFFAPYYDILPSTLSNMPIFWQPDELEWLKGSYLLTQIEERKRAIKADY-----EA 174

Query: 206 ILPNKQLFPRPITLDDFLWAFGILRSRAFSRL--RGQNLVLIPLADLINHSPGITTEDYA 263
           I      F    TL++F WA   + SR F  +    +   ++P AD++NH     T+   
Sbjct: 175 ICGIWPSFIDVCTLEEFKWARMCVCSRNFGVVVNGARTSAMVPYADMLNHFRPRETKWTF 234

Query: 264 YEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF-IESKSDRNAYT 322
              +GA        F++ +   +  G Q+   Y   K N    L+YGF IE   + + + 
Sbjct: 235 DNSRGA--------FTITSLQKISVGSQIYDSYG-QKCNHRFLLNYGFAIEDNKEPDGFC 285

Query: 323 -----LTLEISESDPFFGDK 337
                L L++S  D     K
Sbjct: 286 PNEAALLLQLSPHDALASRK 305


>gi|224125978|ref|XP_002329631.1| predicted protein [Populus trichocarpa]
 gi|222870512|gb|EEF07643.1| predicted protein [Populus trichocarpa]
          Length = 513

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 65/262 (24%), Positives = 124/262 (47%), Gaps = 33/262 (12%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSL---CSGLKPWISVALFLIREKK 147
           G G +A +D+   ++ LE+P+   I+ + V  S++  +     G+     + L+ ++E+ 
Sbjct: 144 GRGAIATKDLKVGDIALEIPVSIIISEEHVHKSDMYHILEKIDGITSETMLLLWSMKERH 203

Query: 148 KEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIIL 207
              S +++Y D LPE   + + +  + ++ L GT LL   +  KE+++ +Y    +E++ 
Sbjct: 204 NCSSKFKIYFDTLPEEFKTGLSFGVDAIMALDGTLLLEEIMQAKEHLRVQY----DELVP 259

Query: 208 PNKQLFPRPI-----TLDDFLWAFGILRSRAF------SRLRGQNLVLIPLADLINHS-- 254
           P  + +P        T + FLWA  +  S +        +LR     LIP+A  +NHS  
Sbjct: 260 PLCKNYPDVFLPELYTWEQFLWACELWYSNSMKVMFVDGKLRT---CLIPIAGFLNHSLY 316

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI-E 313
           P I    +  ++  A   +  L F L  P     GEQ  + Y  N S++ L   YGF+ +
Sbjct: 317 PHIV---HYGKVDSA---TNTLKFPLTRPCCF--GEQCCLSYG-NFSSSHLITFYGFMPQ 367

Query: 314 SKSDRNAYTLTLEISESDPFFG 335
             +  +   L +++ ++D   G
Sbjct: 368 GDNPCDVIPLDIDVGDADCIEG 389


>gi|301112144|ref|XP_002905151.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262095481|gb|EEY53533.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 510

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 105/423 (24%), Positives = 189/423 (44%), Gaps = 39/423 (9%)

Query: 77  VSPKSPIRPATFPEGLGL------VAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCS 130
           V P +P+     PEG         +   ++     +L +PM   ++ ++ A   +G L  
Sbjct: 83  VKPVAPMSTVLQPEGFNFGRGTAYITAENVEVGSELLSLPMSQVMSVESAARGRVGLLLE 142

Query: 131 ---GLKPWISVALFLIREKK-KEDSPWRVYLDILP--ECTDSTVFWSEEELVELQGTQLL 184
               L   I++ L L+ E+     S +  ++  LP  E  +ST+F+SE+E+ EL+G+QL 
Sbjct: 143 VNPDLPSAIALGLHLLEERALGAASNFSDFVATLPTIEAINSTLFYSEDEMNELEGSQLQ 202

Query: 185 STTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI------TLDDFLWAFGILRSRAFSRLR 238
             TLG  + V+  Y  + + +   +++    PI      TLD F WA G++ S  F    
Sbjct: 203 RFTLGRAQAVEAFYDALVQPVT--SREAVDPPIFHKSEFTLDKFRWAMGVVWSSTFQ--F 258

Query: 239 GQNLVLIPLADLINHSPGITTE-----DYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVL 293
           G+N   + LA ++N + GI T+     + A       + +     ++   V     ++V 
Sbjct: 259 GENEDDVILAPVLN-TIGICTDLNQEGNEACPETSIKVDTDTQRLTVYASVAYSKSQEVR 317

Query: 294 IQYDLNKSNAELALDYGFIESKSDR-NAYTLTLEISESDPFFGDKLDIAETNGLGES--A 350
           +     KS+ +L L +GF  +++ + +   LT+ +  SD     K  + +T  L ES  A
Sbjct: 318 LSMP-GKSSTQLMLSHGFARARASKLDKLDLTVTLDPSDTLAPLKNYLLQTQ-LNESINA 375

Query: 351 YFDIVLGRT-LPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLP-VSHANEELICRV 408
            ++   G + +   +   LR+  L G +    + +      G    P VS  NE +  R 
Sbjct: 376 TYEFFYGSSKIDEYVSTSLRMKLLSGGELARYKELL-TPAEGEEHRPIVSLRNEFVFTRA 434

Query: 409 VRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKER-ELE 467
           V   C + L  + T+IE+D++ L    L  +  +     A  +R+L    +I  E  EL 
Sbjct: 435 VISTCTTLLKQYPTSIEQDQENL--AKLSDKDDVESVRIAHVQRILIMEKQILNETMELA 492

Query: 468 LDE 470
           LD+
Sbjct: 493 LDQ 495


>gi|302818853|ref|XP_002991099.1| hypothetical protein SELMODRAFT_429412 [Selaginella moellendorffii]
 gi|300141193|gb|EFJ07907.1| hypothetical protein SELMODRAFT_429412 [Selaginella moellendorffii]
          Length = 428

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 102/398 (25%), Positives = 158/398 (39%), Gaps = 39/398 (9%)

Query: 86  ATFPE--GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAAS-EIGSLCSGLKPWISVAL-- 140
           A FP+  G GL   R++ + E++L VP    I         E G +        SV +  
Sbjct: 32  AVFPDQGGRGLGVARNVEQGEMILRVPFAALIGVHCAREDPEFGKVLVDFAHLSSVQILT 91

Query: 141 -FLIRE-KKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY 198
            +L+ E  K   S W  YL   P+   +   +S  E  ELQ    +S      E  Q ++
Sbjct: 92  AYLLSEVAKSRSSRWFSYLRHNPQVHHNLPHFSAMEAEELQVEDAISMAKSSFEDTQRQW 151

Query: 199 LKVEEEIILPNKQLFPRPIT-LDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGI 257
               E   L ++   PR  T    +LWA   + SR          VL P+ DL N+   I
Sbjct: 152 ---RETSSLLSRLRLPRKFTTFKAWLWAAATISSRTLHVPWDDAGVLCPIGDLFNYDAPI 208

Query: 258 ------TTED----YAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELAL 307
                   ED    +   +   G  +    +        K G+Q LI Y    +N EL  
Sbjct: 209 ERTLSSRNEDDEHKFTSRLTDGGYETSISSYCFYARRSYKNGQQALICYG-QYTNLELLE 267

Query: 308 DYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQY 367
            YGF+   +  +   + L  SE    FG K    ++      A + I        ++LQ 
Sbjct: 268 HYGFLLPDNPCDVIYIPLPSSEE---FGLKSTGDKSERQHNLAAYCIEASGKPSFSLLQQ 324

Query: 368 LRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEED 427
           LRL A+    A L +S   +       LP+S  +++L+   ++  C+  L    TTIE+D
Sbjct: 325 LRLRAV---PASLRKS---HGYMASQGLPISRESDQLVFLWLQKKCQKLLEKLPTTIEQD 378

Query: 428 EQLLQ--------GGNLDPRLRIAVGVRAGEKRVLQQI 457
           E+ L+        G     R ++A+  R G K +LQ+ 
Sbjct: 379 ERALEQLEKSTSSGSCGVERHKLAIQWRLGYKTILQRC 416


>gi|297738159|emb|CBI27360.3| unnamed protein product [Vitis vinifera]
          Length = 449

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 105/411 (25%), Positives = 162/411 (39%), Gaps = 67/411 (16%)

Query: 86  ATFPE--GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLC-----SGLKPWISV 138
           + FP   G GL A RD+++ E++L VP    +   ++   E  S+      S   P I  
Sbjct: 41  SHFPHAGGRGLAAARDLSQGELILTVPKSALMTSQSLLKDEKLSVAVKRHTSLSSPQILT 100

Query: 139 ALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY 198
              L    K + S W  YL  LP   D+   +S+ E   LQ    +  T       +   
Sbjct: 101 ICLLAEMSKGKSSWWHPYLMQLPRSYDTLANFSQFEKQALQVDDAIWVT-------ERAI 153

Query: 199 LKVEEEI-----ILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINH 253
           LK E E      ++   +L P+      +LWA   + SR           L P+ D  N+
Sbjct: 154 LKAELEWKKAIPLMEELKLKPQLQNFRAWLWASSTVSSRTMHIPWDDAGCLCPVGDFYNY 213

Query: 254 -SPG-----------ITTEDYAYEIKGAGLFSRDL-LFSLRTPVPVKAGEQVLIQYDLNK 300
            +PG              +D   +    G +  DL  +        K GEQVL+ Y    
Sbjct: 214 AAPGEEPCGWEDLKDAEQDDVLSQRLTDGGYKEDLAAYCFYARKNYKKGEQVLLSYG-TY 272

Query: 301 SNAELALDYGFI--ESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGR 358
           +N EL   YGF+  E+ +D+    L  E+  S  +  D L I +    G+ ++       
Sbjct: 273 TNLELLEHYGFLLDENPNDKAFIPLEPEVYASSSWPKDSLYIHQN---GKPSF------- 322

Query: 359 TLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHL---DLPVSHANEELICRVVRDACKS 415
               A+L  LRL A   +         R ++ GHL      +S  NE  +   +  +C  
Sbjct: 323 ----ALLSALRLWATPASQ--------RRSV-GHLVYSGTQLSSENEIFVMEWIAKSCHV 369

Query: 416 ALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKEREL 466
            L    T++EED  LL       R ++AV  R   KR+L  +D I +  E+
Sbjct: 370 VLENLPTSVEEDSLLLSM----ERWKLAVQWRLRHKRIL--VDCISRCTEI 414


>gi|413917183|gb|AFW57115.1| hypothetical protein ZEAMMB73_742803 [Zea mays]
          Length = 514

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 77/314 (24%), Positives = 135/314 (42%), Gaps = 32/314 (10%)

Query: 80  KSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISV 138
           KS ++ A F   G G++A   I   ++ LE+P    I+ + +  SE+           S 
Sbjct: 136 KSRLQIAYFQGAGRGMIASESIGVGDIALEIPESLIISDELLCQSEVFLSLKDFNNITSE 195

Query: 139 ALFLI---REKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQ 195
            + L+   RE+    S ++ Y D LP   ++ + +  + L  L+GT L    +  +++++
Sbjct: 196 TMLLLWSMRERYNLGSKFKPYFDTLPANFNTGLSFGIDALAALEGTLLFDEIIQARQHLR 255

Query: 196 NEYLKVEEEIILPNKQLFPRPI-TLDDFLWAFGILRSRAFSRLRGQ---NLVLIPLADLI 251
            +Y ++   +     ++F + + T DDFLWA  +  S +   +      +  L+P+A L+
Sbjct: 256 QQYDELFPLLCTNFPEIFRKDVCTWDDFLWACELWYSNSMMIVLSSGKLSTCLVPVAGLL 315

Query: 252 NHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF 311
           NHS      +Y    +     ++ L F L    P  AGEQ  + Y  +  +  L   YGF
Sbjct: 316 NHSVSPHILNYGRVDEA----TKSLKFPLSR--PCDAGEQCFLSYGKHPGSHLLTF-YGF 368

Query: 312 IESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGR-----------TL 360
           +  + D     + L++  S     D  DI   +    S    +V G             L
Sbjct: 369 L-PRGDNPYDVIPLDLDTS----ADDEDITAQSSATTSQTTHMVRGTWLSTSGGFPTYGL 423

Query: 361 PPAMLQYLRLVALG 374
           P  +L YLR  ALG
Sbjct: 424 PQPLLTYLR-AALG 436


>gi|413942690|gb|AFW75339.1| hypothetical protein ZEAMMB73_065275 [Zea mays]
          Length = 571

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 67/264 (25%), Positives = 106/264 (40%), Gaps = 57/264 (21%)

Query: 105 VVLEVPMKFWINPDTVAASE-IGSLCSGL------KPWISVALFLIREKKKEDSPWRVYL 157
           V + VP+   I P  V     +G  C  L         + V LFL+ E+++  S W+ YL
Sbjct: 77  VAMVVPLDLAITPMRVLQDPLVGPRCRALLEDGVVDDRLLVMLFLMAERRRPGSLWKPYL 136

Query: 158 DILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY-LKVE---EEIILPNKQLF 213
           D+LP    S+++++EEEL EL+GT L   TL  ++ +Q+ +  KV+   EE++  ++   
Sbjct: 137 DMLPSTFGSSLWFTEEELAELEGTTLHRATLIQRKSLQSSFDEKVKGLVEELLHVDESAS 196

Query: 214 PRPITLDDFLWAFGILRSRAFS----------------RLRGQNLV-------------- 243
              +  +DFLWA  I  +RA +                + R  N                
Sbjct: 197 SVEVLFEDFLWANSIFWTRALNIPLPHSYVFLGSCGDQQARADNDAHQEIDITAKDCSAD 256

Query: 244 ---------------LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKA 288
                          L+P  D  NH+           +  A      +   L    P + 
Sbjct: 257 ENSKPSNTESIWVEGLVPGIDFCNHNVKALATWEVDSVGNATGIPASMYLLLADKSPAET 316

Query: 289 GEQVLIQYDLNKSNAELALDYGFI 312
           G ++ I Y  NK N EL   YGF+
Sbjct: 317 GAEICINYG-NKGNEELLYLYGFV 339


>gi|302896454|ref|XP_003047107.1| SET domain protein [Nectria haematococca mpVI 77-13-4]
 gi|256728035|gb|EEU41394.1| SET domain protein [Nectria haematococca mpVI 77-13-4]
          Length = 1037

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 102/438 (23%), Positives = 172/438 (39%), Gaps = 107/438 (24%)

Query: 91   GLGLVAQRDIAKNEVVLEVPMKFWIN----------PDTVAASEIGSL---CSGLKPWIS 137
            G G++A +DI     +  +P K  IN          PD     +          L  W S
Sbjct: 607  GRGIIALQDIPAETTLFTIPRKGIINVETSELPKKLPDVFDLDKPIDDDDEAPRLDSWSS 666

Query: 138  VALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQN 196
            + L L+ E  + E S W+ Y D+LP   D+ +FWSE EL +LQ +  +   +G K   ++
Sbjct: 667  LILVLMYEYLQGEKSQWKPYFDVLPSSFDTPMFWSESELDQLQASH-MRHKIG-KADAES 724

Query: 197  EYLKVEEEIILPNKQLFPRPITLDDFL-------------WAF--------------GIL 229
             + K    II  N  +F      DD L             +AF              G +
Sbjct: 725  MFRKTLLPIIRKNSSVFGGENRSDDDLVEIAHRMGSTIMAYAFDLENDEDEEEEETDGWV 784

Query: 230  RSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAG 289
              R    + G    ++P+AD++N        D  +    A +   +   ++ +  P+KAG
Sbjct: 785  EDREGKSMMG----MVPMADILN-------ADAEF---NAHVNHEEESLTVTSLRPIKAG 830

Query: 290  EQVLIQYDLNKSNAELALDYGFIESKSDR-NAYTLTLEISES------------------ 330
            E++   Y  +  N+EL   YG++  +  R +   +  ++ ES                  
Sbjct: 831  EEIFNYYGPH-PNSELLRRYGYVTERHSRYDVVEIPWDVVESVMRLNFGISGQVLEKLRH 889

Query: 331  ---------DPFFGDKLDIAETNGLGE---SAYFDIVLGRTLPPAMLQYLRLVALGGTDA 378
                     D F  ++ +  E N  G     A F+     ++P  + + L+   L G   
Sbjct: 890  GLEEEEEFEDTFVLER-ETGEVNSDGTFSGPARFE-----SMPEDLQEQLKTF-LKGVKK 942

Query: 379  FLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDP 438
               E+I           P     +E+   V+    ++  S + T+  ED  LLQ  +L  
Sbjct: 943  AQPEAI-----------PDKRKRDEIHHAVLAKTLQALASKYPTSTSEDGILLQRQDLSQ 991

Query: 439  RLRIAVGVRAGEKRVLQQ 456
            R R+A+ VR GEK++LQ+
Sbjct: 992  RTRMAIEVRLGEKKLLQE 1009


>gi|116200550|ref|XP_001226087.1| hypothetical protein CHGG_10820 [Chaetomium globosum CBS 148.51]
 gi|88175534|gb|EAQ83002.1| hypothetical protein CHGG_10820 [Chaetomium globosum CBS 148.51]
          Length = 461

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 102/423 (24%), Positives = 170/423 (40%), Gaps = 76/423 (17%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMK-----------------FWINPD----TVAASEIGSLC 129
           G G+ AQ+DIA + V+  +P                   F +  D    T + S+  S  
Sbjct: 46  GRGITAQQDIAADTVLFTIPRTAIICTATSGLKDKIPGIFDLESDDAGHTDSDSDSESTR 105

Query: 130 SGLKPWISVALFLIREKKKED-SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTL 188
           S    W  + L +I E ++ D S W+ YLD+LP   D+ +FWS  EL +LQ + L++  +
Sbjct: 106 SSQDSWTLLILAMIYEHQQGDASRWKPYLDVLPPAFDTPMFWSPAELSQLQASALIA-KV 164

Query: 189 GVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-----------SRL 237
           G  E  +    K+   ++L         I      +AF + +                R 
Sbjct: 165 GKDEADRMIQAKI-VSVLLHLAHRMGSTI----MAYAFDLEKDEDEEGEEEEEGWVEDRE 219

Query: 238 RGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYD 297
               L ++P+AD++N          A+   G    +   L       P++AGE+VL  Y 
Sbjct: 220 GKTMLGMVPMADILNADAAFN----AHINHGGDALTATAL------RPIRAGEEVLNYYG 269

Query: 298 LNKSNAELALDYGFIESKSDRNAYTLTLE------ISESDPFFGDKL-------DIAETN 344
              +N EL   YG+ E +          E      + ++D  F +         D   T 
Sbjct: 270 -PLANGELLRRYGYSELRERVGGRVGVAEWEGVRRLLQADEEFEESFVLERSSEDPDSTG 328

Query: 345 GLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEEL 404
            LG  A F   L   L      +L+ +   G     +++           L      ++L
Sbjct: 329 QLGSEAVF-TGLPDELGEQFKTFLKAMKKVGNVELAVQA-----------LSDKDTRKDL 376

Query: 405 ICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKER 464
             R V  A  +    + T++EEDEQ+     L  R R+A+ VR GEK+VL++  + + +R
Sbjct: 377 YLRTVLRALHAREKQYATSLEEDEQVANAEQLSGRERMALWVRRGEKQVLREA-QAWVQR 435

Query: 465 ELE 467
           E++
Sbjct: 436 EID 438


>gi|226508108|ref|NP_001151788.1| SET domain containing protein [Zea mays]
 gi|195649689|gb|ACG44312.1| SET domain containing protein [Zea mays]
          Length = 536

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 81/329 (24%), Positives = 142/329 (43%), Gaps = 36/329 (10%)

Query: 80  KSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISV 138
           KS ++ A F   G G++A   I   ++ LE+P    I+ + +  SE+           S 
Sbjct: 167 KSRLQIAYFQGAGRGMIASESIGVGDIALEIPEFLIISDELLCQSEVFLALKDFNNITSE 226

Query: 139 ALFLI---REKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQ 195
            + L+   RE+    S ++ Y D LP   ++ + +  + L  L+GT L    +  +++++
Sbjct: 227 TMLLLWSMRERYNLGSKFKPYFDTLPANFNTGLSFGIDALAALEGTLLFDEIIQARQHLR 286

Query: 196 NEYLKVEEEIILPNKQLFPRPI-TLDDFLWAFGILRSRAFSRLRGQ---NLVLIPLADLI 251
            +Y ++   +     ++F + + T DDFLWA  +  S +   +      +  L+P+A L+
Sbjct: 287 QQYDELFPLLCTNFPEMFRKDVCTWDDFLWACELWYSNSMMIVLSSGKLSTCLVPVAGLL 346

Query: 252 NHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF 311
           NHS      +Y    +     ++ L F L    P  AGEQ  + Y      + L   YGF
Sbjct: 347 NHSVSPHILNYGRVDEA----TKSLKFPLSR--PCDAGEQCFLSYG-KHPGSHLVTFYGF 399

Query: 312 IESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGR-----------TL 360
           +  + D     + L++  S     D  DIA  +    S    +V G             L
Sbjct: 400 L-PRGDNPYDVIPLDLDTS----VDDEDIAAQSSATTSQTTHMVRGTWLSTSGGFPTYGL 454

Query: 361 PPAMLQYLRLVALGG----TDAFLLESIF 385
           P  +L +LR  ALG     +D  +LE+I 
Sbjct: 455 PQPLLTHLR-AALGCEVKESDKVVLETIL 482


>gi|301763371|ref|XP_002917104.1| PREDICTED: SET domain-containing protein 4-like [Ailuropoda
           melanoleuca]
          Length = 440

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 67/263 (25%), Positives = 119/263 (45%), Gaps = 32/263 (12%)

Query: 69  QWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           +WL+D+K     + + PA FP  G GL+++  + + ++++ +P    +  DTV  S +G+
Sbjct: 38  KWLKDRKF--EDTNLIPACFPGTGRGLMSKTSLREGQMIISLPESCLLTTDTVIRSYLGA 95

Query: 128 LCSGLK----PWISVALFLIREKKKED-SPWRVYLDILPECTDSTVFWSEEELVELQGTQ 182
             +  +    P +++  FL+ EK   D S W+ YL+ILP+     V   E E+V L    
Sbjct: 96  YIAKWQPPPSPLLALCTFLVSEKHAGDQSLWKPYLEILPKAYTCPVCL-EPEVVNLFPKP 154

Query: 183 LLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI----TLDDFLWAFGILRSRA-FSRL 237
           L +     +  VQ  +    +      + LF   +    +    LWA+  + +RA + + 
Sbjct: 155 LKAKAEEQRARVQGFFSSSRDFFSSL-QPLFSEAVESIFSYSALLWAWCTVNTRAVYVKH 213

Query: 238 RGQN--------LVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAG 289
           R +           L P  DL+NHSP +  +        A        + +RT    +  
Sbjct: 214 RQEQCFSTEPNTCALAPYLDLLNHSPRVQVK--------AAFNEETRCYEIRTASGCRKH 265

Query: 290 EQVLIQYDLNKSNAELALDYGFI 312
           E+V I Y  +  N +L L+YGF+
Sbjct: 266 EEVFICYGPH-DNQQLLLEYGFV 287


>gi|358388339|gb|EHK25932.1| hypothetical protein TRIVIDRAFT_82204 [Trichoderma virens Gv29-8]
          Length = 915

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 96/378 (25%), Positives = 160/378 (42%), Gaps = 86/378 (22%)

Query: 134 PW-ISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKE 192
           PW I + + +    K ++S W+ Y+D+LP   ++ +FWS  EL ELQ +    T +G  +
Sbjct: 541 PWSILIIIMMFEYFKGDESKWKPYMDVLPASFETPMFWSGAELDELQAS-ATRTKVGKAD 599

Query: 193 YVQNEYLKVEEEIILPNKQLFPRPITL-DDFLWAFG-----ILRSRAFS----------- 235
             +  + KV   +I  N ++FP   +  DD L          + S AF            
Sbjct: 600 AEEMFHAKV-LPVIRANHEIFPSSQSYSDDELVQLAHRMGSTIMSYAFDFQNEDEEDEED 658

Query: 236 -------RLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKA 288
                  R     + ++P+AD++N       E  A+   G    +   + +LRT   +KA
Sbjct: 659 EEEWVEDRESKSTMGMVPMADILN----ADAEYNAHVNYGDDALT---VTALRT---IKA 708

Query: 289 GEQVLIQYDLNKSNAELALDYGFIESKSDR------------NAYTLTLEISES------ 330
           GE++L  Y  +  N+EL   YG++  K  R            N+   +L +SE       
Sbjct: 709 GEEILNYYGPH-PNSELLRRYGYVTPKHSRYDVVELPWKLVENSLAASLGLSEQQLDNAR 767

Query: 331 ---------DPFFGDK-LDIAETNG--LGESAYFDIVLGRTLPPAMLQYLRLVALGGTDA 378
                    D F  D+  D    +G   G + + DI      P  +   L+L        
Sbjct: 768 EYLDMDEIEDTFVLDRESDEPNPDGTFTGSARFSDI------PEDLRDQLKL-------- 813

Query: 379 FLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDP 438
            LL+++ +       D       +E+   V+  A  +  S + TTI EDE++L G ++  
Sbjct: 814 -LLKAVRKVDPSSVAD---KRKRDEIQHSVLIKALDALASQYPTTIPEDERILSGSDISE 869

Query: 439 RLRIAVGVRAGEKRVLQQ 456
           R R AV VR GEKR++++
Sbjct: 870 RQRAAVTVRLGEKRLIEE 887


>gi|444705829|gb|ELW47217.1| Histone-lysine N-methyltransferase setd3 [Tupaia chinensis]
          Length = 539

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 52/179 (29%), Positives = 89/179 (49%), Gaps = 14/179 (7%)

Query: 287 KAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGL 346
           + GEQ+ I Y   +SNAE  +  GF    +  +   + L +S+SD  +  K ++    G+
Sbjct: 246 RPGEQIYIFYG-TRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGI 304

Query: 347 GESAYFDIVLGRTLPP---AMLQYLRLVALGGTDAFLLESIFRNT------IWGHLDLPV 397
             S+ F   L  T PP    +L +LR+  +  T+  L E +  ++        G+ + PV
Sbjct: 305 PTSSVF--ALHFTDPPISAQLLAFLRVFCM--TEEELKEHLLGDSAIDRIFTLGNSEFPV 360

Query: 398 SHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           S  NE  +   + D     L  + TTIEED+ +L+  +L  R  +A+ +R GEK +L++
Sbjct: 361 SWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKSRDLSVRATMAIKLRLGEKEILER 419



 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 33/114 (28%), Positives = 56/114 (49%), Gaps = 5/114 (4%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALFLIR 144
           EG GL A R+I   E+ L VP K  +  ++   S +G L S  +       I++A  L+ 
Sbjct: 104 EGFGLRATREIKAEELFLWVPRKLLMTVESAKNSVLGPLYSQDRILQAMGNIALAFHLLC 163

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY 198
           E+   +S W+ Y+  LP   D+ +++ E+E+  LQ TQ +       +    +Y
Sbjct: 164 ERASPNSFWQPYIQTLPSEYDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY 217


>gi|440802833|gb|ELR23759.1| [Ribulose-bisphosphate-carboxylase]-lysine N-methyltransferase
           [Acanthamoeba castellanii str. Neff]
          Length = 518

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 81/318 (25%), Positives = 129/318 (40%), Gaps = 66/318 (20%)

Query: 64  VETFWQWLRDQ--KVVSPKS-PIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDT 119
           V  F+ WL      V  P     R ATFP  G G  A   I + +V+  VP +  ++  T
Sbjct: 8   VAAFYTWLATNGLDVECPDHVKFRLATFPWTGRGTRATVPIHEEDVLASVPRQLILSAVT 67

Query: 120 VAASEI---------GSLCSGLKPWISVALFLIREKK-KEDSPWRVYLDIL---PEC--- 163
              S +         G++ +   P +++AL L+ EK     S W  Y  IL   P C   
Sbjct: 68  ATDSRVVRDVLPAVEGTILA--DPMVALALLLLYEKNLGPASFWAPYFHILLLPPLCRFV 125

Query: 164 --------------------TDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEE 203
                                +  +FWS E+LV L+     +  L     ++   L++  
Sbjct: 126 PLLAWWQQTKRNKPVQWIVYYNLPIFWSSEDLVLLEEAH--TDILPHSRNMRTSILRLYF 183

Query: 204 EIILPNKQLF--------------PRPITLDDFLWAFGILRSRAFSRLRGQNLV--LIPL 247
             +LP   L               P   T D+ +WAF  + SR +  L G + +  ++PL
Sbjct: 184 GFLLPLFHLLIFISIFKDYPDMFSPAVHTCDELMWAFATIWSRGY-WLDGDDTMPAIVPL 242

Query: 248 ADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELAL 307
           AD++NH+     E  A+    A       +F + +    + G+QVL  Y  NK+N     
Sbjct: 243 ADMLNHNTEKGGERVAHYFYDADA----QIFKVISKTSYEPGQQVLTHYG-NKANGNFLE 297

Query: 308 DYGFIESKSDRNAYTLTL 325
           DYGF+   +D+N + L +
Sbjct: 298 DYGFVYMNNDQNEFYLPI 315


>gi|195396323|ref|XP_002056781.1| GJ16703 [Drosophila virilis]
 gi|194146548|gb|EDW62267.1| GJ16703 [Drosophila virilis]
          Length = 539

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 74/317 (23%), Positives = 136/317 (42%), Gaps = 49/317 (15%)

Query: 44  FAATCSLHSASATTNPPTAQVETFWQWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAK 102
            AA  +  +A+A      A+V  F +W R   V +    I   TFP   LGL   RD+A+
Sbjct: 99  LAACPAEDAANANEQKRLAKVAAFNEWARAGGVQTDCVEI--TTFPGYQLGLRVTRDLAE 156

Query: 103 NEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREK-KKEDSPWRVYLDILP 161
            E+VL VP +   + + +  ++   L       ++V   LI EK +   S W+ ++D LP
Sbjct: 157 GELVLTVPRQLIFSEELLPEAQ-RKLFIDFPTHLNVTYMLIIEKVRGAASNWQPFIDTLP 215

Query: 162 ECTDSTVFWSEEELVELQGTQLLSTTLGVKEYV--------QNEYLKVEEEIILPNKQLF 213
              ++ ++++ E++  L+GT   S  +     +        +  Y++ ++ ++     LF
Sbjct: 216 TRYNTVLYFTVEQMQRLRGTSACSAAVRHCRVIARIYASMYKCAYMQPDDSVMAGMANLF 275

Query: 214 PR-PITLDDFLWAFGILRSRAFSRLRGQNLV------------------LIPLADLINHS 254
               +  + + WA   + +R       QNLV                  LIP  D+ NH 
Sbjct: 276 TEYGLCYELYRWAVSTVTTR-------QNLVPRQLATDSDGVRNSPMSALIPFWDMANHR 328

Query: 255 PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
            G  T  Y    +     +++           KAGEQ  I Y  ++ NA+  + +GF++ 
Sbjct: 329 CGKITSYYKPSAQQMECIAQEAF---------KAGEQFFIYYG-DRCNADRLVHHGFLDM 378

Query: 315 KSDRNAYTLTLEISESD 331
            + ++   + L +S +D
Sbjct: 379 NNLKDYVHIRLGLSPTD 395


>gi|302410103|ref|XP_003002885.1| SET domain-containing protein RMS1 [Verticillium albo-atrum
           VaMs.102]
 gi|261357909|gb|EEY20337.1| SET domain-containing protein RMS1 [Verticillium albo-atrum
           VaMs.102]
          Length = 469

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 110/431 (25%), Positives = 176/431 (40%), Gaps = 96/431 (22%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWIN----------PDTVAAS--EIGSLCSGLKPWISV 138
           G G++A RDI +   +  +P +  IN          P     S  E+      L  W  +
Sbjct: 38  GRGIIATRDIPEETTLFTIPRQAIINVLTSELPQKLPQVFDGSIDEMDDNAEPLDSWGQL 97

Query: 139 ALFLIREKKKEDSP-WRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNE 197
            L ++ E  + DS  W+ Y DILP+  D+ +FWS+ EL+ELQG     T+L  ++  + E
Sbjct: 98  ILVMLYEVLQGDSSRWKPYFDILPQQFDTPIFWSDGELLELQG-----TSLTAEKIGKVE 152

Query: 198 YLKVEEEIILPNKQLFP---------RPITLDDFL------------WAFGILRSRAFSR 236
              +    ILP  Q  P         +P T D+ L            +AF +        
Sbjct: 153 SDAMFRSKILPIVQANPAIFYPEGAAQP-TEDELLHLAHRMGSTIMAYAFDLENDDENEN 211

Query: 237 L--------RGQNLV-LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVK 287
                     G+ ++ ++P+AD +N +     E  A+   G  L +  +   +R      
Sbjct: 212 EEDGWVEDREGRTMLGMVPMADTLNAN----AEFNAHINHGESLEATAIRADIR------ 261

Query: 288 AGEQVLIQYDLNKSNAELALDYGFIESKSDRN-----AYTLTLEI--------SESDPFF 334
           AG+QVL  Y      +EL   YG++  +  R       +TL  E+        +E+    
Sbjct: 262 AGDQVLNYYG-PLPTSELLRRYGYVTPEHSRYDVVEVPWTLVKEVIVSCLSLSAEAWKQV 320

Query: 335 GDKLDIAETNGLGESAYFDIVL--------GRTLPPAMLQYLRLVALGGTDAFLLESIFR 386
             ++D  E        YF I          GR   PA+L+ +    +     FL      
Sbjct: 321 ESQIDDEEIED-----YFVIERDSGEPGPDGRFTAPAVLREVSPELVEQLKEFL------ 369

Query: 387 NTIWGHLD---LPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIA 443
                 LD   +P     +E+   V+ +  K  L+ + T+IE DE+LL   +L  R R+A
Sbjct: 370 -KAVKKLDSERIPDKRKRDEICDAVIAEVLKVRLAQYPTSIETDEKLLAEADLPARRRMA 428

Query: 444 VGVRAGEKRVL 454
           V VR GEK++L
Sbjct: 429 VVVRLGEKKLL 439


>gi|449520517|ref|XP_004167280.1| PREDICTED: LOW QUALITY PROTEIN: sulfate transporter 4.1,
           chloroplastic-like, partial [Cucumis sativus]
          Length = 923

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 53/201 (26%), Positives = 91/201 (45%), Gaps = 25/201 (12%)

Query: 65  ETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVA--- 121
           +TF  WL  +      S +       G  L A   I   + +L+VP    I+PD++    
Sbjct: 706 DTFLPWLERKAETKISSVLSIGKSSIGRFLFASETIRAGDCILKVPFNVQISPDSLPLPI 765

Query: 122 ----ASEIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPEC--TDSTVFWSEEEL 175
                +EIG++           + L+  K    S W  Y+  LP+     +T+FW E EL
Sbjct: 766 RDLLGNEIGNVAK------LAVVVLLEHKLGLGSEWAPYIIRLPQPWEMHNTIFWKESEL 819

Query: 176 VELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPR---PITLDDFLWAFGILRSR 232
             ++ + L   +L  +  ++ E+L + + +     + FP     I+ DDF+ A+ ++ SR
Sbjct: 820 EMIRKSSLYEESLNQRSQIKREFLAIRKAL-----EAFPEIIDRISCDDFMHAYALVTSR 874

Query: 233 AFSRLRGQNLVLIPLADLINH 253
           A+    G +  LIP AD +NH
Sbjct: 875 AWRSTEGVS--LIPFADFLNH 893


>gi|224012755|ref|XP_002295030.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220969469|gb|EED87810.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 753

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 68/232 (29%), Positives = 104/232 (44%), Gaps = 27/232 (11%)

Query: 93  GLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS--LCSGLKPWISVALFLI------R 144
           G+ A+  I  N + + +P    I  +   A+ IG   L S L+      +FL+      R
Sbjct: 126 GVHAKTSIPPNTICVSIPKSCLITVEMGQATPIGRKILTSDLELDAPKHIFLMIYILWDR 185

Query: 145 EKKKEDSPWRVYLDILPEC-TDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEE 203
           +   E S +  Y  ILPE   +  +FW+ EEL  L+G+ LL       E ++ +Y+ +  
Sbjct: 186 KVNGETSFFAPYYKILPETLRNMPIFWTREELDALEGSYLLLQIADRAEAIKEDYISICS 245

Query: 204 EIILPNKQLFPRPITLDDFLWAFGILRSRAFSRL--RGQNLVLIPLADLINH-SPGITTE 260
             I P    F    TL++F WA  I+ SR F  L    +   L+P AD++NH  P  T  
Sbjct: 246 --IAPE---FGDIATLEEFQWARMIVCSRNFGLLINGHRTSALVPHADMLNHLRPRETKW 300

Query: 261 DYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI 312
            ++ E +          F++ T   +  GEQV   Y   K N    L+YGF 
Sbjct: 301 TFSEESQS---------FTITTLQEIGMGEQVFDSYG-QKCNHRFLLNYGFC 342


>gi|281338852|gb|EFB14436.1| hypothetical protein PANDA_005285 [Ailuropoda melanoleuca]
          Length = 415

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 67/263 (25%), Positives = 119/263 (45%), Gaps = 32/263 (12%)

Query: 69  QWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           +WL+D+K     + + PA FP  G GL+++  + + ++++ +P    +  DTV  S +G+
Sbjct: 14  KWLKDRKF--EDTNLIPACFPGTGRGLMSKTSLREGQMIISLPESCLLTTDTVIRSYLGA 71

Query: 128 LCSGLK----PWISVALFLIREKKKED-SPWRVYLDILPECTDSTVFWSEEELVELQGTQ 182
             +  +    P +++  FL+ EK   D S W+ YL+ILP+     V   E E+V L    
Sbjct: 72  YIAKWQPPPSPLLALCTFLVSEKHAGDQSLWKPYLEILPKAYTCPVCL-EPEVVNLFPKP 130

Query: 183 LLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI----TLDDFLWAFGILRSRA-FSRL 237
           L +     +  VQ  +    +      + LF   +    +    LWA+  + +RA + + 
Sbjct: 131 LKAKAEEQRARVQGFFSSSRDFFSSL-QPLFSEAVESIFSYSALLWAWCTVNTRAVYVKH 189

Query: 238 RGQN--------LVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAG 289
           R +           L P  DL+NHSP +  +        A        + +RT    +  
Sbjct: 190 RQEQCFSTEPNTCALAPYLDLLNHSPRVQVK--------AAFNEETRCYEIRTASGCRKH 241

Query: 290 EQVLIQYDLNKSNAELALDYGFI 312
           E+V I Y  +  N +L L+YGF+
Sbjct: 242 EEVFICYGPH-DNQQLLLEYGFV 263


>gi|154312088|ref|XP_001555372.1| predicted protein [Botryotinia fuckeliana B05.10]
          Length = 222

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 50/155 (32%), Positives = 77/155 (49%), Gaps = 9/155 (5%)

Query: 62  AQVETFWQWLRDQKV-VSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTV 120
           A+  TF  WL++  +  +PK  +      EG G   + DI  +E++  +P    +N    
Sbjct: 8   ARTATFSAWLQEMGIRTNPKMALVD-LRQEGRGRGVE-DIDDDEIIFSIPRSAVLNAQNA 65

Query: 121 AASEIGS-LCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQ 179
               I   L   +  W+++   L+ E + +DS W  YL ILPE  +S VFWS+ EL ELQ
Sbjct: 66  KPLAISKRLAEKMPSWLALTSILMAEGQVDDSKWAPYLAILPEQLNSLVFWSDSELAELQ 125

Query: 180 GTQLLSTTLGVKEYVQNEYLKV--EEEIILPNKQL 212
            + +L+  L  +E   NEY     E+E +L N  L
Sbjct: 126 ASAVLAILLQARE---NEYSTTVEEDEDLLKNADL 157



 Score = 40.4 bits (93), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 18/36 (50%), Positives = 27/36 (75%)

Query: 420 FHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQ 455
           + TT+EEDE LL+  +L  R  +A+ VR+GEK+VL+
Sbjct: 141 YSTTVEEDEDLLKNADLPIRTAMAIQVRSGEKKVLR 176


>gi|400597281|gb|EJP65016.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
          Length = 484

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 109/434 (25%), Positives = 178/434 (41%), Gaps = 85/434 (19%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVA-ASEIGSLCS---GLKP------------ 134
           G G+VA RDIA   V+  +P +  IN +T    S++  L S   GL P            
Sbjct: 39  GRGIVALRDIAPETVLFTIPRQSIINVETSGLRSQLPQLFSDEEGLAPQHGVADDDPLSS 98

Query: 135 -----WISVALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTL 188
                W ++ L L+ E  +   S WR YLD+LP   ++ +FW+  EL  LQ       T 
Sbjct: 99  SPLDAWGALILVLLYEHLRGAASAWRPYLDVLPATFETPMFWTGAELGALQA----GATA 154

Query: 189 GV--KEYVQNEYLKVEEEIILPNKQLFPRPITL-DDFLWAFG-----ILRSRAFSRLR-- 238
           G   +E  ++ +  +   ++  +  +F     L D+ L A        + + AF      
Sbjct: 155 GKVGRESAEDTFRGILLPVVRAHPDVFQGSAALSDEALVALAHRMGSTIMAYAFDLENDE 214

Query: 239 ----------------GQNLV-LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLR 281
                           G+ ++ ++P+AD++N       E  A+   G    +   + +LR
Sbjct: 215 EREDEEDEDGWVEDRDGKAMMGMVPMADILN----ADAEFNAHVNHGDNELT---VTALR 267

Query: 282 TPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDR-NAYTLTLEISESDPFFGDKL-- 338
              P+KAGE++L  Y  +  N+EL   YG++  +  R +   +  E+ E+       L  
Sbjct: 268 ---PIKAGEEILNYYGPH-PNSELLRRYGYVTERHSRYDVVEIPWELVEAATVSQLGLSQ 323

Query: 339 ------------DIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFR 386
                       D  E   + E    DI+     PP+   +     L G     L    R
Sbjct: 324 EAWAKVRGQLDEDEMEDTFVLERECPDILPTGLFPPSPPAH--FAGLPGDLQEQLHEFLR 381

Query: 387 --NTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGN--LDPRLRI 442
               +   L        +EL   +V  A  +  + + TT+ +DEQLL GG   L  + R+
Sbjct: 382 AVRAVDAELVPDRRRKRDELQTTIVARALAALEAQYPTTLAQDEQLLAGGGPPLSDKERM 441

Query: 443 AVGVRAGEKRVLQQ 456
           AV VR GEKR+L++
Sbjct: 442 AVVVRVGEKRLLRE 455


>gi|326503142|dbj|BAJ99196.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 425

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 92/380 (24%), Positives = 155/380 (40%), Gaps = 41/380 (10%)

Query: 34  KPSFRLKSRAFAATC---SLHSASATTNPPT-------AQVETFWQWLRDQKVVSPKSPI 83
           +P  R  SR   A C   +L S+S    PP+       +    F  WL    +   K  I
Sbjct: 41  QPGSRSCSRLRLAACHADTLLSSSGAQGPPSPAACLSASSAGGFSDWLLTNGLPPGKLAI 100

Query: 84  RPATFPEGLG--------LVAQRDIAKNEVVLEVPMKFWINPDTVAASE-IGSL--CSGL 132
                P   G        + A +D+   +V  EVPM   +  + V   E +  L   + L
Sbjct: 101 LERPVPCSRGGRDRPLHFVAAGQDLEAGDVAFEVPMSLVVTLERVLGDESVAELLTTNKL 160

Query: 133 KPWISVALFLIREKKK-EDSPWRVYLDILPE-------CTDSTVFWSEEELVELQGTQLL 184
                +AL+L+ EKK+  DS W  Y+  L           +S + W+E EL  L G+ + 
Sbjct: 161 SELACLALYLMYEKKQGRDSLWYPYIKELDRQRGRGQLAVESPLLWTESELDYLNGSPMR 220

Query: 185 STTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRG----- 239
              +   E ++ EY +++    +        P  +    + F I + +AF  ++      
Sbjct: 221 DEVVVRDEGIKKEYNELDTLWFMAGSLFKQYPFDVPTEAFPFEIFK-QAFVAVQSCVVHL 279

Query: 240 QNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLN 299
           Q + L     L+   P + T  Y    K A L + D    L    P KAGE +++ +   
Sbjct: 280 QKVSLARRFALVPLGPPLLT--YKSNCK-AMLTAVDGSVRLLVDRPYKAGEPIIV-WCGP 335

Query: 300 KSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLG-- 357
           + N+ L L+YGF++  +  +   +   ++  DP + +K  +A+ NG      F + +G  
Sbjct: 336 QPNSRLLLNYGFVDEDNPYDRIAIEASLNTEDPQYQEKRMVAQRNGKLAIQKFQVCVGKE 395

Query: 358 RTLPPAMLQYLRLVALGGTD 377
           +     ML YLRL  +   D
Sbjct: 396 KQTISEMLPYLRLGYISDPD 415


>gi|145354487|ref|XP_001421515.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144581752|gb|ABO99808.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 431

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 82/337 (24%), Positives = 133/337 (39%), Gaps = 58/337 (17%)

Query: 153 WRVYL-DILPECTDSTVF-WSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNK 210
           WR Y+ D LP+  DS V  W+  E+ E+   + +      K        K+ E+      
Sbjct: 111 WRAYVVDALPQTVDSPVTTWTNAEIAEMAFARGIGNCAKAKARDAGTIKKLNEK------ 164

Query: 211 QLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAG 270
             +        F WA   +RSRA    RG++  L P+ D++NHS G     +     G  
Sbjct: 165 --YGDASWTKHFAWALSCVRSRAVDLERGESF-LAPMLDMLNHSHGAANVKWDASDDGEA 221

Query: 271 LFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISES 330
           +    +L +L+T   +  GE++L QY    + + L L  GF+   +  +   L   + E+
Sbjct: 222 V----VLRALKT---IDEGEELLTQYACEPAESFL-LYMGFVGGMNPYDRVELWSSLGEA 273

Query: 331 DPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLR---------LVALGGTDAFLL 381
             +F +          G     D   GR++   +    R         L   GGT     
Sbjct: 274 ADWFAETF-----RSEGGEVVLDKAAGRSIANELEAQQRARERASIVELSGGGGTADDAA 328

Query: 382 ESIFRN---TIWG------------------HLD---LPVSHANEELICRVVRDACKSAL 417
           E   R+   T+ G                  H     +P  +A+ +++   ++  C+  L
Sbjct: 329 ELRLRDIQSTLMGPSVGWRTDYDEALWLMFRHFTQALMPDENAD-QIVMHAIKKRCEEVL 387

Query: 418 SGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVL 454
               TTIEEDE+L    +  PRLR AV  R  +K +L
Sbjct: 388 LSMKTTIEEDEKLFSAEDTPPRLRAAVEYRLFKKCLL 424


>gi|328772383|gb|EGF82421.1| hypothetical protein BATDEDRAFT_86633 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 648

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 91/408 (22%), Positives = 161/408 (39%), Gaps = 60/408 (14%)

Query: 93  GLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIG---------------SLCSGLKP--- 134
           G  A +DI  N  +  +P    ++   V ASEIG                +   +K    
Sbjct: 66  GAYASKDIPPNSEICFIPSTILLSESDVRASEIGKAILTYIDEHQDAKQKISDKIKHPHA 125

Query: 135 --WISVALFLIREKK--KEDSPWRVYLDILPECTDSTVFWSEEELVE-LQGTQLLSTTLG 189
              +++A F++ +      DS W  YL  LP+     + W+ + +   L GT LL   + 
Sbjct: 126 EILLAMAAFIVHQVSLPTADSHWLPYLASLPKNYALPLMWTRDRIQNLLGGTSLLYMMIE 185

Query: 190 VKEYVQNEYLKVEEEIILPNKQLFPR-PITLDDFLWAFGILRSRAFSRLR-------GQN 241
             E++QN    VE          FP   +T+    WA   + SRAF + +       G +
Sbjct: 186 RLEWIQNSTKVVENAC----GHYFPTGALTVQSMQWATCSIWSRAFPKAKPSLDLQDGSH 241

Query: 242 -----------LVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGE 290
                      + L P+ D+ NH  G   E   + +   G+       S  TP  +  G 
Sbjct: 242 QDVQDWIGLSEICLFPILDMFNHKRGYRVE---WRMTEKGV-------SFITPDGICKGS 291

Query: 291 QVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESA 350
           ++L  Y   K N  L  +YGF+   +  + + + L + + DP +  K  + E     + +
Sbjct: 292 ELLNNYG-PKGNENLLSNYGFVIENNPEDYFKVFLGLQQEDPLYTAKKAVLEVVSENDLS 350

Query: 351 YFDIVLGRTLPPAMLQYLRLVALGGTDAFLLES-IFRNTIWGHLDLPVSHANEELICRVV 409
           +  + L   LP  M+   R++     +  +LE+ +  +T       P+   NE L    +
Sbjct: 351 HL-VFLKDDLPTNMISISRVLVANSWELAILEARLTSSTKVSPHSTPICLHNEILALSTL 409

Query: 410 RDACKSALSGFHTTIEEDEQ-LLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
            +   S L    +T    EQ  ++  N D   ++    RAG++ +L+ 
Sbjct: 410 YNLLNSKLKVLKSTTPNYEQDTIEFPNDDEARKLVHIYRAGQQSILEH 457


>gi|328854233|gb|EGG03367.1| hypothetical protein MELLADRAFT_90239 [Melampsora larici-populina
           98AG31]
          Length = 509

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 111/483 (22%), Positives = 186/483 (38%), Gaps = 112/483 (23%)

Query: 62  AQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPD--T 119
            ++E F  WL+D+ V+   S        +G GL+A  DI    ++  +P     N    T
Sbjct: 7   TRIEAFNHWLKDRGVIQHPSISIHHFGHQGHGLIATEDIEAGTILFSIPRPPVSNSPLLT 66

Query: 120 VAASEIGSLCSGL------KPWISVALFLIREKKK-------EDSPWRVYLDILPECTDS 166
           +  S+  S           + WI + + ++ E+ +           WR Y +++P   D+
Sbjct: 67  IGTSDFLSKLGTSDAEKISRNWIPLLMTMMWERARGYDQSVPSHMSWRPYFEMMPTEFDT 126

Query: 167 TVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI-------TL 219
            +FWS++EL ELQ     ST LG KE  + +Y ++   +I     LFP P        T 
Sbjct: 127 LMFWSDDELKELQA----STVLG-KEEAEADYHQLVAPLIRSRSDLFPIPTSNQGKVWTW 181

Query: 220 DDF--LWAFGILRSRAFSRLRGQNLVLIPLADLINH------------------------ 253
           DDF  L  + ++ S A S  R   + ++P  D  NH                        
Sbjct: 182 DDFYGLQIYHLMGSLALS--RSFEVDVVPETD--NHDEEEEVDDDKSIEDQDENGDISMT 237

Query: 254 -----SPGITTEDYAYE-------------------IKGAGLFSRDLLFSLRTPVPVKAG 289
                S GI  E+                        + A LF      ++ T   ++ G
Sbjct: 238 AASASSTGIIEEEVGLNTSEGVAMVPLADILNAKSGCENAKLFYEPTTLNMTTTKSIRKG 297

Query: 290 EQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLE---------ISESDPFFGDKLDI 340
           EQ+   Y  +  NA+L   YG ++ ++  +   ++LE         +  SDP   + LD 
Sbjct: 298 EQIYNTY-ADPPNADLLRRYGHVDDENPFDLAEVSLELCIRLAAESLHPSDPQNQNTLDE 356

Query: 341 AETN---GLGESAYFDIVLGRT---------LPPAMLQYLRLVALGGTDAFLLESIFRNT 388
            ++     L  S   +I +  T         LP  ++  LR++ L   + F         
Sbjct: 357 LKSRAKWALEVSDIDEIFMLPTKSQREPKEILPDELVIMLRIL-LSTEEEF--------Q 407

Query: 389 IWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRA 448
            W         A  E I ++      + L+ + TTI+ D+ LL+  +L  R   ++ VR 
Sbjct: 408 TWKSKGKVPKPAMSEPIAQLAIQILSNRLNQYSTTIQNDQDLLKDQSLSRRKLKSIKVRL 467

Query: 449 GEK 451
           GEK
Sbjct: 468 GEK 470


>gi|408397548|gb|EKJ76689.1| hypothetical protein FPSE_03100 [Fusarium pseudograminearum CS3096]
          Length = 467

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 59/199 (29%), Positives = 92/199 (46%), Gaps = 13/199 (6%)

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKV-EEEIILP- 208
           +PW  YL  LP        WSE E   LQGT L +        +  E+  + E+  +LP 
Sbjct: 133 TPWTEYLKFLPRDVPVPTMWSEVERALLQGTSLEAALEAKFASLSKEFEDLTEKSSVLPF 192

Query: 209 -NKQLFPR-PITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEI 266
            N   + +  + + D++      RSR     RG + V++P  D+ NHS   T   Y  + 
Sbjct: 193 WNSLFWEKGTVAIQDWILVDAWYRSRCLELPRGGD-VMVPGLDMANHSHHPTAY-YDEDD 250

Query: 267 KGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLE 326
           K       D++  +R    V AGE+V I Y  +K+ AE+   YGFI+++S      L ++
Sbjct: 251 KD------DVVLLVRPGTKVSAGEEVNISYG-DKNPAEMLFSYGFIDNESTVEGLNLPVK 303

Query: 327 ISESDPFFGDKLDIAETNG 345
           +   DP    KL I  ++G
Sbjct: 304 VLPDDPLGKAKLHIFGSSG 322


>gi|159122413|gb|EDP47534.1| SET domain protein [Aspergillus fumigatus A1163]
          Length = 492

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 118/494 (23%), Positives = 194/494 (39%), Gaps = 120/494 (24%)

Query: 54  SATTNPPTA-----QVETFWQWLRDQKVVSPKSPIRPATF---PEGLGLVAQRDIAKNEV 105
           S+TT+ P +     Q + F  WL  +  V   S I  A       G G+VA+ DI   E 
Sbjct: 2   SSTTHFPDSDSFQRQSDEFITWLAGKPGVKINSKISIADLRSKSAGRGVVARSDIFDGEE 61

Query: 106 VLEVPMKFWINPDTVAASEIGSLCSG----LKPWISVALFLIREKK-KEDSPWRVYLDIL 160
           +  +P    +   +   S++  L S     L PW+S+ L ++ E    E S W  Y  IL
Sbjct: 62  LFSIPRGLVL---SAQNSKLKDLLSQDLEELGPWLSLILVMMYEYLLGEQSAWAPYFKIL 118

Query: 161 PECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLD 220
           P+  D+ +FWS  EL ELQG+ ++S  +G KE  ++  +++   ++  N  LFP   ++D
Sbjct: 119 PKSFDTLMFWSPSELRELQGSAIVS-KIG-KEGAEDSIMQMIAPVVRANPSLFP---SVD 173

Query: 221 DFLWAFGILRSRAFSRL---RGQNLV---------------------------------- 243
                 G   S A  RL    G  ++                                  
Sbjct: 174 GLASWDGEAGSHALLRLAHIMGSLIMAYAFDIEKVEDEDDENNDEEDGYVTDDEQDQSSK 233

Query: 244 -LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSN 302
            ++PLAD++N            +   A LF  D    ++   P++ GE++   Y      
Sbjct: 234 GMVPLADILNADA---------DRNNARLFQEDDSLVMKAIKPIRVGEEIFNDYG-ELPR 283

Query: 303 AELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLD-IAETNGLGESAYFDIVLGRTLP 361
           A+L   YG++   +D  A    +E+S         LD I  + GL  +   DI    + P
Sbjct: 284 ADLLRRYGYV---TDNYAQYDVVELS---------LDQICRSAGLQNA---DI---ESYP 325

Query: 362 P-AMLQYLRLVALGG-----------TDAFLLESIFRNTIWGHLDLPVSHANEE------ 403
           P A L+ L L+  G            TD    E +            + H   +      
Sbjct: 326 PLAFLEDLELLDDGYVIPRPSPEDSLTDFLPDELLVLVKTLTLSSEQLEHQKSKSKAPKP 385

Query: 404 ----LICRVVRDACKSALSGFHTTIEEDEQLLQG----------GNLDPRLRIAVGVRAG 449
                   ++  A +   S + TT+ +DE++L G             D R ++A+ VR G
Sbjct: 386 SFGHAEAAILLKAVQLVGSQYPTTVAQDEEILSGLVQSEASQPLNQTDRRQKMAIQVRLG 445

Query: 450 EKRVLQQIDEIFKE 463
           EK +LQ +  + ++
Sbjct: 446 EKYILQTLANMLEK 459


>gi|329663327|ref|NP_001192753.1| SET domain-containing protein 4 [Bos taurus]
 gi|296490853|tpg|DAA32966.1| TPA: SET domain containing 4 [Bos taurus]
          Length = 440

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 63/267 (23%), Positives = 112/267 (41%), Gaps = 40/267 (14%)

Query: 69  QWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           +WL+D++     + + PA FP  G GL+++  + + + ++ +P    +  DTV  S +G+
Sbjct: 38  KWLKDRRF--EDTTLIPAHFPGTGRGLMSKTSLQEGQTIISLPESCLLTTDTVIRSYLGA 95

Query: 128 LCSGLK----PWISVALFLIREKKKED-SPWRVYLDILPECTDSTVFWSEEELVELQGTQ 182
             +  +    P +++  FL+ EK   D SPW+ YL++LP+     V       +E +   
Sbjct: 96  YIAKWQPPPSPLLALCTFLVSEKHAGDRSPWKPYLEVLPKAYTCPV------CLEPEVVN 149

Query: 183 LLSTTLGVKEYVQNEYL----KVEEEIILPNKQLFPRPI----TLDDFLWAFGILRSRAF 234
           LL   L  K + Q  ++              + LF   +    +     WA+  + +RA 
Sbjct: 150 LLPNPLKTKAWEQRSHVWEFFSSSRGFFSSLQPLFSEAVETIFSYRALRWAWCAVNTRAV 209

Query: 235 SRLR---------GQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVP 285
              R              L P  DL+NHSP +  +        A        + +RT   
Sbjct: 210 YMKRPPLLCLSPEPDTCALAPYLDLLNHSPDVQVK--------AAFNEETRCYEIRTATR 261

Query: 286 VKAGEQVLIQYDLNKSNAELALDYGFI 312
               ++V I Y  +  N  L L+YGF+
Sbjct: 262 CGKHKEVFICYGPH-DNHRLLLEYGFV 287


>gi|429861365|gb|ELA36056.1| set domain protein [Colletotrichum gloeosporioides Nara gc5]
          Length = 471

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 99/421 (23%), Positives = 175/421 (41%), Gaps = 71/421 (16%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVA-ASEIGSLCSG------------LKPWIS 137
           G G++A +DI     +  +P +  IN +T     +I  + +G            L  W S
Sbjct: 39  GRGIIATKDIPAETTLFTIPRRSIINVETSELPKKIPQVFTGNDGDDEDMENEPLDSWGS 98

Query: 138 VALFLIRE-KKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQN 196
           + L +I E  +   SPW+ Y ++LPE   + +FW   +L  L+G+ +LS  +G KE    
Sbjct: 99  LILVMIYEFLQGAASPWKPYFEVLPEKFHTLMFWESSDLENLKGSAVLS-KIG-KEEADE 156

Query: 197 EYLKVEEEIILPNKQLF----PRPITLDDFLW----AFGILRSRAF-------------- 234
            +      +I  N  +F      P+   + L        I+ + AF              
Sbjct: 157 MFRSRILTVIAANPAIFYPEGSSPLGEAELLQLAHRMGSIIMAYAFDLDNEEEPEQEEDD 216

Query: 235 ----SRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGE 290
                R     L ++P+AD++N                A +   D   ++    P+ AGE
Sbjct: 217 EWIEDRDGKTMLGMVPMADILNADAEFN----------AHVNHGDDELTVTALRPIPAGE 266

Query: 291 QVLIQYDLNKSNAELALDYGFIESKSDR-NAYTLTLEISESDPFFGDKLDIAETNGLGES 349
           ++L  Y  +  N+EL   YG++  K  R +   +  ++ ++     + L I +       
Sbjct: 267 EILNYYGPH-PNSELLRRYGYVTPKHSRYDVVEIPWDLVQAS--VSEHLKIGDDVWKQVQ 323

Query: 350 AYFD-------IVLGRTL--PPAMLQYLRLVALGGTDAFL---LESIFR--NTIWGHLDL 395
            Y D        VL R    P +  Q+  +  +    A L   L+++ +    I G L +
Sbjct: 324 EYVDPEELEDVFVLERESGEPDSEGQFRTVAEVREISAELEEQLKAVLKAIKKINGDL-I 382

Query: 396 PVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQ 455
           P     +E+   V+    +  LS + T+ +EDE LL   +L  R R+A+ VR GEK++L+
Sbjct: 383 PDKRKRDEVFHAVIVSTLQKILSQYPTSTQEDEALLATSDLTNRQRMAIHVRLGEKKLLK 442

Query: 456 Q 456
           +
Sbjct: 443 E 443


>gi|385303944|gb|EIF47986.1| ribosomal n-lysine [Dekkera bruxellensis AWRI1499]
          Length = 462

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 52/165 (31%), Positives = 85/165 (51%), Gaps = 20/165 (12%)

Query: 156 YLDILPECTD--STVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLF 213
           YLD+LP+  +  S + ++E+EL+ L+GT L   T  V   V+ EY    +E I   K + 
Sbjct: 113 YLDLLPKAEEIRSPLIYNEDELLLLKGTNLFKGTQVVLAQVKREY----QEFIDATKSVL 168

Query: 214 PRPITLDDFLWAFGILRSRAFS-RLRGQ-----NLVLIPLADLINHSPGITTEDYAYEIK 267
            R I+  D+LW F IL SR+F  RL  +      ++L+PL D +NH P +T   ++++  
Sbjct: 169 SRTISFHDYLWGFLILYSRSFPLRLVEKECDPAEVMLVPLLDFMNHKP-LTKVTWSFDGX 227

Query: 268 GAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI 312
             G+       S +  +   +G+  +      K N EL + YGF+
Sbjct: 228 SFGV-------SSQIELANGSGKYEVYNNYGPKGNEELLMAYGFV 265


>gi|320167148|gb|EFW44047.1| hypothetical protein CAOG_02072 [Capsaspora owczarzaki ATCC 30864]
          Length = 533

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 102/410 (24%), Positives = 174/410 (42%), Gaps = 55/410 (13%)

Query: 59  PPTAQ--VETFWQWLRDQKVVSPKSPIRPATF------PEGLGLVAQRDIAKNEV--VLE 108
           PP  Q  ++ F QW  ++ +   +S     T         GL + A + I  +    +L 
Sbjct: 58  PPLMQDALDEFLQWASEEGIEVGESDAGAKTLELRLHPTMGLSIFASQAIEASTTTPLLS 117

Query: 109 VPMKFWINPDTVAASEIGSLCSGLKPWI----SVALFLIREKKKEDSPWRVYLDILPECT 164
           VP+  +    T+  S + +  + ++P       ++L  + E    DS W+ +  + P   
Sbjct: 118 VPLSTFFARFTLLDSPMMAALA-VRPVAREEAKLSLLFLYEYFDPDSFWQPWFQLFPREL 176

Query: 165 DSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQL-FPRP-----IT 218
           D   FW +  L+EL  T +      ++  ++ EY    +++ LP  +L FP        +
Sbjct: 177 DCAGFWDDLLLMELDNTSIRDAIRQLEALIEYEY----DQLDLPALRLRFPDSFVADRFS 232

Query: 219 LDDFLWAFGILRSRAF--SRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFS--- 273
            DDF WAF +L SR    S       V+IP  D  NH+ G  +  ++Y  + AG  S   
Sbjct: 233 YDDFKWAFMVLASRGLTMSVNNAPCTVMIPFVDFFNHN-GAKSIAFSY-TRRAGDASDVS 290

Query: 274 ----RDLLFSLRTPV-----PVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLT 324
                D + +L   V         GEQ+ + Y  + SN  L L YGF    ++ + + + 
Sbjct: 291 SGNYDDSVENLNCAVISGNETFLPGEQMFLNYKAH-SNEVLLLHYGFALPHNEHDTFLVR 349

Query: 325 LEISE---SDPFFGDKLDIAETNGLGESAYFDIV-LGRTLPPAMLQYLR-LVALGGTDAF 379
           L       +DP    +  + E  G+ E+  F +   G  + P +L  LR ++A      F
Sbjct: 350 LHFDREKTNDPLMDLREHLLELRGIQENHPFLLRWHGDIIDPDILFALRVMIATKDQLDF 409

Query: 380 LL-ESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDE 428
           LL E I  N+       P S  +E +   +++ + +   + F TTIE DE
Sbjct: 410 LLEEGIASNS-------PFSVDSELVATVLLQRSLERNQAKFATTIESDE 452


>gi|242081035|ref|XP_002445286.1| hypothetical protein SORBIDRAFT_07g007800 [Sorghum bicolor]
 gi|241941636|gb|EES14781.1| hypothetical protein SORBIDRAFT_07g007800 [Sorghum bicolor]
          Length = 490

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 61/241 (25%), Positives = 109/241 (45%), Gaps = 15/241 (6%)

Query: 80  KSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISV 138
           KS ++ A F   G G+VA   I   E+ LE+P    I+ + +  SE+           S 
Sbjct: 167 KSILQIAFFQGAGRGMVASESIGVGEIALEIPESLIISDELLCQSEVFLALKDFNSITSE 226

Query: 139 ALFLI---REKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQ 195
            + L+   RE+    S ++ Y D LP   ++ + +  + L  L+GT L    +  K++++
Sbjct: 227 TMLLLWSMRERYNLASKFKPYFDTLPANFNTGLSFGIDGLAALEGTLLFDEIMQAKQHLR 286

Query: 196 NEYLKVEEEIILPNKQLFPRPI-TLDDFLWAFGILRSRAFSRLRGQ---NLVLIPLADLI 251
            +Y ++   +     ++F + + T D+FLWA  +  S +   +      +  L+P+A L+
Sbjct: 287 QQYDELFPLLCTNFPEIFRKDVCTWDNFLWACELWYSNSMMVVLSSGKLSTCLVPVAGLL 346

Query: 252 NHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF 311
           NHS      +Y    +     ++ L F L    P  AGEQ  + Y      + L   YGF
Sbjct: 347 NHSVSPHILNYGRVDEA----TKSLKFPLSR--PCDAGEQCFLSYG-KHPGSHLVTFYGF 399

Query: 312 I 312
           +
Sbjct: 400 L 400


>gi|302819975|ref|XP_002991656.1| hypothetical protein SELMODRAFT_236359 [Selaginella moellendorffii]
 gi|300140505|gb|EFJ07227.1| hypothetical protein SELMODRAFT_236359 [Selaginella moellendorffii]
          Length = 428

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 102/398 (25%), Positives = 157/398 (39%), Gaps = 39/398 (9%)

Query: 86  ATFPE--GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAAS-EIGSLCSGLKPWISVAL-- 140
           A FP+  G GL   R++ + E++L VP    I         E G +        SV +  
Sbjct: 32  AVFPDQGGRGLGVARNVEQGEMILRVPFAALIGVHCAREDPEFGKVLVDFAHLSSVQILT 91

Query: 141 -FLIREKKKE-DSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY 198
            +L+ E  K   S W  YL   P+   S   +S  E  ELQ    +S      E  Q ++
Sbjct: 92  AYLLSEVAKSCSSRWFSYLRHNPQVHHSLPHFSAMEAEELQVEDAISMAKSSLEDTQRQW 151

Query: 199 LKVEEEIILPNKQLFPRPIT-LDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGI 257
               E   L ++   PR  T    +LWA   + SR          VL P+ DL N+   I
Sbjct: 152 ---RETSSLLSRLRLPRKFTTFKAWLWAAATISSRTLHVPWDDAGVLCPIGDLFNYDAPI 208

Query: 258 ------TTED----YAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELAL 307
                   ED    +   +   G  +    +        K G+Q LI Y    +N EL  
Sbjct: 209 ERTMSSRNEDDELEFTNRLTDGGYETSISSYCFYARRSYKKGQQALICYG-QYTNLELLE 267

Query: 308 DYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQY 367
            YGF+   +  +   + L   E    FG K    ++      A + I        ++LQ 
Sbjct: 268 HYGFLLPDNPCDVIYIPLPSPEE---FGLKSTGDKSERQHNLAAYCIEASGKPSFSLLQQ 324

Query: 368 LRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEED 427
           LRL A+    A L +S   +       LP+S  +++L+   ++  C+  L    TTIE+D
Sbjct: 325 LRLRAV---PASLRKS---HGYMASQGLPISRESDQLVFLWLQKKCQKLLEKLPTTIEQD 378

Query: 428 EQLLQ--------GGNLDPRLRIAVGVRAGEKRVLQQI 457
           E+ L+        G     R ++A+  R G K +LQ+ 
Sbjct: 379 ERALEQLEKSTSSGSCGVERHKLAIKWRLGYKTILQRC 416


>gi|156374449|ref|XP_001629819.1| predicted protein [Nematostella vectensis]
 gi|156216828|gb|EDO37756.1| predicted protein [Nematostella vectensis]
          Length = 281

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 70/278 (25%), Positives = 122/278 (43%), Gaps = 35/278 (12%)

Query: 62  AQVETFWQWLRDQ--KVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDT 119
           + + +F  W  D   K+ +  S ++  +     G+VA  DI+ +E + +VP    + P T
Sbjct: 14  SAISSFLLWCHDNDLKLNNKVSSMQKGSC-HRYGMVAMEDISPDECLFKVPRGLLLEPKT 72

Query: 120 VAASE------IGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILP--ECTDSTVFWS 171
              S+      I ++ S  + W+ + L L+ E     S W+ Y+DI+P  +  D  +FW 
Sbjct: 73  CGISKILTGKVIQNMLSQHEGWVPLLLALMYEYTNPTSLWKPYMDIVPGIDILDQPMFWP 132

Query: 172 EEELVE-LQGTQLLSTTLGVKEYVQNEYLKVEEEII---LPNKQLFPRPITLDDFLWAFG 227
           +E     LQGT         K+ ++ +Y  V   I+        L    ++L   + AF 
Sbjct: 133 DETRQSLLQGTGFEDDVEDDKQRIERQYFTVAVPIMKKFKKFFDLKRHSLSLYKHMAAF- 191

Query: 228 ILRSRAFSR----LRGQNL-VLIPLADLINHSPGITTEDYAYEIKGAGL-FSRDLLFSLR 281
            + + +F+       G N+ V++P+AD++NH          +    A L F  + L  + 
Sbjct: 192 -IMAYSFTEDSPSFHGNNVPVMVPMADILNH----------HSNNNARLEFGEEELSMVS 240

Query: 282 TPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRN 319
           T   +K GE V   Y    +N  L   YGF+E   + N
Sbjct: 241 TQHILKGGE-VFNTYG-QLANCHLLQSYGFVEGPDNPN 276


>gi|320170159|gb|EFW47058.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 640

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 61/229 (26%), Positives = 104/229 (45%), Gaps = 16/229 (6%)

Query: 105 VVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECT 164
           V+ E  ++   NP  +A        S   P  ++A+ L+ E  +  S WR ++  LP+  
Sbjct: 220 VMFETYLRTLENPMILAIDRRFKTMSVPDPSYALAMALLYESYEPKSMWREWISSLPQTL 279

Query: 165 DSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFP-RPITLDDFL 223
           DSTVFWS EE   LQ   L   T  ++ ++Q  Y      ++     +F     + + F 
Sbjct: 280 DSTVFWSAEEQDALQSLPLKRKTQILERHLQQLYNATTPRLLAAFPHIFAGGNYSYEMFK 339

Query: 224 WAFGILRSRAFSRLRGQN----LVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLL-- 277
           WA+ I+ SR+ +   G +    ++L PL DL++H P  T           G+   ++L  
Sbjct: 340 WAYMIVDSRSLTFSTGPDTLPQIMLAPLVDLLHHDPVQTNIQL-------GVHPEEVLGF 392

Query: 278 -FSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTL 325
             SL+T   +K GE  L+++     N +L L +G    ++    Y + L
Sbjct: 393 EISLKTTRAIKKGEP-LVRHIGELPNHQLLLRFGLAMPRNPYEFYPILL 440


>gi|406603886|emb|CCH44637.1| hypothetical protein BN7_4206 [Wickerhamomyces ciferrii]
          Length = 477

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 81/332 (24%), Positives = 139/332 (41%), Gaps = 53/332 (15%)

Query: 61  TAQVETFWQWLRDQKV-VSPKSPIRPATF-PEGLGLVAQRDIAKNEVVLEVPMKFWINPD 118
           T     F  WL +  V +SPK  I    +  +G GL++ +DI ++E++ ++P    +N +
Sbjct: 4   TEDTHNFQNWLINSGVQISPKIKIEDLRYLSQGRGLISLQDINQDEILFKIPRNVLLNIE 63

Query: 119 TVAASEIGS----LCSGLKPWISVALFLIRE-KKKEDSPWRVYLDILPECTDSTVFWSEE 173
           T + S+I +    L +    W  + L ++ E     +S W  Y  ILP    S +FW ++
Sbjct: 64  TGSLSQINNNKEKLLTNYDHWEGLILTILYELSLGNESKWFQYFKILPNEFHSLMFWEKD 123

Query: 174 ELVELQGTQLLSTTLGVKEYVQNEYLKVEEEII---LPNKQLFPRPITLDDFLWAFGILR 230
           EL      +LL  +L +    Q + L+   ++I   L +  +    I+LD F      + 
Sbjct: 124 EL------ELLKPSLVLDRIGQEKALETFNKLIPNALVDLGINHLNISLDLFHKVASTIL 177

Query: 231 SRAFSRLR----------------GQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSR 274
           S +F   R                G    ++ LADL+N    ++          A LF  
Sbjct: 178 SYSFDVERPDFNEDMEDDEQVQYDGYFKSMVTLADLLNADTNLS---------NANLFYE 228

Query: 275 DLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAY----------TLT 324
                +++  P+  G+Q+   Y  +  N+EL   YG++E    +  +          TL 
Sbjct: 229 TEFLIMKSIKPIPQGQQIYNTYG-DHPNSELLRRYGYVEYNGSKFDFGELPISTIKKTLN 287

Query: 325 LEISESDPFFGDKLDIAETNGLGESAYFDIVL 356
           L  + S  F    LDI   + + E    DIVL
Sbjct: 288 LHFNLSFEFIDQILDIISQSDVEELEE-DIVL 318


>gi|367024041|ref|XP_003661305.1| hypothetical protein MYCTH_107134, partial [Myceliophthora
           thermophila ATCC 42464]
 gi|347008573|gb|AEO56060.1| hypothetical protein MYCTH_107134, partial [Myceliophthora
           thermophila ATCC 42464]
          Length = 506

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 63/215 (29%), Positives = 96/215 (44%), Gaps = 26/215 (12%)

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEE---EIIL 207
           +PW  YL  LPE       W+EEE + L+GT L +        +  E+  V E   +I+ 
Sbjct: 118 NPWTEYLKFLPETVPVPTLWTEEERLLLRGTSLETAVDAKISALDAEFGLVREKSSDIVG 177

Query: 208 PNKQLF----PRPITLDDFLWAFGILRSRAFSRLR-GQNLVLIPLADLINHSPGITTEDY 262
            N  L+       ++  D++    + RSR     R G+++V  P  D+INHS    T   
Sbjct: 178 WNDLLWGAEGASAVSFMDWIRLDALYRSRCLELPRSGESMV--PCIDMINHS---ATPSA 232

Query: 263 AYEIKGAGLFSRD-LLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAY 321
            Y+        RD ++  LR    +  G+++ I Y   KS AE+   YGFI+ ++  N+ 
Sbjct: 233 YYDEN-----PRDEVVLRLRPGTTISRGDEITINYGDAKSAAEMLFSYGFIDRETAINSL 287

Query: 322 TLTLEISESDPF-------FGDKLDIAETNGLGES 349
            L L     DP        FG+   I  + G GES
Sbjct: 288 VLPLAPFPDDPLAKAKLVAFGEPPRIHVSRGRGES 322


>gi|28393324|gb|AAO42088.1| unknown protein [Arabidopsis thaliana]
          Length = 543

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 68/238 (28%), Positives = 116/238 (48%), Gaps = 33/238 (13%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLC---SGLKPWISVALFLIREKK 147
           G G +A  D+   +V LE+P+   I+ + V  S++  +     G+     + L+ +REK 
Sbjct: 175 GRGAIASEDLKLGDVALEIPVSSIISEEYVYNSDMYPILETFDGITSETMLLLWTMREKH 234

Query: 148 KEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEII- 206
             DS ++ Y D L E   + + +  + ++EL GT LL   +  KE ++  Y    +E+I 
Sbjct: 235 NLDSKFKPYFDSLQENFCTGLSFGVDAIMELDGTLLLDEIMQAKELLRERY----DELIP 290

Query: 207 -LPN-KQLFPRPI-TLDDFLWAFGILRSRAF------SRLRGQNLVLIPLADLINHSPGI 257
            L N +++FP  + T + +LWA  +  S +        +L+     LIP+A  +NHS   
Sbjct: 291 LLSNHREVFPPELYTWEHYLWACELYYSNSMQIKFPDGKLK---TCLIPVAGFLNHSI-- 345

Query: 258 TTEDYAYEIKGAGLFSRDL-LFSLRTPV--PVKAGEQVLIQYDLNKSNAELALDYGFI 312
               Y + +K   +   D+   SL+ PV  P   GEQ  + Y  N S++ L   YGF+
Sbjct: 346 ----YPHIVKYGKV---DIETSSLKFPVSRPCNKGEQCFLSYG-NYSSSHLLTFYGFL 395


>gi|79557522|ref|NP_179475.3| SET domain-containing protein [Arabidopsis thaliana]
 gi|56381987|gb|AAV85712.1| At2g18850 [Arabidopsis thaliana]
 gi|330251719|gb|AEC06813.1| SET domain-containing protein [Arabidopsis thaliana]
          Length = 543

 Score = 66.2 bits (160), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 68/238 (28%), Positives = 116/238 (48%), Gaps = 33/238 (13%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLC---SGLKPWISVALFLIREKK 147
           G G +A  D+   +V LE+P+   I+ + V  S++  +     G+     + L+ +REK 
Sbjct: 175 GRGAIASEDLKFGDVALEIPVSSIISEEYVYNSDMYPILETFDGITSETMLLLWTMREKH 234

Query: 148 KEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEII- 206
             DS ++ Y D L E   + + +  + ++EL GT LL   +  KE ++  Y    +E+I 
Sbjct: 235 NLDSKFKPYFDSLQENFCTGLSFGVDAIMELDGTLLLDEIMQAKELLRERY----DELIP 290

Query: 207 -LPN-KQLFPRPI-TLDDFLWAFGILRSRAF------SRLRGQNLVLIPLADLINHSPGI 257
            L N +++FP  + T + +LWA  +  S +        +L+     LIP+A  +NHS   
Sbjct: 291 LLSNHREVFPPELYTWEHYLWACELYYSNSMQIKFPDGKLK---TCLIPVAGFLNHSI-- 345

Query: 258 TTEDYAYEIKGAGLFSRDL-LFSLRTPV--PVKAGEQVLIQYDLNKSNAELALDYGFI 312
               Y + +K   +   D+   SL+ PV  P   GEQ  + Y  N S++ L   YGF+
Sbjct: 346 ----YPHIVKYGKV---DIETSSLKFPVSRPCNKGEQCFLSYG-NYSSSHLLTFYGFL 395


>gi|255581713|ref|XP_002531659.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase,
           putative [Ricinus communis]
 gi|223528717|gb|EEF30729.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase,
           putative [Ricinus communis]
          Length = 558

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 62/253 (24%), Positives = 117/253 (46%), Gaps = 23/253 (9%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSL---CSGLKPWISVALFLIREKK 147
           G G +A  D+   ++ LE+P+   I+ + V  S++  +     G+     + L+ ++E+ 
Sbjct: 185 GRGAIATEDLKVGDIALEIPVSIIISEELVRHSDMYHILEKIDGISSETMLLLWSMKERH 244

Query: 148 KEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIIL 207
             +S  ++Y D LP+  ++ + +  + ++   GT L    +  KE+++ +Y ++   +  
Sbjct: 245 NCNSKSKIYFDTLPKEFNTGLSFGVDAIMASDGTLLFDEIMQAKEHLRVQYDELVPALCN 304

Query: 208 PNKQLFPRPI-TLDDFLWAFGILRSRAF--SRLRGQ-NLVLIPLADLINHS--PGITTED 261
               +FP  + T + FLWA  +  S +     L G+    LIP+A  +NHS  P I    
Sbjct: 305 NYPDVFPPELYTWEQFLWACELWYSNSMKIKFLDGKLRTCLIPIAGFLNHSLHPHIIHYG 364

Query: 262 YAYEIKGAGLFSRDLLFSLRTPV--PVKAGEQVLIQYDLNKSNAELALDYGFIESKSDR- 318
               I            +L+ P+  P + GEQ  + Y  N S A L   YGF+    +R 
Sbjct: 365 KVDSITN----------TLKFPLSRPCRVGEQCCLSYG-NFSGAHLITFYGFLPQGDNRY 413

Query: 319 NAYTLTLEISESD 331
           +   L ++  E+D
Sbjct: 414 DIIPLDIDAGEAD 426


>gi|330800139|ref|XP_003288096.1| hypothetical protein DICPUDRAFT_152307 [Dictyostelium purpureum]
 gi|325081857|gb|EGC35358.1| hypothetical protein DICPUDRAFT_152307 [Dictyostelium purpureum]
          Length = 525

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 48/173 (27%), Positives = 83/173 (47%), Gaps = 4/173 (2%)

Query: 87  TFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREK 146
           T   GLG+++ +D+  N +V ++P    ++  T + S I +  + ++  I+ A+ LI E 
Sbjct: 75  TIISGLGIISNKDLKVNNIVAKIPKDIILSIHTSSISNILTKYT-MERNIATAIALIYEA 133

Query: 147 K-KEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEI 205
              E S W  Y+  LP   D  + W +E    L GT +          + + Y  + E +
Sbjct: 134 SIGEKSKWYGYISSLPLKVDIPILWDKESQQLLNGTVMEDVIQDDNILINHAYADIVESL 193

Query: 206 ILPNK-QLFPRPI-TLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPG 256
           ++ N  + F + I + ++F  A  I+ SRAF         L+PLAD+ NH  G
Sbjct: 194 LIKNHPEYFSKEIFSFENFKIANSIVSSRAFCIDSYHGDSLVPLADIFNHKTG 246


>gi|428177623|gb|EKX46502.1| hypothetical protein GUITHDRAFT_138238 [Guillardia theta CCMP2712]
          Length = 486

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 74/289 (25%), Positives = 122/289 (42%), Gaps = 45/289 (15%)

Query: 53  ASATTNPPTAQ--VETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVP 110
           A +T  P + Q   E   +W R+  +   K  ++   F  G  + A  D+ ++EV+  VP
Sbjct: 157 ADSTCGPSSEQERREKLLEWAREHGIGFEKISLQEDEFG-GTAMFASEDLEEDEVIGVVP 215

Query: 111 MKFWINPDTVAASEIGSLCSGLK------PWISVALFLIREKKKEDSPWRVYLDILPECT 164
               I  +++  S  G L   L       P +   +FL+ E++   S +R YLD+LP  +
Sbjct: 216 FSISIGRESLWRSRHGELLGQLYEDERTPPDLISCIFLLLERRSSSSFFRPYLDMLPTPS 275

Query: 165 D-STVF-WSEEELVEL----QGTQLLSTTLGVKEYVQNEYLKV---EEEI---ILPNKQL 212
             S VF W    L       +   L +  L + E     Y  V    EE       ++++
Sbjct: 276 GVSNVFHWDAHALSAFSPHEEARSLAAAHLSLFERTYQRYFTVVNKNEEFQRQFGKHQEI 335

Query: 213 FPRPITLDDFLWAFGILRSRAFSR----LRGQNLVLIPLADLINHSPGITTEDYAYEIKG 268
           F R    D  LWA+ +L SRA+       R     ++P+AD+ NH    T         G
Sbjct: 336 FSR----DQVLWAYSLLISRAWEHPDYNYRTSFHRMLPIADIANHKMSPT---------G 382

Query: 269 AGLFSRDL------LFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF 311
           +G  S +       +F +     ++ G++++  Y  N  NA L + YGF
Sbjct: 383 SGWMSVEFRNQQGAVFLVTRGGAIRRGQEIVTSYS-NAGNALLLVQYGF 430


>gi|334184301|ref|NP_001189551.1| SET domain-containing protein [Arabidopsis thaliana]
 gi|330251720|gb|AEC06814.1| SET domain-containing protein [Arabidopsis thaliana]
          Length = 536

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 68/238 (28%), Positives = 116/238 (48%), Gaps = 33/238 (13%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLC---SGLKPWISVALFLIREKK 147
           G G +A  D+   +V LE+P+   I+ + V  S++  +     G+     + L+ +REK 
Sbjct: 175 GRGAIASEDLKFGDVALEIPVSSIISEEYVYNSDMYPILETFDGITSETMLLLWTMREKH 234

Query: 148 KEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEII- 206
             DS ++ Y D L E   + + +  + ++EL GT LL   +  KE ++  Y    +E+I 
Sbjct: 235 NLDSKFKPYFDSLQENFCTGLSFGVDAIMELDGTLLLDEIMQAKELLRERY----DELIP 290

Query: 207 -LPN-KQLFPRPI-TLDDFLWAFGILRSRAF------SRLRGQNLVLIPLADLINHSPGI 257
            L N +++FP  + T + +LWA  +  S +        +L+     LIP+A  +NHS   
Sbjct: 291 LLSNHREVFPPELYTWEHYLWACELYYSNSMQIKFPDGKLK---TCLIPVAGFLNHSI-- 345

Query: 258 TTEDYAYEIKGAGLFSRDL-LFSLRTPV--PVKAGEQVLIQYDLNKSNAELALDYGFI 312
               Y + +K   +   D+   SL+ PV  P   GEQ  + Y  N S++ L   YGF+
Sbjct: 346 ----YPHIVKYGKV---DIETSSLKFPVSRPCNKGEQCFLSYG-NYSSSHLLTFYGFL 395


>gi|291235388|ref|XP_002737626.1| PREDICTED: SET domain containing 4-like [Saccoglossus kowalevskii]
          Length = 353

 Score = 65.9 bits (159), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 67/287 (23%), Positives = 121/287 (42%), Gaps = 33/287 (11%)

Query: 83  IRPATFPE-GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCS-----GLKPWI 136
           ++PA F E G GL+A +     + V+ +P    I    V +S +G          L P  
Sbjct: 48  LKPANFKETGRGLMATKPFQIGDQVISIPEMLLITTQNVLSSYLGDFIKQQTRPKLSPMQ 107

Query: 137 SVALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQ 195
            +  +LI E+ +++DS W  Y+ +LP+   + V+++ EE+  L   ++        E + 
Sbjct: 108 VICTYLICERSRQKDSFWYNYIKVLPKSYSNPVYFTNEEINWL-PRRIKRKVFDECEKIN 166

Query: 196 NEYLKVEE--EIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQN---------LVL 244
             Y +++    I+      F        F WA+  + +R+   L+ QN           L
Sbjct: 167 TAYRELKNLFSILESTFVSFKGIFEYSAFRWAWCTVNTRSVYMLQEQNPHLSIERDHYAL 226

Query: 245 IPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAE 304
            P  DL+NH+  +       E+K A        + + T    K  +Q+ I Y  +  N +
Sbjct: 227 APFLDLLNHTNTV-------EVK-ASYNPVSKCYEIFTCTACKKYDQMFIYYGPH-DNVK 277

Query: 305 LALDYGFIESKSDRNAYTLTLE-----ISESDPFFGDKLDIAETNGL 346
           L ++YGF+  ++  N   L  E       E      +KLD  +++ L
Sbjct: 278 LFIEYGFVLPQNQHNVVELDFEDIYCKTCEERKLLRNKLDFFQSHNL 324


>gi|148671822|gb|EDL03769.1| SET domain containing 4, isoform CRA_c [Mus musculus]
          Length = 269

 Score = 65.9 bits (159), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 60/208 (28%), Positives = 100/208 (48%), Gaps = 23/208 (11%)

Query: 69  QWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           +WL+++K     + + PA+FP  G GL+++  + + +V++ +P    +  DTV  S +G 
Sbjct: 37  KWLKERKFED--TDLVPASFPGTGRGLMSKASLQEGQVMISLPESCLLTTDTVIRSSLGP 94

Query: 128 LCSGLKPWIS----VALFLIREKKKE-DSPWRVYLDILPECTDSTVFWSEEELVELQGTQ 182
                KP +S    +  FL+ EK     S W+ YLDILP+     V   E E+V+L  + 
Sbjct: 95  YIKKWKPPVSPLLALCTFLVSEKHAGCRSLWKSYLDILPKSYTCPVCL-EPEVVDLLPSP 153

Query: 183 LLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI----TLDDFLWAFGILRSRA-FSRL 237
           L +     +  VQ+ +           + LF  P+    +   FLWA+  + +RA + R 
Sbjct: 154 LKAKAEEQRARVQDLFTSA-RGFFSTLQPLFAEPVDSVFSYRAFLWAWCTVNTRAVYLRS 212

Query: 238 RGQNLV--------LIPLADLINHSPGI 257
           R Q  +        L P  DL+NHSP +
Sbjct: 213 RRQECLSAEPDTCALAPFLDLLNHSPHV 240


>gi|113930683|ref|NP_001039027.1| SET domain-containing protein 4 [Danio rerio]
 gi|66911144|gb|AAH96876.1| SET domain containing 4 [Danio rerio]
          Length = 440

 Score = 65.9 bits (159), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 97/420 (23%), Positives = 158/420 (37%), Gaps = 51/420 (12%)

Query: 62  AQVETFWQWLRDQKVVSPKSPIRPATFPE-GLGLVAQRDIAKNEVVLEVPMKFWINPDTV 120
           AQ     +WL ++   S    + P  F + G GL+A + I     V+ +P +  +   TV
Sbjct: 32  AQFVLLRRWLNERGFTS--QSLIPVNFHDTGRGLMATQTIKAKNSVISLPEECLLTTSTV 89

Query: 121 AASEIGSLCSGLKPWIS----VALFLIREKKK-EDSPWRVYLDILPECTDSTVFWSEEEL 175
             S +        P IS    +  FLI E+   E S W  Y+DILP+ T +   +  + +
Sbjct: 90  LKSYMADYIKRWHPPISPLLALCCFLISERHHGEASEWNPYIDILPK-TYTCPLYFPDNV 148

Query: 176 VELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRP----ITLDDFLWAFGILRS 231
           +EL    L       KE  Q E     +      + LF +P     + D   WA+  + +
Sbjct: 149 IELLPRSLQKKATQQKEQFQ-ELFSSSQTFFHSLQPLFNQPTEELFSQDALRWAWCSVNT 207

Query: 232 RAFSRLRGQN---------LVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRT 282
           R       Q+           L P  DL+NH P +  E        AG       + +R+
Sbjct: 208 RTVYMEHDQSKYLSREKDVYALAPYLDLLNHCPNVQVE--------AGFNKETRCYEIRS 259

Query: 283 PVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLE-----ISESDPFFGDK 337
               K  +Q  I Y  +  N  L L+YGF+   +  +   + LE     + E D    +K
Sbjct: 260 VNGCKKFQQAFINYGPH-DNHRLLLEYGFVAPCNPHSVVYVDLETLKVGLDEKDKQLKEK 318

Query: 338 LDIAETNGLGESAYFDIVLGRTLPP-AMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLP 396
           L   + N        ++  G   P   ++  LRL++L        +S+        L   
Sbjct: 319 LLYLKDNDFLR----NLTFGMDGPSWRLMTALRLLSLKPQQYTRWKSVL-------LGAA 367

Query: 397 VSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLD--PRLRIAVGVRAGEKRVL 454
           VS   E+         C +        +E   QL +G N     +L +   +R  E+ +L
Sbjct: 368 VSQDREDWCIESALKLCNNLTEDNVKALERLAQLKEGANQSGLEQLCVVESLRREEQNIL 427


>gi|213407234|ref|XP_002174388.1| lysine methyltransferase [Schizosaccharomyces japonicus yFS275]
 gi|212002435|gb|EEB08095.1| lysine methyltransferase [Schizosaccharomyces japonicus yFS275]
          Length = 537

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 68/272 (25%), Positives = 118/272 (43%), Gaps = 40/272 (14%)

Query: 95  VAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKP----WISVALFLIREKKKED 150
           +A +DI  ++V++  P  + I     A +E+  L           I    F +   K E 
Sbjct: 34  IASKDIDGDQVLISCPESYIITLQK-AKNELCRLSPKFADEKMHTIVCTFFALERLKGEK 92

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNK 210
           S W  Y++ LP+  D+ ++++++EL  L+ T +       K   + E+    +  +L N 
Sbjct: 93  SQWAKYIEYLPKTFDTPLYFTDDELKSLEHTNIFYGCNDRKRIWKEEHATAAK--LLDN- 149

Query: 211 QLFPRPITLDDFLWAFGILRSRAFSR-LRGQN------LVLIPLADLINHSP-------G 256
              P   + + +LWA  +  SR FS  L G+        +LIPL D +NH P        
Sbjct: 150 ---PDNFSWNMYLWAATVFSSRCFSSALLGEEDTDDAAPILIPLVDSLNHKPRCPIIWNK 206

Query: 257 ITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKS 316
           +T E +A +           L S++   P+ +G QV   Y   K N EL + YGF    +
Sbjct: 207 VTKESHAVQ-----------LVSVK---PISSGGQVYNNYG-PKGNEELLMGYGFCLPNN 251

Query: 317 DRNAYTLTLEISESDPFFGDKLDIAETNGLGE 348
           +   + L L + ++      K  I  ++GL +
Sbjct: 252 EFETFALRLSLDKAVYNSEKKRSILASHGLSK 283


>gi|312098619|ref|XP_003149111.1| hypothetical protein LOAG_13557 [Loa loa]
 gi|307755724|gb|EFO14958.1| hypothetical protein LOAG_13557 [Loa loa]
          Length = 288

 Score = 65.5 bits (158), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 69/294 (23%), Positives = 116/294 (39%), Gaps = 54/294 (18%)

Query: 67  FWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIG 126
           F +W+ +         IR  +   G GL A  D  +NE V+ +PM+  I    VA  E+ 
Sbjct: 6   FMEWVIENGGEHFGVDIRDCSNEGGKGLYATTDFRENETVICIPMEIIITAGFVA--EMP 63

Query: 127 SLCS-----GLKPWISVALFLIREKKKEDSPWRVYLDILPE------------------- 162
             C       LKP+ ++  F + EK+K +S W  Y  +LP+                   
Sbjct: 64  GYCDVFKRYRLKPFEALVYFFLVEKEK-NSKWDPYFKVLPKSFSTPASLHPVLEPEDFPY 122

Query: 163 CTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDF 222
           C        + EL  +       T     E+V +     +   IL +  ++        F
Sbjct: 123 CLRKQWCIQKNELKTMYEKARFVTEGTAGEFVPHNRFYSQFVAILADNTIWGH------F 176

Query: 223 LWAFGILRSRAFSR----------LRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLF 272
           LWA+ I+ +R   R            G +L ++PL D++NHS          + +   ++
Sbjct: 177 LWAWHIVNTRCIYRDNKPHPLIDNTEGDSLAIVPLIDMLNHSN---------DSQCCAIW 227

Query: 273 SRDL-LFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTL 325
              L L+      P+  GEQ+ I Y  + +N  L ++YGF    +  N   ++L
Sbjct: 228 DSKLNLYKAIVTRPIHEGEQIFICYG-SHTNGSLWIEYGFYLKDNICNKVEISL 280


>gi|346970168|gb|EGY13620.1| SET domain-containing protein [Verticillium dahliae VdLs.17]
          Length = 485

 Score = 65.5 bits (158), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 56/192 (29%), Positives = 79/192 (41%), Gaps = 20/192 (10%)

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNK 210
           +PW  Y+  LP        WSE+E   LQGT L    L V   +Q   L  E E +    
Sbjct: 117 TPWTEYVKFLPREVPVPTMWSEQERELLQGTSL---ELAVSAKIQ--ALTSEFEALREKS 171

Query: 211 QLFP---------RPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTED 261
              P           ++L D+       RSR+   L    + ++P+ DL NH+P  +   
Sbjct: 172 SDLPFWHAIFWDTNNVSLADWFLVDAWYRSRSL-ELPSAGVSMVPVLDLANHAPAPSAY- 229

Query: 262 YAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAY 321
           Y    +  G    D+   LR    + AG++V I Y   KS AE+   YGFI+     +  
Sbjct: 230 YEESARREG----DVELRLRPGSTLAAGDEVTISYGAGKSGAEMLFSYGFIDPARSTDTV 285

Query: 322 TLTLEISESDPF 333
            L L   E DP 
Sbjct: 286 ALPLAPLEDDPL 297


>gi|428177750|gb|EKX46628.1| hypothetical protein GUITHDRAFT_107412 [Guillardia theta CCMP2712]
          Length = 606

 Score = 65.5 bits (158), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 78/311 (25%), Positives = 122/311 (39%), Gaps = 49/311 (15%)

Query: 140 LFLIRE--KKKEDSPWRVYLDILPECTDSTVFWSEEEL------VELQGTQLLSTTLGVK 191
           LFLI E   K+E S W+ Y D LP   ++ + + EEE        EL GT  +      +
Sbjct: 280 LFLIHEMKTKRETSRWKTYFDFLPGKFETGICFEEEEGGGLNLDEELAGTGFVQKRWKER 339

Query: 192 EYVQNEYLKVEEEIILPNKQLFPRP-ITLDDFLWAFGILRSRAFS--------------- 235
           E V++ Y  +   +     Q+F R       F+WA G+  +R  +               
Sbjct: 340 EVVEHTYNMLFPWLTEEFPQVFDREHFDFQSFMWARGVFDTRCVTVKFPAEKTGKVGVDN 399

Query: 236 RLRGQN-----LVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGE 290
              G+        L+P AD+ NH P      YA ++    L          T  P+K G 
Sbjct: 400 NGEGEKGTRDVTCLVPWADMCNHHP------YA-QLNKPSLDPTRKFLQFCTMAPIKQGS 452

Query: 291 QVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESA 350
           QV + Y     N +L L YG+ E  +    Y + LE+ +      D+L + +   L    
Sbjct: 453 QVFLNYG-PLDNTQLLLYYGYAEQDNPYQTYAIELELPD------DELRLMKQLMLVHYR 505

Query: 351 YFDIVLGRT---LPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICR 407
           + D    R    +P A+L  LR+  +   D   ++   R      L+  +S  NEE + R
Sbjct: 506 FHDSHHLRDRGEIPAALLAALRICLMDKED---MDRGAREGEANPLEGRISKRNEEDVLR 562

Query: 408 VVRDACKSALS 418
            + D     L 
Sbjct: 563 TLEDTVSYVLG 573


>gi|225424368|ref|XP_002281246.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Vitis
           vinifera]
 gi|297737636|emb|CBI26837.3| unnamed protein product [Vitis vinifera]
          Length = 483

 Score = 65.5 bits (158), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 107/447 (23%), Positives = 181/447 (40%), Gaps = 59/447 (13%)

Query: 51  HSASATTNPPTAQVETFWQWLRDQK-VVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEV 109
           H      +PP        +W+R +   V     I P+    GLGLVA +DI K   ++ +
Sbjct: 29  HPGRLVPHPPD-----LIKWVRREGGFVHQAVTIAPSADSCGLGLVASQDIPKGSDLIAL 83

Query: 110 PMKFWINPDTVAASEIGSLCSGL---------KPW-ISVALFLIREKKKEDSPWRVYLDI 159
           P    +   ++ +  + ++ S L         + W + + L L++E+    S W  Y+  
Sbjct: 84  PHHIPLRFSSLESEGVDTIDSVLVNLARQVPEELWAMRLGLKLLQERASIGSFWWAYISN 143

Query: 160 LPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFP----- 214
           LPE     +F+  E++  LQ   LL        ++    L  E+E+    K L P     
Sbjct: 144 LPETYSVPIFFPGEDIKNLQYAPLLYQVNKRCRFL----LDFEKEVKRVLKNLKPDDHPF 199

Query: 215 --RPITLDDFLWAFGILRSRAFSRLRGQNL---------VLIPLADLINHSPGITTEDYA 263
             + +      WA   + SRAF RL G+ L         +++PL D+ NHS     +   
Sbjct: 200 RGQDVDASSLGWAMSAVSSRAF-RLHGKKLSDGTHVDVPMMLPLIDMCNHSFNPNAQIVQ 258

Query: 264 YEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF---------IES 314
            +  G    S ++L  +     +K  + +++ Y    +N    LDYGF         IE 
Sbjct: 259 EQDAG----STNMLIKVVAETQIKQDDNLVLNYGC-LNNDFFLLDYGFVIPSNPYDCIEL 313

Query: 315 KSDR---NAYTLTLEISE---SDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYL 368
           K D    +A ++   +     S P    +  +++ N  GE+    + LG +         
Sbjct: 314 KYDGALLDAASMAAGVMSPNFSSPAPWQQQILSQLNLDGEAPVLKVNLGGSELVEGRLLA 373

Query: 369 RLVALGGTDAFLLESIFRNTIWG-HLDLPVSHANEELICRVVRDACKSALSGFHTTIEED 427
            L  L  +D   ++    NT+     + P+  +NE    R V   C  AL  F T I ED
Sbjct: 374 ALRVLLASDLETVQKHDLNTLKSLAAEAPLGISNEVAALRTVIALCVIALGHFPTKIMED 433

Query: 428 EQLLQGGNLDPRLRIAVGVRAGEKRVL 454
           E LL+ G +     +A+  R  +K V+
Sbjct: 434 EALLKQG-VSGSAELAIQFRIQKKSVI 459


>gi|296810368|ref|XP_002845522.1| SET domain-containing protein [Arthroderma otae CBS 113480]
 gi|238842910|gb|EEQ32572.1| SET domain-containing protein [Arthroderma otae CBS 113480]
          Length = 491

 Score = 65.5 bits (158), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 109/433 (25%), Positives = 174/433 (40%), Gaps = 105/433 (24%)

Query: 94  LVAQRDIAKNEVVLEVPMKFWINPDTVAASE-IGSLCSGLKPWISVALFLIREK-KKEDS 151
           L A RDIA++E +  +P    ++ +   A E +G   + L PW+S+ + +I E  + E S
Sbjct: 61  LGAVRDIAEDEELFVIPEDLILSVENSKAREALGLNETQLGPWLSLIIVMIYEYYQGEQS 120

Query: 152 PWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQ 211
            W  Y  ILP   D+ +FW+E +L ELQG  ++   +G K       L+    +I  N  
Sbjct: 121 RWEPYFHILPTSFDTLMFWTEAQLQELQGCAVVD-KIG-KSAADEAILQKVVPLIQANPH 178

Query: 212 LFPR-----PITLDDFL-------------WAFGILRSRAFSRLRGQNLV---------- 243
            FP      P+  +D L             +AF I ++        ++            
Sbjct: 179 HFPARSGMPPLDSNDALLCLAHRMGSLIMAYAFDIEKTEGADDDAAEDGYMTDDEDEPAK 238

Query: 244 -LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSN 302
            ++PLAD+ N            +   A LF  +  F ++    ++AGE++   Y      
Sbjct: 239 GMVPLADIFNADA---------QRNNARLFQEEGSFVMKAIRNIQAGEEIFNDYG-ELPR 288

Query: 303 AELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLD-IAETNGLGESAYFDIVLGRTLP 361
           A+L   YG++   +D  A    +E S         LD I    GL +S       G + P
Sbjct: 289 ADLLRRYGYV---TDNYAQYDVVEFS---------LDSICNAAGLPDSH-----PGPSNP 331

Query: 362 PAMLQYLRLVALGG------TDAFLLES-------IFRNTIWGHLDLPVSHANEELICRV 408
              L     +   G      +D+  L         I  NT    L LPV++ N+    R 
Sbjct: 332 QLELLDNLGLLEDGYSIPRISDSESLNDAIPEDFLILLNT----LTLPVANLND---LRA 384

Query: 409 VRDACKSALSG----------------FHTTIEEDEQLLQG--GNLDP------RLRIAV 444
              A K   S                 + TTI+ D+++LQ    N +P      R+++AV
Sbjct: 385 RNKAPKPEFSAASAAVLLPLAVQRLCEYPTTIQSDKKMLQCLEQNFNPNDSNSRRMKMAV 444

Query: 445 GVRAGEKRVLQQI 457
            VR GEK +L Q+
Sbjct: 445 QVRMGEKEILAQV 457


>gi|358395796|gb|EHK45183.1| hypothetical protein TRIATDRAFT_39811 [Trichoderma atroviride IMI
           206040]
          Length = 484

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 79/309 (25%), Positives = 122/309 (39%), Gaps = 37/309 (11%)

Query: 60  PTAQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDI-------AKNEVVLEVPMK 112
           P   +E F  W     V    + +R +   +G GLVA++DI       +  E ++ +P  
Sbjct: 4   PQLPIEAFPAWALLNDVEFKSAEVR-SIEGKGFGLVAKKDIPGVSDDSSSTEAIIRIPRD 62

Query: 113 FWINPDTVAAS-----------EIGSLCSGLKPWISVALFLIREKKKEDS--------PW 153
             ++ + V A            E+    S     +   L  +   K+  S        PW
Sbjct: 63  LVLSAEAVEAYAKVDQHFRQLLEVAGHQSTRGDILLYLLTHLILSKRNSSGSKGCASTPW 122

Query: 154 RVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEI-ILPN-KQ 211
             Y+  LP        W+ EE   LQGT L S+       +  EY ++ E+   LP    
Sbjct: 123 TEYIKFLPRSISVPTMWTSEEREFLQGTSLESSVNAKLSVLSREYDELSEKASTLPFWND 182

Query: 212 LFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGL 271
           L      L+D++ A  + RSR    L    L ++P  D+ NHSP        Y+    G 
Sbjct: 183 LLSESGMLEDWILADALYRSRCL-ELPHAGLAMVPGLDMANHSPKYLAR---YDETPEG- 237

Query: 272 FSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESD 331
              D++    +   V +GE++ I Y   KS AE+   YGFI+ +S      L L+    D
Sbjct: 238 ---DVVLLPSSGSGVSSGEEITISYGEAKSAAEMLFSYGFIDQESGVKELVLHLDALPDD 294

Query: 332 PFFGDKLDI 340
           P    K  I
Sbjct: 295 PLGKAKFHI 303


>gi|194707708|gb|ACF87938.1| unknown [Zea mays]
          Length = 352

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 81/321 (25%), Positives = 138/321 (42%), Gaps = 31/321 (9%)

Query: 136 ISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYV- 194
           + + L L++E+ K DS W  Y+  LPE     +F+  E++  LQ   +L        ++ 
Sbjct: 1   MKLGLRLLQERAKSDSFWWPYIANLPETFTVPIFFPGEDIKNLQYAPILHQVNKRCRFLL 60

Query: 195 --QNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLIN 252
             + E  +    + L +   + + +      WA     SRAF RL G+  +L+PL D+ N
Sbjct: 61  EFEKEVQQKLHTVPLVDHPFYGQDVNSSSLGWAMSAASSRAF-RLHGEVPMLLPLIDMCN 119

Query: 253 HSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI 312
           HS        A  ++   + S D+   +     +K  E + + Y    ++  L LDYGF+
Sbjct: 120 HS----FNPNARIVQERSVNSLDMSVKVLAEKKIKQNEAITLNYGCYPNDFFL-LDYGFV 174

Query: 313 ESKSDRNAYTL----------TLEISESDPFFGD----KLDI-AETNGLGESAYFDIVLG 357
            +++  +   L          ++    S P F      + DI ++ N  GE A   + LG
Sbjct: 175 ITQNPYDQVELSYDGALLDAASMAAGVSSPNFSAPAKWQQDILSQLNLHGEGAVLKVSLG 234

Query: 358 RTLPPAMLQYLRLVALGGTDAFLLESIFR---NTIWG-HLDLPVSHANEELICRVVRDAC 413
               P ++    L AL    A   E++ +   NT+    + +P+    E    R V   C
Sbjct: 235 G---PDVVDGRLLAALRVLLADDPEAVHKHDLNTLMSLDVQVPLGPTVEASALRTVLALC 291

Query: 414 KSALSGFHTTIEEDEQLLQGG 434
             AL  FHT I +D+ +L GG
Sbjct: 292 AIALQHFHTKIMDDQAILGGG 312


>gi|281207968|gb|EFA82146.1| hypothetical protein PPL_04566 [Polysphondylium pallidum PN500]
          Length = 510

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 54/214 (25%), Positives = 96/214 (44%), Gaps = 21/214 (9%)

Query: 63  QVETFWQWLRDQ---------KVVSPKSPIRPAT--------FPEGLGLVAQRDIAKNEV 105
           Q++   +WL D          ++V  K  +   T          EGLG++A +D+  +  
Sbjct: 11  QLDIVVKWLDDNGVKINHKLIEIVCQKQSVDDVTNKNTPHEQVVEGLGVIALQDLKIDHT 70

Query: 106 VLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREKK-KEDSPWRVYLDILPECT 164
           V  +P    + P T + S        +K   + ++ L+ E      S W  Y+  LP   
Sbjct: 71  VAIIPKSCLLTPHTTSISAYLKK-YKIKDATATSIALLYEASIGSQSKWYGYIKSLPLSV 129

Query: 165 DSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLK-VEEEIILPNKQLFPRPI-TLDDF 222
           D  + W++ +L  L+GT + +     KE V   Y K ++ ++I  +  +F   + +LD+F
Sbjct: 130 DLPILWNDADLKNLKGTSIETVVYENKETVDATYNKYIKSKLIANHPDVFNEHVFSLDNF 189

Query: 223 LWAFGILRSRAFSRLRGQNLVLIPLADLINHSPG 256
             A  ++ SRAF+        ++PLAD+ NH  G
Sbjct: 190 KRASCLVSSRAFNIDTYHGDSMVPLADIFNHRTG 223


>gi|384254260|gb|EIE27734.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 724

 Score = 65.1 bits (157), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 89/352 (25%), Positives = 139/352 (39%), Gaps = 79/352 (22%)

Query: 67  FWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEI- 125
           F +W     + SP   +R A F    G+ A  +IAK EV++ +P+   +       S++ 
Sbjct: 77  FQEWALQSGITSPS--LRLAEFAGLRGMAAADNIAKGEVLVSLPVAAALVVSPKERSQLP 134

Query: 126 GSLCSGL----KPW-ISVALFLIREKK-KEDSPWRVYLDILPECTDSTVFWSEEELVELQ 179
           G+ CS      KPW + +AL L+ E++    S    Y+  LP    + + WSE +L  L 
Sbjct: 135 GTFCSSAFYSKKPWYVQMALNLLYERQLGPASKLAPYVAALPVDFSTPLSWSEAQLQALC 194

Query: 180 GTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAF----- 234
             QL+      +E ++   L  E  +  P       PIT  D +WA   +RSRAF     
Sbjct: 195 YPQLIREVATQREGLKR--LHAELAVSTPGT-----PITEQDLIWALQAVRSRAFSGPYA 247

Query: 235 -----SRLRG---------------------------------------QNLVLIPLADL 250
                SRL+                                        +   + P+ D 
Sbjct: 248 GPTWRSRLKTFGALGALAAASITVAHVLNGAIAAALFNLLYDVVLSQKVKWYAMCPVVDF 307

Query: 251 INHSPGITTE-DYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDY 309
           +NH   + +E +Y Y       F+    FS+R       GEQV I Y   +SN  L   Y
Sbjct: 308 LNHKSTVQSEVEYEY-------FADR--FSVRCQSYFSKGEQVFISYG-KQSNDSLLQYY 357

Query: 310 GFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLP 361
           GF+E     + YT+  ++  +     D   +    GL +S+    +L RT P
Sbjct: 358 GFVEPGIPHDTYTIP-DLRAAALALSDTSKVPGIAGLPQSS--KGILTRTGP 406


>gi|326427686|gb|EGD73256.1| hypothetical protein PTSG_04969 [Salpingoeca sp. ATCC 50818]
          Length = 455

 Score = 65.1 bits (157), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 91/314 (28%), Positives = 142/314 (45%), Gaps = 52/314 (16%)

Query: 52  SASATTNPPTAQVETFWQWLRDQ-KVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVP 110
            A  T  P  A V+   +W R Q  V++P   IR +    G G+ A  D+  +  V  VP
Sbjct: 22  GALGTATP--AGVDGLVEWARAQGAVIAPAVAIRTSPL-TGNGVYATADMKAHTTVFAVP 78

Query: 111 MKFWINPD-TVAASEIGSLCSGLKPWIS----VALFLIREKKKEDSPWRVYLDIL-PECT 164
               +N +  +   ++G L   L P +S    +A FL  E  +    W+ YL  L P  T
Sbjct: 79  FSLMMNVEHALVDPDLGRLWDML-PDLSDLEVLAGFLAFESLRGTGFWQPYLASLGPPPT 137

Query: 165 DSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEE--EIILP------NK------ 210
             T+F S+EE+       LL+ +  V +  Q  +L + E  ++I+       NK      
Sbjct: 138 TPTLF-SQEEM------DLLAPSAAVFDIAQQRHLDLSEVYDLIVKAATNGLNKKEKEAW 190

Query: 211 -QLFPRPITLDDFLWAFGILRSRA----FSRLRGQ---NLVLIPLADLINHSPGITTEDY 262
            QL  R     D+LWA  +LRSR+         GQ    + L+PLADL N      T + 
Sbjct: 191 QQLGMRK---SDYLWAHVVLRSRSHKLSIKDAVGQWHDAMCLVPLADLFNTDLRNNTANV 247

Query: 263 AYEI--KGAGLFSRDLLFSLRTPVPVKAGEQVLIQY---DLNKSNAELALDYGFIESKSD 317
           A     +G G  S    F   T   +   E++L++Y    + +S+ +L LDYGF+ +  D
Sbjct: 248 ACYTGEEGHGTAST---FYCETTRDINHSEELLVEYIGDAMRRSSGKLLLDYGFVPTTHD 304

Query: 318 RNAYTLTL-EISES 330
            ++  L L ++SE+
Sbjct: 305 SDSVLLHLPKLSET 318


>gi|70984218|ref|XP_747626.1| SET domain protein [Aspergillus fumigatus Af293]
 gi|66845253|gb|EAL85588.1| SET domain protein [Aspergillus fumigatus Af293]
          Length = 492

 Score = 65.1 bits (157), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 54/174 (31%), Positives = 85/174 (48%), Gaps = 18/174 (10%)

Query: 54  SATTNPPTA-----QVETFWQWLRDQKVVSPKSPIRPATF---PEGLGLVAQRDIAKNEV 105
           S+TT+ P +     Q + F  WL  +  V   S I  A       G G+VA+ DI   E 
Sbjct: 2   SSTTHFPDSDSFQRQSDEFITWLAGKPGVKINSKISIADLRSKSAGRGVVARSDIFDGEE 61

Query: 106 VLEVPMKFWINPDTVAASEIGSLCSG----LKPWISVALFLIREKK-KEDSPWRVYLDIL 160
           +  +P    +   +   S++  L S     L PW+S+ L ++ E    E S W  Y  IL
Sbjct: 62  LFSIPRGLVL---SAQNSKLKDLLSQDLEELGPWLSLILVMMYEYLLGEQSAWAPYFKIL 118

Query: 161 PECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFP 214
           P+  D+ +FWS  EL ELQG+ ++S  +G KE  ++  +++   ++  N  LFP
Sbjct: 119 PKSFDTLMFWSPSELRELQGSAIVS-KIG-KEGAEDSIMQMIAPVVRANPSLFP 170


>gi|291000152|ref|XP_002682643.1| predicted protein [Naegleria gruberi]
 gi|284096271|gb|EFC49899.1| predicted protein [Naegleria gruberi]
          Length = 619

 Score = 65.1 bits (157), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 75/303 (24%), Positives = 126/303 (41%), Gaps = 51/303 (16%)

Query: 136 ISVALFLIRE--KKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEY 193
           I   +FLI E   +KE S    YL++LP    + +++ E+E+  L+ T L  +   +++ 
Sbjct: 107 IVFYMFLIYELHVEKEKSTHFPYLNLLPREFTTALYFDEDEMAALRSTNLYKSVQSIRQN 166

Query: 194 VQNEYLKVEEEIILPNKQLFPRPI-TLDDFLWAFGILRSRAF------SRLRGQNLV--L 244
           ++  Y    E ++    Q F R + + ++F+WAF  + SR F          G  +V  L
Sbjct: 167 LKQIYETKVEYLMNKYPQKFDRQVFSYENFMWAFSAVWSRVFPIEYPAENGEGVEIVPTL 226

Query: 245 IPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAE 304
           +P  D++NH          +  K       D  F L+T   +K+G+ V   Y   KSN  
Sbjct: 227 LPTVDILNH---------KFNAKITYFTGSDRRFYLKTRESLKSGDYVCNNYGA-KSNDS 276

Query: 305 LALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLG-----RT 359
             L YGF+   +  +   +   IS+              +G  E   F   LG     + 
Sbjct: 277 FLLSYGFVIPNNSEDTLYVQFGISD--------------DGNEELVQFKKNLGFYLKKQQ 322

Query: 360 LPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSG 419
           LP  ML  +R+  +   D +L +   ++TI  H         E+    V R + K+    
Sbjct: 323 LPEDMLDSIRICMMTEDDFYLAK---QSTISQH--------REQSKDSVYRFSVKNEFKM 371

Query: 420 FHT 422
            HT
Sbjct: 372 LHT 374


>gi|325183831|emb|CCA18289.1| conserved hypothetical protein [Albugo laibachii Nc14]
 gi|325183979|emb|CCA18437.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 561

 Score = 64.7 bits (156), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 102/442 (23%), Positives = 191/442 (43%), Gaps = 61/442 (13%)

Query: 70  WLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS-- 127
           WL++Q   + K  ++    PE  G+  + ++   E +L +P    I  +    +EIG   
Sbjct: 123 WLQNQGAETKKLMLQQYA-PEVRGVHCRNELVPGERILFIPKNCLITVEMGKQTEIGQKV 181

Query: 128 LCSGLK----PWISVALFLIREKKKED-SPWRVYLDILPECTDST-VFWSEEELVELQGT 181
           L   ++      I + L+L+ + +K+D + ++ Y   LP    +  +FWS++EL  L+G+
Sbjct: 182 LAHNIEFVAPKHIFLILYLLTDMEKKDLTFFKYYYSTLPSTLKNMPIFWSDQELSWLKGS 241

Query: 182 QLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFS-RLRG- 239
            +L      K  ++ +Y     + I      F R  +L+ F WA  I+ SR F   + G 
Sbjct: 242 YILHQIQERKAAIRKDY-----DAICRADPSFSR-FSLERFSWARMIVCSRNFGLTIDGV 295

Query: 240 QNLVLIPLADLINH-SPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDL 298
           +   L+P AD++NH  P  T+  +  ++ G         F++ +   + +G QV   Y  
Sbjct: 296 KTAALVPFADMLNHYRPRETSWTFDQKLDG---------FTITSLESICSGAQVYDSYG- 345

Query: 299 NKSNAELALDYGFI------ESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYF 352
            K N    L+YGF       E  S+ N   +  ++   DP  G +L   +T  L ES  +
Sbjct: 346 KKCNHRFLLNYGFAVEDNTEEDGSNPNEIMVDFQL---DPGDG-QLLYDKTAYLYESGIY 401

Query: 353 DI----VLGRTLPPAM--LQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELIC 406
            +        + P       + RL+A    D F   S  +     H   P+S  NE    
Sbjct: 402 TMNARLSCSHSDPSTREGFSFARLIA-ATEDEF---SSMKMRSPAHASPPISFRNEIAAL 457

Query: 407 RVVRDACKSALSGFHTTIEEDEQLLQGGN---LDPRLRIAVGVRAGEKRVLQQIDEIFKE 463
            +++    + L  + T+++E E +L+         R++    +R GEK+V      + ++
Sbjct: 458 NLLKQLMDTQLDQYPTSLDEGEAILKSKEYPLYSNRIQALFFIR-GEKQVCMHYKHLAEK 516

Query: 464 ---------RELELDELEYYQE 476
                    +E++  E +Y+ +
Sbjct: 517 VVPLFSLTMKEMKESEQQYFDD 538


>gi|7573451|emb|CAB87765.1| putative protein [Arabidopsis thaliana]
          Length = 537

 Score = 64.7 bits (156), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 90/376 (23%), Positives = 161/376 (42%), Gaps = 42/376 (11%)

Query: 94  LVAQRDIAKNEVVLEVPMKFWINPDTVAASE-IGSL--CSGLKPWISVALFLIREKKK-E 149
           + A  D+ K +V   VP    +  + V  +E I  L   + L     +AL+L+ EKK+ +
Sbjct: 115 VAASEDLQKGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 174

Query: 150 DSPWRVYLDILPEC-------TDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVE 202
            S W  Y+  L           +S + WSE EL  L G+   +  L   E ++ EY +++
Sbjct: 175 KSVWYPYIRELDRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAEGIKREYNELD 234

Query: 203 EEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQ----NLVLIPLADLINHSPGIT 258
               +        P  +    ++F I + +AF  ++       +VL+  ++L  ++   T
Sbjct: 235 TVWFMAGSLFQQYPFDIPTEAFSFEIFK-QAFVAIQSCVVHLQVVLVASSNLDCYASSCT 293

Query: 259 TE-----------------DYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKS 301
                               Y    K A L + D    L    P KAG+ +++ +   + 
Sbjct: 294 QNVGLARRFALVPLGPPLLAYCSNCK-AMLTAVDGAVELVVDRPYKAGDPIVV-WCGPQP 351

Query: 302 NAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLG--RT 359
           NA+L L+YGF++  +  +   +   ++  DP + DK  +A+ NG      F + +G  R 
Sbjct: 352 NAKLLLNYGFVDEDNPYDRVIVEAALNTEDPQYQDKRMVAQRNGKLSQQVFQVRVGKERE 411

Query: 360 LPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSG 419
               ML YLRL  +  +D   ++S+  +   G +  P+S   E  +   + +     LSG
Sbjct: 412 AVQDMLPYLRLGYM--SDPSEMQSVISSQ--GPV-CPMSPCMERAVLDQLANYFMRRLSG 466

Query: 420 FHTTIEEDEQLLQGGN 435
           + TT +ED+ L    N
Sbjct: 467 YPTTPKEDDALEASCN 482


>gi|21952799|dbj|BAC06215.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
 gi|22202682|dbj|BAC07340.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
 gi|215769224|dbj|BAH01453.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222619626|gb|EEE55758.1| hypothetical protein OsJ_04288 [Oryza sativa Japonica Group]
          Length = 495

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 58/210 (27%), Positives = 102/210 (48%), Gaps = 9/210 (4%)

Query: 53  ASATTNPPTAQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIA---KNEVVLEV 109
           ASA+T+    ++  F +W+RD  VV   + +R     +G G V  R +A   + ++V  +
Sbjct: 3   ASASTSTAR-RLRAFRRWMRDHGVVCSNA-LRLDAAEDGGGGVYVRALAALREGDLVATI 60

Query: 110 PMKFWINPDTVAASEIGSLCSGLKPW-ISVALFLIREKKKEDSPWRVYLDILPECTDSTV 168
           P    + P T  A+E         P  ++VA+   R +  E SPW  YL ++PE     +
Sbjct: 61  PRGACLTPRTSGAAEAIEAAELGGPLALAVAVMYERARGAE-SPWDAYLRLIPEREPVPL 119

Query: 169 FWSEEELVE-LQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNK-QLFPRPITLDDFLWAF 226
            W  +E    L GT+L       ++++  ++ +  E +IL  + ++ P   +L+++  A 
Sbjct: 120 VWPADEAERLLAGTELDKIVKQDRQFICEDWKECIEPLILSGELEVDPDDFSLENYFSAK 179

Query: 227 GILRSRAFSRLRGQNLVLIPLADLINHSPG 256
            +L SR+F         ++PLADL NH  G
Sbjct: 180 SLLSSRSFRIDSYHGSGMVPLADLFNHKTG 209


>gi|402862437|ref|XP_003895567.1| PREDICTED: SET domain-containing protein 4 [Papio anubis]
          Length = 456

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 82/330 (24%), Positives = 134/330 (40%), Gaps = 65/330 (19%)

Query: 69  QWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           +WL+ +K     S + PA FP  G GL++Q  + + ++++ +P    +  DTV  S +G+
Sbjct: 38  KWLKARKFQD--SHLVPACFPGTGRGLMSQTSLQEGQMIISLPESCLLTADTVIRSYLGA 95

Query: 128 LCSGLK----PWISVALFLIREKKKED-SPWRVYLDILPECTDSTVFWSEEELVELQGTQ 182
             +  K    P +++  FL+ EK   D S W+ YL+ILP+     V   E E+V L    
Sbjct: 96  YITKWKPPPSPLLALCTFLVSEKHAGDRSLWKPYLEILPKAYTCPVCL-EPEVVNLLPKS 154

Query: 183 LLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI----TLDDFLWAFGILRSRA----- 233
           L +     + +VQ E+     +     + LF   +    +    LWA+  + +RA     
Sbjct: 155 LKAKAEEQRAHVQ-EFFASSRDFFSSLQPLFVEAVDSIFSYSALLWAWCTVNTRAVYLRP 213

Query: 234 ----------------------FSRLRG---QNLVLIPLADLINH-SPGITTEDYAYEIK 267
                                 FS L     QNL+L PL+ L          E ++YEI 
Sbjct: 214 RQRECLSAEPDTCALAPACLCPFSPLCSSGLQNLMLHPLSGLSQQVKAAFNEETHSYEI- 272

Query: 268 GAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEI 327
                        RT    +  E+V I Y  +  N  L L+YGF+   +      ++ EI
Sbjct: 273 -------------RTTSRWRKHEEVFICYGPH-DNQRLFLEYGFVSVHNPHACVYVSREI 318

Query: 328 -----SESDPFFGDKLDIAETNGLGESAYF 352
                   D     K+ I + +G  E+  F
Sbjct: 319 LVKYLPSRDKQMDKKISILKDHGYIENLTF 348


>gi|145354720|ref|XP_001421625.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144581863|gb|ABO99918.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 375

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 52/196 (26%), Positives = 89/196 (45%), Gaps = 28/196 (14%)

Query: 138 VALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEEL-VELQGTQLLSTTLGVKEYVQN 196
           +  F+I  ++ E+S W+ Y+D LP   D+ + +S+EEL  EL GT + +     K +V+ 
Sbjct: 40  MCFFMIERRRGEESAWKEYIDSLPRAYDAPLSFSDEELERELSGTTVYAPVKAQKAHVK- 98

Query: 197 EYLKVEEEIILP--------------NKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNL 242
              K+ EE + P              +  + P  ++  +F WAF    SRA +   G   
Sbjct: 99  ---KMFEECVRPAMRELTQADNAAGSSLHMLP-DVSEKEFAWAFQTFWSRALAIPVGAGG 154

Query: 243 -----VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTP--VPVKAGEQVLIQ 295
                 ++P  D++NH+P          ++ +       +  +  P    +K GE++ I 
Sbjct: 155 SVTVDSVVPGVDMVNHAPRARANARWEHVEDSSRPDGGYVALVSAPPNRTMKDGEEIFIN 214

Query: 296 YDLNKSNAELALDYGF 311
           Y  +KSN EL   YGF
Sbjct: 215 YG-DKSNEELLFTYGF 229


>gi|340517549|gb|EGR47793.1| hypothetical protein TRIREDRAFT_122428 [Trichoderma reesei QM6a]
          Length = 482

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 70/281 (24%), Positives = 107/281 (38%), Gaps = 40/281 (14%)

Query: 90  EGLGLVAQRDIAKNE-------VVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVA--- 139
           +G GLVA+ DI            +L +P    ++ + V   E   +    K  + VA   
Sbjct: 31  KGFGLVAKHDITDESRDASGPATILRIPRDLVLSAEAV--EEYAKVDQNFKQLLDVAGHQ 88

Query: 140 -----------LFLIREKKKE-------DSPWRVYLDILPECTDSTVFWSEEELVELQGT 181
                        L++ K           +PW  Y+  LP        W+ +E   L+GT
Sbjct: 89  STRGDIMLYLLTHLVQSKATSPGTRAFASTPWTEYIRFLPRPIPVPTMWTNDERELLKGT 148

Query: 182 QLLSTTLGVKEYVQNEYLKVEEEIILPN--KQLFPRPITLDDFLWAFGILRSRAFSRLRG 239
            L +        + +EY K+ EE    +    L     TL+D++ A    RSR     R 
Sbjct: 149 SLEAAVSAKLSALSSEYDKLCEEASALSFWSTLLSESATLEDWVLADAWYRSRCLELPRA 208

Query: 240 QNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLN 299
            +  ++P  D+ NHS    +    Y+    G    D++   R    + AG ++ I Y   
Sbjct: 209 GH-AMVPGLDMANHS---QSHSAYYDESSDG----DVVLLPRPGSKIPAGAEITISYGEA 260

Query: 300 KSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDI 340
           K  AE+   YGFI+  S     TL LE    DP    K  I
Sbjct: 261 KPAAEMLFSYGFIDKDSTVKELTLHLEALPDDPLGRAKFHI 301


>gi|322694827|gb|EFY86647.1| SET domain protein [Metarhizium acridum CQMa 102]
          Length = 467

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 110/488 (22%), Positives = 178/488 (36%), Gaps = 133/488 (27%)

Query: 61  TAQVETFWQWLRDQKVVSPKSPIRPATF---PEGLGLVAQRDIAKNEVVLEVPMKFWIN- 116
           TA+   F QW +     +    I          G G+ A RDI  +  +  +P    IN 
Sbjct: 7   TARTAAFLQWFKALPGATFSDAIEIVDLRSRDAGRGITALRDIPADTTLFTIPRDAIINS 66

Query: 117 ---------PDTVAASEIGSLCSGLKPWISVALFLIREK-KKEDSPWRVYLDILPECTDS 166
                    PD   +         L  W ++ L ++ E    ++S W+ Y+D+LP   D+
Sbjct: 67  ETSSLRKKLPDLFESQGDEDEEQALDSWSALILIMMYEFFLGDESKWKPYIDVLPLTFDT 126

Query: 167 TVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAF 226
            +FWSEEEL  LQ +  +           N+  K + E      ++F             
Sbjct: 127 PMFWSEEELSYLQASATV-----------NKIGKADAE------EMF------------- 156

Query: 227 GILRSRAFSRLRGQNLVLIPLADL-------INHSPGITTEDYAYE-------------- 265
              R+R    +RG   V +   D        + H  G T   YA++              
Sbjct: 157 ---RTRLIPAIRGNPSVFVSSGDCSDEDLIGLAHRMGSTIMAYAFDLENEEAENDEESDG 213

Query: 266 -------------IKGAGLFSRDLLF-----------SLRTPVPVKAGEQVLIQYDLNKS 301
                        +  A + + D  F           ++ +   +KAGE++L  Y  +  
Sbjct: 214 WVEDREGKSMMGMVAMADILNADAEFNAHVNHGDEELTVTSIRDIKAGEEILNYYGPH-P 272

Query: 302 NAELALDYGFIESKSDR-----------------------NAYTLTLEISESDPF---FG 335
           N+EL   YG+I  K  R                       +    T+E  + D     F 
Sbjct: 273 NSELLRRYGYITEKHSRYDVVEIPWDAVQHSLMSELGVPQDIMAETMERMDQDDLEDIFV 332

Query: 336 DKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDL 395
            + D  E N  G  A   +V G  +PP + + L+          LL+ +  N I      
Sbjct: 333 LERDSGEPNPDGTFAGPAVVDG--MPPDLKEQLKATIK------LLQKLDGNLISD---- 380

Query: 396 PVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQ 455
                 ++++   + +  +   S + TTI EDE LL   +L  R R+AV VR GEK++LQ
Sbjct: 381 --KRKRDDILRSTMVETLRLIASRYSTTIAEDEVLLAQDSLTRRQRMAVQVRLGEKKLLQ 438

Query: 456 QIDEIFKE 463
           +  + F E
Sbjct: 439 EACDHFSE 446


>gi|412985257|emb|CCO20282.1| SET domain-containing protein-like (ISS) [Bathycoccus prasinos]
          Length = 615

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 75/310 (24%), Positives = 136/310 (43%), Gaps = 66/310 (21%)

Query: 60  PTAQVETFWQWLRDQ-KVVSPKSPIRPATFPEGLGLVAQRDIAKNE------------VV 106
           P  +V     +L D+ ++++   P++ A  P   GL +  +I++ E            +V
Sbjct: 115 PHEKVAKLHHFLGDENRILAHAKPLKSADAPVHFGLFSDGEISRKEKRRSVRKESGETIV 174

Query: 107 LEVPMKFWINPDTVAASE-IGSLCSGL------KPWISVALFLIREKK-KEDSPWRVYLD 158
           L VP  + ++  T A+ E +G   + +          +  +F++ +++ K +S W+ Y+D
Sbjct: 175 LNVPDAYIVSASTAASDEDLGETFTKMLDENKINTRQATQMFVLAQRRLKSESGWKAYVD 234

Query: 159 ILPECTDST-VFWSEEELVE-LQGTQLLSTTLGVKEYVQNEYLKVEEEI----ILPN-KQ 211
            LP   D   +FW+E E+   L+GT L       K  ++ EY  V ++     +LP  K+
Sbjct: 235 FLPRRMDLVPMFWTEREIERGLKGTVLYEMVKTQKARLKEEYETVVKDAFDANVLPKLKE 294

Query: 212 LFPR------------------PITLDDFLWAFGILRSRAFS--RLRGQNLV--LIPLAD 249
           + P                   P++ ++FLWA  +  +RA +     G+ +V  L+PL D
Sbjct: 295 IIPSSSSSVFVSLFGGNANDTSPLSFEEFLWAKALFWTRALTIPIENGRVIVEALVPLVD 354

Query: 250 LINHSPGITTEDYAYEIKGAGLFSRDLL-FSLRTPVPVK----AGEQVLIQYDLNKSNAE 304
             NHS       Y          S DL    LR P  ++    + +++ I Y +   N E
Sbjct: 355 ACNHSTKKPNARYQ--------LSNDLKSVELRVPSKIEDNMTSEDEIKITYGV--ENLE 404

Query: 305 LA-LDYGFIE 313
            A   YGF++
Sbjct: 405 RAFFTYGFVD 414


>gi|50557134|ref|XP_505975.1| YALI0F28061p [Yarrowia lipolytica]
 gi|49651845|emb|CAG78787.1| YALI0F28061p [Yarrowia lipolytica CLIB122]
          Length = 454

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 63/265 (23%), Positives = 118/265 (44%), Gaps = 42/265 (15%)

Query: 77  VSPKSPIRP-ATFPEGLGLVAQRDIAKNEVVLEVPMKFWI----NPDTVAASEIGSLCSG 131
           +SPK  I    +  +G G++A  DI ++EV+ ++P   ++    +PD +           
Sbjct: 24  ISPKIAIHDYRSDHQGRGVIASEDIEEDEVLFKIPRSSFLSVENDPDFIKQV---PEAKK 80

Query: 132 LKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVK 191
           L  W+ + L++++      + W+ Y D+LP   DS + W+++EL  L+G+ ++   +G K
Sbjct: 81  LNSWLQLILYMMKAGSM--TKWKPYFDVLPTQLDSLMMWTDDELEGLKGSMIVK-KIG-K 136

Query: 192 EYVQNEYLKVEEEIILPNKQLFPRPIT-LDDFLWAFGILRSRAFS--------------- 235
              + +Y +  + II  + + F    T L+ F    G++ + +F                
Sbjct: 137 AGAEEDYQEKLKPIIDAHPEYFKDCDTSLESFHRMGGLIMAYSFDAPDSFSEDEEDDEDI 196

Query: 236 ---RLRGQNLV--LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGE 290
               L  + LV  ++PLAD +N        +   E  G         FS+    P+K GE
Sbjct: 197 EHDDLYNEGLVKAMVPLADTLNAHTRFCNANLIAEDDGG--------FSMTAIQPIKKGE 248

Query: 291 QVLIQYDLNKSNAELALDYGFIESK 315
           QV   Y     N +    YG++E++
Sbjct: 249 QVYNTYG-ELPNCDFLRRYGYVENE 272


>gi|322706860|gb|EFY98439.1| SET domain protein [Metarhizium anisopliae ARSEF 23]
          Length = 595

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 58/208 (27%), Positives = 95/208 (45%), Gaps = 18/208 (8%)

Query: 144 REKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLST------TLGVKEYVQNE 197
           R K    +PW  Y+ +LP        W+E E + L GT L +       +LG KE+  + 
Sbjct: 214 RPKGLSPTPWTEYIRLLPRPIPVPTMWTEPERLLLNGTSLEAALEAKLLSLG-KEF--DT 270

Query: 198 YLKVEEEIILPNKQLFP-RPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPG 256
             +V E+    N+ L+    ++L+D++      RSR     R     ++P  D++NHS  
Sbjct: 271 LREVSEDFPFWNEFLWSGEEVSLEDWVLVDAWYRSRCLELPRS-GTAMVPGLDMVNHSSK 329

Query: 257 ITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKS 316
            T     YE         +++  +R   PV++GE+V I Y   K  +E+   YGFI+  +
Sbjct: 330 ATA---YYEEDD----HDNVVLLIRPGCPVRSGEEVTISYGDAKPASEMLFSYGFIDPNN 382

Query: 317 DRNAYTLTLEISESDPFFGDKLDIAETN 344
             +  TL L+    DP    KL I+ + 
Sbjct: 383 IVDKLTLRLDPFPDDPLARAKLRISNSG 410


>gi|224080099|ref|XP_002306017.1| SET domain-containing protein [Populus trichocarpa]
 gi|222848981|gb|EEE86528.1| SET domain-containing protein [Populus trichocarpa]
          Length = 468

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 70/307 (22%), Positives = 132/307 (42%), Gaps = 46/307 (14%)

Query: 61  TAQVETFWQWLRDQKVVSPKSPIRPATFPE-GLGLVAQRDIAKNEVVLEVPMKFWINPDT 119
           + ++  F +W+  Q +      ++    PE G+ + A  D+ + +++ ++P    +   T
Sbjct: 3   SRRLRAFKRWMISQGIEWSNDTLQFIDNPEEGISVNAFWDLKEGDLIAKIPKTACLTIKT 62

Query: 120 VAASEIGSLCSGLKPWISVALFLIREKK-KEDSPWRVYLDILPECTDSTVFWSEEEL-VE 177
             A ++    +GL  ++ +++ L+ EK    DSPW  YL +LP+C    + WS +E+ + 
Sbjct: 63  SGAQDLIE-STGLDGYLGLSVALMYEKSLGGDSPWAGYLQVLPDCECLPLVWSLDEVDLL 121

Query: 178 LQGTQLLSTTLGVKEYVQNEYLKVEEEIILP-----NKQLFPRPITLDDFLWAFGILRSR 232
           L+GT+L       K  +  ++    +E ILP        + P+  +++ +  A  ++ SR
Sbjct: 122 LRGTELHKIVKEDKALIYEDW----KESILPLLDSLPSNIDPKFFSVEQYFAAKSLIASR 177

Query: 233 AFSRLRGQNLVLIPLADLINHSPG-------------------------------ITTED 261
           +F         ++PLADL NH  G                               I  ++
Sbjct: 178 SFEIDDYHGFGMVPLADLFNHKTGAEDVHFTSTSSHSESDDDSDNSDTVDLDADNIGNKE 237

Query: 262 YAYEIKGAGLFSRD-LLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNA 320
            + E+  + +   D L+  +     VKAG +V   Y L   NA L   YGF E  +    
Sbjct: 238 PSSELDCSSVTGDDPLVLEMIMVKDVKAGVEVFNTYGL-LGNAALLHRYGFTEPDNSFGI 296

Query: 321 YTLTLEI 327
             + LE+
Sbjct: 297 VNIDLEL 303


>gi|125528589|gb|EAY76703.1| hypothetical protein OsI_04658 [Oryza sativa Indica Group]
          Length = 495

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 58/210 (27%), Positives = 102/210 (48%), Gaps = 9/210 (4%)

Query: 53  ASATTNPPTAQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIA---KNEVVLEV 109
           ASA+T+    ++  F +W+RD  VV   + +R     +G G V  R +A   + ++V  +
Sbjct: 3   ASASTSTAR-RLRAFRRWMRDHGVVCSDA-LRLDAAEDGGGGVYVRALAALREGDLVATI 60

Query: 110 PMKFWINPDTVAASEIGSLCSGLKPW-ISVALFLIREKKKEDSPWRVYLDILPECTDSTV 168
           P    + P T  A+E         P  ++VA+   R +  E SPW  YL ++PE     +
Sbjct: 61  PRGACLTPRTSGAAEAIEAAELGGPLALAVAVMYERARGAE-SPWDAYLRLIPEREPVPL 119

Query: 169 FWSEEELVE-LQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNK-QLFPRPITLDDFLWAF 226
            W  +E    L GT+L       ++++  ++ +  E +IL  + ++ P   +L+++  A 
Sbjct: 120 VWPADEAERLLAGTELDKIVKQDRQFICEDWKECIEPLILSGELEVDPDDFSLENYFSAK 179

Query: 227 GILRSRAFSRLRGQNLVLIPLADLINHSPG 256
            +L SR+F         ++PLADL NH  G
Sbjct: 180 SLLSSRSFRIDSYHGSGMVPLADLFNHKTG 209


>gi|302921343|ref|XP_003053266.1| hypothetical protein NECHADRAFT_105995 [Nectria haematococca mpVI
           77-13-4]
 gi|256734206|gb|EEU47553.1| hypothetical protein NECHADRAFT_105995 [Nectria haematococca mpVI
           77-13-4]
          Length = 371

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 69/245 (28%), Positives = 108/245 (44%), Gaps = 43/245 (17%)

Query: 83  IRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALF 141
           I P   P  G+G+VA RDI +NE +L VPMK     DTV   +I     G+     +A  
Sbjct: 21  IEPRILPGRGIGVVATRDIRENEAILTVPMKALRTIDTV-PKQISKALHGVSVHGILAAE 79

Query: 142 LIREKKKEDSPWRVYLDILPECTD----STVFWSEE--ELVELQGTQLLSTTLGVKEYVQ 195
           +  +K  + + W+    +LP   D      + W  E   L+  +   LL          Q
Sbjct: 80  IALDKSDDFAVWKT---VLPTKEDLESGMPMMWPSELQLLLPRRAKNLLDK--------Q 128

Query: 196 NEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAF--------SRLRGQNLVLIPL 247
               + E EI+L     FP  +T DD+L+A+ ++ +R F        +  +   LV +P 
Sbjct: 129 TTTFRRECEIVL---NAFPN-LTRDDYLYAWVLINTRTFYNSMPKMKAYAQADRLVCMPA 184

Query: 248 ADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELAL 307
           ADL NH+            +G  L    L ++++     + GE+V + Y  + SN  L  
Sbjct: 185 ADLFNHAD-----------QGCQLSFSPLGYTIKADRVYRQGEEVYVSYGPH-SNDFLLT 232

Query: 308 DYGFI 312
           +YGFI
Sbjct: 233 EYGFI 237


>gi|270005260|gb|EFA01708.1| hypothetical protein TcasGA2_TC007288 [Tribolium castaneum]
          Length = 253

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 43/155 (27%), Positives = 78/155 (50%), Gaps = 14/155 (9%)

Query: 53  ASATTNPPTAQVETFWQWLRDQKVVSPKSPIRPATFPE----GLGLVAQRDIAKNEVVLE 108
           +S       A +E F +W  +       + +   +  E    GLG+ A  DIA++ +V+ 
Sbjct: 76  SSVDNRASAATIENFLKWANENG-----AQLNGCSIDEFEGYGLGVKANVDIAESSLVIA 130

Query: 109 VPMKFWINPDTVAASEIGSLCS-----GLKPWISVALFLIREKKKEDSPWRVYLDILPEC 163
           VP K  ++ +    S +  L       G  P +++++FL+ EK K DS W+ Y+DILP+ 
Sbjct: 131 VPRKLMMSVENAKESVLKDLIEKDKILGSMPNVALSIFLLLEKYKGDSFWKPYIDILPKT 190

Query: 164 TDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY 198
             + +++S +EL EL+G+  L   L   + +  +Y
Sbjct: 191 YTTVLYFSIDELEELRGSPTLEVALRQIKSITRQY 225


>gi|46136815|ref|XP_390099.1| hypothetical protein FG09923.1 [Gibberella zeae PH-1]
          Length = 484

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 57/194 (29%), Positives = 89/194 (45%), Gaps = 13/194 (6%)

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKV-EEEIILP- 208
           +PW  YL  LP        WSE E   LQGT L +        +  E+  + ++  +LP 
Sbjct: 133 TPWTEYLKFLPRDVPVPTMWSEVERALLQGTSLEAALEAKFASLSKEFEDLTDKSSVLPF 192

Query: 209 -NKQLFPR-PITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEI 266
            N   + +  +T+ D++      RSR     RG +  ++P  D+ NHS   T   Y  + 
Sbjct: 193 WNSLFWEKGTVTIQDWILVDAWYRSRCLELPRGGD-AMVPGLDMANHSHHPTAY-YDEDD 250

Query: 267 KGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLE 326
           K       D++  +R    V AGE+V I Y  +K+ AE+   YGFI+++S      L ++
Sbjct: 251 KD------DVVLLVRPETTVSAGEEVNISYG-DKNPAEMLFSYGFIDNESTVEGLNLPVK 303

Query: 327 ISESDPFFGDKLDI 340
           +   DP    KL I
Sbjct: 304 VLPDDPLGKAKLHI 317


>gi|260831632|ref|XP_002610762.1| hypothetical protein BRAFLDRAFT_91548 [Branchiostoma floridae]
 gi|229296131|gb|EEN66772.1| hypothetical protein BRAFLDRAFT_91548 [Branchiostoma floridae]
          Length = 604

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 63/228 (27%), Positives = 104/228 (45%), Gaps = 32/228 (14%)

Query: 138 VALFLIREKKK-EDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQN 196
           ++LFL+ EK K +DS W  Y+  LP    + V+++E EL  L  + L      +K+ + +
Sbjct: 239 LSLFLLLEKNKGKDSFWYPYIRSLPNSFTTPVYFTESELNALSPS-LQEKARDLKKELLH 297

Query: 197 EYLKVEEEI--ILPNKQLFPRPITLDDFLWAFGILRSRAF----------SRLRGQNLVL 244
            +  +E  +   LP         T D F WA+ +L++R            S    Q   L
Sbjct: 298 AFNDLEPFVTSCLPE---LDSTFTFDAFRWAWSVLKTRTLYQEDCRSPYLSNKEPQTSTL 354

Query: 245 IPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAE 304
           +P+ DLINHSP      + Y +  +        + +R   P +  +QV I Y   + N E
Sbjct: 355 VPMLDLINHSPSAKAR-FGYNVNTS-------CYEVRVLEPYRKYDQVFISYGF-EENTE 405

Query: 305 LALDYGFIESKSDRNAYTLTL-EISESDPFFGD-----KLDIAETNGL 346
           L L +GF   ++ ++   + L E+ ES P   D     K+D+   +GL
Sbjct: 406 LMLKFGFFVPENPKDFMKINLSEMLESLPQINDEERKNKVDLLFDSGL 453


>gi|407035166|gb|EKE37568.1| [Ribulose-bisphosphate-carboxylase]-lysine N-methyltransferase
           [Entamoeba nuttalli P19]
          Length = 791

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 73/316 (23%), Positives = 126/316 (39%), Gaps = 60/316 (18%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWIN-------------PDTVAASEIGSLCSGLKPWIS 137
           G GL A ++  K+E+++ +P    IN             P      +     +GL     
Sbjct: 27  GRGLCANKEFKKDEIIMSIPYSIQINRINLNHIWPEVKLPKFNEGDDDRDDLNGL----- 81

Query: 138 VALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTT-------LGV 190
           V L+L   K         Y+++LPE  D  + ++ +EL  ++GT+L +         + V
Sbjct: 82  VYLYLAVNKTNPKCFHWPYINVLPETYDCPLSYTIDELNLMKGTKLYAAVEKINAFLMKV 141

Query: 191 KEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLR----GQNLVLIP 246
            +Y  N+ ++   +   P   LF R        WA     SRAF  +     G+   LIP
Sbjct: 142 VDYYNNKLIQQFPQYFQPFDDLFKR------LQWAHQSFWSRAFLVIYPQPFGEVGSLIP 195

Query: 247 LADLINHS--------PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDL 298
             D  NH             TE ++++   A                VK GEQ+   Y +
Sbjct: 196 FCDFSNHCTQAKVTYISNTQTETFSFQTNEA---------------LVKPGEQIFNNYRI 240

Query: 299 NKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGR 358
            +SN +L L YGF+E  +  +   L +     D  + +  +I +   +    +F + L  
Sbjct: 241 -RSNEKLLLGYGFVEENNPCDNLLLRIYFEVDDNQYNEIEEILKQEEIKSFDFF-LKLDE 298

Query: 359 TLPPAMLQYLRLVALG 374
            +P  +++ LR+V L 
Sbjct: 299 DIPLELMRILRIVNLS 314


>gi|121701277|ref|XP_001268903.1| SET domain protein [Aspergillus clavatus NRRL 1]
 gi|119397046|gb|EAW07477.1| SET domain protein [Aspergillus clavatus NRRL 1]
          Length = 498

 Score = 64.3 bits (155), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 50/190 (26%), Positives = 88/190 (46%), Gaps = 13/190 (6%)

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEII-LPN 209
           +PW  Y+  +P       F++E EL  L+GT L +        ++ E+ ++ +    +P 
Sbjct: 132 NPWTEYIRFMPPSIRLPTFYTEAELELLRGTSLRTAVFAKLASLEKEFERLRQSTEGIPW 191

Query: 210 KQLF-----PRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAY 264
            Q +        +T DD+ +   + RSR    L      ++P  D+ NH+    +ED + 
Sbjct: 192 CQKYWWDEDTGRLTFDDWKYVDAVYRSRVVE-LPESGHAIVPCVDMANHA----SED-SV 245

Query: 265 EIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES-KSDRNAYTL 323
           + +     + D L  LR    + +GE+V I Y   K  +E+   YGF+E+ ++D     L
Sbjct: 246 KARYDESSTEDALLQLRQGRRICSGEEVTISYGSEKPASEMVFSYGFVENERTDAKQIFL 305

Query: 324 TLEISESDPF 333
            LEI + DP 
Sbjct: 306 DLEIPDDDPL 315


>gi|238011186|gb|ACR36628.1| unknown [Zea mays]
          Length = 200

 Score = 64.3 bits (155), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 46/142 (32%), Positives = 76/142 (53%), Gaps = 11/142 (7%)

Query: 105 VVLEVPMKFWINPDTVAASE-IGSLCSGL------KPWISVALFLIREKKKEDSPWRVYL 157
           V + VP+   I P  V     +G  C  L         + V LFL+ E+++  S W+ YL
Sbjct: 8   VAMVVPLDLAITPMRVLQDPLVGPRCRALLEDGVVDDRLLVMLFLMAERRRPGSLWKPYL 67

Query: 158 DILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY-LKVE---EEIILPNKQLF 213
           D+LP    S+++++EEEL EL+GT L   TL  ++ +Q+ +  KV+   EE++  ++   
Sbjct: 68  DMLPSTFGSSLWFTEEELAELEGTTLHRATLIQRKSLQSSFDEKVKGLVEELLHVDESAS 127

Query: 214 PRPITLDDFLWAFGILRSRAFS 235
              +  +DFLWA  I  +RA +
Sbjct: 128 SVEVLFEDFLWANSIFWTRALN 149


>gi|402077734|gb|EJT73083.1| SET domain-containing protein 8 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 403

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 53/187 (28%), Positives = 83/187 (44%), Gaps = 11/187 (5%)

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEE---IIL 207
           +PW  Y+  LP        WS+ E + L GT L +        ++ E+  ++E+   +  
Sbjct: 39  NPWTQYIKYLPRAVPLPTLWSDAERLLLNGTSLETAVQAKLLALEREFDDLKEKSSALAT 98

Query: 208 PNKQLFPR-PITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEI 266
            N+ L+   P+ L D++      RSR     RG +  ++P  D+INHS   T        
Sbjct: 99  WNEILWENGPVKLRDWVVLDAWYRSRCMELPRGGS-SMVPCIDMINHSRQWTASYTQNPD 157

Query: 267 KGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLE 326
            G    S DL  S  T V V  G ++ I Y   K+ AE+   YGF++ +       L +E
Sbjct: 158 DG----SVDLCLSRDTDVAV--GTEITISYGNEKAPAEMLFSYGFVDPEGGPEMVVLPVE 211

Query: 327 ISESDPF 333
           + E DP 
Sbjct: 212 VPEDDPL 218


>gi|342875304|gb|EGU77102.1| hypothetical protein FOXB_12400 [Fusarium oxysporum Fo5176]
          Length = 371

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 69/245 (28%), Positives = 108/245 (44%), Gaps = 43/245 (17%)

Query: 83  IRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALF 141
           + P   P  G+G+VA RDI  NE +L VPMK     DTV  + I     G+     +A  
Sbjct: 21  VEPRILPGRGIGIVATRDIKPNETILSVPMKALRTIDTVPKN-ITEALQGVSVHGILAAE 79

Query: 142 LIREKKKEDSPWRVYLDILPECTD----STVFWSEE--ELVELQGTQLLSTTLGVKEYVQ 195
           +  +K  + S W+    +LP   D      + W  E   L+  +   +L          Q
Sbjct: 80  IALDKSDDFSVWKT---VLPTREDLEAGVPMMWPSELQALLPKRAKDILDN--------Q 128

Query: 196 NEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAF--------SRLRGQNLVLIPL 247
           N   + E EI+L   + FP+ +T D++L+++ ++ +R F        S      LV +P 
Sbjct: 129 NTTFRRECEIVL---KAFPK-LTRDEYLYSWVLINTRTFYNSMPKMKSYAHVDRLVCMPT 184

Query: 248 ADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELAL 307
           ADL NH+            +G  L    L +S++       GE+V + Y  + SN  L  
Sbjct: 185 ADLFNHAD-----------QGCKLAYSALGYSVQADRVYHQGEEVYVSYGPH-SNDFLLS 232

Query: 308 DYGFI 312
           +YGFI
Sbjct: 233 EYGFI 237


>gi|452982650|gb|EME82409.1| hypothetical protein MYCFIDRAFT_40308 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 449

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 79/281 (28%), Positives = 124/281 (44%), Gaps = 49/281 (17%)

Query: 65  ETFWQWLRDQ-KVVSPKSPIRPATFPE---GLGLVAQRDIAKNEVVLEVPMKFWINPDTV 120
           + F  WL++    +SPK  I+ A   +   G G+VA  D+  +E +  +P    +  +T 
Sbjct: 11  DKFLTWLKNTGATISPK--IQLADLRDRAAGRGVVATSDLTSDEEIFRIPRTSILTTETT 68

Query: 121 A-ASEIGSLCSGLKPWISVALFLIREKK-KEDSPWRVYLDILPECTDSTVFWSEEELVEL 178
               EI  L     PW+S+ L +I E     +S ++ YLDILPE  ++ +FW++ EL  L
Sbjct: 69  DLPQEI--LQQLTDPWLSLILAMIFEYLLGTNSRFKPYLDILPESFNTLMFWTDNELQYL 126

Query: 179 QGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAF-----GILRSRA 233
           QG+ +LS  +G KE   N + +    II  N ++F      +  L A       I+ S A
Sbjct: 127 QGSAILS-KIG-KEEADNTFSEQLLPIITKNPEIFKIGTCNNQDLLALCHRMGSIIMSYA 184

Query: 234 FSRLRGQNL----------------------VLIPLADLINHSPGITTEDYAYEIKGAGL 271
           F                               LIPLAD++N +  +T          + L
Sbjct: 185 FDLDPPPTTTTSSSEEWESDSDSENEKISPKALIPLADMLNANGDLT---------NSKL 235

Query: 272 FSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI 312
           F     F ++T  PV AGE++L  +      A+L   YGF+
Sbjct: 236 FFSSDSFIMKTLQPVAAGEELLNDFG-PLPPADLLRRYGFV 275


>gi|389629438|ref|XP_003712372.1| hypothetical protein MGG_04887 [Magnaporthe oryzae 70-15]
 gi|351644704|gb|EHA52565.1| hypothetical protein MGG_04887 [Magnaporthe oryzae 70-15]
          Length = 499

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 108/481 (22%), Positives = 192/481 (39%), Gaps = 115/481 (23%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMK-----------------FWINPDT-------VAASEIG 126
           G G+VA  DI  + V+  +P K                  ++N  +              
Sbjct: 40  GRGIVATADIEPDTVLFTIPRKDIICLENSQLFKEVDSSIFVNRGSEPLDEDDEDQDGGS 99

Query: 127 SLCSGLKPWISVALFLIREKKKED-SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLS 185
           S   G   W ++ L ++ E  + D SPWR YLD+LP   ++ +FW+ EE+ ELQ + +++
Sbjct: 100 SHQRGQNSWTTLILVMMYEHLRRDPSPWRPYLDVLPTEFETPMFWTSEEIAELQASPVVA 159

Query: 186 TTLGVKE---YVQNEYLKVEEEIILPNKQLFPRPITLDD--------------FLWAFGI 228
            ++G +E    ++ + L V    I  N  +F      DD                ++F +
Sbjct: 160 -SIGREEADVMIRTKILPV----IKENAAVFGGVTDADDERLVQLAHQMGSTIMAYSFDL 214

Query: 229 LRSRAFSRL---------RGQNLV-LIPLADLINHSPGITTE-DYAYEIKGAGLFSRDLL 277
                              G+ ++ ++P+AD++N         +++ E   A    +   
Sbjct: 215 EGEGDEEDEDEDGWVEDREGRTMMGMVPMADILNADAEFNAHINHSEEALVATALRK--- 271

Query: 278 FSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDR------NAYTLT------L 325
                   + AGE++L  Y     N +L   YG++  K  R       A ++T      L
Sbjct: 272 --------IPAGEEILNYYG-PLPNGQLLRRYGYVTEKHSRYDVVELTAASITSSIRSRL 322

Query: 326 EISESDPFFGDKLDIAETNGL-------GESAYFDIVLGRTLPPAMLQYLRLVALGGTDA 378
            +++ D    DK+D A    +          A  ++     L  A         L GT +
Sbjct: 323 NLAQQD---WDKIDEARAKSMFGDDSEQVHGAMVEVEDSYPLERACEDPDEFGQLQGTPS 379

Query: 379 F---------LLESIFRNTIWGHLD-LPVSHANEELICRVVRDACKSALSGFHTTIEEDE 428
           F         + ++ F+       D +P     + +I   VR+A  + L+ + TT+EED 
Sbjct: 380 FSELPEELLDMAKARFKEIKTIRPDAVPDKKFRDRIIYSAVREAASAKLAEYGTTLEED- 438

Query: 429 QLLQGGNLD---PRLRIAVGVRAGEKRVLQQ--------IDEIFKERELELDELEYYQER 477
            L +   LD   PRLR+A+ VR GEK++L +        I +     ++E+DE    ++R
Sbjct: 439 -LAKLSTLDSNQPRLRMAMEVRIGEKKLLHEAIAWLDGKIGDGAANGDMEVDEPASKRQR 497

Query: 478 R 478
           R
Sbjct: 498 R 498


>gi|298711968|emb|CBJ32910.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 247

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 69/236 (29%), Positives = 97/236 (41%), Gaps = 35/236 (14%)

Query: 240 QNLVLIPLADLINHSPGITTE---DYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQY 296
           +   L P+ DLINH  GI ++   +Y Y             F++ T     AGEQVLI Y
Sbjct: 8   KRYALTPVVDLINHQSGIDSDVSYNYFYG-----------YFAVTTQRGWTAGEQVLISY 56

Query: 297 DLNKSNAELALDYGFIESKSDRNAYTLTLEISE-SDPFFGDKLDIAETNGLGESAYFDIV 355
              +SN  L   YGF+E  +  + Y +T  I + SD    D + +   +G       D  
Sbjct: 57  G-PRSNDHLLRRYGFVEQDNPNDVYRITGLIDKLSDVLGKDSVRVLRESGGKLGTTGDNA 115

Query: 356 LGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLP---------------VSHA 400
            GR  P   +   R   LG  +   +  +FR  +     LP               +S A
Sbjct: 116 EGR--PVESVTVGRSGLLGEKEEGRVMPVFRLAVVKDDQLPEGKAAGISLKDFSNEISPA 173

Query: 401 NEELICRVVRDACKSALSGFHTTIEEDEQLLQ--GGNLDPRLRIAVGVRAGEKRVL 454
           NE      +R  C     GF TT+ EDE  L   G +L  + R+A   R  +KRVL
Sbjct: 174 NEAAARDALRKLCIKEREGFATTLAEDEAYLSSLGNSLGAQKRVAFSFRMEKKRVL 229


>gi|405118508|gb|AFR93282.1| hypothetical protein CNAG_03776 [Cryptococcus neoformans var.
           grubii H99]
          Length = 532

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 64/235 (27%), Positives = 107/235 (45%), Gaps = 27/235 (11%)

Query: 59  PPTAQVETFWQWLRDQKV-VSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINP 117
           PP ++     +WL    V  SP   I+P +   G  +VA+RDI   E++L +P    ++ 
Sbjct: 10  PPPSRA-CLLEWLAANLVYTSPSLTIQPMSASSGYRIVAERDIEIGELLLSLPKYSILSH 68

Query: 118 DTVAASEIGSLCSGLKPWISVALFLIREKKK-EDSPWRVYLDILPECTDST--VFWSEEE 174
            T + S I  L S     ++++L L+ E +   +S +  YL  LP   ++   VFW   +
Sbjct: 69  QTASLSAITHLASASHTILNLSLCLLHEIRLFTNSRFYGYLQSLPRDMEAGLPVFWKIGK 128

Query: 175 LVELQ----GTQLLSTTLGVKEYVQNE------------YLKVEEEIILPNKQLFPRPIT 218
            VE++    G Q L +T   KE  ++E            YL+     +LP     P P  
Sbjct: 129 GVEVEDGERGLQWLKSTEAEKELRKSEKQGLSLSDIYAFYLRTSH--LLPPTSTDPLPSP 186

Query: 219 LDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSP----GITTEDYAYEIKGA 269
              F+ A+ ++ +RAF         L P ADL+NHS      + ++D+   + G+
Sbjct: 187 FLAFVHAYTLISTRAFLIDLYHLTALCPFADLLNHSAFPHTCLASDDFVCYMCGS 241


>gi|226492747|ref|NP_001140859.1| uncharacterized protein LOC100272935 [Zea mays]
 gi|194701488|gb|ACF84828.1| unknown [Zea mays]
 gi|413951742|gb|AFW84391.1| hypothetical protein ZEAMMB73_159573 [Zea mays]
          Length = 495

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 48/190 (25%), Positives = 91/190 (47%), Gaps = 4/190 (2%)

Query: 67  FWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIG 126
           F +W+R   VV   +     + P G+ + A   +   ++V  +P    + P T  A+   
Sbjct: 15  FKRWMRAHGVVCSDALSLDVSDPLGVHVRAATPLRDGDLVATIPRGACLTPRTTGAAAAI 74

Query: 127 SLCSGLKPWISVALFLIREKKK-EDSPWRVYLDILPECTDSTVFW-SEEELVELQGTQLL 184
              + L   +++ + ++ E+ +  DSPW  YL +LP+C    + W + E    L GT+L 
Sbjct: 75  EA-AELGGCLALTVAVMYERAQGADSPWDAYLQLLPDCESVPLVWPAGEAECLLAGTELD 133

Query: 185 STTLGVKEYVQNEYLKVEEEIILPNK-QLFPRPITLDDFLWAFGILRSRAFSRLRGQNLV 243
                 KE++  ++ +  E ++L  +  + P   +L+ +L A  ++ SR+F         
Sbjct: 134 KIVKQDKEFLCEDWKECIEPLMLSGELDVDPDDFSLEKYLSAKTLVSSRSFQIDSYHGSG 193

Query: 244 LIPLADLINH 253
           ++PLADL NH
Sbjct: 194 MVPLADLFNH 203


>gi|145250231|ref|XP_001396629.1| SET domain protein [Aspergillus niger CBS 513.88]
 gi|134082145|emb|CAK42259.1| unnamed protein product [Aspergillus niger]
 gi|350636112|gb|EHA24472.1| hypothetical protein ASPNIDRAFT_48629 [Aspergillus niger ATCC 1015]
          Length = 489

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 78/307 (25%), Positives = 127/307 (41%), Gaps = 64/307 (20%)

Query: 54  SATTNPP-----TAQVETFWQWLRDQKVVSPKSPIRPA---TFPEGLGLVAQRDIAKNEV 105
           S+TT+ P       Q   F  WL  +  V     I+ A   +   G G+VAQ D+ + E 
Sbjct: 2   SSTTHFPDPGNFQHQSNEFLTWLAGKPGVKINPKIQIADLRSHAAGRGVVAQSDLDEGEE 61

Query: 106 VLEVPMKFWINPDTVAASEIGSLCS----GLKPWISVALFLIREKKKED-SPWRVYLDIL 160
           +  +P    +   +V  S + +L S     L PW+S+ + +I E  + D S W  Y  +L
Sbjct: 62  LFTIPRAHVL---SVQNSNLKNLLSQNLDDLGPWLSLMVVMIYEYLQGDQSAWASYFRVL 118

Query: 161 PECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI--- 217
           P   D+ +FWS  EL ELQG+ ++   +G K+  +   L+    I+  N  LFP PI   
Sbjct: 119 PRNFDTLMFWSASELEELQGSAIVE-KIG-KQGAEESILETIAPIVRANPALFP-PIDGV 175

Query: 218 -----------------TLDDFL--WAFGILRSRAFSRLRGQNLV-------------LI 245
                            T+   +  +AF I +       R                  ++
Sbjct: 176 ASYDGDAGTQALLHLAHTMGSLIMAYAFDIEKPEDEEGERDGEDGYLTDEEEEQSSKGMV 235

Query: 246 PLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAEL 305
           PLADL+N            +   A LF  + +  ++   P+KAGE++   Y      ++L
Sbjct: 236 PLADLLNAD---------ADRNNARLFQEEEVLVMKAIKPIKAGEEIFNDYG-EIPRSDL 285

Query: 306 ALDYGFI 312
              YG++
Sbjct: 286 LRRYGYV 292


>gi|119467702|ref|XP_001257657.1| SET domain protein [Neosartorya fischeri NRRL 181]
 gi|119405809|gb|EAW15760.1| SET domain protein [Neosartorya fischeri NRRL 181]
          Length = 492

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 72/309 (23%), Positives = 123/309 (39%), Gaps = 61/309 (19%)

Query: 63  QVETFWQWLRDQKVVSPKSPIRPATF---PEGLGLVAQRDIAKNEVVLEVPMKFWINPDT 119
           Q + F  WL  +  V   S I  A       G G+VA+ DI   E +  +P    +   +
Sbjct: 16  QSDEFITWLAGKPGVRINSKINIADLRSKSAGRGVVARSDIFDGEELFSIPRGLVL---S 72

Query: 120 VAASEIGSLCSG----LKPWISVALFLIREKK-KEDSPWRVYLDILPECTDSTVFWSEEE 174
              S++  L S     L PW+S+ L ++ E    E S W  Y  +LP+  D+ +FWS  E
Sbjct: 73  AQNSKLKDLLSQDLEELGPWLSLILVMMYEYLLGEQSAWAPYFKVLPKSFDTLMFWSPSE 132

Query: 175 LVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAF 234
           L ELQG+ ++S     KE  ++  +++   ++  N  LFP    L  +    G       
Sbjct: 133 LQELQGSAIVSKI--GKEGAEDSIMQMIAPVVRANPSLFPSVEGLASWDGEAGSHALLGL 190

Query: 235 SRLRGQNLV-----------------------------------LIPLADLINHSPGITT 259
           + + G  ++                                   ++PLAD++N       
Sbjct: 191 AHIMGSLIMAYAFDIEKAEDEDDEDNDEEEGYVTDDEQDQSSKGMVPLADILNADA---- 246

Query: 260 EDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRN 319
                +   A LF  +    ++   P+ AGE++   Y      A+L   YG++   +D  
Sbjct: 247 -----DRNNARLFQEEDSLVMKAIKPIHAGEEIFNDYG-ELPRADLLRRYGYV---TDNY 297

Query: 320 AYTLTLEIS 328
           A+   +E+S
Sbjct: 298 AHYDVVELS 306


>gi|325186836|emb|CCA21381.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 473

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 64/286 (22%), Positives = 126/286 (44%), Gaps = 50/286 (17%)

Query: 93  GLVAQRDIAKNEVVLEVPMKFWINPDTVAASE---------IGSLCSGL--KPWISVALF 141
           G+    DIA   VV ++P+    +  T++  E         + S+ + +  +  + +A+ 
Sbjct: 63  GVYIAEDIAPQTVVFQIPLTSLYSLATISKEEPSYEIIEVFVNSISTDIEEREEMLLAII 122

Query: 142 LIREKK--KEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYL 199
           L+ E    + +S W  +L+ILP+   + +++S +E+  L GT L      ++E +  +Y 
Sbjct: 123 LLFEMYVLQSESKWAHHLEILPKEHRNLLYYSSDEVKALDGTNLYYVAHEMQERLHEDYE 182

Query: 200 KVEEEIILPNKQLFPRPI----------TLDDFLWAFGILRSRAFSRLRGQNLV------ 243
            +E  ++   K +    +          +  ++ WA  I+ SR  S    Q LV      
Sbjct: 183 FIETRVLPELKHILKHILSPSVSATTVFSFANYKWALSIIWSRFVSIEIDQELVSTLPFT 242

Query: 244 -----------LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQV 292
                      ++P+ D++NH P      + Y+           +F L T   + AG Q+
Sbjct: 243 IDPTKKHCVKAMVPVFDMLNHDPKAEM-THKYDAASG-------MFQLTTHQHLAAGTQL 294

Query: 293 LIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEI-SESDPFFGDK 337
            I Y    ++A LAL YGF+ S +  +   + L++ S++  F+ +K
Sbjct: 295 HINYGPLSNHALLAL-YGFMHSHNPHDTVEVHLQMESDNTSFYEEK 339


>gi|212532027|ref|XP_002146170.1| SET domain protein [Talaromyces marneffei ATCC 18224]
 gi|210071534|gb|EEA25623.1| SET domain protein [Talaromyces marneffei ATCC 18224]
          Length = 497

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 55/188 (29%), Positives = 85/188 (45%), Gaps = 13/188 (6%)

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEE---EIIL 207
           +PW  Y+  LP  T     W+E+ELV L GT L          ++ E+ ++ +    I  
Sbjct: 122 NPWSEYIKFLPSETLLPTLWTEDELVLLYGTSLKDAVDHKLSALEAEFDRLRDATRSIAW 181

Query: 208 PNKQLFPR---PITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAY 264
             ++ +      +TLDD+     + RSRA   L G   V++P  D+ NH+ G  T    Y
Sbjct: 182 CEREWWDEETGQLTLDDWKIVDAMYRSRALD-LPGSGHVMVPCVDMANHASGEETVAL-Y 239

Query: 265 EIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIE-SKSDRNAYTL 323
           E        R+ +  LR    +K  E+V I Y   K  +E+   YGF+E S  D     L
Sbjct: 240 ETDK----ERNAVLQLRWGKKLKREEEVTITYGDEKGASEMVFSYGFLENSVEDARQLFL 295

Query: 324 TLEISESD 331
            ++I + D
Sbjct: 296 PIDIPDDD 303


>gi|303322220|ref|XP_003071103.1| SET domain containing protein [Coccidioides posadasii C735 delta
           SOWgp]
 gi|240110802|gb|EER28958.1| SET domain containing protein [Coccidioides posadasii C735 delta
           SOWgp]
          Length = 469

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 109/441 (24%), Positives = 182/441 (41%), Gaps = 81/441 (18%)

Query: 67  FWQWLRDQKVVSPKSPIRPATF---PEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAAS 123
           F  WL+    V     IR A       G G+VA  +I ++E +  +P    +   +VA S
Sbjct: 16  FMGWLKQHPGVRVNPKIRIADLRPTGAGRGVVACEEIVQDEELFAIPEDLVL---SVANS 72

Query: 124 EIGSLCS----GLKPWISVALFLIREKKKED-SPWRVYLDILPECTDSTVFWSEEELVEL 178
           +I    +        W+S+ + +I E  +   S W  Y  +LP   D+ +FW+E EL EL
Sbjct: 73  KIKDRINFADENFDTWLSLIVTMIFEYLQGGVSKWSPYFGVLPTDFDTLMFWTENELREL 132

Query: 179 QGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDF---------------- 222
           QG+ +L   +G +E  Q    KV   ++L +  LFP    L  F                
Sbjct: 133 QGSSVLD-KIGKQEADQVILDKV-LPVVLEHPDLFPPVNGLASFDSPCGKEVVLQLAHRM 190

Query: 223 -----LWAFGILRSRAFSRLRGQNLV----------LIPLADLINHSPGITTEDYAYEIK 267
                 +AF I   +   +      V          ++PLADL+N               
Sbjct: 191 GTLIMAYAFDIEMDQDEDQDGEDGYVTDDEQEKAKGMVPLADLLNADA---------HRN 241

Query: 268 GAGLFSRDLLFSLRTPVPVKAGEQVLIQY-DLNKSNAELALDYGFI-ESKSDRNAYTLTL 325
            A LF  D  F +R+  P+    ++   Y +L +S+  L   YG+I E+ +  +   ++L
Sbjct: 242 NARLFQEDGYFIMRSIAPISIEMEIFNDYGELPRSD--LLRRYGYITENYAPYDVVEISL 299

Query: 326 EISESDPFFGD---KLDIAETNGLGESAYF------DIVLGRTLPPAMLQYLRLVALGGT 376
           E   +     +   +L++ E  G+ E  Y       D +    +P  +L  LR +     
Sbjct: 300 EAICNIAGVEEGCCQLELLEDAGVLEDGYALSRPEGDAITTEAIPAELLILLRTLRSPRK 359

Query: 377 DAFLLESIFRNTIW--GHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQ-- 432
           D   L+ + +        LD P +     L+  ++++  K     + TTI +D++LLQ  
Sbjct: 360 D---LDQMRKKGKLPKAELDKPCAR----LLVEILQNRQKD----YPTTIAQDDELLQAL 408

Query: 433 GGNLDPRLRIAVGVRAGEKRV 453
           G +   R  +AV VR GEK +
Sbjct: 409 GSSQVHRREMAVRVRKGEKEI 429


>gi|320034953|gb|EFW16895.1| conserved hypothetical protein [Coccidioides posadasii str.
           Silveira]
          Length = 469

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 109/441 (24%), Positives = 182/441 (41%), Gaps = 81/441 (18%)

Query: 67  FWQWLRDQKVVSPKSPIRPATF---PEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAAS 123
           F  WL+    V     IR A       G G+VA  +I ++E +  +P    +   +VA S
Sbjct: 16  FMGWLKQHPGVRVNPKIRIADLRPTGAGRGVVACEEIVQDEELFAIPEDLVL---SVANS 72

Query: 124 EIGSLCS----GLKPWISVALFLIREKKKED-SPWRVYLDILPECTDSTVFWSEEELVEL 178
           +I    +        W+S+ + +I E  +   S W  Y  +LP   D+ +FW+E EL EL
Sbjct: 73  KIKDRINFADENFDTWLSLIVTMIFEYLQGGVSKWSPYFGVLPTDFDTLMFWTENELREL 132

Query: 179 QGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDF---------------- 222
           QG+ +L   +G +E  Q    KV   ++L +  LFP    L  F                
Sbjct: 133 QGSSVLD-KIGKQETDQVILDKV-LPVVLEHPDLFPPVNGLASFDSPSGKEVVLQLAHRM 190

Query: 223 -----LWAFGILRSRAFSRLRGQNLV----------LIPLADLINHSPGITTEDYAYEIK 267
                 +AF I   +   +      V          ++PLADL+N               
Sbjct: 191 GTLIMAYAFDIEMDQDEDQDGEDGYVTDDEQEKAKGMVPLADLLNADA---------HRN 241

Query: 268 GAGLFSRDLLFSLRTPVPVKAGEQVLIQY-DLNKSNAELALDYGFI-ESKSDRNAYTLTL 325
            A LF  D  F +R+  P+    ++   Y +L +S+  L   YG+I E+ +  +   ++L
Sbjct: 242 NARLFQEDGYFIMRSIAPISIEMEIFNDYGELPRSD--LLRRYGYITENYAPYDVVEISL 299

Query: 326 EISESDPFFGD---KLDIAETNGLGESAYF------DIVLGRTLPPAMLQYLRLVALGGT 376
           E   +     +   +L++ E  G+ E  Y       D +    +P  +L  LR +     
Sbjct: 300 EAICNIAGVEEGCCQLELLEDAGVLEDGYALSRPEGDAITTEAIPAELLILLRTLRSPRK 359

Query: 377 DAFLLESIFRNTIW--GHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQ-- 432
           D   L+ + +        LD P +     L+  ++++  K     + TTI +D++LLQ  
Sbjct: 360 D---LDQMRKKGKLPKAELDKPCAR----LLVEILQNRQKD----YPTTIAQDDELLQAL 408

Query: 433 GGNLDPRLRIAVGVRAGEKRV 453
           G +   R  +AV VR GEK +
Sbjct: 409 GSSQVHRREMAVRVRKGEKEI 429


>gi|42820762|emb|CAF32075.1| SET domain protein, putative [Aspergillus fumigatus]
          Length = 530

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 55/203 (27%), Positives = 91/203 (44%), Gaps = 19/203 (9%)

Query: 144 REKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQL---LSTTLGVKEYVQNEYLK 200
           R+K    +PW  Y+  +P       F+S EE   L+GT L   +   LG  E   +   +
Sbjct: 153 RQKIGISNPWTEYIRFMPASVPLPTFYSAEERELLRGTSLQTAVDAKLGSLEKEFDHLRQ 212

Query: 201 VEEEIILPNKQLFPR---PITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGI 257
             EEI    +  +       T DD+ +   + RSR     R  +  ++P  D+ NH+   
Sbjct: 213 ATEEIPWCQEHWWDEDTGKFTFDDWKYVDAVYRSRVVDLPRSGH-AIVPCVDMANHACED 271

Query: 258 TTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQ------VLIQYDLNKSNAELALDYGF 311
           + +   Y+ +GAG    + +  LRT   ++ GE+      V  +Y   K  +E+   YGF
Sbjct: 272 SVK-ARYDEEGAG----NAVLQLRTGKKLRVGEEKLHADAVACRYGDEKPASEMVFSYGF 326

Query: 312 IES-KSDRNAYTLTLEISESDPF 333
           +E+ ++D     L LEI + DP 
Sbjct: 327 VENERTDAKQIFLDLEIPDDDPL 349


>gi|295668911|ref|XP_002795004.1| SET domain-containing protein [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226285697|gb|EEH41263.1| SET domain-containing protein [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 488

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 115/484 (23%), Positives = 189/484 (39%), Gaps = 102/484 (21%)

Query: 65  ETFWQWLRDQKVVSPKSPIRPATFPE---GLGLVAQRDIAKNEVVLEVPMKFWINPDTVA 121
           E F  WL+    V     I+ A       G G+VA  DI K E +  +P    ++     
Sbjct: 18  EEFLDWLKQSPGVRVNPKIKIADLRSEGAGRGIVAYDDINKEEELFAIPQGLVLSFQNSK 77

Query: 122 ASEIGSLCS-GLKPWISVALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSEEELVELQ 179
             ++  +    L  W+ + L +I E  +   SPW  Y  +LP   D+ +FW++ EL+EL+
Sbjct: 78  LKDLMEINERDLGQWLCLILVMIYEYLQGVASPWAPYFKVLPTDFDTLMFWTDAELLELK 137

Query: 180 GTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDF----------------- 222
           G+ +L   +G K   +  +L+    ++  N +LFP    L  +                 
Sbjct: 138 GSAVLG-RIG-KSAAEEVFLRDLLPLVSKNSELFPLTSGLLSYNSPDGKAALLSLAHRMG 195

Query: 223 ----LWAFGILRSRAFSRLRGQNLV-----------LIPLADLINHSPGITTEDYAYEIK 267
                +AF +    A         V           +IPLADL+N            +  
Sbjct: 196 SLIMSYAFDVKNDEAEEVEGEGGYVTDDEERQLPKGMIPLADLLNADA---------DRN 246

Query: 268 GAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI--------ESKSDRN 319
            A LF  D   ++++   ++ GE++   Y      AEL   YG++        E++    
Sbjct: 247 NACLFQEDGYLAMKSIKSIRKGEEIFNDYG-ELPRAELLRRYGYVTDNYAQYDEAEVPIQ 305

Query: 320 AYTLTLEISESDPF-------FGDKLDIAETNGLG-ESAYFDIVLGRTLPPAMLQYLRLV 371
                  +  S P        F D L++ + +G G         L  TLP  +L  L ++
Sbjct: 306 TICKVAGLKSSTPGPDEPRLEFLDDLEVLD-DGYGIPRPDRSTPLAETLPTELLVVLNIL 364

Query: 372 ALGGTDAFLLESIFRNTIWGHLDLP---VSHANEELICRVVRDACKSALSGFHTTIEEDE 428
            +       LE    N +     +P   +  A   L+  VVR      L  + TT+ +D+
Sbjct: 365 VMP------LEQF--NQLKQKSKVPKPALGIAEATLLDEVVR----LILGEYPTTVAQDK 412

Query: 429 QLLQG-----GNLDP----RLRIAVGVRAGEKRVLQQIDEIFKERELELDELEYY--QER 477
           +LL       G+  P    RL++A+ VR GEK +L  +          L ELE +  Q R
Sbjct: 413 ELLASCANNQGSTSPKSAGRLKMALQVRKGEKEILNAV----------LSELEDFIVQHR 462

Query: 478 RLKD 481
           R+ D
Sbjct: 463 RVSD 466


>gi|440465458|gb|ELQ34778.1| hypothetical protein OOU_Y34scaffold00745g53 [Magnaporthe oryzae
           Y34]
 gi|440487677|gb|ELQ67452.1| hypothetical protein OOW_P131scaffold00314g25 [Magnaporthe oryzae
           P131]
          Length = 544

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 91/387 (23%), Positives = 163/387 (42%), Gaps = 83/387 (21%)

Query: 131 GLKPWISVALFLIREKKKED-SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLG 189
           G   W ++ L ++ E  + D SPWR YLD+LP   ++ +FW+ EE+ ELQ + +++ ++G
Sbjct: 69  GQNSWTTLILVMMYEHLRRDPSPWRPYLDVLPTEFETPMFWTSEEIAELQASPVVA-SIG 127

Query: 190 VKE---YVQNEYLKVEEEIILPNKQLFPRPITLDD--------------FLWAFGILRSR 232
            +E    ++ + L V +E    N  +F      DD                ++F +    
Sbjct: 128 REEADVMIRTKILPVIKE----NAAVFGGVTDADDERLVQLAHQMGSTIMAYSFDLEGEG 183

Query: 233 AFSRL---------RGQNLV-LIPLADLINHSPGITTE-DYAYEIKGAGLFSRDLLFSLR 281
                          G+ ++ ++P+AD++N         +++ E   A    +       
Sbjct: 184 DEEDEDEDGWVEDREGRTMMGMVPMADILNADAEFNAHINHSEEALVATALRK------- 236

Query: 282 TPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDR------NAYTLT------LEISE 329
               + AGE++L  Y     N +L   YG++  K  R       A ++T      L +++
Sbjct: 237 ----IPAGEEILNYYG-PLPNGQLLRRYGYVTEKHSRYDVVELTAASITSSIRSRLNLAQ 291

Query: 330 SDPFFGDKLDIAETNGL-------GESAYFDIVLGRTLPPAMLQYLRLVALGGTDAF--- 379
            D    DK+D A    +          A  ++     L  A         L GT +F   
Sbjct: 292 QD---WDKIDEARAKSMFGDDSEQVHGAMVEVEDSYPLERACEDPDEFGQLQGTPSFSEL 348

Query: 380 ------LLESIFRNTIWGHLD-LPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQ 432
                 + ++ F+       D +P     + +I   VR+A  + L+ + TT+EED  L +
Sbjct: 349 PEELLDMAKARFKEIKTIRPDAVPDKKFRDRIIYSAVREAASAKLAEYGTTLEED--LAK 406

Query: 433 GGNLD---PRLRIAVGVRAGEKRVLQQ 456
              LD   PRLR+A+ VR GEK++L +
Sbjct: 407 LSTLDSNQPRLRMAMEVRIGEKKLLHE 433


>gi|413951743|gb|AFW84392.1| hypothetical protein ZEAMMB73_159573 [Zea mays]
          Length = 281

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 48/190 (25%), Positives = 91/190 (47%), Gaps = 4/190 (2%)

Query: 67  FWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIG 126
           F +W+R   VV   +     + P G+ + A   +   ++V  +P    + P T  A+   
Sbjct: 15  FKRWMRAHGVVCSDALSLDVSDPLGVHVRAATPLRDGDLVATIPRGACLTPRTTGAAAAI 74

Query: 127 SLCSGLKPWISVALFLIREKKK-EDSPWRVYLDILPECTDSTVFW-SEEELVELQGTQLL 184
              + L   +++ + ++ E+ +  DSPW  YL +LP+C    + W + E    L GT+L 
Sbjct: 75  EA-AELGGCLALTVAVMYERAQGADSPWDAYLQLLPDCESVPLVWPAGEAECLLAGTELD 133

Query: 185 STTLGVKEYVQNEYLKVEEEIILPNK-QLFPRPITLDDFLWAFGILRSRAFSRLRGQNLV 243
                 KE++  ++ +  E ++L  +  + P   +L+ +L A  ++ SR+F         
Sbjct: 134 KIVKQDKEFLCEDWKECIEPLMLSGELDVDPDDFSLEKYLSAKTLVSSRSFQIDSYHGSG 193

Query: 244 LIPLADLINH 253
           ++PLADL NH
Sbjct: 194 MVPLADLFNH 203


>gi|19112238|ref|NP_595446.1| ribosomal lysine methyltransferase Set10 [Schizosaccharomyces pombe
           972h-]
 gi|74626910|sp|O74738.1|SET10_SCHPO RecName: Full=Ribosomal N-lysine methyltransferase set10
 gi|3738151|emb|CAA21252.1| ribosomal lysine methyltransferase Set10 [Schizosaccharomyces
           pombe]
          Length = 547

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 70/265 (26%), Positives = 114/265 (43%), Gaps = 35/265 (13%)

Query: 95  VAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLK---PWISVALFLIREK-KKED 150
           VAQ DIA +++++  P ++ I  +  A  E+  L    +   P I++  FL  E  K   
Sbjct: 37  VAQNDIAPDQLLISCPFEYAITYNK-AKEELKKLNPNFESCNPHITLCTFLALESLKGIQ 95

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNK 210
           S W  Y++ LP+  ++ ++++E +   L  T   S         ++EY   +E +     
Sbjct: 96  SKWYGYIEYLPKTFNTPLYFNENDNAFLISTNAYSAAQERLHIWKHEY---QEAL----- 147

Query: 211 QLFPRP---ITLDDFLWAFGILRSRAFS-----RLRGQNLVLIPLADLINHSPG---ITT 259
            L P P    T D ++W+  +  SR FS     +      +L+PL D +NH P    +  
Sbjct: 148 SLHPSPTERFTFDLYIWSATVFSSRCFSSNLIYKDSESTPILLPLIDSLNHKPKQPILWN 207

Query: 260 EDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRN 319
            D+  E K   L S++L         V  G Q+   Y   K N EL + YGF    +  +
Sbjct: 208 SDFQDE-KSVQLISQEL---------VAKGNQLFNNYG-PKGNEELLMGYGFCLPDNPFD 256

Query: 320 AYTLTLEISESDPFFGDKLDIAETN 344
             TL + I    P    K  I E +
Sbjct: 257 TVTLKVAIHPDLPHKDQKAAILEND 281


>gi|226294776|gb|EEH50196.1| SET domain-containing protein [Paracoccidioides brasiliensis Pb18]
          Length = 488

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 115/484 (23%), Positives = 198/484 (40%), Gaps = 102/484 (21%)

Query: 65  ETFWQWLRDQKVVSPKSPIRPATFPE---GLGLVAQRDIAKNEVVLEVPMKFWINPDTVA 121
           E F  WL+    V     I+ A       G G+VA  DI + E +  +P    ++     
Sbjct: 18  EEFMDWLKQSPGVRVNPKIKIADLRSEGAGRGIVAYDDINEEEELFAIPQGLVLSFQNSK 77

Query: 122 ASEIGSLCS-GLKPWISVALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSEEELVELQ 179
             ++  +    L  W+ + L +I E  +   SPW  Y  +LP   D+ +FW++ EL+EL+
Sbjct: 78  LKDLMEINERDLGQWLCLILVMIYEYLQGAASPWAPYFKVLPTDFDTLMFWTDAELLELK 137

Query: 180 GTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFG------------ 227
           G+ +L   +G K   +  +L+    ++  N +LFP    L  +    G            
Sbjct: 138 GSAVLG-RIG-KSTAEEVFLRDLLPLVSKNSELFPLTGGLLSYNSPDGKAALLSLAHRMG 195

Query: 228 -ILRSRAFS-------RLRGQNLV------------LIPLADLINHSPGITTEDYAYEIK 267
            ++ S AF         + G++              +IPLADL+N            +  
Sbjct: 196 SLIMSYAFDVENDEAEEVEGEDGYVTDDEERQLPKGMIPLADLLNADA---------DRN 246

Query: 268 GAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI-ESKSDRNAYTLTLE 326
            A LF  D   S+++   ++ GE++   Y      AEL   YG++ +S +  +   + ++
Sbjct: 247 NARLFQEDGYLSMKSIKSIRKGEEIFNDYG-ELPRAELLRRYGYVTDSYAQYDEAEVPIQ 305

Query: 327 -------ISESDPF-------FGDKLDIAETNGLGESAY-FDIVLGRTLPPAMLQYLRLV 371
                  +  S P        F D L++ + +G G   +     L  TLP  +L  L ++
Sbjct: 306 TICRVAGLKSSTPGPDEPRLEFLDDLEVLD-DGYGIPRHDRSTPLAETLPTELLVVLNIL 364

Query: 372 ALGGTDAFLLESIFRNTIWGHLDLP---VSHANEELICRVVRDACKSALSGFHTTIEEDE 428
            +       LE    N +     +P   +  A   L+  VVR      L  + TT+ +D+
Sbjct: 365 VMP------LEQF--NQLKQKSKVPKPALGIAEATLLDEVVR----LILGEYPTTVAQDK 412

Query: 429 QLLQG-----GNLDP----RLRIAVGVRAGEKRVLQQIDEIFKERELELDELEYY--QER 477
           +LL       G+  P    RL++A+ VR GEK +L  +          L ELE +  Q R
Sbjct: 413 ELLASCANYQGSTSPISAGRLKMALQVRKGEKEILNAV----------LSELEDFIAQHR 462

Query: 478 RLKD 481
           ++ D
Sbjct: 463 QVSD 466


>gi|363749087|ref|XP_003644761.1| hypothetical protein Ecym_2195 [Eremothecium cymbalariae
           DBVPG#7215]
 gi|356888394|gb|AET37944.1| Hypothetical protein Ecym_2195 [Eremothecium cymbalariae
           DBVPG#7215]
          Length = 589

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 90/384 (23%), Positives = 151/384 (39%), Gaps = 96/384 (25%)

Query: 156 YLDILPECTDSTVFWSEEELVELQGTQL-------LSTTLGVKEYVQNEY-----LKVEE 203
           Y++ LPE  DS + W  +EL  L GT +       L   +    YV NE        V+E
Sbjct: 102 YMNCLPEEVDSPLIWKPDELELLDGTNIGGSISEKLDLIVNDWRYVINELGFDVPQAVQE 161

Query: 204 EIILPNKQLF----------------------PRPITLDDFLWAFGILRSRAF------S 235
           +++   + L                       P  ++   F+W+  I  SRAF      S
Sbjct: 162 QLVFAARMLAEPNDVSKDEIYNMLIKHPKDGEPHWLSFQAFIWSHLIFTSRAFPERILNS 221

Query: 236 RLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQ 295
                N++L+P+ DL+NHS          +I+ AG    + +FS R   PV+ G+++   
Sbjct: 222 TCEISNVILLPILDLLNHSQ-------HSKIEWAG---DNGVFSFRKLEPVEVGDEIFNN 271

Query: 296 YDLNKSNAELALDYGFI--ESKSDRNAYT---------------LTLEISESDPFFGDKL 338
           Y   KSN EL + YGF+  ++K D  A                 L+L I +    F    
Sbjct: 272 YG-GKSNEELLVGYGFVIEDNKCDYLALKIKPPFPVLKSMLKAGLSLPILDDYTTFAFDT 330

Query: 339 DIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVS 398
            I + N    S Y D +L                      +L+       +W   DL   
Sbjct: 331 SIEKNNTTDISRYSDGIL----------------------YLINKHNDQLLW---DLLSV 365

Query: 399 HANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQID 458
           ++  E       ++ +  L G     E  EQ L+  N+ PR R +  ++        +  
Sbjct: 366 YSFLEAGDEEDHNSLRCQLQGIQNLRESIEQRLKTTNVAPRCRASDSLKYQILSRRARYA 425

Query: 459 EIFKERELELDELEYYQERRLKDL 482
           E+++++++++ +   Y  +RLKDL
Sbjct: 426 EVYRQQQVQILK---YSIQRLKDL 446


>gi|19114522|ref|NP_593610.1| lysine methyltransferase Set8 (predicted) [Schizosaccharomyces
           pombe 972h-]
 gi|74626633|sp|O14135.1|SET8_SCHPO RecName: Full=SET domain-containing protein 8
 gi|2462684|emb|CAB16739.1| lysine methyltransferase Set8 (predicted) [Schizosaccharomyces
           pombe]
          Length = 429

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 67/268 (25%), Positives = 114/268 (42%), Gaps = 34/268 (12%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTV-----AASEIGSLCSGLKPWISVAL---- 140
           +G G+ +        V+LEVP++  I   TV     +  +  S+ + L+ W  ++     
Sbjct: 27  KGNGIFSLNRYTSGTVLLEVPLENIICRKTVEQFRNSCDKFASIAT-LEEWNDMSFRTQA 85

Query: 141 -----FLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQ 195
                +L    +   + W  +L +LP   ++   W E+E+  LQGT + +     ++ +Q
Sbjct: 86  MLFLCYLWLGIQPRTNKWDKFLTVLPLSINTPAQWPEKEVYSLQGTSIFNPVCVKRKILQ 145

Query: 196 NEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSP 255
            E+L + +         +P  ITL  ++ A  +  SR       ++ VL P+ DL NHS 
Sbjct: 146 QEWLSLNQRY----SDSWPSKITLPKWVHADALFHSRCLES-PFKDPVLAPVIDLCNHSS 200

Query: 256 GITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI-ES 314
                    +      FS D +  L     +   E+V I Y   K +AE    YGF+ E 
Sbjct: 201 ---------KSNAKWSFSEDAM-QLYLDKDIDENEEVTINYGSEKGSAEFLFSYGFLPEP 250

Query: 315 KSDRNAYTLTLEISESDPFFGDKLDIAE 342
           + DR    + L I E      D LD+A+
Sbjct: 251 EGDRITNVMKLLIPEDS---NDSLDLAK 275


>gi|302754812|ref|XP_002960830.1| hypothetical protein SELMODRAFT_402221 [Selaginella moellendorffii]
 gi|300171769|gb|EFJ38369.1| hypothetical protein SELMODRAFT_402221 [Selaginella moellendorffii]
          Length = 393

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 55/194 (28%), Positives = 95/194 (48%), Gaps = 21/194 (10%)

Query: 128 LCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECT--DSTVFWSEEELVELQGTQLLS 185
           L S +KPW  +AL ++ E+ K  + W  Y+  LP+    D+T  W + EL  L+ + L  
Sbjct: 157 LPSTVKPWTKLALIVLMERYKGQAIWAPYISCLPQPAELDNTFRWEDTELSYLRASPLYG 216

Query: 186 TTLGVKEYVQNEYLKVE--------EEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRL 237
                 E +  E+ +V+        E+ +    QLF + ++L+D    +  + SR+ + +
Sbjct: 217 KARERLEMITTEFGQVQNDFCTCVLEQALDVWPQLFGK-VSLEDLKHVYATVFSRSLA-I 274

Query: 238 RGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYD 297
              +  LIP+ D  NH+       +A ++   GL +  ++ + R        +Q+ I Y 
Sbjct: 275 GEDSTTLIPMLDFFNHN----ATSFA-KLSFNGLLNYAVVTADRD---YAENDQIWINYG 326

Query: 298 LNKSNAELALDYGF 311
            + SNAELALDYGF
Sbjct: 327 -DLSNAELALDYGF 339


>gi|358369683|dbj|GAA86297.1| SET domain protein [Aspergillus kawachii IFO 4308]
          Length = 489

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 97/369 (26%), Positives = 156/369 (42%), Gaps = 85/369 (23%)

Query: 54  SATTNPP-----TAQVETFWQWLRDQKVVSPKSPIRPA---TFPEGLGLVAQRDIAKNEV 105
           S+TT+ P       Q   F  WL  +  V     I+ A   +   G G+VAQ D+ + E 
Sbjct: 2   SSTTHFPDPGNFQHQSNEFLTWLAGKPGVKINPKIQIADLRSHAAGRGVVAQSDLDEGEE 61

Query: 106 VLEVPMKFWINPDTVAASEIGSLCS----GLKPWISVALFLIREKKKED-SPWRVYLDIL 160
           +  +P    +   +V  S + +L S     L PW+S+ + +I E  + D S W  Y  +L
Sbjct: 62  LFTIPRAHVL---SVQNSNLKNLLSQNLEDLGPWLSLMVVMIYEYLQGDQSAWASYFRVL 118

Query: 161 PECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI--- 217
           P   D+ +FWS  EL ELQG+ ++   +G K+  +   ++    I+  N  LFP PI   
Sbjct: 119 PRNFDTLMFWSASELEELQGSAIVE-KIG-KQGAEGSIIESIAPIVRANPALFP-PIDGV 175

Query: 218 -----------------TLDDFL--WAFGILRSRAFSRLR-GQNLVL------------I 245
                            T+   +  +AF I +       R G++  L            +
Sbjct: 176 ASYDGDAGTQALLHLAHTMGSLIMAYAFDIEKPEDEEGDRDGEDGYLTDEEEEQSSKGMV 235

Query: 246 PLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAEL 305
           PLADL+N            +   A LF  + +  ++   P+K+GE++   Y      ++L
Sbjct: 236 PLADLLNAD---------ADRNNARLFQEEEVLVMKAIKPIKSGEEIFNDYG-EIPRSDL 285

Query: 306 ALDYGFIESKSDRNAYTLTLEISESDPFFGDKLD-IAETNGLGESAYFDIVLGRTLPPAM 364
              YG++   +D  A    +E+S         LD I +  GLG +   DI    T PP  
Sbjct: 286 LRRYGYV---TDNYAQYDVVELS---------LDQICQAAGLGSA---DI---ETQPP-- 325

Query: 365 LQYLRLVAL 373
           LQ+L  + L
Sbjct: 326 LQFLEDLEL 334


>gi|428175768|gb|EKX44656.1| hypothetical protein GUITHDRAFT_109433 [Guillardia theta CCMP2712]
          Length = 591

 Score = 62.8 bits (151), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 76/281 (27%), Positives = 124/281 (44%), Gaps = 26/281 (9%)

Query: 65  ETFWQWLRDQ-----KVVSP--KSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINP 117
           + F  WLR+      K+  P  + P +P     G+ +++  DIA  E +  +P K  ++ 
Sbjct: 9   QRFLAWLREHGAKFDKLEWPVYRWPGKPHDGERGVRVIS--DIAPCEEMFSIPEKILMSR 66

Query: 118 DTVAASEIGSLCSGLKPWI-------SVALFLIREK--KKEDSPWRVYLDILPECTDSTV 168
            +  AS I  +    K  +       ++ L ++ EK  +   S W+  +DILP    +  
Sbjct: 67  KSCMASSIAHVFRKHKDVLFSSRDELALTLLILYEKLDQGNASFWKPMIDILPADPGAAS 126

Query: 169 FWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFP-RPITLDDFLWAFG 227
            WSEEEL ELQ   L +  + V   +Q  Y +V   I++ +  +F     T ++F WA  
Sbjct: 127 KWSEEELQELQDESLKAEAMIVVASMQQTYQRVLRPILVQHGDVFSVDRYTWEEFRWALL 186

Query: 228 ILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVK 287
            + SR F R      + +P ADL+NH    T+  +  E + A       +         +
Sbjct: 187 CVESRTFGRFLPHPSI-VPFADLLNHVNVQTSYRWLPEERRAA-----YMCDASGEHVHR 240

Query: 288 AGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEIS 328
            GE+  + Y   +SNAEL L YGF    +   A  L   I+
Sbjct: 241 RGEEAFMSYG-PRSNAELLLHYGFALQSNRYEAVELNFRIN 280


>gi|405953717|gb|EKC21325.1| SET domain-containing protein 6 [Crassostrea gigas]
          Length = 384

 Score = 62.8 bits (151), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 67/280 (23%), Positives = 116/280 (41%), Gaps = 41/280 (14%)

Query: 61  TAQVETFWQWL--RDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPD 118
           T +   F +W    D    S K  I P       G+VA  D+ + E +  +  K  ++P 
Sbjct: 27  TNRANAFLKWFSSNDDNFFSGKVTIGPDGSCAQNGMVAIADVQEGESLFRISRKILLHPK 86

Query: 119 TVAASEI--GSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELV 176
           + + S +      +    W  + + +++E   +DSPW+ Y D+LPE  D  +FW+EEE  
Sbjct: 87  SSSISALFEKDPVNSESGWSELLICMMQEYNTKDSPWKPYFDVLPETVDLPMFWTEEERE 146

Query: 177 EL-QGTQLLSTT-------LGVKEYVQNEYLKVEEEII---LPNKQLFPRPITLDDFLWA 225
           +L  GT ++          L   + V + YLK  ++ I     + +L+ R ++   ++ A
Sbjct: 147 KLLTGTGVVEAVNRDNKKILTEFQSVVSPYLKKHKDTISESCDDLELYKRMVS---YVMA 203

Query: 226 FGILR----------SRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGL-FSR 274
           +                         + ++P+AD++NH               A L F  
Sbjct: 204 YSFTEPPKDDDSDDFGEEDEEEEKSTIYMVPMADMLNHIAN----------NNAHLSFKP 253

Query: 275 DLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
           D L  + T   +K GE+V   Y    +N  L   YGF E+
Sbjct: 254 DCLEMIATK-DIKKGEEVFNTYG-ELANWHLLHMYGFSEA 291


>gi|195040205|ref|XP_001991024.1| GH12451 [Drosophila grimshawi]
 gi|193900782|gb|EDV99648.1| GH12451 [Drosophila grimshawi]
          Length = 573

 Score = 62.8 bits (151), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 79/334 (23%), Positives = 134/334 (40%), Gaps = 71/334 (21%)

Query: 62  AQVETFWQWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTV 120
           A+V  F +W R   V S    I   TFP   LGL A+RDIA  E+VL VP K   + + +
Sbjct: 125 AKVAAFNEWARAGGVQSDCVEI--TTFPGYQLGLRAKRDIAAEELVLSVPRKLIFSEELL 182

Query: 121 AASEIGSLCSGLKPWISVALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSEEELVELQ 179
              +   L       ++V   LI EK +   S W+ ++D LP    + ++++ +++  L+
Sbjct: 183 PEWK-RELFRNFPTHLNVTYTLIIEKVRGAASAWQPFIDTLPTRYSTVLYFTVDQMQRLR 241

Query: 180 GTQLLSTTLG--------VKEYVQNEYLKVEEEIILPNKQLFPR-PITLDDFLWAFGILR 230
           GT   S  +              +  Y++  + ++     LF    +  + + WA   + 
Sbjct: 242 GTSACSAAMRHCLVIARLYASMYKCAYIQPGDNVMAAKANLFTEYGLCYELYRWAVSTVT 301

Query: 231 SRAFSRLRGQNLV---------------------------------------LIPLADLI 251
           +R       QNLV                                       LIP  D+ 
Sbjct: 302 TR-------QNLVPRELSTVGEVDQVCQLGGFEGTEIKRDAETGARNAPISALIPYWDMT 354

Query: 252 NHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF 311
           NH  G  T  Y    +     +++           KAGEQ  I Y  ++SNA+  + +GF
Sbjct: 355 NHRCGKITSYYDRAAQQMECTAQEAF---------KAGEQFFIYYG-DRSNADRLVHHGF 404

Query: 312 IESKSDRNAYTLTLEISESDPFFGDK-LDIAETN 344
           ++  + ++   + L +S +DP    + L +AE N
Sbjct: 405 LDMHNLKDYVQIRLGLSPTDPLVEQRSLLLAELN 438


>gi|149059900|gb|EDM10783.1| hypothetical protein RDA279, isoform CRA_c [Rattus norvegicus]
          Length = 314

 Score = 62.8 bits (151), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 58/217 (26%), Positives = 100/217 (46%), Gaps = 22/217 (10%)

Query: 69  QWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           +WL+++K       + PA FP  G GL+++  + + +V++ +P    +  DTV  S +G 
Sbjct: 72  KWLKERKF-EDTGLLVPACFPGTGRGLMSKASLQEGQVIISLPESCLLTTDTVIRSSVGP 130

Query: 128 LCSGLKPWIS----VALFLIREKKK-EDSPWRVYLDILPECTDSTVFWSEEELVELQGTQ 182
                KP +S    +  FL+ E+     S W+ YLDILP+     V   E E+V+L    
Sbjct: 131 YIKKWKPPVSPLLALCTFLVSERHAGSHSLWKSYLDILPKSYTCPVCL-EPEVVDLLPGP 189

Query: 183 LLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI----TLDDFLWAFGILRSRA-FSRL 237
           L +     +  VQ+ +     +     + LF   +    +   FLWA+  + +RA + + 
Sbjct: 190 LRAKAEEQRARVQDLFAS-SRDFFSTLQPLFAESVDSIFSYHAFLWAWCTVNTRAVYLKS 248

Query: 238 RGQNLV--------LIPLADLINHSPGITTEDYAYEI 266
           R Q  +        L P  DL+NHSP +    +  E+
Sbjct: 249 RRQECLSSEPDTCALAPFLDLLNHSPHVQVRGWLGEV 285


>gi|119175557|ref|XP_001239983.1| hypothetical protein CIMG_09604 [Coccidioides immitis RS]
 gi|392864750|gb|EAS27347.2| SET domain-containing protein [Coccidioides immitis RS]
          Length = 502

 Score = 62.8 bits (151), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 54/199 (27%), Positives = 89/199 (44%), Gaps = 13/199 (6%)

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEE---EIIL 207
           +PW  Y+  LPE      F+++ E   LQGT L        + ++ E+ ++ E   +I  
Sbjct: 136 NPWTGYVQFLPESYTLPTFYTDGEFEILQGTSLKPALEAKLDSLEREFEQLRESTKDIAW 195

Query: 208 PNKQLF---PRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAY 264
                +      +T  D+     + RSRA   + G    ++P  D+ NH+ G  T +  Y
Sbjct: 196 CKTNWWHGETGKLTFYDWKIVDSMYRSRAL-EVPGIGHAMVPCVDMANHASGEDT-NAMY 253

Query: 265 EIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESK-SDRNAYTL 323
           ++ G G     L +  R    ++ G++V I Y   K  AE+   YGF+E   SD     L
Sbjct: 254 DVDGNGNVVLQLHYGQR----LEEGDEVTITYGDEKGAAEMIFSYGFLEDTVSDARQLFL 309

Query: 324 TLEISESDPFFGDKLDIAE 342
            L+I + DP    KL + +
Sbjct: 310 DLQIPDDDPLKPAKLMVCD 328


>gi|303318433|ref|XP_003069216.1| conserved hypothetical protein [Coccidioides posadasii C735 delta
           SOWgp]
 gi|240108902|gb|EER27071.1| conserved hypothetical protein [Coccidioides posadasii C735 delta
           SOWgp]
          Length = 502

 Score = 62.8 bits (151), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 53/199 (26%), Positives = 90/199 (45%), Gaps = 13/199 (6%)

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEE---EIIL 207
           +PW  Y+  LPE      F+++ E   LQGT L        + ++ E+ ++ E   +I  
Sbjct: 136 NPWTGYVQFLPESYTLPTFYTDGEFEILQGTSLKPALEAKLDSLEREFEQLRESTKDIAW 195

Query: 208 PNKQLF---PRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAY 264
                +      +T  D+     + RSRA   + G    ++P  D+ NH+ G  T +  Y
Sbjct: 196 CKTNWWHGETGKLTFYDWKIVDSMYRSRAL-EVPGIGHAMVPCVDMANHASGEDT-NAMY 253

Query: 265 EIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESK-SDRNAYTL 323
           ++ G G    +++  L     ++ G++V I Y   K  AE+   YGF+E   SD     L
Sbjct: 254 DVDGNG----NVVLQLHYGQRLEEGDEVTITYGDEKGAAEMIFSYGFLEDTVSDARQLFL 309

Query: 324 TLEISESDPFFGDKLDIAE 342
            L+I + DP    KL + +
Sbjct: 310 DLQIPDDDPLKPAKLMVCD 328


>gi|168021415|ref|XP_001763237.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685720|gb|EDQ72114.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 489

 Score = 62.8 bits (151), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 74/265 (27%), Positives = 109/265 (41%), Gaps = 38/265 (14%)

Query: 83  IRPATFPE--GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWIS--- 137
           IR ATFP+  G GL A RD+   E++L VP K  +N  +       +    L P +S   
Sbjct: 44  IRIATFPDAGGRGLAAARDLKLGELILRVPEKALMNGRSARLDAELTRALALYPSLSHVQ 103

Query: 138 -VALFLIREKKKEDSPWRV-YLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQ 195
            + + L+RE  K  +  R  YL  LP    +  F+S  E   LQ    +S   GV +  +
Sbjct: 104 VLCVHLLREIAKGRTSERFPYLVHLPRYYHTASFYSPFEAQALQVKDAVSMAEGVVQNSR 163

Query: 196 NEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHS- 254
            E+L+     +L    L  R  TL  +LWAF  + SR       +   L P+ D  N++ 
Sbjct: 164 EEWLQARP--VLEKLGLGRRFCTLQGWLWAFATISSRTLYVPWDEAGTLCPVGDFFNYAC 221

Query: 255 PGI------TTEDYAYE----------IKGAGLFSRDLL-----------FSLRTPVPVK 287
           PG+      T +D                  G+  RD L           +        +
Sbjct: 222 PGVPYNLPPTAQDTQMREGDLISEEDVDTSGGIEIRDRLRDGGFEDERGEYCFYARQDYQ 281

Query: 288 AGEQVLIQYDLNKSNAELALDYGFI 312
            G+QVL+ Y    +N EL   YGF+
Sbjct: 282 EGQQVLLCYG-TYTNLELLEHYGFL 305


>gi|320039093|gb|EFW21028.1| hypothetical protein CPSG_02871 [Coccidioides posadasii str.
           Silveira]
          Length = 502

 Score = 62.8 bits (151), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 53/199 (26%), Positives = 90/199 (45%), Gaps = 13/199 (6%)

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEE---EIIL 207
           +PW  Y+  LPE      F+++ E   LQGT L        + ++ E+ ++ E   +I  
Sbjct: 136 NPWTGYVQFLPESYTLPTFYTDGEFEILQGTSLKPALEAKLDSLEREFEQLRESTKDIAW 195

Query: 208 PNKQLF---PRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAY 264
                +      +T  D+     + RSRA   + G    ++P  D+ NH+ G  T +  Y
Sbjct: 196 CKTNWWHGETGKLTFYDWKIVDSMYRSRAL-EVPGIGHAMVPCVDMANHASGEDT-NAMY 253

Query: 265 EIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESK-SDRNAYTL 323
           ++ G G    +++  L     ++ G++V I Y   K  AE+   YGF+E   SD     L
Sbjct: 254 DVDGNG----NVVLQLHYGQRLEEGDEVTITYGDEKGAAEMIFSYGFLEDTVSDARQLFL 309

Query: 324 TLEISESDPFFGDKLDIAE 342
            L+I + DP    KL + +
Sbjct: 310 DLQIPDDDPLKPAKLMVCD 328


>gi|342879010|gb|EGU80287.1| hypothetical protein FOXB_09214 [Fusarium oxysporum Fo5176]
          Length = 530

 Score = 62.8 bits (151), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 61/195 (31%), Positives = 85/195 (43%), Gaps = 15/195 (7%)

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLK--VEEEIILP 208
           +PW  YL  LP        WSE E   LQGT L    L  K    N+     +E    LP
Sbjct: 156 TPWTEYLKFLPRDIPVPTMWSELERALLQGTSL-EVALDAKLSALNKEFDELIERSSALP 214

Query: 209 --NKQLFPR-PITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYE 265
             N   + R  +T+DD++      RSR     R  +  ++P+ D+ NHS   T    AY 
Sbjct: 215 FWNSFFWEREAVTIDDWVLVDAWYRSRCLELPRSGH-AMVPVLDMANHSHSQT----AYY 269

Query: 266 IKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTL 325
            +       +++   R  + +  G++V I Y   KS AE+   YGFI+ +S     TL L
Sbjct: 270 DEDD---EDNVVLLPRPGMEISIGDEVTISYG-EKSPAEMIFSYGFIDRESTVEGLTLPL 325

Query: 326 EISESDPFFGDKLDI 340
           E    DP    KL I
Sbjct: 326 ESLADDPLGKAKLHI 340


>gi|119196663|ref|XP_001248935.1| hypothetical protein CIMG_02706 [Coccidioides immitis RS]
 gi|392861859|gb|EAS37551.2| SET domain-containing protein [Coccidioides immitis RS]
          Length = 469

 Score = 62.8 bits (151), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 108/441 (24%), Positives = 182/441 (41%), Gaps = 81/441 (18%)

Query: 67  FWQWLRDQKVVSPKSPIRPATF---PEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAAS 123
           F  WL+    V     IR A       G G+VA  +I ++E +  +P    +   +VA S
Sbjct: 16  FMGWLKQHPGVRVNPKIRIADLRPTGAGRGVVACEEIVQDEELFAIPEDLVL---SVANS 72

Query: 124 EIGSLCS----GLKPWISVALFLIREKKKED-SPWRVYLDILPECTDSTVFWSEEELVEL 178
           ++    +        W+S+ + +I E  +   S W  Y  +LP   D+ +FW+E EL EL
Sbjct: 73  KVKDRINFADENFDTWLSLIVTMIFEYLQGGVSKWSPYFGVLPTDFDTLMFWTENELREL 132

Query: 179 QGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDF---------------- 222
           QG+ +L   +G +E  Q    KV   ++L +  LFP    L  F                
Sbjct: 133 QGSSVLD-KIGKQEADQVILDKV-LPVVLEHPDLFPPVNGLASFDSPSGKEVVLQLAHRM 190

Query: 223 -----LWAFGILRSRAFSRLRGQNLV----------LIPLADLINHSPGITTEDYAYEIK 267
                 +AF I   +   +      V          ++PLADL+N               
Sbjct: 191 GTLIMAYAFDIEMDQDEDQDGEDGYVTDDEQEKAKGMVPLADLLNADA---------HRN 241

Query: 268 GAGLFSRDLLFSLRTPVPVKAGEQVLIQY-DLNKSNAELALDYGFI-ESKSDRNAYTLTL 325
            A LF  D  F +R+  P+    ++   Y +L +S+  L   YG+I E+ +  +   ++L
Sbjct: 242 NARLFQEDGYFIMRSIAPISIEMEIFNDYGELPRSD--LLRRYGYITENYAPYDVVEISL 299

Query: 326 EISESDPFFGD---KLDIAETNGLGESAYF------DIVLGRTLPPAMLQYLRLVALGGT 376
           E   +     +   +L++ E  G+ E  Y       D +    +P  +L  LR +     
Sbjct: 300 EAICNIAGVEEGCCQLELLEDAGVLEDGYALSRPEGDAITTEAIPAELLILLRTLRSPRK 359

Query: 377 DAFLLESIFRNTIW--GHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQ-- 432
           D   L+ + +        LD P +     L+  ++++  K     + TTI +D++LLQ  
Sbjct: 360 D---LDQMRKKGKLPKAELDKPCAR----LLVEILQNRQKD----YPTTIAQDDELLQAL 408

Query: 433 GGNLDPRLRIAVGVRAGEKRV 453
           G +   R  +AV VR GEK +
Sbjct: 409 GSSQVHRREMAVRVRKGEKEI 429


>gi|321462357|gb|EFX73381.1| hypothetical protein DAPPUDRAFT_58066 [Daphnia pulex]
          Length = 425

 Score = 62.8 bits (151), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 70/298 (23%), Positives = 118/298 (39%), Gaps = 54/298 (18%)

Query: 84  RPATF-PEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFL 142
           +PA F   G GL+A  +IA N +++++P    I  + V A EI  L            F 
Sbjct: 53  KPALFNSTGRGLMAMSNIAPNHLLVQIPQSLLITKEKVLA-EISDLLQFSMTTAECLTFF 111

Query: 143 IREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVE 202
           I    K +  +  Y+  LP+          +E+  L     L   +      QN  LK  
Sbjct: 112 ILN-SKFNGLYSSYISTLPKSFSVGGLCKSQEIAALPS--FLQEKIMCN---QNFVLKKY 165

Query: 203 EEIILPNKQLFPRPITLDDFLWAFGILRSRA------------FSRLRG--QNLVLIPLA 248
           E+I    ++++   ++L+ F WA+  + +RA             +++ G   N+ L P  
Sbjct: 166 EKIFAIWRKIYGSTLSLELFQWAWFCVNTRAVFYQDSKQHSHGLNKVDGMENNMALAPYL 225

Query: 249 DLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALD 308
           D+ NH   +  E        AG       + +R+   +K  +QV I Y  +  N +L L+
Sbjct: 226 DMFNHDAEVVVE--------AGFNKTTQCYEIRSDRHIKKYQQVFINYGPH-DNMKLFLE 276

Query: 309 YGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQ 366
           YGF+ +K+   A    +++                        FD+VLG    P  LQ
Sbjct: 277 YGFLATKNLHKAVEFDIDV-----------------------LFDLVLGCPSKPCCLQ 311


>gi|425767698|gb|EKV06264.1| hypothetical protein PDIG_78250 [Penicillium digitatum PHI26]
 gi|425780393|gb|EKV18400.1| hypothetical protein PDIP_26670 [Penicillium digitatum Pd1]
          Length = 494

 Score = 62.8 bits (151), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 52/192 (27%), Positives = 86/192 (44%), Gaps = 17/192 (8%)

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEE---IIL 207
           +PW  Y+  LP       F++ EE   L+GT L    +    +++ E+ ++ +    I  
Sbjct: 129 NPWTEYVKFLPPSFPLPTFYTAEEQELLRGTSLAEPLVAKLAFLEREFEQLRQATGGIAW 188

Query: 208 PNKQLFPR---PITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINH--SPGITTEDY 262
             +  +      +T+DD+ +     RSR    L G  L ++P  D+ NH  S G+     
Sbjct: 189 CQRSWWHERTGALTIDDWKYVDAAYRSRLLD-LPGSGLAMVPCIDMANHVSSDGVKA--- 244

Query: 263 AYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESK-SDRNAY 321
            Y+    G    + +  LR    ++ GE++ I Y   KS +E+   YGF+ES  +     
Sbjct: 245 LYDTDSEG----NAVLQLRWGKTIQPGEEITISYGNEKSASEMIFSYGFLESGITQAREM 300

Query: 322 TLTLEISESDPF 333
            L LEI E DP 
Sbjct: 301 FLNLEIPEDDPL 312


>gi|33468718|emb|CAE30375.1| SI:dZ63M10.4 (novel protein similar to human chromosome 21 open
           reading frame 18 (C21orf18)) [Danio rerio]
          Length = 440

 Score = 62.8 bits (151), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 95/420 (22%), Positives = 157/420 (37%), Gaps = 51/420 (12%)

Query: 62  AQVETFWQWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTV 120
           AQ     +WL ++   S    + P  F   G GL++ + I     ++ +P +  +   TV
Sbjct: 32  AQFVLLRRWLNERGFTS--QSLIPVNFHGNGRGLMSTQTIKAKNSLISLPEECLLTTSTV 89

Query: 121 AASEIGSLCSGLKPWIS----VALFLIREKKK-EDSPWRVYLDILPECTDSTVFWSEEEL 175
             S +        P IS    +  FLI E+   E S W  Y+DILP+ T +   +  + +
Sbjct: 90  LKSYMADYIKRWHPPISPLLALCCFLISERHHGEASEWNPYIDILPK-TYTCPLYFPDNV 148

Query: 176 VELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRP----ITLDDFLWAFGILRS 231
           +EL    L       KE  Q E     +      + LF +P     + D   WA+  + +
Sbjct: 149 IELLPRSLQKKATQQKEQFQ-ELFSSSQTFFHSLQPLFNQPTEELFSQDALRWAWCSVNT 207

Query: 232 RAFSRLRGQN---------LVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRT 282
           R       Q+           L P  DL+NH P +  E        AG       + +R+
Sbjct: 208 RTVYMEHDQSKYLSREKDVYALAPYLDLLNHCPNVQVE--------AGFNKETRCYEIRS 259

Query: 283 PVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLE-----ISESDPFFGDK 337
               K  +Q  I Y  +  N  L L+YGF+   +  +   + LE     + E D    +K
Sbjct: 260 VNGCKKFQQAFINYGPH-DNHRLLLEYGFVAPCNPHSVVYVDLETLKVGLDEKDKQLKEK 318

Query: 338 LDIAETNGLGESAYFDIVLGRTLPP-AMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLP 396
           L   + N        ++  G   P   ++  LRL++L        +S+        L   
Sbjct: 319 LLYLKDNDFLR----NLTFGMDGPSWRLMTALRLLSLKPQQYTSWKSVL-------LGAA 367

Query: 397 VSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLD--PRLRIAVGVRAGEKRVL 454
           VS   E+         C +        +E   QL +G N     +L +   +R  E+ +L
Sbjct: 368 VSQDREDWCIESALKLCNNLTEDNVKALERLAQLKEGANQSGLEQLCVVESLRREEQNIL 427


>gi|159477607|ref|XP_001696900.1| rubisco small subunit N-methyltransferase [Chlamydomonas
           reinhardtii]
 gi|158274812|gb|EDP00592.1| rubisco small subunit N-methyltransferase [Chlamydomonas
           reinhardtii]
          Length = 411

 Score = 62.8 bits (151), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 46/177 (25%), Positives = 86/177 (48%), Gaps = 10/177 (5%)

Query: 288 AGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLG 347
           AG ++L+  +  + N EL L  G ++  +  +  +    +  +D ++  K  + E+ G  
Sbjct: 180 AGSEMLL--NDGRPNGELLLATGTLQDNNSSDFLSWPAGLVPADRYYMMKSQVLESMGYS 237

Query: 348 ESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICR 407
            +  F +   R +P  +L YLRL  +    A L +  F        D+ +S  NE  I +
Sbjct: 238 AAEEFPVYADR-MPIQLLAYLRLSRVADP-ALLAKCTF------EADVELSQMNEYEILQ 289

Query: 408 VVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKER 464
           ++   C+  L+ +  + EED ++ Q  +L P+ R+AV +R GEKR++    E  + R
Sbjct: 290 ILMGDCRERLASYTKSYEEDVKIAQQSDLSPKERLAVKLRLGEKRIINATMEAVRRR 346


>gi|397576179|gb|EJK50107.1| hypothetical protein THAOC_30954 [Thalassiosira oceanica]
          Length = 426

 Score = 62.8 bits (151), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 68/247 (27%), Positives = 109/247 (44%), Gaps = 45/247 (18%)

Query: 93  GLVAQRDIAKNEVVLEVPMKFWI-------NPDTVAASEIGSLCSGLKPWISVALFLIRE 145
           G++A +DI+K + ++E+P +  +       +P   A + +   C+    W S A     +
Sbjct: 127 GVIALKDISKGDPIIEIPYRMALDLGRESADPTLPATAFLQKYCA----WRSGADVPPGD 182

Query: 146 KKKEDSPWRVYLDILP-----ECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLK 200
           + + D     Y D+LP     +C  ST F+S+  L  LQ   +   TL  K+ V+  Y +
Sbjct: 183 RDRGD-----YFDMLPPFLSDDCLGSTDFFSDSALDMLQSPMVKEETLERKKLVKLRYER 237

Query: 201 VEEEIILPNKQLFP---RPITLDDFLWAFGILRSRAFSRLRG------QNLVLIPLADLI 251
             E +   +  L+       T D   WA  I+ SR  + ++G       N +LIPL D+ 
Sbjct: 238 DIESMTQMSSNLYQWDGEAATEDHLHWASWIITSRVLT-VQGPPESSFSNRLLIPLIDMC 296

Query: 252 NH---SPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLN-KSNAELAL 307
           NH   SP I T      + G        L  +   V VKAG+ + I Y    + N     
Sbjct: 297 NHSRESPHILT---GRAMPGG-------LLKVVAGVDVKAGDAIDICYGGGVEGNDRFIQ 346

Query: 308 DYGFIES 314
           DYGF++S
Sbjct: 347 DYGFLDS 353


>gi|367036851|ref|XP_003648806.1| hypothetical protein THITE_2106671 [Thielavia terrestris NRRL 8126]
 gi|346996067|gb|AEO62470.1| hypothetical protein THITE_2106671 [Thielavia terrestris NRRL 8126]
          Length = 479

 Score = 62.4 bits (150), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 54/193 (27%), Positives = 89/193 (46%), Gaps = 14/193 (7%)

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEI--ILP 208
           +PW  YL  LPE       W+E+E + LQGT L +        +  E+ ++ E+   I  
Sbjct: 118 NPWTEYLRFLPETVLVPTLWTEDERLLLQGTSLEAAVNAKISALDAEFAQIVEKSSDIAC 177

Query: 209 NKQLF--PRPITLDDFLWAFGILRSRAFS-RLRGQNLVLIPLADLINHSPGITTEDYAYE 265
            ++L      ++  D++    + RSR     + G+++V  P  D++NHS   T   Y  E
Sbjct: 178 WRELLWDGGTVSFTDWIRLDALYRSRCLELPVHGESMV--PCIDMLNHS--ATPSAYYDE 233

Query: 266 IKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTL 325
                    D++  LR  + +  G+++ I Y   KS AE+   YGFI+ +S  ++  L L
Sbjct: 234 NP-----QDDVVLLLRPGISLAEGDEITISYGDAKSAAEMLFSYGFIDPRSTADSLVLPL 288

Query: 326 EISESDPFFGDKL 338
                DP    KL
Sbjct: 289 SPFPDDPLAKAKL 301


>gi|332020872|gb|EGI61270.1| SET domain-containing protein 3 [Acromyrmex echinatior]
          Length = 247

 Score = 62.4 bits (150), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 38/118 (32%), Positives = 67/118 (56%), Gaps = 6/118 (5%)

Query: 86  ATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSG----LKPWISVAL 140
           A FP   LGL A+ D A+N+++LE+P     +  T AASE+  L +       P +++A+
Sbjct: 103 AEFPGYDLGLKAETDFAENQLILEIPRALIFSTYT-AASELTILQNDPLVQHMPQVALAI 161

Query: 141 FLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY 198
            L+ E+ KE+S W+ YLD+LP   ++ ++    +++EL+G+  L   L     +  +Y
Sbjct: 162 ALLIERYKENSKWKPYLDMLPSSYNTVLYMKTNDMIELKGSPTLEAALKQCRNIARQY 219


>gi|307107162|gb|EFN55406.1| hypothetical protein CHLNCDRAFT_134525 [Chlorella variabilis]
          Length = 705

 Score = 62.4 bits (150), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 50/183 (27%), Positives = 78/183 (42%), Gaps = 9/183 (4%)

Query: 92  LGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREKKKEDS 151
           LG+ A RD+A+ E +  +P    ++  T   +++            V   +       +S
Sbjct: 39  LGVRAVRDVAEGERLCAIPKAACLSIRTTQLADVIEAEELGGGLGLVLAVMHEMSLGAES 98

Query: 152 PWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQ 211
            W  Y   LP      +FWS+ +L  L GT+L  +    +E    ++    EE +LP   
Sbjct: 99  RWHGYFAALPPREYLPLFWSDAQLRLLAGTELEGSAESDREASAEDF----EEHVLPLLH 154

Query: 212 LFP---RP--ITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEI 266
            +P   RP   TLD F  A   + SRAF         ++PLAD+ NH   +      YE+
Sbjct: 155 KYPGRLRPAACTLDRFRVAASFVGSRAFCVDEWHGDAMVPLADIFNHKASVVELGAGYEV 214

Query: 267 KGA 269
            GA
Sbjct: 215 HGA 217


>gi|443730800|gb|ELU16158.1| hypothetical protein CAPTEDRAFT_140019 [Capitella teleta]
          Length = 255

 Score = 62.4 bits (150), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 66/256 (25%), Positives = 108/256 (42%), Gaps = 27/256 (10%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS-----LCSGLKPWISVALFLIRE 145
           G G++ +R +   + ++ +P    I   TV  S +G      L   L P  ++ +FL+ E
Sbjct: 5   GRGVMVRRRLLTGDTIIAIPESLLITTSTVLRSYLGPVIHDFLPCRLSPTETLVIFLLCE 64

Query: 146 KKKEDSP-WRVYLDILPECTDSTVFWSEEELVEL-QGTQLLSTTLGVK-EYVQNEYLKVE 202
           + K  S  W+ Y+DILP      + W+ +E+  L + T+  +  L +K E   N      
Sbjct: 65  RNKGCSSFWKPYVDILPSSYTDILHWTSKEMDLLPKFTKRRACDLRLKAEESFNRLCNGF 124

Query: 203 EEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLV----------LIPLADLIN 252
             +++     F    T D F WA+  + +R     + QN V          L P  DL+N
Sbjct: 125 LPLLVRQMPQFNGAFTWDLFKWAWSSVNTRCVYMSQPQNSVLSPDEEDKSALAPFLDLLN 184

Query: 253 HSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI 312
           H+  +           A        + + T    K  +QV I Y  + SN +L L+YGF 
Sbjct: 185 HTVDVEV--------NARFDDSSKSYKITTLTACKPYDQVFINYGPH-SNEKLLLEYGFT 235

Query: 313 ESKSDRNAYTLTLEIS 328
              +  N  +LTL  S
Sbjct: 236 LPCNPHNNISLTLSQS 251


>gi|170042895|ref|XP_001849144.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167866318|gb|EDS29701.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 474

 Score = 62.4 bits (150), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 75/320 (23%), Positives = 132/320 (41%), Gaps = 58/320 (18%)

Query: 83  IRPATFPE-GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSG---------- 131
           +R A F E G GL ++R     + ++ +P +  I+  ++   E    C            
Sbjct: 39  LRVANFEETGKGLFSRRGFKAGDCIISLPFESLISLKSIEEDEAFLACFNKDALADFSKG 98

Query: 132 -LKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGV 190
            L+    +AL+LI  + +++SP + YL+ +PE   +  F S+ E+  L    +L   +  
Sbjct: 99  ELQFQSLLALYLIYLRLEDNSPRKAYLNSIPESFTTPYFCSKLEMANLPNV-VLKQMVQQ 157

Query: 191 KEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFS--------RLRGQ-- 240
            E ++  +  ++   IL N  L  + + LD F WA+  + +R+           L+G+  
Sbjct: 158 NEIIKQNFTLLQS--ILNNDSL--QAVDLDLFKWAYFAVNTRSVYLEPRVLKLLLKGKTT 213

Query: 241 ----------NLVLIPLADLINHSPG--------ITTEDYAYEIKGAGLFSRDLLFSLRT 282
                     N+ L P  D  NH+ G        I+ E    + K   +    L + L T
Sbjct: 214 FFAEKLKDEPNMALAPFLDFFNHNAGAETTSKLSISYESLTKQFKKNQI--THLYYELFT 271

Query: 283 PVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLE----ISESDP------ 332
             PV    Q+ I Y    +N +L L+YGF    + ++   LTL+       +DP      
Sbjct: 272 SKPVPPFSQIFISYG-THNNTKLLLEYGFSLPANPQDFLELTLDDINAFIRADPELRPLK 330

Query: 333 FFGDKLDIAETNGLGESAYF 352
              +K      + LGE  +F
Sbjct: 331 IHREKYRFIADHSLGEQLFF 350


>gi|115492035|ref|XP_001210645.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114197505|gb|EAU39205.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 514

 Score = 62.4 bits (150), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 55/203 (27%), Positives = 86/203 (42%), Gaps = 12/203 (5%)

Query: 141 FLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLK 200
           FL + K     PW  Y+  +P        +  EE+  LQGT L       K+Y++ E+  
Sbjct: 115 FLGQPKIGVSHPWAEYIKYMPSSVLLPTLYDHEEVELLQGTSLRLAVEAKKDYLEKEFEH 174

Query: 201 VE---EEIILPNKQLFPRP--ITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSP 255
           +    E I    ++ +     + +DD+ +   + RSR     R  +  ++P  D+ NH+ 
Sbjct: 175 LRQSTENIPWCEERWWGESGELDIDDWKYVDALYRSRLVDLPRSGH-AMVPCVDMANHAS 233

Query: 256 GITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI-ES 314
             T +   YE    G    + L  LR    +  G++V I Y   K  AE+   YGF+ E 
Sbjct: 234 DDTVK-ALYEEDAEG----NALLQLREGQVLHPGDEVTISYGSEKPAAEMLFSYGFLPED 288

Query: 315 KSDRNAYTLTLEISESDPFFGDK 337
           K D     L L I + DP    K
Sbjct: 289 KEDAGQVFLDLSIPDDDPLRNHK 311


>gi|346980096|gb|EGY23548.1| SET domain-containing protein RMS1 [Verticillium dahliae VdLs.17]
          Length = 469

 Score = 62.4 bits (150), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 106/431 (24%), Positives = 171/431 (39%), Gaps = 96/431 (22%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWIN----------PDTVAAS--EIGSLCSGLKPWISV 138
           G G++A RDI +  ++  +P +  IN          P     S  E+      L  W  +
Sbjct: 38  GRGIIATRDIPEETILFTIPRQAIINVLTSELPQKLPQVFDGSIDEMDDNAEPLDSWGQL 97

Query: 139 ALFLIREKKKED-SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNE 197
            L ++ E  + D S W+ Y DILP+  D+ +FWS+ EL+E     L  T+L  ++  + E
Sbjct: 98  ILVMLYEVLQGDASRWKPYFDILPQQFDTPIFWSDGELLE-----LQGTSLTAEKIGKVE 152

Query: 198 YLKVEEEIILPNKQLFP---------RPITLDDFL------------WAFGILRSRAFSR 236
              +    ILP  Q  P         +P T D+ L            +AF +        
Sbjct: 153 SDAMFRSKILPIVQANPAIFYPEGAAQP-TEDELLHLAHRMGSTIMAYAFDLENDDENEN 211

Query: 237 L--------RGQNLV-LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVK 287
                     G+ ++ ++P+AD +N +          E   A     D          +K
Sbjct: 212 EEDGWVEDREGRTMLGMVPMADTLNANAEFNAHINHGESLEATAIRAD----------IK 261

Query: 288 AGEQVLIQYDLNKSNAELALDYGFIESKSDRN-----AYTLTLEI--------SESDPFF 334
           AG+Q+L  Y      +EL   YG++  +  R       +TL  E+        +E+    
Sbjct: 262 AGDQILNYYG-PLPTSELLRRYGYVTPEHSRYDVVEVPWTLVKEVIVSSLSLSAEAWKQV 320

Query: 335 GDKLDIAETNGLGESAYFDIVL--------GRTLPPAMLQYLRLVALGGTDAFLLESIFR 386
             ++D  E        YF I          GR   PA+L+ +    +     FL      
Sbjct: 321 ESQIDDEEIED-----YFVIERDSGEPGPDGRFTAPAVLREVSPELVEQLKEFL------ 369

Query: 387 NTIWGHLD---LPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIA 443
                 LD   +P     +E+   V+ +  K  L+ + T+IE DE+LL   +L  R R+A
Sbjct: 370 -KAVKKLDSERIPDKRKRDEICDAVIAEVLKVRLAQYPTSIETDEKLLAEADLPARRRMA 428

Query: 444 VGVRAGEKRVL 454
           V VR GEK++L
Sbjct: 429 VVVRLGEKKLL 439


>gi|302804174|ref|XP_002983839.1| hypothetical protein SELMODRAFT_445692 [Selaginella moellendorffii]
 gi|300148191|gb|EFJ14851.1| hypothetical protein SELMODRAFT_445692 [Selaginella moellendorffii]
          Length = 236

 Score = 62.4 bits (150), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 60/222 (27%), Positives = 99/222 (44%), Gaps = 36/222 (16%)

Query: 93  GLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREKKKEDSP 152
           GL A R +   E +LE+ +   I P     S++    S L+                 S 
Sbjct: 39  GLFASRPVRAGERMLEISLDLMIAP-----SDLPDQLSTLQ----------------SSA 77

Query: 153 WRVYLDILPECT--DSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEI-ILPN 209
           W  Y+  LPE    D+T  W + EL  L+ + L   T    E +  E+ +V+  + + P 
Sbjct: 78  WAPYISCLPEPAGLDNTFLWEDTELSYLRASPLYGKTRERLEIITTEFGQVQNALDVWP- 136

Query: 210 KQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGA 269
            QLF + ++++DF+  +  + SR  +      LV+IP+ D  NH+          ++   
Sbjct: 137 -QLFGK-VSVEDFMHVYATVFSRPLAIGEDSTLVMIPMLDFFNHNAASFA-----KLSFN 189

Query: 270 GLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF 311
           GL +  ++ + R        +Q+ I    + SNAELALDYGF
Sbjct: 190 GLLNYAVVTADRD---CAENDQIWINCG-DLSNAELALDYGF 227


>gi|357504157|ref|XP_003622367.1| SET domain-containing protein [Medicago truncatula]
 gi|355497382|gb|AES78585.1| SET domain-containing protein [Medicago truncatula]
          Length = 497

 Score = 62.0 bits (149), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 53/175 (30%), Positives = 86/175 (49%), Gaps = 14/175 (8%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEI-GSLCSGLKPWISVALFLIREKK- 147
           EG+ + A  +I   +VV ++P K  +   T  A EI  + C G   ++ +A+ ++ E+  
Sbjct: 32  EGISVKALCEINAGDVVAKMPKKACLTIKTSGACEIIENACLG--GYLGLAVAIMYERSL 89

Query: 148 KEDSPWRVYLDILPECTDSTVFWSEEELVE-LQGTQLLSTTLGVKEYVQNEYLKVEEEII 206
            E+SPW  YL +LP+     + WS EE+ + L GT+L  T    K  V  ++     E I
Sbjct: 90  AEESPWEGYLQLLPQQECLPLVWSVEEVDQLLCGTELHQTVQEDKALVYEDW----RENI 145

Query: 207 LP-----NKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPG 256
           LP       +L P    ++ +  A  ++ SR+F         ++PLADL NH  G
Sbjct: 146 LPLLDSEPSKLNPAFFGVEQYFAAKSLISSRSFEIDDYHGFGMVPLADLFNHKTG 200


>gi|78097104|ref|NP_001030295.1| N-lysine methyltransferase SETD6 [Mus musculus]
 gi|81904260|sp|Q9CWY3.1|SETD6_MOUSE RecName: Full=N-lysine methyltransferase SETD6; AltName: Full=SET
           domain-containing protein 6
 gi|12845648|dbj|BAB26837.1| unnamed protein product [Mus musculus]
 gi|74198625|dbj|BAE39788.1| unnamed protein product [Mus musculus]
 gi|148679234|gb|EDL11181.1| RIKEN cDNA 0610039J04 [Mus musculus]
 gi|187951385|gb|AAI39199.1| SET domain containing 6 [Mus musculus]
 gi|187952351|gb|AAI39200.1| SET domain containing 6 [Mus musculus]
          Length = 473

 Score = 62.0 bits (149), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 79/297 (26%), Positives = 130/297 (43%), Gaps = 44/297 (14%)

Query: 42  RAFAATCSLHSASATTNPPTAQVETFWQWLRDQKV-VSPKSPIRPATFPEGLGLVAQRDI 100
           RA  A   L + S+   P    V  F +W R   + +SPK  +       G G+VA+  +
Sbjct: 24  RAARAPLPLPAGSSGGEPEGDAVAGFLRWCRRVGLELSPKVTVSRQGTVAGYGMVARESV 83

Query: 101 AKNEVVLEVPMKFWINPDTVAAS-----EIGSLCSGLKPWISVALFLIREKKKEDSPWRV 155
              E++  VP    ++P T + S     E G+L S L  W+ + L L+ E +   SPW  
Sbjct: 84  RAGELLFAVPRSALLSPHTCSISGLLERERGALQS-LSGWVPLLLALLHELQAPASPWSP 142

Query: 156 YLDILPECT--DSTVFWSEEELVE-LQGTQLLSTTLGVKEYVQNEYLKVEEE---IILP- 208
           Y  + PE    +  +FW EEE +  L+GT       GV E V+ + + +  E   I+LP 
Sbjct: 143 YFALWPELGRLEHPMFWPEEERLRLLKGT-------GVPEAVEKDLVNIRSEYYSIVLPF 195

Query: 209 ---NKQLF-PRPITLDDFLWAFGILRSRAFS--------RLRGQNLVLIPLADLINHSPG 256
              +  LF P   +L+ +     ++ + +F              + +++P AD++NH   
Sbjct: 196 MEAHSDLFSPSVRSLELYQQLVALVMAYSFQEPLEEDDDEKEPNSPLMVPAADILNH--- 252

Query: 257 ITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIE 313
           I   +   E      +S D L  + T  P+  G ++   Y    +N +L   YGF E
Sbjct: 253 IANHNANLE------YSADYLRMVAT-QPILEGHEIFNTYG-QMANWQLIHMYGFAE 301


>gi|198417784|ref|XP_002130734.1| PREDICTED: similar to SET domain-containing protein 4 [Ciona
           intestinalis]
          Length = 473

 Score = 62.0 bits (149), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 71/298 (23%), Positives = 124/298 (41%), Gaps = 47/298 (15%)

Query: 85  PATFPE-GLGLVAQRDIAKNEVVLEVPMKFWINPDTV---------AASEIGSLCSGLK- 133
           P  F + G G++A+  I + +V+L +P    +  ++           +S   S+  GLK 
Sbjct: 50  PFVFKDTGRGMMAKTRICEGDVILSIPQAAMVGVNSAFNLSKFAQSISSVYHSMHDGLKL 109

Query: 134 PWISV-ALFLIREKKK-----EDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTT 187
             I +  +FLI EK+K       S W  Y+ +LP+     ++W  EE+  L   QL    
Sbjct: 110 SGIQILCIFLIEEKRKLGKNKPSSTWGYYVKVLPQTFTHPLYWEMEEIHTL-PKQLQICV 168

Query: 188 LGVKEYVQNEYLKVEEEI---ILPNKQLFPRPITLDDFLWAFGILRSR------------ 232
               + V+ ++ ++ E I    L +   +   I+  ++ WA+  + +R            
Sbjct: 169 NKTIDCVKQQFKELNEMIKKLKLGSDLNYHEEISWIEYRWAWCCVNTRCVYSTHDDPTIM 228

Query: 233 --AFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGE 290
              +         L+P  DL+NHS  + T+        A   + +  F LRT    K   
Sbjct: 229 KCCYQSSAADKYFLVPYLDLLNHSNEVNTK--------AEFNNTNKCFELRTHCKFKRFA 280

Query: 291 QVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPF--FGDKLDIAETNGL 346
           QV I Y    SN+ L ++YGF+  K+      + L++     F  +  K  I  +N L
Sbjct: 281 QVFISYGA-LSNSTLLVEYGFV-CKTPNKHDVVALDVGHVLSFIKYSVKTAICPSNSL 336


>gi|328866266|gb|EGG14651.1| hypothetical protein DFA_10909 [Dictyostelium fasciculatum]
          Length = 581

 Score = 62.0 bits (149), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 62/252 (24%), Positives = 102/252 (40%), Gaps = 67/252 (26%)

Query: 217 ITLDDFLWAFGILRSRAF-----------------------------SRLRGQNLVLIPL 247
           + L+ ++WA+G++++R +                             ++ +  N  L+PL
Sbjct: 301 VDLNMYVWAWGVIQTRTYYYNPRLDHQVSAAASSKKQQQQQLPKIHTTQDKNDNACLVPL 360

Query: 248 ADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELAL 307
           ADL NH+P + T         A   + D  + + T    + GEQV I Y L+ +NA L  
Sbjct: 361 ADLFNHNPNVKTM--------ASYCAADRCYRVYTDTRFEKGEQVFISYGLH-NNATLLH 411

Query: 308 DYGFIESKSDRNAYTLTLEIS------------------ESDPFFGDKLDIAETNGLGES 349
            YGF+   +  +   +  E S                  E +     K +I   NGL + 
Sbjct: 412 YYGFVIDNNHLDGIEIDSEASLPPLRPSAFYHLPAKEQREYERLVERKENILIQNGLSQG 471

Query: 350 AYFDIVLGRTLPPA--MLQYLRLVALGGT--DAFLLESIFRNTIWGHLDLPVSHANEELI 405
            Y ++V   T+P     L  LR++ +  +  DA L   IF      HLD PVS  N + +
Sbjct: 472 KY-EVVNDPTMPFTWNYLTTLRVMMMTKSEIDANLQNHIF------HLDEPVSLQNNKRV 524

Query: 406 CRVVRDACKSAL 417
            + + D C   L
Sbjct: 525 SKFLTDLCDRHL 536



 Score = 41.6 bits (96), Expect = 0.88,   Method: Compositional matrix adjust.
 Identities = 32/127 (25%), Positives = 63/127 (49%), Gaps = 17/127 (13%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAA-----SEIGSLCSGLKPWISVALFLIRE 145
           G G++A+R I   ++++EVP+   I+   + +       I ++   L    ++   LI E
Sbjct: 45  GRGVIAKRTIESGDLLVEVPLSLLIHSLPILSVVPPFEHIETVLKLLDSKQTICFQLIYE 104

Query: 146 K-KKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEE 204
           +  +  S W  YLD +P+  ++TV +++ E+ EL              Y +NE  K+ +E
Sbjct: 105 RLIRNRSRWYGYLDCIPKEYNTTVSYTDAEIGELS-----------YPYYKNEATKLRKE 153

Query: 205 IILPNKQ 211
           ++  +KQ
Sbjct: 154 MLDSHKQ 160


>gi|121703688|ref|XP_001270108.1| SET domain protein [Aspergillus clavatus NRRL 1]
 gi|119398252|gb|EAW08682.1| SET domain protein [Aspergillus clavatus NRRL 1]
          Length = 492

 Score = 62.0 bits (149), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 50/160 (31%), Positives = 78/160 (48%), Gaps = 13/160 (8%)

Query: 63  QVETFWQWLRDQKVVSPKSPIRPATF---PEGLGLVAQRDIAKNEVVLEVPMKFWINPDT 119
           Q + F  WL  +  V     IR A       G G+VAQ  I + E +  +P    ++ + 
Sbjct: 16  QSDEFITWLAQRPGVRISPKIRIADLRSQSAGRGVVAQSAIVEGEELFSIPRDLVLSTEN 75

Query: 120 VAASEIGSLCSG----LKPWISVALFLIREKK-KEDSPWRVYLDILPECTDSTVFWSEEE 174
              S++ SL S     L PW+S+ L +I E   +E S W  Y  I PE  D+ +FWS  E
Sbjct: 76  ---SKLKSLLSQDLGELGPWLSLMLVMIYEYLLREQSAWAPYYRIFPENFDTLMFWSPAE 132

Query: 175 LVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFP 214
           L ELQG+ ++   +G ++  +   L++   ++  N  LFP
Sbjct: 133 LQELQGSAIVD-KIG-RQGAEESILQMIAPVVKANPSLFP 170


>gi|225678514|gb|EEH16798.1| SET domain-containing protein RMS1 [Paracoccidioides brasiliensis
           Pb03]
          Length = 488

 Score = 62.0 bits (149), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 114/484 (23%), Positives = 198/484 (40%), Gaps = 102/484 (21%)

Query: 65  ETFWQWLRDQKVVSPKSPIRPATFPE---GLGLVAQRDIAKNEVVLEVPMKFWINPDTVA 121
           E F  WL+    V     I+ A       G G+VA  DI + E +  +P    ++     
Sbjct: 18  EEFMDWLKQSPGVRVNPKIKIADLRSEGAGRGIVAYDDINEEEELFAIPQGLVLSFQNSK 77

Query: 122 ASEIGSLCS-GLKPWISVALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSEEELVELQ 179
             ++  +    L  W+ + L +I E  +   SPW  Y  +LP   D+ +FW++ EL+EL+
Sbjct: 78  LKDLMEINERDLGQWLCLILVMIYEYLQGAASPWAPYFKVLPTDFDTLMFWTDAELLELK 137

Query: 180 GTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFG------------ 227
           G+ +L   +G K   +  +L+    ++  N +LFP    L  +    G            
Sbjct: 138 GSAVLG-RIG-KSAAEEVFLRDLLPLVSKNSELFPLTGGLLSYNSPDGKAALLSLAHRMG 195

Query: 228 -ILRSRAFS-------RLRGQNLV------------LIPLADLINHSPGITTEDYAYEIK 267
            ++ S AF         + G++              +IPLADL+N            +  
Sbjct: 196 SLIMSYAFDVENDEAEEVEGEDGYVTDDEERQLPKGMIPLADLLNADA---------DRN 246

Query: 268 GAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI-ESKSDRNAYTLTLE 326
            A LF  D   ++++   ++ GE++   Y      AEL   YG++ +S +  +   + ++
Sbjct: 247 NARLFQEDGYLAMKSIKSIRKGEEIFNDYG-ELPRAELLRRYGYVTDSYAQYDEAEVPIQ 305

Query: 327 -------ISESDPF-------FGDKLDIAETNGLGESAY-FDIVLGRTLPPAMLQYLRLV 371
                  +  S P        F D L++ + +G G   +     L  TLP  +L  L ++
Sbjct: 306 TICRVAGLKSSTPGPDEPRLEFLDDLEVLD-DGYGIPRHDRSTPLAETLPTELLVVLNIL 364

Query: 372 ALGGTDAFLLESIFRNTIWGHLDLP---VSHANEELICRVVRDACKSALSGFHTTIEEDE 428
            +       LE    N +     +P   +  A   L+  VVR      L  + TT+ +D+
Sbjct: 365 VMP------LEQF--NQLKQKSKVPKPALGIAEATLLDEVVR----LILGEYPTTVAQDK 412

Query: 429 QLLQG-----GNLDP----RLRIAVGVRAGEKRVLQQIDEIFKERELELDELEYY--QER 477
           +LL       G+  P    RL++A+ VR GEK +L  +          L ELE +  Q R
Sbjct: 413 ELLASCANYQGSTSPISAGRLKMALQVRKGEKEILNAV----------LSELEDFIAQHR 462

Query: 478 RLKD 481
           ++ D
Sbjct: 463 QVSD 466


>gi|453083670|gb|EMF11715.1| SET domain-containing protein [Mycosphaerella populorum SO2202]
          Length = 477

 Score = 62.0 bits (149), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 70/277 (25%), Positives = 127/277 (45%), Gaps = 64/277 (23%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLK-----PWISVALFLIRE 145
           G G+VA +D++++E +  +P        ++  +E   L + L+     PW+S+ L ++ E
Sbjct: 42  GRGVVATQDLSEDEELFSIPRA------SILTNETTDLPANLRKELDHPWLSLILVMVHE 95

Query: 146 K-KKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEE 204
             K   S W  Y ++LPE  DS +FWS+EEL+ L+G+ ++   +G KE   + + +    
Sbjct: 96  YLKGTKSSWYPYFNLLPETFDSLMFWSDEELLSLKGSAVVD-KIG-KESADSTFTEQLIP 153

Query: 205 IILPNKQLFPRPITLDDFL-------------WAFGILR------------------SRA 233
           +I  +  +F      +D L             +AF + +                    A
Sbjct: 154 LIAQHANIFQTAGRSNDELLSLCHRMGSTIMAYAFDLEKPEPSQPPNQQDDEEWEEEESA 213

Query: 234 FSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVL 293
            S  +G    +IPLAD++N +          +   A LF +D    ++T   V+AGE++ 
Sbjct: 214 ISLPKG----MIPLADMLNANA---------DHNNAKLFYQDDKVVMKTLHAVRAGEELF 260

Query: 294 IQYD-LNKSNAELALDYGFIESKSDRNAYTLTLEISE 329
             +  L +S  +L   YG++   +D+ A    +EISE
Sbjct: 261 NDFGPLPRS--DLLRRYGYV---TDQYAKYDVVEISE 292


>gi|67484540|ref|XP_657490.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
 gi|56474743|gb|EAL52100.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
          Length = 791

 Score = 62.0 bits (149), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 75/307 (24%), Positives = 128/307 (41%), Gaps = 42/307 (13%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWIN-------------PDTVAASEIGSLCSGLKPWIS 137
           G GL A ++  K+EV++ +P    IN             P      +     +GL     
Sbjct: 27  GRGLCANKEFKKDEVIMSIPYSIQINRINLNHIWPEVKLPKFNEGDDDRDDLNGL----- 81

Query: 138 VALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNE 197
           V L+L   K         Y+++LPE  D  + ++ +EL  ++GT+L +    +  ++   
Sbjct: 82  VYLYLAVNKTNPKCFHWPYINVLPETYDCPLSYTIDELNLMKGTKLYAAVEKINAFLMKV 141

Query: 198 YLKVEEEIILPNKQLFPRPI-TLDDFL----WAFGILRSRAFSRLR----GQNLVLIPLA 248
                 ++I    Q FP+   + DD      WA     SRAF  +     G+   LIP  
Sbjct: 142 VDYYNNKLI----QQFPQYFQSFDDLFKRLQWAHQSFWSRAFLVIYPQPFGEVGSLIPFC 197

Query: 249 DLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVP-VKAGEQVLIQYDLNKSNAELAL 307
           D  NH     T+     I      ++   FS +T    VK GEQ+   Y + +SN +L L
Sbjct: 198 DFSNH----CTQAKVTYISN----TQTETFSFQTNEELVKPGEQIFNNYRI-RSNEKLLL 248

Query: 308 DYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQY 367
            YGF+E  +  +   L +     D  + +  +I +   +    +F + L   +P  +++ 
Sbjct: 249 GYGFVEENNPCDNLLLRIYFEVDDNQYNEIEEILKQEEIKSFDFF-LKLDEDIPLELMRI 307

Query: 368 LRLVALG 374
           LR+V L 
Sbjct: 308 LRIVNLS 314


>gi|116180202|ref|XP_001219950.1| hypothetical protein CHGG_00729 [Chaetomium globosum CBS 148.51]
 gi|88185026|gb|EAQ92494.1| hypothetical protein CHGG_00729 [Chaetomium globosum CBS 148.51]
          Length = 510

 Score = 62.0 bits (149), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 60/205 (29%), Positives = 91/205 (44%), Gaps = 20/205 (9%)

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEE---EIIL 207
           +PW  YL  LPE       W+E+E + L+GT L +        +  E+  V E   +II 
Sbjct: 118 NPWTEYLKFLPETVLVPTLWTEDERLLLRGTSLEAAVDAKISALDAEFDLVREKSSDIIA 177

Query: 208 PNKQLFPR--PITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYE 265
            N  L+    P++  D++    + RSR    L      ++P  D+INHS   T   Y  E
Sbjct: 178 WNDLLWMEGVPVSFTDWIRLDALYRSRCL-ELPTSGESMVPCIDMINHSA--TPSAYYDE 234

Query: 266 IKGAGLFSRDLLFSLRTP-VPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLT 324
                  ++D +  L  P V +   +++ I Y   KS AE+   YGFI+  S   +  L 
Sbjct: 235 ------NTKDEVVLLPRPGVGISKDDEITISYGDAKSAAEMLFSYGFIDPASSAESLVLP 288

Query: 325 LEISESDPFFGDKLDIAETNGLGES 349
           L     +PF  D L  A+    GE+
Sbjct: 289 L-----NPFPDDPLAKAKLVAFGEA 308


>gi|159473090|ref|XP_001694672.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158276484|gb|EDP02256.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 515

 Score = 62.0 bits (149), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 78/310 (25%), Positives = 124/310 (40%), Gaps = 62/310 (20%)

Query: 64  VETFWQWLRDQKVVSPKSPIRPATFPEG-LGLVAQRDIAKNEVVLEVPMKF--------- 113
            E F +WL         S       P G  G+VA   IA+ E++L +P+           
Sbjct: 2   AEAFEEWLDKHGGKKHASLDLVKELPNGDRGVVATAPIAEGELLLLLPINCALYMPNDEE 61

Query: 114 WINPDTVAASEIGSLCSG---LKPWISVALFLIRE-KKKEDSPWRVYLDILP-ECTDSTV 168
           W    +     +G L      L P+++  L L+ E  +  +S W  Y+  LP  C D  +
Sbjct: 62  WAKRGSSFPEAVGYLHEHHRTLSPFLATTLALMSEVARGGESAWAAYVGTLPPSCPDCLL 121

Query: 169 FWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFP-------------- 214
            WS+EE  +L+GT L    LG  +   + + +    I+   + L+P              
Sbjct: 122 NWSKEEKKDLEGTAL--EELG-PDPAADAFKRHVAPILAARRDLWPLQQQGEGEGGAAAD 178

Query: 215 -RPITLDDFLWAFGILRSRAF--------------SRLRGQN--LVLIPLADLINHSP-- 255
                L  F+   G+++SRAF              S+L G    + L+P  D+INHS   
Sbjct: 179 EAAADLALFVRVAGLVQSRAFHLEAENWVSGAKEISKLEGGGTQVFLLPGIDMINHSHNP 238

Query: 256 ----------GITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAEL 305
                      +     A   +G      +  F +R   P+  GE+VL  Y  N S+A+L
Sbjct: 239 ARRNAHLQRLNVAQAAAAKLTEGGAPEGVEAFFVMRADKPIAEGEEVLHTYG-NLSDAQL 297

Query: 306 ALDYGFIESK 315
              YGF++S+
Sbjct: 298 LQTYGFLDSE 307


>gi|302660547|ref|XP_003021952.1| hypothetical protein TRV_03939 [Trichophyton verrucosum HKI 0517]
 gi|291185873|gb|EFE41334.1| hypothetical protein TRV_03939 [Trichophyton verrucosum HKI 0517]
          Length = 479

 Score = 62.0 bits (149), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 109/443 (24%), Positives = 179/443 (40%), Gaps = 65/443 (14%)

Query: 65  ETFWQWLRDQKVVSPKSPIRPATFPEGL-GLVAQRDIAKNEVVLEVPMKFWIN-PDTVAA 122
           + F  WL   K  SP   + P      L    A R I+++E +  +P    ++  ++ A 
Sbjct: 18  DDFLLWL---KRSSPHFKMHPGIHIADLRSTGAGRGISEDEELFVIPDDLILSVQNSEAR 74

Query: 123 SEIGSLCSGLKPWISVALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSEEELVELQGT 181
           S +G     L PW+S+ + +I E  + E S W  Y  ILP   D+ +FW++E+L ELQG+
Sbjct: 75  SVLGLDDKQLGPWLSLIITMIYEYYQGEQSKWYSYFRILPSSFDTLMFWTDEQLSELQGS 134

Query: 182 QLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFP-----RPITLDDFLWAFGILRSRAFSR 236
            ++   +G K    +  L+    +I  N + FP      P+   D   A   L  R  S 
Sbjct: 135 SVVG-KIG-KAAADDTILQKVVPLIQANSRHFPPRPNMPPLNSPDSQNALLCLAHRMGSI 192

Query: 237 LRGQNLVLIPLADLINHSP--GITTEDYAYEIKG----------------AGLFSRDLLF 278
           +      +    +   H+   G  T+D     KG                A LF  +  F
Sbjct: 193 IMAYAFDIEKTDEADEHTADDGYMTDDEDEPAKGMVPLADIFNADAQRNNARLFQEEGSF 252

Query: 279 SLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIE---SKSDRNAYTLTL---------- 325
            ++    + +GE++   Y      A+L   YG++    ++ D   ++L            
Sbjct: 253 VMKAIKNIYSGEEIFNDYG-ELPRADLLRRYGYVTDNYAQYDVVEFSLDAICKVAGLPDS 311

Query: 326 EISESDPFFG--DKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLES 383
           E S S+P     D LD+ E          +  L   +P   L  LR + L       +E 
Sbjct: 312 EPSPSNPRLELLDNLDMLEEGYNISRIPRNGTLEDAIPEDFLVLLRALTLP------IED 365

Query: 384 IFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLL-----QGGNLDP 438
           + R         P   A+E     ++R       S + T+++EDE +L     Q G ++ 
Sbjct: 366 LNRLGARNKAPKPEFSASE---ASLLRSLVTLRQSEYPTSVQEDESILNCLEQQNGYIND 422

Query: 439 ----RLRIAVGVRAGEKRVLQQI 457
               R ++AV VR GEK +L QI
Sbjct: 423 SGLNRKKMAVQVRKGEKEILTQI 445


>gi|302510645|ref|XP_003017274.1| hypothetical protein ARB_04152 [Arthroderma benhamiae CBS 112371]
 gi|291180845|gb|EFE36629.1| hypothetical protein ARB_04152 [Arthroderma benhamiae CBS 112371]
          Length = 479

 Score = 61.6 bits (148), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 110/443 (24%), Positives = 180/443 (40%), Gaps = 65/443 (14%)

Query: 65  ETFWQWLRDQKVVSPKSPIRPATFPEGL-GLVAQRDIAKNEVVLEVPMKFWIN-PDTVAA 122
           + F  WL   K  SP   + P      L    A R I+++E +  +P    ++  ++ A 
Sbjct: 18  DDFLLWL---KRSSPHFKMHPGIHIADLRSTGAGRGISEDEELFVIPNDLILSVQNSEAR 74

Query: 123 SEIGSLCSGLKPWISVALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSEEELVELQGT 181
           S +G     L PW+S+ + +I E  + E S W  Y  ILP   D+ +FW++E+L ELQG+
Sbjct: 75  SVLGLDDKQLGPWLSLIITMIYEYYQGEQSKWYPYFRILPSSFDTLMFWTDEQLSELQGS 134

Query: 182 QLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFP-----RPITLDDFLWAFGILRSRAFSR 236
            ++   +G K    +  L+    +I  N + FP      P+   D   A   L  R  S 
Sbjct: 135 AVVG-KIG-KAVADDTILQKVVPLIQANSRYFPPRPNMPPLNSPDSQNALLCLAHRMGSI 192

Query: 237 LRGQ--NLVLIPLADLINHSPGITTEDYAYEIKG----------------AGLFSRDLLF 278
           +     ++     AD      G  T+D     KG                A LF  +  F
Sbjct: 193 IMAYAFDIEKTDEADEDTAEDGYMTDDEDEPAKGMVPLADIFNADAQRNNARLFQEEGSF 252

Query: 279 SLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIE---SKSDRNAYTLTL---------- 325
            ++    + +GE++   Y      A+L   YG++    ++ D   ++L            
Sbjct: 253 VMKAIKNIHSGEEIFNDYG-ELPRADLLRRYGYVTDNYAQYDVVEFSLDAICKVAGLPDG 311

Query: 326 EISESDPFFG--DKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLES 383
           E S ++P     D LD+ E          +  L   +P   L  LR + L       LE 
Sbjct: 312 EPSPTNPRLELLDNLDMLEEGYNIPRIPRNGTLEDAIPEDFLVLLRALTLP------LED 365

Query: 384 IFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLL-----QGGNLDP 438
           + R         P   A+E  + R +    +S    + T+++EDE +L     Q G ++ 
Sbjct: 366 LNRLGARNKAPKPEFSASEASLLRTLVTLRQSE---YPTSVQEDESILKCLEQQNGYIND 422

Query: 439 ----RLRIAVGVRAGEKRVLQQI 457
               R ++AV VR GEK +L QI
Sbjct: 423 SILNRKKMAVQVRKGEKEILTQI 445


>gi|255080880|ref|XP_002504006.1| predicted protein [Micromonas sp. RCC299]
 gi|226519273|gb|ACO65264.1| predicted protein [Micromonas sp. RCC299]
          Length = 529

 Score = 61.6 bits (148), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 94/419 (22%), Positives = 165/419 (39%), Gaps = 33/419 (7%)

Query: 70  WLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEI-GSL 128
           WL  +K + P    +  +F +G       D+   E ++E+P    +    VA S I   L
Sbjct: 55  WLTVEKGL-PGGAAKAVSFGDGGVAKLVNDVRAGEPLIEIPQNLAVTSVDVADSPIVAGL 113

Query: 129 CSGLKPWISVALFLIREKKKED-SPWRVYLDILPEC-TDSTVFWSEEELVEL-QGTQLLS 185
            +G    + +AL+L  E+ K   S W  Y+  LP   +D  + W+  EL  L QG+ +  
Sbjct: 114 AAGRGELVGLALWLCLERHKGPLSEWAPYVATLPSAGSDHPLLWTAGELQTLLQGSPVRE 173

Query: 186 TTLGVKEYVQNEYLKVEEEIILPNKQLFPRP----ITLDDFLWAFGILRSRAFSRLRGQN 241
             +   E   +EY  + ++I   N   FP      +T D F+ A   + +RA        
Sbjct: 174 QAVSRLESADDEYASIADQI-RSNPNDFPPDAYEFLTRDAFVDALATVLARAVWLNAANC 232

Query: 242 LVLIPLADLI----NHSPGITTEDYAYE--------IKGAGLFSRDLLFSLRTPVPVKAG 289
             ++PL DL+    +  PG++    A             AG+   D        V     
Sbjct: 233 YAMVPLVDLLPLVGSPPPGVSPAAAAGGPAVGKPGLAAAAGVVDYDAATECVAVVSANDA 292

Query: 290 EQ---VLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGL 346
           +Q   V+    L ++  +L L  G ++     +        +++D  +  K  I E  G+
Sbjct: 293 QQTARVVCVDPLARNAGDLFLATGAVDESHCGDYLAFAASCTQTDRLYEAKRQILEGMGM 352

Query: 347 GESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELIC 406
                   V    +P  +L Y+R   +      +  S   + I       VS  NE  + 
Sbjct: 353 SADGQTFPVFADRMPMQLLAYMRFARVQDPGELMSVSFEEDRI-------VSPMNEYEVL 405

Query: 407 RVVRDACKSALSGFHTTIEEDEQL-LQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKER 464
           +++    +  L+ + ++ EE E L L+   L  R R+A  +R  EKR++       + R
Sbjct: 406 QLLMQDAREMLAEYESSSEEFELLQLKEKGLSARQRVAAKLRLAEKRLINATTTAVRRR 464


>gi|301094169|ref|XP_002997928.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262109714|gb|EEY67766.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 440

 Score = 61.6 bits (148), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 83/327 (25%), Positives = 136/327 (41%), Gaps = 53/327 (16%)

Query: 155 VYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFP 214
           +Y+  LP   D  ++W + +  ELQG +     +   ++    Y +V + +   N Q   
Sbjct: 134 LYVSALPRYVDLPLYWDDRKFKELQGCEEARRAV---QHGARFYSQVYQHLFGTNNQF-- 188

Query: 215 RPITLDDFLWAFGILRSRAFSRLRGQN--LVLIPLADLINHSPGITTEDYAYEIKGAGLF 272
             ++ + F WA  IL SRA S   GQN    LIP  D  NH+     ++   E      F
Sbjct: 189 --VSAEAFFWAISILMSRATS---GQNQPFALIPFFDWFNHAD--NGDECVQE------F 235

Query: 273 SRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEIS---- 328
                F++ T    + GEQ+ I Y  + SN  L  +YGF    +  +  TL + I+    
Sbjct: 236 DPQKGFTVHTTKAYEPGEQLYINYG-SHSNLRLLRNYGFTTPNNPYDVVTLPMPIALQQP 294

Query: 329 -ESDPFFGDKLDIAETNGLGESAYFDIVLGRT--------LPPAMLQYLRLVALGGTDAF 379
             +DP F  K  + ++     S   DI   R+        L P    +L +  L  T   
Sbjct: 295 NPADPAFLQKRGLLQSATGSHST--DIPALRSLRFNHDGQLAPNAEHWLEI--LLATPEE 350

Query: 380 LLESIFR---------NTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQL 430
           L E I +         +TI   L + + H     +  +V     + L    +T+EED+  
Sbjct: 351 LSEIITQAASQSGAADSTISLALPMSLKHKVHSEVGSLV----TARLKQHSSTLEEDDAF 406

Query: 431 LQGGN--LDPRLRIAVGVRAGEKRVLQ 455
           L+  +  +   LR  + +R GEK+ L+
Sbjct: 407 LRANDQQMASWLRSCLHIRMGEKQTLE 433


>gi|357131408|ref|XP_003567330.1| PREDICTED: ribosomal N-lysine methyltransferase 3-like
           [Brachypodium distachyon]
          Length = 495

 Score = 61.6 bits (148), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 45/194 (23%), Positives = 93/194 (47%), Gaps = 4/194 (2%)

Query: 63  QVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAA 122
           ++  F +W+    VV   +    A+   G+ + A   + + ++V  +P +  + P T + 
Sbjct: 9   RLRAFKRWMSKHGVVCSDALCLDASEAGGVYVRALSALREGDLVATIPRRACLTPRT-SG 67

Query: 123 SEIGSLCSGLKPWISVALFLIREKKK-EDSPWRVYLDILPECTDSTVFWSEEELVE-LQG 180
           +      + L   +++A+ ++ E+ +  +SPW  YL ++P+C    + W +EE    L G
Sbjct: 68  AAAAIEAAELGGTLALAVAVMYERARGAESPWNAYLRLIPDCEPVPLVWPDEEAERLLSG 127

Query: 181 TQLLSTTLGVKEYVQNEYLKVEEEIILPNK-QLFPRPITLDDFLWAFGILRSRAFSRLRG 239
           T+L       +E++  ++ +  E +I      + P   +L+ +  A  +L SR+F     
Sbjct: 128 TELDKIVKQDREFLCEDWKECIEPLISSGDLGVNPEDFSLEKYFAAKSLLSSRSFHIDSY 187

Query: 240 QNLVLIPLADLINH 253
               ++PLADL NH
Sbjct: 188 HGSGMVPLADLFNH 201


>gi|403215215|emb|CCK69715.1| hypothetical protein KNAG_0C06190 [Kazachstania naganishii CBS
           8797]
          Length = 496

 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 77/286 (26%), Positives = 128/286 (44%), Gaps = 54/286 (18%)

Query: 67  FWQWLRD--QKVVSPKSPIRP-ATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAAS 123
           F +WL+   + V+SPK  +       +G  ++A  DI K+E++ EVP    +N +    S
Sbjct: 11  FVEWLQGPAEIVLSPKIKVDDLREVNQGRCVIAIEDIEKDEILFEVPRTTMLNVENCELS 70

Query: 124 ----EI-GSLCSGLKPWISVALFLIREKK--KEDSPWRVYLDILPECTDST--VFWSEEE 174
               EI   L   +  W  + + L+ E K   E S W  YL +LP+ TD    ++W+++E
Sbjct: 71  KRYPEIKNHLVESVGQWEGLIIALLFEWKVVGEKSKWWPYLQVLPKKTDMNQLIYWADDE 130

Query: 175 LVELQGTQLLSTTLG---VKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRS 231
           L EL    L+   +G    KE  +N  + +  +  L  K  +   +T ++FL    I+ S
Sbjct: 131 L-ELLKPSLILERVGADKAKEMFEN-VVDIINKSTLKEKDSYILKVTWENFLLVASIIMS 188

Query: 232 RAFSRLRGQNLV------------------------LIPLADLINHSPGITTEDYAYEIK 267
            +F     Q+ V                        +IPLAD +N +   T +  A+ I 
Sbjct: 189 YSFD---VQDYVEEKEGGTDEEEDDNESENVRSLKCMIPLADTLNSN---THKCNAHLIH 242

Query: 268 GAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIE 313
           G+       L  +R+   +K GEQ+   Y  +  N+E+   YG+IE
Sbjct: 243 GSN------LLEMRSIKAIKKGEQIYNIYG-DHPNSEILRRYGYIE 281


>gi|156035929|ref|XP_001586076.1| hypothetical protein SS1G_13169 [Sclerotinia sclerotiorum 1980]
 gi|154698573|gb|EDN98311.1| hypothetical protein SS1G_13169 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 291

 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 77/284 (27%), Positives = 119/284 (41%), Gaps = 49/284 (17%)

Query: 65  ETFWQWLRDQ-KVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAA 122
           E F +W  D   ++   SP R   FP +GLGLVA ++    + ++ VP+K         +
Sbjct: 5   EEFTKWAVDHGAIIDGVSPFR---FPGKGLGLVANKNFEIGDTLVRVPIKV-----LRKS 56

Query: 123 SEIGSLCSGLKPWISVALFLIREKKKEDSP-WRVYLDILPECTDST-VFW--SEEELVEL 178
           +++ S  SGL P ++V   L         P W+  L    +   S  +FW  S  EL+  
Sbjct: 57  TDVPSQYSGLAPNVNVHALLALTLDSLLGPEWKAALPSKQDMHSSMPLFWDPSLSELLPY 116

Query: 179 QGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRP-ITLDDFLWAFGILRSRAFSRL 237
               LL         VQ E +     +I    + FP P I+ D+F++ + I+ SR F  L
Sbjct: 117 SAQTLLK--------VQKEKIGSAWTVIC---KAFPEPPISYDEFMYNYSIVNSRTFYYL 165

Query: 238 ------------RGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVP 285
                       +   L L P AD +NHS   T       +  AG       ++L    P
Sbjct: 166 SPTIKSSKLQPPKEDRLALNPFADYMNHSSQPTVNA---TLSRAG-------YTLTASQP 215

Query: 286 VKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISE 329
           +K G +V I Y  + +N  L ++YG   + S+    T   E  E
Sbjct: 216 IKEGSEVHISYG-SHNNDFLLIEYGRHGNNSEDKQNTRRAEWPE 258


>gi|347967016|ref|XP_003436005.1| AGAP002018-PB [Anopheles gambiae str. PEST]
 gi|333469796|gb|EGK97407.1| AGAP002018-PB [Anopheles gambiae str. PEST]
          Length = 504

 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 106/444 (23%), Positives = 178/444 (40%), Gaps = 53/444 (11%)

Query: 56  TTN--PPTAQVETFWQWLRDQKVVSPKSPIRPATFPE--GLGLVAQRDIAKNEVVLEVP- 110
           TTN  P    V  F +W  ++        +R A   E  GLGL +   I   E ++ VP 
Sbjct: 62  TTNGRPRMETVAHFMRWAVERGCQVEN--VRVAEHAEYGGLGLESCGPIPAGECIITVPR 119

Query: 111 -MKFWIN--PDTVAASEI--GSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTD 165
            M F++   P      E+  G++ S  +  I +AL LI E+ +  S W+ YLD+LP+   
Sbjct: 120 SMFFYVTNEPRYRQLLELMPGAMMSE-QGNIMLALALIMERFRAKSDWKPYLDLLPDRYT 178

Query: 166 STVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKVEEEIILPNKQLFPRPITLDDF 222
           + ++++ E++ EL  T      L + +++  +Y    +  +E +   +  F    T D F
Sbjct: 179 TPLYYTTEDMGELAETDAFLPALKLCKHIARQYGFIRRFVQEKVDELRDCF----TYDVF 234

Query: 223 L-----------WAFGILRSRA------FSRLRG--QNLVLIPLADLINHS-PGITTEDY 262
                       WA   + +R        +   G    L LIPL D+ NH+ P    E  
Sbjct: 235 RLLLFSLLIPHSWAVSTVMTRQNKVPVNLAEFDGMDHTLALIPLWDMANHAFPDTANETR 294

Query: 263 AYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYT 322
                     +  L  SL   V   A   + I Y   +++AE  +  GF+  ++      
Sbjct: 295 CVAETCYNATNEQLECSLTREVSDIASVPIFIVYG-TRTDAEFLVHNGFVCPRNPHANVQ 353

Query: 323 LTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGR------TLPPAMLQYLRLVALGGT 376
               +  + P + ++  + E  G+  +  F     R      T  P   + + L  +   
Sbjct: 354 KRFTLVPAIPLYKERAHLLELLGMPTTGTFSFGPAREPAAATTTTPISQELISLARVSSM 413

Query: 377 DAFLLE--SIFRNTIWGHLDLPVSHANEELICRVVR---DACKSALSGFHTTIEEDEQLL 431
            A  L+  +  + T    L    +    EL  R  R      K  L  + TTIE+DE LL
Sbjct: 414 TAKELDEYTAMKETQRQTLRTYQALLPAELCARTERWLATVMKIMLLRYPTTIEQDEALL 473

Query: 432 QGGNLDPRLRIAVGVRAGEKRVLQ 455
           +      R R+ +  R GEK++L+
Sbjct: 474 KTNRHHIR-RLLIEYRLGEKQILR 496


>gi|298708218|emb|CBJ30557.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 493

 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 76/287 (26%), Positives = 125/287 (43%), Gaps = 48/287 (16%)

Query: 59  PPTAQV------ETFWQWLRDQK--VVSPKSPIRPATFPEGL-GLVAQRDIAKNEVVLEV 109
           PP+ +V      E F  W   ++   VSP   +  A F EG  G+VA +DI ++ V++ +
Sbjct: 27  PPSTKVIGHEAKEKFLAWADSEQGFQVSPDVDLFHA-FSEGYRGVVATKDIPRDAVLVRI 85

Query: 110 PMKFWINPDTVAASE------------------IGSLCSGLKPWISVA----LFLIREKK 147
                + P+T   S+                   G       P ++ A    L L+ E+ 
Sbjct: 86  ARSCCLGPETTDESKNSWTKAMSTSAVDATKTTQGESSKPPAPRLTRACLTVLRLLHERG 145

Query: 148 -KEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEII 206
             E SP+  YL +LP+     + W+E E+  LQGT   S    V     +   +  + ++
Sbjct: 146 LGESSPFHSYLSVLPQDHRLPLEWTEAEVGLLQGT---SAEPLVGAGSLDSQFEAFQSVV 202

Query: 207 LPNKQLF-PRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYE 265
             +  ++ P   T   F      +RSR F+ +   +  +IP AD+ NH P       + +
Sbjct: 203 AQHPTVWEPSVCTKAAFAKGVNWVRSRGFTVMGDPH--MIPGADMFNHDP----NKQSVQ 256

Query: 266 IKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI 312
           I   G    +  F ++T  PVKAGE+V   +  + SNA+L   YGF+
Sbjct: 257 IGTDG----EEHFVMKTVQPVKAGEEVFSSFG-HISNAQLLNSYGFV 298


>gi|402223842|gb|EJU03906.1| SET domain-containing protein [Dacryopinax sp. DJM-731 SS1]
          Length = 492

 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 76/279 (27%), Positives = 124/279 (44%), Gaps = 36/279 (12%)

Query: 63  QVETFWQWLRDQKV-VSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVA 121
           Q+ TF QWL    V + P+  I   T   G  + A  DI     V + P    ++     
Sbjct: 11  QLSTFLQWLSSNNVHIDPRIQIV-HTPKTGYTVQALSDIPYGVRVSKTPKSAVLSVRNCT 69

Query: 122 ASE-IGSLCSGLKPWISVALFLIREKK-KEDSPWRVYLDILPECTDSTVFW------SEE 173
            S+ +      L+P I++++ L+ E+    DS +  YL  LPE  D  +FW       EE
Sbjct: 70  LSDTLWPFLKPLEPNIALSIALLSEQLLGRDSRFWGYLQSLPEEADIALFWKLKGQDGEE 129

Query: 174 ELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPN-KQLFPRPITLDDFLWAFGILRSR 232
            L+ LQ T++ +    ++  +  +  +   +++ P   Q  P   T+D F  A+ ++ SR
Sbjct: 130 ALLWLQNTEVSAI---LRTSLLADLDRTYTQVVRPALAQWSP---TIDRFRKAYSLVSSR 183

Query: 233 AFSRLRGQNLVLIPLADLINHSP--GITTE-DYAYEIKGAGL---------------FSR 274
           AF       + L+P+AD  NHS   G+  E D+A   +   L               +  
Sbjct: 184 AFQVDSWHGVGLVPVADAFNHSEENGVFMETDFAVCTQCGSLGPCAHPEHTPPHSSPWDE 243

Query: 275 DLLFSLRTPVPVKAGEQVLIQYDLNK-SNAELALDYGFI 312
            +   + T  P+KAGE+V   YD     NA+L + YGF+
Sbjct: 244 PMTCDMVTYAPLKAGEEVFNSYDEKGLGNAKLLVWYGFL 282


>gi|8570443|gb|AAF76470.1|AC020622_4 Contains similarity to a hypothetical protein gi|6983878 from Oryza
           Sativa BAC gb|AP001168 [Arabidopsis thaliana]
          Length = 441

 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 76/273 (27%), Positives = 119/273 (43%), Gaps = 31/273 (11%)

Query: 62  AQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDI-AKNEVVLEVPMKFWINPDTV 120
           A++E F  WL+          I+ +   +G G+ A     A +EV+L VP+   I P  V
Sbjct: 8   AKLERFLDWLQVNGGELRGCNIKYSDSLKGFGIFASTSTQASDEVLLVVPLDLAITPMRV 67

Query: 121 AASEI-GSLCSGLKPWISV------ALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEE 173
               + G  C  +     V       LFL  E+ + +S W+ YLD+LP    + +++S++
Sbjct: 68  LQDPLLGPECQKMFEQGQVDDRFLMILFLTLERLRINSSWKPYLDMLPTRFGNPLWFSDD 127

Query: 174 ELVELQGTQLLSTTLGVK----EYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGIL 229
           +++EL+GT L   T  V      +  + +      I LP+  +FP+         +  I 
Sbjct: 128 DILELKGTNLYHATDKVSFEHFLWANSVFWSRALNIPLPHSFVFPQSQDDTGECTSTKIQ 187

Query: 230 RSRAFSRLRGQNL---VLIPLADLINHS--PGITTEDYAYEIKGAGL-----FSRDLLFS 279
              A S   G  +    L+P  D  NH   P  T     +E+ G G      FS  LL  
Sbjct: 188 AQPAPSVGSGDTIWVEGLVPGIDFCNHDLKPVAT-----WEVDGIGSVSRVPFSMYLLSV 242

Query: 280 LRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI 312
            + P+P K   ++ I Y  NK N EL   YGF+
Sbjct: 243 AQRPIPKK---EISISYG-NKGNEELLYLYGFV 271


>gi|449454790|ref|XP_004145137.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Cucumis
           sativus]
 gi|449471403|ref|XP_004153299.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Cucumis
           sativus]
 gi|449474044|ref|XP_004154058.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Cucumis
           sativus]
 gi|449503343|ref|XP_004161955.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Cucumis
           sativus]
          Length = 482

 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 96/423 (22%), Positives = 178/423 (42%), Gaps = 70/423 (16%)

Query: 89  PEGLGLVAQRDIA---KNEVVLEVPMKFWIN----------PDTVA-ASEIGSLCSGLKP 134
           P+G G+ A +D+    +  V++E+P++  +           PD +     I  + +   P
Sbjct: 97  PDGFGVYASKDVEPLRRARVIMEIPLELMLTISQKLPWMFFPDIIPLGHPIFDIINSTNP 156

Query: 135 ---W-ISVALFLIREKKKEDSPWRVYLDILP---ECTDSTVFWSEEELVELQGTQLLSTT 187
              W + +A  L+    ++D+ W++Y D LP   ECT   +  + EE +     Q L++T
Sbjct: 157 ETDWDLRLACLLLYAFDRDDNFWQLYGDFLPSIDECTSLLL--ASEEELLELQDQNLAST 214

Query: 188 LGVKEYVQNEYLKV--EEEIILPNKQLFPRPITLDDFLWAFGILRSRAF---SRLRG--Q 240
           +  ++    E+ +      + L  K+L   P     F+WA  I +SR     +R+    Q
Sbjct: 215 IRDQQRRALEFWERNWHSGVPLKIKRLARDP---KRFIWALSIAQSRCINMETRIGALVQ 271

Query: 241 NL-VLIPLADLINHS--PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYD 297
           N  +LIP AD++NHS  P      + +  K      R L   +     +K G+++ + Y 
Sbjct: 272 NANMLIPYADMLNHSFQPNCF---FHWRFK-----DRMLEVMINAGQQIKKGQEMTVNYM 323

Query: 298 LNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLG 357
             + N      YGF    S  N + +    S +       L +    GL E+ Y++   G
Sbjct: 324 NGQQNNMFLQRYGF---SSPVNPWDMIEFSSNACIHLDSFLSVFNIAGLPENYYYN---G 377

Query: 358 RTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSAL 417
           R         L        D  ++ +      W   D+P S + E    + +++ C+  L
Sbjct: 378 R---------LSSKEDTFVDGAVIAAARSLPSWSDGDIPPSPSRERKAVKELQEECQRML 428

Query: 418 SGFHTTIEEDEQLLQGGNLDPR-LRIAVGVRAGEKRVLQQIDEIFKERELELDELEYYQE 476
           + F TT ++D+++L   +   R L  ++  R   K  ++++          +  L+ YQE
Sbjct: 429 AAFPTTSDKDQKMLDSMSQATRTLEASIKYRLHRKLFIEKV----------IKALDVYQE 478

Query: 477 RRL 479
           R L
Sbjct: 479 RIL 481


>gi|85099007|ref|XP_960703.1| hypothetical protein NCU06658 [Neurospora crassa OR74A]
 gi|28922220|gb|EAA31467.1| predicted protein [Neurospora crassa OR74A]
 gi|28950107|emb|CAD70887.1| conserved hypothetical protein [Neurospora crassa]
          Length = 469

 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 57/196 (29%), Positives = 85/196 (43%), Gaps = 16/196 (8%)

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEII-LP- 208
           +PW  Y+  LP+       W+E+E + L+GT L S        +  E+  V E    LP 
Sbjct: 99  NPWTEYIKFLPKTVLVPTLWTEDERLLLRGTSLESAVNAKMTAITAEFDAVREAASSLPS 158

Query: 209 -NKQLFP-----RPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDY 262
            N  L+P        +L  ++    + RSR    L      ++P  D+INHS    T   
Sbjct: 159 WNDVLWPYEDGNSSASLRSWILLDALYRSRVL-ELPKSGESMVPCIDMINHS----TRAS 213

Query: 263 AYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYT 322
           AY  + A     +++   R    +  GE+V I Y   K  AE+   YGFI+ ++   +  
Sbjct: 214 AYYDENA---KDEVVLLPRPDSSISPGEEVTISYGDAKPAAEMLFSYGFIDPEATVESLV 270

Query: 323 LTLEISESDPFFGDKL 338
           L LE  E DP    KL
Sbjct: 271 LPLEPFEDDPLAKAKL 286


>gi|255945819|ref|XP_002563677.1| Pc20g11910 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211588412|emb|CAP86520.1| Pc20g11910 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 487

 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 80/327 (24%), Positives = 132/327 (40%), Gaps = 72/327 (22%)

Query: 54  SATTNPPTA-----QVETFWQWLRDQKVVSPKSPIRPATF---PEGLGLVAQRDIAKNEV 105
           S+T + P A     Q + F  WL+    V     +R A       G G+VAQ +I++ E 
Sbjct: 2   SSTAHFPDAEGFQQQSDNFMSWLQASPGVQLNPKLRLADLRATGAGRGVVAQSNISEGEE 61

Query: 106 VLEVPMKFWINPDTVAASEIGSLCS-----GLKPWISVALFLIREK-KKEDSPWRVYLDI 159
           +  VP    +   TV  SE+ +L        + PW+S+ L ++ E  + E S W  Y  +
Sbjct: 62  LFSVPRAMVL---TVQNSELRTLLGENLEEQMGPWLSLMLVMVYEYLQGEKSRWAPYFRV 118

Query: 160 LPECTDSTVFWSEEELVELQGTQLLSTT--LGVKEYVQNEYLKVEEEIILPNKQLFPRPI 217
           LP   D+ +FWS  EL ELQ + ++      G +E ++N        I+     LFP P 
Sbjct: 119 LPSRFDTLMFWSPAELQELQASTIVEKIGRSGAEESIRNSIAP----ILAKRPDLFPPPQ 174

Query: 218 TL--------DDFLWAFG-----ILRSRAFSRLRGQN----------------------- 241
            L        D  L   G     ++ + AF   + ++                       
Sbjct: 175 GLASWEGDAGDAALIQVGHIMGSLIMAYAFDIEKSEDDGDEGEANDESYMTDDEEEEQLP 234

Query: 242 LVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKS 301
             ++PLADL+N            +   A L+  +    ++   P++ GE++   Y     
Sbjct: 235 KGMVPLADLLNAD---------ADRNNARLYQEEGALVMKAIKPIQQGEEIFNDYG-EIP 284

Query: 302 NAELALDYGFIESKSDRNAYTLTLEIS 328
            A+L   YG++   +D  A    LE+S
Sbjct: 285 RADLLRRYGYV---TDNYAVYDVLELS 308


>gi|167389227|ref|XP_001738871.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165897700|gb|EDR24782.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
          Length = 791

 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 74/309 (23%), Positives = 127/309 (41%), Gaps = 46/309 (14%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWIN-------------PDTVAASEIGSLCSGLKPWIS 137
           G GL A ++  ++E+++ +P    IN             P      +     +GL     
Sbjct: 27  GRGLCANKEFKQDEIIMSIPYSIQINRINLNHIWPEVKLPKFNEGDDDRDDLNGL----- 81

Query: 138 VALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTT-------LGV 190
           V L+L   K         Y+++LP+  D  + ++ +EL  ++GT+L           + V
Sbjct: 82  VYLYLAINKTNPKCFHWPYINVLPKTYDCPLSYTIDELNIMKGTKLYVAVEKINAFLMKV 141

Query: 191 KEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLR----GQNLVLIP 246
            +Y  N+ ++   +   P   LF R        WA     SRAF  +     G+   LIP
Sbjct: 142 VDYYNNKLIQQFPQYFQPFDDLFKR------LQWAHQSFWSRAFLVIYPQPFGEVGSLIP 195

Query: 247 LADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVP-VKAGEQVLIQYDLNKSNAEL 305
             D  NH     T+     I      +R   FS +T    VK GEQ+   Y + +SN +L
Sbjct: 196 FCDFSNH----CTQAKVTYISN----TRTETFSFQTNEEVVKPGEQIFNNYRI-RSNEKL 246

Query: 306 ALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAML 365
            L YGF+E  +  +   L +     D  + +  +I +   +    +F + L   +P  ++
Sbjct: 247 LLGYGFVEENNPCDNLLLRIYFEVDDNQYNEIEEILKQEEIKSFDFF-LKLDEDIPLELM 305

Query: 366 QYLRLVALG 374
           + LR+V L 
Sbjct: 306 RILRIVNLS 314


>gi|397593323|gb|EJK55933.1| hypothetical protein THAOC_24272 [Thalassiosira oceanica]
          Length = 567

 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 117/488 (23%), Positives = 182/488 (37%), Gaps = 117/488 (23%)

Query: 66  TFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWI---NPD--TV 120
            F  W ++  +      +       GLGL   +DI  N+V+++VP    I   +PD  TV
Sbjct: 94  AFTDWAKENNIKYAGVEVSSGGDNSGLGLYVTQDINANDVLIQVPTNMVIQAESPDYNTV 153

Query: 121 AASEI----GSLCSGLKPWISVALFL--------IREKKKEDS--PWRVYLDILPECTDS 166
              E+          L+ W ++++ L        + +K    S  PW   L+ LP   D+
Sbjct: 154 MEREVFDSNPKAYRNLQWWAALSVQLNYYDKINPVNDKAGGISIQPW---LNSLPRVYDT 210

Query: 167 TVF-WSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWA 225
             F W+E  L ELQ   ++      K   + E+  V++       + F   ++ +DF+W 
Sbjct: 211 PAFNWAESSLEELQYRPMIEAVALQKRAWKKEFETVQKAA----SKDFLSKVSFEDFVWG 266

Query: 226 FGILR----SRAFSR--------------------------------------------- 236
               R    S A+S                                              
Sbjct: 267 CETARSRAFSGAYSGSAFNPIPYATVTVLVAVYVALGIGSIEQAANGAALVICGSILKDF 326

Query: 237 -----LRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQ 291
                L+ Q  V+ PL D+ NH       + ++E    G       +SL      K G +
Sbjct: 327 VVPKLLKIQKYVICPLIDMANHVGTGAAGNVSFEYFADG-------YSLSALSNAKKGSE 379

Query: 292 VLIQYDLNKSNAELALDYGFIESKSDRNAYTL----TLEISESDPFFGDK---------- 337
           + I Y   +SN +L   YGF+E ++  + Y L      +I + +   G K          
Sbjct: 380 MFISYG-PRSNDQLLQYYGFVEEQNAHDIYILPPIREWDIGQLEEACGRKIGNGRLEKLD 438

Query: 338 ----LDIAETNGLGESAYFDI---VLGRT--LPPAMLQYLRLVALGGTDAFLLESIFRNT 388
               L  A  N     A  DI   VL R   + PA+ Q LR  AL  TD+   +S     
Sbjct: 439 RVGLLGKAAANSENPDAANDIGGVVLTRASGIDPAVTQALR--ALISTDSEWQDS---GE 493

Query: 389 IWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRA 448
             G+    VS  NE+    V R A +  L    TTIEED  LL+       + +AV  R 
Sbjct: 494 SIGNFAEMVSAENEQAARTVARRAMELELESKPTTIEEDVNLLKVMQDKDGVDLAVKFRL 553

Query: 449 GEKRVLQQ 456
            +K++LQ+
Sbjct: 554 EKKKLLQE 561


>gi|409080258|gb|EKM80618.1| hypothetical protein AGABI1DRAFT_71041 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 492

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 113/475 (23%), Positives = 177/475 (37%), Gaps = 95/475 (20%)

Query: 66  TFWQWLRDQKVVSPKSPIRPATFPE---GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAA 122
           TF QW + +        +    FP    G G VA RDI     +  +P    I+  T + 
Sbjct: 3   TFIQWFQRKGAYFDAESVGIIGFPPSEGGRGAVALRDIPDGHTLFSIPRALTISTRTCSL 62

Query: 123 SEIGSLCSGLKP-----WISVALFLIREKKK-EDSPWRVYLDILPECTDSTVFWSEEELV 176
                L +  K      W+ + L L+ E      S W  YLDILP   D+ +FW+E +L+
Sbjct: 63  PTKFGLEAWRKAQLHQGWVGLILCLMWETAAGSSSKWAGYLDILPNRFDTPMFWTEYDLL 122

Query: 177 ELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLF-PRPITLDDFLWAFGILRSRAFS 235
           EL+GT ++   LG K   Q +Y +     +     LF P  +++   L  + I+ SR  S
Sbjct: 123 ELKGTSVVE-KLG-KLDAQADYNEKLIPAVKSRPDLFLPEHLSIHYTLERYHIMGSRILS 180

Query: 236 RL----------RGQNLVLIPLADLINHSPGITTE------------------------- 260
           R            G       L   ++  P    E                         
Sbjct: 181 RSFIVEKWNEDEDGNEAANTSLGSAMDVEPSCNEETPSVPMETEDNSDDNDDDDDDDDEE 240

Query: 261 -------------DYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELAL 307
                        +  Y+ + A LF       + T  P+K GEQ+   Y  +  NAEL  
Sbjct: 241 ASDVAMVPMADILNARYQTENAKLFHEKDELKMVTTKPIKTGEQIWNTYG-DLPNAELLR 299

Query: 308 DYGF-----------------IESKSDR--NAYTLTLEISESDPFFGDKLDIAETNGLGE 348
            YG                  +E K+D   +A + T E  + D    +++D     G GE
Sbjct: 300 RYGHVDFLSLPSGGHGNPGDVVEIKADLIISAVSSTPEAVKDDE-AKERIDWWLEEG-GE 357

Query: 349 SAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRV 408
             +  +     LPP M+ +++L+ L   D        +         P     E ++  +
Sbjct: 358 DVFI-LDYEYDLPPVMISFVKLLLLTQADWEKAREKSK---------PPKSRLEGILYDI 407

Query: 409 VRDACKSALSGFHTTIEEDEQLLQGGNLDP-RLRIAVGVRAGEKRVLQQIDEIFK 462
           +    +  L+ + TTIE D+ LL   N  P   + A+ VR GEK +L  I +  K
Sbjct: 408 LISTLEKRLAEYPTTIETDKALLT--NDTPLNNKNAIIVRLGEKEILHGILQKLK 460


>gi|327295326|ref|XP_003232358.1| hypothetical protein TERG_07206 [Trichophyton rubrum CBS 118892]
 gi|326465530|gb|EGD90983.1| hypothetical protein TERG_07206 [Trichophyton rubrum CBS 118892]
          Length = 692

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 62/204 (30%), Positives = 90/204 (44%), Gaps = 35/204 (17%)

Query: 128 LCSGLKPWISVALFLIREK-KKEDSPWRVYLDILPECTD--STVFWSEEELVELQGTQLL 184
           L   ++  I +A F++ E+ K  DS W  YL  LP  ++  S +F+ + +L  LQGT L 
Sbjct: 98  LAQRVRKSIILAFFMVHEQLKGRDSHWWPYLATLPRASELTSALFFQDSDLEWLQGTSLY 157

Query: 185 STTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-SRLRGQNL- 242
            T    +  V+ EY  +   I+     L     T D F WA+ ++ SRAF SR+    L 
Sbjct: 158 ETHRAYRNTVKEEY-DLAISILRDEGYLAIESYTWDIFCWAYTLIASRAFTSRVLDAYLS 216

Query: 243 ------------VLIPLADLINHSP--GITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKA 288
                       +++PL D  NH P   I  +  A EI+            L+   P   
Sbjct: 217 NHPSLKQEEEFQIMLPLVDFSNHKPLAKIEWQAEATEIR------------LKVVEPTFT 264

Query: 289 GEQVLIQYD-LNKSNAELALDYGF 311
           GE+V   Y  LN  N +L   YGF
Sbjct: 265 GEEVHNNYGPLN--NQQLMTTYGF 286


>gi|344301751|gb|EGW32056.1| hypothetical protein SPAPADRAFT_138237 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 483

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 73/301 (24%), Positives = 138/301 (45%), Gaps = 52/301 (17%)

Query: 61  TAQVETFWQWLRDQKV-VSPK---SPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWIN 116
           +A+ + F  +L++  V +SPK   + +R A   +G G++A  DI  +E +  +P    IN
Sbjct: 5   SARTDKFIAFLQENDVTISPKLQIADLRAAN--QGRGIIALEDIEIDETLFTIPRTVLIN 62

Query: 117 P--DTVAAS--EIGSLCSGLK-PWISVALFLIREKKKEDSPWRVYLDILPECT----DST 167
              +++     E+    +GL+  W ++ L L+ E K+++S W  Y ++LP+         
Sbjct: 63  SLNNSLVQDQPELADKLAGLENEWDALILVLLYEYKRKESKWTDYFNVLPDLDTFEFHEL 122

Query: 168 VFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFP--RPITLDDFLWA 225
           +FW++E+L +L+ + +L   +G  + V+      E  + + N+      + +T+++F   
Sbjct: 123 LFWNDEQLSDLKPSLVLD-RIGKDKTVE----MYERLVAIVNQWNLEELKGMTMEEFTKI 177

Query: 226 FGILRSRAFSRLRGQNLV------------------LIPLADLINHSPGITTEDYAYEIK 267
             I+ S +F   +G                      ++PLAD +N    +      Y   
Sbjct: 178 ATIIMSYSFDVAQGTEDEDEDEDDEEEEEEVEYIKSMVPLADTLNADTHLNNAILTYN-- 235

Query: 268 GAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIE---SKSDRNAYTLT 324
                ++DL+ +     P+K GEQV   Y  +  N E+   YG++E   SK D     LT
Sbjct: 236 ----KNQDLVMTCIK--PIKKGEQVYNTYS-DHPNCEILRRYGYVETTGSKYDFGEIPLT 288

Query: 325 L 325
           L
Sbjct: 289 L 289


>gi|320580679|gb|EFW94901.1| hypothetical protein HPODL_3273 [Ogataea parapolymorpha DL-1]
          Length = 423

 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 48/163 (29%), Positives = 77/163 (47%), Gaps = 26/163 (15%)

Query: 218 TLDDFLWAFGILRSRAFSRL------RGQNL-VLIPLADLINHSPG----ITTEDYAYEI 266
           + + F WA+ I  +RAF         R  N+  L P+ DL+NH  G     T ED ++  
Sbjct: 180 SFEAFSWAYSIYCTRAFPNFLRKQSERSLNIGFLCPIVDLLNHKNGEKVTWTCEDNSFVF 239

Query: 267 KGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLE 326
           K +                ++AGE++   Y  NKSN +L L+YGFI + ++    TLTL+
Sbjct: 240 KASA-------------KRIRAGEEIYNNYG-NKSNTDLLLNYGFILNDNESETTTLTLK 285

Query: 327 ISESDPFFGDKLDIAETNGLGESAY-FDIVLGRTLPPAMLQYL 368
           + ES    G K  +    G   +   F++ L   LP  +L+++
Sbjct: 286 VEESVIEAGTKFGLKLPEGTSANGICFNLSLVTPLPKDLLRFM 328


>gi|255720552|ref|XP_002556556.1| KLTH0H16126p [Lachancea thermotolerans]
 gi|238942522|emb|CAR30694.1| KLTH0H16126p [Lachancea thermotolerans CBS 6340]
          Length = 571

 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 108/468 (23%), Positives = 191/468 (40%), Gaps = 92/468 (19%)

Query: 61  TAQVETFWQWLRDQ-KVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDT 119
           T  V++  +W R    V+  +   R A   +G   VA   I++  V++ +P K  I   T
Sbjct: 3   TGAVDSCVEWARSHGSVIDDRVEFR-ADPNQGTYAVANARISEGSVLVTIPRKLLIT--T 59

Query: 120 VAASEIGSLCSGLK--PWISVALFLIREK-----KKEDSPWRVYLDILPECTD--STVFW 170
             A +   L + ++  P   + LFL + K       E S ++ ++D+LP   D  S  FW
Sbjct: 60  TLAEKHFRLKNPVESNPNALLQLFLSKLKFSSKSCDEISFFQPFMDVLPLIRDIHSPYFW 119

Query: 171 SEEELVELQGTQLL-STTLGVKEYVQ-------------------------NEYLKVEEE 204
           S EEL  L+GT LL  T   +K+ ++                         N + +V + 
Sbjct: 120 SSEELTALKGTDLLIKTERNLKKVIEEWFGLITKAKAADDEDTAFYLQSSSNPHFQVSDH 179

Query: 205 IILPNKQLFPRPITLDDFLWAFGILRSRAFSRL--RGQNL------VLIPLADLINHSPG 256
           +            +   +LW+  I+ SRAF  L    +NL       L P+ D +NH  G
Sbjct: 180 MTYSKSS---SWHSFSSYLWSAYIVSSRAFPELILENENLKNINQAFLYPIVDFLNHHSG 236

Query: 257 ITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKS 316
              +    + +    FS        +   ++ G+++   Y  +KSN EL L+YGF    +
Sbjct: 237 QKVQWQLNKDRNGVSFS--------SGNQIEKGQEIFNNYG-DKSNEELLLNYGFAIQNN 287

Query: 317 DRNAYTLTLEI--SESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALG 374
             ++ TLTL +   + +      L + + N    S  F + LG  LP  +++   +++  
Sbjct: 288 MNDSSTLTLRLPPGQLESLKSYDLTLRDQNLAENSVNFLLTLGSPLPMPLMKLFGILSKL 347

Query: 375 GTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGG 434
            ++ F+     R+ + G   L   +A  E      +DA K  +S   + I          
Sbjct: 348 TSEKFITA---RSVLDGSEQL---NAIIEQKLSFFKDASKLKVS-LKSPI---------- 390

Query: 435 NLDPRLRIAVGVRAGEKRVLQQIDEIFKERELELDELEYYQERRLKDL 482
               R +      AG+KR+ Q+  +  K+          +Q++ LK+L
Sbjct: 391 ----RSKSTKAYMAGQKRIFQEASDFIKK----------FQKKVLKEL 424


>gi|171692069|ref|XP_001910959.1| hypothetical protein [Podospora anserina S mat+]
 gi|170945983|emb|CAP72784.1| unnamed protein product [Podospora anserina S mat+]
          Length = 454

 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 73/279 (26%), Positives = 115/279 (41%), Gaps = 55/279 (19%)

Query: 87  TFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREK 146
           T  +G G+V+Q+D+A  E  +E P    +  D +  S                   + E 
Sbjct: 27  TVDKGYGVVSQKDLAAPEGTVETPQLLAVPHDLILNS-----------------LAVEEH 69

Query: 147 KKEDSPWRVYLDI------------------LPECTDSTVF----WSEEELVELQGTQLL 184
            KED  ++  L+                   LP C  STV     W+E+E   L+GT L 
Sbjct: 70  AKEDKEFKQLLEAVGHHTADNNVATPRQRSALPAC--STVLLPTVWTEDERPLLKGTSL- 126

Query: 185 STTLGVK-EYVQNEYLKVEE---EIILPNKQLFPR-PITLDDFLWAFGILRSRAFSRLRG 239
              +G K   + NE+  + E   +I   N  L+    ++L D++    + RSR    L  
Sbjct: 127 EVAVGAKLRALDNEFEMIREASSDIPYWNGLLWHSGAVSLKDWVHLDALYRSRCL-ELPK 185

Query: 240 QNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLN 299
               ++P  D+INHS    ++  AY  + +     + +  LR    +  G++V I Y   
Sbjct: 186 SGESMVPCIDMINHS----SDPSAYYDQNSDY---EAVLLLRPGASMSKGQEVTISYGDT 238

Query: 300 KSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKL 338
           KS AE+   YGFI+ +S   +  L L     DP    KL
Sbjct: 239 KSAAEMLFSYGFIDPESTSESLVLPLAPFPDDPLAKAKL 277


>gi|440802665|gb|ELR23594.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
          Length = 984

 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 81/351 (23%), Positives = 137/351 (39%), Gaps = 53/351 (15%)

Query: 64  VETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAAS 123
           +E F  WL+  +   P+     +    G G +A  DI   E +  +P++  +  +    S
Sbjct: 1   MEEFEGWLQANEARYPRLTFAVSEL-GGRGGIATEDILPGEELCSIPVRLVLTTEIARKS 59

Query: 124 EIGSLCS-------GLKPWIS-----VALFLIREKKKEDSPWRVYLDILPECTDSTVFWS 171
           E+G L +       G +  +S     +  +LI ++  +D+ W  YL  LP+  D      
Sbjct: 60  EVGRLVAAHLNAVQGERLRVSAGRAILCAYLIHQRAAQDAFWGPYLRSLPKHDDR----P 115

Query: 172 EEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI-TLDDFLWAFGILR 230
           +E++  L GT L       ++ ++  +  +   +   +  +FP  + T D FLW F    
Sbjct: 116 DEDIQHLAGTNLFYAMQEKQQQIRESFDLLFPALCHAHPTVFPPDLFTWDHFLWTFTACS 175

Query: 231 SRAFSRLRGQNLVLIPLADLINHS-------------PGITTEDYAYEIKGA-GLFSRDL 276
           SR+F     Q LV  P A    H+             PG+   ++ Y  K    L     
Sbjct: 176 SRSFP----QTLVQQPTATTSAHADPYDLLEIDECLLPGLDMLNHQYRKKITWALDPSTG 231

Query: 277 LFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPF--F 334
                T   V+ G +    Y   K N EL + YGF    ++++   + L  S +     F
Sbjct: 232 RLKFVTEDTVEKGTEAFNNYG-PKGNEELLMGYGFCIEDNEQDYVMIRLSFSPAGKMGLF 290

Query: 335 GDKLDIAETNGLGESAYFDIVLGRT------------LPPAMLQYLRLVAL 373
              L     N L  +A    +LGR             +PPA+L  LR+  +
Sbjct: 291 SPVLAAVSINPL--AARKKALLGRLKLPVLQALRRDYVPPALLDALRVSVM 339


>gi|320584053|gb|EFW98265.1| Nuclear protein that contains a SET-domain [Ogataea parapolymorpha
           DL-1]
          Length = 499

 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 72/260 (27%), Positives = 112/260 (43%), Gaps = 52/260 (20%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSG-------LKPWISVALFL 142
           EG GL A+ DI K+ V+  +     +N  T A   +G L  G       L  W ++ L L
Sbjct: 36  EGRGLRARNDIQKDTVLFRLARDHILNIRTAA---LGKLKPGNQEVLETLNQWEALILCL 92

Query: 143 IREKK-KEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKV 201
             E    E+S W  YL +LPE  +S +FWS EEL +L+ + +L   +G +E  +  Y K+
Sbjct: 93  AYEMMLGEESRWSSYLAVLPEKFNSLMFWSSEELEKLKPSNVLQ-RIG-REQAEQMYSKL 150

Query: 202 EEEIILP--NKQLFPRPITLDDFLWAFGILRSRAFS------------------------ 235
             E  L   +K+L    +T+D F     I+ S +F                         
Sbjct: 151 VPEYCLRLGSKKLVEY-LTIDRFHVVASIIMSYSFDVDDPEDDPEDDEDEEEDFDEIEQE 209

Query: 236 --RLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVL 293
             +  G    ++PLAD +N +  +   + +YE         D L    T   +K GEQ+ 
Sbjct: 210 CIKYDGYLKSMVPLADTLNSNTNLVNANLSYE--------NDALVMTATK-DIKKGEQIY 260

Query: 294 IQYDLNKSNAELALDYGFIE 313
             Y     N+E+   YG++E
Sbjct: 261 NIYG-ELPNSEILRKYGYVE 279


>gi|296085170|emb|CBI28665.3| unnamed protein product [Vitis vinifera]
          Length = 565

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 49/187 (26%), Positives = 91/187 (48%), Gaps = 17/187 (9%)

Query: 62  AQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVA 121
           A+++ F QWL+  +V      I+     +G G+    D A + + L VP+   I P  V 
Sbjct: 7   AKLQHFLQWLQLNRVELRGCEIKYCDSNKGFGIFYAND-ASDGIPLVVPLDLAITPMRVL 65

Query: 122 ASE-IGSLCSGL------KPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEE 174
               +G  C  +         + + LFL  E+ +++S W+ YLD+LP    + +++ ++E
Sbjct: 66  QDPFLGPECRAMFEEGEVDDRLLMILFLTVERLRKNSSWKPYLDMLPTTFGTPLWFIDDE 125

Query: 175 LVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLF------PRPITLDDFLWAFGI 228
            +EL+GT +   T   K+ +Q+ Y    ++++   K+L          +  +DFLWA  I
Sbjct: 126 FIELKGTSVHRATELQKKQLQSLYDDKVKDLV---KKLLILDGDSKGEVHFEDFLWANSI 182

Query: 229 LRSRAFS 235
             +RA +
Sbjct: 183 FWTRALN 189


>gi|403217882|emb|CCK72375.1| hypothetical protein KNAG_0J02970 [Kazachstania naganishii CBS
           8797]
          Length = 581

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 92/366 (25%), Positives = 150/366 (40%), Gaps = 53/366 (14%)

Query: 57  TNPPTAQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEV-VLEVPMKFWI 115
           ++  +  ++ F  W ++   V  +S I   T   G+  VA   I  ++  ++ VP    I
Sbjct: 2   SHSDSTSLKNFIDWSQENGAVIDRSIIFEVTAQTGIRAVASEKIKSSKHPLISVPKCLLI 61

Query: 116 NP-DTVAASEIG---SLCSGLKPWISVALFLIREKKKED--SPWRVYLDILPECTDSTVF 169
              D   A + G    L +G  P     L+L + K  +   S  + YLDILP   +   F
Sbjct: 62  TKEDAKKAFDWGDNRKLPTG-NPNALTQLYLSKLKFGDHAISSLKPYLDILPLHLNQPYF 120

Query: 170 WSEEELVELQGTQLLSTTLGVKEYVQNEY------LKVEEEIILPNKQLFPRPI------ 217
           W   +L  L+G  L+         + NE+        +E+E    +   F + +      
Sbjct: 121 WHRSQLDLLKGADLILAVHANLVEIWNEWKNLLTEFGIEQEAKTCDLDSFDKLVDYISSN 180

Query: 218 ------------TLDDFLWAFGILRSRAFSRL-------RGQNLVLIPLADLINHSPGIT 258
                       +   ++W+  I RSRAF  L         Q   L+P+ DL+NH  G  
Sbjct: 181 IDRLHSGETAWTSFIAYVWSSAIFRSRAFPELMLEENVENLQTAFLLPVVDLLNHRNGT- 239

Query: 259 TEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDR 318
                   K    FS   +  +     + A E++   Y  ++SN EL L YGF+E  +  
Sbjct: 240 --------KVKWTFSDGKVNFISEAKEINAKEELFNNYG-DRSNEELLLAYGFVEENNVH 290

Query: 319 NAYTLTLEI-SESDPFF---GDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALG 374
           +   LTL++ SE+   F   G KLD+ E+        FDI L  +LP  +LQ    +   
Sbjct: 291 DIARLTLQLDSETIANFRKCGVKLDLRESTIKKNCIQFDISLSNSLPLPLLQLFGFLCKL 350

Query: 375 GTDAFL 380
            +++FL
Sbjct: 351 KSESFL 356


>gi|336264087|ref|XP_003346822.1| hypothetical protein SMAC_05080 [Sordaria macrospora k-hell]
 gi|380090292|emb|CCC11868.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 528

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 92/411 (22%), Positives = 164/411 (39%), Gaps = 107/411 (26%)

Query: 134 PWISVALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLG--V 190
           PW  + L L+ E  +   SPW  YL +LPE  D+ +FW+ EEL ELQ + L+        
Sbjct: 125 PWTLLILILVHEYLQGSASPWYPYLSVLPEKFDTPMFWTNEELGELQASALVPKVGKEEA 184

Query: 191 KEYVQNEYLKV-------------------EEEIILPNKQLFPRPITLDDFLWAFGILRS 231
            E ++++ +KV                   EEE++    ++    +      +AF + + 
Sbjct: 185 DEMIRSKIVKVVKDNESAFYPASYSGSKLSEEELLALGHRMGSAIMA-----YAFDLAKE 239

Query: 232 RAF-----------SRLRGQN--LVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLF 278
                          ++ GQN  + ++P+AD++N          A  +  A +   +   
Sbjct: 240 DDDDEDEEGDGWVEDKIAGQNDTMGMVPMADMLN----------ADAVFNAHINHEEGSL 289

Query: 279 SLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDR-NAYTLTLEISES------D 331
           +  +   +K GE++L  Y    S+AEL   YG++     R +   L  ++ E       D
Sbjct: 290 TATSLREIKEGEEILNYYG-PLSSAELLRRYGYVTPNHARYDVVELGWDLIEKGLKKMVD 348

Query: 332 PFFGDKLDI--------------------------------AETNGLGESAYFDIVLGRT 359
              G K D+                                +E    GE+ +  +     
Sbjct: 349 GLQGKKKDVQWEKVDELLEDEKEEGEWEDSFVLERQSEDPDSEGQVHGEAEF--VGWPEE 406

Query: 360 LPPAMLQYLRLV--ALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSAL 417
           L   +  YL+ V   +G  D  + E++    +            +E++   V  A +   
Sbjct: 407 LEEQVRTYLKAVKKVVGSGDRAVAEALGDKKV----------RVKEILLGAVASALEERE 456

Query: 418 SGFHTTIEEDEQLLQ---GGNLDPRLRIAVGVRAGEKRVLQQIDEIFKERE 465
             + T++EEDE++LQ   GG    R  +AV VRAGEKR++++     K++E
Sbjct: 457 GQYATSLEEDEKVLQGIDGGRPTTRREMAVWVRAGEKRIIREAVAWIKKQE 507


>gi|225430662|ref|XP_002268533.1| PREDICTED: uncharacterized protein LOC100267311 [Vitis vinifera]
          Length = 561

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 49/187 (26%), Positives = 91/187 (48%), Gaps = 17/187 (9%)

Query: 62  AQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVA 121
           A+++ F QWL+  +V      I+     +G G+    D A + + L VP+   I P  V 
Sbjct: 7   AKLQHFLQWLQLNRVELRGCEIKYCDSNKGFGIFYAND-ASDGIPLVVPLDLAITPMRVL 65

Query: 122 ASE-IGSLCSGL------KPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEE 174
               +G  C  +         + + LFL  E+ +++S W+ YLD+LP    + +++ ++E
Sbjct: 66  QDPFLGPECRAMFEEGEVDDRLLMILFLTVERLRKNSSWKPYLDMLPTTFGTPLWFIDDE 125

Query: 175 LVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLF------PRPITLDDFLWAFGI 228
            +EL+GT +   T   K+ +Q+ Y    ++++   K+L          +  +DFLWA  I
Sbjct: 126 FIELKGTSVHRATELQKKQLQSLYDDKVKDLV---KKLLILDGDSKGEVHFEDFLWANSI 182

Query: 229 LRSRAFS 235
             +RA +
Sbjct: 183 FWTRALN 189


>gi|347835472|emb|CCD50044.1| hypothetical protein [Botryotinia fuckeliana]
          Length = 562

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 79/337 (23%), Positives = 143/337 (42%), Gaps = 47/337 (13%)

Query: 150 DSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEII-LP 208
           ++PW  Y+  LPE       W+E+E   L GT L ++ +     +  ++  + E+ I +P
Sbjct: 194 NNPWSEYIRFLPEKIPVPTMWTEDEKTLLNGTSLENSLVNKMVALSRDFENLREKTIEIP 253

Query: 209 ------NKQLFP-RPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTED 261
                  K+  P +P+ L D++      RSR      G+  V++P+ D++NHS    T +
Sbjct: 254 WCHSCWWKEDGPLKPLLLSDWIRVDAWYRSRCLEIETGEE-VMVPVLDMVNHS---FTPN 309

Query: 262 YAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAY 321
             +E    G    + +  L   + +  G ++ I Y + KS+AE   +YGFI+++      
Sbjct: 310 AHWEHTSNG----NAILVLVPDILLDEGTEITISYGV-KSDAENLFNYGFIDAE-----V 359

Query: 322 TLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQ--YLRLVALGGTDAF 379
            LT  I E +P   D L +A+    G+     I  G +         ++ L  L   D  
Sbjct: 360 PLTSLILEVEPIATDPLRVAKVAAFGKRPSVQI-FGHSNGETSWDCPFIYLACLNEEDGL 418

Query: 380 LLESIFR-------NTIWGHLDLPVS----------HANEELI----CRVVRDACKSALS 418
             +++ +          W + D+  S          H  E+++      ++RD  +  L 
Sbjct: 419 EFKTVQKVDGSQSLKVFWQNADVTESTDQFERLISGHEREDILRFRALNLIRDRIELQLE 478

Query: 419 GFHTTIEEDEQLLQGGNLDPRLRI-AVGVRAGEKRVL 454
               + E  E L+ G  +DP  +  A+ +R  E  VL
Sbjct: 479 RLQASEEIVETLVNGEMVDPNTQANALELRQIETDVL 515


>gi|440302460|gb|ELP94773.1| hypothetical protein EIN_341910 [Entamoeba invadens IP1]
          Length = 823

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 70/261 (26%), Positives = 108/261 (41%), Gaps = 36/261 (13%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINP--------DTVAASEIGSLCSGLKPWIS----V 138
           G GL + ++  + + +L +P    +N           V   E+  L  G K        V
Sbjct: 27  GNGLCSSKEFHEGDTLLSIPYHLQLNTIELHNVFESMVPGFEVPRLGEGAKNRDDENSVV 86

Query: 139 ALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY 198
            L+L   K  E      Y++ LP      + +SE EL  L+GT+LL T    K +++   
Sbjct: 87  YLYLAMNKTNEKCFHFPYINTLPTTFSCPLSYSENELKMLKGTKLLVTVEKTKTFLK--- 143

Query: 199 LKVEEEIILPNKQLFPRPITLDDF----LWAFGILRSRAFSRLR----GQNLVLIPLADL 250
            K+ +       Q   R    DDF    +WA  +  SRAF  +     G    LIP AD 
Sbjct: 144 -KLSDYYETLTHQYPTRFQQFDDFYQRLVWAHQVFWSRAFLVIYPDPIGDVASLIPFADF 202

Query: 251 INHSPGITTEDYAYEIKGAGLFSRDL-LFSLRTPVPV-KAGEQVLIQYDLNKSNAELALD 308
            NH+          E K   + +R    FSL+T   V   GEQ+   Y + + N ++ L 
Sbjct: 203 SNHNT---------ETKVTYVSNRQTQTFSLQTNEKVLHCGEQIFNNYRI-RPNEKMLLG 252

Query: 309 YGFIESKSDRNAYTLTLEISE 329
           YGF+ S++  +   L +   E
Sbjct: 253 YGFVISENPYDEVLLRINFKE 273


>gi|345325919|ref|XP_001512656.2| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Ornithorhynchus anatinus]
          Length = 345

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 51/178 (28%), Positives = 85/178 (47%), Gaps = 14/178 (7%)

Query: 288 AGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLG 347
           AGEQ+ I Y   +SNAE  +  GF    +  +   + L +S+SD  +  K ++    G+ 
Sbjct: 55  AGEQIYIFYG-TRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIP 113

Query: 348 ESAYFDIVLGRTLPP---AMLQYLRLVALGGTDAFLLESIFRNT------IWGHLDLPVS 398
            S+ F   L  T PP    +L +LR+  +  T+  L E +  +         G+ + PVS
Sbjct: 114 TSSVF--ALHFTEPPISAQLLAFLRVFCM--TEEELKEHLIGDHAIDKIFTLGNSEFPVS 169

Query: 399 HANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
             NE  +   +       L  + TTIEED+  L+  +L     +A+ +R GEK +L++
Sbjct: 170 WDNEVKLWTFLEARASLLLKTYKTTIEEDKSFLETPDLTFHATMAIKLRLGEKEILEK 227


>gi|336472467|gb|EGO60627.1| hypothetical protein NEUTE1DRAFT_75928 [Neurospora tetrasperma FGSC
           2508]
 gi|350294307|gb|EGZ75392.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
          Length = 469

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 57/196 (29%), Positives = 85/196 (43%), Gaps = 16/196 (8%)

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEII-LP- 208
           +PW  Y+  LP+       W+E+E + L+GT L S        +  E+  V E    LP 
Sbjct: 99  NPWTEYIKFLPKTVLVPTLWTEDERLLLRGTSLESAVNAKMTAITAEFDAVREAASSLPI 158

Query: 209 -NKQLFP-----RPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDY 262
            N  L+P        +L  ++    + RSR    L      ++P  D+INHS    T   
Sbjct: 159 WNDILWPYEDGNSSASLRRWILLDALYRSRVL-ELPKSGESMVPCIDMINHS----TRAS 213

Query: 263 AYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYT 322
           AY  + A     +++   R    +  GE+V I Y   K  AE+   YGFI+ ++   +  
Sbjct: 214 AYYDENA---KDEVVLLPRPDSSISPGEEVTISYGDAKPAAEMLFSYGFIDPEATVESLV 270

Query: 323 LTLEISESDPFFGDKL 338
           L LE  E DP    KL
Sbjct: 271 LPLEPFEDDPLAKAKL 286


>gi|366987955|ref|XP_003673744.1| hypothetical protein NCAS_0A08050 [Naumovozyma castellii CBS 4309]
 gi|342299607|emb|CCC67363.1| hypothetical protein NCAS_0A08050 [Naumovozyma castellii CBS 4309]
          Length = 499

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 83/307 (27%), Positives = 117/307 (38%), Gaps = 68/307 (22%)

Query: 56  TTNPPTAQVETFWQWLRDQ--KVVSPKSPIRPA-TFPEGLGLVAQRDIAKNEVVLEVPMK 112
           T +      E F  WL +     +SPK  I       +G  ++A  DI  +E++ E+P +
Sbjct: 2   TVDAFLKNTENFHSWLTNSVGYKLSPKIKIADGRDTNQGRFILATEDIKTDELLFEIPRE 61

Query: 113 FWINPDTVAASEIGSLCSGLKPW---------------ISVALFLIREKKKEDSPWRVYL 157
             +N  T       SL S    W               I   LF I+ KK   S W  Y 
Sbjct: 62  SILNVLT------SSLVSEYPAWENILLDGDVGHWEGLIICMLFEIKVKKNM-SKWAPYF 114

Query: 158 DILPECTD--STVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNK----Q 211
           D+LPE TD  S ++W+ EEL  L+ + +L        +  +E  KV E I    K     
Sbjct: 115 DVLPESTDLNSLMYWTAEELEALKPSLVLDRIGNDGAHQMHE--KVMELIRTFEKDHSVD 172

Query: 212 LFPRPITLDDFLWAFGILRSRAFS-------------------------RLRGQNLVLIP 246
           L    IT +DFL+   I+ S +F                          R  G    +IP
Sbjct: 173 LSFGTITWEDFLYVASIIMSYSFDVELPPTSADENEEDDEVEEDVEQTVRNEGSLKSMIP 232

Query: 247 LADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELA 306
           LAD +N           Y+              +R    +KAGEQV   Y  N  NAE+ 
Sbjct: 233 LADTLNSDTNKCNAHLIYDEDS---------LKMRAISNIKAGEQVYNIYG-NHPNAEIL 282

Query: 307 LDYGFIE 313
             YG++E
Sbjct: 283 RRYGYVE 289


>gi|302805649|ref|XP_002984575.1| hypothetical protein SELMODRAFT_42811 [Selaginella moellendorffii]
 gi|300147557|gb|EFJ14220.1| hypothetical protein SELMODRAFT_42811 [Selaginella moellendorffii]
          Length = 530

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 77/328 (23%), Positives = 133/328 (40%), Gaps = 71/328 (21%)

Query: 63  QVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNE--------------VVLE 108
           ++E F +W ++  +      I+  +  EG GL  Q D A+ +              V++ 
Sbjct: 4   RLERFSRWSQEHGIQFRGCAIKRVSDAEGFGLYTQNDSARGDFLSFCAPLSTDFADVLVV 63

Query: 109 VPMKFWINPDTVAASEI------GSLCSGLKPWISVALFLIREKKK-EDSPWRVYLDILP 161
            P+   + P T+    +        L + +   + V +FLI E+ +   S W  YL++LP
Sbjct: 64  TPLDLALTPVTIVKDPVLGNVYREMLGNEIDDRLLVMIFLIIERARGRASFWAPYLEMLP 123

Query: 162 ECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYL-KVEEEIILPNKQLF---PRPI 217
               + +++ +EEL+EL GT L   T   +  + + Y+  +  ++ L +  LF    R +
Sbjct: 124 SGFGTPLWFEDEELMELDGTTLFEATKAQQRCLPSVYIGTLCCQLFLVSLYLFRPDDREL 183

Query: 218 TLDDFLWAFGILRSRAF----------------SRLRGQNLV------------------ 243
              +FLWA  I  +RA                 ++  G  LV                  
Sbjct: 184 EFQEFLWANCIFWTRALNIPCPASFVTSSSPEVAKDDGNRLVIYVLPHPFISCSSKDVST 243

Query: 244 -----LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFS--LRTPVPVKAGEQVLIQY 296
                L+P  D  NH+   +     +EI G+   +  +  S  L   V    G +VLI Y
Sbjct: 244 IWIEGLVPGIDFCNHTRRASG---LWEIDGSDGSTSGVPHSMYLIADVVFPPGSEVLINY 300

Query: 297 DLNKSNAELALDYGFIESKSDRNAYTLT 324
             +K N EL   YGF+E + + N Y + 
Sbjct: 301 G-DKGNEELLFLYGFVE-EDNSNDYVMV 326


>gi|384246985|gb|EIE20473.1| rubisco small subunit N-methyltransferase, partial [Coccomyxa
           subellipsoidea C-169]
          Length = 363

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 88/367 (23%), Positives = 141/367 (38%), Gaps = 68/367 (18%)

Query: 104 EVVLEVPMKFWINP-DTVAASEIGSLCSGLKPWISVALFLIREKKK-EDSPWRVYLDILP 161
           + ++E+P    +   D  A  E+  L  G      +AL+L+ E++K E+S W  +L+ LP
Sbjct: 1   QALVELPGNLSVTAVDVAAHEEVAGLAEGRGELTGLALWLMAERQKGEESRWAPFLECLP 60

Query: 162 ECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDD 221
           E T S V W EE                    VQ+E LK                     
Sbjct: 61  EATLSPVLWPEE--------------------VQDELLK--------------------- 79

Query: 222 FLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLR 281
                    S      R +   L    D+I     I T D A    G       L  +L 
Sbjct: 80  --------NSPTLKECRARRAALQQEWDVIAQR--IATGD-ARRFSGGDELK--LWITLG 126

Query: 282 TPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIA 341
           +P      +++L+     + N ELA+  G +E  +  +  T+ + + ++D  F  K  I 
Sbjct: 127 SPGWGGTSDKLLMAIYDGRPNGELAMATGRVEDDNASDCLTVRVGLVQADRLFSVKKQIL 186

Query: 342 ETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHAN 401
           E+ G      F I   R   P  L     +A     A L +  F        D+ ++  N
Sbjct: 187 ESLGFDIVQEFPIFRDRM--PTQLLAYLRLARLTDPALLAKVSFEE------DIILNPVN 238

Query: 402 EELICRVVRDACKSALS---GFHT-TIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQI 457
           E  + +++   C+  L+   G H  + EED +LLQ   L  + R+A  +R  EK +LQ  
Sbjct: 239 EYEVLQLLLGECRDRLTSYAGMHMGSAEEDVKLLQRPGLTAQERLAARLRKAEKAILQGT 298

Query: 458 DEIFKER 464
            +  + R
Sbjct: 299 LDAVRRR 305


>gi|424513104|emb|CCO66688.1| predicted protein [Bathycoccus prasinos]
          Length = 514

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 82/334 (24%), Positives = 140/334 (41%), Gaps = 53/334 (15%)

Query: 148 KEDSPWRVY-LDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEII 206
           K    W+ Y  + LP+  DS    +E+EL ELQ   +++   G  +  + +  ++ +E +
Sbjct: 199 KNSEEWKTYKRETLPKSYDSLYLANEKELEELQDVSVMNMAKGSAKMYEAQLEQLLKEPL 258

Query: 207 LPNKQLFPRPITLDDFLWAFGILRSRAFS---RLRGQN----LVLIPLADLINHSP---- 255
             NK+     I L+D  WA  +  +RA S    + G+       ++P ADL NH      
Sbjct: 259 F-NKEGVKDMIDLEDLRWARSVAHTRAMSGKLNVAGEGSFPCAFVVPGADLTNHRTVPNS 317

Query: 256 --GITTEDYAYEIKGAGLFS------RDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELAL 307
             G++ +   YE+K     S        L   L      K G ++ I Y     NA LAL
Sbjct: 318 IYGVSEDGLRYELKWRAKNSLEEDKKEGLPPPLEEEKEPKEGLEMFICYGARHPNALLAL 377

Query: 308 DYGFIE--SKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAML 365
            YGF++  + +DR      +      PF   K  + E    G+                 
Sbjct: 378 HYGFVDDTNPNDRIPMECVMPGMRKAPFKVVKKAVQELKEKGDDR--------------- 422

Query: 366 QYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEEL----ICRVVRDACKSALSGFH 421
                 A  G+    L ++ + +  G   +P   A++E     I + ++ A  +ALS F 
Sbjct: 423 -----AAWAGSQ---LLAVSKRSPEG---VPDEIADQEKVHPEIVQQMKKATVAALSQFP 471

Query: 422 TTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQ 455
           TT+E+DE+++       R+++A+  R  +KR L 
Sbjct: 472 TTLEDDEKIVFDEIESSRMQVAISYRIAQKRHLH 505


>gi|219125216|ref|XP_002182882.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217405676|gb|EEC45618.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 488

 Score = 60.1 bits (144), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 57/196 (29%), Positives = 90/196 (45%), Gaps = 36/196 (18%)

Query: 136 ISVALFLIREKKKEDSPWRVYLDILPECTDSTVF---WSEEELVELQGTQLLSTTLGVKE 192
            +VA++L     +    +R YL  LP  +DS      WSE EL  L+G+ LL      +E
Sbjct: 110 CAVAMYLASRPAE----YREYLASLP--SDSNFLPRQWSESELKALRGSPLLIRVRKARE 163

Query: 193 YVQNEYLKVEE---EIILPNK----QLFPRPITLDDFLWAFGILRSRAFSRLRG------ 239
            +Q +Y  + E   EI   +       FP+   L++F  A  ++ SRAFS + G      
Sbjct: 164 GIQKDYNLIREAWKEIHSSHSPSTTANFPK---LEEFSSAMAVVSSRAFSGMAGFETKGG 220

Query: 240 --QNLVLIPLADLINHSPG-ITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQY 296
              +  +IPL DL NH  G   T++ +Y  K   +        ++    +  G+ + I Y
Sbjct: 221 AVNDTTMIPLLDLCNHHRGRCVTKNVSYRFKDGTVM-------VKAVTDIAIGDTLKITY 273

Query: 297 DLNKSNAELALDYGFI 312
              + NA+L L+YGF 
Sbjct: 274 GA-QGNAQLFLNYGFC 288


>gi|356553227|ref|XP_003544959.1| PREDICTED: protein SET DOMAIN GROUP 40-like [Glycine max]
          Length = 475

 Score = 60.1 bits (144), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 62/247 (25%), Positives = 97/247 (39%), Gaps = 27/247 (10%)

Query: 88  FPE--GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLK-------PWISV 138
           FP   G GL A RD+ + E+VL VP    +  +TV   E   LC  +          I +
Sbjct: 47  FPHSGGRGLGAVRDLRRGEIVLRVPKSALMTRETVM--EDKKLCDAVNRHSSLSSAQILI 104

Query: 139 ALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY 198
              L    K + S W  YL  LP   D    + E E   LQ  + +  T       ++E+
Sbjct: 105 VCLLYEMGKGKTSRWHPYLMHLPHTYDVLAMFGEFEKHALQVDEAMWVTEKAMLKAKSEW 164

Query: 199 LKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINH-SPGI 257
              E   ++ +    P+  T   ++WA   + SR       +   L P+ DL N+ +PGI
Sbjct: 165 --KEAHSLMQDLMFKPQFFTFKAWVWAAATISSRTLHIPWDEAGCLCPVGDLFNYDAPGI 222

Query: 258 ------------TTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAEL 305
                         + +++ +   G       +        K G+QVL+ Y    +N EL
Sbjct: 223 EPSGIEDLDHAEQLDSHSWRLTDGGFEEDANAYCFYAREHYKKGDQVLLCYG-TYTNLEL 281

Query: 306 ALDYGFI 312
              YGF+
Sbjct: 282 LEHYGFL 288


>gi|358392567|gb|EHK41971.1| hypothetical protein TRIATDRAFT_251278, partial [Trichoderma
           atroviride IMI 206040]
          Length = 956

 Score = 60.1 bits (144), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 68/268 (25%), Positives = 109/268 (40%), Gaps = 55/268 (20%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWIN----------PDTVAASEIGSLCSG---LKPWIS 137
           G G+VA +DI  + V+  VP    +N          PD     +            PW +
Sbjct: 528 GRGIVALQDIPADTVLFTVPRSAIVNIETSELRAKLPDVFLNQDTAMEVDNKPQQDPWST 587

Query: 138 VALFLIREKKKED-SPWRVYLDILPECTDSTVFWSEEELVELQGT--------------- 181
           + + LI E  K D S W+ YLD+LP   ++ +FWS+ E+ ELQ +               
Sbjct: 588 LIIVLIYEYFKGDQSSWKPYLDVLPASFETPMFWSDAEVDELQASATRSKIGKTNAEEMF 647

Query: 182 --QLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPR-PITLDDFLWAF--------GILR 230
             ++L    G  +  Q    K +EE+I    QL  R   T+  + + F            
Sbjct: 648 HAKILPVIRGNPDIFQTSQAKSDEELI----QLAHRMGSTIMSYAFDFQNEDEEEEDDSE 703

Query: 231 SRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGE 290
                R     + ++P+AD++N        D  Y    A +   D   ++ T   +KAGE
Sbjct: 704 EWVEDREAKSTMGMVPMADILN-------ADAEY---NAHVNYGDDALTVATLRTIKAGE 753

Query: 291 QVLIQYDLNKSNAELALDYGFIESKSDR 318
           ++L  Y  +  N+EL   YG++  K  R
Sbjct: 754 EILNYYGPH-PNSELLRRYGYVTPKHSR 780



 Score = 43.5 bits (101), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 34/55 (61%)

Query: 402 EELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
           +E+   V+  A  +  S + TTI EDE +L G NL  R + AV VR GEKR+LQ+
Sbjct: 876 DEVQHTVLITALDALTSQYPTTIIEDELILSGSNLSERRKAAVTVRLGEKRLLQE 930


>gi|145502426|ref|XP_001437191.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124404340|emb|CAK69794.1| unnamed protein product [Paramecium tetraurelia]
          Length = 637

 Score = 60.1 bits (144), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 58/238 (24%), Positives = 108/238 (45%), Gaps = 30/238 (12%)

Query: 85  PATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWIS------- 137
           PA+F + +G+VA  D+  + V++ +P    I+PD    S + ++ +              
Sbjct: 37  PASFGDVIGVVASEDLPSDTVIICIPQSLIISPDKCKQSTLITVYNSHPEMFDEEETNEA 96

Query: 138 ----VALFLIREKKK-EDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKE 192
               +  ++  EKKK E S +  Y+  + + +++ + WS E+L +++   +L     +K+
Sbjct: 97  EFNILTFYMFNEKKKGEQSFYYPYIQAI-QTSNTLMAWSNEDLQKIEDPLILEEFQLIKQ 155

Query: 193 YVQNEYLKVEEEIILPNKQ-LF--PRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLAD 249
                + K   ++I  N Q +F  PR     DF WA   + SR F     ++  L+P+AD
Sbjct: 156 DFLGLWSKA--KLIFDNAQDIFGAPRITDKSDFFWAIECVMSRCFG-WALKSTCLVPIAD 212

Query: 250 LINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELAL 307
            +NHS    T    +     G FS+           V+A  Q   QY + ++N  L++
Sbjct: 213 FLNHSNKACTHYMVHSALEKGSFSKS---------EVQANFQK--QYVIKRNNMNLSI 259


>gi|147777505|emb|CAN60498.1| hypothetical protein VITISV_027869 [Vitis vinifera]
          Length = 2077

 Score = 60.1 bits (144), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 63/256 (24%), Positives = 110/256 (42%), Gaps = 33/256 (12%)

Query: 131  GLKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGV 190
            G+     + L+ ++EK   +S +  Y + LPE  ++ + +  + ++ L GT LL   +  
Sbjct: 1451 GISSETMLLLWSMKEKHNSNSKFNTYFNALPEAFNTGLSFEFDAIMVLAGTLLLEEIIEA 1510

Query: 191  KEYVQNEYLKVEEEIILPNKQLFPRPI-TLDDFLWAFGILRSRAF------SRLRGQNLV 243
            K+++  +Y ++   +   +  +FP    T + FLWA  +  S          +LR     
Sbjct: 1511 KKHLNAQYEELVPALCKDHPDIFPPEFYTQEQFLWACELWYSNGMQVMFTDGKLR---TC 1567

Query: 244  LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNA 303
            LIP+A  +NHS       Y + +    + S+          P   GEQ  + Y  N S++
Sbjct: 1568 LIPIAGFLNHSL------YPHIMHYGKVDSKTNSLKFCVSKPCNMGEQCYLSYG-NFSSS 1620

Query: 304  ELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDI--------- 354
             L   YGFI  + D    T+ LEI       GD  +  E + + +SA   +         
Sbjct: 1621 HLVTFYGFIP-QGDNLYDTIPLEIDNPQ---GDCPE--EXHPMSDSATHMVRGTWLSNNH 1674

Query: 355  -VLGRTLPPAMLQYLR 369
             +    LPP +L +LR
Sbjct: 1675 EIFHYGLPPPLLDHLR 1690


>gi|198432683|ref|XP_002129380.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
          Length = 239

 Score = 60.1 bits (144), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 51/209 (24%), Positives = 95/209 (45%), Gaps = 17/209 (8%)

Query: 61  TAQVETFWQWLRDQK-VVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDT 119
           +A ++ F QWL ++K  +S K  I         GL A  DI+  E++  VP    +    
Sbjct: 25  SACIDAFLQWLAEEKFALSKKIKISERNSCHRFGLSASEDISNGELLFSVPRDSLLWEKN 84

Query: 120 VAASEIGSLCSGLKP---WISVALFLIREKKKEDSPWRVYLDILP--ECTDSTVFWSEEE 174
            + S+  + C  L     W+ + L L+ E  K  S W+ Y+D LP   C D  + W + +
Sbjct: 85  CSISKRLNACDNLNGGSGWVKIILCLMYEHTKPSSRWKPYMDFLPPVSCMDQPLHW-DPD 143

Query: 175 LVE--LQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQL--FPRPITLDDFLWAFGILR 230
           + E  L+G+ L    +     ++NEY +     I  N  +    +  TL+ +     ++ 
Sbjct: 144 IREKMLEGSGLNEMVVNDVAMMENEYKEKALPFIKSNSDIGFSEKTHTLELYKHMASVVM 203

Query: 231 SRAFSRLRGQN------LVLIPLADLINH 253
           + +F+     +       +++P+AD++NH
Sbjct: 204 AYSFTEPTVDDDDERTPPIMVPVADVLNH 232


>gi|388579878|gb|EIM20197.1| RuBisCO-cytochrome methylase [Wallemia sebi CBS 633.66]
          Length = 447

 Score = 60.1 bits (144), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 102/450 (22%), Positives = 176/450 (39%), Gaps = 100/450 (22%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASE-IGSLC------SGLKPWISVALFLI 143
           G GLVA  DI     +  +P    ++  T +  E +G           +  W  + + + 
Sbjct: 33  GRGLVAVADIKAQTSLFTIPRDIVLSTRTSSFKEKVGQDVYKQLENDNIGSWTPLIMAMC 92

Query: 144 RE-KKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVE 202
            E  +   S W  Y  ILP+   S +FWS+EEL  L+GT ++   +G+ E ++NE+ +V 
Sbjct: 93  WEYNQGGSSKWDAYFKILPKQFTSLMFWSKEELSLLKGTTVVD-KIGL-EDIENEFERV- 149

Query: 203 EEIILPNKQLFPR--PITLDDFLWAFGILRSRAFS------------------------- 235
            +I+  N+ +F      TLD F     ++ SR+F+                         
Sbjct: 150 RDIVKQNENVFGDIANYTLDLFKRMGSLILSRSFTVEEWKTEEEREKEEEEEEDEDEEID 209

Query: 236 -RLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLI 294
            R    ++ ++P+AD++N           YE     + S            +KAG+Q+  
Sbjct: 210 LRTSVDDVAMVPMADILNSRTDSVNAHTEYEENCLRMISLQ---------DIKAGDQIFN 260

Query: 295 QYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDI 354
            Y+ +  NA+L   YG ++             +S+   F G+K D+ E          DI
Sbjct: 261 TYN-DPPNADLIRRYGHVDYSP----------LSQDPDFMGNKNDVVELPA-------DI 302

Query: 355 VLGRTLPPAMLQYLR-----LVALGGTDAFLL--------------------ESIFRNTI 389
           +L   LP A   +       L+   G D+F L                    E+ F+ T 
Sbjct: 303 LLELALPDAKESHKERRVEFLLDECGEDSFELTHDDLVPELLKICVLLFTESEAEFK-TR 361

Query: 390 WGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRI-AVGVRA 448
                LP +    +     +  A K  +  + +T+E+D   L   +  P     A+ V  
Sbjct: 362 EKSRKLPKASGFTKGKAEFLIKAIKQRMEQYGSTLEDDISKLDNKDSLPENNFKALVVTV 421

Query: 449 GEKRVLQQIDEIFKERELELDELEYYQERR 478
           GE+R+L +  E       EL   +Y Q+++
Sbjct: 422 GERRILNKAIE-------ELQTTDYAQKKQ 444


>gi|50557274|ref|XP_506045.1| YALI0F30327p [Yarrowia lipolytica]
 gi|49651915|emb|CAG78858.1| YALI0F30327p [Yarrowia lipolytica CLIB122]
          Length = 430

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 76/293 (25%), Positives = 127/293 (43%), Gaps = 43/293 (14%)

Query: 69  QWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAA-SEIGS 127
            W+++ KV    + I  +   +G GL   +D     +VL VP    IN + V+A +E   
Sbjct: 2   NWIKEGKVTLTNTEIFESP-GKGKGLRFTKDNVSG-IVLRVPNDLLINAEHVSAWTEENK 59

Query: 128 LCSG-----LKP---WIS-----VALFLIREKKKEDSPWRVYLDILPECT--DSTVFWSE 172
             +G     +K    W+      + LF+     +  S    Y+  LP+ T  D    W E
Sbjct: 60  YLTGEHLEFMKASLEWLKSEKDVIVLFMALLASQSSSGMSDYVSSLPKSTEMDQPWSWPE 119

Query: 173 EELVE-LQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRS 231
            E+ + L+GT LL   +  K +VQ +YL +        K+   + ++   F  +   + S
Sbjct: 120 TEIFDSLKGTSLLMACVHKKMHVQAKYLAI------VGKKGGEKLVSEQQFYLSEQWVVS 173

Query: 232 RAFSRLRGQN-----LVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPV 286
           R+             L ++P+ D +NHSP        +E+      S +++  +   V +
Sbjct: 174 RSLEIPESPGSETLALTMVPVLDYVNHSPKANCR---FEVD-----SGEVVLIVNEDVLI 225

Query: 287 KAGEQVLIQYDLNKSNAELALDYGFIE-----SKSDRNAYTLTLEISESDPFF 334
           KAG++V I Y  +KS AE    YGFI+     +KS      L L + E DP +
Sbjct: 226 KAGDEVFINYGPDKSAAEFLFCYGFIDAAHGVTKSITLETPLMLSMDEIDPEY 278


>gi|26344391|dbj|BAC35846.1| unnamed protein product [Mus musculus]
          Length = 462

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 78/297 (26%), Positives = 129/297 (43%), Gaps = 44/297 (14%)

Query: 42  RAFAATCSLHSASATTNPPTAQVETFWQWLRDQKV-VSPKSPIRPATFPEGLGLVAQRDI 100
           RA  A   L + S+   P    V  F +W R   + +SPK  +       G G+VA+  +
Sbjct: 24  RAARAPLPLPAGSSGGEPEGDAVAGFLRWCRRVGLELSPKVTVSRQGTVAGYGMVARESV 83

Query: 101 AKNEVVLEVPMKFWINPDTVAAS-----EIGSLCSGLKPWISVALFLIREKKKEDSPWRV 155
               ++  VP    ++P T + S     E G+L S L  W+ + L L+ E +   SPW  
Sbjct: 84  RAGTLLFAVPRSALLSPHTCSISGLLERERGALQS-LSGWVPLLLALLHELQAPASPWSP 142

Query: 156 YLDILPECT--DSTVFWSEEELVE-LQGTQLLSTTLGVKEYVQNEYLKVEEE---IILP- 208
           Y  + PE    +  +FW EEE +  L+GT       GV E V+ + + +  E   I+LP 
Sbjct: 143 YFALWPELGRLEHPMFWPEEERLRLLKGT-------GVPEAVEKDLVNIRSEYYSIVLPF 195

Query: 209 ---NKQLF-PRPITLDDFLWAFGILRSRAFS--------RLRGQNLVLIPLADLINHSPG 256
              +  LF P   +L+ +     ++ + +F              + +++P AD++NH   
Sbjct: 196 MEAHSDLFSPSVRSLELYQQLVALVMAYSFQEPLEEDDDEKEPNSPLMVPAADILNH--- 252

Query: 257 ITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIE 313
           I   +   E      +S D L  + T  P+  G ++   Y    +N +L   YGF E
Sbjct: 253 IANHNANLE------YSADYLRMVAT-QPILEGHEIFNTYG-QMANWQLIHMYGFAE 301


>gi|380480025|emb|CCF42668.1| SET domain-containing protein RMS1 [Colletotrichum higginsianum]
          Length = 318

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 71/237 (29%), Positives = 110/237 (46%), Gaps = 43/237 (18%)

Query: 242 LVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKS 301
           L ++P+AD++N       E  A+   G    S   + +LR   P+KAGE++L  Y  +  
Sbjct: 71  LGMVPMADILN----ADAEFNAHVNHGEDDLS---VVALR---PIKAGEEILNYYGPH-P 119

Query: 302 NAELALDYGFIESKSDR------------NAYTLTLEISES---------DPF-FGDKLD 339
           N+EL   YG++  K  R            +  T  L +++          DP  F D   
Sbjct: 120 NSELLRRYGYVTPKHSRYDVVEIPWDLVQSILTEQLRLTDDVWKQLAEHVDPEDFEDVFV 179

Query: 340 IAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSH 399
           +   +G  +S       GR   PA +Q +    L      +L++I +  + G L +P   
Sbjct: 180 LERDSGEPDSE------GRLTTPAKVQEVS-AELEEQLKAVLKAIKK--VRGDL-IPDKR 229

Query: 400 ANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ 456
             +E+   VV  A +  L+ + TT EEDE LL  GNL  R R+AV VR GEKR+L++
Sbjct: 230 KRDEVYQHVVAAALQKLLAQYPTTAEEDEALLASGNLTSRQRMAVEVRLGEKRLLKE 286


>gi|154294851|ref|XP_001547864.1| hypothetical protein BC1G_13548 [Botryotinia fuckeliana B05.10]
          Length = 372

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 74/320 (23%), Positives = 135/320 (42%), Gaps = 46/320 (14%)

Query: 150 DSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEII-LP 208
           ++PW  Y+  LPE       W+E+E   L GT L ++ +     +  ++  + E+ I +P
Sbjct: 52  NNPWSEYIRFLPEKIPVPTMWTEDEKTLLNGTSLENSLVNKMVALSRDFENLREKTIEIP 111

Query: 209 ------NKQLFP-RPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTED 261
                  K+  P +P+ L D++      RSR      G+  V++P+ D++NHS    T +
Sbjct: 112 WCHSCWWKEDGPLKPLLLSDWIRVDAWYRSRCLEIETGEE-VMVPVLDMVNHS---FTPN 167

Query: 262 YAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAY 321
             +E    G    + +  L   + +  G ++ I Y + KS+AE   +YGFI+++      
Sbjct: 168 AHWEHTSNG----NAILVLVPDILLDEGTEITISYGV-KSDAENLFNYGFIDAE-----V 217

Query: 322 TLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQ--YLRLVALGGTDAF 379
            LT  I E +P   D L +A+    G+     I  G +         ++ L  L   D  
Sbjct: 218 PLTSLILEVEPIATDPLRVAKVAAFGKRPSVQI-FGHSNGETSWDCPFIYLACLNEEDGL 276

Query: 380 LLESIFR-------NTIWGHLDLPVS----------HANEELI----CRVVRDACKSALS 418
             +++ +          W + D+  S          H  E+++      ++RD  +  L 
Sbjct: 277 EFKTVQKVDGSQSLKVFWQNADVTESTDQFERLISGHEREDILRFRALNLIRDRIELQLE 336

Query: 419 GFHTTIEEDEQLLQGGNLDP 438
               + E  E L+ G  +DP
Sbjct: 337 RLQASEEIVETLVNGEMVDP 356


>gi|325530255|sp|E1BI64.1|SETD6_BOVIN RecName: Full=N-lysine methyltransferase SETD6; AltName: Full=SET
           domain-containing protein 6
          Length = 450

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 74/303 (24%), Positives = 124/303 (40%), Gaps = 48/303 (15%)

Query: 52  SASATTNPPTAQVETFWQWLRDQKV---VSPKSPIRPATFPEGLGLVAQRDIAKNEVVLE 108
           +  A ++   A V +F  W   Q+V   +SPK  +       G G+VA+  +   E++  
Sbjct: 11  AGPAGSDDDPAPVASFLSWC--QRVGLELSPKVAVSRQGTVAGYGMVARESVQPGELLFA 68

Query: 109 VPMKFWINPDTVAAS-----EIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPEC 163
           VP    ++  T + S     E G+L S    W+ + L L+ E +   SPW  Y  + PE 
Sbjct: 69  VPRAALLSQHTCSISGVLERERGALQSQ-SGWVPLLLALLHEMQAPASPWSPYFALWPEL 127

Query: 164 T--DSTVFWSEEELVE-LQGTQLLSTTLGVKEYVQNEYLKVEEE---IILP----NKQLF 213
                 +FW EEE    LQGT       GV E V+ + + +  E   I+LP    +  LF
Sbjct: 128 GRLQHPMFWPEEERRRLLQGT-------GVPEAVEKDLVNIRSEYYSIVLPFMDAHPDLF 180

Query: 214 -PRPITLDDFLWAFGILRSRAFSRLRG--------QNLVLIPLADLINHSPGITTEDYAY 264
            PR  +L+ +     ++ + +F              + +++P AD++NH           
Sbjct: 181 SPRVRSLELYRQLVALVMAYSFQEPLEEEEDEKEPNSPLMVPAADILNHLAN-------- 232

Query: 265 EIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLT 324
               A L        +    P+  G ++   Y    +N +L   YGF E   D    T  
Sbjct: 233 --HNANLEYSPTCLRMVAIQPIPKGHEIFNTYG-QMANWQLIHMYGFAEPYPDNTNDTAD 289

Query: 325 LEI 327
           +++
Sbjct: 290 IQM 292


>gi|145354549|ref|XP_001421544.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144581782|gb|ABO99837.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 488

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 107/441 (24%), Positives = 166/441 (37%), Gaps = 74/441 (16%)

Query: 66  TFWQWLRDQKVVSPKSPIRPATFPE-GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASE 124
           TF  W  D +       +R    P  G  L A  DIA  E ++ +P K  +  D+   SE
Sbjct: 56  TFATWF-DHRGGKIHERVRMTYRPNFGWSLEADGDIADGERLVSLPPKLMLRCDSDDVSE 114

Query: 125 IGSLCSGLKP---WIS-VALFLIREK-KKEDSPWRVYLDILPECTD-STVFWSEEELVEL 178
                    P   W S V L L+RE+     S +  Y+ +LP   + S  F+  + +  L
Sbjct: 115 PLKNVVDRVPNEFWSSKVGLVLLRERVAGAHSAFAPYITLLPAVHEGSPTFFPPDAVRAL 174

Query: 179 Q----------GTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGI 228
           +            + L T  G    V +    V+E    P +Q     I  +   WA   
Sbjct: 175 EYAPIVQQINKRARFLGTFAGNALTVDDGESYVDE--AHPGRQRVEMTIDANALGWATAC 232

Query: 229 LRSRAFSRLRGQNLVLIPLADLINHS--PGITTE--DYAYEIKGAGLFSRDLLFSLRTPV 284
             SRAF         ++P+ D+ NHS  P ++    +      G  L +R  L S     
Sbjct: 233 ASSRAFKVGANSAPAMLPVIDICNHSFNPSVSVRAIEEGDNAGGVELIARRALTS----- 287

Query: 285 PVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETN 344
               GE + + Y  N SN EL LDYGFI   +  +   L  ++          +++A   
Sbjct: 288 ----GEPIELSYG-NLSNDELLLDYGFIVKDNPFDCVKLRWDLK--------LIELAREI 334

Query: 345 GLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIF------------------- 385
           G   +A  D V    + P     L  + L G D  +  S+F                   
Sbjct: 335 GGLAAAPIDTVA--KVAPWQATALERIGLVGDDPNVELSVFGAGQVMDKKALAGLRVLYS 392

Query: 386 ---------RNTIWGHLDLPVSHANEEL-ICRVVRDACKSALSGFHTTIEEDEQLLQGGN 435
                     +  +G +D  V   + E+   R        AL  F TT+E+DE+L     
Sbjct: 393 KSSAEASRAADAPFGEIDADVVSKDTEIKALRTCMSLLALALGNFSTTLEKDEELHDAAT 452

Query: 436 LDPRLRIAVGVRAGEKRVLQQ 456
             P++R+A+  R  +K+VL +
Sbjct: 453 -SPQVRLAIAFRMEKKKVLAK 472


>gi|326512906|dbj|BAK03360.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 485

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 86/353 (24%), Positives = 149/353 (42%), Gaps = 36/353 (10%)

Query: 136 ISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQ 195
           + + L L++E+ K DS W  Y+  LPE     +F+  E++  LQ   LL        ++ 
Sbjct: 134 MRLGLRLLQERTKFDSFWWPYIANLPETFTVPIFFPGEDIKNLQYAPLLHQINKRCRFLL 193

Query: 196 NEYLKVEEEI-ILP--NKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLIN 252
               +V++++  +P  +     + +      WA     SRAF RL G+  +L+PL D+ N
Sbjct: 194 EFEKEVKQKLGTVPSGDHPFCGQDVHSSSLGWAMSAASSRAF-RLHGEIPMLLPLVDMCN 252

Query: 253 HSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI 312
           HS        A  ++   + S D+   +     +     V + Y    ++  L LDYGF+
Sbjct: 253 HS----FSPNARIVQDGDVESPDMSVKVIAETQIDQNAAVTLNYGCYPNDFYL-LDYGFV 307

Query: 313 ESKSDRNAYTL----------TLEISESDPFFGDKLD-----IAETNGLGESAYFDIVLG 357
            + +  +   L          ++    S+P F          +++ N  GE A   + LG
Sbjct: 308 VTSNPYDQVELSYDGNLLDAASMAAGVSNPNFSTPAKWQQEILSQLNLHGEGAILKVSLG 367

Query: 358 RTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDL----PVSHANEELICRVVRDAC 413
               P ++    L AL    A   E++ ++ +   + L    P+    E    R V   C
Sbjct: 368 G---PDIVDGRLLAALRVLLAADPEAVGKHDLKTLMSLSAKAPLGPTVEASAFRTVLALC 424

Query: 414 KSALSGFHTTIEEDEQLLQGGNLDPRL--RIAVGVRAGEKRVLQQIDEIFKER 464
             AL  FHT I +D+ +L+G   +P L   +AV  R  +K +L  I +    R
Sbjct: 425 AIALQHFHTKIMDDQAILKG---EPPLTTELAVQFRLQKKLMLVDIMQNLSRR 474


>gi|367029027|ref|XP_003663797.1| hypothetical protein MYCTH_2080826, partial [Myceliophthora
           thermophila ATCC 42464]
 gi|347011067|gb|AEO58552.1| hypothetical protein MYCTH_2080826, partial [Myceliophthora
           thermophila ATCC 42464]
          Length = 357

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 65/221 (29%), Positives = 93/221 (42%), Gaps = 42/221 (19%)

Query: 131 GLKPWISVALFLIREK-KKEDSPWRVYLDILPECTDSTV-----FWSEEELVELQGTQLL 184
            + P +    FL++E  K +DS W  Y+  LP      V     FW + ++  L+GT   
Sbjct: 108 SVPPHVLGRFFLVKEYLKGKDSFWWPYIATLPPPEQVAVWALPPFWPDHDIAYLEGTNAH 167

Query: 185 STTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFL--WAFGILRSRAF-------- 234
                ++E V+ E+ +  +   L  ++ FP        L  WAF I  SR+F        
Sbjct: 168 VAIQEIQENVKREFKQARK---LLKEEDFPDLPAYTQLLYKWAFCIFTSRSFRPSLVLSD 224

Query: 235 ------SRLRGQNL------VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRT 282
                 S L  Q +      VL PL D+ NHSP   T  Y ++        R +      
Sbjct: 225 ATKRRLSALLPQGVQLDDFSVLQPLLDIANHSP---TARYTWDTTSVPDTCRLICHD--- 278

Query: 283 PVPVKAGEQVLIQYDLNKSNAELALDYGFI--ESKSDRNAY 321
             P + G QV   Y L K+N+EL L YGFI  E+ S  N Y
Sbjct: 279 --PYQPGTQVYNNYGL-KTNSELLLAYGFILPETPSLHNDY 316


>gi|449016030|dbj|BAM79432.1| similar to ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase [Cyanidioschyzon merolae
           strain 10D]
          Length = 458

 Score = 59.7 bits (143), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 70/267 (26%), Positives = 117/267 (43%), Gaps = 36/267 (13%)

Query: 67  FWQWLRDQ---KVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAAS 123
           F+ +LR+    KV   + P+  A+  +  GLVA   I   EV+  +P +  I        
Sbjct: 136 FYDYLREAHFPKVALAEVPVDGASSLKMRGLVATAAIRAGEVICRIPRRLAIC------- 188

Query: 124 EIGSLCSGLKPWISVALFLIREKKKEDS--PWRVYLDILPE---CTDSTVFWSEEELVEL 178
            +GS   G  P +  AL L+R     ++   ++ Y D+LP    C  +T F+++EEL ++
Sbjct: 189 -LGS--EGENPGLP-ALHLLRMMTDGEAVHKYKAYFDVLPRPEMCQMTTDFYNDEELGQI 244

Query: 179 QGTQLLSTTLGVKEYVQN----EYLKVEEEIILPN--KQLFPRPITLDDFLWAFGILRSR 232
             T  +  T   ++ +++    E+L+   + + P    Q          +LWA  ++ SR
Sbjct: 245 AHTPTVEETRRRRQQLRDTFLQEFLRTGADYLHPQVAAQNLDHMPEFQRYLWAVHLVVSR 304

Query: 233 AFSRLRGQ--NLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGE 290
           A +   G      LIPL D+IN        D  +E++          F L     V+  E
Sbjct: 305 ALAVRTGDEAQRYLIPLLDMIN-----CRMDSKHELRYRIATDE---FVLIAGESVRRSE 356

Query: 291 QVLIQYDLN-KSNAELALDYGFIESKS 316
           ++ I Y     SN  L  DYGFI  ++
Sbjct: 357 EIRIPYGGGFVSNDRLIQDYGFIVERN 383


>gi|327259114|ref|XP_003214383.1| PREDICTED: SET domain-containing protein 3-like, partial [Anolis
           carolinensis]
          Length = 311

 Score = 59.7 bits (143), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 58/201 (28%), Positives = 92/201 (45%), Gaps = 22/201 (10%)

Query: 287 KAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGL 346
           KAGEQ+ I Y   +SNAE  +  GF    +  +   + L +S+SD  +  K ++    G+
Sbjct: 21  KAGEQIYIFYG-TRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGI 79

Query: 347 GESAYFDIVLGRTLPP---AMLQYLRLVALGGTDAFLLE------SIFRNTIWGHLDLPV 397
             S+ F   L  T PP    +L +LR+  +  T+  L E      +I R    G+ + PV
Sbjct: 80  PTSSVF--ALHATEPPISAQLLAFLRVFCM--TEDELKEHLIGEHAIDRIFTLGNSEFPV 135

Query: 398 SHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQ- 456
           S  NE  +   +       L  + TT+ ED+  L   +L     +A+ +R GEK +L++ 
Sbjct: 136 SWDNEVKLWTFLEARASLLLKTYKTTVSEDKAFLGTQDLTCNATMAIKLRLGEKEILEKA 195

Query: 457 IDEIFKERELELDELEYYQER 477
           I      R       EYY +R
Sbjct: 196 IKSAAANR-------EYYSKR 209


>gi|426197159|gb|EKV47086.1| hypothetical protein AGABI2DRAFT_203917 [Agaricus bisporus var.
           bisporus H97]
          Length = 491

 Score = 59.7 bits (143), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 113/477 (23%), Positives = 177/477 (37%), Gaps = 100/477 (20%)

Query: 66  TFWQWLRDQKVVSPKSPIRPATFPE---GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAA 122
           TF QW + +        +    FP    G G VA RDI     +  +P    I+  T + 
Sbjct: 3   TFIQWFQRKGAYFDAESVGIIGFPPSEGGRGAVALRDIPNGHTLFSIPRTLTISTRTCSL 62

Query: 123 SEIGSLCSGLKP-----WISVALFLIREKKK-EDSPWRVYLDILPECTDSTVFWSEEELV 176
                L +  K      W+ + L L+ E      S W  YLDILP   D+ +FW+E +L+
Sbjct: 63  PTKFGLEAWRKAQLHQGWVGLILCLMWETAAGSSSKWAGYLDILPNRFDTPMFWTEYDLL 122

Query: 177 ELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQ-----LFPRPITLDDFLWAFGILRS 231
           EL+GT ++   LG K   Q +Y     E +LP  +       P  ++    L  + I+ S
Sbjct: 123 ELKGTSVVE-KLG-KVDAQADY----NEKLLPAVKSRPDLFLPEHLSTHYTLERYHIMGS 176

Query: 232 RAFSRL-------------RGQNLVLIPLADL----INHSPGITTE-------------- 260
           R  SR                 N  L    D+       +PG+  E              
Sbjct: 177 RILSRSFIVEKWNEDEDENEAANTSLGSAMDVEPSRNEETPGVPMETEDNSDDNDDDDDD 236

Query: 261 ----------------DYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAE 304
                           +  Y+ + A LF       + T  P+++GEQ+   Y  +  NAE
Sbjct: 237 DEEASDVAMVPMADILNARYQTENAKLFHEKDELKMVTTKPIRSGEQIWNTYG-DLPNAE 295

Query: 305 LALDYGFIE----------SKSDRNAYTLTLEISE--------SDPFFGDKLDIAETNGL 346
           L   YG ++          +  D       L IS          D    +++D     G 
Sbjct: 296 LLRRYGHVDFLSLPSEGHGNPGDVVEIKADLIISAVSSIPEAVKDDEAKERIDWWLEEG- 354

Query: 347 GESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELIC 406
           GE  +  +     LPP M+ +++L+ L   D        +         P     E ++ 
Sbjct: 355 GEDIFI-LDYEYDLPPVMISFVKLLLLPQADWEKAREKSK---------PPKSKLEGILY 404

Query: 407 RVVRDACKSALSGFHTTIEEDEQLLQGGNLDP-RLRIAVGVRAGEKRVLQQIDEIFK 462
            ++    +  L+ + TTIE D+ LL   N  P   + A+ VR GEK +L  I +  K
Sbjct: 405 DILISTLEKRLAEYPTTIETDKALLT--NDTPLNKKNAIIVRLGEKEILHGILQKLK 459


>gi|189189204|ref|XP_001930941.1| SET domain-containing protein RMS1 [Pyrenophora tritici-repentis
           Pt-1C-BFP]
 gi|187972547|gb|EDU40046.1| SET domain-containing protein RMS1 [Pyrenophora tritici-repentis
           Pt-1C-BFP]
          Length = 476

 Score = 59.7 bits (143), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 102/466 (21%), Positives = 196/466 (42%), Gaps = 96/466 (20%)

Query: 65  ETFWQWLRDQKV-VSPKSPIRPATFPE-GLGLVAQRDIAKNEVVLEVPMKFWIN-PDTVA 121
           + F  WLR     +SPK  +      + G G+VA +DIA++E++  +P    ++  +++ 
Sbjct: 9   QAFLAWLRQSGAEISPKIKLEDLRNKDAGRGVVASQDIAEHELLFRIPRASILSVENSIL 68

Query: 122 ASEI-GSLCSGLKPWISVALFLIRE-KKKEDSPWRVYLDILPECTDSTVFWSEEELVELQ 179
           ++EI  +  S L PW+S+ L ++ E      S W  Y  +LP   ++ +FW+E+EL ELQ
Sbjct: 69  STEIPAATLSLLGPWLSLILVMLYEYHNGSASNWAPYFAVLPTEFNTLMFWTEDELAELQ 128

Query: 180 GTQLLSTTLGVKEYVQNEYL-----------------------KVEEEIILPNKQLFPRP 216
            + ++      KE     +L                       K +E   L N +L  + 
Sbjct: 129 ASAVVGKV--GKESADEAFLEQLLPVIEEFADIVFSGDERAKDKAKEMRSLENLELMHKM 186

Query: 217 ITLDDFLWAFGILRSRAFSRLRGQNLV-----------LIPLADLINHSPGITTEDYAYE 265
            +L    +AF +  +     +  +              ++PLAD++N             
Sbjct: 187 GSL-IMAYAFDVEPATPTKEVDEEGFAEEEEDAALPKGMVPLADMLNADADRCN------ 239

Query: 266 IKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYD-LNKSNAELALDYGFIESKSDRNAYTLT 324
              A LF       ++   P++AGE++   Y  L +S+  L   YG++   +D  A    
Sbjct: 240 ---ARLFYEKDCLEMKALKPIQAGEEIFNDYGPLPRSD--LLRRYGYV---TDNYAQYDV 291

Query: 325 LEI---------------SESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLR 369
           +EI                E+   + D+ ++ +T G   ++     L  +L P ++  + 
Sbjct: 292 VEIPTDLVSEILVHEGVWQENRLEYLDEQEVLDT-GYDIASSTPFTLQESLSPELVVLVE 350

Query: 370 LVALGGTDAFLLESIFRNTIWGHLDLP--VSHANEELICRVVRDACKSALSGFHTTIEED 427
            + L   D   L+S       G L  P  ++     ++ +++    ++ ++ + TT+E+D
Sbjct: 351 SMLLSNEDFERLKS------KGKLPKPEKITGQGAGMLHKIL----QARIAQYTTTLEQD 400

Query: 428 EQLLQGGNLDP---------RLRIAVGVRAGEKRVLQQIDEIFKER 464
            Q+ +   + P         R  +A  VR GEK++L Q +E   E+
Sbjct: 401 LQMHE--EVPPVGTTTIEQRRCAMAKAVRIGEKKLLIQAEEALAEK 444


>gi|242059429|ref|XP_002458860.1| hypothetical protein SORBIDRAFT_03g041640 [Sorghum bicolor]
 gi|241930835|gb|EES03980.1| hypothetical protein SORBIDRAFT_03g041640 [Sorghum bicolor]
          Length = 491

 Score = 59.3 bits (142), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 46/190 (24%), Positives = 91/190 (47%), Gaps = 4/190 (2%)

Query: 67  FWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIG 126
           F +W+R   VV   +     + P G+ + A   +   ++V  +P    + P T  A+   
Sbjct: 13  FKRWMRAHGVVCSDALSLDVSDPLGVHVRAVTPLRDGDLVATIPRGACLTPRTTGAAAAI 72

Query: 127 SLCSGLKPWISVALFLIREKKK-EDSPWRVYLDILPECTDSTVFW-SEEELVELQGTQLL 184
              + L   +++A+ ++ E+ +  DSPW  YL +LP+     + W ++E    L GT+L 
Sbjct: 73  EA-AELGGCLALAVAVMYERARGTDSPWDAYLQLLPDRESVPLVWPADEAECLLAGTELD 131

Query: 185 STTLGVKEYVQNEYLKVEEEIILPNK-QLFPRPITLDDFLWAFGILRSRAFSRLRGQNLV 243
                 +E++  ++ +  E ++L  +  + P   +L+ +  A  ++ SR+F         
Sbjct: 132 KIVKQDREFLCEDWKECIEPLLLSGELDVDPDDFSLEKYFSAKTLVSSRSFQIDSYHGFG 191

Query: 244 LIPLADLINH 253
           ++PLADL NH
Sbjct: 192 MVPLADLFNH 201


>gi|429850390|gb|ELA25672.1| set domain-containing protein, partial [Colletotrichum
           gloeosporioides Nara gc5]
          Length = 443

 Score = 59.3 bits (142), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 76/310 (24%), Positives = 129/310 (41%), Gaps = 36/310 (11%)

Query: 55  ATTNPPTAQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFW 114
           AT+  P   + T W    D  ++  ++   P     GLGL+  +++++ E   ++P    
Sbjct: 2   ATSKLPVDTLPT-WAMFNDVDLIDVEAREIPGC---GLGLLVNKELSREEETFDIPTLLR 57

Query: 115 INPDTVAASEIGSLCSGLKPWISVAL------FLIREKKKE------DSPWRVYLDILPE 162
           I  + V ++E     + +       L       ++ ++ K        +PW  Y+  LP 
Sbjct: 58  IPHELVLSAEAVESYAKVDKNFRQLLDACGHKMVLSDRPKTTMKGGVSTPWTEYVKFLPP 117

Query: 163 CTDSTVFWSEEELVELQGTQL--------LSTTLGVKEYVQNEYLKVEE--EIILPNKQL 212
               T  W+E+E   L GT L         S T      + +E+ ++ E  E +    +L
Sbjct: 118 QVPVTTLWTEQEREMLVGTSLEDHVLTSSQSATAAKIVTLTDEFDELRETSEALPFWNEL 177

Query: 213 F--PRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAG 270
           F     ++L D+       RSR    L      ++P+ DL NHS     +  AY  + + 
Sbjct: 178 FWESDKVSLIDWARVDAWFRSRCL-ELPKSGEAMVPVLDLANHS----AQANAYYEENS- 231

Query: 271 LFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISES 330
               +++  LR    V +G+++ I Y   KS AE+   YGFI+  S  +  TL L   E 
Sbjct: 232 --KDEVVLLLRPGCRVLSGDEMTISYGDAKSGAEMLFSYGFIDPASAADRITLPLAPLED 289

Query: 331 DPFFGDKLDI 340
           DP    KL I
Sbjct: 290 DPLGKAKLHI 299


>gi|302679960|ref|XP_003029662.1| hypothetical protein SCHCODRAFT_59023 [Schizophyllum commune H4-8]
 gi|300103352|gb|EFI94759.1| hypothetical protein SCHCODRAFT_59023 [Schizophyllum commune H4-8]
          Length = 522

 Score = 59.3 bits (142), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 56/183 (30%), Positives = 79/183 (43%), Gaps = 17/183 (9%)

Query: 66  TFWQWLRDQKVVSPKSPIRPATFPE---GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAA 122
            F QW         K  +    FPE   G G+VA +DI  +  +  +P    ++  T   
Sbjct: 5   AFEQWFVANGGFVDKEYVGLTEFPEEEGGRGMVALKDIPADHTLFSIPRSIVLSTRT--- 61

Query: 123 SEIGSLCSGL--------KPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEE 174
           S + SL            K W  + L ++ E  + D  W+ YLD LP    + +FW+EEE
Sbjct: 62  SPLPSLFGADAWKERQLDKGWGGLILCMMWESAQPDRKWKGYLDSLPTEFSTPMFWTEEE 121

Query: 175 LVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDF-LWAFGILRSRA 233
           + EL GT ++   +G +E  Q EY       I     LFP  +    F L AF I  SR 
Sbjct: 122 IAELTGTAVVD-HIGKQEADQ-EYTNKVLPAIQSRPDLFPPSLLSTHFSLTAFHINGSRV 179

Query: 234 FSR 236
            SR
Sbjct: 180 LSR 182


>gi|261189299|ref|XP_002621061.1| SET domain-containing protein [Ajellomyces dermatitidis SLH14081]
 gi|239591846|gb|EEQ74427.1| SET domain-containing protein [Ajellomyces dermatitidis SLH14081]
          Length = 470

 Score = 59.3 bits (142), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 55/198 (27%), Positives = 91/198 (45%), Gaps = 15/198 (7%)

Query: 144 REKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQL-LSTTLGVKEYVQNEYLKVE 202
           +E  K  + W  Y+  LP       +++ EE   L GT L L+  L +   ++ E+ ++ 
Sbjct: 100 KEHIKISNAWTEYIKFLPASYSLPTWYTIEERELLHGTSLELALDLKLAS-LEKEFEQLR 158

Query: 203 E---EIILPNKQLFPRP---ITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPG 256
           E   +I   N+  +      +  +D+     + RSRA     G+   ++P  D+ NH+ G
Sbjct: 159 EATMDIPWCNRDWWNEDTGRLAFEDWKLVDAMYRSRAL-EFPGKGHSMVPCVDMANHTSG 217

Query: 257 ITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESK- 315
           I T    YE    G    + +F L     ++ G++V I Y   K  AE+   YGFIES  
Sbjct: 218 IQT-GALYETDANG----NAVFQLGWGQSLEVGDEVTITYGDEKGAAEMIFSYGFIESNL 272

Query: 316 SDRNAYTLTLEISESDPF 333
           ++ +   L L+I  SDP 
Sbjct: 273 TNAHQLFLDLDIPASDPL 290


>gi|357122881|ref|XP_003563142.1| PREDICTED: protein SET DOMAIN GROUP 40-like [Brachypodium
           distachyon]
          Length = 480

 Score = 59.3 bits (142), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 91/382 (23%), Positives = 146/382 (38%), Gaps = 59/382 (15%)

Query: 86  ATFPE--GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAAS--EIGSLCSGLKPWIS---- 137
           A FP+  G G  A RD+ + E+VL VP    +  D V A   EI S  +   P +S    
Sbjct: 34  ADFPDAGGRGFAAARDLRRGELVLRVPRAALLTSDRVMADDPEIASCIAARHPRLSSVQR 93

Query: 138 -VALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQN 196
            +   L    K + S W +YL  LP        +++ E+  LQ    +         +++
Sbjct: 94  LIVCLLAEVGKGKSSSWYLYLSQLPSYYTVLATFNDFEIEALQVDDAIWIAQKSLSAIRS 153

Query: 197 EYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSP- 255
           E+   +   ++   +  P+ +    +LWAF  + SR           L P+ DL N++  
Sbjct: 154 EW--EDATPLMQGLKFKPKLLIFKTWLWAFATVSSRTLHVAWDDAGCLCPVGDLFNYAAP 211

Query: 256 --GITTE--------------DYAYEIK--------GAGLFSRDLLFSLRTPVPVKAGEQ 291
              I++E              +   E+K          G +     + L        GEQ
Sbjct: 212 DDDISSEEENREEVTKCQQKNEMLEEVKFGRSSERLSDGGYEDSEAYCLYARKCYTKGEQ 271

Query: 292 VLIQYDLNKSNAELALDYGFI--ESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGES 349
           VL+ Y    +N EL   YGF+  E+ +++    L L++     +  D L I  +   G  
Sbjct: 272 VLLGYG-TYTNLELLEHYGFLLAENPNEKTYIQLDLDLYSVGTWPTDSLYIHPS---GNP 327

Query: 350 AYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVV 409
           ++           A+L  LRL              F + I+    L V   NE  I + +
Sbjct: 328 SF-----------ALLCLLRLWMTPANH----RKAFSHQIYSGSMLSVE--NELEIMKWL 370

Query: 410 RDACKSALSGFHTTIEEDEQLL 431
              C   L    TT+E DE LL
Sbjct: 371 GSKCVETLQKLPTTVESDESLL 392


>gi|408392258|gb|EKJ71616.1| hypothetical protein FPSE_08255 [Fusarium pseudograminearum CS3096]
          Length = 527

 Score = 59.3 bits (142), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 73/262 (27%), Positives = 113/262 (43%), Gaps = 45/262 (17%)

Query: 67  FWQWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEI 125
           F  W   + V    S + P   P  G+G+VA  DI  N+ +L VP +     DTV    I
Sbjct: 7   FMPWASSKGVKC--SNVEPRIMPGRGIGIVAVYDIRANQTILSVPTRAVRTIDTV-PKHI 63

Query: 126 GSLCSGLKPWISVALFLIREKKKEDS-PWRVYLDILPECTD----STVFWSEE--ELVEL 178
                G    +SV   L  E   +DS  + ++  +LP   D      + W  E   L+  
Sbjct: 64  KDALHG----VSVHGILAAEIALDDSDDFAIWRTVLPTREDLEGGMPMMWPSELQALLPK 119

Query: 179 QGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAF---- 234
           +   LL          QN   + E +I+L   + FP  +T D+++ ++ ++ +R F    
Sbjct: 120 RAKDLLDN--------QNTTFRRECDIVL---KAFPT-LTRDEYMLSWVLINTRTFYNSM 167

Query: 235 ----SRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGE 290
               S      LV +P+ DL NH      ED   + +G  L    L +S++T    K GE
Sbjct: 168 PKMKSYAHSDRLVCMPVLDLFNH------ED---QSQGCKLVYSALGYSVQTDRAYKQGE 218

Query: 291 QVLIQYDLNKSNAELALDYGFI 312
           +V + Y  + SN  L  +YGFI
Sbjct: 219 EVFVSYGPH-SNDFLLTEYGFI 239


>gi|189237481|ref|XP_001810520.1| PREDICTED: similar to conserved hypothetical protein [Tribolium
           castaneum]
 gi|270006984|gb|EFA03432.1| hypothetical protein TcasGA2_TC013422 [Tribolium castaneum]
          Length = 413

 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 92/401 (22%), Positives = 164/401 (40%), Gaps = 55/401 (13%)

Query: 87  TFPE-GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLC---SGLKPWISVALFL 142
            FP+ G G+   R++ +++V++ VP +  I+  T+  S    L    S L     +  FL
Sbjct: 45  NFPDTGRGVATPRNLKESDVLITVPYELMISYTTLQKSNFLHLFTPESRLSIVDLLTAFL 104

Query: 143 IREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVE 202
           + E+ KE+S WR Y+  LP           ++ VEL    L       +  ++  + ++ 
Sbjct: 105 VIERDKENSFWRDYIKSLPPQPPWIPALLSQDRVELLPADLRLAAKKSRRLLEESWSRLR 164

Query: 203 EEIILPNKQLFPRPITLDDFLWAFGILRSRA-FSRLR--------GQNLV-------LIP 246
           + I    ++     I L  F+W + ++ +RA +   R        G +++       L P
Sbjct: 165 KSI----RREASCVIDLHSFIWGYVLVNTRAVYVNPRIVRELCDCGSDILSDEPCMALCP 220

Query: 247 LADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELA 306
             D+ NHS    TE  A  +   G F    ++ L T V  +  EQV I Y  +  N +L 
Sbjct: 221 FLDMFNHSHEAKTE--ATLMNDQGKF----VYQLTTLVGTRKHEQVFISYG-DHDNVKLL 273

Query: 307 LDYGFIESKSDRNAYTLTLE--ISESDPFFGD-KLDIAETNGLGESAYFDIVLGRTLPPA 363
           ++YGF    +  ++  +  E      +P   D +     ++ L +  Y    L       
Sbjct: 274 IEYGFFIPGNSNDSIPIQSEEVFRVLEPNLNDFQYKFIRSHNLDKCLY----LTEAGASF 329

Query: 364 MLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTT 423
            L+    VA G      L S+  N      D P     EE  C    D  +  L      
Sbjct: 330 NLKAFLFVAFGTIGTKNLTSVIYND-----DYP-----EEFTC----DLVQLCLKLLRFQ 375

Query: 424 IEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKER 464
           + + E +LQ  +LDP +   +G R   K +++++ +I  ++
Sbjct: 376 LRQVETILQNSDLDPHIVDFLGYR---KSLIEKLCDIVNKQ 413


>gi|307190528|gb|EFN74525.1| SET domain-containing protein 3 [Camponotus floridanus]
          Length = 210

 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 42/131 (32%), Positives = 69/131 (52%), Gaps = 12/131 (9%)

Query: 64  VETFWQWLRDQKVVSPKSPIRPATFPE-GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAA 122
           +  F  WL +         I  A FP   LGL A+ D A+N+++LE+P     +  T AA
Sbjct: 81  IGQFMNWLTENGAHVDGVSI--AEFPGYDLGLKAETDFAENQLMLEIPRAIIFSTYT-AA 137

Query: 123 SEIGSLCSG----LKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVEL 178
            E+  L +       P +++A+ L+ E+ KE+S W  YLD+LP   ++ ++    +++EL
Sbjct: 138 PELTVLQNDPLVQHMPQVALAIALLIERHKENSKWECYLDMLPSGYNTVLYMKANDMIEL 197

Query: 179 QGTQLLSTTLG 189
           +G    S TLG
Sbjct: 198 KG----SPTLG 204


>gi|58262900|ref|XP_568860.1| phospholipid metabolism-related protein [Cryptococcus neoformans
           var. neoformans JEC21]
 gi|57223510|gb|AAW41553.1| phospholipid metabolism-related protein, putative [Cryptococcus
           neoformans var. neoformans JEC21]
          Length = 533

 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 77/310 (24%), Positives = 126/310 (40%), Gaps = 60/310 (19%)

Query: 59  PPTAQVETFWQWLRDQKV-VSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINP 117
           PP ++     +WL +  V  SP   I+P +   G  +VA+RDI   E++L +P    ++ 
Sbjct: 10  PPPSRA-CLLEWLAENLVYTSPSLTIQPMSSSSGYRIVAERDIEIGELLLSLPKHSILSH 68

Query: 118 DTVAASEIGSLCSGLKPWISVALFLIRE-KKKEDSPWRVYLDILPECTDST--VFWSEEE 174
            T + S I  L S     ++++L L+ E +   +S +  YL  LP    +   +FW   E
Sbjct: 69  QTASLSAITHLSSASHTILNLSLCLLHEIRLSVNSKFYGYLQSLPRDMGAGLPLFWQTGE 128

Query: 175 LVELQ----GTQLLSTTLGVKEYVQNE------------YLKVEEEIILPNKQLFPRPIT 218
            VE++    G Q L +T   KE  ++E            YL+    +        P P  
Sbjct: 129 GVEVEDGERGLQWLKSTEAEKELRKSERQGLSLSDVYAFYLRTSHLLPPTPTDPLPSPFL 188

Query: 219 LDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSP----GITTEDYAYEIKG------ 268
              F+ A+ ++ +RAF         L P ADL+NHS      + ++D+   I G      
Sbjct: 189 A--FVHAYTLISTRAFLIDLYHLTALCPFADLLNHSAVPHTCLASDDFVCYICGSLNVCE 246

Query: 269 --------------------------AGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSN 302
                                     A L + +    +R   PV+ GE+V   Y  +  +
Sbjct: 247 HDLSRPDGLDENGTPRRLAHLSQMEIARLKNENDNIEMRLERPVRKGEEVFNTYG-DVGD 305

Query: 303 AELALDYGFI 312
             L  +YGFI
Sbjct: 306 GRLLAEYGFI 315


>gi|354495008|ref|XP_003509624.1| PREDICTED: N-lysine methyltransferase SETD6-like [Cricetulus
           griseus]
          Length = 492

 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 76/290 (26%), Positives = 125/290 (43%), Gaps = 45/290 (15%)

Query: 64  VETFWQWLRDQKV-VSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAA 122
           V  F +W     + +SPK  +       G G+VA+  +   E++  VP    ++P T + 
Sbjct: 60  VAGFLRWCAGVGLELSPKVAVSRQGTVAGYGMVARESVQPGELLFAVPRSALLSPHTCSI 119

Query: 123 S-----EIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECT--DSTVFWSEEEL 175
           S     E G+L S L  W+ + L L+ E +   SPW  Y  + PE    +  +FW EEE 
Sbjct: 120 SGLLERERGALQS-LSGWVPLLLALLHELQAPASPWSPYFALWPELGRLEHPMFWPEEER 178

Query: 176 VE-LQGTQLLSTTLGVKEYVQNEYLKVEEE---IILP----NKQLF-PRPITLDDFLWAF 226
              LQGT       GV E V+ + + +  E   I+LP    +  LF P   +L+ +    
Sbjct: 179 RRLLQGT-------GVPEAVEKDLVNITSEYHSIVLPFMEAHSDLFSPTVRSLELYRQLV 231

Query: 227 GILRSRAFSR---------LRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLL 277
            ++ + +F               + +++P ADL+NH   I   +   E      +S D L
Sbjct: 232 ALVMAYSFQEPLEEEDDDEKEPNSPLMVPAADLLNH---IANHNANLE------YSADYL 282

Query: 278 FSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEI 327
             + T  P+  G ++   Y    +N +L   YGF E   D    T  +++
Sbjct: 283 RMVAT-QPIPKGHEIFNTYG-QMANWQLIHMYGFAEPYPDNTDDTADIQM 330


>gi|320168265|gb|EFW45164.1| hypothetical protein CAOG_03170 [Capsaspora owczarzaki ATCC 30864]
          Length = 464

 Score = 58.9 bits (141), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 69/313 (22%), Positives = 126/313 (40%), Gaps = 60/313 (19%)

Query: 86  ATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSG-------LKPWIS- 137
           A   +G G++A+RDI   +  + VP    +  +    SE   L +        L P ++ 
Sbjct: 162 ANAEQGNGVIARRDIPAGQTFINVPEALMMTAEKARKSETFQLITSGALDSTELSPAMAK 221

Query: 138 -----VALFLIREKKKE-DSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTL--- 188
                + +FLI E+++  +S W  Y+D+LP+     ++++E EL  L+ +  L       
Sbjct: 222 LDNFLLRMFLIVERRRGGNSYWSPYIDLLPQRFRLPLYFTEAELELLKPSPALQEAFVQL 281

Query: 189 --GVKEYVQ-NEYLKV----------------EEEIILPNKQLFPRPI-----TLDDFLW 224
              V++Y    +YL +                 ++I+   ++    P+     T D F W
Sbjct: 282 RNVVRQYAAWKQYLMMLELARAAELPSGSGDAHQKILDQRRRAQAMPVRYNELTYDLFCW 341

Query: 225 AFGILRSRAFSRLRGQ---------NLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRD 275
           A   + +R    + G+         +L LIP  D+ NH+ G  +  Y  + +        
Sbjct: 342 ASSAVATRQNQIVVGEVRANQAPELSLALIPGWDMCNHAFGGASSFYDTQTRS------- 394

Query: 276 LLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFG 335
                    P+  GE VL+ Y    S A    +  F+ +    + Y + L + + DP F 
Sbjct: 395 --LECVAVAPIAKGEPVLLHYGDRSSMAYFG-NSEFVPADHPTDQYLILLAVGKQDPLFK 451

Query: 336 DKLDIAETNGLGE 348
            K  I +  G+ E
Sbjct: 452 SKSTILQALGVKE 464


>gi|448092000|ref|XP_004197467.1| Piso0_004720 [Millerozyma farinosa CBS 7064]
 gi|448096594|ref|XP_004198498.1| Piso0_004720 [Millerozyma farinosa CBS 7064]
 gi|359378889|emb|CCE85148.1| Piso0_004720 [Millerozyma farinosa CBS 7064]
 gi|359379920|emb|CCE84117.1| Piso0_004720 [Millerozyma farinosa CBS 7064]
          Length = 595

 Score = 58.9 bits (141), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 85/371 (22%), Positives = 143/371 (38%), Gaps = 73/371 (19%)

Query: 63  QVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAA 122
           ++E F  W ++            +    GL   A  DI+ +++  EVP    + PD   +
Sbjct: 4   KLERFLTWAKNNDAFIDSRINFVSNTDLGLLAKANEDISASKI-FEVPSTICLGPDLARS 62

Query: 123 SEIGSLCSGL-----KPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVE 177
           +    L   L         ++ +  ++  K  +  +  Y+DILP+      +W+ +E   
Sbjct: 63  TLKNYLRDDLYYDSGDELTTLLISKLKFDKSCEHSFGPYIDILPDKLSLPFYWNHQERSL 122

Query: 178 LQGTQL----------------------------LSTTLGVK-------EYVQNEY--LK 200
           ++ T L                            LS   G+K       EY+  +Y   +
Sbjct: 123 VEDTDLKVILDRNFQKLVEEWHSLVESLIDKEKHLSFEAGLKADLNFYEEYITGKYDEYR 182

Query: 201 VEEEIILPNKQLFPRPITLDDFLWAFGILRSRAF---------SRLRGQNLVLIPLADLI 251
           + E +   NK++     +   ++W+  IL SR F         S+       LIPL D++
Sbjct: 183 LYEYL---NKKI-QSWTSFSAYVWSRSILMSRGFPYLLVAEDNSKPNLTKACLIPLFDIL 238

Query: 252 NHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF 311
           NH              G G    +++F L   V  K GEQ+   Y  NKSN EL L YGF
Sbjct: 239 NHKSNSPIRWTPVMESGTG----NVIFQLERGV--KKGEQLFNNYG-NKSNCELLLSYGF 291

Query: 312 IESKSDRNAYTLTLEISESD----------PFFGDKLDIAETNGLGESAYFDIVLGRTLP 361
            E K+  ++ ++TL+I + +           FF  K  I + N L +   F I     LP
Sbjct: 292 AEEKNPHDSASITLKIDDGNIFKKALAHGVKFFDGKNKIKDINLLTKGVNFQISNECLLP 351

Query: 362 PAMLQYLRLVA 372
             ++ +   V 
Sbjct: 352 SNLIDFFAFVV 362


>gi|367048695|ref|XP_003654727.1| hypothetical protein THITE_2117893 [Thielavia terrestris NRRL 8126]
 gi|347001990|gb|AEO68391.1| hypothetical protein THITE_2117893 [Thielavia terrestris NRRL 8126]
          Length = 481

 Score = 58.9 bits (141), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 60/209 (28%), Positives = 88/209 (42%), Gaps = 40/209 (19%)

Query: 132 LKPWISVALFLIREK-KKEDSPWRVYLDILPECTDSTV-----FWSEEELVELQGTQLLS 185
           L P +    FLI+E  K  DS W  Y+  LP+    +      FW EE++  L GT    
Sbjct: 99  LPPHVIGRFFLIKEYLKGRDSFWAPYIATLPQPEHVSAWALPAFWPEEDIAYLAGTNAHV 158

Query: 186 TTLGVKEYVQNEYLKVEEEIILPNKQLFP--RPITLDDFLWAFGILRSRAF------SRL 237
               ++  V++E+ +  + +       FP  +  T   + WAF I  SR+F      S  
Sbjct: 159 AIAEIQANVKSEFKQARKAL---KAAGFPAWQDYTQMLYKWAFCIFTSRSFRPSLVLSEP 215

Query: 238 RGQNL--------------VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTP 283
             Q +              +L PL D+ NHS    T  YA+++       + +      P
Sbjct: 216 AKQQMAELLPPGCQLDDFSILQPLFDIANHS---MTARYAWDVASDPASCQLVCHDAYQP 272

Query: 284 VPVKAGEQVLIQYDLNKSNAELALDYGFI 312
                GEQV   Y L K+N+EL L YGFI
Sbjct: 273 -----GEQVYNNYGL-KTNSELLLAYGFI 295


>gi|134108228|ref|XP_777065.1| hypothetical protein CNBB2970 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|50259750|gb|EAL22418.1| hypothetical protein CNBB2970 [Cryptococcus neoformans var.
           neoformans B-3501A]
          Length = 533

 Score = 58.9 bits (141), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 77/310 (24%), Positives = 126/310 (40%), Gaps = 60/310 (19%)

Query: 59  PPTAQVETFWQWLRDQKV-VSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINP 117
           PP ++     +WL +  V  SP   I+P +   G  +VA+RDI   E++L +P    ++ 
Sbjct: 10  PPPSRA-CLLEWLAENLVYTSPSLTIQPMSSSSGYRIVAERDIEIGELLLSLPKHSILSH 68

Query: 118 DTVAASEIGSLCSGLKPWISVALFLIRE-KKKEDSPWRVYLDILPECTDST--VFWSEEE 174
            T + S I  L S     ++++L L+ E +   +S +  YL  LP    +   +FW   E
Sbjct: 69  QTASLSAITHLSSASHTILNLSLCLLHEIRLSVNSKFYGYLQSLPRDMGAGLPLFWQTGE 128

Query: 175 LVELQ----GTQLLSTTLGVKEYVQNE------------YLKVEEEIILPNKQLFPRPIT 218
            VE++    G Q L +T   KE  ++E            YL+    +        P P  
Sbjct: 129 GVEVEDGERGLQWLKSTEAEKELRKSERQGLSLSDVYAFYLRTSHLLPPTPTDPLPSPFL 188

Query: 219 LDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSP----GITTEDYAYEIKGA-GLFS 273
              F+ A+ ++ +RAF         L P ADL+NHS      + ++D+   I G+  +  
Sbjct: 189 A--FVHAYTLISTRAFLIDLYHLTALCPFADLLNHSAVPHTCLASDDFVCYICGSLNVCE 246

Query: 274 RDLL-------------------------------FSLRTPVPVKAGEQVLIQYDLNKSN 302
            DL                                  +R   PV+ GE+V   Y  +  +
Sbjct: 247 HDLSRPDGLDENGTPRRLAHLSQMEITRLKNENDNIEMRLERPVRKGEEVFNTYG-DVGD 305

Query: 303 AELALDYGFI 312
             L  +YGFI
Sbjct: 306 GRLLAEYGFI 315


>gi|115386294|ref|XP_001209688.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114190686|gb|EAU32386.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 486

 Score = 58.9 bits (141), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 47/160 (29%), Positives = 78/160 (48%), Gaps = 13/160 (8%)

Query: 63  QVETFWQWLRDQKVVSPKSPIRPATFPE---GLGLVAQRDIAKNEVVLEVPMKFWINPDT 119
           Q + F  WL ++  V+    I  A       G G+VAQ DI +NE +  +P    +   +
Sbjct: 16  QSDDFISWLSEKPGVNINPNIAVADLRSQGAGRGVVAQTDIPENEELFTIPRDLVL---S 72

Query: 120 VAASEIGSLCSG----LKPWISVALFLIREKKKED-SPWRVYLDILPECTDSTVFWSEEE 174
              S++  L S     L PW+S+ L ++ E    D S W  Y  +LP   D+ +FW+  E
Sbjct: 73  TQNSKLKDLLSQDLEELGPWLSLMLVMMYEYLLGDQSTWAAYFKVLPRKFDTLMFWTPSE 132

Query: 175 LVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFP 214
           L+ELQG+ ++   +G ++      L++   I+  +  LFP
Sbjct: 133 LLELQGSAVID-KIG-RQGADESILEMIAPIVRAHPSLFP 170


>gi|336463341|gb|EGO51581.1| hypothetical protein NEUTE1DRAFT_125257 [Neurospora tetrasperma
           FGSC 2508]
 gi|350297448|gb|EGZ78425.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
          Length = 503

 Score = 58.9 bits (141), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 73/269 (27%), Positives = 112/269 (41%), Gaps = 53/269 (19%)

Query: 83  IRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALF 141
           + P   P  G+G+VA + + ++E +L VP       +TV      S+   L   IS+   
Sbjct: 50  VEPRRIPGRGMGIVATKPLEEDETILTVPSTTLRTYETVHP----SISRALPKDISIHGL 105

Query: 142 LIREKKKEDSPWRVYLDILPECTDSTVFWS-----------------------------E 172
           L  +   + SP   Y      C      W                              E
Sbjct: 106 LAADLAFDPSPSVKYRQWNTVCPSREDIWDYLPLTWDHQLREFLPRKALELLLKQEAKFE 165

Query: 173 EELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSR 232
           ++   +Q ++LLS+  G KE   N+ L+ EEE    NK      IT DD+L+A+ ++ +R
Sbjct: 166 KDWSMVQQSKLLSSGPGQKE---NKSLE-EEECGTLNKG-----ITRDDYLYAWLLVNTR 216

Query: 233 AFSRL--------RGQNLVLIPLADLINH-SPGITTEDYAYEIKGAGLFSRDLLFSLRTP 283
            F           +   +VL P+ADL NH S   +TE+     K   +      F++ T 
Sbjct: 217 TFYHETPKTKKFNKDDRMVLQPVADLFNHTSYDPSTEEKEGNKKTCSVAFSPTAFTITTT 276

Query: 284 VPVKAGEQVLIQYDLNKSNAELALDYGFI 312
            P  AGE+V I Y  N SN  L ++YGF+
Sbjct: 277 RPYAAGEEVYICYG-NHSNDFLLIEYGFL 304


>gi|406602781|emb|CCH45655.1| Ribosomal N-lysine methyltransferase 1 [Wickerhamomyces ciferrii]
          Length = 586

 Score = 58.9 bits (141), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 84/319 (26%), Positives = 133/319 (41%), Gaps = 55/319 (17%)

Query: 62  AQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIA--KNEVVLEVPMKFWINPDT 119
           +Q++T  +W +DQ  +   S     T   G+ +   +DI     +  +++P    I+ + 
Sbjct: 5   SQIQTLLKWGQDQGAIIDPSIQFKYTKDHGISVFVTQDIKTPSEKPQIKIPSDMIIH-NK 63

Query: 120 VAASEIGSLCSGLKPWISVALFLIREKKKEDS---PWRV------YLDILP--ECTDSTV 168
           +A    G         ++ +L L+  K K D+    W +      ++D+LP  + T S  
Sbjct: 64  LALDTFGESFISKSDNVNSSLKLLVSKLKFDNTNDSWELQQKFKPFIDLLPLGKETGSVF 123

Query: 169 FWSEEELVELQGTQLL-STTLGVKEYVQNEYLKVEEEIILPNKQ----LFPRPITLDD-- 221
           +WS EEL  L  T L  S     K  ++  Y  VE     P  Q    LF    +LD   
Sbjct: 124 YWSSEELRTLGKTNLAGSLEAKTKSLLKEWYSTVENLEHTPELQQDLKLFHDFDSLDHDT 183

Query: 222 ----------------FLWAFGILRSRAF------SRLRGQNLVLIPLADLINHSPGITT 259
                           +LW+  I  SRAF      S  +    +L+P+ DL+NH      
Sbjct: 184 LVSKLLDTKSWTSFGAYLWSCIIFTSRAFPNEIINSNCKPGQAILLPIIDLLNHDNSTNV 243

Query: 260 EDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI--ESKSD 317
           E +++E  G   F     F+L    P   G+++   Y   KSN EL L YGF   ++K D
Sbjct: 244 E-WSFEETGDEGF-----FTLLNKDPHTKGDEIFNNYGA-KSNEELLLGYGFTLEDNKHD 296

Query: 318 RNAYTLTL---EISESDPF 333
             A  + L   EI E++ F
Sbjct: 297 TIALRIKLPLGEILEAENF 315


>gi|356556146|ref|XP_003546388.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Glycine
           max]
          Length = 483

 Score = 58.9 bits (141), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 102/427 (23%), Positives = 175/427 (40%), Gaps = 78/427 (18%)

Query: 89  PEGLGLVAQRDIA---KNEVVLEVPMKFWIN----------PDTVAASE-IGSLCSGLKP 134
           P+G G+ A +D+    +  V++E+P++  +           PD +     I  + +   P
Sbjct: 98  PDGFGVYASKDVEPLRRARVIMEIPLELMLTISKKLPWMFFPDIIPLDHPIFDIINSTNP 157

Query: 135 ---W-ISVALFLIREKKKEDSPWRVYLDILP---ECTDSTVFWSEEELVELQGTQLLSTT 187
              W + +A  L+     E + W++Y D LP   ECT   +  SEEEL+ELQ   L ST 
Sbjct: 158 ETDWDLRLACLLLYAFDCEGNFWQLYGDFLPSADECTSLLLA-SEEELLELQDPDLASTI 216

Query: 188 LGVK----EYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSR------L 237
              +    E+ +N +     ++ L  K+L   P     F WA GI +SR  +       L
Sbjct: 217 RKQQQRSLEFWENNW---HSDVPLKIKRLARDP---QRFAWAVGIAQSRCINMQTRIGAL 270

Query: 238 RGQNLVLIPLADLINHS--PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQ 295
             +  +LIP AD++NHS  P      + +  K      R L   +     ++ G+++ + 
Sbjct: 271 NQEANMLIPYADMLNHSFEPNCF---FHWRFK-----DRMLEVLINAGQQIRKGDEMTVN 322

Query: 296 YDLNKSNAELALDYGFIESKS--DRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFD 353
           Y   + N      YGF    +  D+  ++    I   D F    L I   +GL E  Y +
Sbjct: 323 YMSAQKNDMFMQRYGFSSPVNPWDKIKFSGNARI-HLDSF----LSIFNISGLPEEYYHN 377

Query: 354 IVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDAC 413
             L +                  D  ++ +      W   D+P   + E    + ++D C
Sbjct: 378 NCLSK------------AGDSFVDGAVIAAARTLPTWSDGDVPPIPSMERKAAKELQDEC 425

Query: 414 KSALSGFHTTIEEDEQLLQG-GNLDPRLRIAVGVRAGEKRVLQQIDEIFKERELELDELE 472
           +  L+ F TT ++D++LL    +    L  A+  R   K  ++++             LE
Sbjct: 426 QKMLAEFATTSKQDQKLLDSMADATRTLEAAIKYRLHRKLFMEKV----------FQALE 475

Query: 473 YYQERRL 479
            YQE+ L
Sbjct: 476 MYQEQIL 482


>gi|308806960|ref|XP_003080791.1| SET domain-containing protein-like (ISS) [Ostreococcus tauri]
 gi|116059252|emb|CAL54959.1| SET domain-containing protein-like (ISS) [Ostreococcus tauri]
          Length = 527

 Score = 58.9 bits (141), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 57/243 (23%), Positives = 104/243 (42%), Gaps = 22/243 (9%)

Query: 84  RPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVA----ASEIGSLCSGLK---PWI 136
           R  T  E  G+ A+  ++ +  +  +P    + P+T      A ++G   + +K    W+
Sbjct: 106 RANTVDENRGVRARVGLSTSRTIARIPFDACLTPETCGMPDVARDVGEALTRMKTDASWL 165

Query: 137 -SVALFLIREKKKED-SPWRVYLDILPECTDSTV-FWSEEELVELQGTQLLSTTLGVKEY 193
            ++A  L  E+   D S +  Y  +LP C  + V  W + E   L GT+       +++ 
Sbjct: 166 CALACALCVERHLGDASTFAPYDKVLPRCEKNVVSMWDDAERELLAGTE-------IEQS 218

Query: 194 VQNEYLKVEEEIILPNKQLFPR---PITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADL 250
           +++E    + E      ++F +     + DDF  A  ++ SRAF+    +N  L+P+AD 
Sbjct: 219 MRDELSAAKREWDCVVSKVFEKHGVKCSFDDFHAARTVVSSRAFTMTPSRN-GLVPIADA 277

Query: 251 INHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYG 310
            NH  G    +        G     L   +     V+ G+++   Y     NA+L   YG
Sbjct: 278 FNHRTGKHDVNVGDGESYTGTKGDSLCVKITKTEGVREGDEIFNTYGF-LGNAKLLNSYG 336

Query: 311 FIE 313
           F +
Sbjct: 337 FTQ 339


>gi|340923760|gb|EGS18663.1| hypothetical protein CTHT_0052690 [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 505

 Score = 58.5 bits (140), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 60/208 (28%), Positives = 87/208 (41%), Gaps = 27/208 (12%)

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY-LKVEEEIILP- 208
           +PW  YL  LPE       W+E+E + L+GT L          +  E+ L  E+   LP 
Sbjct: 122 NPWAQYLRFLPETVLVPTLWNEDERLLLRGTSLEPAIDAKLSALDAEFGLVREKSCDLPC 181

Query: 209 ---------NKQL--------FPRPI-TLDDFLWAFGILRSRAFSRLRGQNLVLIPLADL 250
                    + QL        F  P  +  D++    + RSR    L G    ++P  D+
Sbjct: 182 WNELLWQQDDGQLAEGAEIGAFSSPTPSFTDWIRLDALYRSRCL-ELPGSGEAMVPCIDM 240

Query: 251 INHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYG 310
           INHS   +    AY  + A     +++  LR  V + AG +V I Y   K  AE+  +YG
Sbjct: 241 INHS---SVSPSAYYEENA---QDEVVLLLRPGVSLPAGAEVTISYGDVKPAAEMLFNYG 294

Query: 311 FIESKSDRNAYTLTLEISESDPFFGDKL 338
           FI+ +S      L L     DP    KL
Sbjct: 295 FIDPQSASERLVLPLRPFSDDPLAKAKL 322


>gi|451999637|gb|EMD92099.1| hypothetical protein COCHEDRAFT_1134267 [Cochliobolus
           heterostrophus C5]
          Length = 476

 Score = 58.5 bits (140), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 98/455 (21%), Positives = 187/455 (41%), Gaps = 76/455 (16%)

Query: 66  TFWQWLRDQKV-VSPKSPIRPATFPE-GLGLVAQRDIAKNEVVLEVPMKFWIN-PDTVAA 122
            F  WL+     ++PK  +      + G G+ A++DIA++E++  +P    ++  +++ +
Sbjct: 10  AFLDWLKHTGAQINPKIQLEDLRAKDAGRGVAAKQDIAEHELLFSIPRSSILSVENSILS 69

Query: 123 SEIG-SLCSGLKPWISVALFLIRE-KKKEDSPWRVYLDILPECTDSTVFWSEEELVELQG 180
           +EI  +  + L PW+S+ L ++ E      S W  Y  +LP   D+ +FW+E+EL ELQ 
Sbjct: 70  TEIPPTTFALLGPWLSLILVMLYEYHNGSASNWAPYFAVLPTDFDTLMFWTEDELTELQA 129

Query: 181 TQLLST--TLGVKEYVQNEYLKVEEEII------------LPNKQLFPRPITL----DDF 222
           + +++     G  E    + L V EE              L  +   P  + L       
Sbjct: 130 SAVVNKIGKEGANEVFIEQLLPVIEEFADVIFSGDERAKDLAKEMRAPENLELMHKMGSL 189

Query: 223 LWAFGILRSRAFSRLRGQNLV-------------LIPLADLINHSPGITTEDYAYEIKGA 269
           + A+      A S                     ++PLAD++N           YE  G 
Sbjct: 190 IMAYAFDVEPAISDKEVDEEGFAEEEEDAALPKGMVPLADMLNADADRCNARLFYEKDG- 248

Query: 270 GLFSRDLLFSLRTPVPVKAGEQVLIQYD-LNKSNAELALDYGFI-ESKSDRNAYTLTLEI 327
                     ++   P++AGE++   Y  L +S+  L   YG+I E+ +  +   +  ++
Sbjct: 249 --------LEMKALKPIQAGEEIFNDYGPLPRSD--LLRRYGYITENYAQYDVVEIPADL 298

Query: 328 -----------SESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGT 376
                       E    + D+ +I +T G   +A     L  +L P ++  +  + L   
Sbjct: 299 VSQALAHDGLWHEKRIEYLDEQEIVDT-GYDIAASVPFSLEESLSPELVILVETMLLPSE 357

Query: 377 DAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQG--- 433
           +   L+S  R        LP +         ++    ++ ++ + TT+E+D Q+      
Sbjct: 358 EFERLQSKGR--------LPKAEKMTGKAAEILYKIVQARIAQYPTTLEQDLQVSSETQP 409

Query: 434 ----GNLDPRLRIAVGVRAGEKRVLQQIDEIFKER 464
                  + R+ +A  VR GEK++L Q +E   E+
Sbjct: 410 AQTMSRKERRIAMARAVRIGEKQLLVQTEERLAEK 444


>gi|365989204|ref|XP_003671432.1| hypothetical protein NDAI_0H00150 [Naumovozyma dairenensis CBS 421]
 gi|343770205|emb|CCD26189.1| hypothetical protein NDAI_0H00150 [Naumovozyma dairenensis CBS 421]
          Length = 589

 Score = 58.5 bits (140), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 59/214 (27%), Positives = 97/214 (45%), Gaps = 40/214 (18%)

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQLL-----STTLGVKEYVQNEYLKVEEEI 205
           S W  Y++IL    +   FW+ ++L +L+GT L           ++EY+  E L++ +  
Sbjct: 104 SQWDKYVEILSLDLNQPYFWTVDQLQQLKGTDLYIKIQQDFATIIQEYI--ELLQILKVD 161

Query: 206 ILPNKQLFPR----------PITLD---------DFLWAFGILRSRAFSRLRGQN----- 241
           IL  ++L             P  LD          +LW+  I +SRAF +L   N     
Sbjct: 162 ILDQEKLQTATISHYINSHLPTLLDGKLPWNHFVSYLWSHCIFKSRAFPQLLLNNAGSDV 221

Query: 242 -----LVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQY 296
                  L P+ DL+NH   +  +  +         ++ L F  +  + V  G+Q+   Y
Sbjct: 222 GNINLAFLFPIVDLLNHKNDVVVKWESSNDINNKNDNKVLTFITQETLHV--GDQIFNNY 279

Query: 297 DLNKSNAELALDYGFIESKSDRNAYT-LTLEISE 329
             NKSN EL L YGFI+  ++   Y+ LTL+++E
Sbjct: 280 G-NKSNEELLLGYGFIQENNNNYDYSELTLKLNE 312


>gi|322795851|gb|EFZ18530.1| hypothetical protein SINV_15998 [Solenopsis invicta]
          Length = 246

 Score = 58.5 bits (140), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 37/118 (31%), Positives = 66/118 (55%), Gaps = 6/118 (5%)

Query: 86  ATFPE-GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSG----LKPWISVAL 140
           A FP   LGL A+ D  +N+++LE+P     +  T AASE+  L +       P +++A+
Sbjct: 103 AEFPGFDLGLKAETDFTENQLILEIPRALIFSTYT-AASELIVLQNDPLVQHMPQVALAV 161

Query: 141 FLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY 198
            L+ EK KE+S W+ YL++LP   ++ ++    +++EL+G+  L   L     +  +Y
Sbjct: 162 ALLIEKYKENSTWKPYLEMLPSNYNTVLYMKTNDMIELKGSPTLEAALKQCRNIARQY 219


>gi|255937283|ref|XP_002559668.1| Pc13g12520 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211584288|emb|CAP92321.1| Pc13g12520 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 482

 Score = 58.5 bits (140), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 52/190 (27%), Positives = 80/190 (42%), Gaps = 13/190 (6%)

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQL---LSTTLGVKEYVQNEYLKVEEEIIL 207
           +PW  Y+  LP       F++ EE   L+GT L   L       E    +  +  E I  
Sbjct: 129 NPWTEYVRFLPPSFPLPTFYTAEEQELLRGTSLTEALDAKFASLEREFEQLRQATEGIAW 188

Query: 208 PNKQLFPR---PITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAY 264
             +  +      +T+DD+ +     RSR    L G  L ++P  D+ NH  G   +   Y
Sbjct: 189 CQRSWWDEKTGALTIDDWKYVDAAYRSRMLD-LPGSGLSMVPCIDMANHGSGDGVKAL-Y 246

Query: 265 EIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIE-SKSDRNAYTL 323
           +        R+ +  LR    ++ GE+V I Y   K  +E+   YGF+E   +D     L
Sbjct: 247 DADS----ERNAVLQLRWVKALQPGEEVTISYGDGKPVSEMMFSYGFLEGGTTDAREIFL 302

Query: 324 TLEISESDPF 333
            L+I E DP 
Sbjct: 303 NLDIPEDDPL 312


>gi|302754816|ref|XP_002960832.1| hypothetical protein SELMODRAFT_437299 [Selaginella moellendorffii]
 gi|300171771|gb|EFJ38371.1| hypothetical protein SELMODRAFT_437299 [Selaginella moellendorffii]
          Length = 418

 Score = 58.5 bits (140), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 60/230 (26%), Positives = 105/230 (45%), Gaps = 22/230 (9%)

Query: 128 LCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVF-WSEEELVELQGTQLLST 186
           L S +KPW  +AL ++ E+ K           L + + +T F W + EL  L+ + L   
Sbjct: 159 LPSTVKPWTKLALIVLMERYKGQYGHHTSRVFLNQRSSTTRFRWEDTELSYLRASPLYGK 218

Query: 187 TLGVKEYVQNEYLKVE--------EEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLR 238
                E +  E+ +V+        E+ +    QLF + ++L+D    +  + SR+ +   
Sbjct: 219 ARERLEMITTEFGQVQNDFCTCVLEQALDVWPQLFGK-VSLEDLKHVYATVFSRSLAIGE 277

Query: 239 GQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDL 298
              LV+IP+ D  NH+       +A ++   GL +  ++ + R        +Q+ I Y  
Sbjct: 278 DSTLVMIPMLDFFNHN----ATSFA-KLSFNGLLNYAVVTADRD---YAENDQIWINYG- 328

Query: 299 NKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGE 348
           + SNAELALDYGF   ++  +   L  +  E +    D+L     N LG+
Sbjct: 329 DLSNAELALDYGFTVPENPYDETELLTQFPEMNTIIKDQLG---GNTLGQ 375


>gi|326913214|ref|XP_003202935.1| PREDICTED: SET domain-containing protein 4-like, partial [Meleagris
           gallopavo]
          Length = 241

 Score = 58.5 bits (140), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 57/210 (27%), Positives = 90/210 (42%), Gaps = 27/210 (12%)

Query: 69  QWLRDQKVVSPKSPIRPATF-PEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           +WL+D+      S +RPA F   G GL+  R +   E+V+ +P K  +  +TV  S +G 
Sbjct: 37  KWLKDRGFGD--SSLRPAQFWGTGRGLMTTRALQAGELVISLPEKCLVTTNTVLNSCLGE 94

Query: 128 LCSGLKP----WISVALFLIREKKK-EDSPWRVYLDILPECTDSTVFWSEEELVEL---- 178
                KP     I++  FLI EK   E S W+ YLD+LP+ T S     E+++++L    
Sbjct: 95  YIMKWKPPVSPLIALCTFLIAEKHAGEKSLWKPYLDVLPK-TYSCPVCLEQDVIQLFPEP 153

Query: 179 --QGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSR 236
             +  Q   TT+         +    + +   N         L+   WA+  + +R    
Sbjct: 154 LRKQAQEQRTTVHELYMSSKAFFSSLQSLFAENTATIFNHSALE---WAWCTINTRTIYM 210

Query: 237 LRGQN---------LVLIPLADLINHSPGI 257
              Q            L P  DL+NHSP +
Sbjct: 211 KHSQRECFSLEPDVYALAPYLDLLNHSPNV 240


>gi|312377430|gb|EFR24262.1| hypothetical protein AND_11269 [Anopheles darlingi]
          Length = 222

 Score = 58.5 bits (140), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 39/114 (34%), Positives = 64/114 (56%), Gaps = 7/114 (6%)

Query: 91  GLGLVAQRDIAKNEVVLEVP--MKFWINPDTVAASEIGSLCSGL----KPWISVALFLIR 144
           GLGL +   I   E V+EVP  + F++N D     ++  L  G     +  I +AL LI 
Sbjct: 86  GLGLESIGPIGAGESVIEVPKALFFYVN-DEPRYRQVLDLMPGAMMKEQGNIMLALALIM 144

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY 198
           E+ ++DS WR YLD+LPE   + +F++ E+++EL+ T  L   L + + +  +Y
Sbjct: 145 ERFRKDSFWRPYLDVLPERYTTPLFYTPEDMIELRETAALEPALKLCKNIARQY 198


>gi|444320495|ref|XP_004180904.1| hypothetical protein TBLA_0E03310 [Tetrapisispora blattae CBS 6284]
 gi|387513947|emb|CCH61385.1| hypothetical protein TBLA_0E03310 [Tetrapisispora blattae CBS 6284]
          Length = 496

 Score = 58.5 bits (140), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 73/294 (24%), Positives = 123/294 (41%), Gaps = 50/294 (17%)

Query: 58  NPPTAQVETFWQWLRDQKVVSPKSPIRPATFP---EGLGLVAQRDIAKNEVVLEVPMKFW 114
           N    Q + F  WL D+  +     +         +G  LVA +DI K+EV+ E+P +  
Sbjct: 2   NSFDGQTQAFVNWLVDEAKIKISDKVEVQDLRAQGQGRALVAVQDILKDEVLFEIPRECL 61

Query: 115 INPDTVAASEIG-----SLCSGLKPWISVALFLIREK--KKEDSPWRVYLDILPEC--TD 165
           +N  T +  +       +L   L  W  + L ++ EK  KKE S W  Y  ILPE    +
Sbjct: 62  LNAKTSSLVKDHPELKVNLEQELGHWEGLILCILYEKLVKKESSHWFPYFSILPEAEEIN 121

Query: 166 STVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWA 225
           S ++W++E+L  L+ + L+ T +G KE  +N +  +   I     +    P++ D F+  
Sbjct: 122 SLMYWNDEQLENLKPS-LVLTRIG-KEAAKNMFDSIAIFITTHGLRDEISPLSWDLFVHV 179

Query: 226 FGILRSRAF----------SRLRGQN----------------LVLIPLADLINHSPGITT 259
             I+ S +F           +  GQ+                  ++PLAD++N +     
Sbjct: 180 ASIIMSYSFDVELPEKQSNDQEDGQDENEEYENECIARDGYMKSMVPLADILNSNTNRCN 239

Query: 260 EDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIE 313
               Y      +F+          +P K G+   I  D    NAE+   YG++E
Sbjct: 240 AHLTYSSTTLKMFA-------IADIP-KNGQIFNIYGD--HPNAEILRRYGYVE 283


>gi|426243560|ref|XP_004015620.1| PREDICTED: N-lysine methyltransferase SETD6 [Ovis aries]
          Length = 450

 Score = 58.5 bits (140), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 74/303 (24%), Positives = 124/303 (40%), Gaps = 48/303 (15%)

Query: 52  SASATTNPPTAQVETFWQWLRDQKV---VSPKSPIRPATFPEGLGLVAQRDIAKNEVVLE 108
           +  A ++   A V +F  W   Q+V   +SPK  +       G G+VA+  +   E++  
Sbjct: 11  AGPAGSDEDPAPVASFLSWC--QRVGLELSPKVAVSRQGTVAGYGMVARESVQPGELLFA 68

Query: 109 VPMKFWINPDTVAAS-----EIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPEC 163
           VP    ++  T + S     E G+L S    W+ + L L+ E +   S WR Y  + PE 
Sbjct: 69  VPRAALLSQHTCSISGVLERERGALQSQ-SGWVPLLLALLHEMQAPASLWRPYFALWPEL 127

Query: 164 T--DSTVFWSEEELVE-LQGTQLLSTTLGVKEYVQNEYLKVEEE---IILP----NKQLF 213
              +  +FW EEE    LQGT       GV E V+ +   +  E   I+LP    +  LF
Sbjct: 128 GRLEHPMFWPEEERRRLLQGT-------GVPEAVEKDLANIRSEYYSIVLPFMDAHADLF 180

Query: 214 -PRPITLDDFLWAFGILRSRAFSRLRG--------QNLVLIPLADLINHSPGITTEDYAY 264
            PR  +L+ +     ++ + +F              + +++P AD++NH           
Sbjct: 181 SPRVRSLELYRQLVALVMAYSFQEPLEEEEDEKEPNSPLMVPAADILNHLAN-------- 232

Query: 265 EIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLT 324
               A L        +    P+  G ++   Y    +N +L   YGF E   D    T  
Sbjct: 233 --HNANLEYSPTCLRMVAIQPIPKGHEIFNTYG-QMANWQLIHMYGFAEPYPDNTNDTAD 289

Query: 325 LEI 327
           +++
Sbjct: 290 IQM 292


>gi|255070351|ref|XP_002507257.1| predicted protein [Micromonas sp. RCC299]
 gi|226522532|gb|ACO68515.1| predicted protein [Micromonas sp. RCC299]
          Length = 986

 Score = 58.5 bits (140), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 72/281 (25%), Positives = 117/281 (41%), Gaps = 41/281 (14%)

Query: 66  TFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKN--EVVLEVPMKFWINPDTVAAS 123
           T W+W+      +PK+ +  A +P  +      + A++  + +  +P+   + P    A 
Sbjct: 20  TLWEWVTRHGGSAPKARLSDA-YPRTVIAAENVNGAQDGGDTIFSIPITCLMTPAAAFAD 78

Query: 124 -------EIGSLCSGLKPWISVALFLIREKKKE-DSPWRVYLDILPECTDSTVFWSEEEL 175
                  E+ +    ++    +  FL  E+++   S W  Y+  LP    + + WS  E 
Sbjct: 79  VTYGKVFELFAAHQSVEDRTVLVFFLAIERQRGMTSHWGPYIRELPSIFSNPLNWSRAET 138

Query: 176 VELQGTQLLSTT-------LGVKEYVQNEYLKV-EEEIILP--NKQLFPRPITL------ 219
           + L GT+L   T       L + E     ++ +   ++IL    K +    I+L      
Sbjct: 139 LRLAGTRLGGATKFHDCALLQLTEVCVPAFIAILRAQLILSANTKAIASGAISLAQDALS 198

Query: 220 -DDFLWAFGILRSRAFSR-LRGQ-NLVLIPLADLINHSPGITTEDYAYEIKGAGL-FSRD 275
            D   W+   + SRAFS  L GQ  + L+PL D+++HSP    E    +  G  L  S D
Sbjct: 199 PDRLAWSHSCVSSRAFSLFLNGQRTIALVPLGDMLDHSPDAQIEWRTDDTAGQFLIISHD 258

Query: 276 LLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKS 316
                R P    AG  +   Y   KSN EL L YGF    S
Sbjct: 259 -----RLP----AGSIMFNNYGA-KSNEELILGYGFFMKSS 289


>gi|85113406|ref|XP_964517.1| hypothetical protein NCU02158 [Neurospora crassa OR74A]
 gi|28926302|gb|EAA35281.1| hypothetical protein NCU02158 [Neurospora crassa OR74A]
          Length = 504

 Score = 58.2 bits (139), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 73/282 (25%), Positives = 117/282 (41%), Gaps = 55/282 (19%)

Query: 70  WLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSL 128
           W +D+ ++   + + P   P  G+G+VA + + ++E +L VP       +TV      S+
Sbjct: 39  WAKDRGIIL--NGVEPRRIPGRGMGIVATKPLEEDETILTVPSTTLRTYETVHP----SI 92

Query: 129 CSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWS----------------- 171
              L   IS+   L  +   + SP   Y      C      W                  
Sbjct: 93  SRALPKDISIHGLLAADLAFDPSPSAKYRQWNAVCPSRKDMWDYLPLTWDHRLRQFLPRK 152

Query: 172 ------------EEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITL 219
                       E++   +Q ++LLS+  G KE   N+ L+ EE+    NK      IT 
Sbjct: 153 ALELLLKQEAKFEKDWSMVQQSKLLSSGPGKKE---NKSLE-EEDCGTLNKS-----ITR 203

Query: 220 DDFLWAFGILRSRAFSRL--------RGQNLVLIPLADLINH-SPGITTEDYAYEIKGAG 270
           DD+L+A+ ++ +R F           +   +VL P+ADL NH S   + E      K   
Sbjct: 204 DDYLYAWLLVNTRTFYHETPKTKKFNKDDRMVLQPVADLFNHTSYDPSAEKKEGNKKTCS 263

Query: 271 LFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI 312
           +      F++ T  P  AGE+V I Y  N SN  L ++YGF+
Sbjct: 264 VAFSPTAFTITTTRPYAAGEEVYICYG-NHSNDFLLIEYGFL 304


>gi|148237199|ref|NP_001085404.1| N-lysine methyltransferase setd6 [Xenopus laevis]
 gi|82184826|sp|Q6INM2.1|SETD6_XENLA RecName: Full=N-lysine methyltransferase setd6; AltName: Full=SET
           domain-containing protein 6
 gi|48734800|gb|AAH72257.1| MGC82362 protein [Xenopus laevis]
          Length = 455

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 74/279 (26%), Positives = 122/279 (43%), Gaps = 47/279 (16%)

Query: 77  VSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEI-----GSLCSG 131
           ++PK  I         G++A+ DIA  E++  VP    ++ +T    E+      SL S 
Sbjct: 36  LNPKVYISTEGTVSQYGMLAREDIADGELLFTVPRSAILSQNTTRIQELLEKEQESLQST 95

Query: 132 LKPWISVALFLIREKKKEDSPWRVYLDILPECT--DSTVFWSEEELVE-LQGTQLLSTTL 188
              W+ + + L+ E     S W  Y  + PE    D  +FWSEEE  + LQGT       
Sbjct: 96  -SGWVPLLISLLYEATDSSSLWAPYFGLWPELDPPDMPMFWSEEEQTKLLQGT------- 147

Query: 189 GVKEYVQNEYLKVEEE---IILP----NKQLF-PRPITLDDFLWAFGILRSRAFSRLRGQ 240
           GV E ++N+   +EEE   I+LP    N + F P   TLD +      + + +F     +
Sbjct: 148 GVLEAIRNDLKNIEEEYNSIVLPFITRNPEKFCPMKHTLDLYKRLVAFVMAYSFQEPLEE 207

Query: 241 NL------------VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKA 288
           N             +++P+ADL+NH   +   +   E      F+ + L  + T   V A
Sbjct: 208 NDEEDEDEKDILPPMMVPVADLLNH---VAHHNAHLE------FTPECL-RMVTTKSVHA 257

Query: 289 GEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEI 327
           G+++   Y    +N +L   YGF E     +  T  +++
Sbjct: 258 GQELFNTYG-EMANWQLLHMYGFAEPHPQNSNETADIQM 295


>gi|389646769|ref|XP_003721016.1| hypothetical protein MGG_02740 [Magnaporthe oryzae 70-15]
 gi|86196443|gb|EAQ71081.1| hypothetical protein MGCH7_ch7g488 [Magnaporthe oryzae 70-15]
 gi|351638408|gb|EHA46273.1| hypothetical protein MGG_02740 [Magnaporthe oryzae 70-15]
 gi|440466942|gb|ELQ36183.1| hypothetical protein OOU_Y34scaffold00666g44 [Magnaporthe oryzae
           Y34]
 gi|440488101|gb|ELQ67845.1| hypothetical protein OOW_P131scaffold00283g3 [Magnaporthe oryzae
           P131]
          Length = 390

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 70/263 (26%), Positives = 107/263 (40%), Gaps = 43/263 (16%)

Query: 65  ETFWQWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAAS 123
           E    W   Q +V   S +RP   P  G+G++A R +  +E VL VP        TV   
Sbjct: 5   EELINWAEAQGIVL--SGVRPDRIPGRGIGIIASRKVKADETVLRVPTTALRTKFTVP-- 60

Query: 124 EIGSLCSGLKPWISVALFLIREKKKEDS----PWRVYLDILPECTDSTVFW--SEEELVE 177
               +C  L   ++V   L  +   EDS     W   L    +       W  + + L+ 
Sbjct: 61  --NEICKALPKNMTVHGLLAADLALEDSGKYDEWNAVLPTPEDVRTMPFTWPPALQALLP 118

Query: 178 LQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAF--- 234
            + T+LL+         Q E    + E +  +     R  T D + +A+ +  +R F   
Sbjct: 119 TRATELLAK--------QEEKFAKDWEAVTSSP--LSRKPTRDSYRYAWLLANTRTFYFV 168

Query: 235 ----SRLRGQN-LVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAG 289
                RL  ++ +VL P+ADL NH+             G  +   D  F++R      AG
Sbjct: 169 CPRTERLGKEDRMVLQPVADLFNHADA-----------GCAVAFNDEDFTIRADRDYDAG 217

Query: 290 EQVLIQYDLNKSNAELALDYGFI 312
           E+VLI Y  N SN  L  +YGF+
Sbjct: 218 EEVLICYG-NHSNDFLLAEYGFV 239


>gi|391340216|ref|XP_003744440.1| PREDICTED: SET domain-containing protein 4-like [Metaseiulus
           occidentalis]
          Length = 381

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 60/255 (23%), Positives = 104/255 (40%), Gaps = 34/255 (13%)

Query: 79  PKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWIS 137
           P S +R A  P  G G+V   +I   +V++++P    I PD V   E+      L     
Sbjct: 35  PTSVLRLACTPASGRGIVCLSNIEAGDVIIDLPSTLLITPDLV-RKELNMSKENLSAEEI 93

Query: 138 VALFLIREKK-KEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQN 196
           + +F++ E+   E S W+ Y++ +P+  D              G Q   +    +   Q 
Sbjct: 94  LTIFVLSERSLGEKSKWKPYIESIPDVFD--------------GLQCRKSVRLPRRLAQA 139

Query: 197 -EYLKVEEEIILPNKQLFPR----PITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLI 251
            +    E   +    ++F R     +  + F WA+  + +R    + G    L P  DL+
Sbjct: 140 IDRWNAERRNVFSRLRMFFRGRGIDLNFETFSWAWSAVNTRCIY-VEGHGSTLAPFLDLL 198

Query: 252 NHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF 311
           NH    + E                 F +R+ V  +AG +V I Y  +  N  L L+YGF
Sbjct: 199 NHHWKASIETSFVNNH----------FIIRSNVGYEAGSEVFIGYG-SHDNRTLFLNYGF 247

Query: 312 IESKSDRNAYTLTLE 326
           +  ++  +  T+ LE
Sbjct: 248 VLDENPNDCITVELE 262


>gi|189193345|ref|XP_001933011.1| predicted protein [Pyrenophora tritici-repentis Pt-1C-BFP]
 gi|187978575|gb|EDU45201.1| predicted protein [Pyrenophora tritici-repentis Pt-1C-BFP]
          Length = 642

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 69/255 (27%), Positives = 106/255 (41%), Gaps = 33/255 (12%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSL--------CSGLKP-WISVALF 141
           G  + A R ++ + VV + P+K  ++   +   E G L        C G  P  I   L 
Sbjct: 42  GFHMRALRPLS-SPVVTKCPLKLTLSCLNLDPGEQGVLHIESPLRQCRGRIPDHILTYLL 100

Query: 142 LIREKKK-EDSPWRVYLDILPECTDST--VFWSEEELVELQGTQLLSTTLGVKEYVQNEY 198
           LI ++ K + SPW  Y+  LP   D T  +++ E ++  L GT L+      K  +Q ++
Sbjct: 101 LIEQRDKGQASPWHAYIACLPNSRDMTTPLWFDEGDMAFLAGTSLVPAAKERKAELQQQW 160

Query: 199 ---LKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRL-----RGQNLVLIPLADL 250
              + V EE+ +P      + I  +  LWA  I  SRAF        R    +L P+ D+
Sbjct: 161 EGAIAVMEELSIP----LAKGIDTESLLWAATIFTSRAFISTHILPERETVPILFPVVDI 216

Query: 251 INHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYG 310
           +NHS     E   ++ +    F+   L            EQ L      K N EL L YG
Sbjct: 217 LNHSVSAKVE---WDFQPGQSFALKCLDGDSF-----QPEQELFNNYAPKQNDELLLGYG 268

Query: 311 FIESKSDRNAYTLTL 325
           F    +    + L L
Sbjct: 269 FCLENNPIEQFALKL 283


>gi|336262426|ref|XP_003345997.1| hypothetical protein SMAC_06551 [Sordaria macrospora k-hell]
 gi|380089590|emb|CCC12472.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 482

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 56/197 (28%), Positives = 87/197 (44%), Gaps = 18/197 (9%)

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEII-LP- 208
           +PW  Y+  LP+       W+E+E + L+GT L S        +  E+  V E    LP 
Sbjct: 112 NPWTEYIKFLPKTVLVPTLWTEDERLLLRGTSLESAVNAKMTALTAEFDAVREAASSLPT 171

Query: 209 -NKQLFP-----RPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDY 262
            N  L+P      P +L +++    + RSR    L      ++P  D+INHS   T    
Sbjct: 172 WNDVLWPFENGNSPASLRNWILLDALYRSRVL-ELPKSGESMVPCIDMINHS---TCASA 227

Query: 263 AYEIKGAGLFSRDLLFSLRTP-VPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAY 321
            Y+       ++D +  L  P   + +G++V I Y   K  AE+   YGFI+ ++   + 
Sbjct: 228 YYDEN-----TKDEVILLPRPDRTISSGKEVTISYGDAKPAAEMLFSYGFIDPETTVESL 282

Query: 322 TLTLEISESDPFFGDKL 338
            L LE    DP    KL
Sbjct: 283 VLPLEPFGDDPLEKAKL 299


>gi|42561607|ref|NP_171694.3| SET domain-containing protein [Arabidopsis thaliana]
 gi|332189232|gb|AEE27353.1| SET domain-containing protein [Arabidopsis thaliana]
          Length = 572

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 83/324 (25%), Positives = 131/324 (40%), Gaps = 85/324 (26%)

Query: 62  AQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDI-AKNEVVLEVPMKFWINPDTV 120
           A++E F  WL+          I+ +   +G G+ A     A +EV+L VP+   I P  V
Sbjct: 8   AKLERFLDWLQVNGGELRGCNIKYSDSLKGFGIFASTSTQASDEVLLVVPLDLAITPMRV 67

Query: 121 AASEI-GSLCSGLKPWISV------ALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEE 173
               + G  C  +     V       LFL  E+ + +S W+ YLD+LP    + +++S++
Sbjct: 68  LQDPLLGPECQKMFEQGQVDDRFLMILFLTLERLRINSSWKPYLDMLPTRFGNPLWFSDD 127

Query: 174 ELVELQGTQLL-STTLGVKEYVQNEYLKVEEEIILPNKQLF-----PRPITLDDFLWAFG 227
           +++EL+GT L  +T L  K+ +   + KVE   +L  K L         ++ + FLWA  
Sbjct: 128 DILELKGTNLYHATELQKKKLLSLYHDKVE---VLVTKLLILDGDSESKVSFEHFLWANS 184

Query: 228 ILRSRAF----------------------------------SRLRGQNLV---------- 243
           +  SRA                                   +  +G++L           
Sbjct: 185 VFWSRALNIPLPHSFVFPQSQDDTGECTSTSESPETAPVNSNEEKGKSLTSAQPAPSVGS 244

Query: 244 --------LIPLADLINH--SPGITTEDYAYEIKGAGL-----FSRDLLFSLRTPVPVKA 288
                   L+P  D  NH   P  T     +E+ G G      FS  LL   + P+P K 
Sbjct: 245 GDTIWVEGLVPGIDFCNHDLKPVAT-----WEVDGIGSVSRVPFSMYLLSVAQRPIPKK- 298

Query: 289 GEQVLIQYDLNKSNAELALDYGFI 312
             ++ I Y  NK N EL   YGF+
Sbjct: 299 --EISISYG-NKGNEELLYLYGFV 319


>gi|315045047|ref|XP_003171899.1| SET domain-containing protein 6 [Arthroderma gypseum CBS 118893]
 gi|311344242|gb|EFR03445.1| SET domain-containing protein 6 [Arthroderma gypseum CBS 118893]
          Length = 485

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 31/96 (32%), Positives = 56/96 (58%), Gaps = 2/96 (2%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWIN-PDTVAASEIGSLCSGLKPWISVALFLIREK-KK 148
           G G+ A RDI ++E +  +P    ++  ++ A + +G     L PW+S+ + +I E  + 
Sbjct: 48  GRGICASRDITEDEELFVIPEDLILSVQNSEARTVLGLDDKQLGPWLSLIIAMIYEYYQG 107

Query: 149 EDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLL 184
           E S W  Y  +LP   D+ +FW++E+L ELQG+ ++
Sbjct: 108 EQSKWYPYFGVLPSSFDTLMFWTDEQLSELQGSAVV 143


>gi|307103393|gb|EFN51653.1| hypothetical protein CHLNCDRAFT_139846 [Chlorella variabilis]
          Length = 712

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 47/184 (25%), Positives = 84/184 (45%), Gaps = 12/184 (6%)

Query: 59  PPTAQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPD 118
           PP +    F  W R+Q V +    I  A+F    G  A RD+A  E +L +P    I  D
Sbjct: 154 PPGSPGADFEAWAREQGVAAA---ISLASFGSLRGCAASRDVAPGEPLLSIPADVLIYED 210

Query: 119 TVAASEIGSLCSGLKPWISV----ALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEE 174
           +V  +++G +   + P +++     LF + ++  EDS W  Y   LP+   + + + +  
Sbjct: 211 SVKGTDLGRMLHAI-PDLTIDNLLVLFTMIDRHDEDSRWARYWHSLPDKYYTGISFPQPL 269

Query: 175 LVELQGTQLLSTTLGVKEYVQNEYLKVEE--EIILPNKQLFPRP--ITLDDFLWAFGILR 230
           +  L GT         ++++Q++Y       +++L     F +P     D +LWA  +  
Sbjct: 270 VDSLAGTAAHLELARAQQHLQDQYAATRPLFDMLLTAYPQFLKPEWFDYDCYLWAAELWY 329

Query: 231 SRAF 234
           S AF
Sbjct: 330 SYAF 333


>gi|302658278|ref|XP_003020845.1| SET domain protein [Trichophyton verrucosum HKI 0517]
 gi|291184711|gb|EFE40227.1| SET domain protein [Trichophyton verrucosum HKI 0517]
          Length = 692

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 60/202 (29%), Positives = 86/202 (42%), Gaps = 31/202 (15%)

Query: 128 LCSGLKPWISVALFLIREK-KKEDSPWRVYLDILPECTD--STVFWSEEELVELQGTQLL 184
           L   ++  I +A F++ E+ K  DS W  YL  LP  ++  S +F+ + +L  LQGT L 
Sbjct: 98  LSQRVRKSIILAFFMVHEQLKGRDSHWWPYLATLPRASEFTSALFYQDNDLEWLQGTNLY 157

Query: 185 STTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-SRLRGQNL- 242
            T    +  VQ EY      I+     L       D F WA+ ++ SRAF SR+      
Sbjct: 158 QTHQAYRNAVQEEY-DSAISILRDEGFLAVESYRWDIFCWAYTLIASRAFTSRVLDAYFS 216

Query: 243 ------------VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGE 290
                       +++PL D  NH P    E +  E    G         L+   P  +GE
Sbjct: 217 NHPTLKQDEEFQIMLPLVDSSNHKPLAKIE-WRAEATEIG---------LKVIEPTSSGE 266

Query: 291 QVLIQYD-LNKSNAELALDYGF 311
           +V   Y  LN  N +L   YGF
Sbjct: 267 EVHNNYGPLN--NQQLMTTYGF 286


>gi|307190530|gb|EFN74527.1| SET domain-containing protein 3 [Camponotus floridanus]
          Length = 232

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 55/215 (25%), Positives = 103/215 (47%), Gaps = 18/215 (8%)

Query: 244 LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNA 303
           LIP+ D+ NH  G  T D+         ++      LR     K GEQV I Y   ++N+
Sbjct: 19  LIPMWDMCNHENGRITTDFNATSDHCECYA------LRN---FKKGEQVFISYG-PRTNS 68

Query: 304 ELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLG-RTLPP 362
           +  +  GF+   + ++ + L L IS++D    +++++    GL     F +  G   +  
Sbjct: 69  DFFVHSGFVYMNNKQDGFKLRLGISKADSLQKERIELLSKLGLPSVGEFLLKPGTEPISD 128

Query: 363 AMLQYLRLVALGGTD-AFLLES--IFRNTIWGHLDLPVSHANEELICRVVRDACKSALSG 419
            +L +LR+ ++   + A  L S  +F      H+D  +    EE + + +    +  ++ 
Sbjct: 129 TLLAFLRVFSMRKAELAHWLRSDKVFD---LKHMDCALETVVEENVRKFLLTRLQLLIAN 185

Query: 420 FHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVL 454
           + TT++ED +LL+   L    ++AV +R  EKR+L
Sbjct: 186 YPTTLKEDLELLE-TTLPQIKKMAVQLRVTEKRIL 219


>gi|359476494|ref|XP_002269094.2| PREDICTED: protein SET DOMAIN GROUP 40-like [Vitis vinifera]
          Length = 504

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 102/414 (24%), Positives = 157/414 (37%), Gaps = 85/414 (20%)

Query: 86  ATFPE--GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLC-----SGLKPWISV 138
           + FP   G GL A RD+++ E++L VP    +   ++   E  S+      S   P I  
Sbjct: 41  SHFPHAGGRGLAAARDLSQGELILTVPKSALMTSQSLLKDEKLSVAVKRHTSLSSPQILT 100

Query: 139 ALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY 198
              L    K + S W  YL  LP   D+   +S+ E   LQ    +  T       +   
Sbjct: 101 ICLLAEMSKGKSSWWHPYLMQLPRSYDTLANFSQFEKQALQVDDAIWVT-------ERAI 153

Query: 199 LKVEEEI-----ILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINH 253
           LK E E      ++   +L P+      +LWA   + SR           L P+ D  N+
Sbjct: 154 LKAELEWKKAIPLMEELKLKPQLQNFRAWLWASSTVSSRTMHIPWDDAGCLCPVGDFYNY 213

Query: 254 -SPG---------------ITTEDYAYEIKGA------------------GLFSRDL-LF 278
            +PG                + +D ++  K A                  G +  DL  +
Sbjct: 214 AAPGEEPCGWEDLKGSRNESSLQDSSFWNKDATSNSDAEQDDVLSQRLTDGGYKEDLAAY 273

Query: 279 SLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI--ESKSDRNAYTLTLEISESDPFFGD 336
                   K GEQVL+ Y    +N EL   YGF+  E+ +D+    L  E+  S  +  D
Sbjct: 274 CFYARKNYKKGEQVLLSYG-TYTNLELLEHYGFLLDENPNDKAFIPLEPEVYASSSWPKD 332

Query: 337 KLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHL--- 393
            L I +    G+ ++           A+L  LRL A   +         R ++ GHL   
Sbjct: 333 SLYIHQN---GKPSF-----------ALLSALRLWATPASQ--------RRSV-GHLVYS 369

Query: 394 DLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGN--LDPRLRIAVG 445
              +S  NE  +   +  +C   L    T++EED  LL   +   DP L + VG
Sbjct: 370 GTQLSSENEIFVMEWIAKSCHVVLENLPTSVEEDSLLLCALDKMQDPDLPMEVG 423


>gi|145500874|ref|XP_001436420.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124403559|emb|CAK69023.1| unnamed protein product [Paramecium tetraurelia]
          Length = 720

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 59/247 (23%), Positives = 108/247 (43%), Gaps = 40/247 (16%)

Query: 85  PATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCS-------------- 130
           PA+F +  G+VA  D+  N   + +P    I+PD   ++ + ++ +              
Sbjct: 37  PASFGDVTGVVASEDLPSNTAFICIPQALIISPDKCKSTNLNTVYNSHPEMFDKDETNDA 96

Query: 131 -----GLKPWISVALFLIREKKK-EDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLL 184
                G+K       ++  EKKK E S +  Y+  + +  ++ + WS E+L +++   +L
Sbjct: 97  EFNMLGIKLICIQVFYMFNEKKKGELSFYYPYISAV-QANNTLLTWSNEDLKKIEDPIIL 155

Query: 185 STTLGVKEYVQNEYLKVEEEIILPNKQLF--PRPITLDDFLWAFGILRSRAFSRLRGQNL 242
                +K+ V   + K  ++I   N+ +F  PR     DF WA   + SR F     ++ 
Sbjct: 156 EEFANIKQDVLGLWGKA-KQIFDNNEDVFGIPRLTDKKDFYWAVECVMSRCFG-WSLKST 213

Query: 243 VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLI--QYDLNK 300
            +IP+AD +NHS    T    +  KG           L   +P+K  +Q     QY L +
Sbjct: 214 CIIPIADFLNHSNRACTHYMVH--KG-----------LEKGIPLKQKDQAHFQQQYILKR 260

Query: 301 SNAELAL 307
           +   L++
Sbjct: 261 NKINLSI 267


>gi|451854554|gb|EMD67847.1| hypothetical protein COCSADRAFT_34629 [Cochliobolus sativus ND90Pr]
          Length = 476

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 99/458 (21%), Positives = 190/458 (41%), Gaps = 82/458 (17%)

Query: 66  TFWQWLRDQKV-VSPKSPIRPATFPE-GLGLVAQRDIAKNEVVLEVPMKFWIN-PDTVAA 122
            F  WL+     ++PK  +      + G G+VA++DIA++E++  +P    +   +++ +
Sbjct: 10  AFLDWLKHTGAQINPKIQLEDLRAKDAGRGVVAKQDIAEHELLFSIPRSSILGVENSILS 69

Query: 123 SEIG-SLCSGLKPWISVALFLIRE-KKKEDSPWRVYLDILPECTDSTVFWSEEELVELQG 180
           +EI  +  + L PW+S+ L ++ E      S W  Y  +LP   D+ +FW+E+EL ELQ 
Sbjct: 70  TEIPPATFAHLGPWLSLILIMLYEYHNGSASNWAPYFAVLPTDFDTLMFWTEDELAELQA 129

Query: 181 TQLLST--TLGVKEYVQNEYLKVEEE---IILPNKQLFPR-------PITLDDFLWAFGI 228
           + +++     G  E    + L V EE   +I    +           P  L+       +
Sbjct: 130 SAVVNKIGKEGANEVFIEQLLPVIEEFADVIFSGDERAKHKAKEMRAPENLELMHKMGSL 189

Query: 229 LRSRAFS---RLRGQNLV----------------LIPLADLINHSPGITTEDYAYEIKGA 269
           + + AF     +  + +                 ++PLAD++N           YE  G 
Sbjct: 190 IMAYAFDVEPAISDKEVDEEGFAEEEEDAALPKGMVPLADMLNADGDRCNARLFYEKDG- 248

Query: 270 GLFSRDLLFSLRTPVPVKAGEQVLIQYD-LNKSNAELALDYGFIESKSDRNAYTLTLEI- 327
                     ++   P++AG+++   Y  L +S+  L   YG+I   +D  A    +EI 
Sbjct: 249 --------LEMKALKPIQAGDEIFNDYGPLPRSD--LLRRYGYI---TDNYAQYDVVEIP 295

Query: 328 --------------SESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVAL 373
                          E    + D+ +I +T G   +A     L  +L P ++  +  + L
Sbjct: 296 VDLVSQTLAHDGLWHEERIEYLDEQEIVDT-GYDIAASIPFSLEESLSPELVILVETMLL 354

Query: 374 GGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQG 433
              +   L+S  R        LP +        + +    ++ ++ + TT+E+D Q+   
Sbjct: 355 PREEFERLQSKGR--------LPKAEKMTGKAAKFLYKIVQARIAQYPTTLEQDLQISSE 406

Query: 434 -------GNLDPRLRIAVGVRAGEKRVLQQIDEIFKER 464
                     + R+ +A  VR GEK++L Q +E   ++
Sbjct: 407 TQPVQTMSRKERRVAMARAVRIGEKKLLVQTEERLADK 444


>gi|15228958|ref|NP_191216.1| SET domain-containing protein [Arabidopsis thaliana]
 gi|7594535|emb|CAB88060.1| putative protein [Arabidopsis thaliana]
 gi|332646015|gb|AEE79536.1| SET domain-containing protein [Arabidopsis thaliana]
          Length = 531

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 61/221 (27%), Positives = 97/221 (43%), Gaps = 30/221 (13%)

Query: 61  TAQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTV 120
           T ++  F +W++   V   ++        +G+ + A  D+ + +VV  +     +   T 
Sbjct: 3   TRRLRAFKRWMQANGVDCSEALNLVDDENDGVSVRAFCDLKEGDVVANISKTACLTIKTS 62

Query: 121 AASEI-------GSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEE 173
            A E+       GSL       +SVAL   R    E+SPW  YL ILP   D  + WS E
Sbjct: 63  GAREMIESADLDGSL------GLSVALMYER-SLGEESPWAGYLQILPIQEDLPLVWSLE 115

Query: 174 ELVE-LQGTQLLSTTLGVKEYVQNEYLKVEE---EIILPNKQLFPRPITLDDF-----LW 224
           +L   L GT+L        + V+ +++ + E   E ILP     P+ +  D F     L 
Sbjct: 116 DLDSLLSGTEL-------HKLVKEDHVLIYEDWKENILPLTSSLPQNVDSDSFGIKEYLA 168

Query: 225 AFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYE 265
           A  ++ SR+F         ++PLADL NH  G     + +E
Sbjct: 169 AKSLIASRSFEIDDYHGSGMVPLADLFNHKTGAEDVHFTHE 209


>gi|297816978|ref|XP_002876372.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297322210|gb|EFH52631.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 518

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 55/215 (25%), Positives = 97/215 (45%), Gaps = 18/215 (8%)

Query: 61  TAQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTV 120
           T ++  F +W++   V    +        +G+ + A  D+ + +VV  +     +   T 
Sbjct: 3   TRRLRAFKRWMKANGVDCSDALNLVDDQNDGVSVRAFCDLKEGDVVANISKTACLTIKTS 62

Query: 121 AASEIGSLCSGLKPWISVALFLIREKK-KEDSPWRVYLDILPECTDSTVFWSEEELVE-L 178
            A E+    + L   + +++ L+ E+   E+SPW  YL ILP   D  + WS ++L   L
Sbjct: 63  GAREMIE-SADLDGSLGLSVALMYERSLGEESPWAGYLQILPVQEDLPLVWSLQDLDSLL 121

Query: 179 QGTQLLSTTLGVKEYVQNEYLKVEE---EIILPNKQLFPRPITLDDF-----LWAFGILR 230
            GT+L        + V+ +++ + E   E ILP     P+ +  D F     L A  ++ 
Sbjct: 122 SGTEL-------HKVVKEDHVLIYEDWKENILPLTSSLPQNVDSDSFGIKEYLAAKSLIA 174

Query: 231 SRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYE 265
           SR+F         ++PLADL NH  G     + +E
Sbjct: 175 SRSFQIDDYHGSGMVPLADLFNHKTGAEDVHFTHE 209


>gi|159490102|ref|XP_001703025.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158270838|gb|EDO96670.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 471

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 67/247 (27%), Positives = 103/247 (41%), Gaps = 45/247 (18%)

Query: 217 ITLDDFLWAFGILRSRAFSRLRG--QNLVLIPLADLINHSPGITTEDYAYEIKGA--GLF 272
           + ++   WA   + SRAF R RG  Q   ++PL D+ NH+     E    E  G   GLF
Sbjct: 154 VDINALGWAMAAVSSRAF-RTRGPTQPAAMLPLIDMANHTFSPNAEVLPLEGGGGAVGLF 212

Query: 273 SRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEI----- 327
           +R           +  GE +L+ Y    SN  L +DYGFI   +  ++  L  ++     
Sbjct: 213 ARRA---------ITEGEPLLLSYG-QLSNDFLFMDYGFIVEDNPYDSVQLRFDVNLLQA 262

Query: 328 ----SESDPFFGDKLDIA----------ETNGLGESAYFDIVLGRTLPPAMLQYLRLVAL 373
               +      G  LD+A          E   +G +A  ++ +G   P A L   RL+A 
Sbjct: 263 GALVANVSDALGAPLDLAPRTWQLQLLAELGLVGPAANTELNIGGGGPGAELLDGRLLAA 322

Query: 374 GGTDAFLLESIFRNTIWGH-------LDLPVSHANEELICRVVRDACKSALSGFHTTIEE 426
               A ++ +     + G        +D P+   NE    R V      ALS F TT+++
Sbjct: 323 ----ARIMVARADGEVSGRGVERLCAVDRPLGRDNELAALRTVGGVLAFALSNFATTLDQ 378

Query: 427 DEQLLQG 433
           D+ LL G
Sbjct: 379 DKTLLAG 385


>gi|46129354|ref|XP_389038.1| hypothetical protein FG08862.1 [Gibberella zeae PH-1]
          Length = 478

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 71/262 (27%), Positives = 110/262 (41%), Gaps = 48/262 (18%)

Query: 67  FWQWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEI 125
           F  W   + V    S + P   P  G+G+VA  DI  N+ +L VP +     DTV    I
Sbjct: 7   FMPWASSKGVKC--SNVEPRIMPGRGIGIVAVCDIRANQTILSVPTRAVRTIDTV-PKHI 63

Query: 126 GSLCSGLKPWISVALFLIREKKKEDS-PWRVYLDILPECTD----STVFWSEE--ELVEL 178
                G    +SV   L  E   +DS  + ++  +LP   D      + W  E   L+  
Sbjct: 64  KDALHG----VSVHGILAAEIALDDSDDFAIWRTVLPTREDLEGGMPMMWPSELQALLPK 119

Query: 179 QGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSR-- 236
           +   LL          QN   + E +I+L   + FP  +T D+++ ++ ++ +R F    
Sbjct: 120 RAKDLLDN--------QNTTFRRECDIVL---KAFPT-LTRDEYMLSWVLINTRTFYNSM 167

Query: 237 ------LRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGE 290
                      LV +P+ADL NH             +G  L    L +S++T    K GE
Sbjct: 168 PKMKIYAHSDRLVCMPVADLFNHD------------QGCKLVYSALGYSVQTDRVYKQGE 215

Query: 291 QVLIQYDLNKSNAELALDYGFI 312
           +V + Y  + SN  L  +YGFI
Sbjct: 216 EVYVSYGPH-SNDFLLTEYGFI 236


>gi|255584095|ref|XP_002532790.1| Protein SET DOMAIN GROUP, putative [Ricinus communis]
 gi|223527460|gb|EEF29592.1| Protein SET DOMAIN GROUP, putative [Ricinus communis]
          Length = 510

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 98/394 (24%), Positives = 153/394 (38%), Gaps = 75/394 (19%)

Query: 88  FPE--GLGLVAQRDIAKNEVVLEVPMKFWINPDT-----VAASEIGSLCSGLKPWISVAL 140
           FP+  G GL A RD+ K E+VL VP    +  D+     +  S I +  S L P  ++ +
Sbjct: 49  FPDAGGRGLGAARDLKKGELVLRVPKSALLTKDSFLKDGLLLSAINNH-SALSPTQTLTV 107

Query: 141 FLIRE-KKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYL 199
            L+ E  K + S W  YL  LP   +    +SE E   LQ    + T    K   + E  
Sbjct: 108 CLLYEMSKGQSSFWYPYLMHLPRSYEILATFSEFEKQALQVDDAIWT--AEKAISKAELD 165

Query: 200 KVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINH-SPG-- 256
           + E   ++   +L P+ +TL  ++WA   + SR       +   L P+ D  N+ +PG  
Sbjct: 166 RKEAYSLMQELRLKPQFLTLRAWIWACATISSRTMHIPWDEAGCLCPVGDFFNYAAPGEE 225

Query: 257 -----------------------------ITTEDYAYEIKGA--GLFSRD-LLFSLRTPV 284
                                          +E +  ++K    G F  D   +      
Sbjct: 226 SSSPENDESWKPASCLEDASLSSERSTSNFCSETFDVQLKSLTDGGFDEDKAAYCFYARQ 285

Query: 285 PVKAGEQVLIQYDLNKSNAELALDYGFI--ESKSDRNAYTLTLEISESDPFFGDKLDIAE 342
             K G QVL+ Y    +N EL   YGF+  E+ +D+    L L +  S+ +  + + I +
Sbjct: 286 NYKKGAQVLLSYG-TYTNLELLEHYGFLLNENPNDKVFIPLELSMQSSNTWPKESMYIHQ 344

Query: 343 TNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSH--- 399
               G+ ++           ++L  LRL A             R    GHL    S    
Sbjct: 345 D---GKPSF-----------SLLCALRLWATPSN---------RRRSMGHLAYSGSQLSV 381

Query: 400 ANEELICRVVRDACKSALSGFHTTIEEDEQLLQG 433
            NE  I + +   C + L    TT+EED  LL  
Sbjct: 382 ENEVSILKWISRKCHAVLKKLPTTVEEDSLLLSA 415


>gi|407923069|gb|EKG16157.1| hypothetical protein MPH_06594 [Macrophomina phaseolina MS6]
          Length = 305

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 31/87 (35%), Positives = 49/87 (56%), Gaps = 2/87 (2%)

Query: 99  DIAKNEVVLEVPMKFWINPDTVAASEI-GSLCSGLKPWISVALFLIREKKKED-SPWRVY 156
           DI + EV+  +P    ++    + S I   L   L PW S+ + +I E  + D SPW+ Y
Sbjct: 3   DIQEGEVLFTIPRSAVLSATNSSLSSILPQLFEHLDPWASLIVTMIYEYLRGDASPWKPY 62

Query: 157 LDILPECTDSTVFWSEEELVELQGTQL 183
            D+LP   D+ +FWS++EL ELQ + +
Sbjct: 63  FDVLPAHFDTLMFWSDDELAELQASAV 89


>gi|357131865|ref|XP_003567554.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Brachypodium distachyon]
          Length = 316

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 50/170 (29%), Positives = 84/170 (49%), Gaps = 25/170 (14%)

Query: 149 EDSPWRVYLDILP--ECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEII 206
           + S W  Y+  LP  +   + +FW   EL  ++ + +    +  +E    E+  V+    
Sbjct: 38  KKSGWAPYVRSLPRNDQMHNMMFWDLNELHMVRISSICDEAIERRERAMKEFSAVK---- 93

Query: 207 LPNKQLFPR---PITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYA 263
            P+ + FP     I L+DF+ A  ++ SRA+   RG  + LIP AD +NH  G++     
Sbjct: 94  -PSLECFPHLFGEIKLEDFMHASALVSSRAWQTSRG--VSLIPFADFLNHD-GVSDSILL 149

Query: 264 YEIKGAGLFSRDL--LFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF 311
           Y+ +      +D+  + S R       GEQV+++Y    SNA LAL++GF
Sbjct: 150 YDGQ------KDIAEVISDRN---YAVGEQVMVRYG-KYSNAMLALNFGF 189


>gi|169847976|ref|XP_001830696.1| hypothetical protein CC1G_03233 [Coprinopsis cinerea okayama7#130]
 gi|116508170|gb|EAU91065.1| hypothetical protein CC1G_03233 [Coprinopsis cinerea okayama7#130]
          Length = 496

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 46/158 (29%), Positives = 74/158 (46%), Gaps = 11/158 (6%)

Query: 65  ETFWQWLRDQKVVSPKSPIRPATFPE---GLGLVAQRDIAKNEVVLEVPMKFWINPDTVA 121
           E F QW  +      +  +  A FP    G G VA +D+ +  V+  +P    ++  T  
Sbjct: 8   EAFIQWFLNHNGTMDREAMGIADFPAAEGGRGAVALKDLPEGHVLFTIPRALTLSTRTSR 67

Query: 122 ASEIGSLCSGLK-----PWISVALFLIREKKK-EDSPWRVYLDILPECTDSTVFWSEEEL 175
             E+  L    +      W  + L ++ E  + ++S W  YLDILP   D+ +FW+EE+L
Sbjct: 68  LPELFGLEEWKRLKLHQGWAGLMLCMMWEAAQGKESRWAGYLDILPAAFDTPMFWNEEDL 127

Query: 176 VELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLF 213
            EL GT ++   LG KE  + +Y    +  I    +LF
Sbjct: 128 SELAGTSIVG-KLG-KEDAERDYDSKIKPAIAKRPELF 163



 Score = 46.2 bits (108), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 65/249 (26%), Positives = 102/249 (40%), Gaps = 68/249 (27%)

Query: 241 NLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQY-DLN 299
           ++ ++P+AD++N   G          + A LF    +  + T  P++ GEQ+   Y DL 
Sbjct: 250 DIAMVPMADMLNARFGT---------ENAKLFHEKEVLKMTTTKPIRKGEQIWNTYGDL- 299

Query: 300 KSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVL--- 356
             NAEL   YG ++         L L     D   G+  D+ E   LG     D+V    
Sbjct: 300 -PNAELLRRYGHVD--------LLKL----PDGREGNPGDVVE---LGADIVVDVVKETN 343

Query: 357 GRTLPPAMLQYLRL-VALGGTDAFLLESIFRNTIWGHLDLPVS----------------- 398
           GR+   A+ + +   +  GG D F  +       W  LDLP +                 
Sbjct: 344 GRSTSDALAERIDWWLEEGGDDVFEFD-------W-DLDLPPAALSLVRLLLLPDNDWKK 395

Query: 399 ----------HANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRA 448
                      A+ +++  V+  A +   + + T I++DE  LQG  L    R A  VR 
Sbjct: 396 AQSKGKPPKPKADSDILL-VLEKAIEKRQAQYPTAIQDDETRLQG-ELSLNARHATVVRL 453

Query: 449 GEKRVLQQI 457
           GEKR+LQ +
Sbjct: 454 GEKRILQAV 462


>gi|300124011|emb|CBK25282.2| unnamed protein product [Blastocystis hominis]
          Length = 366

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 49/185 (26%), Positives = 87/185 (47%), Gaps = 20/185 (10%)

Query: 136 ISVALFLIREKKKEDSPWRVYLDILPE-CTDSTVFWSEEELVELQGTQLLSTTLGVKEYV 194
           +  A FL+ + + EDS ++ Y D LPE  ++  V W+  E+ +L G+     ++ ++  V
Sbjct: 52  VYFANFLLEDMENEDSFYKPYYDTLPEDISNIPVIWTNSEINQLHGSYF---SICIRSRV 108

Query: 195 QNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRG--QNLVLIPLADLIN 252
              Y   ++  +      F R    D +L    ++ SR F        N +L+PLAD++N
Sbjct: 109 VEIYRDYQK--MCDVNSFFCR-YPFDQYLRVRLLIGSRNFGSFFNSLNNGILVPLADMLN 165

Query: 253 HS-PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF 311
           H+ P  TT +Y  + K          F + + + ++ G QV+  Y   + N  L   YGF
Sbjct: 166 HTRPRQTTWEYDDKEKA---------FVITSLLNLRQGAQVMDSYG-RRDNRRLLFSYGF 215

Query: 312 IESKS 316
           +E  +
Sbjct: 216 VEDDN 220


>gi|297848312|ref|XP_002892037.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297337879|gb|EFH68296.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 548

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 54/217 (24%), Positives = 100/217 (46%), Gaps = 15/217 (6%)

Query: 54  SATTNPPTAQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKF 113
           +A +    A++E F  WL+          I+ +   +G G+ A  + + ++V+L VP+  
Sbjct: 2   AAISEEEEAKLERFLDWLQVNGAELGGCNIKYSDSRKGFGIFASTETS-DDVLLVVPLDL 60

Query: 114 WINPDTVAASEI-GSLCSGL------KPWISVALFLIREKKKEDSPWRVYLDILPECTDS 166
            I P  V    + G  C  +           + LFL  E+ + +S W+ YLD+LP    +
Sbjct: 61  AITPMRVLQDPLLGPECQKMFQDGQVDDRFLIILFLTFERLRINSSWKPYLDMLPTRFGN 120

Query: 167 TVFWSEEELVELQGTQLL-STTLGVKEYVQNEYLKVEEEIILPNKQLF----PRPITLDD 221
            +++S+++++EL GT L  +T L  K  +   + KV  E+++ N  +        ++ + 
Sbjct: 121 PLWFSDDDILELNGTNLYHATELQKKNLLSLYHDKV--EVLVKNLLILDGDSQSKVSFEH 178

Query: 222 FLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGIT 258
           FLWA  +  SRA +     + V     D I   P  +
Sbjct: 179 FLWANSVFWSRALNIPLPHSFVFPQSQDDIGECPSTS 215


>gi|428174289|gb|EKX43186.1| hypothetical protein GUITHDRAFT_110913 [Guillardia theta CCMP2712]
          Length = 437

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 58/219 (26%), Positives = 95/219 (43%), Gaps = 33/219 (15%)

Query: 137 SVALFLIREKKKEDSPWRVYLDILPECTDSTVFWS-------EEELVELQGTQLLSTTLG 189
           ++A+FL+ E + + S WR YL  LP+     +F+S       +E+L E Q  +  +    
Sbjct: 167 ALAVFLLLESQNKSSFWRPYLCSLPKHVPLPMFYSKERRQQLKEQLPEDQRVKFDALVEA 226

Query: 190 VKEYVQNEYLKVEEEIILPNKQLF-PRPITLDDFLWAFGILRSRAFSRL--------RGQ 240
            ++ V   Y+++   + L    LF P   + + F WA  I+ SR + +         RG+
Sbjct: 227 RRDVVDLHYMQLLPVLFLKYPTLFSPEVFSYEKFAWAISIIMSRTWGKTYFDSALGPRGR 286

Query: 241 NL---VLIPLADLINH-SPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQY 296
           N+    L P AD+ NH S G+       + +G          +L     +  GEQ  I Y
Sbjct: 287 NITVHTLAPAADMPNHDSSGLEANR---DPRGR--------MTLNAQKNLSVGEQFFISY 335

Query: 297 DLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFG 335
             +K +AE    YGF+   +        +  SE DP  G
Sbjct: 336 G-SKCDAEFLAHYGFVPFNNSHKRCKGRV-YSEGDPNMG 372


>gi|241603784|ref|XP_002405757.1| SET domain-containing protein, putative [Ixodes scapularis]
 gi|215502568|gb|EEC12062.1| SET domain-containing protein, putative [Ixodes scapularis]
          Length = 429

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 72/289 (24%), Positives = 114/289 (39%), Gaps = 45/289 (15%)

Query: 88  FPE-GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLC----SGLKPWISVALFL 142
           FP+ G G+VA   +   E  L++P    I+  T   S + S      + L P   + LF+
Sbjct: 52  FPDTGRGVVALEKLVGGETFLKLPATLLISTRTALQSRLHSFIIRHHAKLTPIDVLTLFV 111

Query: 143 IREKK-KEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLK- 200
           + +K   E S W  ++D LP    + VF   +    L          G+  ++Q  +LK 
Sbjct: 112 LDQKLLGEASRWWPFVDSLPRTFTTPVFLRRKVFESLPKDLREEVQTGIT-FIQRTFLKL 170

Query: 201 -------VEEEIILPNKQLFPRPITLDDFLWAFGILRSRAF--------SRLRGQNLVLI 245
                  VEEE   P  Q      T ++F+WA+  + +R          S     +  L 
Sbjct: 171 KVLLGGHVEEE---PEVQCLSTGFTWNNFVWAWTAVNTRCIFAQGSNSSSLWEDDHCALA 227

Query: 246 PLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAEL 305
           P  D +NH        +   I+ A +      F + +     A EQV I Y  + SN  L
Sbjct: 228 PFLDCLNH-------HWKASIETAMVGEN---FEILSHKSHDANEQVFISYGPH-SNRRL 276

Query: 306 ALDYGFIESKSDRNAYTLTLE--------ISESDPFFGDKLDIAETNGL 346
            LDYGF+   +  +   +T +           + P F  KLD  E+  +
Sbjct: 277 FLDYGFVLPDNPNDVVVVTKDHLVKLYSLHENTMPHFQSKLDFLESKNV 325


>gi|359480927|ref|XP_002267602.2| PREDICTED: N-lysine methyltransferase setd6-like [Vitis vinifera]
 gi|296084825|emb|CBI27707.3| unnamed protein product [Vitis vinifera]
          Length = 500

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 66/307 (21%), Positives = 121/307 (39%), Gaps = 48/307 (15%)

Query: 63  QVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAA 122
           ++  F +W++ Q +    +     +  +G+ + A+ D+ + +VV  +P    +   T  A
Sbjct: 14  RLRAFKRWMKSQGIDCSDALEITHSLDQGISVYAKCDLEEGDVVATIPKDSCLTVKTSGA 73

Query: 123 SEIGSLCSGLKPWISVALFLIREKK-KEDSPWRVYLDILPECTDSTVFWSEEELVEL-QG 180
            +I     G +  + ++  L+ E      SPW  YL +LP      + WS +E+  L  G
Sbjct: 74  KDIIEE-YGFEGPLGLSFALMYEISLGHASPWAPYLHLLPNSESLPLVWSLDEVDSLLSG 132

Query: 181 TQLLSTTLGVKEYVQNEYLKVEEEIILP-----NKQLFPRPITLDDFLWAFGILRSRAFS 235
           T++       K  + +++    +E ILP     + +L P    ++ ++ A  ++ SR+F 
Sbjct: 133 TEIHKIVKEDKALIYDDW----KENILPLMDSTSLKLSPDFFGVEQYIAAKSLVASRSFE 188

Query: 236 RLRGQNLVLIPLADLINHSPGITTEDYAYEIKGA------GLFSRDLLFSLRTPVP---- 285
                   ++PLADL NH  G     +  E+                  S   P+P    
Sbjct: 189 VDDYHGFGMVPLADLFNHKTGAENVHFTTELSHGDSDNDTDNNDEGNNMSDNKPLPQNSF 248

Query: 286 -------------------------VKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNA 320
                                    VKAG++V   Y  +  NA L   YGF E  +  + 
Sbjct: 249 DDGNLEDPSYLGNDSMILETIIVKSVKAGDEVFNTYG-SMGNAGLLHRYGFTEPDNPNDI 307

Query: 321 YTLTLEI 327
             + LE+
Sbjct: 308 VNIDLEL 314


>gi|425773952|gb|EKV12277.1| hypothetical protein PDIG_46020 [Penicillium digitatum PHI26]
 gi|425782378|gb|EKV20291.1| hypothetical protein PDIP_17950 [Penicillium digitatum Pd1]
          Length = 487

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 44/145 (30%), Positives = 71/145 (48%), Gaps = 17/145 (11%)

Query: 54  SATTNPPTA-----QVETFWQWLRDQKVVSPKSPIRPATF---PEGLGLVAQRDIAKNEV 105
           S+T + P A     Q ++F  WL+    V     +R A       G G+VAQ +I + E 
Sbjct: 2   SSTAHFPDAEGFQQQSDSFMSWLQASPGVQLNPKLRLADLRATGAGRGVVAQSNIVEGEE 61

Query: 106 VLEVPMKFWINPDTVAASEIGSLCS-----GLKPWISVALFLIREK-KKEDSPWRVYLDI 159
           +  +P    +   TV  SE+ +L +      + PW+S+ L ++ E  + E S W  Y  +
Sbjct: 62  LFSIPRTMVL---TVQNSELRTLLAENLEEQMGPWLSLMLVMVYEYLQGEKSRWAPYFRV 118

Query: 160 LPECTDSTVFWSEEELVELQGTQLL 184
           LP   D+ +FWS  EL ELQ + ++
Sbjct: 119 LPSRFDTLMFWSPAELQELQASTIV 143


>gi|295665983|ref|XP_002793542.1| SET domain-containing protein [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226277836|gb|EEH33402.1| SET domain-containing protein [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 527

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 52/198 (26%), Positives = 85/198 (42%), Gaps = 13/198 (6%)

Query: 144 REKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQL---LSTTLGVKEYVQNEYLK 200
           +E  +  S W  Y+  LPE      F++ EE   L GT L   L   +   E   N+  +
Sbjct: 124 KEHIRMSSAWTEYVKFLPEPLPLPTFYTIEERELLHGTSLELALDAKIATLEKEFNDLRE 183

Query: 201 VEEEIILPNKQLFPRP---ITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGI 257
              +I   N+  +      +T +D+     + RSR+     G   V++P  D+ NH+   
Sbjct: 184 ATMDIKWCNRYWWHEETGRLTFEDWKLVDALYRSRSLE-FPGIGHVMVPCLDMTNHASND 242

Query: 258 TTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESK-S 316
            T +  YE+ G G    + +  LR    +  G+++ I Y   K  AE+   YGF+E+  +
Sbjct: 243 LT-NALYEVDGGG----NAVLQLRWGRSLGKGDEITITYGDEKGAAEMIFSYGFLENGLA 297

Query: 317 DRNAYTLTLEISESDPFF 334
                 L L+I   DP  
Sbjct: 298 KARQLFLDLDIPCDDPLL 315


>gi|357521293|ref|XP_003630935.1| SET domain-containing protein [Medicago truncatula]
 gi|355524957|gb|AET05411.1| SET domain-containing protein [Medicago truncatula]
          Length = 482

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 97/398 (24%), Positives = 166/398 (41%), Gaps = 40/398 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVP--MKFWINPDTVAASEIGSLCSGL--KPW-ISVALFLIR 144
            GLGLVA+ DI     ++ +P  +    N D  + S +  L S +  + W + + L L+ 
Sbjct: 68  NGLGLVAKEDIPIGTDLIALPQHIPLHFNHDDNSHSLLLQLTSHVPDELWSMKLGLKLLL 127

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEE 204
           E+ K  S W  Y+  LP+     +F+  E++  LQ   +L        ++ +   KV+  
Sbjct: 128 ERAKLGSFWWPYISNLPQTYTLPIFFPGEDIKNLQYAPILHQVNKRCRFLLDFEQKVKHA 187

Query: 205 II--LPNKQLFP-RPITLDDFLWAFGILRSRAFSRLRGQNL---------VLIPLADLIN 252
           ++   P+K  F  + +      WA   + SRAF +L G            +++PL D+ N
Sbjct: 188 LVGLTPDKHPFGGQEVDASSLGWAMSAVSSRAF-KLHGNKQSNGINFDIPMMLPLIDMCN 246

Query: 253 HSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF- 311
           HS          +  G    S  +   +     +K  + +L+ Y    SN    LDYGF 
Sbjct: 247 HSFNPNARIVQEQETG----STKMWVKVVAEKAIKEDDPLLLCYGC-LSNDLFLLDYGFV 301

Query: 312 --------IESKSDR---NAYTLTLEISE---SDPFFGDKLDIAETNGLGESAYFDIVLG 357
                   IE K D    +A ++   +S    S P    +L +++ N  GE+    + LG
Sbjct: 302 IQSNPYDCIELKYDGALLDAASMAAGVSSPNFSTPAPWQELILSQLNLAGETPDLKVSLG 361

Query: 358 RTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWG-HLDLPVSHANEELICRVVRDACKSA 416
                       L  L  +D   ++    NT+   + + P+  AN+  + R +   C  A
Sbjct: 362 GQEIIEGRLLAALRVLLASDMASVQKHDLNTLKSLNAEAPLGVANDLAVFRTLIALCVIA 421

Query: 417 LSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVL 454
           L  F T + +DE LL+ G       +A+  R  +K V+
Sbjct: 422 LGHFPTKLMDDESLLKQGA-SGSTELAIQFRIQKKSVI 458


>gi|317137154|ref|XP_001727528.2| SET domain protein [Aspergillus oryzae RIB40]
          Length = 496

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 52/192 (27%), Positives = 82/192 (42%), Gaps = 15/192 (7%)

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKVEEEIIL 207
           +PW  Y+  LP       F++ EE   L+GT L          ++NE+    +  E I  
Sbjct: 131 NPWTEYIQYLPSSITLPTFYTVEERELLRGTSLKLAVDAKIVSLENEFELLRQSTENISC 190

Query: 208 PNKQLFPRP---ITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINH-SPGITTEDYA 263
             K  +       TLDD+ +   + RSR    L      ++P  D+ NH S  I    Y 
Sbjct: 191 CRKHWWDENTGRFTLDDWKYVDAMYRSRMVD-LPSSGHAMVPCIDMANHASEDIVKALYE 249

Query: 264 YEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES-KSDRNAYT 322
            + +G      + +  LR+   + + E+V I Y  +K  +E+   YGF+ES +       
Sbjct: 250 EDTEG------NAVLQLRSGRKLHSDEEVTISYGDDKPASEMIFSYGFLESERGGAKQIF 303

Query: 323 LTLEISESDPFF 334
           L L+I E DP  
Sbjct: 304 LNLDIPEDDPLI 315


>gi|159490820|ref|XP_001703371.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158280295|gb|EDP06053.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 339

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 75/286 (26%), Positives = 116/286 (40%), Gaps = 56/286 (19%)

Query: 67  FWQWLRDQKVVSPKS--PIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASE 124
           F  W +++KV  P S   + PAT   G  LVA R+I   EVV+EVP       D V  +E
Sbjct: 42  FVAWCQEEKVRFPSSHLAVLPAT---GRALVASRNIKMGEVVVEVP------DDAVLMAE 92

Query: 125 IGSLCSGLK----------------PWISVALFLIREKKKEDSPWRVYLDILPE-CTDST 167
              L   L+                  + +A+   R +  E S W  YL +LP+  T   
Sbjct: 93  NCGLRDVLEEEGMTKDSADEEILEVQGLVIAVMWERWRGPE-SRWAPYLALLPDDMTHMP 151

Query: 168 VFWSEEELVELQGTQLLSTTLGVKEY--------------VQNEYLKVEEEIILPNKQLF 213
           ++W   E  EL+GT      LG  ++              V   ++    E+ LP  +  
Sbjct: 152 LYWKRREFRELRGTAAYDKMLGRAQHPSDAPTQVPLLWSEVVGPFIAEHPELGLPGGE-- 209

Query: 214 PRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFS 273
            R   L  + WA   + S +F     +   ++P+ DL+NH     T D    +       
Sbjct: 210 -RGYEL--YRWATAAVASYSFILGDDKYQAMVPVWDLLNH----ITGDVNVRLHHCSKRH 262

Query: 274 RDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRN 319
              + ++R    + AG +++  Y    SNAEL   YGF+E  +  N
Sbjct: 263 VLQMIAMRD---IVAGSELVNNYG-ELSNAELLRGYGFVERANRYN 304


>gi|367009050|ref|XP_003679026.1| hypothetical protein TDEL_0A04830 [Torulaspora delbrueckii]
 gi|359746683|emb|CCE89815.1| hypothetical protein TDEL_0A04830 [Torulaspora delbrueckii]
          Length = 484

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 70/278 (25%), Positives = 118/278 (42%), Gaps = 42/278 (15%)

Query: 65  ETFWQWLRDQKVV--SPKSPIRP-ATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVA 121
           E F  WL +   V  SPK  I     F +G  +VA+  I   EV+ E+P    +N  T  
Sbjct: 9   EQFSAWLSEVAGVRMSPKIAISDLRNFNQGRCVVAREAIEGQEVLFEIPRSAILNVGTSR 68

Query: 122 ASEI-----GSLCSGLKPWISVALFLIREKK--KEDSPWRVYLDILP--ECTDSTVFWSE 172
            +E      G L   +  W  + L ++ E K     S W  Y  +LP  +  +S +FW++
Sbjct: 69  LTEKYPGIRGRLLEEIGHWEGLVLCMLYEMKVLNIQSRWWAYFQVLPRPDEVNSLMFWND 128

Query: 173 EELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSR 232
           E+L  L+ + L+   +GV E  Q  Y +V + +     +     ++  DF++   ++ S 
Sbjct: 129 EQLEGLKPS-LIVERVGVAEAKQM-YERVLQYVEESGVEELG-SVSWSDFVYVASVIMSY 185

Query: 233 AFS-----------------RLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRD 275
           +F                  +  G    +IPLAD +N +      +  Y+I+   + +  
Sbjct: 186 SFDVELVDADPNEEQELSTVKNDGYMKSMIPLADTLNANTSKCNANLVYDIESLKMCATK 245

Query: 276 LLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIE 313
                    P+  GEQV   Y  +  N+EL   YG++E
Sbjct: 246 ---------PIGMGEQVYNIYG-DHPNSELLRRYGYVE 273


>gi|406868331|gb|EKD21368.1| SET domain-containing protein [Marssonina brunnea f. sp.
           'multigermtubi' MB_m1]
          Length = 480

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 52/188 (27%), Positives = 84/188 (44%), Gaps = 14/188 (7%)

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKE---YVQNEYLKVEEEIIL 207
           +PW  Y+ +LPE       WSEEE V L GT L  T +  K      + E L+ +   I 
Sbjct: 118 NPWTEYVRMLPESIPVPTMWSEEERVMLTGTSL-ETAVSAKCASLISEIEDLRGKTAEIA 176

Query: 208 PNKQLFPRPITLDDFLWAF--GILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYE 265
             ++ +    +L    W+      RSR+   +      +IP  D++NH+     E  +Y 
Sbjct: 177 WCQKCWWEEESLRYENWSLLDAWYRSRSL-EVPNAGESMIPCVDMVNHA----AEANSYY 231

Query: 266 IKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTL 325
            + +     ++   LR    ++A  +V I Y  +KS AE+   YGFI+ +      TL +
Sbjct: 232 ERTS---DNNIALLLRPDTQLEAESEVTISYGSSKSEAEMLFSYGFIDEQGTSKGLTLNI 288

Query: 326 EISESDPF 333
           + S  DP 
Sbjct: 289 DPSPDDPL 296


>gi|330933580|ref|XP_003304225.1| hypothetical protein PTT_16721 [Pyrenophora teres f. teres 0-1]
 gi|311319308|gb|EFQ87682.1| hypothetical protein PTT_16721 [Pyrenophora teres f. teres 0-1]
          Length = 476

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 37/125 (29%), Positives = 70/125 (56%), Gaps = 5/125 (4%)

Query: 65  ETFWQWLRDQKV-VSPKSPIRPATFPE-GLGLVAQRDIAKNEVVLEVPMKFWIN-PDTVA 121
           + F  WLR     +SPK  +      + G G+VA ++IA++E++  +P    ++  +++ 
Sbjct: 9   QAFLAWLRKSGAEISPKIKLEDLRNKDAGRGVVASQEIAEHELLFRIPRTSILSVENSIL 68

Query: 122 ASEI-GSLCSGLKPWISVALFLIRE-KKKEDSPWRVYLDILPECTDSTVFWSEEELVELQ 179
           ++EI  +  S L PW+S+ L ++ E      S W  Y  +LP   ++ +FW+E+EL ELQ
Sbjct: 69  STEIPAATLSLLGPWLSLILVMLYEYHNGSASNWAPYFAVLPTEFNTLMFWTEDELAELQ 128

Query: 180 GTQLL 184
            + ++
Sbjct: 129 ASAVV 133


>gi|21594028|gb|AAM65946.1| unknown [Arabidopsis thaliana]
          Length = 471

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 84/355 (23%), Positives = 141/355 (39%), Gaps = 50/355 (14%)

Query: 136 ISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYV- 194
           + + L L++E+   DS W  Y+  LPE     +F+  E++  LQ   LL     +  ++ 
Sbjct: 108 MKLGLRLLQERANADSFWWPYISNLPETYTVPIFFPGEDIKNLQYAPLLHQVNKICRFLL 167

Query: 195 --QNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNL---------- 242
             + E  +  E++   +     + +      W    + +RAF RL G             
Sbjct: 168 EFEQEIRRTLEDVKASDHPFSGQDVNASALGWTMSAVSTRAF-RLHGNKKLQGGSSDDVP 226

Query: 243 VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSN 302
           +++PL D+ NHS     +  A  I+       + L  +     VK  + +L+ Y    SN
Sbjct: 227 MMLPLIDMCNHS----FKPNARIIQEQNGADSNTLVKVVAETEVKENDPLLLNYGC-LSN 281

Query: 303 AELALDYGF---------IESKSDR---NAYTLTLEISE---SDPFFGDKLDIAETNGLG 347
               LDYGF         IE K D    +A ++   +S    S P       +++ N  G
Sbjct: 282 DFFLLDYGFVIESNPYDTIELKYDEQLMDAASMAAGVSSPKFSSPAPWQHQLLSQLNLAG 341

Query: 348 ESAYFDIVLGRTLP--------PAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSH 399
           E     + +G   P          +L    LV +   D+  L+S+           P   
Sbjct: 342 EMPNLKVTIGGPEPVEGRLLAALRILLCGELVEVEKHDSDTLKSLSAVA-------PFGI 394

Query: 400 ANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVL 454
           ANE  + R V   C  ALS F T I EDE +++ G +     +++  R  +K V+
Sbjct: 395 ANEIAVFRTVIALCVIALSHFPTKIMEDEAIIKQG-VSATAELSIKYRIQKKSVI 448


>gi|444314545|ref|XP_004177930.1| hypothetical protein TBLA_0A06190 [Tetrapisispora blattae CBS 6284]
 gi|387510969|emb|CCH58411.1| hypothetical protein TBLA_0A06190 [Tetrapisispora blattae CBS 6284]
          Length = 550

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 59/239 (24%), Positives = 105/239 (43%), Gaps = 26/239 (10%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREK--- 146
           +G+  +A  ++   + ++EVP         +     G           + + L + K   
Sbjct: 33  KGISCIASEELGGQQPIIEVPSDIIFCSKQIQEFLPGIEIDSQDSNTCLKIILCKLKFDT 92

Query: 147 KKEDSPWRV--YLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEE 204
           K  +SP R   Y++ LPE  DS + W+E+EL  LQ T L +      + V +E+ K  E+
Sbjct: 93  KDSNSPHRFEPYINALPEIIDSPLNWNEDELKLLQNTNLGNCLKERFQNVYDEWFKFLEK 152

Query: 205 IILPNKQLFPRP-----ITLDDFLWAFGILRSRAF------SRLRGQNLVLIPLADLINH 253
               N Q F            +FLWA  I+ SR+F            +++L+P+ DL+NH
Sbjct: 153 --YQNYQEFETQSETSWYNFSNFLWAHLIITSRSFPEYIINPNCPRDSVMLLPVLDLLNH 210

Query: 254 SPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI 312
           S      +Y+ +++  G    + ++       ++ G+++   Y   K N EL   YGF+
Sbjct: 211 S------NYS-KVEWDGNKGGNFIYKKLDLQEIEIGDEIYNNYG-GKGNEELLNGYGFV 261


>gi|303272869|ref|XP_003055796.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226463770|gb|EEH61048.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 677

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 71/290 (24%), Positives = 112/290 (38%), Gaps = 75/290 (25%)

Query: 101 AKNEVVLEVPMKFWINPDTVAASEIGS-------LCSG-LKPWISVALFLIREKKK-EDS 151
           AK  V+  VPM+  ++ D  A  +  +       L SG +   ++V L LI E+++ E S
Sbjct: 174 AKPVVLASVPMRLAVSADAAANHDTPAGAVFREMLESGEIDERLAVMLLLIVERRRGEGS 233

Query: 152 PWRVYLDILPECTD--STVFWSEEELVELQGTQLLSTTLGVKEYVQN------------- 196
           P + Y+D LP+  +  + +F+   E+  L+GT L       +  + +             
Sbjct: 234 PIKAYVDALPKPGEHKTPLFYDANEMKALRGTNLHRAVSAQRRRLASVLKDSAIPAGKKL 293

Query: 197 -----EYLKVEEEIILPNKQLFP----------------------RPITLDDFLWAFGIL 229
                 ++  EE          P                      +PIT+D+FLWA+   
Sbjct: 294 MRAVASHVPPEEREKKTGGGWLPWWLGGGNGKRDLRRRRATKPSAKPITMDEFLWAYATF 353

Query: 230 RSRAFS-------RLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRD------- 275
            SRA +          G    ++P  D  NHS       +A     A    R+       
Sbjct: 354 WSRALALPIGPDPEASGAVEAIVPGIDFANHSCARPNARWAV----ANASGREGATAGEP 409

Query: 276 --LLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTL 323
              L  L  P P   GE+VLI Y  +K N EL   +GF E ++  +A  L
Sbjct: 410 TVTLECLSVPGP---GEEVLISYG-DKPNEELLFVHGFAERENPHDALVL 455


>gi|255078794|ref|XP_002502977.1| set domain protein [Micromonas sp. RCC299]
 gi|226518243|gb|ACO64235.1| set domain protein [Micromonas sp. RCC299]
          Length = 536

 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 52/177 (29%), Positives = 85/177 (48%), Gaps = 25/177 (14%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWI----NPDTVAASEIGSLCSGLKPW-ISVALFLIRE 145
           G GL A RDI   E VL +P++  I    +     A E+ S      PW + +A  L++E
Sbjct: 106 GRGLEASRDIENGEPVLRLPLEMGICDYQDGHPAEAWEVMSNA----PWGVRLACRLLQE 161

Query: 146 KKK-EDSPWRVYLDILPECT-DSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEE 203
           + K EDS +  Y+ ++PE    S + W+++E+  LQ    ++    +++ V   + K+  
Sbjct: 162 RAKGEDSDYAPYIALIPESVPGSPLMWTDDEVASLQYPPAVAEAREMRDAVATWFRKLSA 221

Query: 204 E--IILPNKQLFPRPITLDDFLWAFGILRSRAF---SRLRGQNL--VLIPLADLINH 253
           E  + L    L       D F  A  ++ SR +   S   G+     L+PLADL+NH
Sbjct: 222 EAPVALAGADL-------DAFKSAVSVVHSRTYGVASSASGEGYFRALLPLADLLNH 271


>gi|326432892|gb|EGD78462.1| hypothetical protein PTSG_09157 [Salpingoeca sp. ATCC 50818]
          Length = 467

 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 43/161 (26%), Positives = 73/161 (45%), Gaps = 10/161 (6%)

Query: 61  TAQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTV 120
           T+   T   WLR +  +  K  +  +    G GL A       E +L++P K  ++  T 
Sbjct: 53  TSDFTTLEAWLRAEGAIIQKVEVGVSGAGMGNGLFATETTEVGETLLQIPSKCMLSEVTA 112

Query: 121 AA-SEIGSLCSG-----LKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEE 174
           A+ S+IGS  +        P +++A+ L+ E     SP+R Y+  LP      + WSE +
Sbjct: 113 ASCSKIGSFVTSDPMLSAMPNLALAVHLLAELNDPASPYRAYIATLPSSFPLPMTWSERD 172

Query: 175 LVELQGTQLLSTTLGVKEYVQNEYL----KVEEEIILPNKQ 211
           +  L+G  L +  L +   V  +Y     ++ E  + P KQ
Sbjct: 173 IARLRGLPLFARVLRLYRNVARQYCYLYTRIVEGALRPAKQ 213


>gi|345566622|gb|EGX49564.1| hypothetical protein AOL_s00078g53 [Arthrobotrys oligospora ATCC
           24927]
          Length = 611

 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 54/199 (27%), Positives = 90/199 (45%), Gaps = 29/199 (14%)

Query: 148 KEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIIL 207
           KE S +  Y+ +LP+  D+ ++++++E+  L GT L +  + +++ +  E  +  ++ + 
Sbjct: 123 KEQSQFWPYIRLLPKTFDTPLYFNDDEMERLAGTNLGAGDVLLRKQLWMEEWEAGKQFLE 182

Query: 208 PNKQLFPRPITLDDFLWAFGILRSRAF-SRLRGQNL-----------------VLIPLAD 249
                  R  T D FL A  I  SR+F S+L G  +                 VLIPL D
Sbjct: 183 GVGAERAREYTWDLFLRAATIYTSRSFPSKLVGITMDSSIEENTMLSDDNGFPVLIPLVD 242

Query: 250 LINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDY 309
           ++NH P        +E            FSL TP  +  G QV   Y   K N EL + Y
Sbjct: 243 ILNHKPNTKI---IWEPTQTS-------FSLITPETISEGSQVFNNYG-PKGNEELLMGY 291

Query: 310 GFIESKSDRNAYTLTLEIS 328
           GF+  ++  ++  +   IS
Sbjct: 292 GFVIPENPGDSLAMKFTIS 310


>gi|85090666|ref|XP_958526.1| hypothetical protein NCU09827 [Neurospora crassa OR74A]
 gi|28919896|gb|EAA29290.1| predicted protein [Neurospora crassa OR74A]
          Length = 532

 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 95/409 (23%), Positives = 162/409 (39%), Gaps = 101/409 (24%)

Query: 135 WISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKE-- 192
           W  + L L+ E  +  S W  YL ILP   D+ +FW+E EL ELQ + L++  +G +E  
Sbjct: 126 WTLLILILMHEYLQGSSNWSPYLSILPTQFDTPMFWTEAELSELQASALVAK-VGKEEAD 184

Query: 193 -YVQNEYLKV---EEEIILPNKQLFPRPITLDD--------------FLWAFGILRSRAF 234
             ++ + +KV    EE+  P     P+   L++                +AF +      
Sbjct: 185 KMIRTKIVKVVQEHEEVFYPADT--PKTQRLEEGELLKLGHRMGSAIMAYAFDLANDDED 242

Query: 235 --------------SRLRGQN--LVLIPLADL----------INHSPGITTEDYAYEIKG 268
                          ++ G N  + ++P+AD+          INH     T     EIK 
Sbjct: 243 EDEEEEEEEDGWVEDKIAGMNDSMGMVPMADMLNADAVFNAHINHGEACLTATSLREIKE 302

Query: 269 AG--------LFSRDLL--FSLRTPVP------------VKAGEQVLIQYDLNKSNAELA 306
                     L S +LL  +   TP              V+ G + +++  +     E+ 
Sbjct: 303 GEEILNYYGPLSSAELLRRYGYVTPNHARYDVVEVGWGLVEGGLKKMVEEGIKGRKKEVD 362

Query: 307 LDYGFIESKSDRNAY-----TLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLP 361
            D      +  +        +  LE    DP        +E    GE+ +    L + L 
Sbjct: 363 WDKVEELVEEVKEEEGEWEDSFVLERQSEDP-------DSEGQVHGEAEFMG--LPQELE 413

Query: 362 PAMLQYLRLV--ALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSG 419
             +  YL+ V   +G  D  + E++   T+            +E++   VR A +     
Sbjct: 414 EQVKTYLKAVKKVVGTGDRAVAEALGNKTV-----------RKEILLGGVRKALEEREGQ 462

Query: 420 FHTTIEEDEQLL---QGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKERE 465
           + T++EEDE++L   +GG    R  +AV VRAGEKR+L++     K++E
Sbjct: 463 YATSLEEDEKVLAGMEGGRPTTRKEMAVWVRAGEKRILKEAVAWVKKQE 511


>gi|224101881|ref|XP_002312458.1| SET domain-containing protein [Populus trichocarpa]
 gi|222852278|gb|EEE89825.1| SET domain-containing protein [Populus trichocarpa]
          Length = 362

 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 86/347 (24%), Positives = 144/347 (41%), Gaps = 37/347 (10%)

Query: 136 ISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQ 195
           + + L L++E+ + +S W  Y+  LP      +F+  E++  LQ   LL        ++ 
Sbjct: 1   MKLGLKLLQERARVNSFWWPYISNLPRTYSVPIFFQGEDIKNLQYAPLLYQVNRRCRFLL 60

Query: 196 N--EYLKVEEEIILPNKQLFPRP-ITLDDFLWAFGILRSRAFSRLRGQNL---------V 243
           +  + +K   E +  N   F R  +      W+   + SRAF RL G+ L         +
Sbjct: 61  DFEQEVKRAVENLKQNDHPFDRQDVDASSLGWSMSAVSSRAF-RLYGKKLPDGTCIDAPM 119

Query: 244 LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNA 303
           ++PL D+ NH+     +    +  G    S  +L    TP+  K  + +L+ Y    +N 
Sbjct: 120 MLPLIDMCNHAFNPNAQIIQEQDAG----SAKMLIKAETPI--KQNDAILLNYGC-LNND 172

Query: 304 ELALDYGF---------IESKSDR---NAYTLTLEISE---SDPFFGDKLDIAETNGLGE 348
              LDYGF         IE K D    +A ++   +S    S P    +  +++ N  GE
Sbjct: 173 LFLLDYGFVIPSNPYDCIELKYDGAFLDAASVAAGVSSPKFSSPAPWQQQFLSQLNLDGE 232

Query: 349 SAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWG-HLDLPVSHANEELICR 407
           +A   + LG            L  L  +D  ++     +T+     D P+  ANE    R
Sbjct: 233 AANLKVTLGGQELVDGRLLAALRVLLASDMEMVRKHDMDTLKSLSADAPLGIANEVAAYR 292

Query: 408 VVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVL 454
            +   C  AL  F T I EDE  L+ G +     +A+  R  +K V+
Sbjct: 293 TIIALCVIALEHFPTKIMEDESSLRQG-VSASTELAIQFRIQKKSVI 338


>gi|332020870|gb|EGI61268.1| SET domain-containing protein 3 [Acromyrmex echinatior]
          Length = 232

 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 53/214 (24%), Positives = 104/214 (48%), Gaps = 16/214 (7%)

Query: 244 LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNA 303
           LIP+ D+ NH  G  T D+           R   ++LR     K GEQV I Y   ++N+
Sbjct: 19  LIPMWDMCNHENGRITTDF------NATSDRCECYALRD---FKKGEQVFISYG-PRTNS 68

Query: 304 ELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLG-RTLPP 362
           +  +  GF+   ++++ + L L IS++D    +++++     L     F +  G   +  
Sbjct: 69  DFFVHSGFVCMDNEQDGFKLRLGISKADSLQKERIELLSKLDLPSVGEFLLKPGTEPISD 128

Query: 363 AMLQYLRLVALGGTDAFLLESIFRNTIWG--HLDLPVSHANEELICRVVRDACKSALSGF 420
            +L +LR+ ++    A L   +  + ++   H+D  +    EE + + +    +  ++ +
Sbjct: 129 TLLAFLRVFSM--RKAELTHWLRSDKVFDLKHVDCALETVVEENVRKFLLTRLQLLIANY 186

Query: 421 HTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVL 454
            TT++ED +LL+   L    ++AV +R  EKR+L
Sbjct: 187 PTTLKEDLELLE-TTLPQMKKMAVQLRVTEKRIL 219


>gi|430810895|emb|CCJ31573.1| unnamed protein product, partial [Pneumocystis jirovecii]
          Length = 433

 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 42/126 (33%), Positives = 63/126 (50%), Gaps = 20/126 (15%)

Query: 149 EDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQ----NEYLKVEEE 204
           + S W  YL I+P+  D   FW  E+   L+GT++        EY+     +E  K  ++
Sbjct: 8   QKSLWYGYLQIIPDKVDIPKFWGHEK-NWLKGTEI--------EYIGGLDISELYKTYQD 58

Query: 205 IILP-----NKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITT 259
           +I+P      K L     T D+FL A  I+RSRAF   +   L L+P AD+ NH+  IT 
Sbjct: 59  LIIPFIKKNEKILNQNTFTYDNFLKAISIIRSRAFEIDKFHGLGLVPFADIFNHT--ITK 116

Query: 260 EDYAYE 265
           E   ++
Sbjct: 117 EHVHFQ 122


>gi|303270905|ref|XP_003054814.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226462788|gb|EEH60066.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 522

 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 64/273 (23%), Positives = 110/273 (40%), Gaps = 39/273 (14%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-ISVALFLIREKKKE 149
           G GL A RD+A  +VV+ VP+   +N    A    G+      PW +++A  ++ E++  
Sbjct: 78  GRGLFAARDVAAGDVVMSVPLAAALNDGVAAPPYDGA------PWSVTLAAAILAERRIG 131

Query: 150 D-SPWRVYLDILPECTDSTVFWSEEELVE-----------LQGTQLLS--------TTLG 189
           D S W  Y+  LP  TD   F ++E L +              +QL +            
Sbjct: 132 DASRWAPYVRSLP--TDVVGFANDEGLFDDASCGASSSSSSSSSQLAAEARALASHDAAA 189

Query: 190 VKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAF------SRLRG--QN 241
             E  +   L       L +    P P T  ++ WA  ++ SR F      + + G    
Sbjct: 190 ADELDRYRSLLTRSHAALTSAGRAPAPPTFAEWRWAMSMVHSRTFRLEEPAAGVAGFETR 249

Query: 242 LVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKS 301
            V++P  DL+NH          ++ +   L      F +     V+AGE+VL+ Y   + 
Sbjct: 250 RVMVPYVDLLNHDSRADVWQCEWDCEW-DLGGGGGTFVVTATRDVRAGEEVLVSYG-ERC 307

Query: 302 NAELALDYGFIESKSDRNAYTLTLEISESDPFF 334
           +    L +GF+ + +  N   L     E+  ++
Sbjct: 308 DRHFFLFFGFLPAPNPHNTVALFANAREAAAWY 340


>gi|452841392|gb|EME43329.1| hypothetical protein DOTSEDRAFT_131367 [Dothistroma septosporum
           NZE10]
          Length = 445

 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 66/272 (24%), Positives = 114/272 (41%), Gaps = 38/272 (13%)

Query: 66  TFWQWLRDQKV-VSPK---SPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVA 121
            F  WLRD    +S K     +R      G G+VA  D+ ++E +  VP    +  +T  
Sbjct: 13  AFVNWLRDNGASISAKITLDDLRQQNA--GRGIVAVEDLDEDEELFSVPRSTMLTTETSR 70

Query: 122 ASEIGSLCSGLKPWISVALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSEEELVELQG 180
             E   L     PW+S+ + +  E      S W+ Y D+LP   D+ +FWS+ EL  L+G
Sbjct: 71  NGE-AVLQEVDDPWLSLIVVMALEYLDGSQSRWKPYFDVLPVSFDNLMFWSDRELRHLEG 129

Query: 181 TQLLST--TLGVKEYVQNEYLKVEEEI----ILPNKQLFPRPITLDDFLWAFGILRSRAF 234
           + ++            + + + V E I       N++L      +   + A+G     + 
Sbjct: 130 STVVGKIGKEAADATFREQLIPVIERISKAKAADNEELLRMCHRMGSTIMAYGFDLETSS 189

Query: 235 SRLR------------GQNL--VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSL 280
            + +            G+ L   ++PLAD++N            +   A LF  D    +
Sbjct: 190 DQAKNDGEEWEEDSDAGETLPKGMVPLADMLNADA---------DRNNAKLFYEDDKVVM 240

Query: 281 RTPVPVKAGEQVLIQYDLNKSNAELALDYGFI 312
           +T  PVKAGE++   +  +   A+L   YG++
Sbjct: 241 KTIKPVKAGEELYNDFG-SLPRADLLRRYGYL 271


>gi|58261130|ref|XP_567975.1| nucleus protein [Cryptococcus neoformans var. neoformans JEC21]
 gi|134115865|ref|XP_773415.1| hypothetical protein CNBI2600 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|50256040|gb|EAL18768.1| hypothetical protein CNBI2600 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|57230057|gb|AAW46458.1| nucleus protein, putative [Cryptococcus neoformans var. neoformans
           JEC21]
          Length = 495

 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 72/261 (27%), Positives = 110/261 (42%), Gaps = 49/261 (18%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINP------DTVAASEIGSLCSGLKPWISVALFLIR 144
           G G VA +DI +   +  VP    ++       D + ASE   L  G   W  + L ++ 
Sbjct: 42  GYGAVAVKDIEEGTPLFHVPDDLILSAYTSDLKDHLDASEWDQLNKG---WAQLILVMMW 98

Query: 145 EK-KKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEE 203
           E  K   S W  YL  +P   ++ +FW+E +  +L GT  ++  +G +E  + EY  V  
Sbjct: 99  ETIKGSKSRWAGYLANMPVLFETPMFWTERQREQLSGTD-IADRIG-REDAEAEYTSVLA 156

Query: 204 EIILPNKQLFPRP---ITLDDF---------------LWAFGILRSRAFSRLRGQN---- 241
             I  +  LFP     IT+D F               L  FG  RS + SR  G +    
Sbjct: 157 PFIKAHPDLFPVDSPHITMDAFHIQGSRILSRSFTVPLHRFG--RSHSQSRSDGNSEKES 214

Query: 242 --------LVLIPLADLINHSPGITTED-YAYEIKGAGLFSRDLLFSLRTPVPVKAGEQV 292
                   +V+IP AD++N + G      Y  E    G    D    +++   VK  EQ+
Sbjct: 215 DDEDEEEMVVMIPFADMLNAAWGKDNAHLYVDEDTIEGF---DEGVVMKSTQLVKQSEQI 271

Query: 293 LIQYDLNKSNAELALDYGFIE 313
              YD +  N+EL   YG ++
Sbjct: 272 YNTYD-SPPNSELLRKYGHVD 291


>gi|412987667|emb|CCO20502.1| related to histone-lysine N-methyltransferase (ISS) [Bathycoccus
           prasinos]
          Length = 866

 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 67/244 (27%), Positives = 111/244 (45%), Gaps = 33/244 (13%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWIS------------V 138
           G G     D+ + +V+LE+P+    + ++   SE+ +         +            +
Sbjct: 24  GRGNAVTEDVRRGDVLLEIPLSRCFSLESAQKSEMLTKAMAKAAAAAAGTRFTPTHDQYM 83

Query: 139 ALFLIREKK--KEDSPWRVYLDILPECTDSTVFWSEEELVE--LQGTQLLSTTLGVKEYV 194
           A+F++ E+   K+ S +   L I P+  D  +FWSEEE     L GT   + TL + E V
Sbjct: 84  AMFILLEQNLGKQSSHYEHILSI-PKAYDLPLFWSEEERQRSLLFGTTTYAETLALDEEV 142

Query: 195 QNEYLKVEEEIILPNKQLF-PRPITLDDFLWAFGILRSRAFSRLR-----GQNLVLIPLA 248
             +Y  ++  +    +  F  + IT+D F W    L SR    LR      +  VLIP  
Sbjct: 143 IQDYELLKHHL---GEDFFREQNITMDRFKWVRATLWSRQCDLLRPAPETTRLRVLIPEF 199

Query: 249 DLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNK-SNAELAL 307
           D+ NHS  +     ++++     +SR L+ +  T   V  GEQ  I Y   + S+++L L
Sbjct: 200 DMFNHSSKVPL-GSSHKLN----YSRGLVTAFAT-ANVPKGEQAYISYGSGEASSSKLLL 253

Query: 308 DYGF 311
            YGF
Sbjct: 254 WYGF 257


>gi|391869684|gb|EIT78879.1| hypothetical protein Ao3042_04699 [Aspergillus oryzae 3.042]
          Length = 496

 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 52/192 (27%), Positives = 82/192 (42%), Gaps = 15/192 (7%)

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKVEEEIIL 207
           +PW  Y+  LP       F++ EE   L+GT L          ++NE+    +  E I  
Sbjct: 131 NPWTEYIQYLPSSITLPTFYTVEERELLRGTSLKLAVDAKIVSLENEFELLRQSTENISW 190

Query: 208 PNKQLFPRP---ITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINH-SPGITTEDYA 263
             K  +       TLDD+ +   + RSR    L      ++P  D+ NH S  I    Y 
Sbjct: 191 CRKHWWDENTGRFTLDDWKYVDAMYRSRMVD-LPSSGHAMVPCIDMANHASEDIVKALYE 249

Query: 264 YEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES-KSDRNAYT 322
            + +G      + +  LR+   + + E+V I Y  +K  +E+   YGF+ES +       
Sbjct: 250 EDTEG------NAVLQLRSGRKLHSDEEVTISYGDDKPASEMIFSYGFLESERGGAKQIF 303

Query: 323 LTLEISESDPFF 334
           L L+I E DP  
Sbjct: 304 LNLDIPEDDPLI 315


>gi|302793745|ref|XP_002978637.1| hypothetical protein SELMODRAFT_52721 [Selaginella moellendorffii]
 gi|300153446|gb|EFJ20084.1| hypothetical protein SELMODRAFT_52721 [Selaginella moellendorffii]
          Length = 523

 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 74/326 (22%), Positives = 130/326 (39%), Gaps = 74/326 (22%)

Query: 63  QVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNE--------------VVLE 108
           ++E F +W ++  +      I+  +  EG GL  Q D A+ +              V++ 
Sbjct: 4   RLERFSRWSQEHGIQFRGCAIKRGSDAEGFGLYTQNDSARGDFLSFCAPLSTDFADVLVV 63

Query: 109 VPMKFWINPDTVAASEI------GSLCSGLKPWISVALFLIREKKK-EDSPWRVYLDILP 161
            P+   + P T+    +        L + +   + V +FLI E+ +   S W  YL++LP
Sbjct: 64  TPLDLALTPVTIVKDPVLGNVYREMLGNEIDDRLLVMIFLIIERARGRASFWAPYLEMLP 123

Query: 162 ECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIIL--PNKQLFPRPITL 219
               + +++ +EEL+EL GT L   T   + +  + ++     + L  P+     R +  
Sbjct: 124 SGFGTPLWFEDEELMELDGTTLFEAT-KAQVFFPSTFVSTCMSLYLFRPDD----RELEF 178

Query: 220 DDFLWAFGILRSRAF----------------SRLRGQNLV-------------------- 243
            +FLWA  I  +RA                 ++  G  LV                    
Sbjct: 179 QEFLWANCIFWTRALNIPCPASFVTSSSPEVAKDDGNRLVIYVLPHPFISCSAKDVSTIW 238

Query: 244 ---LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLR--TPVPVKAGEQVLIQYDL 298
              L+P  D  NH+   +     +EI G+   +  +  S+     V    G +VLI Y  
Sbjct: 239 IEGLVPGIDFCNHTRRASG---LWEIDGSDGSTSGVPHSMYLIADVVFPPGSEVLINYG- 294

Query: 299 NKSNAELALDYGFIESKSDRNAYTLT 324
           +K N EL   YGF+E + + N Y + 
Sbjct: 295 DKGNEELLFLYGFVE-EDNSNDYVMV 319


>gi|358371988|dbj|GAA88594.1| SET domain protein [Aspergillus kawachii IFO 4308]
          Length = 497

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 48/176 (27%), Positives = 79/176 (44%), Gaps = 20/176 (11%)

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQN-----EYLKVEEEI 205
           SPW  Y+  LP       F+SEEEL  L+G+ L    L V   + +     E+L+   E 
Sbjct: 131 SPWSEYMKYLPSSIPLPTFYSEEELELLRGSSL---RLAVHAKIASLEKEFEHLRQSTEG 187

Query: 206 ILPNKQLF----PRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSP-GITTE 260
           +   K+ +       +T +D+ +   + RSR    L      ++P  D+ NH+P G    
Sbjct: 188 LDWCKRYWWDDDTGKLTFNDWKYVDALYRSRMVD-LPQHGHAMVPCIDMANHAPEGTVKA 246

Query: 261 DYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKS 316
            Y  +  G      + +  LR    ++A E+V I Y   KS +E+   YGF++  +
Sbjct: 247 LYDEDADG------NAVLQLRDGRSLRADEEVTISYGDEKSASEMIFSYGFLDEHT 296


>gi|452825745|gb|EME32740.1| ribulose-1,5 bisphosphate carboxylase oxygenase large subunit
           N-methyltransferase, putative isoform 2 [Galdieria
           sulphuraria]
          Length = 331

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 65/242 (26%), Positives = 109/242 (45%), Gaps = 33/242 (13%)

Query: 27  LLTSLTKKPSFRL--KSRAFAATCSLHSASATTNPPTAQVETFWQWLRDQKVVSPKSPIR 84
           +L   T +PS RL   SR F   CS           + +++ F+ WL  + V  PK  I+
Sbjct: 47  ILNVPTLRPSSRLCKPSRLFLV-CS-----------SGRLDLFYHWLTRENVYMPK--IK 92

Query: 85  PATFPEGL-GLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKP--WISVALF 141
                +GL G+VA   I  +E  L+VP    +    V   E  ++   + P  W     +
Sbjct: 93  LDQNKDGLRGVVAVEGIECDESFLKVPRDLSLQ---VTEHEECTMSEFVDPELWSQENWY 149

Query: 142 LIREKKKED-------SPWRVYLDILPECTDS-TVFWSEEELVELQGTQLLSTTLGVKEY 193
           +    K          S W+ Y+DILP   ++  V+WS  EL +LQ   L+      + Y
Sbjct: 150 VKLSLKLLKEKYLGKLSLWKPYIDILPHALNTGLVYWSSSELAQLQYRPLIEEVKINQYY 209

Query: 194 VQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFS--RLRGQNLVLIPLADLI 251
            +  Y +V E +  P + ++ +    + F WA  +++SRAF    +  +   L+P+ D++
Sbjct: 210 REALYTRVFESLSSPVR-VWLQNEKENVFFWALDMVQSRAFGIPDVGNKTYALLPMMDML 268

Query: 252 NH 253
           NH
Sbjct: 269 NH 270


>gi|340966944|gb|EGS22451.1| hypothetical protein CTHT_0019870 [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 499

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 61/217 (28%), Positives = 94/217 (43%), Gaps = 44/217 (20%)

Query: 131 GLKPWISVALFLIREK-KKEDSPWRVYLDILPECTDST-----VFWSEEELVELQGTQLL 184
            L P +    FLI+E  K E+S W  Y+  LP+           FW E+++  L+GT   
Sbjct: 102 SLPPHVIGRFFLIKEYLKGENSFWWPYIATLPQPEQVNSWTLPAFWPEDDIQFLEGTNAH 161

Query: 185 STTLGVKEYVQNEY---LKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAF--SRLRG 239
                ++  ++ EY    KV +E   PN + + + +    + WAF I  SR+F  S +  
Sbjct: 162 VAIGEIQANIKREYKQARKVLKEENFPNWKEYSQML----YKWAFSIFTSRSFRPSLILS 217

Query: 240 QNL------------------VLIPLADLINHSPGIT-TEDYAYEIKGAGLFSRDLLFSL 280
           Q++                  +L PL D+ NHS   T T D   +     L  +D     
Sbjct: 218 QSVKDYVSTLLPSAREIDDFSILQPLFDIANHSMTATYTWDTTSDPNCCQLICQD----- 272

Query: 281 RTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSD 317
                 + G+QV   Y   K+N+EL L YGFI  ++D
Sbjct: 273 ----SYRPGDQVFNNYGF-KTNSELLLAYGFILPETD 304


>gi|324503528|gb|ADY41532.1| SET domain-containing protein 3 [Ascaris suum]
          Length = 502

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 80/371 (21%), Positives = 143/371 (38%), Gaps = 66/371 (17%)

Query: 64  VETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAAS 123
           V+ F +W     +      +R +    G GL A     ++  +L VP K  ++ D    S
Sbjct: 80  VQRFLEWADRMGIEREGITVRCSDGAMGFGLEATHSFKQDAELLRVPRKAMLSWDQARKS 139

Query: 124 EIGSLC------SGLKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVE 177
            +   C            +++AL +  +K   DS W  YLD LP+   + +++S  EL +
Sbjct: 140 AMLKKCFEQDMIVKTMDNVALALMVCCQKLSPDSSWLPYLDALPQTFSTPLYFSALELRK 199

Query: 178 LQGTQLLSTTLGVKEYVQNEYL----------------------KVEEEIILPNKQLFPR 215
           L  +     +L +   V  +++                       V  E +  N      
Sbjct: 200 LSPSPAYEESLIMYRNVARQFVYFLAAVQRSERSRSAKKDKNHAAVGMEPLFLNAPFTVS 259

Query: 216 PITLDDFLWAFGILRSR------AFSR-LRGQNL---VLIPLADLINH------SPGITT 259
             T D + WA   + +R       +++   GQ +    LIPL D+ NH      +   +T
Sbjct: 260 NFTFDLYRWAVACVTTRINFIPSQYAKDSNGQPVAVPCLIPLLDMANHEFDHPLTVHFST 319

Query: 260 E-DYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDR 318
           E DYA               S++     KAG++V I Y + ++N +  L  GF+    ++
Sbjct: 320 EGDYA---------------SIKATKDYKAGDEVTIFYGI-RTNRQFFLHNGFVPDGENK 363

Query: 319 N-AYTLTLEISESDPFFGDKLDIAETNGL-GESAYFDIVLG---RTLPPAMLQYLRLVAL 373
           N  Y L +     D     +L +    G   ES  F   +    R +P ++L + R+  +
Sbjct: 364 NDTYKLKIGFPRGDKQVRARLKLMHDAGFNAESRVFVFEVNASERPVPLSLLDFARVFLV 423

Query: 374 GGTDAFLLESI 384
              D+  L+ +
Sbjct: 424 ENPDSVSLDEV 434


>gi|241712095|ref|XP_002413441.1| conserved hypothetical protein [Ixodes scapularis]
 gi|215507255|gb|EEC16749.1| conserved hypothetical protein [Ixodes scapularis]
          Length = 227

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 55/197 (27%), Positives = 84/197 (42%), Gaps = 33/197 (16%)

Query: 93  GLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSG---LKPWISVAL--FLIRE-K 146
           G  A++DI    V L VP+   +       S++G+L      +K   +VAL  FLI E  
Sbjct: 33  GFAAEQDIQVGPVFLGVPLGMMMTTIGARKSKLGALLKDDPIMKSMENVALSMFLILELC 92

Query: 147 KKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEI- 205
               S W  Y+ ILP   ++ +++S +EL  L G+ +L   L +   +  +Y    +   
Sbjct: 93  AGSASFWHPYISILPRSFNTVLYFSVDELQLLTGSSVLDEALKLHRSIARQYAYFHKIFR 152

Query: 206 ------ILPNKQLFPRPITLDDFLWAFGILRSR---------------AFSRLRGQNL-V 243
                  LP K  F    T D + WA   + +R               A +R  G  +  
Sbjct: 153 THPLAKSLPYKDCF----TYDLYRWAVSAVMTRQNAVPRAVVCGGADDACARGSGSGVAA 208

Query: 244 LIPLADLINHSPGITTE 260
           L+PL DL NHS G  +E
Sbjct: 209 LVPLFDLCNHSDGKVSE 225


>gi|384249279|gb|EIE22761.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 438

 Score = 56.2 bits (134), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 46/140 (32%), Positives = 66/140 (47%), Gaps = 24/140 (17%)

Query: 214 PRPITLDDFLWAFGILRSRAFSRLR----------GQNLVLIPLADLINHSPGITTEDYA 263
           P     D F+ A G+++SRAF   +          G+ L LIP  D+INHS  +   + A
Sbjct: 45  PASCGYDAFVHAAGMVQSRAFHMKKENWITGENEDGEELYLIPGIDMINHSSRLQERNTA 104

Query: 264 YEIKGAGLFSR---DL--------LFSLRTPVPVKAGEQVLIQY-DLNKSNAELALDYGF 311
            E    G+  R   DL        LF ++    V AGEQ+L  Y DL  S+A+L   YGF
Sbjct: 105 LEQSSDGVTFRRKPDLPPEEYNGGLFVMKAGRKVAAGEQILHTYGDL--SDAQLLQTYGF 162

Query: 312 IESKSDRNAYTLTLEISESD 331
           +E  +  N +   + +  SD
Sbjct: 163 VEDPARPNRHNRNVRLPTSD 182


>gi|336473420|gb|EGO61580.1| hypothetical protein NEUTE1DRAFT_58975 [Neurospora tetrasperma FGSC
           2508]
 gi|350293291|gb|EGZ74376.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
          Length = 533

 Score = 56.2 bits (134), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 102/423 (24%), Positives = 166/423 (39%), Gaps = 112/423 (26%)

Query: 135 WISVALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKE- 192
           W  + L L+ E  +   S W  YL ILP   D+ +FW+E EL ELQ + L++  +G  E 
Sbjct: 128 WTLLILILMHEYLQGSSSNWSPYLSILPHQFDTPMFWTEAELAELQASALVAK-VGKDEA 186

Query: 193 --YVQNEYLKVEEEIILPNKQLF-----PRPITLDD--------------FLWAFGILRS 231
              ++ + +KV +E    N+ +F     P+   LD+                +AF + + 
Sbjct: 187 DKMIRTKIVKVVQE----NEDVFYPAGTPKTQRLDEGELLKLGHRMGSAIMAYAFDLAKE 242

Query: 232 RAF-------------SRLRGQN--LVLIPLADL----------INHSPGITTEDYAYEI 266
                            ++ G N  + ++P+AD+          INH     T     EI
Sbjct: 243 EDDDEDEEEEEDGWVEDKIGGMNDTMGMVPMADMLNADAVFNAHINHGEACLTATSLREI 302

Query: 267 KGAG--------LFSRDLLFSLRTPVPVKAGEQV------LIQYDLNK-----SNAELAL 307
           K           L S +LL       P  A   V      L++  L K        +  +
Sbjct: 303 KEGEEILNYYGPLSSAELLRRYGYVTPNHARYDVVEVGWGLVEGGLKKMVEGIKGRKKEV 362

Query: 308 DYGFIESKSDRNAY-------TLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTL 360
           D+  +E   +           +  LE    DP        +E    GE+ +    L   L
Sbjct: 363 DWAKVEELVEEQKEEEGEWEDSFVLERQSEDPD-------SEGQVHGEAEFMG--LPEEL 413

Query: 361 PPAMLQYLRLV--ALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALS 418
              +  YL+ V   +G  D  + E++   T+            +E++   VR A +    
Sbjct: 414 EEQVKTYLKAVKKVVGTGDRAVAEALGNKTV-----------RKEILLGGVRKALEEREG 462

Query: 419 GFHTTIEEDEQLL---QGGNLDPRLRIAVGVRAGEKRVL--------QQIDEIFKERELE 467
            + T++EEDE++L   +GG    R  +AV VRAGEKR++        +Q  E+ KE+  E
Sbjct: 463 QYATSLEEDEKVLAGIEGGRPTTRKEMAVWVRAGEKRIIREAVAWVKKQEKELIKEKGQE 522

Query: 468 LDE 470
            DE
Sbjct: 523 KDE 525


>gi|405119695|gb|AFR94467.1| nuclear protein [Cryptococcus neoformans var. grubii H99]
          Length = 495

 Score = 56.2 bits (134), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 71/259 (27%), Positives = 112/259 (43%), Gaps = 45/259 (17%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINP------DTVAASEIGSLCSGLKPWISVALFLIR 144
           G G VA +DI +   +  +P    ++P      D + ASE   L  G   W  + L L+ 
Sbjct: 42  GYGAVAVKDIEEGTPLFHIPDDLILSPYTSDLKDHLDASEWDQLNKG---WAQLILVLMW 98

Query: 145 EK-KKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEE 203
           E  K   S W  YL  +P   ++ +FW+E++  +L GT  ++  +G +E  + EY  V  
Sbjct: 99  ETIKGSKSRWAGYLANMPVTFETPMFWTEQQREQLAGTD-IADRIG-REDAEAEYTSVLA 156

Query: 204 EIILPNKQLFP---RPITLDDFLWAFGILRSRAFS-------RLRGQN------------ 241
             I  +  LFP     IT+D F      + SR+F+       R + Q             
Sbjct: 157 PFIKAHPDLFPIDSPHITIDAFHIQGSRILSRSFTVPLHRFGRSQSQTRSDSNSEMEGDG 216

Query: 242 ------LVLIPLADLINHSPGITTED-YAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLI 294
                 +V+IP AD++N + G      Y  E    G     ++ S++    VK  EQ+  
Sbjct: 217 EEEEEVVVMIPFADMLNAAWGKDNAHLYIDEDTNEGFNEGVVMKSIQL---VKQSEQIYN 273

Query: 295 QYDLNKSNAELALDYGFIE 313
            YD +  N+EL   YG I+
Sbjct: 274 TYD-SPPNSELLRKYGHID 291


>gi|57899520|dbj|BAD87034.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
 gi|57899939|dbj|BAD87851.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
          Length = 509

 Score = 56.2 bits (134), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 67/297 (22%), Positives = 115/297 (38%), Gaps = 55/297 (18%)

Query: 64  VETFWQWLRDQKVVSPKSPIRPATFPE-GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAA 122
              F +WLR +      S +   T    G  L A   I + + +++VP    +  D +  
Sbjct: 30  CSVFLRWLRSKSGTHISSVLSLGTSSAFGRSLFASEPIQEGDCIMQVPYHVQLTLDKLPQ 89

Query: 123 SEIGSLCSGLKPWISVALFLIREKK-KEDSPWRVYLDILP--ECTDSTVFWSEEELVELQ 179
                L   +     +A  LI E+    +S W  Y+  LP  +   + V W   EL  +Q
Sbjct: 90  KFNTLLDHAVGDTSKLAALLIMEQHLGNESGWAPYIKSLPTKDQMHNMVLWDLNELHAVQ 149

Query: 180 GTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPR---PITLDDFLWAFGILRSRAFSR 236
            + +    +  KE  + E+L ++     P    FP     + L DF+ A  +        
Sbjct: 150 NSSIYDEAIEHKEQAKKEFLALK-----PALDHFPHLFGEVKLGDFMHASAL-------- 196

Query: 237 LRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPV-------PVKAG 289
                       D +NH                G+F   L++  +  V           G
Sbjct: 197 ------------DFLNHD---------------GVFGSVLIYDEQKDVCEIIADRNYAVG 229

Query: 290 EQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGL 346
           EQV+I+Y    SNA LAL++GF  +++  +   + +++   DP +  KLDI + + L
Sbjct: 230 EQVMIRYG-KYSNATLALNFGFTLARNIYDQALIRIDMPVQDPLYKKKLDIWQKHRL 285


>gi|79316289|ref|NP_001030933.1| SET domain-containing protein [Arabidopsis thaliana]
 gi|63003834|gb|AAY25446.1| At1g01920 [Arabidopsis thaliana]
 gi|332189233|gb|AEE27354.1| SET domain-containing protein [Arabidopsis thaliana]
          Length = 547

 Score = 56.2 bits (134), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 54/188 (28%), Positives = 91/188 (48%), Gaps = 17/188 (9%)

Query: 62  AQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDI-AKNEVVLEVPMKFWINPDTV 120
           A++E F  WL+          I+ +   +G G+ A     A +EV+L VP+   I P  V
Sbjct: 8   AKLERFLDWLQVNGGELRGCNIKYSDSLKGFGIFASTSTQASDEVLLVVPLDLAITPMRV 67

Query: 121 AASEI-GSLCSGLKPWISV------ALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEE 173
               + G  C  +     V       LFL  E+ + +S W+ YLD+LP    + +++S++
Sbjct: 68  LQDPLLGPECQKMFEQGQVDDRFLMILFLTLERLRINSSWKPYLDMLPTRFGNPLWFSDD 127

Query: 174 ELVELQGTQLL-STTLGVKEYVQNEYLKVEEEIILPNKQLF-----PRPITLDDFLWAFG 227
           +++EL+GT L  +T L  K+ +   + KVE   +L  K L         ++ + FLWA  
Sbjct: 128 DILELKGTNLYHATELQKKKLLSLYHDKVE---VLVTKLLILDGDSESKVSFEHFLWANS 184

Query: 228 ILRSRAFS 235
           +  SRA +
Sbjct: 185 VFWSRALN 192


>gi|296419472|ref|XP_002839331.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295635461|emb|CAZ83522.1| unnamed protein product [Tuber melanosporum]
          Length = 541

 Score = 56.2 bits (134), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 59/235 (25%), Positives = 101/235 (42%), Gaps = 28/235 (11%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIRE-KKKE 149
           G  ++   DI  +  ++  P    IN     ++      +      ++ +FL  E  K +
Sbjct: 40  GSRVITCTDIPSHSQLISCPHTLTINYTKARSAFSADFITNTTQHAALCMFLCLEWLKGK 99

Query: 150 DSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPN 209
           +S W  YL +LP   D+ +++S+E+L  LQG  L +T +  ++ +  E  +     IL  
Sbjct: 100 ESFWWPYLCVLPREFDTPLYFSDEDLQFLQGCNLEATEVEARKLIWREEFEAAVS-ILQR 158

Query: 210 KQLFPRPITLDDFLWAFGILRSRAF-SRLRGQNLVLI----------PLADLINHSPG-- 256
           +       T + +LWA  I  SR+F  +L   + +++          PL D +NH P   
Sbjct: 159 EGYDTEYYTWELYLWASTIFTSRSFPGKLMDWDRIIVHEDDTMPILFPLIDSLNHYPATI 218

Query: 257 ITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF 311
           IT +              D    + + V V AG +V   Y   K+N EL + YGF
Sbjct: 219 ITWQ------------PSDTSLRIISGVGVSAGAEVYNNYG-PKANEELLMGYGF 260


>gi|428165190|gb|EKX34191.1| hypothetical protein GUITHDRAFT_147375 [Guillardia theta CCMP2712]
          Length = 681

 Score = 56.2 bits (134), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 86/343 (25%), Positives = 134/343 (39%), Gaps = 69/343 (20%)

Query: 168 VFWSEEELVELQGTQLLSTTLGVKE-------------YVQNEYLKVEEEIILPNKQLFP 214
           V W++EEL ELQ  +L++     K              Y        E+E  + +++ F 
Sbjct: 168 VHWTDEELAELQNPRLVAAASDSKRSHAGLTIEASEWRYFDRMQAAGEQETSMLDQETFQ 227

Query: 215 R------------PITLDDFLWAFGILRSRAF--SRLRG-QNLVLIPLADLINHSPGITT 259
           R            P +L +  WA    +SR+F  S   G +   L PLAD++NH P    
Sbjct: 228 RDLHRLLCDRMTAPPSLAELRWAMDCAQSRSFGVSTTEGVKCFCLCPLADMLNHDPS--- 284

Query: 260 EDYAYEIKGAGLFSRD---LLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKS 316
                      LF  D     F++RT      GE+V I Y    SN +L   YGF+    
Sbjct: 285 --------SPALFDFDPATSCFAIRTSRAWSEGEEVTISYG-ELSNEDLLQFYGFV---L 332

Query: 317 DRNAYTL-TLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLP---PAMLQYLRLVA 372
           D N +   + ++ E       KL  +E + L        +    LP   P   + LRL  
Sbjct: 333 DDNMHEFESFDVEELGLLVPRKLIGSEADRLDRLFLLSSLSSAGLPVLSPDHSRLLRLSR 392

Query: 373 LGGTDAFL----LESIFRNTIWGHLD---------LPVSHA---NEELICRVVRDACKSA 416
            G +   +    + S+ R T+  H           L   H    NE    R++R AC+  
Sbjct: 393 RGISPPLMAVARVLSLSRETLAAHEGRLEELAGRLLRGKHVCLDNELEAWRLLRRACERR 452

Query: 417 LSGFHTTIEEDEQLLQ---GGNLDPRLRIAVGVRAGEKRVLQQ 456
           L    TT EED + L+         ++++++  R   KR L++
Sbjct: 453 LEAMPTTEEEDVRRLEETKRAKGSEQMKLSLRYRMSRKRFLRR 495


>gi|156717956|ref|NP_001096520.1| N-lysine methyltransferase setd6 [Xenopus (Silurana) tropicalis]
 gi|325530258|sp|A4QNG5.1|SETD6_XENTR RecName: Full=N-lysine methyltransferase setd6; AltName: Full=SET
           domain-containing protein 6
 gi|140832737|gb|AAI35641.1| LOC100125156 protein [Xenopus (Silurana) tropicalis]
          Length = 454

 Score = 56.2 bits (134), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 68/273 (24%), Positives = 119/273 (43%), Gaps = 37/273 (13%)

Query: 64  VETFWQWLRDQKV-VSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDT--- 119
           V  F  W +   + ++PK  I         G++A+ D++  E++  +P    ++ +T   
Sbjct: 22  VSCFLAWCKKVGLELNPKVYISTEGTVSQYGMLAREDLSDGELLFSIPRSAILSQNTTRI 81

Query: 120 --VAASEIGSL--CSGLKPWISVALFLIREKKKEDSPWRVYLDILPECT--DSTVFWSEE 173
             +   E  SL  CSG   W+ + + L+ E     S W  Y  + PE    D  +FWSEE
Sbjct: 82  RDLIEKEQDSLQSCSG---WVPLLISLLYEATDSSSHWAPYFGLWPELDPPDMPMFWSEE 138

Query: 174 ELVE-LQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLF-PRPITLDDFLWAFGILRS 231
           E  + LQGT +L       + ++ EY  +    I  N + F P   TLD +      + +
Sbjct: 139 EQTKLLQGTGILEAVHKDLKNIEKEYNSIVLPFIRRNPEKFCPMKHTLDLYKRLVAFVMA 198

Query: 232 RAFSRLRGQNL-----------VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSL 280
            +F   + ++            +++P+ADL+NH   +   +   E      F+ + L  +
Sbjct: 199 YSFQEPQEEDEEEDIEKDILPPMMVPVADLLNH---VAQHNAHLE------FTPECLRMI 249

Query: 281 RTPVPVKAGEQVLIQYDLNKSNAELALDYGFIE 313
            T   V AG+++   Y    +N +L   YGF E
Sbjct: 250 TTK-SVCAGQELFNTYG-QMANWQLLHMYGFAE 280


>gi|395839524|ref|XP_003792639.1| PREDICTED: N-lysine methyltransferase SETD6 [Otolemur garnettii]
          Length = 448

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 66/261 (25%), Positives = 114/261 (43%), Gaps = 43/261 (16%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAAS-----EIGSLCSGLKPWISVALFLIRE 145
           G G+VA   +   E++  VP    ++  T + S     E  +L S    W+ + L L+ E
Sbjct: 50  GYGMVALESVQPGELLFAVPRAALLSQHTCSISGLLEQERVALQSQ-SGWVPLLLALLHE 108

Query: 146 KKKEDSPWRVYLDILPECT--DSTVFWSEEELVE-LQGTQLLSTTLGVKEYVQNEYLKVE 202
            +   SPWR Y  + PE    +  +FW EEE    LQGT       GV E V+ +   + 
Sbjct: 109 VQAPASPWRPYFALWPELGRLEHPMFWPEEERHRLLQGT-------GVPEAVEKDLTNIR 161

Query: 203 EE---IILP----NKQLF-PRPITLDDFLWAFGILRSRAFSRLRG--------QNLVLIP 246
            E   I+LP    + +LF PR  +L+ +     ++ + +F              + +++P
Sbjct: 162 SEYCSIVLPFMEAHPELFSPRVRSLELYHQLVALVMAYSFQEPLEEEEDEKEPNSPLMVP 221

Query: 247 LADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELA 306
            AD++NH   +   +   E      +S + L  + T  P+  G ++   Y    +N +L 
Sbjct: 222 AADILNH---LANHNANLE------YSANYLRMVATQ-PIPKGHEIFNTYG-QMANWQLI 270

Query: 307 LDYGFIESKSDRNAYTLTLEI 327
             YGF+E   D    T  +++
Sbjct: 271 HMYGFVEPYPDNTDDTADIQM 291


>gi|424512980|emb|CCO66564.1| predicted protein [Bathycoccus prasinos]
          Length = 542

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 104/446 (23%), Positives = 177/446 (39%), Gaps = 104/446 (23%)

Query: 93  GLVAQRDIAKNEVVLEVPMKFWI-------NPDTVAASEIGSLCSGLKPW-ISVALFLIR 144
           G+ A  DIA  + ++ +P +  +       NP     S      +G + W + VAL L+ 
Sbjct: 108 GVKATSDIASEDDLVRLPREATMLVVEGQENPHEEYISNELWAKAGDERWALRVALVLLY 167

Query: 145 EKK-KEDSPWRVYLDILPECTDSTVFWSEEELVELQGT--------QLLSTTLG---VKE 192
           EK     S +  Y++ LP+  ++   W+EEE+ ELQ +        Q L        ++E
Sbjct: 168 EKSLGSRSKFYEYIEQLPKSFENLGTWTEEEVRELQYSVGEKFAKEQRLENEKACELIQE 227

Query: 193 YVQNEYLKV--EEEIILP-------------------NKQLFPRPITLD----------- 220
           Y ++  LK    EE+I                      ++L PR +++            
Sbjct: 228 YARDGGLKTIEREEVIWALDVVRSRVFSGKIADQEALQRKLLPRALSVGTVFASFLTAQT 287

Query: 221 ---DFLWAFGILRSRAFSRLRGQNL------VLIPLADLINHSPGITTEDYAYEIKGAGL 271
               +L  F +L    F   +  ++      VL+PL D  NH   + TE   +E   +  
Sbjct: 288 TELKWLCVFALLALVVFDSTKENDVKTDTAYVLMPLIDAFNHQTMLKTE---FEFTNSE- 343

Query: 272 FSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYT---LTLEIS 328
                 F+L++P   K GE+VLI Y L   N EL L YGF++ ++  + Y    L   ++
Sbjct: 344 ------FALKSPKSYKKGEEVLISYGL-MPNDELLLRYGFVDDQNVADTYQFEGLLPYLT 396

Query: 329 ESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFR-N 387
           ++DP   + L         E     I   R +P   L+ L L       AF+ +     N
Sbjct: 397 QNDPTLKENL---------EKNPKRIESLRFIPEGSLEDLTLEDSAWRGAFVSDGTAEPN 447

Query: 388 TIWGHLDLPVSHA------------------NEELICRVVRDACKSALSGFHTTIEEDEQ 429
             W    L  + +                   E+     ++ AC+  L    TT  +D+ 
Sbjct: 448 LSWALRCLYATESEWINAGGSCDGFRLGGGQTEKKAALALKMACEYRLQEMETTYNQDQG 507

Query: 430 LL-QGGNLDPRLRIAVGVRAGEKRVL 454
           LL + G  + ++R A+  R  +K++L
Sbjct: 508 LLKEKGGENWKMRQAILYRMRKKKIL 533


>gi|328771298|gb|EGF81338.1| hypothetical protein BATDEDRAFT_87914 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 607

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 43/188 (22%), Positives = 87/188 (46%), Gaps = 4/188 (2%)

Query: 69  QWLRDQKVVSPKSPIR-PATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           QW  + K+   +  IR       G  + A++ +   +++  +P +  ++      +++  
Sbjct: 9   QWFGENKIAYDEEKIRIEHDTNNGFRVFAKQTLEVGDILCAIPKEAILSIKNCGVADVLE 68

Query: 128 LCSGLKPWISVALFLIREKK-KEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLST 186
              GL   + + + L+ E+   E SPW  Y+  LP   +  +FW +++   L GT +   
Sbjct: 69  E-QGLGGQLGLVIALMFERSLGEKSPWYGYIQSLPLRENIPLFWEKDQQACLDGTAVAHL 127

Query: 187 TLGVKEYVQNEYLKVEEEIILPNKQLF-PRPITLDDFLWAFGILRSRAFSRLRGQNLVLI 245
              + + ++ +Y +    ++  N ++F    +T DDF  A  ++ SRAF         ++
Sbjct: 128 LEPMPKDLKADYKEHVVPLVKENSKVFNAAKMTFDDFTAATSLVTSRAFRVDVYHEEAMV 187

Query: 246 PLADLINH 253
           PLADL NH
Sbjct: 188 PLADLFNH 195


>gi|224130176|ref|XP_002320771.1| predicted protein [Populus trichocarpa]
 gi|222861544|gb|EEE99086.1| predicted protein [Populus trichocarpa]
          Length = 551

 Score = 55.8 bits (133), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 46/163 (28%), Positives = 80/163 (49%), Gaps = 25/163 (15%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASE-IGSLCSGL------KPWISVALFL 142
           +G G+ +  D++   V+L VP+   I P  V     IG  C  +           + LFL
Sbjct: 20  KGFGVFSSNDVSDG-VLLVVPLDLAITPMRVLQDPLIGPECRSMFEEGEVDDRFLMILFL 78

Query: 143 IREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVE 202
           + E+ + +S W+ YLD+LP+   + ++++++EL+EL+GT L   T    E  +   L + 
Sbjct: 79  MLERLRNNSSWKPYLDMLPKTFGNPLWFTDDELLELKGTTLYRAT----ELQRKRLLSLY 134

Query: 203 EEIILPNKQLFPRPITLD----------DFLWAFGILRSRAFS 235
           E+ +   K L  + + LD          DFLWA  +  +RA +
Sbjct: 135 EDKV---KGLVQKLLILDGDLESEVCFEDFLWANSVFWTRALN 174


>gi|403366800|gb|EJY83208.1| hypothetical protein OXYTRI_19172 [Oxytricha trifallax]
          Length = 869

 Score = 55.8 bits (133), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 54/219 (24%), Positives = 87/219 (39%), Gaps = 21/219 (9%)

Query: 63  QVETFWQWLRDQKVVSPKSPIRPATFPEGL----GLVAQRDIAKNEVVLEVPMKFWINPD 118
           + + F+ W     +  PK    P  F  G     G++A  DI  NE++L+VP K  ++  
Sbjct: 42  KFKNFYDWAFANGIEFPKLQY-PVMFGSGNSQYPGMMATEDIGPNEIMLKVPAKLLLSTK 100

Query: 119 TVAASEI-----------GSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTDST 167
                +I           G   S  +  +  A  L    K E S W+   D+ P  TD  
Sbjct: 101 ACFRGDINQIFLENPELFGKHVSDGEDNVLNAFILYELGKGEKSFWKPMFDVWPRDTDIL 160

Query: 168 VFWSEEELVELQGTQL-LSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRP--ITLDDFLW 224
             W EE+L  LQ   L     +  ++ +   + +   +I++   Q FP    I+ D + W
Sbjct: 161 FNWEEEDLDWLQDPTLKQDAQIAYQQLIT--FWRRMFDILILYPQYFPNEDMISFDKYKW 218

Query: 225 AFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYA 263
            + +  +R F         +IP A+ INH       DY 
Sbjct: 219 VYMLTTNRCFGSNWPGVCSMIPFAEFINHENVDIQYDYV 257


>gi|365989356|ref|XP_003671508.1| hypothetical protein NDAI_0H00910 [Naumovozyma dairenensis CBS 421]
 gi|343770281|emb|CCD26265.1| hypothetical protein NDAI_0H00910 [Naumovozyma dairenensis CBS 421]
          Length = 540

 Score = 55.8 bits (133), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 70/285 (24%), Positives = 121/285 (42%), Gaps = 46/285 (16%)

Query: 63  QVETFWQWLRDQKVVSPKSPIRPATF-PEGLG--LVAQRDIAKNEVVLEVPMKFWINPDT 119
           Q E+F  WL     V+  S I+      EG G  ++A +DI  +E++ E+P    +N  T
Sbjct: 56  QTESFLSWLTTDGKVTVSSKIKIEDLRSEGQGRCIIASKDIDTDELLFEIPRSSILNVTT 115

Query: 120 ----VAASEIGSLCSGLKPWISVALFLIREKK--KEDSPWRVYLDILP--ECTDSTVFWS 171
               V    I      L  W S+ + ++ E K  + +S W  Y ++LP  E  ++ ++W+
Sbjct: 116 SQLCVDFPHITGKLMELSQWDSLIICMMYEMKVLQHESRWSSYFNVLPSSESLNTLMYWN 175

Query: 172 EEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPR---PITLDDFLWAFGI 228
           ++EL  L  + L+   +G K   +  Y ++ + I   N+ +       I+ ++FL+   I
Sbjct: 176 DKELSFLTPS-LVVNRVG-KGDAETMYRRILDTINEFNEDILTEKLGSISWEEFLYIPSI 233

Query: 229 LRSRAFS--------------------RLRGQNLVLIPLADLINHSPGITTEDYAYEIKG 268
           + + +F                             +IPLAD +N        +  Y+   
Sbjct: 234 IMAYSFDVEIKNDDDENEGDEEFDEKEEEPELLKSMIPLADTLNADTHKCNANLTYD--- 290

Query: 269 AGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIE 313
                +D L  L    P+K GEQV   Y     N+EL   YG++E
Sbjct: 291 -----KDSLKMLAIK-PIKKGEQVYNTYG-ELPNSELLRKYGYVE 328


>gi|116206234|ref|XP_001228926.1| hypothetical protein CHGG_02410 [Chaetomium globosum CBS 148.51]
 gi|88183007|gb|EAQ90475.1| hypothetical protein CHGG_02410 [Chaetomium globosum CBS 148.51]
          Length = 442

 Score = 55.8 bits (133), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 63/214 (29%), Positives = 91/214 (42%), Gaps = 40/214 (18%)

Query: 127 SLCSGLKPWISVALFLIREK-KKEDSPWRVYLDILPECTDSTV-----FWSEEELVELQG 180
           S    + P +    FLI+E  K +DS W  YL  LP            FW E+++  L+ 
Sbjct: 100 SFTQSVPPHVLGRFFLIKEYLKGKDSFWWPYLATLPSPDQVNAWVLPAFWPEDDIAYLEC 159

Query: 181 TQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFL--WAFGILRSRAF---- 234
           T        ++  V+ E+ +  +  IL N+  FP        +  WAF I  SR+F    
Sbjct: 160 TNAHVAIQEIQANVKGEFKQARK--ILKNEN-FPDVAAYTSLMYKWAFTIFTSRSFRPSL 216

Query: 235 ----------SRLRGQNL------VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLF 278
                     S L  Q++      +L PL D+ NHSP   T  Y+++       +  L+ 
Sbjct: 217 ILSDTTKRHISTLLPQSVELDDFSILQPLLDIANHSP---TAVYSWDTTSPAD-ACTLVC 272

Query: 279 SLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI 312
             R P     G QV   Y L K+N+EL L YGFI
Sbjct: 273 GDRYP----PGAQVFNNYGL-KTNSELLLGYGFI 301


>gi|307195796|gb|EFN77610.1| SET domain-containing protein 3 [Harpegnathos saltator]
          Length = 246

 Score = 55.8 bits (133), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 40/137 (29%), Positives = 68/137 (49%), Gaps = 8/137 (5%)

Query: 67  FWQWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEI 125
           F  WL +         I  A FP   LGL A+ D  +N+++LE+P    I     AA E+
Sbjct: 85  FMNWLTENGARVDGVSI--AEFPGYDLGLKAEADFTENQLILEIPRAL-IFSTYNAAPEL 141

Query: 126 GSLCSG----LKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGT 181
             L +       P +++A+ L+ E+ KE+S W+ YLD+LP    + ++    +++EL+G+
Sbjct: 142 VILQNDPLVQHMPQVALAIALLIERHKENSKWKPYLDMLPTSYTTVLYMKATDMIELKGS 201

Query: 182 QLLSTTLGVKEYVQNEY 198
             L   L     +  +Y
Sbjct: 202 PTLEAALKQCRNIARQY 218


>gi|413950744|gb|AFW83393.1| hypothetical protein ZEAMMB73_866859 [Zea mays]
          Length = 281

 Score = 55.8 bits (133), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 42/149 (28%), Positives = 77/149 (51%), Gaps = 8/149 (5%)

Query: 285 PVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETN 344
           P KAGE ++I +   ++N+ L L+YGF++  +  +  ++   ++  DP + +K  +A+ N
Sbjct: 56  PYKAGEPIII-WCGPQTNSRLVLNYGFVDENNPFDRISIEASLNTEDPQYQEKRMVAQRN 114

Query: 345 GLGESAYFDIVLG--RTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANE 402
           G      F++ +G  +     ML YLRL  +   D   ++SI  +   G    PVS   E
Sbjct: 115 GKHAIQNFNVYVGKEKQTVAEMLPYLRLGYISDPDE--MQSILSSE--GD-TCPVSPCTE 169

Query: 403 ELICRVVRDACKSALSGFHTTIEEDEQLL 431
             +   +    +S L+G+ TT+ EDE ++
Sbjct: 170 RAVLDQLGGYLESRLAGYPTTLNEDEAMV 198


>gi|363747293|ref|XP_003643967.1| PREDICTED: N-lysine methyltransferase SETD6-like [Gallus gallus]
          Length = 447

 Score = 55.8 bits (133), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 75/282 (26%), Positives = 114/282 (40%), Gaps = 52/282 (18%)

Query: 60  PTAQVETFWQWLRDQKV-VSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPD 118
           PTA    F  W     V +SPK  I       G GL+A  D+   E++  VP    ++  
Sbjct: 17  PTA---GFVAWCEAAGVELSPKVSISRRGTVSGYGLLAAADLEPGELLFSVPRSALLSQH 73

Query: 119 TVA-------ASEIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECT--DSTVF 169
           T A       A E     SG   W+ + L L+ E     S WR Y  +  + +  D  +F
Sbjct: 74  TCAIRALLHDAQESLQSQSG---WVPLLLALLHEYTTGTSHWRPYFSLWQDFSSLDHPMF 130

Query: 170 WSEEELVE-LQGTQLLSTTLGVKEYVQNEYLKVEEE---IILP----NKQLF-PRPITLD 220
           W EEE V  LQGT       G+ E V  +   ++ E   IILP    +  +F P   TL+
Sbjct: 131 WPEEERVRLLQGT-------GIPEAVDKDLANIQLEYSSIILPFMKSHPDIFDPELHTLE 183

Query: 221 DFLWAFGILRSRAFSR--------LRGQN-LVLIPLADLINHSPGITTEDYAYEIKGAGL 271
            +      + + +F           +G N  +++P+AD++NH               A L
Sbjct: 184 LYKQLVAFVMAYSFQEPLEEEDEDEKGPNPPMMVPVADILNHVAN----------HNASL 233

Query: 272 FSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIE 313
                   + T  P+  G+++   Y    +N +L   YGF E
Sbjct: 234 KYAPTCLRMVTTQPISKGQEIFNTYG-QMANWQLLHMYGFAE 274


>gi|322707769|gb|EFY99347.1| SET domain protein [Metarhizium anisopliae ARSEF 23]
          Length = 467

 Score = 55.8 bits (133), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 66/237 (27%), Positives = 103/237 (43%), Gaps = 41/237 (17%)

Query: 126 GSLCSGLKPWISVALFLIREK-KKEDSPWRVYLDILPECTDSTV-------FWSEEELVE 177
           G   +   P +   LFLI+E  K++ S W  Y+  LP+             FW ++E   
Sbjct: 81  GEFLARAAPHVVGRLFLIKEYLKRDKSFWWPYIRALPQPGQGNKSQWALAPFWDDDEAEL 140

Query: 178 LQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQ--LFPRPITLDDFLWAFGILRSRAF- 234
           L+GT +      ++  V+ +  + +E + L       F + +T + + WA+ I  SR+F 
Sbjct: 141 LEGTNVEVGIDKIRNDVRRDLQEAQELLRLHGDADGAFGKALTTELYQWAYCIFSSRSFR 200

Query: 235 -----------SRLRGQNL----VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDL--L 277
                      S  RG  +    VL+PL D+ NH       D   EI+      R    L
Sbjct: 201 PSLVLSDEQRRSLPRGVTMDDFSVLLPLFDIGNH-------DMTTEIRWDLDDDRQTCEL 253

Query: 278 FSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI--ESKSDRNAYTLTLEISESDP 332
              RT +P   G+QV   Y + K+NAEL L YGF+   +K   N YT   + + + P
Sbjct: 254 RVGRTHMP---GQQVFNNYSM-KTNAELLLGYGFMLPATKELHNDYTHVRKRTAAAP 306


>gi|303272215|ref|XP_003055469.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226463443|gb|EEH60721.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 468

 Score = 55.8 bits (133), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 61/224 (27%), Positives = 97/224 (43%), Gaps = 29/224 (12%)

Query: 67  FWQWLRDQKVVSPKSPIRPATFPEGLG-----------LVAQRDI--AKNEVVLEVPMKF 113
           F  W   + VV  +  +    F +  G            V  RD   A  E+V  +P K 
Sbjct: 18  FLSWCDAEGVVHDRDALEIRVFVDRSGGDSSESIRCHYAVHARDDLPADVEIVSAIPKKA 77

Query: 114 WINPDTVA-ASEIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTDST--VFW 170
            ++  T + A E+     G    ++VA+ + R    E S WR Y  +LP   + T  +FW
Sbjct: 78  CLSARTCSVAKELRDARLGGGLALNVAVMVERALGSE-SRWRDYFAVLPSRGERTLPMFW 136

Query: 171 SEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEII----LPNKQLFPR-PITLDDFLWA 225
           +E  L  L+GT L +    V+E  +N     +EE++    + + + F R  +T + +L A
Sbjct: 137 TEARLEALKGTDLATH---VREDAENLRADYDEEVVNGLCVAHPEKFRREELTFERYLEA 193

Query: 226 FGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGA 269
             +  SRAF         L+P AD+ NH     T+D    + GA
Sbjct: 194 ASLSASRAFYIGEECGEALVPWADMFNHR----TDDETVRVLGA 233


>gi|212544736|ref|XP_002152522.1| hypothetical protein PMAA_003730 [Talaromyces marneffei ATCC 18224]
 gi|210065491|gb|EEA19585.1| hypothetical protein PMAA_003730 [Talaromyces marneffei ATCC 18224]
          Length = 429

 Score = 55.8 bits (133), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 77/330 (23%), Positives = 135/330 (40%), Gaps = 74/330 (22%)

Query: 56  TTNPPTAQVETFWQWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFW 114
           T +PP  Q   F QW   + V    + + PA  P  GLG++A RDI +NEV++EVP    
Sbjct: 3   TYSPPGEQHTAFMQWAIAEGVKV--NGVEPARIPGRGLGMIATRDIQENEVMVEVPRSAM 60

Query: 115 INPDTVAASEIGSLCSGLKPWISVALFLIR---EKKKEDSPWRVYLDILPECTDST-VFW 170
           ++ D++ A  + ++ +G+     +A FL     +  K    W+     LP+  +   + W
Sbjct: 61  LSIDSIPADFV-NMFNGISTHGLLAAFLTHGDPKTLKRYDLWKATWPTLPDFEEGMPILW 119

Query: 171 SEE-----------------ELVELQGTQLLSTTLGV-----KEYVQNEYLKVEEEIILP 208
           S+E                    +L    L  +  G+     K+ +  EY    + ++  
Sbjct: 120 SKELGGSGLKTPISTIATTSHQHDLSPITLPPSVSGLWNTFHKKPLAEEYTTKHQNLLAQ 179

Query: 209 NK-----------QLFPRPITLDDFLWAFGILRSRAFSRLR-GQN--------LVLIPLA 248
            +            +FP     + F + + I+ +R+F  L  GQ+        + ++P A
Sbjct: 180 QEARLREAWRDVIAVFPD-TDWEKFSYHWLIVNTRSFYYLMPGQDPPEDTNDAMAMVPFA 238

Query: 249 DLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALD 308
           D  NH     T+D   E+   G       ++ R     K GE++ + Y  + +      D
Sbjct: 239 DYFNH-----TDDAECEVHFDG-----KSYTFRATRLYKKGEEIYMSYGPHPN------D 282

Query: 309 YGFIESKSDRNAYTLTLEISESDPFFGDKL 338
           + F+E       Y   LE +ESD  F D +
Sbjct: 283 FLFVE-------YGFYLETNESDAIFLDDI 305


>gi|403306046|ref|XP_003943557.1| PREDICTED: N-lysine methyltransferase SETD6 [Saimiri boliviensis
           boliviensis]
          Length = 449

 Score = 55.8 bits (133), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 71/275 (25%), Positives = 118/275 (42%), Gaps = 44/275 (16%)

Query: 64  VETFWQWLRDQKV-VSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAA 122
           V  F  W R   + +SPK  +       G G+VA+  +   E++  VP    ++P T + 
Sbjct: 22  VAGFLSWCRRVGLELSPKVEVSRQGTVAGYGMVARESVQAGELLFVVPRAAILSPHTCSI 81

Query: 123 S-----EIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECT--DSTVFWSEEEL 175
                 E G+L S    W+ + L L+ E +   S WR Y  + PE    +  +FW EEE 
Sbjct: 82  GGLLERERGALQSQ-SGWVPLLLALLHELQAAASHWRPYFALWPELGHLEHPMFWPEEER 140

Query: 176 VE-LQGTQLLSTTLGVKEYVQNEYLKVEEE---IILPNKQLFP-----RPITLDDFLWAF 226
              LQGT       GV E V+ +   +  E   I+LP  +  P     R  +L+ +L   
Sbjct: 141 RRLLQGT-------GVPEAVEKDLDSIRSEYHSIVLPFMEAHPDLFSLRVHSLELYLQLV 193

Query: 227 GILRSRAFSRLRG--------QNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLF 278
            ++ + +F              + +++P AD++NH   +   +   E      +S D L 
Sbjct: 194 ALVMAYSFQEPLEEEEDEKEPNSPIMVPAADILNH---LANHNANLE------YSADCLR 244

Query: 279 SLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIE 313
            + T  P+  G ++   Y    +N +L   YGF+E
Sbjct: 245 MVATQ-PIPKGHEIFNTYG-QMANWQLIHMYGFVE 277


>gi|159480366|ref|XP_001698255.1| hypothetical protein CHLREDRAFT_193195 [Chlamydomonas reinhardtii]
 gi|158273753|gb|EDO99540.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 463

 Score = 55.5 bits (132), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 49/169 (28%), Positives = 72/169 (42%), Gaps = 16/169 (9%)

Query: 150 DSP-WRVYLDILPECTDSTVF--WSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEII 206
           DSP W  Y+  LP   + T    +  +   ELQ   L+        +    + K  + + 
Sbjct: 106 DSPLWAAYIATLPPADEVTCLINYGPDVAKELQIKDLVEEAKSQYNWAMGVHRKYFDGVQ 165

Query: 207 LPNKQLFPRPITLDDFLWAFGILRSRAFS-RLRGQNLVL-IPLADLINHSPGITTEDYAY 264
               QL     +  D +WA  ++R+R FS  + G+ L L +P ADL NHS         +
Sbjct: 166 GELAQL-KLAASARDTMWAMSMVRTRTFSENVNGEGLTLMVPYADLANHS---------F 215

Query: 265 EIKGAGLFSRD-LLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI 312
                   +RD   F LR   P+  GE+  I Y   K N E+  DYGF+
Sbjct: 216 RPNATFCMARDNKRFELRLLGPLAPGEEAAISYGETKPNPEVMRDYGFV 264


>gi|353236313|emb|CCA68310.1| related to SET7-Regulatory protein [Piriformospora indica DSM
           11827]
          Length = 493

 Score = 55.5 bits (132), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 40/126 (31%), Positives = 60/126 (47%), Gaps = 13/126 (10%)

Query: 64  VETFWQWLRDQKVVSPKSPIRPATFPE-----GLGLVAQRDIAKNEVVLEVPMKFWINPD 118
            + F +W RD         + PA   +     G G VA  DI K+ V+  VP    ++  
Sbjct: 6   TQEFLKWFRDSGAT-----LHPAVGIKDFEGVGRGAVALHDIQKDTVLFTVPRSILLSTR 60

Query: 119 TVAASEI--GSLCSGLKPWISVALFLIREKKK-EDSPWRVYLDILPECTDSTVFWSEEEL 175
           T    +I      S LK W  + L ++ E  + +DSPW  YL  LP   D+ +FW++EEL
Sbjct: 61  TAPLRDILGDEDWSTLKGWEGLILSMMYEDSRVKDSPWSGYLQDLPTKFDTLMFWTDEEL 120

Query: 176 VELQGT 181
            +LQ +
Sbjct: 121 EQLQAS 126


>gi|340503061|gb|EGR29686.1| SET domain protein [Ichthyophthirius multifiliis]
          Length = 286

 Score = 55.5 bits (132), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 46/162 (28%), Positives = 82/162 (50%), Gaps = 16/162 (9%)

Query: 49  SLHSASATTNPPTAQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLE 108
            L S  + ++P   Q     +WL D K    K  I+   +    G+ +++ I K+E +L 
Sbjct: 109 QLSSLISESDPSYNQNYKLLEWLNDGKCDIWK--IKMVYYNNYRGIHSKQKINKDETILF 166

Query: 109 VPMKFWINPDTVAASEIGSLCSG--------LKPWISV-ALFLIREKKKEDSPWRVYLDI 159
           +P K+ I   T+   +  ++C          L P  S+ +++++ EKK  +S W+ YLDI
Sbjct: 167 IPQKYMI---TLELCKQNTICKQIEQRNIKLLSPKHSILSIYILSEKKNPNSFWKPYLDI 223

Query: 160 LP-ECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLK 200
           LP E T   + ++EEE+  L+G+ L+   +  K   QN+ +K
Sbjct: 224 LPCEFTTFPILYTEEEIQWLKGS-LIINQIYEKNKAQNKTIK 264


>gi|413951744|gb|AFW84393.1| hypothetical protein ZEAMMB73_159573 [Zea mays]
          Length = 206

 Score = 55.5 bits (132), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 46/188 (24%), Positives = 89/188 (47%), Gaps = 4/188 (2%)

Query: 67  FWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIG 126
           F +W+R   VV   +     + P G+ + A   +   ++V  +P    + P T  A+   
Sbjct: 15  FKRWMRAHGVVCSDALSLDVSDPLGVHVRAATPLRDGDLVATIPRGACLTPRTTGAAAAI 74

Query: 127 SLCSGLKPWISVALFLIREKKK-EDSPWRVYLDILPECTDSTVFW-SEEELVELQGTQLL 184
              + L   +++ + ++ E+ +  DSPW  YL +LP+C    + W + E    L GT+L 
Sbjct: 75  EA-AELGGCLALTVAVMYERAQGADSPWDAYLQLLPDCESVPLVWPAGEAECLLAGTELD 133

Query: 185 STTLGVKEYVQNEYLKVEEEIILPNK-QLFPRPITLDDFLWAFGILRSRAFSRLRGQNLV 243
                 KE++  ++ +  E ++L  +  + P   +L+ +L A  ++ SR+F         
Sbjct: 134 KIVKQDKEFLCEDWKECIEPLMLSGELDVDPDDFSLEKYLSAKTLVSSRSFQIDSYHGSG 193

Query: 244 LIPLADLI 251
           ++PLADL 
Sbjct: 194 MVPLADLC 201


>gi|358380690|gb|EHK18367.1| hypothetical protein TRIVIDRAFT_47382 [Trichoderma virens Gv29-8]
          Length = 479

 Score = 55.5 bits (132), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 60/224 (26%), Positives = 94/224 (41%), Gaps = 40/224 (17%)

Query: 128 LCSGLKPWISVALFLIREK-KKEDSPWRVYLDILPECTDST-----VFWSEEELVELQGT 181
             S  +P +   LFLI+E  + ++S W  Y+  LP+  D        FW EEE   L+GT
Sbjct: 79  FLSKTRPHVVGRLFLIKELLRGQESFWYPYIQALPQPEDFDDWALPPFWPEEEAELLEGT 138

Query: 182 QLLSTTLGVKEYVQNEYLKVEEEIILPNKQL---FPRPITLDDFLWAFGILRSRAF---- 234
            +      ++E +  E+      +I   K     F   +T + + WA+ I  SR+F    
Sbjct: 139 NVEIGLEKIREDLGREFRDARNLLIASQKDAEDDFSDHLTRELYQWAYCIFSSRSFRPSL 198

Query: 235 --SRLRGQNL----------VLIPLADLINHSPGITTE---------DYAYEIKGAGLFS 273
             S  + Q+L          VL+PL D+ NH   +                 ++G+G   
Sbjct: 199 VLSEEQQQSLPDGVSVNDFSVLLPLFDIGNHDMTVHVRWDLAAGDEAAAGAGVRGSGA-- 256

Query: 274 RDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSD 317
                 L+     K G+Q+   Y   K+NAEL L YGF+   +D
Sbjct: 257 ---AVQLKVGREHKPGQQIFNNYS-PKTNAELLLGYGFMLPATD 296


>gi|321251886|ref|XP_003192213.1| phospholipid metabolism-related protein [Cryptococcus gattii WM276]
 gi|317458681|gb|ADV20426.1| Phospholipid metabolism-related protein, putative [Cryptococcus
           gattii WM276]
          Length = 533

 Score = 55.5 bits (132), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 61/235 (25%), Positives = 102/235 (43%), Gaps = 27/235 (11%)

Query: 59  PPTAQVETFWQWLRDQKV-VSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINP 117
           PP ++     +WL    V  SP   I+  +   G  +VA+RD+   E++L +P    ++ 
Sbjct: 10  PPPSRA-CLLEWLAANLVYTSPSLTIQHMSASSGYRIVAERDMEVGELLLSLPKLSILSQ 68

Query: 118 DTVAASEIGSLCSGLKPWISVALFLIREKKK-EDSPWRVYLDILPECTDST--VFWSEEE 174
            T + S +  L S     ++++L L+ E +   DS +  YL  LP    +   +FW   +
Sbjct: 69  QTASLSALTHLASTPHTILNLSLCLLHEIRLFTDSKFYGYLQSLPRDMGAGLPLFWRIGK 128

Query: 175 LVELQ----GTQLLSTTLGVKEYVQNE------------YLKVEEEIILPNKQLFPRPIT 218
             E++    G Q L  T   KE  ++E            YL      +LP     P P  
Sbjct: 129 GTEVEDGERGLQWLKGTEAEKELCKSEKEGLSLPDVYAFYLHTSH--LLPPTSTNPLPSP 186

Query: 219 LDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSP----GITTEDYAYEIKGA 269
              F+ A+ ++ +RAF         L P ADL+NHS      + ++D+   I G+
Sbjct: 187 FLAFVHAYMLISTRAFRIDLYHLTALCPFADLLNHSAIPHTCLASDDFVCYICGS 241


>gi|307107214|gb|EFN55457.1| hypothetical protein CHLNCDRAFT_52262 [Chlorella variabilis]
          Length = 478

 Score = 55.5 bits (132), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 61/238 (25%), Positives = 97/238 (40%), Gaps = 24/238 (10%)

Query: 10  SLSTSPPTFVSSSKTLKLLTSLTKKPSF-RLKSRAFAATCSLHSASATTNPPTAQVETFW 68
           S +  P    +++   + L S     +  R +    AA  +  S S +T+   A  + + 
Sbjct: 9   SFTRGPRNAAAAAPDFRQLHSCRPAAALPRRRHGTAAAAATPQSGSGSTSKEAAVTKDYL 68

Query: 69  QWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSL 128
            W  +  ++SPK  +  A F    G  A  DIA +EV + VP         V A      
Sbjct: 69  SWATEAGILSPK--LMQAYFGGLRGGQALSDIAADEVFVTVPR----GAALVVAPNERCP 122

Query: 129 CSGL--------KPW---ISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVE 177
           C            PW   ++V L   R K +  S W  Y++ LP   D+ V W E +L E
Sbjct: 123 CPDFVDPGFYKEAPWFVKMAVLLLWERRKGRGSSVWG-YIEQLPSSIDTPVRWEEADLAE 181

Query: 178 LQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFS 235
           LQ    +      +   + +Y +     + P +  +    + D+FLWA   +RSRAFS
Sbjct: 182 LQYQPAIKEIKQQQTAWRQQYNRF-CAAVRPGQGNY----SWDNFLWAAENVRSRAFS 234


>gi|341883062|gb|EGT38997.1| CBN-SET-29 protein [Caenorhabditis brenneri]
          Length = 414

 Score = 55.5 bits (132), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 57/239 (23%), Positives = 101/239 (42%), Gaps = 39/239 (16%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVA-----ASEIGSLCSGLKPWISVALFLIRE 145
           G G+ A       + ++ +P    IN   V        ++  +   +KP   + +F   E
Sbjct: 28  GNGIYATTGFRTGKPIITLPEHDMINSALVVDLPFYKKKLAKINEKMKPMEILTMFFSFE 87

Query: 146 KKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTT--LGVKEYVQNEYLKVEE 203
              E S W  YL +LP+  D+  F         +G      T  L +++Y  ++  ++  
Sbjct: 88  DF-EQSAWSPYLKVLPKTFDTPAF---------KGIDYDVNTLPLSIRKYWIDQKKEI-S 136

Query: 204 EIILPNKQLFPRPITLDDFLWAFGILRSRAF----------SRLRGQNLVLIPLADLINH 253
           EI    + LFP  +T D  LWA+ ++ +R                G  + +IP  D++NH
Sbjct: 137 EISEKLRHLFPE-LTHDKILWAWHVVNTRCIFVENEEHDNVDNSDGDTIAVIPYVDMLNH 195

Query: 254 SPGITTEDYAYEIKGAGLF-SRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF 311
            P    E Y    +G  L   R+  + ++    ++ GEQ+ + Y  +  NA L ++YGF
Sbjct: 196 DP----EKY----QGVALHEKRNGRYVVQAKRQIQEGEQIFVCYGAH-DNARLLVEYGF 245


>gi|302835223|ref|XP_002949173.1| hypothetical protein VOLCADRAFT_120737 [Volvox carteri f.
           nagariensis]
 gi|300265475|gb|EFJ49666.1| hypothetical protein VOLCADRAFT_120737 [Volvox carteri f.
           nagariensis]
          Length = 593

 Score = 55.5 bits (132), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 73/311 (23%), Positives = 125/311 (40%), Gaps = 36/311 (11%)

Query: 76  VVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW 135
              P  P+R        GL A   +A  +VVL VP    I+ +T   S++G + S L   
Sbjct: 185 AAQPPPPLR--------GLRADTAVAPGDVVLHVPADLLISYETAKKSDLGKVLSALPLD 236

Query: 136 I---SVAL-FLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVK 191
           +   S+AL +   E+ + ++P   +   LP    + +  S+E++  L+GT L    +  +
Sbjct: 237 LSDDSIALIWTCVERHEPEAPHAPFWAALPHSFSTALSASQEDVALLEGTPLHGDAVRAR 296

Query: 192 EYVQNEYLKVE---EEIILPNKQLF-PRPITLDDFLWA------FGILRSRAFSRLRGQN 241
           +++   +         ++      F P   + + +LWA      +GI    A   +R   
Sbjct: 297 QHLSEAFESSSPAFRSLLGAYPDYFKPEWFSWESYLWAAELWYSYGIQVQFASGDIR--- 353

Query: 242 LVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKS 301
             L P   L+NH P        + +  + +        +R   P +AG Q+ + Y    S
Sbjct: 354 TCLAPYLGLMNHHP------LPHVVHFSKVDPETGCLRVRAFRPCEAGNQLFLSYG-PYS 406

Query: 302 NAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLP 361
           NA+L L YGF    +  +   L L++    P      D       G S    +  G  L 
Sbjct: 407 NAKLLLFYGFAVRDNPADEVELVLQV----PPGAAATDRRALLAAGLSLEHRLRAGGRLA 462

Query: 362 PAMLQYLRLVA 372
           P +L   RL+A
Sbjct: 463 PPLLSCARLLA 473


>gi|238489143|ref|XP_002375809.1| SET domain protein [Aspergillus flavus NRRL3357]
 gi|220698197|gb|EED54537.1| SET domain protein [Aspergillus flavus NRRL3357]
          Length = 496

 Score = 55.5 bits (132), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 51/192 (26%), Positives = 82/192 (42%), Gaps = 15/192 (7%)

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKVEEEIIL 207
           +PW  Y+  LP       F++ EE   L+GT L          ++NE+    +  E I  
Sbjct: 131 NPWTEYIQYLPSSITLPTFYTVEERELLRGTSLKLAVDAKIVSLENEFELLRQSTENISW 190

Query: 208 PNKQLFPRP---ITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINH-SPGITTEDYA 263
             K  +       TLDD+ +   + RSR    L      ++P  D+ NH S  I    Y 
Sbjct: 191 CRKHWWDENTGRFTLDDWKYVDAMYRSRMVD-LPSSGHAMVPCIDMANHASEDIVKALYE 249

Query: 264 YEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES-KSDRNAYT 322
            + +G      + +  LR+   + + E+V I Y  +K  +E+   YGF+E+ +       
Sbjct: 250 EDTEG------NAVLQLRSGRKLHSDEEVTISYGDDKPASEMIFSYGFLENERGGAKQIF 303

Query: 323 LTLEISESDPFF 334
           L L+I E DP  
Sbjct: 304 LNLDIPEDDPLI 315


>gi|213403926|ref|XP_002172735.1| conserved hypothetical protein [Schizosaccharomyces japonicus
           yFS275]
 gi|212000782|gb|EEB06442.1| conserved hypothetical protein [Schizosaccharomyces japonicus
           yFS275]
          Length = 495

 Score = 55.5 bits (132), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 44/144 (30%), Positives = 73/144 (50%), Gaps = 13/144 (9%)

Query: 123 SEIGSLCSGLKPWISVALF-----LIREKKKEDSPWRVYLDIL--PECTDSTVFWSEEEL 175
           S I + CS   P+ ++AL      L+ EK  E S WR ++D L   E  D  ++W  E L
Sbjct: 84  STIDTDCSS-SPFDALALVRLTVALLLEKYNERSFWRPFVDSLLNAETPDLPIYWPPETL 142

Query: 176 VELQGTQLLSTTLGVKEYVQNEYLKVEE---EIILPNKQLFPRPITLDDFLWAFGILRSR 232
            +L+GT + S  +   +    +++ V +   E +L  K L P P +  ++L AF ++++R
Sbjct: 143 EQLEGTCVHSLVVNCSKLTAFQFISVVQPFFETVLVPKGL-PCP-SWSEYLHAFVLVQTR 200

Query: 233 AFSRLRGQNLVLIPLADLINHSPG 256
            F       L L+P  D++NH  G
Sbjct: 201 CFFIDSTHQLALVPFCDILNHRSG 224


>gi|307107385|gb|EFN55628.1| hypothetical protein CHLNCDRAFT_57818 [Chlorella variabilis]
          Length = 435

 Score = 55.5 bits (132), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 41/155 (26%), Positives = 66/155 (42%), Gaps = 6/155 (3%)

Query: 300 KSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRT 359
           + N+ L ++YG ++  +  +   L++ I   DP +  K D     GL     F +    +
Sbjct: 258 QPNSRLLINYGIVDESNPYDKLPLSITIPSDDPLYRLKRDRLAERGLSTQQTFQLQAAAS 317

Query: 360 LPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSG 419
           LP  +L YLRLV          E+      W     PV+  NE  +   +    +   S 
Sbjct: 318 LPAQLLPYLRLV------HSTREADVEGVKWEEEAGPVAPENELTVLNQLITHLRLRQSR 371

Query: 420 FHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVL 454
           + TTIEEDE ++      PR  +A  +   EK +L
Sbjct: 372 YRTTIEEDEAIIADPAKGPRPTVAARLLKIEKGIL 406


>gi|449453201|ref|XP_004144347.1| PREDICTED: N-lysine methyltransferase setd6-like [Cucumis sativus]
          Length = 500

 Score = 55.5 bits (132), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 51/216 (23%), Positives = 100/216 (46%), Gaps = 6/216 (2%)

Query: 63  QVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAA 122
           ++  F +W+  Q +    +     T   G+ + A  D+ + +VV  VP    +   T +A
Sbjct: 5   RLRVFKRWMTSQGIQCSDALQFTDTPDNGISVKALYDLREGDVVANVPKLACLTVKTTSA 64

Query: 123 SEIGSLCSGLKPWISVALFLIREKK-KEDSPWRVYLDILPECTDSTVFWSEEELVE-LQG 180
           S I     GL  ++ +++ L+ E+   E+S W  YL +LP+     + WS +++ + L G
Sbjct: 65  SSIIEEV-GLGGYLGLSVALMYERSLGENSNWAGYLQLLPDKECVPLLWSLQDVDQFLCG 123

Query: 181 TQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLF-PRPITLDDFLWAFGILRSRAFSRLRG 239
           T+L  T    K  +  ++ +    +++    +F P    ++ +  A  ++ SR+F     
Sbjct: 124 TELHKTVKEDKTLMYEDWKENILPLMMSAPLMFSPEFFGIEQYFSARSLISSRSFDIDDF 183

Query: 240 QNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRD 275
               ++PLADL NH      ED  + +  + + S D
Sbjct: 184 HGFGMVPLADLFNHK--TNAEDVHFTLVSSDVESDD 217


>gi|346324642|gb|EGX94239.1| SET domain-containing protein RMS1 [Cordyceps militaris CM01]
          Length = 482

 Score = 55.5 bits (132), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 97/431 (22%), Positives = 168/431 (38%), Gaps = 80/431 (18%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDT------------VAASEIGS------LCSGL 132
           G G++A RDI    V+  +P    ++ +T             AASE G           L
Sbjct: 39  GRGIIALRDIPAETVLFTIPRASILSVETSALRAQLPSLFAAAASEGGGTADDDDAPPPL 98

Query: 133 KPWISVALFLIREKKKEDSP-------WRVYLDILPECTDSTVFWSEEELVELQGTQLLS 185
             W ++ L L+ E  + D+        WR YLD+LP    + +FWS  EL  LQ +  ++
Sbjct: 99  DAWSALILVLLYEHLQRDADATGAACRWRPYLDVLPAAFATPMFWSPAELGALQASPAVA 158

Query: 186 TTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDD--------------FLWAFGILRS 231
                +E   N +  +    +  +  +F     L D                +AF + + 
Sbjct: 159 KV--GRESADNMFRGILLPAVRAHAHVFAGSERLSDEQIVALAHRMGSTIMAYAFDLDKE 216

Query: 232 RAFSRLR---------GQNLV-LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLR 281
                           G+ L+ ++P+AD++N        D  + +    +   D   ++ 
Sbjct: 217 EDEDEDGEDGWVEDRDGKALMGMVPMADILN-------ADAEFNVH---VNHGDDDLTVT 266

Query: 282 TPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDR-NAYTLTLEISESDPFFGDKLDI 340
              P++AGE++L  Y  +  N+EL   YG++  +  R +   +  E+ E+      +L++
Sbjct: 267 ALRPIRAGEEILNYYGPHP-NSELLRRYGYVTERHARYDVVEIPWELIEAAAV--SQLNL 323

Query: 341 AETNGLGESAYFD-------IVLGRTLPPAMLQYL-------RLVALGGTDAFLLESIFR 386
            +        + D        VL R  P    + L       +   L G     L    R
Sbjct: 324 PQEAWAKVRGHLDEDEMEDSFVLERDCPEVTAEGLFPDAAPAQFTGLPGDLQEQLTEFLR 383

Query: 387 NTIWGHLDL-PVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVG 445
                   L P     EEL   +   A  +  + + TT+ +DEQLL    +  R R+A+ 
Sbjct: 384 AVRAVDPALVPDKRKREELQAAIAARALAALEAQYPTTLAQDEQLLATAAMGDRERMALI 443

Query: 446 VRAGEKRVLQQ 456
           VR GEKR+L++
Sbjct: 444 VRVGEKRLLRE 454


>gi|167523178|ref|XP_001745926.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163775727|gb|EDQ89350.1| predicted protein [Monosiga brevicollis MX1]
          Length = 469

 Score = 55.1 bits (131), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 76/309 (24%), Positives = 124/309 (40%), Gaps = 46/309 (14%)

Query: 54  SATTNPPTAQVETFWQWLRDQKVV-SPKSPIRPATFPEG---------LGLVAQRDIAKN 103
           + T    TA VE F QW +DQ +V  P+  +       G          GL A   + + 
Sbjct: 31  ATTGGETTAVVEGFMQWSQDQGLVWHPEVEVTSTHAASGGDDGYLGPQRGLKAAAALGEG 90

Query: 104 EVVLEVPMKFWINPD-TVAASEIGSLCSGLKPWISVALF------LIREKKKEDSPWRVY 156
            VVL VP++  I  +  +   E G L   L     + +F      L+        PW  Y
Sbjct: 91  TVVLSVPLRSLITAEHALLDVETGRLWDALYELSDLDIFTGFLAHLLYNPHATQDPWAPY 150

Query: 157 LDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFP-- 214
           L ILP      +  S +E+       +++  L  + Y  +  +    +I+     + P  
Sbjct: 151 LAILPSTDPGLLELSADEIRPYTAMPIVA-ALQARNYSLSMSMT---DIVKMAATMAPTG 206

Query: 215 --------RPITLDDFLWAFGILRSRAFS-RLRG------QNLVLIPLADLIN--HSPGI 257
                   + +T   F  A  +LRSR+    +R         + L+P ADLIN    P +
Sbjct: 207 GDAVAWIAQNMTWSQFAHAHFLLRSRSHRVAIRDAMAEWQHAMALVPGADLINTDMDPAL 266

Query: 258 TTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYD---LNKSNAELALDYGFIES 314
                    +G G+ +    F   T   + AGE++L QY     +++N +L LDYGF+ +
Sbjct: 267 LNVVCRTTDEGEGVNAN---FVCETTRDLDAGEELLAQYTSRAASRTNTKLLLDYGFVAT 323

Query: 315 KSDRNAYTL 323
            +  +A  L
Sbjct: 324 NNPDDALIL 332


>gi|409042840|gb|EKM52323.1| hypothetical protein PHACADRAFT_126051 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 435

 Score = 55.1 bits (131), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 75/288 (26%), Positives = 117/288 (40%), Gaps = 48/288 (16%)

Query: 63  QVETFWQWLRDQK-VVSPKSPIRPAT---FPEGLGLVAQRDIAKNEVVLEVPMKFWINPD 118
             E F QWL +++  + P   ++P     F  GL +VA  +I  +  ++  P    I P+
Sbjct: 8   NTEAFRQWLHERRGYIHPDVYLKPGMCLHFSFGLNVVAMSNIPADVQIVSCPFSLAITPE 67

Query: 119 -----------------TVAA-SEIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDIL 160
                            T+A  SE   +CS    +I +   L  +    +   R Y++IL
Sbjct: 68  VARRALAVVFRVDRSSPTLAGWSERQLVCS----YIVMHHILDGQSAPCELAHRPYINIL 123

Query: 161 PECTD---STVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI 217
           P  TD   +++ ++ EEL   +GT L   T   ++  + E+      I + N  L  R  
Sbjct: 124 P-STDKLRTSLHFTREELEFFRGTNLYGATFDRRKAWEEEWEMCRSRIAISNSYLAHR-F 181

Query: 218 TLDDFLWAFGILRSRAFS-----------RLRGQNLVLIPLADLINHSPGITTEDYAYEI 266
           T + +L A   L SRAF               G   VL+P  D +NH+            
Sbjct: 182 TWEHYLTASTYLSSRAFPSTLLSDNPTLVSTPGSYPVLLPGVDSLNHARAQPVSWVVSNP 241

Query: 267 KGAGLFSR---DLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF 311
           + +   S     +  +L T  P  AG ++L  Y   K NAEL L YGF
Sbjct: 242 QPSAPLSDSEPSISLALHTLTP--AGAELLNNYG-PKPNAELILGYGF 286


>gi|322700433|gb|EFY92188.1| hypothetical protein MAC_01789 [Metarhizium acridum CQMa 102]
          Length = 469

 Score = 55.1 bits (131), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 67/254 (26%), Positives = 106/254 (41%), Gaps = 54/254 (21%)

Query: 121 AASEIGSLC---------SGLKPWISVALFLIREK-KKEDSPWRVYLDILPECTDSTV-- 168
           A SE G+ C         +   P +   LFLI+E  K++ S W  Y+  LP+        
Sbjct: 67  AVSEQGTCCPDAFHKEFLAKAAPHVVGRLFLIKEYLKRDKSFWHPYIQALPQPGQGNKSQ 126

Query: 169 -----FWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIIL----PNKQLFPRPITL 219
                FW ++E   L+GT +      ++  V+ +  +  E + L     +   F   +T 
Sbjct: 127 WALAPFWDDDEAELLEGTNVEVGIDKIRNDVKRDLKEARELLRLHGDGADGGAFNEALTT 186

Query: 220 DDFLWAFGILRSRAFS----------RLRGQNL------VLIPLADLINHSPGITTE--- 260
           + + WA+ I  SR+F           R+  + +      VL+PL D+ NH   +TTE   
Sbjct: 187 ELYQWAYCIFSSRSFRPSLVLSDKQRRMLPEGVDTDDFSVLLPLFDIGNHD--MTTEVRW 244

Query: 261 DYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI--ESKSDR 318
           D   E +            LR       G+QV   Y + K+NAEL L YGF+   +K   
Sbjct: 245 DLDNERQNC---------ELRVGKTHMPGQQVFNNYSM-KTNAELLLGYGFMLPATKELH 294

Query: 319 NAYTLTLEISESDP 332
           N YT   + + + P
Sbjct: 295 NDYTHVRKRTAAAP 308


>gi|356511297|ref|XP_003524363.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
           setd3-like [Glycine max]
          Length = 449

 Score = 55.1 bits (131), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 97/419 (23%), Positives = 166/419 (39%), Gaps = 72/419 (17%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKF----------WINPDTVAASEIGSLCSGLKPWISVAL 140
           GLGLV + +I +     ++P++F           I      A E+ ++  GLK       
Sbjct: 44  GLGLVTKEEIPRGS---DLPLRFTSLERDPLLLIIRLYXHLAEELWAMKLGLK------- 93

Query: 141 FLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLK 200
            L++E+ K  S W  Y+  LPE     +F+  E++  L    LL        ++ +   +
Sbjct: 94  -LLQERAKVGSFWWPYITNLPETYTVPIFFPGEDIKNLHYASLLHQVNKRCRFLLDFERE 152

Query: 201 VEEEII--LPNKQLFP-RPITLDDFLWAFGILRSRAFSRLRGQN---------LVLIPLA 248
           V+  ++   P+K  F  + +      WA   + SRAF RL G+           +++PL 
Sbjct: 153 VKRALVSLTPDKHPFGGQEVDASSLGWAMSAVSSRAF-RLYGEKDPNGIRIDVPMMLPLI 211

Query: 249 DLINHS----PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAE 304
           D+ NHS      I  E    + K            +     +K  + +L+ Y    +N  
Sbjct: 212 DMCNHSFNPNARIVQEQDTSDSK----------MKVVAETAIKEDDPLLLCYGC-LNNDL 260

Query: 305 LALDYGFIESKSDRNAYTLTLE----------ISESDPFFG-----DKLDIAETNGLGES 349
             LDYGF+   +  +   L  +             S P F       +L +++ N  GE+
Sbjct: 261 FLLDYGFVMHSNPYDCIELKYDGALLDAASTAAGVSSPNFSTPAPWQELILSQLNLAGET 320

Query: 350 AYFDIVLG--RTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDL--PVSHANEELI 405
               + LG   T+   +L  LR++     +       +  +I   LD   P+  ANE  +
Sbjct: 321 PDLKVSLGGQETVEGRLLAALRVILSTNVETM---QKYDLSILQSLDAEAPLGVANEIAV 377

Query: 406 CRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFKER 464
            R +   C  AL  F T I +DE LL+ G       +A+  R  +K V+  + +    R
Sbjct: 378 FRTLIALCVIALGHFPTKIMDDESLLKQGA-SGSTELAIQYRIQKKSVIIDVMKNLSRR 435


>gi|226293110|gb|EEH48530.1| SET domain-containing protein [Paracoccidioides brasiliensis Pb18]
          Length = 535

 Score = 55.1 bits (131), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 53/206 (25%), Positives = 87/206 (42%), Gaps = 13/206 (6%)

Query: 144 REKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQL---LSTTLGVKEYVQNEYLK 200
           +E  +  S W  Y+  LPE      F++ EE   L GT L   L   +   E   N+  +
Sbjct: 124 KEHIRMSSAWTEYVKFLPEPLPLPTFYTIEERELLHGTSLELALDAKIASLEKEFNDLRE 183

Query: 201 VEEEIILPNKQLFPRP---ITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGI 257
              +I   N+  +      +T +D+     + RSR+     G   V++P  D+ NH+   
Sbjct: 184 ATMDIKWCNRYWWHEETGRLTFEDWKLVDALYRSRSLE-FPGIGHVMVPCLDMTNHASND 242

Query: 258 TTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESK-S 316
            T +  YE+ G G    + +  L     +  G++V I Y   K  AE+   YGF+E+  +
Sbjct: 243 LT-NALYEVDGGG----NAVLQLHWGRTLGKGDEVTITYGDEKGAAEMIFSYGFLENGLT 297

Query: 317 DRNAYTLTLEISESDPFFGDKLDIAE 342
                 L L+I   DP    K  + +
Sbjct: 298 KARQLFLDLDIPCDDPLLPPKKAVCD 323


>gi|383863099|ref|XP_003707020.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Megachile
           rotundata]
          Length = 222

 Score = 55.1 bits (131), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 40/137 (29%), Positives = 72/137 (52%), Gaps = 8/137 (5%)

Query: 67  FWQWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEI 125
           F  WL++       + I  A FP   LGL A+ D  +NE++L++P     N    AA E+
Sbjct: 61  FMNWLKENSADIEGASI--AEFPGYDLGLKAEHDFQENELILKIPRHLIFNTLN-AAPEL 117

Query: 126 GSLCSGL----KPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGT 181
            +L +       P +++A+ L+ E+ KE+S W+ YLD+LP    + ++ +  +++EL+G+
Sbjct: 118 TALKNDPLIHDMPQVALAIALLIERYKENSKWKPYLDVLPTTYTTVMYMTAADMIELKGS 177

Query: 182 QLLSTTLGVKEYVQNEY 198
             L   L     +  +Y
Sbjct: 178 PTLVPALKQCRNIARQY 194


>gi|302792358|ref|XP_002977945.1| hypothetical protein SELMODRAFT_107696 [Selaginella moellendorffii]
 gi|300154648|gb|EFJ21283.1| hypothetical protein SELMODRAFT_107696 [Selaginella moellendorffii]
          Length = 467

 Score = 55.1 bits (131), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 74/268 (27%), Positives = 107/268 (39%), Gaps = 42/268 (15%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAAS---EIGSLCSGLKPWISVALFLIREKK 147
           G G+ A RD+   E++  +P    +   T AA    E   L  GL   ++VAL   R K 
Sbjct: 31  GAGVRALRDLHHGELIATIPKAACLTLLTTAARDAIERARLGGGLG--LTVALMYERSKG 88

Query: 148 KEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVE--EEI 205
           K  S W  YL  LP        WSEEE+  L    LL T L  K   +++ L  E  EE 
Sbjct: 89  K-GSKWYRYLKTLPRQESVPFLWSEEEIDGL----LLGTELH-KALKEDKLLMKEDWEEN 142

Query: 206 ILPNKQLFP-----RPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTE 260
           I P  +  P     +  T + +L A  ++ SR+F         ++PLADL NH      E
Sbjct: 143 IAPLTKEDPLEFPAQDFTFESYLAAKSLVSSRSFEIDAEHGYGMVPLADLFNHKT--DAE 200

Query: 261 DYAYEIKGA-------------GLFSRDL--------LFSLRTPVPVKAGEQVLIQYDLN 299
           D  + +  +             GL + D         +  +     V AG ++   Y   
Sbjct: 201 DVHFMLNASDSDDDDNGLIIDDGLANGDCREISSDKSVLEMVMVKDVAAGSEIFNTYG-Q 259

Query: 300 KSNAELALDYGFIESKSDRNAYTLTLEI 327
             NA L   YGF E  +  +   L ++ 
Sbjct: 260 LGNAALLHRYGFTEPNNPHDIVNLDMDC 287


>gi|308812165|ref|XP_003083390.1| ADR237Cp (ISS) [Ostreococcus tauri]
 gi|116055270|emb|CAL57666.1| ADR237Cp (ISS) [Ostreococcus tauri]
          Length = 392

 Score = 55.1 bits (131), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 68/267 (25%), Positives = 112/267 (41%), Gaps = 49/267 (18%)

Query: 222 FLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLR 281
           F WA   +RSRA     G++  L P+ D++NHS       +     G  +    +L +LR
Sbjct: 133 FAWARACVRSRAVGLEHGESF-LAPVLDMVNHSHTAANVKWDPSDDGDAV----VLRALR 187

Query: 282 TPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIA 341
           T   +++GE++L QY    + + L L  GF+   +  +   L   +  +  +F +     
Sbjct: 188 T---IESGEELLTQYACEPAESFL-LHMGFVGGMNPYDRVELWTSLEAAADWFAETF--- 240

Query: 342 ETNGLGESAYFDIVLGRTLPPAMLQYLR------LVAL-GGTDAFLLESI------FRNT 388
             +  GE    D   GR +   +    R      +V L GG DA   ES        ++T
Sbjct: 241 -KSERGE-VVLDKAAGRQVAMELEAQQRARERASIVGLAGGADAVADESAEIRLREVQST 298

Query: 389 IWG------------------HLD---LPVSHANEELICRVVRDACKSALSGFHTTIEED 427
           + G                  H     +P  +A+  ++   ++  C+  L    TTIEED
Sbjct: 299 LMGPSVGWKTDYDEALWLMFRHFTKALMPDENADH-IVLHAIKKRCEEMLMSMRTTIEED 357

Query: 428 EQLLQGGNLDPRLRIAVGVRAGEKRVL 454
           E+L    +  PRLR+AV  R  +K +L
Sbjct: 358 EKLFSAEDTPPRLRLAVEFRLYKKCLL 384


>gi|380472668|emb|CCF46664.1| SET domain-containing protein RMS1, partial [Colletotrichum
           higginsianum]
          Length = 289

 Score = 55.1 bits (131), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 66/244 (27%), Positives = 104/244 (42%), Gaps = 49/244 (20%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVA-ASEIGSLCSG------------LKPWIS 137
           G G++A +DIA   V+  +P K  IN +T     +I  + +G            L  W S
Sbjct: 41  GRGIIATKDIAPETVLFTIPRKSIINTETSELPKKIPQVFTGNDGDDEDMENEPLDSWGS 100

Query: 138 VALFLIREKKKED-SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLG---VKEY 193
           + L +I E  + D SPW+ Y ++LP+  D+ +FW   +L  L+G+ +LS  +G     E 
Sbjct: 101 LILVMIYEYLQGDASPWKPYFEVLPDKFDTLMFWESSDLEWLRGSAVLS-KIGKDEADEM 159

Query: 194 VQNEYLKV---EEEIILPNKQLFPRPITLDDFLWAFG-ILRSRAF--------------- 234
            ++  L V      I  P     P    L       G I+ + AF               
Sbjct: 160 FRSRILSVIAANPTIFFPQGVAQPSETELLQLAHRMGSIIMAYAFDLENEEEPEEENEEW 219

Query: 235 --SRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQV 292
              R     L ++P+AD++N       E  A+   G    S   + +LR   P+KAGE++
Sbjct: 220 VEDRDGKTMLGMVPMADILN----ADAEFNAHVNHGEDDLS---VVALR---PIKAGEEI 269

Query: 293 LIQY 296
           L  Y
Sbjct: 270 LNYY 273


>gi|239614757|gb|EEQ91744.1| SET domain-containing protein [Ajellomyces dermatitidis ER-3]
          Length = 494

 Score = 55.1 bits (131), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 54/198 (27%), Positives = 89/198 (44%), Gaps = 15/198 (7%)

Query: 144 REKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQL-LSTTLGVKEYVQNEYLKVE 202
           RE  K  + W  Y+  LP       +++ EE   L GT L L+  L +   ++ E+ ++ 
Sbjct: 124 REHIKISNAWTEYIKFLPASYSLPTWYTIEERELLHGTSLELALDLKLAS-LEKEFEQLR 182

Query: 203 E---EIILPNKQLFPRP---ITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPG 256
           E   +I   ++  +      +  +D+     + RSRA     G+   ++P  D+ NH+ G
Sbjct: 183 EATMDIPWCSRDWWNEDTGRLAFEDWKLVDAMYRSRAL-EFPGKGHSMVPCVDMANHTSG 241

Query: 257 ITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESK- 315
           I T    YE    G    + +  L     +  G++V I Y   K  AE+   YGFIES  
Sbjct: 242 IQT-GALYETDANG----NAVLQLGWGQSLDVGDEVTITYGDEKGAAEMIFSYGFIESNL 296

Query: 316 SDRNAYTLTLEISESDPF 333
           ++ +   L L+I  SDP 
Sbjct: 297 TNAHQLFLDLDIPASDPL 314


>gi|440639458|gb|ELR09377.1| hypothetical protein GMDG_03941 [Geomyces destructans 20631-21]
          Length = 485

 Score = 55.1 bits (131), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 78/358 (21%), Positives = 132/358 (36%), Gaps = 43/358 (12%)

Query: 59  PPTAQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAK------NEVVLEVPMK 112
           PP   V +   W +   +      +       G G+V+   I        N  VL VP  
Sbjct: 3   PPALPVSSLRTWAKFNGIEFEDISVEKNKEYGGYGVVSTTKIDSAPEQEGNLTVLNVPKD 62

Query: 113 FWINPDTVAA--------SEIGSLCSGLKPWISVALFLIREKKKEDS----------PWR 154
             ++ +T+A          +I     G      V LFL+ +  +  S          PW 
Sbjct: 63  LILSAETIAEHAKVDKHFGQILEAVGGSTLRGDVMLFLLMQVTRASSDPSIKFSVRGPWT 122

Query: 155 VYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEII-LP---NK 210
            Y+ +LPE       W ++++  L GT L          +  E+  + E    +P   N 
Sbjct: 123 EYVKMLPEYISLPTAWHDDQINLLNGTSLEKAVAAKVSALVREFETLRENTTEIPWCHNA 182

Query: 211 QLFPRPITLDDFLWAFGILRSRAFS-RLRGQNLVLIPLADLINHSPGITTEDYAYEIKGA 269
                 +   D++      RSR+    L G+   ++P  D+ NHS    +  Y   I   
Sbjct: 183 WWETEHLEFKDWILIDSWYRSRSLELPLSGE--AMVPFLDMANHSANANSH-YQQGIDD- 238

Query: 270 GLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISE 329
                ++L  ++    ++ GE++ I Y   KS AE+   YGFI+  S  ++  L +  S 
Sbjct: 239 -----EVLLQVKPGQHIEKGEELTIDYGSAKSAAEMLFSYGFIDDLSSVHSLVLHISPSP 293

Query: 330 SDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRN 387
            DP    K+ I         +  D  L  T P     ++ L+ + G D    + +  N
Sbjct: 294 DDPLGKAKVMIYGKPPTLHISATDDSLELTCP-----FIYLMCVHGDDGLDFKVLLEN 346


>gi|327357292|gb|EGE86149.1| SET domain-containing protein [Ajellomyces dermatitidis ATCC 18188]
          Length = 494

 Score = 55.1 bits (131), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 54/198 (27%), Positives = 89/198 (44%), Gaps = 15/198 (7%)

Query: 144 REKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQL-LSTTLGVKEYVQNEYLKVE 202
           RE  K  + W  Y+  LP       +++ EE   L GT L L+  L +   ++ E+ ++ 
Sbjct: 124 REHIKISNAWTEYIKFLPASYSLPTWYTIEERELLHGTSLELALDLKLAS-LEKEFEQLR 182

Query: 203 E---EIILPNKQLFPRP---ITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPG 256
           E   +I   ++  +      +  +D+     + RSRA     G+   ++P  D+ NH+ G
Sbjct: 183 EATMDIPWCSRDWWNEDTGRLAFEDWKLVDAMYRSRAL-EFPGKGHSMVPCVDMANHTSG 241

Query: 257 ITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESK- 315
           I T    YE    G    + +  L     +  G++V I Y   K  AE+   YGFIES  
Sbjct: 242 IQT-GALYETDANG----NAVLQLGWGQSLDVGDEVTITYGDEKGAAEMIFSYGFIESNL 296

Query: 316 SDRNAYTLTLEISESDPF 333
           ++ +   L L+I  SDP 
Sbjct: 297 TNAHQLFLDLDIPASDPL 314


>gi|320170797|gb|EFW47696.1| hypothetical protein CAOG_05634 [Capsaspora owczarzaki ATCC 30864]
          Length = 903

 Score = 55.1 bits (131), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 73/297 (24%), Positives = 119/297 (40%), Gaps = 50/297 (16%)

Query: 65  ETFWQWLRDQKVVS-PKSPIRPATFPE-GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAA 122
           +   QWL +  + S  ++ +  A F   G G++A   I     VL +P    IN   VA 
Sbjct: 388 DNLLQWLHNAGMTSIAENHLSIADFEHTGRGVLANERIEAGVEVLHLPQHLLINIH-VAL 446

Query: 123 SE---IGSLCSGLKPWIS----VALFLIREK--KKEDSPWRVYLDILPECTDSTVFWSEE 173
            E   IG + S L+        + L+++ EK      S W  + + LP   +S + +   
Sbjct: 447 DESHPIGRVLSDLRDEYDDDTLLLLYVLHEKLVAGSASRWAPFFETLPATYNSPLLFHVT 506

Query: 174 ELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPR-----PITLDDFLWAFGI 228
           EL+EL+GT+L+  T  +K+      L+V  E + P  + +P        T ++ LW   +
Sbjct: 507 ELLELEGTRLIDETFEIKDG-----LRVLHESLGPLAEAYPALFPTDAFTYENLLWVRAM 561

Query: 229 LRSRAFSRLRGQNLV------------------LIPLADLINHSPGITTEDYAYEIKGAG 270
           + SRA                            LIP  D+INH          Y+     
Sbjct: 562 IDSRAMKLPVPAAAAAVAAAAPEDATETPFVANLIPFVDMINHEEHSHISVRRYDTSAKA 621

Query: 271 LFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEI 327
           L        L T     AG Q+ + Y    S  +L L YG +   ++ N  T+T+++
Sbjct: 622 LV-------LTTLGACAAGTQLSLHYSTLPSWQQL-LYYGML--STELNPLTVTVDV 668


>gi|308498155|ref|XP_003111264.1| CRE-SET-29 protein [Caenorhabditis remanei]
 gi|308240812|gb|EFO84764.1| CRE-SET-29 protein [Caenorhabditis remanei]
          Length = 401

 Score = 55.1 bits (131), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 58/238 (24%), Positives = 106/238 (44%), Gaps = 41/238 (17%)

Query: 87  TFPEGLGLVA--QRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIR 144
           +F  GL ++   + D+  + +VL++P             ++ ++   LKP   + +F   
Sbjct: 36  SFRSGLPIITLPEYDMINSALVLDLPF---------YRKKMANVNEKLKPMEILTMFFCF 86

Query: 145 EKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEE 204
           E   E S W  YL ILP+  D+  F       +     + +  L +++Y  ++  ++ E 
Sbjct: 87  EDF-EQSAWSPYLKILPKEFDTPAF-------KRIDYDVNTLPLSIRKYWIDQKKEISE- 137

Query: 205 IILPNKQLFPRPITLDDFLWAFGILRSRAF----------SRLRGQNLVLIPLADLINHS 254
           I    ++LFP  +T D  LWA+ ++ +R                G  + +IP  D++NH 
Sbjct: 138 ISEKLRRLFPE-LTHDKILWAWHVVNTRCIFVENEEHDNVDNTDGDTIAVIPYVDMLNHD 196

Query: 255 PGITTEDYAYEIKGAGLF-SRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF 311
           P    E Y    +G  L   R+  + ++    +  GEQV + Y  +  NA L ++YGF
Sbjct: 197 P----EKY----QGVALHEKRNGRYVVQARRQILEGEQVFVCYGAH-DNARLLVEYGF 245


>gi|358397725|gb|EHK47093.1| hypothetical protein TRIATDRAFT_298882 [Trichoderma atroviride IMI
           206040]
          Length = 481

 Score = 55.1 bits (131), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 58/213 (27%), Positives = 96/213 (45%), Gaps = 32/213 (15%)

Query: 130 SGLKPWISVALFLIREK-KKEDSPWRVYLDILPECTDST-----VFWSEEELVELQGTQL 183
           S  +P +   L LI+E  + E+S W  Y+  LP+  D        FW EEE   L+GT +
Sbjct: 80  SRARPHVIGRLLLIKELLRGEESFWWPYIQALPQPEDVDDWALPPFWPEEEAELLEGTNV 139

Query: 184 LSTTLGVKEYVQNEYLKVEEEIILPNKQL---FPRPITLDDFLWAFGILRSRAF------ 234
                 +++ ++ E+ + +  ++   K     F   +T + + WA+ I  SR+F      
Sbjct: 140 EVGLDKIRDDLKREFREAKAMLLASQKDAEDDFSELLTRELYNWAYCIFSSRSFRASLVM 199

Query: 235 SRLRGQNL----------VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPV 284
           +  + Q L          VL+PL D+ NH   +   D  +E+  A   +      LR   
Sbjct: 200 TEAQQQALPEDVSVDDFSVLLPLFDIGNHDMAV---DVRWELDAA---NSGAACQLRVGR 253

Query: 285 PVKAGEQVLIQYDLNKSNAELALDYGFIESKSD 317
             + G+Q+   Y   K+NAEL L YGF+   +D
Sbjct: 254 EHQPGQQIFNNYS-PKTNAELLLGYGFMLPVTD 285


>gi|134079652|emb|CAK97078.1| unnamed protein product [Aspergillus niger]
          Length = 443

 Score = 55.1 bits (131), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 68/284 (23%), Positives = 114/284 (40%), Gaps = 46/284 (16%)

Query: 60  PTAQVETFWQWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPD 118
           P  Q E F +W + Q +V   + + PA FP  GLG++A R I K+ ++++VP    + P 
Sbjct: 7   PGEQHELFTEWAKTQGIVI--NGVSPARFPGRGLGMIATRKIEKDSILVKVPHSAMLTPS 64

Query: 119 TVAASEIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTDST-VFWSE----- 172
            + ++      +        A +L         PWR     + + T S  + WS      
Sbjct: 65  KLPSTFTSRFPADTPTHTLYAAYLTNASPSHLKPWRNTWPTMEDFTSSMPILWSSTSPLT 124

Query: 173 --------EELVELQGTQLLST-TLGVKEYV----QNEYLKVEEEIILPNKQLFPRPITL 219
                   ++L+    +   ST T G +++         LK +E  +     +  R    
Sbjct: 125 PNSKTSKIQDLLPPSISNTWSTITPGKRKHKSDTRHQNLLKAQETRLRKAWDIVVRVFPE 184

Query: 220 DD---FLWAFGILRSRAFSRL---------RGQNLVLIPLADLINHSPGITTEDYAYEIK 267
            D   F + + I+ +R+F  L         R   + L+P AD  NHS      D A  +K
Sbjct: 185 TDKELFTYHWVIVNTRSFFYLLPGAEMPEDRNDAMALVPFADYFNHS------DVACNVK 238

Query: 268 GAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF 311
             G    + +F  R       GE++ + Y  + SN  L  +YGF
Sbjct: 239 FDG---EEYVF--RAAKEYNEGEEIYMSYGPH-SNDFLFTEYGF 276


>gi|356574815|ref|XP_003555540.1| PREDICTED: ribosomal N-lysine methyltransferase 3-like [Glycine
           max]
          Length = 506

 Score = 55.1 bits (131), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 50/201 (24%), Positives = 97/201 (48%), Gaps = 12/201 (5%)

Query: 63  QVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAA 122
           ++  F +W++ + +    +     T  EG+ + A   + + +VV ++P +  +   T  A
Sbjct: 5   RLRAFKRWMKSKGLEWSDALEFVDTPEEGVEVRALCQLKEGDVVAKMPKEACLTTKTSGA 64

Query: 123 SEIGSLCSGLKPWISVALFLIREKKKE-DSPWRVYLDILPECTDSTVFWSEEELVEL-QG 180
            +I    +GL   + +A  ++ E+  + DSP+  YL +LP      + W+ +E+ EL  G
Sbjct: 65  RKIIEE-AGLDGHLGLAFAIMYERSLDGDSPFAGYLQLLPHQECVPIVWTLDEVNELLCG 123

Query: 181 TQLLSTTLGVKEYVQNEYLKVEEEIILP-----NKQLFPRPITLDDFLWAFGILRSRAFS 235
           T+L  T    K  + +++    +E ILP       +L P+   ++ +  A  ++ SR+F 
Sbjct: 124 TELHQTVQEDKALIYDDW----KENILPLLDLAPLKLNPKFFGVEQYFAAKSLISSRSFE 179

Query: 236 RLRGQNLVLIPLADLINHSPG 256
                   ++PLADL NH  G
Sbjct: 180 IDDYHGFGMVPLADLFNHKTG 200


>gi|297845640|ref|XP_002890701.1| hypothetical protein ARALYDRAFT_472886 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297336543|gb|EFH66960.1| hypothetical protein ARALYDRAFT_472886 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 471

 Score = 55.1 bits (131), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 81/332 (24%), Positives = 136/332 (40%), Gaps = 43/332 (12%)

Query: 136 ISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYV- 194
           + + L L++E+   DS W  Y+  LPE     +F+  E++  LQ   LL        ++ 
Sbjct: 108 MKLGLRLLQERANVDSFWWPYISNLPETFTVPIFFPGEDIKNLQYAPLLYQVNKRCRFLL 167

Query: 195 --QNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNL---------- 242
             + E  +  E++   +     + +      W    + +RAF RL G             
Sbjct: 168 EFEQEIRRTLEDVKASDHPFSGQDVNASALGWTMSAVSTRAF-RLHGNKKLQGGSSDDVP 226

Query: 243 VLIPLADLINHS--PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNK 300
           +++PL D+ NHS  P +       E  GA     + L  +     +K  + +L+ Y    
Sbjct: 227 MMLPLIDMCNHSFKPNVKI---IQEQNGA---ESNTLVKVVAETELKENDPLLLNYGC-L 279

Query: 301 SNAELALDYGF---------IESKSDR---NAYTLTLEISE---SDPFFGDKLDIAETNG 345
           SN    LDYGF         IE K D    +A ++   +S    S P       +++ N 
Sbjct: 280 SNDFFLLDYGFVIESNPYDTIELKYDEQLMDAASMAAGVSSPKFSSPAPWQHQLLSQLNL 339

Query: 346 LGESAYFDIVLGRTLPPA--MLQYLRLVALGGTDAFLLESIFRNTIWGHLDL-PVSHANE 402
            GE     + +G   P    +L  +R++  G  +   +E    +T+     + P+  ANE
Sbjct: 340 AGEMPNLKVTIGGPEPVEGRLLAAIRILLCG--EMVEVEKHDLDTLKSLSAIAPLGIANE 397

Query: 403 ELICRVVRDACKSALSGFHTTIEEDEQLLQGG 434
             + R V   C  ALS F T I EDE +++ G
Sbjct: 398 IAVFRTVIALCVIALSHFPTKIMEDEAIIKKG 429


>gi|317033156|ref|XP_001394952.2| ribosomal N-lysine methyltransferase [Aspergillus niger CBS 513.88]
          Length = 415

 Score = 55.1 bits (131), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 68/284 (23%), Positives = 114/284 (40%), Gaps = 46/284 (16%)

Query: 60  PTAQVETFWQWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPD 118
           P  Q E F +W + Q +V   + + PA FP  GLG++A R I K+ ++++VP    + P 
Sbjct: 7   PGEQHELFTEWAKTQGIVI--NGVSPARFPGRGLGMIATRKIEKDSILVKVPHSAMLTPS 64

Query: 119 TVAASEIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTDST-VFWSE----- 172
            + ++      +        A +L         PWR     + + T S  + WS      
Sbjct: 65  KLPSTFTSRFPADTPTHTLYAAYLTNASPSHLKPWRNTWPTMEDFTSSMPILWSSTSPLT 124

Query: 173 --------EELVELQGTQLLST-TLGVKEYV----QNEYLKVEEEIILPNKQLFPRPITL 219
                   ++L+    +   ST T G +++         LK +E  +     +  R    
Sbjct: 125 PNSKTSKIQDLLPPSISNTWSTITPGKRKHKSDTRHQNLLKAQETRLRKAWDIVVRVFPE 184

Query: 220 DD---FLWAFGILRSRAFSRL---------RGQNLVLIPLADLINHSPGITTEDYAYEIK 267
            D   F + + I+ +R+F  L         R   + L+P AD  NHS      D A  +K
Sbjct: 185 TDKELFTYHWVIVNTRSFFYLLPGAEMPEDRNDAMALVPFADYFNHS------DVACNVK 238

Query: 268 GAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF 311
             G    + +F  R       GE++ + Y  + SN  L  +YGF
Sbjct: 239 FDG---EEYVF--RAAKEYNEGEEIYMSYGPH-SNDFLFTEYGF 276


>gi|449465799|ref|XP_004150615.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cucumis
           sativus]
 gi|449523996|ref|XP_004169009.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cucumis
           sativus]
          Length = 483

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 99/409 (24%), Positives = 167/409 (40%), Gaps = 60/409 (14%)

Query: 91  GLGLVAQRDIAK-NEVVL---EVPMKFWINPDTVAASEIGSLCSGL------KPW-ISVA 139
           GLGL+A   I K +E+++    +P++F  +P+   + E  S+   L      + W + + 
Sbjct: 68  GLGLLASHHIPKGSELIILPHNLPLRFE-SPEAGDSDEADSVLVNLARQVPEELWSMKLG 126

Query: 140 LFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYV---QN 196
           L L++E+ K  S W  Y+  LPE     +F+  +++  LQ   LL        ++   + 
Sbjct: 127 LKLLKERAKVGSFWWAYIGNLPEVFTVPIFFPGDDIKNLQYAPLLYQVNKRCRFLLDFEK 186

Query: 197 EYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNL---------VLIPL 247
           E  +  + I   N     + +      WA   + SRAF RL  +NL         +++PL
Sbjct: 187 EVKRTLDSIKPENHPFGGQTVDASSLGWAMAAVSSRAF-RLYSKNLTDSTPTSVPMMLPL 245

Query: 248 ADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKS---NAE 304
            D+ NHS         +      +  +D    L+  V  +   +      LN     N  
Sbjct: 246 IDMCNHS---------FNSNARIIQEQDASMKLKVKVVAETEIEENAPLTLNYGCLDNDL 296

Query: 305 LALDYGF---------IESKSDR---NAYTLTLEISE---SDPFFGDKLDIAETNGLGES 349
             LDYGF         IE K D     A ++   IS    S P    +L + + N  GE+
Sbjct: 297 FLLDYGFVVPSNQYDYIELKYDEALLEAASIVAGISSENFSSPAPWQRLILTKLNLHGEA 356

Query: 350 AYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLE----SIFRNTIWGHLDLPVSHANEELI 405
           A   + +G +          L  L   D  +++    S+ ++      + P+  ANE   
Sbjct: 357 ALLKVSIGGSEIVDGRLLAALRVLLSVDEEMVQKHDLSVLKSL---SAEAPLGIANEVAA 413

Query: 406 CRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVL 454
            R V   C  AL  F T I +DE LL+    +   ++A+  R  +K V+
Sbjct: 414 LRTVIALCVIALGHFPTKIMDDETLLKKCESETS-KLAIQFRLQKKSVI 461


>gi|126305181|ref|XP_001376097.1| PREDICTED: n-lysine methyltransferase SETD6-like [Monodelphis
           domestica]
          Length = 453

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 72/273 (26%), Positives = 115/273 (42%), Gaps = 46/273 (16%)

Query: 67  FWQWLR--DQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVA--- 121
           F  W R    ++ S  S  R  T   G G+VA  D+ + E++  VP    ++  T A   
Sbjct: 27  FLNWCRRVGLELSSKVSVTREGTVA-GYGMVALEDVQRGELLFVVPRAVLLSQKTTAIRD 85

Query: 122 --ASEIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPE--CTDSTVFWSEEELVE 177
               E G+L S    W+ + L L+ E   EDSPW  Y  + P+       +FWSE EL +
Sbjct: 86  LLEKEHGALQSQ-SGWVPLLLALLYEYLAEDSPWSCYFSLWPDLGSLQHPMFWSEGELRQ 144

Query: 178 -LQGTQLLSTTLGVKEYVQNEYLKVEEE---IILP----NKQLFP---------RPITLD 220
            LQGT       GV E VQ +   + +E   I+ P    + ++FP         R +   
Sbjct: 145 LLQGT-------GVPEAVQRDLANISQEYDAIVQPFLEAHPEIFPPQARSLELYRRLVAM 197

Query: 221 DFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSL 280
              ++F                +++P AD++NH   +   +   E      +S + L  +
Sbjct: 198 VMAYSFQEPLEEEEDEKEPNPPMMVPAADILNH---VANHNANLE------YSPEYLRMV 248

Query: 281 RTPVPVKAGEQVLIQYDLNKSNAELALDYGFIE 313
            T  P+  G+++   Y    +N +L   YGF E
Sbjct: 249 ATQ-PILKGQEIFNTYG-QMANWQLVHMYGFAE 279


>gi|409045252|gb|EKM54733.1| hypothetical protein PHACADRAFT_97093 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 513

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 39/135 (28%), Positives = 67/135 (49%), Gaps = 9/135 (6%)

Query: 57  TNPPTAQVETFWQWLRDQKVVSPKSPIRPATFPE-GLGLVAQRDIAKNEVVLEVPMKFWI 115
           TN P   V  F QWL+ Q+    ++ +  A  P  G G+VA RD+ K   +  +P    +
Sbjct: 3   TNTPKGNV--FIQWLKSQRAAVDETAMGIADIPGYGRGIVALRDLPKEHTLFTIPRNLVL 60

Query: 116 NPDTVAASEIGSLCS----GL-KPWISVALFLI-REKKKEDSPWRVYLDILPECTDSTVF 169
           +  T +  +     +    GL + WI + L ++  E +   S W  +  ILP+  D+ +F
Sbjct: 61  SMRTSSLPQKFGQAAWKKFGLHQGWIGLILCMLWEEAQGVSSKWHGFFPILPDGFDTPMF 120

Query: 170 WSEEELVELQGTQLL 184
           W + +L EL+GT ++
Sbjct: 121 WDDADLEELKGTAVV 135


>gi|448535330|ref|XP_003870959.1| Rkm1 protein [Candida orthopsilosis Co 90-125]
 gi|380355315|emb|CCG24832.1| Rkm1 protein [Candida orthopsilosis]
          Length = 565

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 60/224 (26%), Positives = 90/224 (40%), Gaps = 61/224 (27%)

Query: 149 EDSPWRVYLDILP--ECTDSTVFWSEEELVELQGTQL----------------------- 183
           + S ++ YLD+LP  +  DS   WS ++   L+GT L                       
Sbjct: 84  QGSFYKPYLDLLPSLQAIDSPYTWSAQDKAYLKGTNLGNSLRENIASLVEGWWSAVNIIP 143

Query: 184 ------LSTTLGVKEYVQN------EYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRS 231
                  S  + +K Y +N      +Y     E+ + N   FP      ++LWA  IL+S
Sbjct: 144 KEVPKPESHFINLKFYYENKFYTDDDYYNYFNEVDINNWTSFP------NYLWASLILKS 197

Query: 232 RAFS------RLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVP 285
           RAF        L     +L+P+ DL+NH+P    E     + G  LF  D          
Sbjct: 198 RAFPAYIIDPSLPKNEPMLLPVVDLLNHNPKAKAEWCC--VDGEFLFQSD---------G 246

Query: 286 VKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISE 329
           V  GE++   Y   K N EL L YGF    +  ++  L ++I E
Sbjct: 247 VSKGEEIFNNYG-QKGNEELLLAYGFAIENNTADSAALKIKIPE 289


>gi|342877200|gb|EGU78693.1| hypothetical protein FOXB_10798 [Fusarium oxysporum Fo5176]
          Length = 456

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 60/214 (28%), Positives = 92/214 (42%), Gaps = 38/214 (17%)

Query: 148 KEDSPWRVYLDILPECTDSTV-----FWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVE 202
           K +S W  Y+  LP+  D        FW +E+    +GT +      ++  V+ E+    
Sbjct: 98  KGESFWAPYIQALPQPDDHDSWSLPPFWPDEDAELFEGTNIEVGVTSIRANVKREFKTAH 157

Query: 203 EEIILPNKQL-FPRPITLDDFLWAFGILRSRAF-----------SRL-RGQNL----VLI 245
           + +   + +L   +  TL  + WA+ I  SR+F            RL  G  L    VL+
Sbjct: 158 DLLAAESWELELLKQFTLPLYQWAYSIFSSRSFRPSLVLGPEDQQRLPEGVKLDDFSVLM 217

Query: 246 PLADLINHSPGITTEDYAYE-IKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAE 304
           PL D+ NH      E    E I G          SL+     +AGEQV   Y + K+NAE
Sbjct: 218 PLFDVGNHDMTTKVEWVRNERINGC---------SLKVEKAYQAGEQVFNNYSM-KTNAE 267

Query: 305 LALDYGFI--ESKSDRNAYTLTLEISESDPFFGD 336
           L L YGF+  E++   N Y   + + +  P  G+
Sbjct: 268 LLLGYGFMLPETEELHNDY---VHVRKRQPAHGE 298


>gi|224042477|ref|XP_002188626.1| PREDICTED: SET domain-containing protein 4 [Taeniopygia guttata]
          Length = 457

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 100/445 (22%), Positives = 178/445 (40%), Gaps = 84/445 (18%)

Query: 69  QWLRDQKVVSPKSPIRPATFPE-GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           +WL+++      S +RPA F E G GL+  + +   ++++ +P K  +   TV +S +G 
Sbjct: 37  KWLKERGF--EDSNLRPAEFWETGRGLMTTKALQAGDLIISLPEKCLLTTGTVLSSCLGG 94

Query: 128 LCSGLKPWIS----VALFLI-------------------REKKKEDSPWRVYLDILPECT 164
                KP +S    +  FLI                    +   + SPW+ YLD+LP+  
Sbjct: 95  HIEKWKPPVSPLLALCTFLIGQNLELLECFQFLLVNGIAEKHAGQKSPWKPYLDVLPKA- 153

Query: 165 DSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFP--RPITLDD- 221
                ++    +E     LL   L  K   Q   +   +E+   ++  F   +P+  +D 
Sbjct: 154 -----YTCPACLEPDIINLLPKPLQKKAQEQKMLI---QELFQSSRAFFSSLQPLFAEDT 205

Query: 222 --------FLWAFGILRSR----------AFSRLRGQNLVLIPLADLINHSPGITTEDYA 263
                     WA+  + +R           FS L      L P  DL+NHSP +  +   
Sbjct: 206 GNIFNFSALQWAWCTVNTRTIYMKHPHRECFS-LEPDVYALAPYLDLLNHSPNVQVK--- 261

Query: 264 YEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDR----- 318
                AG   +   + + T    K  ++VLI Y  +  N  L L+YGF+ + +       
Sbjct: 262 -----AGFNEQTRSYEIWTDSQCKKYQEVLICYGPH-DNQRLLLEYGFVATDNPHSSVYV 315

Query: 319 NAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPP-AMLQYLRLVALGGTD 377
           +A TL    S  D     K+ I + +   E    ++  G   P   +L  L++++L   D
Sbjct: 316 SADTLLKYFSSLDKQREAKVSILKDHDFLE----NLTFGWEGPSWRLLTALKVLSLAA-D 370

Query: 378 AFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGG-NL 436
            F   + +R  + G +    S  NE+         C+  +      + +  QL +   NL
Sbjct: 371 EF---ACWRRILLGDV---TSARNEQQALGTAAKICQFLIEETQHVLLQISQLKRDKENL 424

Query: 437 DPRLRIAVGVRAGEKRVLQQIDEIF 461
              L +   +R+ + ++LQ+  EI 
Sbjct: 425 KTHLALVEMLRSEDLKILQKSAEIL 449


>gi|18395523|ref|NP_564222.1| rubisco methyltransferase-like protein [Arabidopsis thaliana]
 gi|9743350|gb|AAF97974.1|AC000103_24 F21J9.27 [Arabidopsis thaliana]
 gi|332192432|gb|AEE30553.1| rubisco methyltransferase-like protein [Arabidopsis thaliana]
          Length = 476

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 84/355 (23%), Positives = 140/355 (39%), Gaps = 50/355 (14%)

Query: 136 ISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYV- 194
           + + L L++E+   DS W  Y+  LPE     +F+  E++  LQ   LL        ++ 
Sbjct: 113 MKLGLRLLQERANADSFWWPYISNLPETYTVPIFFPGEDIKNLQYAPLLHQVNKRCRFLL 172

Query: 195 --QNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNL---------- 242
             + E  +  E++   +     + +      W    + +RAF RL G             
Sbjct: 173 EFEQEIRRTLEDVKASDHPFSGQDVNASALGWTMSAVSTRAF-RLHGNKKLQGGSSDDVP 231

Query: 243 VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSN 302
           +++PL D+ NHS     +  A  I+       + L  +     VK  + +L+ Y    SN
Sbjct: 232 MMLPLIDMCNHS----FKPNARIIQEQNGADSNTLVKVVAETEVKENDPLLLNYGC-LSN 286

Query: 303 AELALDYGF---------IESKSDR---NAYTLTLEISE---SDPFFGDKLDIAETNGLG 347
               LDYGF         IE K D    +A ++   +S    S P       +++ N  G
Sbjct: 287 DFFLLDYGFVIESNPYDTIELKYDEQLMDAASMAAGVSSPKFSSPAPWQHQLLSQLNLAG 346

Query: 348 ESAYFDIVLGRTLP--------PAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSH 399
           E     + +G   P          +L    LV +   D+  L+S+           P   
Sbjct: 347 EMPNLKVTIGGPEPVEGRLLAALRILLCGELVEVEKHDSDTLKSLSAVA-------PFGI 399

Query: 400 ANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVL 454
           ANE  + R V   C  ALS F T I EDE +++ G +     +++  R  +K V+
Sbjct: 400 ANEIAVFRTVIALCVIALSHFPTKIMEDEAIIKQG-VSATAELSIKYRIQKKSVI 453


>gi|413942691|gb|AFW75340.1| hypothetical protein ZEAMMB73_065275 [Zea mays]
          Length = 210

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 41/130 (31%), Positives = 69/130 (53%), Gaps = 11/130 (8%)

Query: 105 VVLEVPMKFWINPDTVAASE-IGSLCSGL------KPWISVALFLIREKKKEDSPWRVYL 157
           V + VP+   I P  V     +G  C  L         + V LFL+ E+++  S W+ YL
Sbjct: 77  VAMVVPLDLAITPMRVLQDPLVGPRCRALLEDGVVDDRLLVMLFLMAERRRPGSLWKPYL 136

Query: 158 DILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY-LKVE---EEIILPNKQLF 213
           D+LP    S+++++EEEL EL+GT L   TL  ++ +Q+ +  KV+   EE++  ++   
Sbjct: 137 DMLPSTFGSSLWFTEEELAELEGTTLHRATLIQRKSLQSSFDEKVKGLVEELLHVDESAS 196

Query: 214 PRPITLDDFL 223
              +  +DFL
Sbjct: 197 SVEVLFEDFL 206


>gi|301097023|ref|XP_002897607.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262106825|gb|EEY64877.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 673

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 65/297 (21%), Positives = 121/297 (40%), Gaps = 57/297 (19%)

Query: 63  QVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAA 122
           ++E F QW+           +R  +   G G  A + +A  +V L+VP++  +N  +   
Sbjct: 386 KLERFQQWIARHHFPVNSLELRYVSEAVGYGTFAVKRLAIGDVYLKVPVQVVMNVWSAVK 445

Query: 123 SEIGSLCSGLKPWISVALFLIREKK----------------------KEDSPWRVYLDIL 160
           S           W+S  +  +++++                         S W+ YL++L
Sbjct: 446 SR----------WVSQTMQELQKQRVSVDREEMLLLLHLLEEKFGPNHRHSFWKPYLEML 495

Query: 161 PECTDST---VFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI 217
           P+  DS    +F+ E EL  L+GT LL+  +  ++ V   Y+ +   +         + +
Sbjct: 496 PDLEDSLNSPLFYEEVELKTLEGTDLLTLVVNYRKRVARSYVTIASYLKQSGHDETLQWL 555

Query: 218 TLDDFLWAFGILRSRAFSRLRGQNLVLIPLADL-------INHSPGITTEDYAYEIKGAG 270
           T   F WA  +L SR+     GQ   L+PL D+       +NH P  T  D       +G
Sbjct: 556 TERRFRWANAVLDSRSIW-WSGQRH-LVPLLDMVNCQELNVNHKPHHTILD------SSG 607

Query: 271 LFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEI 327
             +       ++    +AG++V+  Y   ++N    L +GF+   +  +     LE+
Sbjct: 608 RHA-----VTKSSWEFQAGQEVVENY--AQTNYIYLLYHGFVLDSNSHDCAHFHLEM 657



 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 50/190 (26%), Positives = 84/190 (44%), Gaps = 26/190 (13%)

Query: 138 VALFLIREKKKEDSPWRVYLDILP-----ECTDSTVFWSEEELVELQGTQLLSTTLGVKE 192
            A  ++ + K   S W  Y  +LP         S +F +EE++  LQ  +++ T    ++
Sbjct: 18  TAFLMLEQAKGHASRWAPYFQVLPSFISKNTVPSPLFSNEEDVDALQDERMIQTARTERQ 77

Query: 193 YVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLIN 252
             +  Y + +    L    L  + ++L  +LW   ++ SRAFS +RGQ  VL+P  D+ N
Sbjct: 78  RAKKAYGRFKR---LFRSFLDDKTMSLSRYLWTRFLVNSRAFS-IRGQR-VLVPFGDIFN 132

Query: 253 HSPGITTEDYAYEIKGAGLFSRDLLF--------SLRTPVPVKAGEQVLIQYDLNKSNAE 304
             P    +D A +      F   LLF        ++R      +G+ +   Y  N SN  
Sbjct: 133 GEP----DDEARQQDNGQRF---LLFHDLQPQGMTIRADRDTLSGQHLFEDYGDN-SNYV 184

Query: 305 LALDYGFIES 314
             L +GF+ S
Sbjct: 185 YFLHHGFLMS 194


>gi|255723006|ref|XP_002546437.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
 gi|240130954|gb|EER30516.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
          Length = 578

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 72/295 (24%), Positives = 123/295 (41%), Gaps = 56/295 (18%)

Query: 77  VSPKSPIRPATFPEGLGLVAQRDIA--KNEVVLEVPMKFWINPDTVAASEIGSLCSGLKP 134
           +SP    +  T    +G +    IA   N   + +P K  I     A +E G      + 
Sbjct: 21  LSPDVQFKEIT-TNNIGAIYDGKIAPSDNGYPISIPFKLIITTQN-AITEFGKYLKSTED 78

Query: 135 WISVAL---FLIREKKKEDSPWRVYLDILPE--CTDSTVFWSEEELVELQGTQL---LST 186
             S A+   +L  E+   DS +  YL +LP     DS   WS ++   L+GT L   L  
Sbjct: 79  KNSNAILKFYLCHERINADSFYHPYLKLLPSLAAIDSPYTWSAQDKTYLKGTNLGNSLKE 138

Query: 187 TLG--VKEY------VQNEYLKVEEEII------------------LPNKQLFPRPITLD 220
            LG  V+E+      +++E  K E+  I                    N++      +  
Sbjct: 139 NLGSLVEEWWEVINLLKDEVSKPEQHFINMKFYYDFKFYTDDDLDKYLNEEDINNWTSFP 198

Query: 221 DFLWAFGILRSRAF------SRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSR 274
           ++LWA  IL+SR+F            + +L+P+ DL+NH+P        +++        
Sbjct: 199 NYLWASLILKSRSFPAYLIDKSCNKNDAMLLPVVDLLNHNPQAKVN---WDV-------S 248

Query: 275 DLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISE 329
           D  F  ++   ++ G+++   Y L K N EL L YGF    + R++  L +++ E
Sbjct: 249 DGFFRFKSE-SIQPGKEIFNNYGL-KGNEELLLAYGFCIEGNPRDSVALKIKMPE 301


>gi|145518912|ref|XP_001445328.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124412772|emb|CAK77931.1| unnamed protein product [Paramecium tetraurelia]
          Length = 761

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 53/190 (27%), Positives = 90/190 (47%), Gaps = 19/190 (10%)

Query: 85  PATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWI-------- 136
           PA F E  G+VA +DI  N  ++ VP    I+ +    S + S+     P +        
Sbjct: 37  PAAFGELTGVVATKDIPANTAIICVPQPLIISQEKCKLSSL-SIVYDKHPELFDENETSD 95

Query: 137 ----SVALFLIREKKK-EDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVK 191
                +  +L  EKKK E S +  Y+  + +  ++ + WS+EEL  ++   +L     V+
Sbjct: 96  AEFNILIFYLFNEKKKGEKSFYHPYVQAI-QSNNTLIDWSKEELNYIEDPIILDEFAIVR 154

Query: 192 EYVQNEYLKVEEEIILPNKQLF--PRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLAD 249
           E +++ + +  +EI     Q+F   RP   +DF WA   + SR F     ++  +IP+AD
Sbjct: 155 EDLKDLWNQA-KEIFNEFVQVFGETRPTDKEDFYWAAQSVMSRCFG-WSLKSTSMIPIAD 212

Query: 250 LINHSPGITT 259
            +NHS    T
Sbjct: 213 FLNHSNKACT 222


>gi|255071849|ref|XP_002499599.1| predicted protein [Micromonas sp. RCC299]
 gi|226514861|gb|ACO60857.1| predicted protein [Micromonas sp. RCC299]
          Length = 588

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 78/290 (26%), Positives = 115/290 (39%), Gaps = 43/290 (14%)

Query: 70  WLRDQKVVSPKSP------IRPATFP-EGLGLVAQRDIAKNEVVLEVPMK--FWINPDTV 120
           WLR     +P +       + PA FP  G G  A   I   ++   +P++  F +     
Sbjct: 157 WLRAHGGGAPTASSKTSISVAPARFPGTGRGAAATTHIPAGDIAAAIPVERLFTVRHALE 216

Query: 121 AASEIGS---LCSGLKPWISVALFLIREKK-KEDSPWRVYLDILP----------ECTDS 166
                G    + + L      AL+LI E+   E SPW   +  LP           C   
Sbjct: 217 MPGPRGDAYRMFAALGEDTIAALWLIAERALGEASPWHAVIASLPWPEGGEGSASPCGGC 276

Query: 167 T-VFWSEEEL-VELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFP-RPITLDDFL 223
           T V W  E     L GT LL+  +   E +  ++  +   +      +FP    TLD+F 
Sbjct: 277 TPVSWPREACDALLGGTPLLADAIAASEKLARQHAALFPALSEHMADVFPASAYTLDNFR 336

Query: 224 WAFGILRSRAF----SRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFS 279
            A     S       S        L P+A L NH+       + + ++    +SR    +
Sbjct: 337 RAHEAWNSYGMTVQASPGEPAATCLPPVAMLCNHA------LWPHVVR----YSRLRDGT 386

Query: 280 LRTPVP--VKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEI 327
           LR PV   V AGE+V + Y   KSNAEL L YGF    +  +   L+LE+
Sbjct: 387 LRLPVARSVHAGEEVFVSYG-AKSNAELLLFYGFALPGNPYDDVPLSLEL 435


>gi|157818191|ref|NP_001099637.1| N-lysine methyltransferase SETD6 [Rattus norvegicus]
 gi|325530256|sp|D3ZSK5.1|SETD6_RAT RecName: Full=N-lysine methyltransferase SETD6; AltName: Full=SET
           domain-containing protein 6
 gi|149032380|gb|EDL87271.1| similar to hypothetical protein FLJ21148 (predicted) [Rattus
           norvegicus]
          Length = 474

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 77/306 (25%), Positives = 131/306 (42%), Gaps = 55/306 (17%)

Query: 44  FAATCSLHSASATT-----------NPPTAQVETFWQWL-RDQKVVSPKSPIRPATFPEG 91
            AA C +  A+A              P +  V  F +W  R    +SPK  +       G
Sbjct: 16  LAAPCPVPRAAARVPLPLPAGPSGHEPESDAVAGFLRWCTRVGLELSPKVLVSRQGTVAG 75

Query: 92  LGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEI-----GSLCSGLKPWISVALFLIREK 146
            G+VA+  +   E++  VP    ++P T + S++     G+L S L  W+ + L L+ E 
Sbjct: 76  YGMVARESVQPGELLFAVPRSALLSPHTCSISDLLERERGALQS-LSGWVPLLLALLHEL 134

Query: 147 KKEDSPWRVYLDILPECT--DSTVFWSEEELVE-LQGTQLLSTTLGVKEYVQNEYLKVEE 203
           +   SPW  Y  + PE    +  +FW EEE +  L+GT       GV E V+ + + +  
Sbjct: 135 QAPASPWSPYFALWPELGRLEHPMFWPEEERLRLLKGT-------GVPEAVEKDLVNIRS 187

Query: 204 E---IILP----NKQLF-PRPITLDDFLWAFGILRSRAFSRLRG--------QNLVLIPL 247
           E   I+LP    +  LF P   +L+ +     ++ + +F              + +++P 
Sbjct: 188 EYYSIVLPFMEAHSDLFSPTVRSLELYRQLVALVMAYSFQEPLEEEEDEKEPNSPLMVPA 247

Query: 248 ADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELAL 307
           AD++NH   I   +   E      +S + L  + T  P+  G ++   Y    +N +L  
Sbjct: 248 ADILNH---IANHNANLE------YSAEYLRMVAT-QPILKGHEIFNTYG-QMANWQLIH 296

Query: 308 DYGFIE 313
            YGF E
Sbjct: 297 MYGFAE 302


>gi|351697762|gb|EHB00681.1| SET domain-containing protein 6 [Heterocephalus glaber]
          Length = 486

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 62/263 (23%), Positives = 104/263 (39%), Gaps = 47/263 (17%)

Query: 77  VSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLC------- 129
           +SPK  +       G G+VA+  +   E++  VP    ++P T +   IG L        
Sbjct: 73  LSPKVAVSRQGTVAGYGMVARESVQPGELLFAVPRAALLSPHTCS---IGGLLERERDVL 129

Query: 130 SGLKPWISVALFLIREKKKEDSPWRVYLDILPECT--DSTVFWSEEELVE-LQGTQLLST 186
                W+ + + L+ E +   SPW  Y  + PE    +  +FW EEE    LQGT     
Sbjct: 130 QSQSGWVPLLMALLHELQAPASPWSPYFALWPELGRLEHPMFWPEEERRRLLQGT----- 184

Query: 187 TLGVKEYVQNEYLKVEEE---IILPNKQLFP-------------RPITLDDFLWAFGILR 230
             GV E V  + + +  E   I+LP  +  P             R +      ++F    
Sbjct: 185 --GVPEAVDKDLVNIRGEYYAIVLPFMEAHPDLFGPSVRSLELYRQLVALVMAYSFQEPL 242

Query: 231 SRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGE 290
                     + +++P AD++NH   +   +   E      +S D L  + T   +  G 
Sbjct: 243 EEEEEEKEPNSPLMVPAADILNH---LANHNSNLE------YSADYLRMVAT-RSIPKGH 292

Query: 291 QVLIQYDLNKSNAELALDYGFIE 313
           ++   Y    +N +L   YGF+E
Sbjct: 293 EIFNTYG-QMANWQLIHMYGFVE 314


>gi|326480913|gb|EGE04923.1| SET domain-containing protein [Trichophyton equinum CBS 127.97]
          Length = 692

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 62/213 (29%), Positives = 90/213 (42%), Gaps = 33/213 (15%)

Query: 117 PDTVAASEIGSLCSGLKPWISVALFLIREK-KKEDSPWRVYLDILPECTD--STVFWSEE 173
           P     S+I  L   ++  I +A F+  E+ K  DS W  YL  LP  ++  S +F+ + 
Sbjct: 89  PHDFHCSDI--LSERVRKSIILAFFVAHEQLKGRDSHWWPYLATLPRASELTSALFYQDS 146

Query: 174 ELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRA 233
           +L  LQGT L  T    +  V+ EY      I+     L     + D F WA+ ++ SRA
Sbjct: 147 DLDWLQGTNLYQTHQAYRNTVKEEY-DSAISILRDEGCLAVESYSWDIFCWAYTLIASRA 205

Query: 234 F-SRLRGQNL-------------VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFS 279
           F SR+    L             +++PL D  NH P    E +  E    G         
Sbjct: 206 FTSRVLDAYLSNHPTLKQDEEFQIMLPLVDSSNHKPLAKIE-WRAEATEIG--------- 255

Query: 280 LRTPVPVKAGEQVLIQYD-LNKSNAELALDYGF 311
           L+   P   GE++   Y  LN  N +L   YGF
Sbjct: 256 LKVIEPTFTGEEIHNNYGPLN--NQQLMTTYGF 286


>gi|168044593|ref|XP_001774765.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162673920|gb|EDQ60436.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 523

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 92/414 (22%), Positives = 166/414 (40%), Gaps = 42/414 (10%)

Query: 69  QWLRDQKVVSPKSPIRPATFPEGLG------LVAQRDIAKNEVVLEVPMKFWINPDTVAA 122
           QW+ +Q +   K  +      EG        +VA  D+   E+ L +P    +  + V  
Sbjct: 95  QWMEEQGLPECKVSLAEHQPSEGDKGKPIHYVVASEDLQPGELALTIPKSLVVTLERVLG 154

Query: 123 SE-IGSL--CSGLKPWISVALFLIREKKK-EDSPWRVYLDILPE-------CTDSTVFWS 171
            E I  L   + L     +AL+L+ EKK+ ++S W  Y+  L            S + WS
Sbjct: 155 DETIAELLTTNKLSELACLALYLMYEKKQGKESYWYPYIRELDRQRGRGQLSVASPLLWS 214

Query: 172 EEELVE-LQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILR 230
            EEL E   G+ +    L     ++ EY +++    +        P  L    ++F I +
Sbjct: 215 REELNEYFTGSTMKEVVLERLAGIKREYEELDTVWFMAGSLFKQYPFDLPTEAFSFEIFK 274

Query: 231 SRAFSRLRG-----QNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVP 285
            +AF  ++      Q + L     L+   P +    Y    K A L + D    L     
Sbjct: 275 -QAFVAVQSCVVHLQGVSLARRFALVPLGPPLLA--YKSNCK-AMLKAVDDNVVLEVDRA 330

Query: 286 VKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNG 345
            KAG+ + + +   + N++L L+YGF++  +  +   +   +   DP +  K  I + N 
Sbjct: 331 YKAGDPIAV-WCGPQPNSKLLLNYGFVDEDNPYDRLAVEASLDTEDPLYQQKRAIVQKNN 389

Query: 346 LGESAYFDIVLGRTLPPA--MLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEE 403
                 F I  G+ +     ML Y+RL  L   +     S  +  +      PVS  NE 
Sbjct: 390 RLTIQTFQIYKGKEMEAVLDMLPYMRLAHLADPEEMETVSFAQGPV-----CPVSACNER 444

Query: 404 LICRVVRDACKSALSGFHTTIEEDEQLLQGGNL--DPRLRIAVGVRAGEKRVLQ 455
            +   +    +  L+G+ ++        +GG+   + + R+A  +   EK +L+
Sbjct: 445 AVLEQLEQYFEKRLAGYKSS-----HATEGGDAKKNAKKRVAEKLLCIEKSILR 493


>gi|358388734|gb|EHK26327.1| hypothetical protein TRIVIDRAFT_35595 [Trichoderma virens Gv29-8]
          Length = 482

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 66/281 (23%), Positives = 113/281 (40%), Gaps = 40/281 (14%)

Query: 90  EGLGLVAQRDI-------AKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVA--- 139
           +GLGLVA+ DI       + +  ++ +P    ++ + V   E   +    K  + VA   
Sbjct: 32  KGLGLVAKHDITEAGRDASSSPAIVRIPRDLVLSAEAV--EEYAKVDQNFKQLLEVAGRQ 89

Query: 140 -------LFLIRE-----------KKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGT 181
                  L+L+ +           +    +PW  Y+  LP        W+ EE   L+GT
Sbjct: 90  STRGDILLYLLTQIIQSKGTSPSTRPFASTPWTEYIKFLPRYIPIPTMWTNEERELLKGT 149

Query: 182 QLLSTTLGVKEYVQNEYLKV-EEEIILPN-KQLFPRPITLDDFLWAFGILRSRAFSRLRG 239
            L +        + +EY  + E+    P    LF   + ++D++ A    RSR     R 
Sbjct: 150 SLEAAVSAKLSALSHEYDGICEQASNFPFWNDLFWGTVKVEDWILADAWYRSRCLELPRA 209

Query: 240 QNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLN 299
            +  ++P  D+ NHS    T    Y+    G    D++    +   + +G ++ I Y   
Sbjct: 210 GH-AMVPGLDMANHS---QTHSAYYDESSNG----DVVLLPSSGSSILSGGEITISYGQA 261

Query: 300 KSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDI 340
           KS AE+   YGFI+ +S      L +     DP    KL +
Sbjct: 262 KSAAEMLFSYGFIDQESSIKELKLQVAPLPDDPLGRAKLRV 302


>gi|261328372|emb|CBH11349.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
           DAL972]
          Length = 586

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 36/101 (35%), Positives = 48/101 (47%), Gaps = 10/101 (9%)

Query: 216 PITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRD 275
           P T+  F+WA+  L SR FS    +   +IP  D  NHS             GA +F   
Sbjct: 254 PFTMQQFIWAYNTLMSRGFS-YDPEVWAVIPWVDYFNHSLTNNATMRFDRCMGAYIF--- 309

Query: 276 LLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKS 316
                 T  PV  G+QV +QY  + ++AEL L YGFI + S
Sbjct: 310 -----ETTAPVSKGDQVFLQYG-SYTDAELVLWYGFITTPS 344


>gi|302498903|ref|XP_003011448.1| SET domain protein [Arthroderma benhamiae CBS 112371]
 gi|291174999|gb|EFE30808.1| SET domain protein [Arthroderma benhamiae CBS 112371]
          Length = 689

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 55/201 (27%), Positives = 83/201 (41%), Gaps = 28/201 (13%)

Query: 128 LCSGLKPWISVALFLIREK-KKEDSPWRVYLDILPECTD--STVFWSEEELVELQGTQLL 184
           L   ++  I +A F++ E+ K  DS W  YL  LP  ++  S +F+ + +L  LQGT L 
Sbjct: 94  LSQRVRKSIILAFFMVHEQLKGRDSHWWPYLATLPRASELTSALFYQDNDLEWLQGTNLY 153

Query: 185 STTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-SRLRGQNL- 242
            T    +  V+ EY      I+     L       D F WA+ ++ SRAF SR+      
Sbjct: 154 QTHQAYRNAVKEEY-DSAISILRDEGFLAVESYRWDIFCWAYTLIASRAFTSRVLDAYFS 212

Query: 243 ------------VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGE 290
                       +++PL D  NH P    E +  E    G         L+   P  +GE
Sbjct: 213 NHPTLKQDEEFQIMLPLVDSSNHKPLAKIE-WRAEATEIG---------LKVIEPTFSGE 262

Query: 291 QVLIQYDLNKSNAELALDYGF 311
           +V   Y    +   +   YGF
Sbjct: 263 EVHNNYGSLNNQQSVMTTYGF 283


>gi|298711967|emb|CBJ32909.1| protein N-methyltransferase [Ectocarpus siliculosus]
          Length = 318

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 56/192 (29%), Positives = 86/192 (44%), Gaps = 48/192 (25%)

Query: 67  FWQWLRDQKVVSPKSPIRPATFPEGL-GLVAQRDIAKNEVVLEVPMKF------------ 113
           F  W  ++  +  K+P    +F +GL GLVA ++IAKN+ ++ VP +             
Sbjct: 99  FLHWAGEE--IGVKAPKLKGSFVDGLRGLVATQNIAKNDEMVVVPSESALVTTTDQLCPF 156

Query: 114 --WINPDTVAASEIGSLCSGLKPW-ISVALFLIREKKK----EDSPWRVYLDILPECTDS 166
             W++ D  A+S          PW + +AL L+REK+K    E  PW   +  LPE   +
Sbjct: 157 PEWVSKDFWASS----------PWQVRLALLLLREKQKGAASELEPW---ISRLPESFGT 203

Query: 167 TVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRP---ITLDDFL 223
            V WS  EL ELQ   L +     +E  +  Y          +  +   P   +  ++  
Sbjct: 204 PVSWSAAELSELQYPHLEAVAREQREEWKGYY----------SSLMASSPGCGVAEEELS 253

Query: 224 WAFGILRSRAFS 235
           WA G+  SRAFS
Sbjct: 254 WAIGVAYSRAFS 265


>gi|209489216|gb|ACI49001.1| hypothetical protein Cbre_JD01.008 [Caenorhabditis brenneri]
          Length = 333

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 46/176 (26%), Positives = 80/176 (45%), Gaps = 33/176 (18%)

Query: 149 EDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTT--LGVKEYVQNEYLKVEEEII 206
           E S W  YL +LP+  D+  F         +G      T  L +++Y  ++  ++  EI 
Sbjct: 9   EQSAWSPYLKVLPKTFDTPAF---------KGIDYDVNTLPLSIRKYWIDQKKEI-SEIS 58

Query: 207 LPNKQLFPRPITLDDFLWAFGILRSRAF----------SRLRGQNLVLIPLADLINHSPG 256
              ++LFP  +T D  LWA+ ++ +R                G  + +IP  D++NH P 
Sbjct: 59  EKLRRLFPE-LTHDKVLWAWHVVNTRCIFVENEEHDNVDNSDGDTIAVIPYVDMLNHDP- 116

Query: 257 ITTEDYAYEIKGAGLF-SRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF 311
              E Y    +G  L   R+  + ++    ++ GEQ+ + Y  +  NA L ++YGF
Sbjct: 117 ---EKY----QGVALHEKRNGRYVVQAKRQIQEGEQIFVCYGAH-DNARLLVEYGF 164


>gi|403158396|ref|XP_003307692.2| hypothetical protein PGTG_00642 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375163798|gb|EFP74686.2| hypothetical protein PGTG_00642 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 622

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 52/185 (28%), Positives = 87/185 (47%), Gaps = 26/185 (14%)

Query: 53  ASATTNPPTAQVETFWQWLRDQKVVSPKSPIRPATF---PEGLGLVAQRDIAKNEVVLEV 109
           ++ +T P   + E F +WLR  + V  +  I    F    EG GL+A+ +I K   +  +
Sbjct: 71  SNNSTGPADRRYEKFLEWLRKARGVELEETISAHEFRETGEGWGLIAETEIRKGTTLFSI 130

Query: 110 PMKFWINPD------TVAASEIGSLCSGL------KPWISVALFLIREKKK------EDS 151
           P      P       +V  S + ++ S        + W+ + L L+ E+ +      +  
Sbjct: 131 PRP---GPSANSPLLSVQTSRLMAVLSAQERARISRNWLPLLLVLLWERVQAILNPLDPL 187

Query: 152 PWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQ 211
            W  Y D LPE  D+ +FW+++EL EL G+ +L   +G KE V+ +Y  V + +I     
Sbjct: 188 RWAPYFDTLPEEFDTLMFWNQDELAELTGSTILD-KIG-KEEVEKDYETVIKPMIESRAD 245

Query: 212 LFPRP 216
           LFP P
Sbjct: 246 LFPVP 250


>gi|340923530|gb|EGS18433.1| hypothetical protein CTHT_0050300 [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 463

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 96/381 (25%), Positives = 164/381 (43%), Gaps = 74/381 (19%)

Query: 130 SGLKPWISVALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTL 188
           S    W  + L L+ E  + E S W+ YLD+LP   D+ +FW+ +EL ELQ + L+   +
Sbjct: 81  SAQDSWTLLILVLMYEHLQGESSRWKPYLDVLPSSFDTPMFWAPDELAELQASALVG-KV 139

Query: 189 GVKEYVQNEYLKVEEEIILPNKQLFPRPIT--LDD--------------FLWAFGILRSR 232
           G +E  +    KV   +I  ++ +F  P T  LDD                +AF +    
Sbjct: 140 GKEEADRMIRTKV-VSVIREHENVFFPPGTRRLDDEELFALAHRMGSTIMAYAFDLENDE 198

Query: 233 AFSRLRGQN-----------LVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLR 281
                   +           L ++P+AD++N          A+   G    +     +LR
Sbjct: 199 DGEDNDEDDEWVEDREDKTMLGMVPMADMLNADAVFN----AHINHGKDTLT---ATALR 251

Query: 282 TPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDR-NAYTLTLEISE----------- 329
           T   ++AGE++L  Y    S+AEL   YG++     R +   L   + E           
Sbjct: 252 T---IRAGEEILNFYG-PLSSAELLRRYGYVTQNHSRYDVVELPWALVEQEVRARVAGNM 307

Query: 330 SDPFFGDKLDIAETNGLGESAYFDIVLGRTL--PPAMLQYLRLVA-------LGGTDAFL 380
           +D  +     + E++   E A+   VL R+   P A  Q             LG     L
Sbjct: 308 ADADWAKVRRLVESDDEFEEAF---VLERSSEDPDATGQVNGEAVFEGLPDELGRQFRGL 364

Query: 381 LESIFRNTIWGHLDLPVSHANEE-----LICRVVRDACKSALSGFHTTIEEDEQLLQGGN 435
           L++I +    G++++ V    ++     L  + V  A ++    + TT+EED++LL G  
Sbjct: 365 LKAIKKV---GNVEMAVQALEDKEMRKGLYLQCVLSALRAREQQYATTLEEDDKLLDGA- 420

Query: 436 LDPRLRIAVGVRAGEKRVLQQ 456
            + R R+A+ VR GEK++L++
Sbjct: 421 CEGRRRMAIWVRRGEKQLLRE 441


>gi|168005531|ref|XP_001755464.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693592|gb|EDQ79944.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1033

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 53/229 (23%), Positives = 102/229 (44%), Gaps = 34/229 (14%)

Query: 93  GLVAQRDIAKNEVVLEVPMKFWI-NPDTVAASEIGSLCSGLKPWISVALFLIREKKK-ED 150
           G+V  ++I + E +  +P+   + + +T+ A E+ S       W   A  L+REK K   
Sbjct: 562 GVVVLKNIRRGETLCNLPLDMGLYDNETIVAGEVDS-------WDRAAARLLREKAKGSS 614

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY--LKVEEEIILP 208
           S W  Y++ILP+     +   + EL E+Q   +L   + V++ ++  +  L V++     
Sbjct: 615 SAWASYINILPQNMTVPILLEDHELHEVQWWPVLRELVQVRKSIRESFSLLSVDD----- 669

Query: 209 NKQLFPRPITLDDFLWAFGILRSRAFSRLRGQN-----LVLIPLADLINHSPGITTEDYA 263
                      +++ WA  ++ SRAF+     +      V++P  D+INH      +  +
Sbjct: 670 -----LAGADFEEYRWAAMMVHSRAFTLPVFADDHYAPYVMMPYMDMINHHYHYQADWMS 724

Query: 264 YEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI 312
             I G  +        +     +K GE++   +   ++N  L L YGF+
Sbjct: 725 QPIWGGKV-------EIVARRDIKKGEELFASFG-PRANDNLFLYYGFV 765


>gi|340505923|gb|EGR32186.1| SET domain protein [Ichthyophthirius multifiliis]
          Length = 417

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 51/184 (27%), Positives = 89/184 (48%), Gaps = 28/184 (15%)

Query: 90  EGL-GLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWIS----------- 137
           EGL G++A  +I  N V+  +P+K  I    +  SE+  +        S           
Sbjct: 148 EGLKGVIATENIPANTVICCIPIKLIITTKKIKESELKQIYKENPDIFSEKRNYDGESDM 207

Query: 138 VALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQN 196
           VA FLI EK K EDS +  Y   + E + +   W+ +E+ E +  +++S      +Y QN
Sbjct: 208 VAAFLIYEKLKGEDSFYYPYFQTV-EKSYTIYDWTIDEIQETENDEIISE---FNQYCQN 263

Query: 197 --EYLKVEEEIILPNKQLF-PRPITLDDFLWAFGILRSRAFSRLRGQNL---VLIPLADL 250
             E++ +  E +L   Q F P+ ++ + ++W++ +L +R F    G  L    L+P  D+
Sbjct: 264 MDEWI-ILHETLLKYPQFFDPKKLSKEIYIWSYELLMTRMF----GHGLPCSFLVPFGDM 318

Query: 251 INHS 254
            NH+
Sbjct: 319 FNHN 322


>gi|298715435|emb|CBJ28046.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 719

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 69/261 (26%), Positives = 112/261 (42%), Gaps = 29/261 (11%)

Query: 65  ETFWQWLRDQKVVSPKSPIRPATFPEGL-GLVAQRDIAKNEVVLEVPMKFWINPDTVAAS 123
           + F +WLR             +    G+ G VA+RDIA  + ++ +P    ++     A 
Sbjct: 21  QEFLRWLRSHGAAIDCVEWPSSETESGVRGAVARRDIAPGDHMVIIPHALMMSEFHAKAD 80

Query: 124 EIGSLCSGLKPWI-----SVALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSEEELVE 177
                   L   +      +AL++++E  K+E S +  YL +LP   +    W+ E L+ 
Sbjct: 81  PKYGHVHRLNTRLLGSDNGLALYIMQEILKEERSFYWPYLRMLPTPCN-LRNWNRESLLL 139

Query: 178 LQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLD-------DFLWAFGILR 230
           LQ  +L+  T         + L +  E I      +P   T D       DF W    ++
Sbjct: 140 LQDHKLVRRTAARSR----QLLALYRETIEFLSSSYPELYTADRYTFELFDFAWR--TIQ 193

Query: 231 SRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGE 290
           +RAF + R ++  L+P AD +NH  G     Y +++ G G F   L  S     P  +  
Sbjct: 194 ARAFGK-RLKSSALVPFADCLNH--GNVQTKYDFDVGGNGTFR--LFPSGNNRYPRNS-- 246

Query: 291 QVLIQYDLNKSNAELALDYGF 311
           +VL  Y   ++N  L LDYGF
Sbjct: 247 EVLNSYG-RRANDNLLLDYGF 266


>gi|356530082|ref|XP_003533613.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Glycine
           max]
          Length = 482

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 100/425 (23%), Positives = 170/425 (40%), Gaps = 74/425 (17%)

Query: 89  PEGLGLVAQRDIA---KNEVVLEVPMKFWIN----------PDTVAASE-IGSLCSGLKP 134
           P+G G+ A +D+    +  VV+E+P++  +           PD +     I  + +   P
Sbjct: 97  PDGFGVYASKDVEPLRRARVVMEIPLELMLTISKKLPWMFFPDIIPLDHPIFDIINSTNP 156

Query: 135 ---W-ISVALFLIREKKKEDSPWRVYLDILP---ECTDSTVFWSEEELVELQGTQLLSTT 187
              W + +A  L+     E + W++Y D LP   ECT   +  SEEEL+ELQ + L ST 
Sbjct: 157 ETDWDLRLACLLLYAFDCEGNFWQLYGDFLPSADECTSLLLA-SEEELLELQDSDLASTI 215

Query: 188 LGVK----EYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSR------L 237
              +    E+ ++ +       I   K+L   P     F WA GI +SR  +       L
Sbjct: 216 RKQQQRALEFWESNWHSAAPLKI---KRLAHDP---QRFAWAVGIAQSRCINMQTRIGAL 269

Query: 238 RGQNLVLIPLADLINHS--PGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQ 295
             +  +LIP AD++NHS  P      + +  K      R L   +     ++ G+++ + 
Sbjct: 270 NQEANMLIPYADMLNHSFEPNCF---FHWRFK-----DRMLEVLINAGQRIRKGDEMTVN 321

Query: 296 YDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIV 355
           Y   + N      YGF    S  N +        +       + I   +GL E  Y +  
Sbjct: 322 YMSTQKNDMFMQRYGF---SSPVNPWDEIKFSGNARIHLDSFVSIFHISGLPEEYYHNNC 378

Query: 356 LGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKS 415
           L                    D  ++ +      W   D+P   + E    + ++D C+ 
Sbjct: 379 LSN------------AGDSFVDGAVIAAARTLPSWSDGDVPPIPSVERKAAKELQDECQK 426

Query: 416 ALSGFHTTIEEDEQLLQGGNLDPR-LRIAVGVRAGEKRVLQQIDEIFKERELELDELEYY 474
            L+ F TT ++D++LL       R L  A+  R   K  ++++          +  LE Y
Sbjct: 427 MLAAFATTSKQDQKLLGSMTDATRTLEAAIKYRLHRKLFMEKV----------IQALEMY 476

Query: 475 QERRL 479
           QE+ L
Sbjct: 477 QEQIL 481


>gi|323448473|gb|EGB04371.1| hypothetical protein AURANDRAFT_67276 [Aureococcus anophagefferens]
          Length = 476

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 61/235 (25%), Positives = 92/235 (39%), Gaps = 27/235 (11%)

Query: 93  GLVAQRDIAKNEVVLEVPMKFWI----NPDTVAASEIGSLCSGLKPWISVALFLIREKKK 148
           G+VA RD A  + V  VP    +      D    +E+  +     P + VAL L+  ++ 
Sbjct: 137 GVVATRDFAVGDTVFSVPKAECVLSEGRADASPVAEVWRVLPAPAPHVRVALLLLYLERV 196

Query: 149 EDSPWRVYLDILPECTDSTV-----FWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEE 203
           E   W   LD+LP   D +       W+  E+ E Q  QL+   +G +    +       
Sbjct: 197 ERKAWAPLLDVLPSPADLSAPGPMRLWAPAEVAETQDPQLVE-RVGAQVAADDAAYDDVV 255

Query: 204 EIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQ------NLVLIPLADLINH-SPG 256
                    F  P T D F  A   + SRA      +      + +L+PLADL NH  P 
Sbjct: 256 VPGW-TAADFGDPPTRDAFKIAVATITSRAKGERTAKAQRERSSYMLVPLADLCNHRDPA 314

Query: 257 ITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF 311
            +    A    G         F +     V+AGE++ + Y    + A LA  +GF
Sbjct: 315 GSNAAEALAPWG--------HFVVAASRRVRAGEEIRVSYGALPNRALLA-QFGF 360


>gi|159474448|ref|XP_001695337.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158275820|gb|EDP01595.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 360

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 60/233 (25%), Positives = 102/233 (43%), Gaps = 37/233 (15%)

Query: 93  GLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREKKKEDSP 152
           GL   + + K EV++ +P    ++   VAA+E              +  L++E     S 
Sbjct: 57  GLYTTQPVKKGEVIVSIPQHIVLSVKNVAAAE-------------ASPQLLKEIHSPCSR 103

Query: 153 WRVYLDIL--PECTDSTVFWSEEELVELQGTQL---LSTTLGVKEYVQNEYLKV---EEE 204
            R YLD L  P+   +   W EE +  L    +   L  +   K + +N +L     E E
Sbjct: 104 LRPYLDTLPGPDGVLTAYNWPEEYIKYLADPAMEEQLKNSF--KLHARNTWLGHNDDEME 161

Query: 205 IILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAY 264
           + +P + +  + ITL ++     +L SR FS +R   L L+P+ DL+NH           
Sbjct: 162 VTIP-EAIGRKNITLKEWEHVVSLLSSRTFS-IRKGALSLVPVLDLVNH----------- 208

Query: 265 EIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSD 317
           +++            L     + AGEQV I Y  +  N EL + YGF+++ ++
Sbjct: 209 DVRDINQLGNSSTVDLVAGKDLAAGEQVTITYG-SMRNDELLMYYGFVDTVTE 260


>gi|432119027|gb|ELK38252.1| SET domain-containing protein 4 [Myotis davidii]
          Length = 339

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 56/234 (23%), Positives = 98/234 (41%), Gaps = 43/234 (18%)

Query: 102 KNEVVLEVPMKFWINPDTVAASEIGSLCSGLKP----WISVALFLIREKKKED-SPWRVY 156
           + +V++ +P    +  DTV  S +G+  +  +P     +++  FL+ EK   D SPW+ Y
Sbjct: 32  EGQVIISLPESCLLTTDTVIRSYLGAYIAKWQPPPSPLLALCTFLVAEKHAGDRSPWKPY 91

Query: 157 LDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFP-- 214
           L++LP+     V       +E +   LL   L  K   Q   ++   E+   ++  F   
Sbjct: 92  LEVLPKAYTCPV------CLEPEVVALLPRPLEAKAREQRTRVR---ELFTSSRGRFSSL 142

Query: 215 RPI---------TLDDFLWAFGILRSRAFSRLRGQ---------NLVLIPLADLINHSPG 256
           +P+         +   F WA+  + +RA    RG+            L P  DL+N+SP 
Sbjct: 143 QPLLSEAAASVFSYRAFRWAWCTVNTRAVYMERGRRQGLSAEPDTCALAPYLDLLNNSPA 202

Query: 257 ITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYG 310
           +  +        A        + +RT    +  E+V I Y  + S   L L+YG
Sbjct: 203 VQVK--------AAFNEETRCYEIRTGSGCRRHEEVFICYGPHDSR-RLLLEYG 247


>gi|412991339|emb|CCO16184.1| predicted protein [Bathycoccus prasinos]
          Length = 519

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 60/264 (22%), Positives = 113/264 (42%), Gaps = 47/264 (17%)

Query: 83  IRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-ISVALF 141
           +RP+T  +G GL A R + K+E VL +P++  I  +     E         PW + +A  
Sbjct: 83  VRPSTSGKGRGLEATRLVEKDECVLTLPLRSGIVDEAKGHPEHTREVIEKAPWGVRLACR 142

Query: 142 LIREKKK-EDSPWRVYLDILPECTDST-VFWSEEELVELQGTQLLSTTLGVKEYVQNEYL 199
           L++E+KK  +S +  YL+++PE  +++ + ++ EE+  +    +      +++ V+  Y 
Sbjct: 143 LLQERKKGAESAYAAYLELIPENVETSPLHYASEEVSRICYPPMEKEIEEMRKAVKKWY- 201

Query: 200 KVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQN-----LVLIPLADLINHS 254
              +++     +      + ++F  A  ++ SR +    G         L+PLADL+NH 
Sbjct: 202 ---DDLNAGEGKEALAGASEEEFKCAVAVVHSRTYGVSSGDTGEGYFRALLPLADLLNHG 258

Query: 255 PG----------------------ITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQV 292
                                   IT E+   EI            +      ++ GE+ 
Sbjct: 259 GDEYIDETRSSTSTVSTETVAWSEITDEEDESEI------------AFTAQKTLEPGEEA 306

Query: 293 LIQYDLNKSNAELALDYGFIESKS 316
           L+ Y   +SN    L YGF+  K+
Sbjct: 307 LMSYG-ERSNDHFLLYYGFVPRKN 329


>gi|255073657|ref|XP_002500503.1| predicted protein [Micromonas sp. RCC299]
 gi|226515766|gb|ACO61761.1| predicted protein [Micromonas sp. RCC299]
          Length = 580

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 66/303 (21%), Positives = 118/303 (38%), Gaps = 70/303 (23%)

Query: 89  PEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREKKK 148
           P G GLVA   +A  +V++ VP +     + +   E G    G    +++ L  + +   
Sbjct: 89  PTGRGLVATAPVAPGDVLITVPWR-----EAIHVLEDG-YSDGDDLRLALELLHVLDDGD 142

Query: 149 EDS-----------PWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNE 197
           ED             WR Y  +LP  T +  FW  + + ELQ  + +  TL  +      
Sbjct: 143 EDGYIDGEYDARVKTWRKYRPMLPVSTGAAAFWCVDNIRELQFPEAVEKTLAQRAEFTAG 202

Query: 198 YLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRL--RGQNLVLIPLADLINHSP 255
            ++   +             T +  +WA   + SR+FS +  +G+   L+P  DL NH P
Sbjct: 203 AMRHSND-----------SRTKERIMWALQQVHSRSFSVMTPQGRARCLVPYVDLFNHRP 251

Query: 256 ---------------------------GITTEDYA----YEIKGAG-----LFSRDLLFS 279
                                      GI   ++A    +++ G        +    +F 
Sbjct: 252 ESPKEARMTDELLQRALRKAFTKGSEEGINDGEWAGAEPWQVVGPDPEDLENWEDAAVFQ 311

Query: 280 LRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDR---NAYTLTLEISESDPFFGD 336
           +R+    + GE+V I Y  ++++AEL   YGF    +D      Y    ++ + D F  +
Sbjct: 312 MRSIWAYEPGEEVFITYG-HETSAELLTSYGFFPEPNDGEFIRVYENVQDLIDDDRFVDE 370

Query: 337 KLD 339
           +L+
Sbjct: 371 ELN 373


>gi|143584415|sp|Q5ZK17.2|SETD6_CHICK RecName: Full=N-lysine methyltransferase SETD6; AltName: Full=SET
           domain-containing protein 6
          Length = 447

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 74/280 (26%), Positives = 114/280 (40%), Gaps = 48/280 (17%)

Query: 60  PTAQVETFWQWLRDQKV-VSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPD 118
           PTA    F  W     V +SPK  I       G GL+A  D+   E++  VP    ++  
Sbjct: 17  PTA---GFVAWCEAAGVELSPKVSISRRGTVSGYGLLAAADLEPGELLFSVPRSALLSQH 73

Query: 119 TVAASEI-----GSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECT--DSTVFWS 171
           T A   +      SL S    W+ + L L+ E     S WR Y  +  + +  D  +FW 
Sbjct: 74  TCAIRALLHDAQESLQSQ-SVWVPLLLALLHEYTTGTSRWRPYFSLWQDFSSLDHPMFWP 132

Query: 172 EEELVE-LQGTQLLSTTLGVKEYVQNEYLKVEEE---IILP----NKQLF-PRPITLDDF 222
           EEE V  LQGT       G+ E V  +   ++ E   IILP    +  +F P   TL+ +
Sbjct: 133 EEERVRLLQGT-------GIPEAVDKDLANIQLEYSSIILPFMKSHPDIFDPELHTLELY 185

Query: 223 LWAFGILRSRAFSR--------LRGQN-LVLIPLADLINHSPGITTEDYAYEIKGAGLFS 273
                 + + +F           +G N  +++P+AD++NH               A L  
Sbjct: 186 KQLVAFVMAYSFQEPLEEEDEDEKGPNPPMMVPVADILNHVAN----------HNASLEY 235

Query: 274 RDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIE 313
                 + T  P+  G+++   Y    +N +L   YGF E
Sbjct: 236 APTCLRMVTTQPISKGQEIFNTYG-QMANWQLLHMYGFAE 274


>gi|158508540|ref|NP_001025734.2| N-lysine methyltransferase SETD6 [Gallus gallus]
          Length = 447

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 74/280 (26%), Positives = 114/280 (40%), Gaps = 48/280 (17%)

Query: 60  PTAQVETFWQWLRDQKV-VSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPD 118
           PTA    F  W     V +SPK  I       G GL+A  D+   E++  VP    ++  
Sbjct: 17  PTA---GFVAWCEAAGVELSPKVSISRRGTVSGYGLLAAADLEPGELLFSVPRSALLSQH 73

Query: 119 TVAASEI-----GSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECT--DSTVFWS 171
           T A   +      SL S    W+ + L L+ E     S WR Y  +  + +  D  +FW 
Sbjct: 74  TCAIRALLHDAQESLQSQ-SVWVPLLLALLHEYTTGTSRWRPYFSLWQDFSSLDHPMFWP 132

Query: 172 EEELVE-LQGTQLLSTTLGVKEYVQNEYLKVEEE---IILP----NKQLF-PRPITLDDF 222
           EEE V  LQGT       G+ E V  +   ++ E   IILP    +  +F P   TL+ +
Sbjct: 133 EEERVRLLQGT-------GIPEAVDKDLANIQLEYSSIILPFMKSHPDIFDPELHTLELY 185

Query: 223 LWAFGILRSRAFSR--------LRGQN-LVLIPLADLINHSPGITTEDYAYEIKGAGLFS 273
                 + + +F           +G N  +++P+AD++NH               A L  
Sbjct: 186 KQLVAFVMAYSFQEPLEEEDEDEKGPNPPMMVPVADILNHVAN----------HNASLEY 235

Query: 274 RDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIE 313
                 + T  P+  G+++   Y    +N +L   YGF E
Sbjct: 236 APTCLRMVTTQPISKGQEIFNTYG-QMANWQLLHMYGFAE 274


>gi|255553959|ref|XP_002518020.1| Protein SET DOMAIN GROUP, putative [Ricinus communis]
 gi|223543002|gb|EEF44538.1| Protein SET DOMAIN GROUP, putative [Ricinus communis]
          Length = 471

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 53/211 (25%), Positives = 92/211 (43%), Gaps = 16/211 (7%)

Query: 53  ASATTNPPTAQVETFWQWLRDQKVVSPKSPIRPATFPE--GLGLVAQRDIAKNEVVLEVP 110
           AS+TT     ++  F +W++ Q +      +     P+  G+ + A R + + EVV  +P
Sbjct: 2   ASSTTR----RLRAFKRWMKSQGISWCSDALELIDAPDQDGIFVKALRALKEGEVVASIP 57

Query: 111 MKFWINPDTVAASEIGSLCSGLKPWISVALFLIREKK-KEDSPWRVYLDILPECTDSTVF 169
               +   T  A  I    S     + ++  L+ E      SPW  YL +LP+     + 
Sbjct: 58  KAACLTSRTSGARHIIEATS-FTGCLGLSFALMYEISLGHLSPWASYLHLLPDSECLPLV 116

Query: 170 WSEEEL-VELQGTQLLSTTLGVKEYVQNEYLKVEEEIILP---NKQLFPRPITLDDFLWA 225
           W+ +E+   L GT+L       K  + +++    +E ILP      L P+      +  A
Sbjct: 117 WTLDEVDYFLSGTELHKIVKEDKALIYDDW----KECILPLVDVHHLNPQYFGAHQYFAA 172

Query: 226 FGILRSRAFSRLRGQNLVLIPLADLINHSPG 256
             ++ SR+F       + ++PLADL NH  G
Sbjct: 173 RTLIASRSFQIDDYHGIGMVPLADLFNHKTG 203


>gi|14596097|gb|AAK68776.1| Unknown protein [Arabidopsis thaliana]
 gi|18377562|gb|AAL66947.1| unknown protein [Arabidopsis thaliana]
          Length = 364

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 84/355 (23%), Positives = 140/355 (39%), Gaps = 50/355 (14%)

Query: 136 ISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYV- 194
           + + L L++E+   DS W  Y+  LPE     +F+  E++  LQ   LL        ++ 
Sbjct: 1   MKLGLRLLQERANADSFWWPYISNLPETYTVPIFFPGEDIKNLQYAPLLHQVNKRCRFLL 60

Query: 195 --QNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNL---------- 242
             + E  +  E++   +     + +      W    + +RAF RL G             
Sbjct: 61  EFEQEIRRTLEDVKASDHPFSGQDVNASALGWTMSAVSTRAF-RLHGNKKLQGGSSDDVP 119

Query: 243 VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSN 302
           +++PL D+ NHS     +  A  I+       + L  +     VK  + +L+ Y    SN
Sbjct: 120 MMLPLIDMCNHS----FKPNARIIQEQNGADSNTLVKVVAETEVKENDPLLLNYGC-LSN 174

Query: 303 AELALDYGF---------IESKSDR---NAYTLTLEISE---SDPFFGDKLDIAETNGLG 347
               LDYGF         IE K D    +A ++   +S    S P       +++ N  G
Sbjct: 175 DFFLLDYGFVIESNPYDTIELKYDEQLMDAASMAAGVSSPKFSSPAPWQHQLLSQLNLAG 234

Query: 348 ESAYFDIVLGRTLP--------PAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSH 399
           E     + +G   P          +L    LV +   D+  L+S+           P   
Sbjct: 235 EMPNLKVTIGGPEPVEGRLLAALRILLCGELVEVEKHDSDTLKSLSAVA-------PFGI 287

Query: 400 ANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGEKRVL 454
           ANE  + R V   C  ALS F T I EDE +++ G +     +++  R  +K V+
Sbjct: 288 ANEIAVFRTVIALCVIALSHFPTKIMEDEAIIKQG-VSATAELSIKYRIQKKSVI 341


>gi|145349778|ref|XP_001419305.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144579536|gb|ABO97598.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 457

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 93/419 (22%), Positives = 164/419 (39%), Gaps = 56/419 (13%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-ISVALFLIREKK-K 148
           G GL A R +   E VLE+ +   I  D     E         PW + +A  L++EKK  
Sbjct: 45  GRGLEATRALRAGEGVLELKLASGIVDDAKGHPESARDAMKEAPWGVRLACRLLQEKKLG 104

Query: 149 EDSPWRVYLDILPE-CTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIIL 207
           E S +  Y   LPE    S + + E+ + ++Q    +S    ++   +  +  + E+   
Sbjct: 105 EGSAYAAYARTLPERVPTSPIHYDEKAIADVQYPPAMSEIREMQAACRKWHETLREK--- 161

Query: 208 PNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQN-----LVLIPLADLINHSPGITTEDY 262
             + L       + F  A G++ SR +     ++      VL+PLAD++NH   I T   
Sbjct: 162 APEALGDAYFDYEAFANAVGVVHSRTYGVASAEDNAGYFRVLLPLADMLNHGGDIVTSLT 221

Query: 263 AYEIKG---------------AGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELAL 307
             E  G               + L + + +        ++ GE+ L+ Y   +SN    +
Sbjct: 222 RDETTGELTDMTTAATDNIAWSTLDAEEGVIQFAATRDIEEGEEALMSYG-ERSNDHFLI 280

Query: 308 DYGFIESKSDRNAYTL--TLE-------ISESDPFFGDKLDIAE----------TNGLGE 348
            YGF    +  +   L   LE       ++  + + G + +I E          T  L  
Sbjct: 281 YYGFAPDNNPHDDCVLFSNLEHAMVWHSVAHPELWDGPEGEIREKAANAAYDEVTKALEA 340

Query: 349 SAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLE---SIFRNT-IWGHLDLPVSHANEEL 404
               D  L    P      L+ +  G  DA LL    +++  T   G ++ P+     + 
Sbjct: 341 EGSADAKLAAAEP-----RLKALGAGRIDARLLSAFAAMYAGTEASGDINGPICGDELKF 395

Query: 405 ICRVVRDACKSALSGFHTTIEED-EQLLQGGNLDPRLRIAVGVRAGEKRVLQQIDEIFK 462
               +   C+  L+   T++ ED + L  G  +D   R+    RA +K++L +  + FK
Sbjct: 396 ARADIAARCEQLLAQMPTSLAEDVDALRSGACVDDAERVRFIYRACKKKILHETLDYFK 454


>gi|291390222|ref|XP_002711632.1| PREDICTED: SET domain containing 6 [Oryctolagus cuniculus]
          Length = 817

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 67/275 (24%), Positives = 120/275 (43%), Gaps = 43/275 (15%)

Query: 77  VSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEI-----GSLCSG 131
           +SPK  +       G G+VA+  + + E++  VP    ++  T +  ++     G+L S 
Sbjct: 403 LSPKVAVSRQGTVAGYGMVARESVQRGELLFAVPRAAILSQHTCSIGDLLERERGALQS- 461

Query: 132 LKPWISVALFLIREKKKEDSPWRVYLDILPECT--DSTVFWSEEELVE-LQGTQLLSTTL 188
              W+ + L L+ E +   SPW  Y  + PE    +  +FW EEE    LQGT       
Sbjct: 462 QSGWVPLLLALLHELQAPASPWSPYFALWPELGRLEHPMFWPEEERRRLLQGT------- 514

Query: 189 GVKEYVQNEYLKVEEE---IILP----NKQLF-PRPITLDDFLWAFGILRSRAFSRLRG- 239
           GV E V+ +   +  E   I+LP    +  LF P+  +L+ +     ++ + +F      
Sbjct: 515 GVPEAVEKDLASIRSEYYSIVLPFMEAHPDLFSPKVHSLELYHQLVALVMAYSFQEPLEE 574

Query: 240 -------QNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQV 292
                   + +++P AD++NH   +   +   E      +S D L  + T  P+  G ++
Sbjct: 575 EEDEKEPNSPLMVPAADILNH---LANHNANLE------YSADYLRMVATQ-PIPKGHEI 624

Query: 293 LIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEI 327
              Y    +N +L   YGF+E   D    T  +++
Sbjct: 625 FNTYG-QMANWQLIHMYGFVEPYPDNTDDTADIQM 658


>gi|315042966|ref|XP_003170859.1| SET domain-containing protein [Arthroderma gypseum CBS 118893]
 gi|311344648|gb|EFR03851.1| SET domain-containing protein [Arthroderma gypseum CBS 118893]
          Length = 693

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 59/202 (29%), Positives = 89/202 (44%), Gaps = 31/202 (15%)

Query: 128 LCSGLKPWISVALFLIREKKKED-SPWRVYLDILPECTD--STVFWSEEELVELQGTQLL 184
           L  G++  I +ALF+  ++ KE  S W  YL  LP  ++  S +F+  ++L  LQGT L 
Sbjct: 98  LSQGVRKTIILALFVAHQQLKEKGSHWWPYLATLPRASELTSALFYHGDDLEWLQGTNLY 157

Query: 185 STTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-SRLRGQNL- 242
            T       V+ EY      I+     L     + D F WA+ ++ SRAF SR+    L 
Sbjct: 158 QTHQAYMNAVKEEY-DSAISILRDEGCLAAELYSWDLFCWAYTVIASRAFTSRVLSVYLS 216

Query: 243 ------------VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGE 290
                       +L+PL D  NH P    E   +  + A +        L+   P+ + E
Sbjct: 217 RNPALKQDEEFQILLPLVDSSNHKPLAKIE---WRAEAAEI-------GLKVVEPIVSEE 266

Query: 291 QVLIQYD-LNKSNAELALDYGF 311
           ++   Y  LN  N +L   YGF
Sbjct: 267 EIHNNYGPLN--NQQLMTTYGF 286


>gi|171676308|ref|XP_001903107.1| hypothetical protein [Podospora anserina S mat+]
 gi|170936220|emb|CAP60879.1| unnamed protein product [Podospora anserina S mat+]
          Length = 495

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 106/447 (23%), Positives = 181/447 (40%), Gaps = 103/447 (23%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMK-----------------FWINPDTVAASEIGSLCSGLK 133
           G G+VAQ DIA + V+  +P                   F ++ D    S   S      
Sbjct: 45  GRGIVAQADIAADTVLFTIPRNSILCAATSPLKDILPEIFDLDNDDEDESGDESDGDNQN 104

Query: 134 PWISVALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKE 192
            W  + L LI E  +   S W+ YLD+LP   ++ +FW+  +L  LQ + + S  +G   
Sbjct: 105 SWTLLILILIHEYLQGSSSQWKPYLDVLPSTFNTPMFWTPSQLSFLQASAVTS-KIG--- 160

Query: 193 YVQNEYLKVEEEIILPNKQLFPR--------PITLDDFL------------WAFGILRSR 232
             Q E  K+    ILP  +  P+        P++ D  +            +AF + +  
Sbjct: 161 --QEEADKMIASKILPVIRSHPQIFFPSSATPLSDDQLIQLAHRMGSTIMSYAFDLEQDM 218

Query: 233 AF------------SRLRGQNLVLIPLADLINHSPGITTE-DYAYEIKGAGLFSRDLLFS 279
                          R     L ++P+AD++N         ++A +   A         S
Sbjct: 219 EIPEQLENDDEWEEDREGKTMLGMVPMADILNADAEFNAHINHAEDALTA--------VS 270

Query: 280 LRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEIS----------- 328
           LR   P++ GE++L  Y    S+AEL   YG++   ++++A    +E+S           
Sbjct: 271 LR---PIRKGEEILNFYG-PLSSAELLRRYGYV---TEKHARWDVVELSWSLISSALQSK 323

Query: 329 -ESDPFFGDKL-DIAETNGLGESAYFDIVLGRTLP-PAMLQYLRLVA--------LGGTD 377
            ++ P   +K+  + + +   E A+   VL R  P P+    L   A        LG   
Sbjct: 324 LQAQPEVWEKVCALVQQDEEFEEAF---VLERQSPDPSSTGLLEEPAEFTSLPDELGEQF 380

Query: 378 AFLLESIFRNTIWGHLDLPVSHANEE-----LICRVVRDACKSALSGFHTTIEEDEQLLQ 432
              L++  + T   + D+ +   N++     L    + +A       + TT+E+D+Q L 
Sbjct: 381 KLCLKACKKITDSKN-DMALRALNDKDWRKRLYLETILEAVTQREREYGTTLEQDDQFLS 439

Query: 433 GGNLDPRLRIAVGVRAGEKRVLQQIDE 459
               D R ++AV VR GEK VL++  E
Sbjct: 440 ANPSDQREKMAVRVRRGEKLVLREARE 466


>gi|346320403|gb|EGX90003.1| SET domain protein [Cordyceps militaris CM01]
          Length = 381

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 60/234 (25%), Positives = 107/234 (45%), Gaps = 36/234 (15%)

Query: 91  GLGLVAQRDIAKNEVVLEVP---MKFWIN-PDTVAASEIGSLCSGLKPWISVALFLIREK 146
           G+G+VA R + +NEV+L VP   ++   N P +V A   G+    +   ++ AL      
Sbjct: 31  GVGIVATRALKENEVILRVPTATLRSLANTPPSVLARLPGASVHAI---LATALCF---D 84

Query: 147 KKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEII 206
           +   + +R + ++LP   D           EL+   LL  T       Q+     +  ++
Sbjct: 85  EASSASFRAWRNVLPSRGDVRAVQPLCWPAELRA--LLPPTAAALLAKQSATFHKDWAVV 142

Query: 207 LPNKQLFPRPITLDDFLWAFGILRSRAF--------SRLRGQNLVLIPLADLINHSP-GI 257
              +  +   ++ +DFL+A+ ++ +R+F         R +  ++VL P+ DL NH+P G 
Sbjct: 143 ---EAAYAGSLSREDFLYAWLLVNTRSFYHTTPTTAQRPKEDHMVLQPVVDLFNHAPTGF 199

Query: 258 TTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF 311
                      AG F  D  F++ T     AGE++ I+Y  +  N  L ++YGF
Sbjct: 200 C----------AGAFD-DAAFTITTQAAHAAGEELFIKYG-SHGNDFLLVEYGF 241


>gi|255723423|ref|XP_002546645.1| hypothetical protein CTRG_06123 [Candida tropicalis MYA-3404]
 gi|240130776|gb|EER30339.1| hypothetical protein CTRG_06123 [Candida tropicalis MYA-3404]
          Length = 428

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 64/284 (22%), Positives = 114/284 (40%), Gaps = 62/284 (21%)

Query: 74  QKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAA----------- 122
              +SPK  ++ A    G G+ A   + K E+++ +P  F +N  TV A           
Sbjct: 27  HSYISPKIDVKDAR-SSGRGIYATNSLKKGELIMNIPHSFLLNFTTVMAHISRYNGMQDE 85

Query: 123 ------------SEIGSLCSGL--------KPWISVALFLIREKKK-EDSPWRVYLDILP 161
                        +  ++ S L          +  ++++L  EK++  +S W+ +LD+LP
Sbjct: 86  SHIYVPFDNSDGDQFTNIYSKLTREEILELSSFQLLSIYLTFEKQRGTNSFWKPFLDMLP 145

Query: 162 ECTDST---VFWSEEELVELQGTQLLSTTLGVKEYVQNEYL----KVEEEIILPNKQLFP 214
              D     + WS+E   +L       ++L V++  +N+Y      ++ +  L    L P
Sbjct: 146 SMEDFALMPIHWSDET-CKLAPDSTQKSSLKVRDRFENDYKLICDLIQTKTDLDVTTLLP 204

Query: 215 RPITLDDFLWAFGILRSRAF------SRLRGQNLVLIPLADLINHSPGITTEDYAYEIKG 268
           R     D L ++  + SR        S+    N  + P  D +NHS     +    +I G
Sbjct: 205 RQ----DVLLSWLCINSRCLYMNLPTSKNTADNFTMAPYVDFMNHS---CDDHCTLKIDG 257

Query: 269 AGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI 312
            G       F + +       +QV + Y  + SN  L  +YGFI
Sbjct: 258 KG-------FQVSSTCSYNIDDQVYLSYGPH-SNDFLLCEYGFI 293


>gi|346465219|gb|AEO32454.1| hypothetical protein [Amblyomma maculatum]
          Length = 353

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 66/278 (23%), Positives = 113/278 (40%), Gaps = 38/278 (13%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEI-GSLCSGLKPWISV---ALFLIREK 146
           G GL   + +   E  L VP    I   T  +S + G L    +   ++    LFLI EK
Sbjct: 53  GRGLATLQKVTAGETFLRVPTCLLITTTTALSSSLHGFLVRHHRQLTAIEVLTLFLINEK 112

Query: 147 -KKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVE--- 202
            +  DS WR ++D LP    + VF   + L  L  T        V   ++  +++++   
Sbjct: 113 LRGLDSEWRFFIDSLPVSYTTPVFLGSKLLARLPETMCRKAEAQVSR-IRRTFVRLQILL 171

Query: 203 EEIILPNKQL--FPRPITLDDFLWAFGILRSRA-FSRLRGQN-------LVLIPLADLIN 252
           +  +L +  L       T   F+WA+  + +R  FS+ R  +         L P  D +N
Sbjct: 172 KRALLDDSALLNLSENFTWHLFVWAWTAVNTRCIFSKHRTDHSFWDDDYCALAPFLDCLN 231

Query: 253 HSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI 312
           H       D    ++G+        F + T    +  +QV I Y  +  N +L L+YGF+
Sbjct: 232 HH---WKADVETTVEGS-------YFEIVTNNNYEPNDQVFISYG-SHDNKKLLLEYGFV 280

Query: 313 ESKSDRNA--------YTLTLEISESDPFFGDKLDIAE 342
            + +  +         Y L  + +++  +F  KL   E
Sbjct: 281 LADNPNDVVAITKGHLYKLNSQQNDTVLYFATKLSFLE 318


>gi|449303172|gb|EMC99180.1| hypothetical protein BAUCODRAFT_120475 [Baudoinia compniacensis
           UAMH 10762]
          Length = 492

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 61/220 (27%), Positives = 92/220 (41%), Gaps = 9/220 (4%)

Query: 126 GSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLS 185
           G++ S L    SV+   + E+    SP+  Y+  LP     T FW+  EL  L GT L  
Sbjct: 101 GAILSFLLVQASVSCPGLSERVGVHSPFTDYVKSLPSELLPT-FWTPAELELLVGTTLAP 159

Query: 186 TTLGVKEYVQNEY-LKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVL 244
                   ++ EY L  E         L    +T DD+L    + RSRA          +
Sbjct: 160 AMSSKLRSLRREYDLLCERGANTRWYGLVEDQLTFDDWLQVDAMYRSRALD-YPAIGHCM 218

Query: 245 IPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAE 304
           +P  DL NH+ G  T    Y+    G    +    +R    +K G++V I Y   K   E
Sbjct: 219 VPCLDLANHAAGEETV-AIYDKDADG----NATLLIRNDKTLKEGDEVTISYGDEKGACE 273

Query: 305 LALDYGFIES-KSDRNAYTLTLEISESDPFFGDKLDIAET 343
           +   YGF+E+ +       L+L I  +D +   K+ +AE 
Sbjct: 274 MLFSYGFLEADRKSAETLFLSLSIPNNDSYRLPKIKVAEC 313


>gi|72389450|ref|XP_845020.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|62176703|gb|AAX70803.1| hypothetical protein, conserved [Trypanosoma brucei]
 gi|70801554|gb|AAZ11461.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 586

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 36/101 (35%), Positives = 48/101 (47%), Gaps = 10/101 (9%)

Query: 216 PITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRD 275
           P T+  F+WA+  L SR FS    +   +IP  D  NHS             GA +F   
Sbjct: 254 PFTMQQFIWAYNTLMSRGFS-YDPEVWAVIPWVDYFNHSLTNNATMRFDRCMGAYIFV-- 310

Query: 276 LLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKS 316
                 T  PV  G+QV +QY  + ++AEL L YGFI + S
Sbjct: 311 ------TTAPVSKGDQVFLQYG-SYTDAELVLWYGFITTPS 344


>gi|255719552|ref|XP_002556056.1| KLTH0H04004p [Lachancea thermotolerans]
 gi|238942022|emb|CAR30194.1| KLTH0H04004p [Lachancea thermotolerans CBS 6340]
          Length = 585

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 56/196 (28%), Positives = 81/196 (41%), Gaps = 52/196 (26%)

Query: 156 YLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEY-------------------VQN 196
           Y+D LP+  DS + W+  EL +L G   L ++L +K Y                   VQ 
Sbjct: 102 YIDALPDEIDSPLVWNPSEL-DLLGNTNLRSSLRIKLYSIFNEWKLIMETLKKHRNEVQA 160

Query: 197 EYLKVEEEIILPNKQLFPRPIT--------------LDDFLWAFGILRSRAFSRL----- 237
           E L +EE +      ++ R IT                 FLW+  +  SRAF        
Sbjct: 161 EILNIEETLGQSEDHVY-RNITSKVFQHSSETDWWSFPAFLWSHMMFLSRAFPEYVINPS 219

Query: 238 -RGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQY 296
               N+VL+P+ DL+NH       DY  +++      RD  F +R    V  GE++   Y
Sbjct: 220 TDPSNVVLLPIIDLLNH-------DYRSKVEWN---QRDGAFGVRKLETVLRGEEIFNNY 269

Query: 297 DLNKSNAELALDYGFI 312
              K N EL   YGF+
Sbjct: 270 G-GKGNEELLSGYGFV 284


>gi|402076002|gb|EJT71425.1| hypothetical protein GGTG_10683 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 497

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 38/119 (31%), Positives = 56/119 (47%), Gaps = 25/119 (21%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEI---------GSLCSGLKP------- 134
           G G+V   DI  + V+  +P +  I P   A SE+         G   SG  P       
Sbjct: 40  GRGIVTTADIEPDTVLFTIPRQHIIYP---ANSELARKLPAVFAGVGGSGPTPDVDDDDG 96

Query: 135 -----WISVALFLIREK-KKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTT 187
                W S+ L +I E  +   SPWR YLD+LP   ++ +FWS  EL ELQ + ++++ 
Sbjct: 97  RAQDSWTSLILVMIHEHLRGSASPWRPYLDVLPARFETPMFWSAAELAELQASPVVASV 155



 Score = 46.2 bits (108), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 25/64 (39%), Positives = 39/64 (60%), Gaps = 2/64 (3%)

Query: 395 LPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQ--GGNLDPRLRIAVGVRAGEKR 452
           +P   A ++L+   VRDA  + L  + T++EED   L   GG    R+R+A+ VR GEKR
Sbjct: 403 VPDKKARDDLMLGAVRDAIDAKLREYGTSLEEDAATLAGLGGGASARVRMALEVRIGEKR 462

Query: 453 VLQQ 456
           +L++
Sbjct: 463 LLRE 466


>gi|442753255|gb|JAA68787.1| Putative set domain-containing protein [Ixodes ricinus]
          Length = 428

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 73/291 (25%), Positives = 114/291 (39%), Gaps = 49/291 (16%)

Query: 88  FPE-GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCS----GLKPWISVALFL 142
           FP+ G G+VA   +   E  L++P    I+  T   S + S  +     L P   + LF+
Sbjct: 51  FPDTGRGVVALEKLVGGETFLKLPTSLLISTRTALQSLLHSFITRYHAKLTPIDVLTLFV 110

Query: 143 IREKK-KEDSPWRVYLDILPECTDSTVFWSE----------EELVELQGTQLLSTTLGVK 191
           + +K   E S W  ++D LP    + VF              E V  + T +  T L +K
Sbjct: 111 LDQKLLGEASRWWPFVDSLPRTFTTPVFLRRTVFESLPKDLREEVHTRITSIQRTFLKLK 170

Query: 192 EYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAF--------SRLRGQNLV 243
             +      VEEE   P  Q      T ++F+WA+  + +R          S     +  
Sbjct: 171 VLLGGH---VEEE---PEVQSLSTGFTWNNFVWAWTAVNTRCIFAQGSNSSSLWENDHCA 224

Query: 244 LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNA 303
           L P  D +NH        +   I+ A +      F + +     A EQV I Y  + SN 
Sbjct: 225 LAPFLDCLNH-------HWKASIETAMVGEN---FEILSHKSHDANEQVFISYGPH-SNR 273

Query: 304 ELALDYGFIESKSDRNAYTLTLE--------ISESDPFFGDKLDIAETNGL 346
            L LDYGF+   +  +   +T +           + P F  KLD  E+  +
Sbjct: 274 RLFLDYGFVLPDNPNDVVVVTRDHLVKLYSLHENTMPHFQSKLDFLESKNV 324


>gi|308806125|ref|XP_003080374.1| tryptophan synthase beta-subunit (ISS) [Ostreococcus tauri]
 gi|116058834|emb|CAL54541.1| tryptophan synthase beta-subunit (ISS) [Ostreococcus tauri]
          Length = 863

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 52/212 (24%), Positives = 85/212 (40%), Gaps = 35/212 (16%)

Query: 136 ISVALFLIREKKKEDSPWRVYLDILPECTD---STVFWSEEELVELQGTQLLSTTLGVKE 192
           ++ AL+++   +     W  Y   LP   D   S +  +EEEL  +Q   + +    ++ 
Sbjct: 500 LACALWVLFATRAGGEVWESYASWLPNKRDGLPSMLLAAEEELEMMQNESMTARARELRG 559

Query: 193 YVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQN---------LV 243
            V+  + +V      P     P  +TL+D  WA+ ++ SRA +   G++          V
Sbjct: 560 LVRKAFDRV------PFAHGIPGSVTLEDLSWAYALVTSRAIASDVGKDPDGVDDTQVAV 613

Query: 244 LIPLADLINH--SPGITTEDYAYEIKGAGL-------------FSRDLLFSLRTPVPV-K 287
           + P  D+ NH     +T         G GL                     L T  P+  
Sbjct: 614 MAPCVDMANHVDVTNVTALKKIGATDGGGLRGAYWRLMTGGSVDGGGGACCLETNRPITS 673

Query: 288 AGEQVLIQYDLNKSNAELALDYGFIESKSDRN 319
           A E+V I Y  + +N EL + YGF   K +RN
Sbjct: 674 ADEEVTISYQPDATNDELMVSYGF-SLKGNRN 704


>gi|268535512|ref|XP_002632889.1| C. briggsae CBR-SET-29 protein [Caenorhabditis briggsae]
          Length = 319

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 47/206 (22%), Positives = 88/206 (42%), Gaps = 28/206 (13%)

Query: 123 SEIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQ 182
            ++      LKP   + +F   E   E S W  YL +LP+  D+  F   +  V      
Sbjct: 59  KKMAKFTEKLKPMEILTMFFCFEDF-ETSAWSPYLKVLPKEFDTPAFKGIDYDVNTLPLS 117

Query: 183 LLSTTLGVKEYVQ------NEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-- 234
           +    +  K+ +        ++ +V ++I+   ++LFP  +T D  LWA+ ++ +R    
Sbjct: 118 IRKFWVDQKKEISEISEKVGDHYEVRKKIVFQLRRLFPE-LTHDKILWAWHVVNTRCIFV 176

Query: 235 --------SRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLF-SRDLLFSLRTPVP 285
                       G  + +IP  D++NH P         + +G  +   R+  + ++    
Sbjct: 177 ENEEHDNVDNSDGDTIAVIPYVDMLNHDPQ--------KYQGVAIHEKRNGRYVVQAKRQ 228

Query: 286 VKAGEQVLIQYDLNKSNAELALDYGF 311
           +  GEQV + Y  +  NA L ++YGF
Sbjct: 229 IMEGEQVFVCYGAH-DNARLLVEYGF 253


>gi|356499056|ref|XP_003518360.1| PREDICTED: LOW QUALITY PROTEIN: protein SET DOMAIN GROUP 40-like
           [Glycine max]
          Length = 497

 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 60/259 (23%), Positives = 102/259 (39%), Gaps = 44/259 (16%)

Query: 93  GLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLC------SGLKPWISVALFLIREK 146
           GL A RD+ + E+VL VP    +  ++V   E   LC      S L P   + + L+ E 
Sbjct: 67  GLGAARDLGRGEIVLRVPKSALMTRESVMEDE--KLCDAVNRHSSLSPAQMLIVCLLYEM 124

Query: 147 KKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEII 206
            K  S W  YL  +P+  D    + E E   LQ  + +  T       ++E+   E   +
Sbjct: 125 GKXTSRWHPYLVHMPQTYDILAMFGEFEKRALQVDEAMWVTEKAMLKAKSEW--KEAHAL 182

Query: 207 LPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINH-SPGIT---TEDY 262
           + +    P+ +T   ++WA   + S+       +   L  + DL N+ +PG+     ED 
Sbjct: 183 MEDLMFKPQFLTFKAWVWAAATISSQTMHIPWDEAGCLCLVGDLFNYDAPGMEPSGIEDL 242

Query: 263 AYEIKGAGLFSRDLL-----------------------------FSLRTPVPVKAGEQVL 293
            + +  + +    LL                             +   +    K G+QVL
Sbjct: 243 EHFLSNSSIHDTSLLNGDNNIMMQTTLILTQRLTDGWFEEDVNAYCFYSRAHYKKGDQVL 302

Query: 294 IQYDLNKSNAELALDYGFI 312
           + Y +  +N EL   YGF+
Sbjct: 303 LCYGI-YTNLELVEHYGFL 320


>gi|222623626|gb|EEE57758.1| hypothetical protein OsJ_08284 [Oryza sativa Japonica Group]
          Length = 437

 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 102/420 (24%), Positives = 170/420 (40%), Gaps = 76/420 (18%)

Query: 69  QWL-RDQKVVSPKSPIRPATFPE-GLGL---VAQRDIAKNEVVLEVPMKFWIN---PDTV 120
           +W+ R+   V P   +R    PE GLG+    A+ DI   +V++ +P +  +    P   
Sbjct: 59  RWVQREGGFVHPA--LRVVDHPEHGLGVSAAAAEGDIPPGDVLIALPGRLPLRLRRPAGA 116

Query: 121 AASEIGSLCSGLKP--W-ISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVE 177
           A + +  L   +    W + + L L++E+ K DS W  Y+  LPE     +F+  E++  
Sbjct: 117 ADAVLVQLADQVPEELWAMRLGLRLLQERAKSDSFWWPYIANLPETFTVPIFFPGEDIKN 176

Query: 178 LQGTQLLSTTLGVKEYVQNEYLKVEEE---IILPNKQLFPRPITLDDFLWAFGILRSRAF 234
           LQ   LL        ++     +V+ +   + L +     + +      WA     +RAF
Sbjct: 177 LQYAPLLHQVNKRCRFLLEFEKEVKHKLGTVPLEDHPFCGQDVNSSSLGWAMSAASTRAF 236

Query: 235 SRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLI 294
            RL G+  +L+PL D+ NHS        A  ++   + S D+                  
Sbjct: 237 -RLHGEIPMLLPLIDMCNHS----FNPNARIVQEGNVDSPDMSV---------------- 275

Query: 295 QYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDI 354
                K+NA          +   +    LT ++  S  +  D L  ++ N  GE A   +
Sbjct: 276 -----KANA----------ASWQQEFLRLTFQLQPS--WQQDIL--SQLNLYGEGAILKV 316

Query: 355 VLG--RTLPPAMLQYLRLV------ALGGTDAFLLESIFRNTIWGHLDLPVSHANEELIC 406
            +G    +   +L  LR++      A+ G D   L S+           P+  A E    
Sbjct: 317 SIGGPEIVDGRLLAALRVIIAADPDAVSGHDLKTLMSLKEKA-------PLGPAVEASAL 369

Query: 407 RVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRL--RIAVGVRAGEKRVLQQIDEIFKER 464
           R V   C  AL  FHT I EDE +L+G   +P L   +AV  R  +K +L  + +    R
Sbjct: 370 RTVLALCTFALQHFHTKIMEDEAILKG---EPPLTTELAVQFRLQKKLLLLDVIQNLSRR 426


>gi|41054567|ref|NP_955894.1| N-lysine methyltransferase setd6 [Danio rerio]
 gi|82177062|sp|Q803K4.1|SETD6_DANRE RecName: Full=N-lysine methyltransferase setd6; AltName: Full=SET
           domain-containing protein 6
 gi|27882107|gb|AAH44440.1| SET domain containing 6 [Danio rerio]
          Length = 460

 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 59/242 (24%), Positives = 105/242 (43%), Gaps = 32/242 (13%)

Query: 93  GLVAQRDIAKNEVVLEVPMKFWINPDTVAASEI---GSLC-SGLKPWISVALFLIREKKK 148
           G++A+ DI +  V+  +P +  ++  T    ++   G  C      W+ + L L+ E   
Sbjct: 50  GMLAKEDIEEGHVLFTIPREALLHQGTTKVKKVLEEGKKCLESASGWVPLLLSLMYEYTS 109

Query: 149 EDSPWRVYLDILPE--CTDSTVFWSEEELVEL-QGTQLLSTTLGVKEYVQNEY------- 198
             S W+ YL + P+    D  +FWSEEE  +L +GT +  + +     +Q+EY       
Sbjct: 110 STSHWKPYLSLWPDFRTLDQPMFWSEEECDKLLKGTGIPESVITDLRKLQDEYNSVVLPF 169

Query: 199 LKVEEEIILPNKQLFPRPITLDDFLWAFGI------LRSRAFSRLRGQNL-VLIPLADLI 251
           +K   ++  P K       +L  F+ A+                 +  NL +++P+AD++
Sbjct: 170 MKSHPDLWDPEKHNLELYKSLVAFVMAYSFQEPVEDDDEDEEDDEKKPNLPMMVPMADML 229

Query: 252 NHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF 311
           NH   I+  +   E     L     + S+R    +  GE+V   Y    +N +L   YGF
Sbjct: 230 NH---ISKHNANLEYTPECL----KMVSIRR---IGKGEEVFNTYG-QMANWQLLHMYGF 278

Query: 312 IE 313
            E
Sbjct: 279 AE 280


>gi|170067683|ref|XP_001868579.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167863782|gb|EDS27165.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 269

 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 64/261 (24%), Positives = 110/261 (42%), Gaps = 59/261 (22%)

Query: 241 NLVLIPLADLINHSPGITTEDY---AYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYD 297
           N VLIPL D+ NH  G  T  Y   A +++   L               + GEQ+ I Y 
Sbjct: 38  NPVLIPLWDMANHVNGQITTGYNEAAQQVESLALGD------------YRKGEQIFIYYG 85

Query: 298 LNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLG 357
            N++NA+  +  GF+   +  +A  + L ++ ++  F  +  + E  GL  S  F++  G
Sbjct: 86  -NRTNADFLVHNGFVYPDNANSAVAIPLSLNPTEEQFEQRKQLLEKLGLASSGDFNVQRG 144

Query: 358 RT---LPPAMLQYLRLVAL--------GGTDAFL----------LESIFRNTIWGHLDLP 396
                + P +L + R+  +         G DA            LE+  R  +W +L + 
Sbjct: 145 GGDSFISPELLGFARVFNMTKEQLGHWQGEDAVQSQLLEPDCPGLEASLREKVWKYLSIR 204

Query: 397 VSHANEELICRVVRDACKSALSGFHTTIEEDEQLL--QGGNLDPRL----RIAVGVRAGE 450
           +                + AL    TT+++DE LL  QG     +L     + V  R  E
Sbjct: 205 L----------------QLALRMTGTTLDQDEALLANQGQKGAQKLGHIKSMLVQFRVVE 248

Query: 451 KRVLQQIDEIFKERELELDEL 471
           K++L +  E  K++E++  E+
Sbjct: 249 KKILSEALEHCKQKEVDKKEV 269


>gi|26325362|dbj|BAC26435.1| unnamed protein product [Mus musculus]
          Length = 162

 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 38/132 (28%), Positives = 62/132 (46%), Gaps = 13/132 (9%)

Query: 136 ISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQ 195
           I++A  L+ E+   +S W+ Y+  LP   D+ +++ EEE+  LQ TQ +       +   
Sbjct: 27  IALAFHLLCERASPNSFWQPYIQTLPSEYDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTA 86

Query: 196 NEY---LKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAF-------SRLRGQNLVLI 245
            +Y    KV +     NK       T +D+ WA   + +R         SR+    L LI
Sbjct: 87  RQYAYFYKVIQTHPHANKLPLKESFTYEDYRWAVSSVMTRQNQIPTEDGSRV---TLALI 143

Query: 246 PLADLINHSPGI 257
           PL D+ NH+ G+
Sbjct: 144 PLWDMCNHTNGL 155


>gi|345325921|ref|XP_001512684.2| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Ornithorhynchus anatinus]
          Length = 392

 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 35/117 (29%), Positives = 56/117 (47%), Gaps = 5/117 (4%)

Query: 87  TFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW-----ISVALF 141
            F EG GL A R+I   E+ L VP K  +  ++   S +GSL S  +       I++A  
Sbjct: 100 NFEEGFGLRATREIKAEELFLWVPRKLLMTVESAKNSVLGSLYSQDRILQAMGNITLAFH 159

Query: 142 LIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY 198
           L+ E+    S W  Y+  LP   D+ +++ E+E+  LQ TQ +       +    +Y
Sbjct: 160 LLCERANPSSFWLPYIQTLPSEYDTPLYFEEDEVQYLQSTQAIHDVFSQYKNTARQY 216


>gi|398405066|ref|XP_003853999.1| hypothetical protein MYCGRDRAFT_91971 [Zymoseptoria tritici IPO323]
 gi|339473882|gb|EGP88975.1| hypothetical protein MYCGRDRAFT_91971 [Zymoseptoria tritici IPO323]
          Length = 597

 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 83/298 (27%), Positives = 127/298 (42%), Gaps = 46/298 (15%)

Query: 59  PPTAQVETFWQWLRDQK-VVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINP 117
           P T Q++T   WL D    V+P   +    + E  G+  + + A +   L    +    P
Sbjct: 2   PSTGQLQTLRLWLEDNGGYVNPAIDL---DWNETAGVHCRVNFASS---LGPDSRICTVP 55

Query: 118 DTVAASEIGSLCS---------GLKPWISVALFLIREK-KKEDSPWRVYLDILP--ECTD 165
            ++A S + +L           GL P      +L+ +   K  S WR YL  LP  E   
Sbjct: 56  HSLALSSLNALVDDSFSVFRNRGLAPEAIGYFYLMHQYINKAKSFWRPYLQTLPGPEHEH 115

Query: 166 STVFWSEEELVE-LQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLW 224
            T FW E+E ++ L  T +L TT   ++  +  Y K  +  +L   ++   P T     W
Sbjct: 116 LTPFWFEDEDLQWLADTDVLHTTKARQKLQEEHYAKGID--MLNRAKIDVEPYTCK--YW 171

Query: 225 A---FGILRSRA-----FSRLRGQNL---VLIPLADLINHSPGITTEDYAYEIKGAGLFS 273
           A    G    R       SR+  ++    VL P+ D+ NHSP     D+A++    G FS
Sbjct: 172 AAYKVGPQGQRQTALVDMSRVSAEDQDFPVLFPVIDIPNHSPTARV-DWAFD---PGRFS 227

Query: 274 RDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESD 331
                +++ P+P   GE+    Y   KSN EL L YGF    +  +   LTL+   SD
Sbjct: 228 ----ITVKDPIP--GGEEAFNNYG-PKSNDELLLGYGFCIPNNLDDKVLLTLKPPPSD 278


>gi|330924024|ref|XP_003300479.1| hypothetical protein PTT_11726 [Pyrenophora teres f. teres 0-1]
 gi|311325361|gb|EFQ91406.1| hypothetical protein PTT_11726 [Pyrenophora teres f. teres 0-1]
          Length = 642

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 64/249 (25%), Positives = 102/249 (40%), Gaps = 48/249 (19%)

Query: 105 VVLEVPMKFWINPDTVAASEIGSL--------CSGLKP-WISVALFLIREKKK-EDSPWR 154
           VV   P+K  ++   +   E G L        C G  P  I   L LI ++ + + SPW 
Sbjct: 55  VVTTCPLKLTLSHLNLDPGEEGVLHIESPLRQCQGRIPDHILTYLLLIEQRDRGQASPWH 114

Query: 155 VYLDILPECTDST--VFWSEEELVELQGTQLLSTTLGVKEYVQNEY---LKVEEEIILPN 209
            Y+  LP   D T  ++++E ++  L GT L       +  +Q ++   + V EE+ +P 
Sbjct: 115 AYIACLPSPRDMTTPLWFNEGDMAFLAGTSLAPAAKERRAELQQQWERAVAVMEELSIP- 173

Query: 210 KQLFPRPITLDDFLWAFGILRSRAFSRL-----RGQNLVLIPLADLINHSPGITTE-DY- 262
                + I ++  LWA  +  SRAF        +    +L P+ D++NHS     E D+ 
Sbjct: 174 ---LAKGIDIESLLWAATVFTSRAFISTHILPEKETVPILFPVVDILNHSVSAKVEWDFQ 230

Query: 263 ---AYEIK---GAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKS 316
              ++ +K   G     R  LF+   P                K N EL L YGF    +
Sbjct: 231 PRQSFALKCLDGHSFQPRQELFNNYAP----------------KQNDELLLGYGFCLEDN 274

Query: 317 DRNAYTLTL 325
               + L L
Sbjct: 275 PIEQFALKL 283


>gi|66819805|ref|XP_643561.1| hypothetical protein DDB_G0275621 [Dictyostelium discoideum AX4]
 gi|60471605|gb|EAL69561.1| hypothetical protein DDB_G0275621 [Dictyostelium discoideum AX4]
          Length = 526

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 45/174 (25%), Positives = 78/174 (44%), Gaps = 13/174 (7%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREKK-KE 149
           GLG+++ +++  +++V ++P    ++  T + S I      ++  I  ++ LI E    E
Sbjct: 66  GLGVISLKELKVDDIVAKIPKSIILSIHTSSISNILEKYK-IENNIGTSIALIHEASLGE 124

Query: 150 DSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPN 209
            S W  Y+  LP   D  + W  E    L+GT +          +   Y  V E I+  N
Sbjct: 125 KSKWYGYISSLPRKVDVPILWDSESRKLLKGTAIEDVLNDDDILINQVYADVIESILSKN 184

Query: 210 -------KQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPG 256
                  K+L+    ++++F  A  I+ SRAF         L+PLAD+ NH   
Sbjct: 185 HPEIFGDKELY----SIENFKIANSIISSRAFCVDSYHGDSLVPLADIFNHQTA 234


>gi|332227974|ref|XP_003263165.1| PREDICTED: N-lysine methyltransferase SETD6 [Nomascus leucogenys]
          Length = 449

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 72/289 (24%), Positives = 122/289 (42%), Gaps = 44/289 (15%)

Query: 64  VETFWQWLRDQKV-VSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAA 122
           V  F  W R   + +SPK  +       G G+VA+  +   E++L VP    ++  T + 
Sbjct: 22  VAGFLSWCRRVGLELSPKVAVSRQGTVAGYGMVARESVQAGELLLVVPRAALLSQHTCSI 81

Query: 123 S-----EIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECT--DSTVFWSEEE- 174
                 E G+L S    W+ + L L+ E +   S WR Y  + PE    +  +FW EEE 
Sbjct: 82  GGLLDRERGALQSQ-SGWVPLLLALLHELQAPASRWRPYFALWPELGRLEHPMFWPEEER 140

Query: 175 LVELQGTQLLSTTLGVKEYVQNEYLKVEEE---IILPNKQLFP-----RPITLDDFLWAF 226
              LQGT       GV E V+ +   +  E   I+LP  +  P     R  +L+ +    
Sbjct: 141 RCLLQGT-------GVPEAVEKDLANIRSEYQSIVLPFMEAHPDLFSLRVRSLELYHQLV 193

Query: 227 GILRSRAFSRLRG--------QNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLF 278
            ++ + +F              + V++P AD++NH   +   +   E      +S + L 
Sbjct: 194 ALVMAYSFQEPLEEEEDEKEPNSPVMVPAADILNH---LANHNANLE------YSANCLR 244

Query: 279 SLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEI 327
            + T  P+  G ++   Y    +N +L   YGF+E   D    T  +++
Sbjct: 245 MVATQ-PIPKGHEIFNTYG-QMANWQLIHMYGFVEPYPDNTDDTADIQM 291


>gi|326473914|gb|EGD97923.1| hypothetical protein TESG_05224 [Trichophyton tonsurans CBS 112818]
          Length = 692

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 61/213 (28%), Positives = 89/213 (41%), Gaps = 33/213 (15%)

Query: 117 PDTVAASEIGSLCSGLKPWISVALFLIREK-KKEDSPWRVYLDILPECTD--STVFWSEE 173
           P     S+I  L   ++  I +A F+  E+ K  DS W  YL  LP  ++  S +F+ + 
Sbjct: 89  PHDFHCSDI--LSERVRKSIILAFFVAHEQLKGRDSHWWPYLATLPRASELTSALFYQDS 146

Query: 174 ELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRA 233
           +L  LQGT L  T    +  V+ EY      I+     L     + D F WA+ ++ SRA
Sbjct: 147 DLDWLQGTNLYQTHQAYRNTVKEEY-DSAISILRDEGCLAVESYSWDIFCWAYTLIASRA 205

Query: 234 F-SRLRGQNL-------------VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFS 279
           F SR+                  +++PL D  NH P    E +  E    G         
Sbjct: 206 FTSRVLDAYFSNHPTLKQDEEFQIMLPLVDSSNHKPLAKIE-WRAEATEIG--------- 255

Query: 280 LRTPVPVKAGEQVLIQYD-LNKSNAELALDYGF 311
           L+   P   GE++   Y  LN  N +L   YGF
Sbjct: 256 LKVIEPTFTGEEIHNNYGPLN--NQQLMTTYGF 286


>gi|449301991|gb|EMC98000.1| hypothetical protein BAUCODRAFT_146595 [Baudoinia compniacensis
           UAMH 10762]
          Length = 633

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 64/213 (30%), Positives = 92/213 (43%), Gaps = 42/213 (19%)

Query: 147 KKEDSPWRVYLDILPECTD-STVFW--SEEELVELQGTQLLSTTLGVKEYVQNEY---LK 200
            KE S W+ YLD+LP  ++ ST  W  +  +L  L GT +L T L  +E     Y   LK
Sbjct: 95  NKEQSFWKPYLDVLPSPSEFSTPLWFDAPADLAWLDGTDVLHTMLARREVYAQYYQSGLK 154

Query: 201 VEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSR--LRGQNLVLIPLADL-------- 250
           V  E  + +  L+    T D F WA     SR+F+   L  QN    P+           
Sbjct: 155 VLSESGI-DVTLY----TWDLFRWAITTFTSRSFTSRVLLPQNRKYWPVHRTSTNGRRQT 209

Query: 251 ----INHSPGITTEDYAYEIKGAGLFSRDL-------------LFSLRTPVPVKAGEQVL 293
               ++HSP    ED  + +   GL S +               FS+    P++AG +V 
Sbjct: 210 VLLDMSHSPA---EDLDFSVLFPGLDSGNHDPNAQVDWSFDANQFSIALVQPIEAGAEVC 266

Query: 294 IQYDLNKSNAELALDYGFIESKSDRNAYTLTLE 326
             Y   K+N EL + YGF    + R+   LTL+
Sbjct: 267 NNYG-PKANDELLMGYGFCIPNNPRDEVLLTLK 298


>gi|303271033|ref|XP_003054878.1| set domain protein [Micromonas pusilla CCMP1545]
 gi|226462852|gb|EEH60130.1| set domain protein [Micromonas pusilla CCMP1545]
          Length = 664

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 50/197 (25%), Positives = 88/197 (44%), Gaps = 36/197 (18%)

Query: 86  ATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWIS-------- 137
           A +  G G+V  R++ K E ++ +P++  ++  +   S IG     LK   S        
Sbjct: 143 AVWGRGRGVVTTRNVTKGETLVAIPLEKCLSTFSARKSAIGE---ALKTITSREVTIDAV 199

Query: 138 VALFLIRE--KKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQ 195
           +AL L+ E   ++E S W  ++ ILP   ++ + W+  EL +L+G+ L    +G ++ V 
Sbjct: 200 IALHLLHELYVQREKSEWWPWVSILPRDVETPLLWTPRELAQLEGSNL----IGFRDAVL 255

Query: 196 NEYLKVEEEIILPNKQLFPRPITLDDF-----LWAFGILRSRAF-------------SRL 237
             +    + +     Q FP     + F      WA  I+ SRA                 
Sbjct: 256 KGWTTQRDALFPKLTQKFPSLFPEEHFRTERWAWAMAIVWSRAADVPVPRPEAIFPSGDD 315

Query: 238 RGQNL-VLIPLADLINH 253
           + + L V++PL D+INH
Sbjct: 316 KSRELRVIVPLFDMINH 332


>gi|452987586|gb|EME87341.1| hypothetical protein MYCFIDRAFT_129844, partial [Pseudocercospora
           fijiensis CIRAD86]
          Length = 491

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 74/286 (25%), Positives = 114/286 (39%), Gaps = 37/286 (12%)

Query: 85  PATFP-EGLGLVAQRDI----AKNEVVLEVPMKFWINPDTV------------------- 120
           PA  P  G+GL+A R +    +++ V L +P    ++ + V                   
Sbjct: 36  PALIPGRGIGLLADRPVQYSSSESSVFLTIPEDLVLSMEAVKRHARFDTHFRELIESLDD 95

Query: 121 -AASEIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQ 179
              +  G++ S L    SV+   + E+         Y+  LP C     FW  EEL  L 
Sbjct: 96  FGRTPRGAILSFLLFQASVSCPHLAERVGVHCALTDYVKTLP-CELLPTFWRPEELQLLA 154

Query: 180 GTQLLSTTLGVKEYVQNEY-LKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLR 238
           GT L        + ++ EY L  E         L    ++ +D+L    + RSRA   L 
Sbjct: 155 GTTLAPAVAAKLKSLRREYDLLCESAARTRWYNLVEGLLSFEDWLHVDAMFRSRA---LD 211

Query: 239 GQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDL 298
                +IP  DL +H+ G  T    Y+ +    F R     L     ++ G++V I Y  
Sbjct: 212 FYGSCMIPGMDLASHAAGDGTV-ARYDREDGRYFLR-----LIEGKSLQEGQEVTITYGD 265

Query: 299 NKSNAELALDYGFIESKSDR-NAYTLTLEISESDPFFGDKLDIAET 343
            K   E+   YGFI+   D      L+L I +SDP    K+ IA+ 
Sbjct: 266 EKGACEMLFSYGFIDDSMDTAETLFLSLSIPDSDPSRTAKMKIADC 311


>gi|348676124|gb|EGZ15942.1| hypothetical protein PHYSODRAFT_561656 [Phytophthora sojae]
          Length = 429

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 53/182 (29%), Positives = 80/182 (43%), Gaps = 19/182 (10%)

Query: 136 ISVALFLIRE-KKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQ---LLSTTLGVK 191
           +   L L+ E  + E+S +  Y+  LP        W  E    L+ T    +L   L +K
Sbjct: 89  VKTTLVLLAELARGEESGFHGYIQQLPTSISLPFSWGAESREMLRHTTAHLILDDKLVLK 148

Query: 192 EYVQNEYLKVEEEIILPNKQLFPRPI-TLDDFLWAFGILRSRAFSRLRGQNLVLIPLADL 250
            Y         E ++     ++P  + TL+ F WA+ ++ SRAF    GQ   L+P+ D+
Sbjct: 149 MYAD-----YAEPLMKEFSTIWPAEVSTLEKFQWAYSMVSSRAFKVTDGQEPTLLPVIDM 203

Query: 251 INHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYG 310
            NH+     E+ A  I      S    F L T   V+  E V I Y  + SNA+L   YG
Sbjct: 204 ANHA----AENPAAHIVKTETGS----FQLTTLRKVEKDESVTISYG-DLSNAQLLCRYG 254

Query: 311 FI 312
           F+
Sbjct: 255 FV 256


>gi|322694547|gb|EFY86374.1| SET domain-containing protein, putative [Metarhizium acridum CQMa
           102]
          Length = 372

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 68/262 (25%), Positives = 108/262 (41%), Gaps = 41/262 (15%)

Query: 63  QVETFWQWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWIN----P 117
           Q+E    W   + V    + I P   P  G+G+VA R +   E +L VP+         P
Sbjct: 3   QIERLLTWAETKGVAI--NGIGPKPLPGRGIGIVATRALKTGEDILTVPVNMLRTLKNTP 60

Query: 118 DTVAASEIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVE 177
             +  +  GS    +     +A  L  E   E + WR    + P   D           E
Sbjct: 61  QPILHNLKGSTVHAI-----LATSLCLETDPEFAIWRA---VFPSEDDIRTCMPLSWPPE 112

Query: 178 LQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSR- 236
           LQ   LL      K  +  +  K + +  L     +P  I+   FL+++ ++ SR F   
Sbjct: 113 LQ--HLLPP--NAKALLAKQKTKFDTDWALVTAA-YPS-ISRTQFLYSWHLVNSRTFYHV 166

Query: 237 -------LRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAG 289
                  L+  ++VL P+ADL NHSP            G  +   D  F++ T  PV+ G
Sbjct: 167 TRATEKLLKADHMVLQPVADLFNHSP-----------DGCKVAFDDASFTITTTHPVEEG 215

Query: 290 EQVLIQYDLNKSNAELALDYGF 311
           +++ I+Y  + SN  L ++YGF
Sbjct: 216 DELFIRYG-SHSNDFLLVEYGF 236


>gi|401625518|gb|EJS43523.1| YHL039W [Saccharomyces arboricola H-6]
          Length = 585

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 61/222 (27%), Positives = 95/222 (42%), Gaps = 51/222 (22%)

Query: 153 WRVYLDILPECTDSTVFWSEEELVELQGTQLLST---TLG--VKEYVQ-NEYLKVE---- 202
           ++ YLD+LP     T FWS +E+V L GT +  T   TL   VKE++   + L +E    
Sbjct: 103 YKPYLDVLPLNLPQTYFWSTDEVVNLHGTDVYLTMRDTLNKVVKEWIGLCQALSIEHASQ 162

Query: 203 ------------EEIILPNKQLFPR--PITLDD--------FLWAFGILRSRAFSRL--- 237
                       E  ++P +Q         LD         +LW++ I  SRAF R+   
Sbjct: 163 DKQFLSLFRGTGEATVVPLEQFCTHINSCKLDTLEWNSFVAYLWSYCIFNSRAFPRIILN 222

Query: 238 ----RGQNL---VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGE 290
                  NL    L P+ DL+NH   +        +K       +L F  + P    A E
Sbjct: 223 KSDTDSSNLNEGFLYPIVDLLNHKNDV-------PVKWKMNDQNELCFMSQCPA-FDANE 274

Query: 291 QVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDP 332
           ++   Y  + SN +  L+YGF +S +  +   LTL++  + P
Sbjct: 275 ELFNNYG-DISNEKCLLNYGFWDSSNKYDFSRLTLKLPSALP 315


>gi|145517214|ref|XP_001444490.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124411912|emb|CAK77093.1| unnamed protein product [Paramecium tetraurelia]
          Length = 748

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 54/204 (26%), Positives = 92/204 (45%), Gaps = 37/204 (18%)

Query: 85  PATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWI-------- 136
           PA F E  G+VA +DI  N  ++ VP    I+ +    S + S+     P +        
Sbjct: 37  PAAFGELTGVVATQDIPANTAIICVPQTLIISQEKCKLSSL-SIVYDKHPELFDENQTSD 95

Query: 137 ----SVALFLIREKKK-EDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVK 191
                +  +L  EKKK E S +  Y+  + +  ++ + W++EEL +++   +L     V 
Sbjct: 96  AEFNILIFYLFNEKKKGEQSFFYPYIQAI-QTNNTLIDWTKEELSQIEDPIVLDEFAIVS 154

Query: 192 E-------YVQ---NEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFS-RLRGQ 240
           E       Y Q   NE++++  E+         RP    DF WA   + SR+F   L+  
Sbjct: 155 EDLKVLWNYAQDIFNEFIQIFGEV---------RPTDKQDFYWAAQSVMSRSFGWSLKST 205

Query: 241 NLVLIPLADLINHSPGITTEDYAY 264
           +  +IP+AD +NHS    T  + +
Sbjct: 206 S--MIPIADFLNHSNKACTHYFVH 227


>gi|307105209|gb|EFN53459.1| hypothetical protein CHLNCDRAFT_136721 [Chlorella variabilis]
          Length = 470

 Score = 53.1 bits (126), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 49/185 (26%), Positives = 87/185 (47%), Gaps = 18/185 (9%)

Query: 135 WISVALFLIREKKKEDSPWRVYLDILPECTDSTVF--WSEEELVELQGTQLLSTTLGVKE 192
           ++ +A +L+  ++    PWR+Y+D+LP   + T    +  EE+ ELQ   L S     + 
Sbjct: 91  FVRLAAWLLWLRRNAQGPWRLYIDLLPREEEITTLMNYRPEEVGELQSPLLESRAALERS 150

Query: 193 YVQ--NEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSR-LRGQNL-VLIPLA 248
            +   ++ L       L   QL      L D +WA  ++ SR+FS  + G+ + +++P A
Sbjct: 151 QIAGLHDRLFCTSGGELRALQL---AAGLQDTVWAACMVNSRSFSETVEGETVSLMVPCA 207

Query: 249 DLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDL-NKSNAELAL 307
           D+ NH+      + +Y+       +    F L+    +  G +  I Y   +KSN  L  
Sbjct: 208 DMANHA---LAPNASYQF-----VAPADAFQLQALQDIARGAEACISYGCTHKSNEGLMR 259

Query: 308 DYGFI 312
           DYGF+
Sbjct: 260 DYGFV 264


>gi|170093191|ref|XP_001877817.1| SET-domain protein [Laccaria bicolor S238N-H82]
 gi|164647676|gb|EDR11920.1| SET-domain protein [Laccaria bicolor S238N-H82]
          Length = 524

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 43/155 (27%), Positives = 71/155 (45%), Gaps = 13/155 (8%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDT--------VAASEIGSLCSGLKPWISVALFL 142
           G G VA +DI +  V+  +P    ++  T        + A +   L  G   W  + L +
Sbjct: 61  GRGAVAVKDIPEGHVLFTIPRDLTLSTRTSSLPLRFGMGAWKNAKLHEG---WAGLILCM 117

Query: 143 IREKKK-EDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKV 201
           + E  +   S W  Y DILP   D+ +FW+EEEL EL+GT ++   +G  +  ++   K+
Sbjct: 118 MWEAAQGSSSKWSGYFDILPTSFDTPMFWTEEELAELRGTSVVE-KIGRADAEKDYKEKL 176

Query: 202 EEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSR 236
              I    +   PR I     +  + ++ SR  SR
Sbjct: 177 IPAINSRPELFLPRDIHTRYSVEMYHVMGSRILSR 211


>gi|157872113|ref|XP_001684605.1| conserved hypothetical protein [Leishmania major strain Friedlin]
 gi|68127675|emb|CAJ05793.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 629

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 87/368 (23%), Positives = 145/368 (39%), Gaps = 68/368 (18%)

Query: 149 EDSPWRVYLDILPECTDST-VFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIIL 207
           E S WR  L   PE   +    W   +L EL G  ++   L  +  +Q    +++  ++L
Sbjct: 267 ERSHWRRLLTHCPERYPTIPTAWELRDLAELDGLDMVDDVLAKRSQMQAFVDQLQNSLLL 326

Query: 208 ----------------PNKQLFPRPITLDDFLWAFGILRSRAFS-RLRGQN-LVLIPLAD 249
                           P+          +  +WA     SRAF+  + G   + L+PLAD
Sbjct: 327 LFHRALSKLHGASAAAPSLADMSAAFVWEHLVWAQSTFDSRAFNLNVDGAVVMALVPLAD 386

Query: 250 LINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDY 309
           ++NH+          E  G G F+  +  +L        G ++ + Y     N EL   Y
Sbjct: 387 MVNHANHTDVLVRKVEPNG-GPFTMQVGAAL---TAADVGRELWMSYG-PLQNWELLQHY 441

Query: 310 GFI---ESKSDRNAYTLTLEISESDPFFGDKLDIAETN------GLGESAY--------- 351
           GF+   ++  D+  + LTL    ++P    + D   T+      G  ES++         
Sbjct: 442 GFLLGPDNVHDKLPFPLTLPAGSNEP--QSESDATATSAPLSREGGTESSWDARRLTLMG 499

Query: 352 -FDIVL-GRT------LPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEE 403
            + + L GR       +PPA L  L  V L   D F    I  +  +G  + P+S   E 
Sbjct: 500 RYALCLPGRCWIPHDGVPPAALLALLRVQLAQADEF---DIMESRRYGPFE-PLSPVTEA 555

Query: 404 LICRVVRDACKSALSGFHTTIEEDEQLL------QGGNLDP------RLRIAVGVRAGEK 451
            +  VV+   +  +  F TT+ EDE+ L      +G +LD          + + +R G K
Sbjct: 556 AVVSVVKSTVQCVMGSFPTTLAEDEETLAEVHGDEGCDLDAGDEPTKNHMLCLQLRVGLK 615

Query: 452 RVLQQIDE 459
           R+  +  E
Sbjct: 616 RIASRCLE 623


>gi|317030555|ref|XP_001392774.2| SET domain protein [Aspergillus niger CBS 513.88]
          Length = 473

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 47/176 (26%), Positives = 78/176 (44%), Gaps = 20/176 (11%)

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQN-----EYLKVEEEI 205
           SPW  Y+  +P       F+SEEEL  L+G+ L    L V   + +     E+L+   E 
Sbjct: 131 SPWTEYMKYMPPAISLPTFYSEEELELLRGSSL---RLAVHAKIASLEKEFEHLRRSTEG 187

Query: 206 ILPNKQLF----PRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINH-SPGITTE 260
           +   ++ +       +T +D+ +   + RSR     R     ++P  D+ NH S G    
Sbjct: 188 LDWCEKYWWDEDTGKLTFNDWKYVDALYRSRMVDLPR-HGHAMVPCIDMANHASEGTVKA 246

Query: 261 DYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKS 316
            Y  +  G      + +  LR    ++A E+V I Y   KS +EL   YGF++  +
Sbjct: 247 LYDEDADG------NAVLQLREGRSLRADEEVTISYGDEKSASELIFSYGFLDEHT 296


>gi|255089515|ref|XP_002506679.1| predicted protein [Micromonas sp. RCC299]
 gi|226521952|gb|ACO67937.1| predicted protein [Micromonas sp. RCC299]
          Length = 584

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 66/304 (21%), Positives = 115/304 (37%), Gaps = 55/304 (18%)

Query: 58  NPPTAQVETFWQWLRDQKVVSPKSPIRPATFPEGL-GLVAQRDIAKNEVVLEVPMKFWIN 116
           + P A  + FW+W R + VV+ +   R     EG  G+VA  DI ++ VVL VP    ++
Sbjct: 10  DGPDADAD-FWRWARARGVVAVRCEAR--DVAEGWRGIVATEDIERDAVVLRVPGDILMS 66

Query: 117 PDTVAASE-------IGSLCSGLKPWISVALFLIREKKK-EDSPWRVYLDILPECTDSTV 168
             ++   E              + P   +A+ L+ E  +   S W  Y+  LP   +   
Sbjct: 67  ARSMERDEQLRDALRTHGETRSMTPADKLAVHLLLEASRGRGSRWHEYISRLPRAYNLLC 126

Query: 169 FWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGI 228
            W+  E   LQ    ++     ++  +  + +  + +      L  R  ++D + WA   
Sbjct: 127 CWTRRERAMLQDPAAIAVARRARDATRQSWERARDVLAALGLTLANRWGSIDAWRWARCT 186

Query: 229 LRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYA------------------------- 263
           + SR           L P+ DL N++P      +A                         
Sbjct: 187 VSSRTVYVPYDAAGALCPVGDLFNYAPPPPPHRHAIVGTPLEGGDEGKDDDEGKDDDKRG 246

Query: 264 ------------YEIKGAGLF---SRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALD 308
                       + + G G +   SR+ +F  R P    AGEQ+++ Y  + +N  L   
Sbjct: 247 EDDDKGGEEGKRWTVSGDGAWDEASREYVFRARRPYV--AGEQIMLCYGRH-TNLGLLEH 303

Query: 309 YGFI 312
           YGF+
Sbjct: 304 YGFL 307


>gi|255076477|ref|XP_002501913.1| predicted protein [Micromonas sp. RCC299]
 gi|226517177|gb|ACO63171.1| predicted protein [Micromonas sp. RCC299]
          Length = 294

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 77/326 (23%), Positives = 131/326 (40%), Gaps = 56/326 (17%)

Query: 160 LPECTDSTVFWSEEELVELQGTQLLSTTLGVK-------EYVQNEYLKVEEEIILP-NKQ 211
           +P    S    ++EEL  LQ   +    +G K       E ++ ++ +V + I    + Q
Sbjct: 1   MPTSYTSLYLATDEELEALQDENVRRMAMGSKANYAAGWEMIKTQHPRVADAIDGSVDDQ 60

Query: 212 LFPRPITLDDFLWAFGILRSRAFSR--LRGQNLVLIPLADLINHS--P----GITTEDYA 263
           +     T ++F WA     +RA S     G    ++P  DL NHS  P    G++ +  +
Sbjct: 61  I---AATQEEFDWARATAHTRAMSGKVAGGPCAFIVPGVDLANHSFAPNCEYGVSGDGGS 117

Query: 264 YEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTL 323
           +++      +R++      P+P ++G++VLI Y     NA L L YGF++ ++      +
Sbjct: 118 FQLTWDTTATREMPKG--PPLP-ESGDEVLICYGARMPNALLMLHYGFMDPENPNEQLPM 174

Query: 324 TLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLES 383
              I  +       +  A               GR           +   G T A   E 
Sbjct: 175 ECMIPGARKIRAKTVSAA---------------GRA----------IAEEGDTKA---EW 206

Query: 384 IFRNTIWGHLDLPVSHANE---ELIC-RVVRDACKSALSGFHTTIEEDEQLLQGGNLDPR 439
             R  +   +  PV  + +   +L C   +R A  + L+ F TT EEDE  L  G L  R
Sbjct: 207 AARQMM--QMANPVDDSADKAADLACIGAMRGAADAKLASFPTTAEEDEVALSAGELGER 264

Query: 440 LRIAVGVRAGEKRVLQQIDEIFKERE 465
           +R+ V  R   KR ++       + E
Sbjct: 265 MRMCVEYRLSVKRNVEAFGRFLAKME 290


>gi|255075393|ref|XP_002501371.1| set domain protein [Micromonas sp. RCC299]
 gi|226516635|gb|ACO62629.1| set domain protein [Micromonas sp. RCC299]
          Length = 518

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 47/173 (27%), Positives = 76/173 (43%), Gaps = 9/173 (5%)

Query: 92  LGLVAQRDIAKNEVVLEVPMKFWINPDTVAASE-IGSL--CSGLKPWISVALFLIREKKK 148
           +G++A+R+I   E+V  +P    +     A  + +G L    G   W+++  F +  ++ 
Sbjct: 84  VGVLARRNIDVGEIVARIPTSQCLTCGGCAHQKLVGDLEISLGKGEWLALLGFALLLERG 143

Query: 149 ED----SPWRVYLDILPECTDS-TVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEE 203
            D    S W  YL  LP      T  W E     L GT +       +   + E+ +   
Sbjct: 144 IDAGKSSSWNAYLRYLPAAEQGVTALWPENRKRYLAGTDVDLALRDERAQAKTEW-ETHI 202

Query: 204 EIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPG 256
           E IL   +      T +D+L A  ++ SRAF+      + L+PLADL NH  G
Sbjct: 203 EPILSRSEYAESGFTFEDYLSARSVVSSRAFTICPKVGVGLVPLADLFNHRTG 255


>gi|281207217|gb|EFA81400.1| mRNA-decapping enzyme 2 [Polysphondylium pallidum PN500]
          Length = 1078

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 90/413 (21%), Positives = 164/413 (39%), Gaps = 84/413 (20%)

Query: 80  KSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAA---------SEIGSLCS 130
           K P+   T P   G+ + RDI + E +L +P   W N  ++           ++I  L  
Sbjct: 40  KQPLLSLT-PNEKGIFSSRDIKEGEELLSLP---WYNSLSMNKVQQQLPWLFNKIQDLEL 95

Query: 131 GLKPWISVALFLIREKKKEDS-PWRVYLDILPECTDSTVFWSEEELVELQGT----QLLS 185
             +  + VAL   R    + S  +  +   +PE  +S +F+S+ E   L G+     L++
Sbjct: 96  TAEDGLVVALLYYRYCMDDLSFDYSEWFSAMPEVLNSGLFFSDAEAELLNGSPAYIDLMN 155

Query: 186 TTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNL--- 242
             L  KE      L    + +   +Q     +T D   WA+ ++ SR        NL   
Sbjct: 156 QRLDAKE------LFGRLKSLFKEQQFSKCAMTYDRLKWAYSVVDSRKI-YTEAPNLDAN 208

Query: 243 -------VLIPLADLINHSPGITTE-DYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLI 294
                  VL P  D  NH+       D+ Y+     + +           P+K GEQ+ +
Sbjct: 209 GNPFITVVLAPFLDYFNHAEDAQAAYDFDYDESAIKVVALQ---------PIKKGEQIFL 259

Query: 295 QYDLNKSNAELALDYGFI-ESKSDRNAYTLTLE-----ISESDPFFGDKLDIAETNGLGE 348
            Y     N++L + YGFI +S + ++   + +E     I  SDP   +K ++        
Sbjct: 260 NYGNQDCNSDLLIHYGFIDQSSTAKHCVNVLVEELLNTIPASDPQLIEKTEL-------- 311

Query: 349 SAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRV 408
                      L  A  Q  R+     +   L E + + + +      +S+ N  L+   
Sbjct: 312 -----------LTKAFEQNERMKLFKDS---LTEELLKISKY------LSYKNFSLL-PY 350

Query: 409 VRDACKSALSGFHTTIEEDEQLLQGG----NLDPRLRIAVGVRAGEKRVLQQI 457
           ++      +  + TT+EED  +++       L  R ++++ +R  EK  L++I
Sbjct: 351 LKSLIDMKMKAYPTTMEEDRAIIEATTEFEKLSQRSKMSIIMRLQEKETLKEI 403


>gi|350629837|gb|EHA18210.1| hypothetical protein ASPNIDRAFT_38188 [Aspergillus niger ATCC 1015]
          Length = 480

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 47/176 (26%), Positives = 78/176 (44%), Gaps = 20/176 (11%)

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQN-----EYLKVEEEI 205
           SPW  Y+  +P       F+SEEEL  L+G+ L    L V   + +     E+L+   E 
Sbjct: 113 SPWTEYMKYMPPAISLPTFYSEEELELLRGSSL---RLAVHAKIASLEKEFEHLRRSTEG 169

Query: 206 ILPNKQLF----PRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINH-SPGITTE 260
           +   ++ +       +T +D+ +   + RSR     R     ++P  D+ NH S G    
Sbjct: 170 LDWCEKYWWDEDTGKLTFNDWKYVDALYRSRMVDLPR-HGHAMVPCIDMANHASEGTVKA 228

Query: 261 DYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKS 316
            Y  +  G      + +  LR    ++A E+V I Y   KS +EL   YGF++  +
Sbjct: 229 LYDEDADG------NAVLQLREGRSLRADEEVTISYGDEKSASELIFSYGFLDEHT 278


>gi|384251065|gb|EIE24543.1| hypothetical protein COCSUDRAFT_40909 [Coccomyxa subellipsoidea
           C-169]
          Length = 685

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 52/212 (24%), Positives = 96/212 (45%), Gaps = 19/212 (8%)

Query: 140 LFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYL 199
           L+ + ++  +DS +  +   LPE   + +  +EEE+  L+GT   +T +  +++++ +Y 
Sbjct: 34  LYTMIDRHDKDSDFAPFWASLPEVFMTGLSATEEEVSMLEGTPAHTTFVEARQHIREQY- 92

Query: 200 KVEEEIILPNKQLFPRPI-----TLDDFLWAFGILRSRAFSRLRGQNLV---LIPLADLI 251
           +  + ++      +P  I     T D F+WA  +  S A         V   L+P+A L+
Sbjct: 93  RAAQPVLQALTAAYPDDITPDLVTEDKFIWACELWYSYAIEVEYVDGAVRQTLVPIAHLL 152

Query: 252 NHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF 311
           NHSP      + + ++   L +      LR      AGEQ  + Y     N +L L YGF
Sbjct: 153 NHSP------WPHIVRYGRLDAATDSLRLRAFRHCAAGEQCFLSYG-PLPNLKLLLFYGF 205

Query: 312 IESKSDRNAYTLTLEISESDPFFGDKLDIAET 343
               +  +   +T E  +++   GD  D+ E 
Sbjct: 206 ALPDNPHDTVPITFEAEKNE---GDVTDMLEA 234


>gi|134077289|emb|CAK45629.1| unnamed protein product [Aspergillus niger]
          Length = 498

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 47/176 (26%), Positives = 78/176 (44%), Gaps = 20/176 (11%)

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQN-----EYLKVEEEI 205
           SPW  Y+  +P       F+SEEEL  L+G+ L    L V   + +     E+L+   E 
Sbjct: 131 SPWTEYMKYMPPAISLPTFYSEEELELLRGSSL---RLAVHAKIASLEKEFEHLRRSTEG 187

Query: 206 ILPNKQLF----PRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINH-SPGITTE 260
           +   ++ +       +T +D+ +   + RSR     R     ++P  D+ NH S G    
Sbjct: 188 LDWCEKYWWDEDTGKLTFNDWKYVDALYRSRMVDLPR-HGHAMVPCIDMANHASEGTVKA 246

Query: 261 DYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKS 316
            Y  +  G      + +  LR    ++A E+V I Y   KS +EL   YGF++  +
Sbjct: 247 LYDEDADG------NAVLQLREGRSLRADEEVTISYGDEKSASELIFSYGFLDEHT 296


>gi|10437194|dbj|BAB15011.1| unnamed protein product [Homo sapiens]
          Length = 449

 Score = 52.8 bits (125), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 71/291 (24%), Positives = 120/291 (41%), Gaps = 48/291 (16%)

Query: 64  VETFWQWLRDQKV-VSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAA 122
           V  F  W R   + +SPK  +       G G+VA+  +   E++  VP    ++  T + 
Sbjct: 22  VACFLSWCRRVGLELSPKVAVSRQGTVAGYGMVARESVQAGELLFVVPRAALLSQHTCS- 80

Query: 123 SEIGSLC-------SGLKPWISVALFLIREKKKEDSPWRVYLDILPECT--DSTVFWSEE 173
             IG L         G   W+ + L L+ E +   S WR Y  + PE    +  +FW EE
Sbjct: 81  --IGGLLERERVALQGQSGWVPLLLALLHELQAPASRWRPYFALWPELGRLEHPMFWPEE 138

Query: 174 E-LVELQGTQLLSTTLGVKEYVQNEYLKVEEE---IILPNKQLFP-----RPITLDDFLW 224
           E    LQGT       GV E V+ +   +  E   I+LP  +  P     R  +L+ +  
Sbjct: 139 ERRCLLQGT-------GVPEAVEKDLANIRSEYQSIVLPFMEAHPDLFSLRVRSLELYHQ 191

Query: 225 AFGILRSRAFS--------RLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDL 276
              ++ + +F              + V++P AD++NH   +   +   E      +S + 
Sbjct: 192 LVALVMAYSFQVPLEEEEDEKEPNSPVMVPAADILNH---LANHNANLE------YSANC 242

Query: 277 LFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEI 327
           L  + T  P+  G ++   Y    +N +L   YGF+E   D    T  +++
Sbjct: 243 LRMVATQ-PIPKGHEIFNTYG-QMANWQLIHMYGFVEPYPDNTDDTADIQM 291


>gi|449702130|gb|EMD42824.1| Hypothetical protein EHI5A_004190 [Entamoeba histolytica KU27]
          Length = 749

 Score = 52.8 bits (125), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 58/229 (25%), Positives = 101/229 (44%), Gaps = 24/229 (10%)

Query: 156 YLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPR 215
           Y+++LPE  D  + ++ +EL  ++GT+L +    +  ++         ++I    Q FP+
Sbjct: 58  YINVLPETYDCPLSYTIDELNLMKGTKLYAAVEKINAFLMKVVDYYNNKLI----QQFPQ 113

Query: 216 PI-TLDDFL----WAFGILRSRAFSRLRGQNL----VLIPLADLINHSPGITTEDYAYEI 266
              + DD      WA     SRAF  +  Q       LIP  D  NH     T+     I
Sbjct: 114 YFQSFDDLFKRLQWAHQSFWSRAFLVIYPQPFGEVGSLIPFCDFSNH----CTQAKVTYI 169

Query: 267 KGAGLFSRDLLFSLRTPVP-VKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTL 325
                 ++   FS +T    VK GEQ+   Y + +SN +L L YGF+E  +  +   L +
Sbjct: 170 SN----TQTETFSFQTNEELVKPGEQIFNNYRI-RSNEKLLLGYGFVEENNPCDNLLLRI 224

Query: 326 EISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALG 374
                D  + +  +I +   +    +F + L   +P  +++ LR+V L 
Sbjct: 225 YFEVDDNQYNEIEEILKQEEIKSFDFF-LKLDEDIPLELMRILRIVNLS 272


>gi|400597177|gb|EJP64912.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
          Length = 468

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 37/115 (32%), Positives = 51/115 (44%), Gaps = 7/115 (6%)

Query: 219 LDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLF 278
           L D+       RSR    L      ++P  D++NHS    T  Y  E  G      D+  
Sbjct: 168 LSDWFLVDAWYRSRCL-ELPQAGHAMVPALDMVNHSTTNATAYYEEEASG------DVSL 220

Query: 279 SLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPF 333
            +R  V V AG+++ I Y  +KS AE+   YGFI+  S     TL L+    DP 
Sbjct: 221 RIRPGVSVAAGDEINISYGDSKSAAEMLFSYGFIDPDSKTRQLTLHLDSFPDDPL 275


>gi|390367697|ref|XP_787519.3| PREDICTED: N-lysine methyltransferase setd6-like
           [Strongylocentrotus purpuratus]
          Length = 466

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 52/229 (22%), Positives = 95/229 (41%), Gaps = 46/229 (20%)

Query: 61  TAQVETFWQWLRDQKVV-SPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDT 119
           +   + F  W   + ++ + K  +R +      G++A  DI+K E +  VP    ++P T
Sbjct: 29  STDCQMFLSWCEQEGIIQNSKVTVRRSGSCAQCGMIALDDISKGETLFTVPRSVLLHPAT 88

Query: 120 ---VAASEIGSLCSGLKP---WISVALFLIREKKKEDSPWRVYLDILPECT--DSTVFWS 171
              V A  +      L+    W+ + L ++ E     S WR YLD+ P+ +  D  +FW 
Sbjct: 89  CSPVVAQRLEEDEDSLETESGWVPLILAVMYEHTNRSSRWRPYLDLFPDYSELDQPMFWD 148

Query: 172 EEELV-ELQGTQLLSTTLGVKEYVQNEYLKVE---EEIILP--------------NKQLF 213
              +  EL+GT       G+ E VQ +   ++    ++ LP              N  L+
Sbjct: 149 SNYMQPELRGT-------GIAEAVQRDLRNIDRDYHDVALPFIKKNADLFSEEKHNLDLY 201

Query: 214 PRPITLDDFLWAFGILRSRAFSRLRGQNL---------VLIPLADLINH 253
            R ++   F+ A+    S  +      +          +++PLAD +NH
Sbjct: 202 KRTVS---FIMAYSFTESPDYDEDDDDSDDDDEETHPPMMVPLADALNH 247


>gi|159481640|ref|XP_001698886.1| hypothetical protein CHLREDRAFT_177526 [Chlamydomonas reinhardtii]
 gi|158273378|gb|EDO99168.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 384

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 97/404 (24%), Positives = 151/404 (37%), Gaps = 89/404 (22%)

Query: 107 LEVPMKFWINPDTVAASEIGSLCSGLKPW-ISVALFLIREKKKEDSP-WRVYLDIL---P 161
           L +P   W      A+S I  L     PW I  AL L+     +    W  Y D+L   P
Sbjct: 17  LSMPPARWPRLRGGASSSIDPL-----PWDILQALALVDGLAGDGGEFWARYCDVLLPAP 71

Query: 162 ECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDD 221
           E     + W  + L +LQ   + +      E  Q+  + +  E++ P       P     
Sbjct: 72  ERLTLPMCWEPQRLAQLQHRDIATAA----EAQQDRLMGLFPELMEPLA-----PDVPSW 122

Query: 222 FLWAFGILRSRAFSRLRGQNLVLIPLADLINHS-------PG--------ITTEDYAYEI 266
           F WAF  +RSRAF R        +P  DL NH+       PG         T    + ++
Sbjct: 123 FQWAFACVRSRAF-RAGPDAFAFVPFLDLANHADAPQAVPPGPSGASGGAATPAPPSNQV 181

Query: 267 KGAGLFSRDL--------------LFSLRTPVPVKAGEQVLIQYDLNK--SNAELALDYG 310
             A   +                 L++LR    + AGE+V I Y   +  +N      YG
Sbjct: 182 ANADFRASPAGAGSGEAGAGEYFELYALRD---IVAGEEVTISYAGPEGYTNQRFMAQYG 238

Query: 311 FIE---SKSDRNAYTLTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQY 367
           F+    + +DR    LT E+                    +SA  D+ + + L   +   
Sbjct: 239 FVPVGGNAADRVKLELTPEM--------------------QSAPLDLEVLQEL---LGDA 275

Query: 368 LRLVALGGTDAFL---LESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTI 424
           L + AL GTD +L   L+S+      G   +  + A      + +    ++ ++   TT+
Sbjct: 276 LFMSALRGTDPYLTAALKSLPLVDGGGSNGVKAATAGSVRTAQALLQQVEAQIAEGTTTL 335

Query: 425 EEDEQLLQG------GNLDPRLRIAVGVRAGEKRVLQQIDEIFK 462
           E DE LL G         DPR   AV  R   KR+L +   + +
Sbjct: 336 EADEALLGGAEGRALAEADPRQAAAVAYRVERKRLLDKTAALLR 379


>gi|393220404|gb|EJD05890.1| SET domain-containing protein [Fomitiporia mediterranea MF3/22]
          Length = 494

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 61/223 (27%), Positives = 96/223 (43%), Gaps = 44/223 (19%)

Query: 136 ISVALFLIR----EKKKEDSPWRVYLDILPECTDST-VFWSEEELVELQGTQ-------L 183
           +S+ L L R    +   +D+ + VY+D LP   DS  + W  +E +   G         L
Sbjct: 105 LSIHLALYRPGGEDSCSKDAHFGVYIDTLPTEFDSHPLTWLVKEKLGKAGAHEVKLLERL 164

Query: 184 LSTTLGVKEYVQNEYL-----------KVEEEIILPNKQLFPRPITLDDFLWAFGILRSR 232
             + L   E V   Y            K  EE+   N+ LF      DDFLW + ++ +R
Sbjct: 165 PPSVLSELEAVATRYWHDWDIAVRAKEKYSEELRSKNEALF----NEDDFLWGWLVVNTR 220

Query: 233 A-----FSR-LRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSR---------DLL 277
           +     F+R L   NL L P+ D  NH+   +     ++ + + + SR         D  
Sbjct: 221 SLYADLFNRNLPQSNLTLCPVIDFANHTASSSVPYAVWKDRPSKVLSRPRGPSKHANDYT 280

Query: 278 FSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNA 320
           F L     +  GE++LIQY    SN  L  +YGF++  +D+ A
Sbjct: 281 F-LPPKQGLGQGEELLIQYG-KHSNRRLFAEYGFVDENADKEA 321


>gi|390477743|ref|XP_003735352.1| PREDICTED: N-lysine methyltransferase SETD6 [Callithrix jacchus]
          Length = 449

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 73/289 (25%), Positives = 121/289 (41%), Gaps = 44/289 (15%)

Query: 64  VETFWQWLRDQKV-VSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAA 122
           V  F  W R   + +SPK  +       G G+VA+  +   E++  VP    ++  T + 
Sbjct: 22  VAGFLSWCRRVGLELSPKVEVSRQGTVAGYGMVARESVQAGELLFVVPRAAILSQYTCSI 81

Query: 123 S-----EIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTD--STVFWSEEEL 175
                 E G+L S    W+ + L L+ E +   S WR Y  + PE       +FW EEE 
Sbjct: 82  GGLLERERGALQSQ-SGWVPLLLALLHELQAPASHWRPYFALWPELGHLKHPMFWPEEER 140

Query: 176 VE-LQGTQLLSTTLGVKEYVQNEYLKVEEE---IILPNKQLFP-----RPITLDDFLWAF 226
              LQGT       GV E V+ +   +  E   I+LP  +  P     R  +L+ +L   
Sbjct: 141 RRLLQGT-------GVPEAVEKDLDSIRSEYHSIVLPFMEAHPDLFSLRVHSLELYLQLV 193

Query: 227 GILRSRAFSRLRG--------QNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLF 278
            ++ + +F              + V++P AD++NH   +   +   E      +S D L 
Sbjct: 194 ALVMAYSFQEPLEEEEDEKEPNSPVMVPAADILNH---LANHNANLE------YSADCLR 244

Query: 279 SLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEI 327
            + T  P+  G ++   Y    +N +L   YGF+E   D    T  +++
Sbjct: 245 MVATQ-PIPKGHEIFNTYG-QMANWQLIHMYGFVEPYPDNTDDTADIQM 291


>gi|401886592|gb|EJT50619.1| nucleus protein [Trichosporon asahii var. asahii CBS 2479]
          Length = 533

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 55/227 (24%), Positives = 90/227 (39%), Gaps = 66/227 (29%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKP---------WISVALF 141
           G G VA RDI     +  +P  + ++  T       +L + LKP         W  + L 
Sbjct: 34  GNGAVATRDIPSGTALFSIPSSYLLSEHT------STLSTHLKPEDWASLEGGWTRLILA 87

Query: 142 LIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKV 201
           L+ E  + +SPWR YLD +P    + ++W   +L  L+GT  +   +G +     +Y   
Sbjct: 88  LMWEDSRAESPWRAYLDAMPGSFSTPMWWPAPDLALLKGTD-IENRIG-RASADRDY--- 142

Query: 202 EEEIILPNKQLFPRPITLDDF-------------LWAFGILRSRAFSRLR---------- 238
            +E + P    +P  + + DF               +F + R+R  +  R          
Sbjct: 143 -DERVAPLLAAYPG-VFVGDFSKECYHRQGSRVLSRSFTVPRARVDAGYRKEKEEKEARS 200

Query: 239 ------------GQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFS 273
                        Q  V++PLAD++N          AYE+  A LFS
Sbjct: 201 EDSDEEEEEEEEEQVAVMVPLADMLNA---------AYEMDNARLFS 238


>gi|406698545|gb|EKD01780.1| nucleus protein [Trichosporon asahii var. asahii CBS 8904]
          Length = 533

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 55/227 (24%), Positives = 90/227 (39%), Gaps = 66/227 (29%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKP---------WISVALF 141
           G G VA RDI     +  +P  + ++  T       +L + LKP         W  + L 
Sbjct: 34  GNGAVATRDIPSGTALFSIPSSYLLSEHT------STLSTHLKPEDWASLEGGWTRLILA 87

Query: 142 LIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKV 201
           L+ E  + +SPWR YLD +P    + ++W   +L  L+GT  +   +G +     +Y   
Sbjct: 88  LMWEDSRAESPWRAYLDAMPGSFSTPMWWPAPDLALLKGTD-IENRIG-RASADRDY--- 142

Query: 202 EEEIILPNKQLFPRPITLDDF-------------LWAFGILRSRAFSRLR---------- 238
            +E + P    +P  + + DF               +F + R+R  +  R          
Sbjct: 143 -DERVAPLLAAYPG-VFVGDFSKECYHRQGSRVLSRSFTVPRARVDAGYRKEKEEKEARS 200

Query: 239 ------------GQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFS 273
                        Q  V++PLAD++N          AYE+  A LFS
Sbjct: 201 EDSDEEEEEEEEEQVAVMVPLADMLNA---------AYEMDNARLFS 238


>gi|443924621|gb|ELU43617.1| SET domain-containing protein [Rhizoctonia solani AG-1 IA]
          Length = 882

 Score = 52.4 bits (124), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 54/208 (25%), Positives = 78/208 (37%), Gaps = 39/208 (18%)

Query: 139 ALFLIREKKKEDSPWRVYLDILP-ECTDSTVFWSEEELVE-------LQGTQL------L 184
            L L      E S W  YL  LP E       W E E  +       LQ T+       L
Sbjct: 142 GLILEARALGEQSAWAGYLQSLPREQVPVAALWDENEDKDSQHAWLWLQATETAREIHEL 201

Query: 185 STTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVL 244
           ST   + + + N Y +V      P         TLDDF  A+ ++ SR+F       L +
Sbjct: 202 STGCTISQTISNFYYQVAN----PTLNQAGHKATLDDFRRAWSLVSSRSFRVDSYHGLAM 257

Query: 245 IPLADLINHSPGITTEDYAY---------------EIKGAGLFSR-----DLLFSLRTPV 284
           +P+AD  NH  G    D+                 EI+G+   +      +    +    
Sbjct: 258 VPIADAFNHI-GENQTDFDVCPICGSLQSCPHDEEEIQGSSNLTEGKTVLENTCDMVMNA 316

Query: 285 PVKAGEQVLIQYDLNKSNAELALDYGFI 312
           PV  G+++   YD +  N+ L   YGFI
Sbjct: 317 PVSTGDEIFNSYDTSLPNSTLLARYGFI 344


>gi|403376910|gb|EJY88443.1| hypothetical protein OXYTRI_16493 [Oxytricha trifallax]
          Length = 787

 Score = 52.4 bits (124), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 57/216 (26%), Positives = 88/216 (40%), Gaps = 37/216 (17%)

Query: 66  TFWQWLRDQKVVSP--KSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAAS 123
            F +WL D  V+ P    P+      + +G+ A++DI   +  L VP +  I+  TV  S
Sbjct: 46  NFQKWLEDNGVLHPGVDYPVAFGRQGQLIGMAARKDIPPQKAFLFVPQRLMISEVTVRNS 105

Query: 124 EIGSLCSGL---------KPWISVALFLIRE-KKKEDSPWRVYLDILPECTDSTVFWSEE 173
           +I  L S             ++ +  F+  E  K E S W  Y  I+   +D  + WS++
Sbjct: 106 KIAPLLSKHPEIFKHHQDAEYLVIIAFVWHELMKGEASFWYPYFQII-NLSDLPMLWSDQ 164

Query: 174 ELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRS-- 231
           E+ E Q   L       K   +NE+  V E         F +  T D++      L +  
Sbjct: 165 EIQEFQDQVLQKDIQDYKVEYENEWKLVYE--------AFSKDETYDEYYPGISNLENKE 216

Query: 232 --RAFSRLRGQNLV------------LIPLADLINH 253
             RAF     Q++V            +IP AD INH
Sbjct: 217 MIRAFYDRAYQSVVTRCFGWGLPKTSMIPFADCINH 252


>gi|344290687|ref|XP_003417069.1| PREDICTED: N-lysine methyltransferase SETD6-like [Loxodonta
           africana]
          Length = 452

 Score = 52.4 bits (124), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 67/261 (25%), Positives = 109/261 (41%), Gaps = 43/261 (16%)

Query: 77  VSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAAS-----EIGSLCSG 131
           +SPK  +       G G+VAQ  +   E++  VP    ++  T         E G+L S 
Sbjct: 39  LSPKVAVSRQGTVAGYGMVAQESVQPGELLFAVPRAAILSQHTCCIGGLLERERGALQSQ 98

Query: 132 LKPWISVALFLIREKKKEDSPWRVYLDILPECT--DSTVFWSEEELVE-LQGTQLLSTTL 188
              W+ + L L+ E +   SPW  Y  + PE +  +  +FW EEE    LQGT       
Sbjct: 99  -SGWVPLLLALLHELQAPSSPWSPYFALWPELSRLEHPMFWPEEEWRRLLQGT------- 150

Query: 189 GVKEYVQNEYLKVEEE---IILPNKQLFP-------RPITLDDFL------WAFGILRSR 232
           GV E V+ +   +  E   I+LP  +  P       R + L   L      ++F      
Sbjct: 151 GVPEAVEKDLANIRSEYYSIVLPFMEAHPELFSPCVRSLELYQQLVALVMAYSFQEPLEE 210

Query: 233 AFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQV 292
                   + +++P AD++NH   +   +   E      +S D L  + T  P+  G+++
Sbjct: 211 EEDEKEPNSPLMVPAADILNH---LANHNAHLE------YSPDCLRMVATQ-PIPKGQEI 260

Query: 293 LIQYDLNKSNAELALDYGFIE 313
              Y    +N +L   YGF+E
Sbjct: 261 FNTYG-QMANWQLIHMYGFVE 280


>gi|322709574|gb|EFZ01150.1| SET domain-containing protein, putative [Metarhizium anisopliae
           ARSEF 23]
          Length = 372

 Score = 52.4 bits (124), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 67/262 (25%), Positives = 108/262 (41%), Gaps = 41/262 (15%)

Query: 63  QVETFWQWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWIN----P 117
           ++E    W   + V    + I P   P  G+G+VA R +   E +L VP+         P
Sbjct: 3   EIERLLNWAEKKGVTI--NGIGPKPLPGRGIGIVATRALKTGEDILTVPVNMLRTLKNTP 60

Query: 118 DTVAASEIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVE 177
             +  S  GS    +     +A  L  E   E + WR    + P   D           E
Sbjct: 61  QPILHSLKGSTVHAI-----LATSLCLETDPEFATWRA---VFPTEDDIRTCMPLSWPSE 112

Query: 178 LQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRL 237
           LQ   LL      K  +  +  K + +  L     +P  I+   FL+++ ++ SR F  +
Sbjct: 113 LQ--YLLPP--NAKALLAKQKTKFDTDWALITAA-YPS-ISRSQFLYSWHLVNSRTFYHV 166

Query: 238 --------RGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAG 289
                   +  ++VL P+ADL NHSP            G  +   D  F++ T  PV+ G
Sbjct: 167 TRATEKLPKADHMVLQPVADLFNHSP-----------DGCKVAFDDASFTITTTHPVEQG 215

Query: 290 EQVLIQYDLNKSNAELALDYGF 311
           +++ I+Y  + SN  L ++YGF
Sbjct: 216 DELFIRYG-SHSNDFLLVEYGF 236


>gi|407832777|gb|EKF98587.1| hypothetical protein TCSYLVIO_010514 [Trypanosoma cruzi]
          Length = 572

 Score = 52.4 bits (124), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 33/96 (34%), Positives = 53/96 (55%), Gaps = 10/96 (10%)

Query: 216 PITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRD 275
           PITL+ FLWA+  L +R F+    +   L+P  D  N++          E++GA +F  +
Sbjct: 247 PITLEHFLWAYNTLMTRGFA-YYSEVWSLMPWVDYFNYALNSNATMKYDELRGAYIF--E 303

Query: 276 LLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF 311
           +LF      P+++GEQ+ +QY    ++ EL L YGF
Sbjct: 304 VLF------PIESGEQIFLQYG-AYTDMELLLWYGF 332


>gi|356524495|ref|XP_003530864.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Glycine
           max]
          Length = 472

 Score = 52.4 bits (124), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 102/442 (23%), Positives = 178/442 (40%), Gaps = 45/442 (10%)

Query: 47  TCSLHSASATTNPPTAQVETFWQWL-RDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNE- 104
           TC+  SAS     P   +    +W+ R+   V     I       GLGLVA+ +I +   
Sbjct: 18  TCASTSASRLVPQPPDLI----KWVTREGGFVHRAVKITQLDSSNGLGLVAKEEIPRGSD 73

Query: 105 -VVL--EVPMKFW-INPDTVAASEIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDIL 160
            +VL   +P++F  +  D      +          + + L L++E+ K  S W  Y+  L
Sbjct: 74  LIVLPHHLPLRFTSLQQDPSLLHHLARQVPEELWAMKLGLKLLQERAKVGSFWWPYISNL 133

Query: 161 PECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFP---RPI 217
           PE     +F+S E++  L    +L        ++ +   +V+  ++   +   P   + +
Sbjct: 134 PETYTVPIFFSGEDIKNLHYAPILHQVNKRCRFLLDFEREVKRTLVSLTQDKHPFGGQEV 193

Query: 218 TLDDFLWAFGILRSRAFSRLRGQN---------LVLIPLADLINHSPGITTEDYAYEIKG 268
                 WA   + SRAF RL G+           +++PL D+ NHS        A  ++ 
Sbjct: 194 DASSLGWAMSAVSSRAF-RLYGEKDPNGIRIDIPMMLPLIDMCNHS----FNPNARIVQE 248

Query: 269 AGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF---------IESKSDR- 318
               +  +   +     +K  + +L+ Y    +N    LDYGF         IE K D  
Sbjct: 249 QDTSNSRMQVKVVAETAIKEDDPLLLCYGC-LNNDLFLLDYGFVMHSNPYDCIELKYDGA 307

Query: 319 --NAYTLTLEISE---SDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPAMLQYLRLVAL 373
             +A +    +S    S P    +L +++ N  GE+A   + LG            L  L
Sbjct: 308 LLDAASTAAGVSSPNFSAPVPWQELILSQLNLAGETADLKVSLGGQETVEARLLAALRVL 367

Query: 374 GGTDAFLLESIFRNTIWG-HLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQ 432
             ++   ++    +T+     + P+  ANE  + R +   C  AL  F T I +DE LL+
Sbjct: 368 LSSNVETVQKYDLSTLQSLDAEAPLGVANEIAVFRTLIALCVIALGHFPTKIMDDESLLK 427

Query: 433 GGNLDPRLRIAVGVRAGEKRVL 454
            G       +A+  R  +K V+
Sbjct: 428 QGA-SGSTELAIQYRIQKKSVI 448


>gi|431912319|gb|ELK14453.1| SET domain-containing protein 6, partial [Pteropus alecto]
          Length = 847

 Score = 52.0 bits (123), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 75/301 (24%), Positives = 125/301 (41%), Gaps = 50/301 (16%)

Query: 58  NPPTAQVETFWQWLRDQKV-VSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWIN 116
           +P    V  F  W R   + +SPK  +       G G+VA+  +   E++  VP    ++
Sbjct: 408 DPDLDPVAGFLSWCRRVGLELSPKVAVSRQGTVAGYGMVARESVQPGELLFAVPRAVLLS 467

Query: 117 PDTVAAS-----EIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPE--CTDSTVF 169
             T + S     E G+L S    W+ + L L+ E +   SPW  Y  + PE    +  +F
Sbjct: 468 QHTCSISGLLERERGALQS-QSGWVPLLLALLHELQAPASPWTPYFALWPELGSLEHPMF 526

Query: 170 WSEEELVE-LQGTQLLSTTLGVKEYVQNEYLKVEEE---IILPNKQLFP-------RPIT 218
           W EEE    LQGT       GV E V+ +   +  E   I+LP  +  P       R + 
Sbjct: 527 WPEEERRRLLQGT-------GVPEAVEKDLANIRSEYYSIVLPFMEAHPDLFSPRVRSLE 579

Query: 219 LDDFLWAFGILRSRAFSRLRGQNL------------VLIPLADLINHSPGITTEDYAYEI 266
           L   L A  +  S+A      + L            +++P AD++NH   + + +   E 
Sbjct: 580 LYHQLVALVMAYSQALYGSFQEPLEEEDDEKEPNSPLMVPAADILNH---LASHNANLE- 635

Query: 267 KGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLE 326
                +S + L  + T  P+  G ++   Y    +N +L   YGF+E   +    T  ++
Sbjct: 636 -----YSPNYLRMVATQ-PIPKGHEIFNTYG-QMANWQLIHMYGFVEPYPNNTDDTADIQ 688

Query: 327 I 327
           +
Sbjct: 689 M 689


>gi|302840199|ref|XP_002951655.1| hypothetical protein VOLCADRAFT_105180 [Volvox carteri f.
           nagariensis]
 gi|300262903|gb|EFJ47106.1| hypothetical protein VOLCADRAFT_105180 [Volvox carteri f.
           nagariensis]
          Length = 517

 Score = 52.0 bits (123), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 48/197 (24%), Positives = 83/197 (42%), Gaps = 24/197 (12%)

Query: 142 LIREKKKEDSPWRVYLDILPE-CTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLK 200
           +  + +   S W  YL+++P+  T   ++W   E  EL+GT      +G  +   +   +
Sbjct: 192 MYEKSRGRQSRWAPYLNLIPDDMTHMPLYWKHREFKELRGTAAYDKMMGKVQCPADAPTQ 251

Query: 201 VE---EEIILPNKQLFPR------PITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLI 251
           V     E++ P  Q  P           D + WA   + S +F     +   ++P+ DL+
Sbjct: 252 VPVLWSEVVEPFIQEHPELELPEGKAGYDLYRWATCAVASYSFILGDDKYQAMVPVWDLL 311

Query: 252 NHSPGITTEDYAYEIKGAGLF---SRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALD 308
           NH  G       +  K   L    +RD+L           GE+++  Y    SNAEL   
Sbjct: 312 NHITGRVNVRLHHCAKRHVLHMIATRDIL----------RGEELVNNYG-ELSNAELLRG 360

Query: 309 YGFIESKSDRNAYTLTL 325
           YGF+E+++  N   + L
Sbjct: 361 YGFVEARNRNNHVQVPL 377


>gi|302841288|ref|XP_002952189.1| hypothetical protein VOLCADRAFT_92848 [Volvox carteri f.
           nagariensis]
 gi|300262454|gb|EFJ46660.1| hypothetical protein VOLCADRAFT_92848 [Volvox carteri f.
           nagariensis]
          Length = 618

 Score = 52.0 bits (123), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 36/109 (33%), Positives = 55/109 (50%), Gaps = 15/109 (13%)

Query: 207 LPNKQLFPRPITLDDFLWAFGILRSRAFSR-LRGQNLVL-IPLADLINHSPGITTEDYAY 264
           L + +L  +P+   D  WA  ++R+R FS  + G+ L L +P AD+ NHS         +
Sbjct: 247 LRHLKLAAKPV---DTQWAMSMVRTRTFSEDVNGEGLTLMVPYADMANHS---------F 294

Query: 265 EIKGAGLFSRDL-LFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI 312
           +       +RD   F LR   P+  GE+  I Y  +K N E+  DYGF+
Sbjct: 295 QYNSTFCMARDNERFELRLLSPLGPGEEASICYGEDKPNFEVMRDYGFV 343


>gi|301122457|ref|XP_002908955.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262099717|gb|EEY57769.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 423

 Score = 52.0 bits (123), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 57/186 (30%), Positives = 81/186 (43%), Gaps = 27/186 (14%)

Query: 136 ISVALFLIRE-KKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYV 194
           +   L L+ E  +K+ S +  Y+  LP      + W E +       ++L  T       
Sbjct: 88  VKTTLALLAELARKDTSDFHGYIQQLPTAISLPLSWDENQ------RKMLKDTTAFPILD 141

Query: 195 QNEYLKVEEEIILPNKQLFP-----RPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLAD 249
               LK+ E+  +P    FP        TL  F WA+ I+ SRAF    G    L+P+ D
Sbjct: 142 DKLVLKLYEDYAVPFANEFPVIWPTEVSTLKKFQWAYSIVSSRAFKVANGLEPTLLPVID 201

Query: 250 LINHSPGITTEDYAYEI--KGAGLFSRDLLFSLRTPVPVKAGEQVLIQY-DLNKSNAELA 306
           + NHS     E+ A  I    +G F    L +LR    V+  E V I Y DL  SNA+L 
Sbjct: 202 MANHS----AENPAAHIVKTESGSFQ---LVALRE---VEKKEPVTISYGDL--SNAQLL 249

Query: 307 LDYGFI 312
             YGF+
Sbjct: 250 CRYGFV 255


>gi|255726968|ref|XP_002548410.1| predicted protein [Candida tropicalis MYA-3404]
 gi|240134334|gb|EER33889.1| predicted protein [Candida tropicalis MYA-3404]
          Length = 498

 Score = 52.0 bits (123), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 36/128 (28%), Positives = 61/128 (47%), Gaps = 7/128 (5%)

Query: 63  QVETFWQWLRDQKV-VSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPDTV 120
           + E+F  WL    V VSPK  I       +G G++A  DI  NE +  +P    IN    
Sbjct: 18  KTESFLNWLISNGVKVSPKISIHDYRLSNQGRGIIANEDIEANEQLFTLPRSVLINVVNN 77

Query: 121 AASE----IGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELV 176
           +  E    +      +  W+S+ + L  E K   + W+ Y +I+P+  +  ++W  EEL 
Sbjct: 78  SLIEKYPNVQDNLKSMDQWLSLIIILSYEFKF-GNKWKNYFNIIPDEFNQLIYWKNEELK 136

Query: 177 ELQGTQLL 184
           +L+ + +L
Sbjct: 137 DLEPSCIL 144


>gi|146180409|ref|XP_001020886.2| hypothetical protein TTHERM_00411920 [Tetrahymena thermophila]
 gi|146144524|gb|EAS00641.2| hypothetical protein TTHERM_00411920 [Tetrahymena thermophila
           SB210]
          Length = 726

 Score = 52.0 bits (123), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 56/215 (26%), Positives = 93/215 (43%), Gaps = 19/215 (8%)

Query: 66  TFWQWLRDQKVVSPKSPIRPATFPE-GL-GLVAQRDIAKNEVVLEVPMKFWINPDTVAAS 123
            F  WL+   VV       P  F + GL G  A +DIA     + +P K  I+ D    S
Sbjct: 37  NFNNWLKANGVVYDGVEY-PVAFGQYGLIGAAATKDIAPLTAFISIPNKIIISYDRARFS 95

Query: 124 EIGSLCSGLKPWIS-----------VALFLIREK-KKEDSPWRVYLDILPECTDSTVFWS 171
           E+ S     +   S           + +F + E+ K + S W  Y +IL E  ++ + W+
Sbjct: 96  ELKSFFKQSEDLFSEKENDEAGVNVLTVFFMYERLKGKKSLWHEYFEIL-ENNETILTWT 154

Query: 172 EEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI-TLDDFLWAFGILR 230
            EE+  +    +       KE V +E     +E++      F +   T + FLWA+ I+ 
Sbjct: 155 AEEINRIPDPYIQKQAREYKEQV-DELWDELKELLHSQPNFFQKATATKELFLWAYNIVM 213

Query: 231 SRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYE 265
           SR F   + +   ++P AD +NH+    T  Y ++
Sbjct: 214 SRCFGYTQ-KGTSIVPFADCLNHNKYHATYFYIHK 247


>gi|328352061|emb|CCA38460.1| hypothetical protein PP7435_Chr2-0775 [Komagataella pastoris CBS
           7435]
          Length = 846

 Score = 52.0 bits (123), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 32/100 (32%), Positives = 54/100 (54%), Gaps = 5/100 (5%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEI----GSLCSGLKPWISVALFLIRE 145
           EG G+VA RDI ++EV+  +     +N  T + + +    G +   L  W  + L L  E
Sbjct: 36  EGRGVVATRDINEDEVLFSISKDNVLNIQTSSLANVKDNNGVILKRLNHWEGLILCLAYE 95

Query: 146 KK-KEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLL 184
           +   E S W+ YLD LP   ++ +FW+E++L  L+ + +L
Sbjct: 96  RSLGELSRWKGYLDTLPTSFNNLIFWNEQDLNSLKPSLIL 135


>gi|412992279|emb|CCO19992.1| predicted protein [Bathycoccus prasinos]
          Length = 640

 Score = 52.0 bits (123), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 67/276 (24%), Positives = 120/276 (43%), Gaps = 36/276 (13%)

Query: 77  VSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPW- 135
           VS  +P   ++  +  GL A  DI   E +LE+P +  ++  T+  SE G     L+   
Sbjct: 219 VSKVAPCLLSSMNDARGLCATEDIRAGENILEIPRRMLLDAGTICISEQGPFGDLLRILE 278

Query: 136 -----ISVALFLIRE----KKKEDSPWRVYLDILPECTD--STVFWSEEEL-VELQGTQL 183
                  + L++++E    K K+++ W +Y   LP+ +   + + W E+ + V L  T +
Sbjct: 279 RCGADTIMTLWIMKERMKMKTKQETFWSLYFLSLPDGSQKLTPLSWPEDIVRVGLGNTPI 338

Query: 184 LSTTLGVKEYVQNEY-------LKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSR 236
             T +  ++ V+N Y       L    E    N++ F    + D ++ A  +  S A + 
Sbjct: 339 FETVMHERQKVRNGYDALLPSLLANCPESFEGNQEEF---WSYDQYISALELWMSYAMTV 395

Query: 237 LRGQNL------VLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGE 290
               N       VL P+A   NH  GI    Y + +  + L   D          ++  E
Sbjct: 396 KPVHNSDSGTIDVLSPVAFFCNH--GI----YPHCVHYSQLRLSDECLVFPAMRDIEKNE 449

Query: 291 QVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLE 326
           ++++ Y   KSN EL L YGF    +  ++  +TL+
Sbjct: 450 EIMLSYGA-KSNGELLLFYGFCIDDNPYDSIDITLD 484


>gi|255536985|ref|XP_002509559.1| conserved hypothetical protein [Ricinus communis]
 gi|223549458|gb|EEF50946.1| conserved hypothetical protein [Ricinus communis]
          Length = 348

 Score = 52.0 bits (123), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 40/134 (29%), Positives = 69/134 (51%), Gaps = 7/134 (5%)

Query: 323 LTLEISESDPFFGDKLDIAETNGLGESAYFDIVLGRTLPPA--MLQYLRLVALGGTDAFL 380
           L   ++  DP + DK  +A+ NG      F I +G+       +L YLRL  +  +D   
Sbjct: 189 LQAALNTEDPQYQDKRMVAQRNGKLSIQVFQIYVGKEKEAISDILPYLRLGYV--SDPSE 246

Query: 381 LESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRL 440
           ++S+  +   G +  PVS   E+ +   + D  K  L+G+ T++ EDE +L   NL+P+ 
Sbjct: 247 MQSVISSQ--GPI-CPVSPCMEQAVLDQLADYFKRRLAGYPTSLNEDELMLADHNLNPKK 303

Query: 441 RIAVGVRAGEKRVL 454
           R+A  +   EK++L
Sbjct: 304 RVATQLVRLEKKIL 317


>gi|213406233|ref|XP_002173888.1| SET domain-containing protein [Schizosaccharomyces japonicus
           yFS275]
 gi|212001935|gb|EEB07595.1| SET domain-containing protein [Schizosaccharomyces japonicus
           yFS275]
          Length = 431

 Score = 52.0 bits (123), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 78/336 (23%), Positives = 131/336 (38%), Gaps = 42/336 (12%)

Query: 61  TAQVETFWQWLRD-QKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDT 119
           TA+++    +LRD  K ++  S   P    +G+GL ++ D++    + EVP K+ ++   
Sbjct: 6   TARLQLIEAFLRDFSKSLNVYSEFLPG---KGVGLYSKNDVSTGSTIFEVPRKYILDLQG 62

Query: 120 VA--ASEIGSLCSGLKPWISVALFLIR-------------EKKKEDSPWRVYLDILPECT 164
           V   A   GS    LK  +S     +                + + + W  Y   LP  T
Sbjct: 63  VYKFAEANGSFAVALKE-VSARSGEVELRLLFLLLYYFGASTQTDPTQWDSYFQSLPLTT 121

Query: 165 DSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLW 224
                W EEE  +LQGT L        + ++ E++  E   +L  KQ +    TL  +  
Sbjct: 122 TLPTEWIEEERNQLQGTSLWEPINAKLQVLKEEWM--EWRTLL--KQWWNIEPTLQGWTH 177

Query: 225 AFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPV 284
              + RSRA     G    ++P+ D+ NHS              A     +    L    
Sbjct: 178 VDALYRSRALEEPDGSE-AIVPVVDICNHSHN----------SNAKWRFGETSVELVATQ 226

Query: 285 PVKAGEQVLIQYDLNKSNAELALDYGFI---ESKSDRNAYTLTLEISESDPFFGDKLDIA 341
            + + E++L+ Y   K  AE    YGF+   E   D+ +    L   +++    D LD+ 
Sbjct: 227 HISSKEEILLCYGNTKGAAEYLFSYGFLCPEEHYEDKYSEIKLLIPRDTE----DALDVW 282

Query: 342 ETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTD 377
           +    G     D+V   +       +L L +L   D
Sbjct: 283 KRRLFGRPPLLDVVRDASGIYWYAPFLYLASLNAED 318


>gi|154290554|ref|XP_001545870.1| hypothetical protein BC1G_15621 [Botryotinia fuckeliana B05.10]
          Length = 336

 Score = 52.0 bits (123), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 69/269 (25%), Positives = 114/269 (42%), Gaps = 57/269 (21%)

Query: 104 EVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISV-ALFLIREKKKEDSPWRVYLDILPE 162
           +++++VP+K         A+++ S  + L P I+V ALF +         W   +  LP 
Sbjct: 4   DILVQVPIKV-----LRKATDVPSQFAALAPDIAVHALFALSLDSLLGPEW---IATLPS 55

Query: 163 CTDST----VFW--SEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRP 216
             D      +FW  S +EL+      LL T +   E + + +  +         + FP P
Sbjct: 56  KQDMHSSMPLFWDISLQELLPYSSRALLMTQM---ENITSAWTAI--------CKTFPEP 104

Query: 217 -ITLDDFLWAFGILRSRAFSRL------------RGQNLVLIPLADLINHSPGITTEDYA 263
            IT D+F++ + I+ SR F  L            +   L L P AD INHS   T +   
Sbjct: 105 PITYDEFIYNYSIVNSRTFYYLSPTIKPSKPQPSKENRLALNPFADYINHSSEPTVDA-- 162

Query: 264 YEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTL 323
             +  AG       ++L    P+K G +V I Y  + +N  L ++YGFI   +  +  TL
Sbjct: 163 -TLSRAG-------YTLTASQPIKQGSEVHISYG-SHNNDFLLVEYGFILEDNRWDEVTL 213

Query: 324 TLEISESDPFFGDKLDIAETNGLGESAYF 352
                  DP+    L + +   L E+ + 
Sbjct: 214 -------DPWITPLLSVEQKEHLEETGFL 235


>gi|115391295|ref|XP_001213152.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114194076|gb|EAU35776.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 691

 Score = 52.0 bits (123), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 57/197 (28%), Positives = 87/197 (44%), Gaps = 30/197 (15%)

Query: 153 WRVYLDILPECTDST--VFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNK 210
           W  Y+  LP+  D T  +++   +L  L+GT L       +  ++ +Y    EE+    K
Sbjct: 129 WYPYICTLPQPGDLTTPLYYEGADLRWLEGTSLAPAREQKESLLKEKYQSTFEEL---RK 185

Query: 211 QLF--PRPITLDDFLWAFGILRSRAFS-----------RLRGQNL-VLIPLADLINHSPG 256
             F      T + +LWA  I  SRAFS            L  +N+ VL+P  D++NH P 
Sbjct: 186 SGFGDAEKYTWELYLWASTIFVSRAFSAKVLAGVVPHAELPEENVSVLLPFIDVLNHRPL 245

Query: 257 ITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKS 316
              E  A E        RD+LF +     V AGE+V   Y   ++N +L ++YGF    +
Sbjct: 246 AKVEWRAGE--------RDVLFVVLE--HVAAGEEVANNYG-PRNNEQLMMNYGFCLQNN 294

Query: 317 DRNAYTLTLEISESDPF 333
             +  TL+L      P 
Sbjct: 295 PCDYRTLSLRAPPGSPL 311


>gi|242009061|ref|XP_002425311.1| SET domain-containing protein, putative [Pediculus humanus
           corporis]
 gi|212509085|gb|EEB12573.1| SET domain-containing protein, putative [Pediculus humanus
           corporis]
          Length = 399

 Score = 52.0 bits (123), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 50/194 (25%), Positives = 91/194 (46%), Gaps = 31/194 (15%)

Query: 134 PWISVALFLIREKK-KEDSPWRVYLDILPECTDSTVFWSEEE----------LVELQGTQ 182
           P + +++FL+ E   K DS +  Y+  LP+   +  F ++ E          LV  Q T 
Sbjct: 91  PQLRLSIFLMYENHLKNDSKYFNYIQTLPQSYSNVYFCTDSEIQLLPDLIKKLVVTQKTD 150

Query: 183 LLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAF----SRLR 238
           L      ++  + +E     ++ I   K+L+ R     +F+WA+  + +R+       +R
Sbjct: 151 LEFLFEKLQNNLNDEICSHCDKSI---KKLYNRY----EFIWAWFTVNTRSVYYEDKSMR 203

Query: 239 GQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDL 298
            ++L L P  D+ NHS    T+ Y         F  + L+ L+T    +  +Q+ I+Y  
Sbjct: 204 KKSLALAPFLDMFNHSSDANTKMYID-------FDNE-LYILKTLNSFRKHQQIFIKYGP 255

Query: 299 NKSNAELALDYGFI 312
           + SN +L ++YGFI
Sbjct: 256 H-SNLKLLIEYGFI 268


>gi|321257099|ref|XP_003193469.1| nucleus protein [Cryptococcus gattii WM276]
 gi|317459939|gb|ADV21682.1| nucleus protein, putative [Cryptococcus gattii WM276]
          Length = 491

 Score = 52.0 bits (123), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 72/261 (27%), Positives = 109/261 (41%), Gaps = 50/261 (19%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINP------DTVAASEIGSLCSGLKPWISVALFLIR 144
           G G VA +DI +   +  V     ++P      D + ASE   L  G   W  + L ++ 
Sbjct: 41  GYGAVAVKDIEEGTPLFHVTDNLILSPYTSDLKDHLDASEWDQLNKG---WAQLILVMMW 97

Query: 145 EK-KKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEE 203
           E  K   S W  YL  +P   ++ +FW+E++  +L GT  ++  +G +E  + EY  +  
Sbjct: 98  ETIKGSKSRWAGYLTNMPVMFETPMFWTEQQRDQLSGTD-IADRIG-REDAEAEYTSLLA 155

Query: 204 EIILPNKQLFP--RPITLDDFLWAFGILRSRAFSRL---------RGQN----------- 241
             I  +  LFP   P T  D   AF I  SR  SR          R Q+           
Sbjct: 156 PFIKAHPDLFPVDSPHTTID---AFHIQGSRILSRSFTVPLHRFGRSQSQSQSDGNETES 212

Query: 242 --------LVLIPLADLINHSPGITTED-YAYEIKGAGLFSRDLLFSLRTPVPVKAGEQV 292
                   +V+IP AD++N + G      Y  E    G     ++ S R    VK  EQ+
Sbjct: 213 DDEEEEEVVVMIPFADMLNAAWGKDNAHLYVDEDTIEGFDEGVVMKSTRL---VKQSEQI 269

Query: 293 LIQYDLNKSNAELALDYGFIE 313
              YD +  N+EL   YG ++
Sbjct: 270 YNTYD-SPPNSELLRKYGHVD 289


>gi|115402591|ref|XP_001217372.1| predicted protein [Aspergillus terreus NIH2624]
 gi|114189218|gb|EAU30918.1| predicted protein [Aspergillus terreus NIH2624]
          Length = 309

 Score = 52.0 bits (123), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 56/230 (24%), Positives = 94/230 (40%), Gaps = 39/230 (16%)

Query: 60  PTAQVETFWQWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPD 118
           P    E F +W   Q +    + + PA FP  GLG++A RDI ++E ++ VP+K  +  +
Sbjct: 7   PGEDHERFTEWALSQGIEV--NGVSPARFPGRGLGMIATRDIEEDEAIVTVPLKAMLTIE 64

Query: 119 TVAASEIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTD----STVFWSEEE 174
            V AS      +G      VA FL     ++ SP+ ++    P   D      + W    
Sbjct: 65  RVPASFTSRFPAGTPVHAIVAAFLTNGNAEDLSPYELWRRTWPSRQDFEDSMPILWPSSL 124

Query: 175 LV---ELQGTQLLSTTL-GVKEYVQNEYLKVEEEIILPNKQLFPR--------------- 215
            V   +  G  +L  ++ G    +  +  K+E E   P++ L P+               
Sbjct: 125 RVPGGQASGATVLPPSISGAWNTITKQ--KLEHEYETPHQNLLPQQEKRLQQAWDSVVAV 182

Query: 216 -PIT-LDDFLWAFGILRSRAFSRL---------RGQNLVLIPLADLINHS 254
            P T  + F + + I+ +R F  L             + ++P AD  NHS
Sbjct: 183 FPDTDWESFSYHWLIVNTRCFYYLMPGEEPPEDSNDAMAMLPFADYFNHS 232


>gi|18041979|gb|AAL57769.1|AF388528_1 hypothetical protein RDA279 [Rattus norvegicus]
          Length = 328

 Score = 52.0 bits (123), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 76/341 (22%), Positives = 138/341 (40%), Gaps = 41/341 (12%)

Query: 141 FLIREKKK-EDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYL 199
           FL+ E+     S W+ YLDILP+     V   E E+V+L    L +     +  VQ+ + 
Sbjct: 1   FLVSERHAGSHSLWKSYLDILPKSYTCPVCL-EPEVVDLLPGPLRAKAEEQRARVQDLFA 59

Query: 200 KVEEEIILPNKQLFPRPI----TLDDFLWAFGILRSRA-FSRLRGQNLV--------LIP 246
               +     + LF   +    +   FLWA+  + +RA + + R Q  +        L P
Sbjct: 60  S-SRDFFSTLQPLFAESVDSIFSYHAFLWAWCTVNTRAVYLKSRRQECLSSEPDTCALAP 118

Query: 247 LADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELA 306
             DL+NHSP +  +        A    +   + +RT    +  ++  I Y  +  N  L 
Sbjct: 119 FLDLLNHSPHVQVK--------AAFNEKTRCYEIRTASRCRKHQEAFICYGPH-DNQRLL 169

Query: 307 LDYGFIESKSDRNAYTLTLEI-----SESDPFFGDKLDIAETNGLGESAYFDIVLGRTLP 361
           L+YGF+   +      ++ E+       +D     KL I E +G       ++  G   P
Sbjct: 170 LEYGFVAFGNPHACVPVSGEMLLKYLPPADKQVHKKLSILEDHGFTG----NLTFGWDGP 225

Query: 362 PAMLQYLRLVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFH 421
              L     +     + F   + ++  + G +   +S  NE+    V +  C   ++  H
Sbjct: 226 SWRLLTALKLLCLEAERF---TSWKKVLIGDV---ISDRNEKASLGVAQKICSDFIAEIH 279

Query: 422 TTIEEDEQLLQ-GGNLDPRLRIAVGVRAGEKRVLQQIDEIF 461
           + + +   + +  G+L  +L +   +R  E R+LQ   EI 
Sbjct: 280 SELRKVSDMKERKGSLISQLCLVEALRMEELRILQASAEIL 320


>gi|390596436|gb|EIN05838.1| SET domain-containing protein [Punctularia strigosozonata HHB-11173
           SS5]
          Length = 398

 Score = 51.6 bits (122), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 70/267 (26%), Positives = 107/267 (40%), Gaps = 53/267 (19%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDT---------------VAASEIGSLCSGLKP- 134
           GL ++A  DI  +E V+  P    +  +                +A+  +    +GL   
Sbjct: 6   GLSVIAAEDIPADESVVSAPFSLAVTHEVAKNALEDISRAYNSHLASDNLKRALAGLSER 65

Query: 135 ---WISVALFLIREKKKEDS--PWRVYLDILPECTDST--VFWSEEELVELQGTQLLSTT 187
               I + L    E + + +    + YLD LPE +  T  + ++E+EL   +GT L   T
Sbjct: 66  QIICIYICLHWNEELRAQTTILGHQSYLDTLPEPSQLTTPLHFTEDELAAFKGTNLYGAT 125

Query: 188 LGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLW-----AFGILRSRAF-SRLRGQN 241
           L  +   + E++  +E + L N      P   D+F W     A   L SRAF S L   N
Sbjct: 126 LDRERQWKTEWVGCKEVVSLLN------PKWTDEFTWTRYLTASTYLSSRAFPSTLLSPN 179

Query: 242 ----------LVLIPLADLINHSPG------ITTEDYAYEIKGA-GLFSRDLLFSLRTPV 284
                      VL+P  D +NH+ G      I +     ++  A    S DL  SL    
Sbjct: 180 PTLQSSPSSYPVLLPGIDALNHARGAPVSWVIKSRSAQSQVNAAPDTGSSDLSISLVLHS 239

Query: 285 PVKAGEQVLIQYDLNKSNAELALDYGF 311
               G ++   Y   K N+EL L YGF
Sbjct: 240 ATPKGHELFNNYG-PKPNSELILGYGF 265


>gi|145511243|ref|XP_001441549.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124408799|emb|CAK74152.1| unnamed protein product [Paramecium tetraurelia]
          Length = 731

 Score = 51.6 bits (122), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 48/189 (25%), Positives = 76/189 (40%), Gaps = 23/189 (12%)

Query: 93  GLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWIS------------VAL 140
           GLVA   I  NE ++ VP    +       S +  +      + S            +A 
Sbjct: 64  GLVASEKILSNETLVSVPRDLLLTTRHAFESPLKQMFLDHPQYFSNQFYPSWEDHQLMAF 123

Query: 141 FLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEY----VQN 196
            L   ++  +S W + +  LP   D  VFW+ EE   L   QL+   L  K+Y    ++ 
Sbjct: 124 ILYEYQRGPESEWHLLISNLPRDIDYLVFWNPEEQELLDDQQLVK--LARKQYQEFVIEY 181

Query: 197 EYLKVEEEIILPNKQLF-PRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSP 255
           E LK    I     QLF P  +T ++  W +  L +R F +     + ++P  +L NH  
Sbjct: 182 ETLKC---ITDKYPQLFKPETVTFENARWVYTHLVTRCFGKYLAY-VTMVPFCELFNHEC 237

Query: 256 GITTEDYAY 264
                D+ Y
Sbjct: 238 TDVFYDFEY 246


>gi|320588047|gb|EFX00522.1| set domain containing protein [Grosmannia clavigera kw1407]
          Length = 508

 Score = 51.6 bits (122), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 24/52 (46%), Positives = 35/52 (67%), Gaps = 1/52 (1%)

Query: 134 PWISVALFLIREKKKED-SPWRVYLDILPECTDSTVFWSEEELVELQGTQLL 184
           PW S+ L +I E  + D SPW+ YLD+LP   D+ +FW+E EL ELQ + ++
Sbjct: 67  PWTSLILIMIFEFLRGDKSPWKSYLDVLPATFDTPMFWTENELRELQASPVV 118


>gi|164662537|ref|XP_001732390.1| hypothetical protein MGL_0165 [Malassezia globosa CBS 7966]
 gi|159106293|gb|EDP45176.1| hypothetical protein MGL_0165 [Malassezia globosa CBS 7966]
          Length = 610

 Score = 51.6 bits (122), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 41/149 (27%), Positives = 65/149 (43%), Gaps = 31/149 (20%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKP--------------WI 136
           G GLVA R IA+NE +  +P    +N DT A ++        KP              W 
Sbjct: 33  GRGLVATRPIAENERLFTIPRSMLMNLDTSALAKTCQAAEKAKPPSSGCSWLDIMNRGWC 92

Query: 137 SVALFLIREKKKEDS-----------PWRVYLDILPECTDSTVFWSEEELVELQGTQLLS 185
            + L L+ E  +  S            W  Y  I+P+   + +FW+ ++L EL+GT  + 
Sbjct: 93  PIILMLMWEHWRASSLQTNKQDMASISWGPYFGIMPKEFSTPMFWNSDQLEELKGTD-VE 151

Query: 186 TTLGVKEYVQNEYLKVEEEIILPNKQLFP 214
             +G  E  +++Y     E +LP  Q +P
Sbjct: 152 DKIGRAE-AESDY----HECVLPYIQQYP 175


>gi|384249602|gb|EIE23083.1| SET domain-containing protein, partial [Coccomyxa subellipsoidea
           C-169]
          Length = 306

 Score = 51.6 bits (122), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 48/181 (26%), Positives = 82/181 (45%), Gaps = 9/181 (4%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEI---GSLCSGLKPWISVALFLIREKK 147
           GLG+ A +D+ + + + E+P    ++      ++I     +  GL   I++   L   K 
Sbjct: 1   GLGVFAVQDLCEGQRLCEIPKTAVLSVQNTGIADILEQHRIRGGLGLIIAIMYELSIGK- 59

Query: 148 KEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEY-LKVEEEII 206
             +S W  YL+ L +     +FW+E+E   LQGT+        +E  Q ++   V   + 
Sbjct: 60  --ESFWHGYLEELHKREYLPLFWAEQERSLLQGTEAEHRPQEDEELTQEDFETHVPPLVE 117

Query: 207 LPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGIT--TEDYAY 264
               +L     TL+ F  A   + SRAF       + ++PLAD+ NH   I   ++DY  
Sbjct: 118 QHADRLRADSFTLESFRVAASWVASRAFGVDSFHGMSMVPLADIFNHKAAIVQFSDDYVV 177

Query: 265 E 265
           E
Sbjct: 178 E 178


>gi|403416334|emb|CCM03034.1| predicted protein [Fibroporia radiculosa]
          Length = 571

 Score = 51.6 bits (122), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 75/324 (23%), Positives = 129/324 (39%), Gaps = 49/324 (15%)

Query: 85  PATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGL-----KPWISVA 139
           P T P G+ + +  DI  +  ++ +P    ++  +   ++     SGL        +S+A
Sbjct: 50  PDTRP-GIAVFSHDDIPSDTTLMTIPRSALLSVRSCTIAQHIPPISGLDDDEMSAKLSLA 108

Query: 140 LFLIRE-KKKEDSPWRVYLDILPECTDSTVFWSEEELV----ELQGTQLLSTTLGVKEYV 194
           L L  E  +  DS W  YL  LP+     +FW   ++       + ++ +  T   KE  
Sbjct: 109 LALFSELVRGRDSRWFGYLQALPKTVPLALFWGTRDVFPDDDSREASEWIVGTEVEKEVR 168

Query: 195 QNEYLK----VEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADL 250
              YL       E +  P       P+T + FL A+ ++ SRA        L ++P+AD 
Sbjct: 169 SRYYLDQIATYYETVASPVLSKLEIPVTPEYFLHAYALVSSRALRADAYHGLCMVPIADA 228

Query: 251 INHSPGITT----EDYAYEIKGAGLFSRDLLFSLRTPVPVK------------------- 287
            NH+  + +    E  ++     G    D     R P   +                   
Sbjct: 229 FNHASVMNSHARFEVISHVCPTCG-SQNDCSHGSRRPPESQLSAYLQKNDACAVIGAFEA 287

Query: 288 AGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLG 347
             +++L+ Y L KSNA L + +GF    ++ N  T++ + SE      D L   + +G  
Sbjct: 288 NSQEILMCYGLAKSNASLLVHHGFALEFNEDN--TISWDASE----LKDDLLAIQASGQK 341

Query: 348 ESAYFDIVLGRTLPPAMLQYLRLV 371
             A   I+ G +L      +LRLV
Sbjct: 342 RDATNTIMSGISLD----SFLRLV 361


>gi|159476254|ref|XP_001696226.1| hypothetical protein CHLREDRAFT_167112 [Chlamydomonas reinhardtii]
 gi|158282451|gb|EDP08203.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 548

 Score = 51.6 bits (122), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 40/115 (34%), Positives = 54/115 (46%), Gaps = 11/115 (9%)

Query: 153 WRVYLDILPECTDST--VFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILP-- 208
           WR Y  +LP  T  T   FW EEEL +LQ   L S     +  V + Y +    I  P  
Sbjct: 221 WRRYRALLPSATQQTSLTFWREEELQQLQDASLASEAAAWQATVLSAYRRF---IDAPEF 277

Query: 209 NKQLFPRPITLDDFLWAFGILRSRAF---SRLRGQNL-VLIPLADLINHSPGITT 259
             ++ P  +TL ++L A G + SRAF   S   G+ L   +P   L N+ PG  T
Sbjct: 278 RTEVGPGAVTLREWLEAVGAVESRAFGFKSDADGRELHAYVPFFCLANYRPGAPT 332


>gi|168016200|ref|XP_001760637.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162687997|gb|EDQ74376.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 450

 Score = 51.6 bits (122), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 45/135 (33%), Positives = 70/135 (51%), Gaps = 11/135 (8%)

Query: 63  QVETFWQWLRDQKVVSPKSPIRPATFPE--GLGLVAQRDIAKNEVVLEVPMKFWINPDTV 120
           +++ F  W++   V S    IRP++  E  G GL A +D A+  V++  P+   I P TV
Sbjct: 14  RLQRFRDWMQINGVQSRFCEIRPSSNGENAGFGLFATKDNAQG-VLMVTPLLLAITPMTV 72

Query: 121 AAS-EIGS-LCSGLKPW-----ISVALFLIREKKKED-SPWRVYLDILPECTDSTVFWSE 172
               E+G   C  ++       + + LFL+ E+ +   S W  YL+ILP    + + +SE
Sbjct: 73  LQDPELGGHYCKLMEEGEVDDRLLIMLFLVIERARGRFSFWAPYLEILPFKFGTPLSFSE 132

Query: 173 EELVELQGTQLLSTT 187
           EEL EL+GT L   T
Sbjct: 133 EELSELKGTHLFQAT 147


>gi|68467835|ref|XP_722076.1| potential protein lysine methyltransferase [Candida albicans
           SC5314]
 gi|68468152|ref|XP_721915.1| potential protein lysine methyltransferase [Candida albicans
           SC5314]
 gi|46443858|gb|EAL03137.1| potential protein lysine methyltransferase [Candida albicans
           SC5314]
 gi|46444024|gb|EAL03302.1| potential protein lysine methyltransferase [Candida albicans
           SC5314]
          Length = 433

 Score = 51.6 bits (122), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 70/304 (23%), Positives = 114/304 (37%), Gaps = 70/304 (23%)

Query: 63  QVETFWQWLR---DQKVVSPKSPIRPA-----TFPEGLGLVAQRDIAKNEVVLEVPMKFW 114
           ++ET   W++   D+K +S  + I P          G G+ A   + K E++L +P  F 
Sbjct: 8   KLETLLHWVKNKDDEKKISNHTYISPKIDVKDVRSSGRGIYAVEPLKKGELILNIPHSFL 67

Query: 115 INPDTVAASEIGSLCSGLKPWISV------------------------------ALFLIR 144
           +N  TV A         +   I V                              +L+L  
Sbjct: 68  LNFTTVMAHIAKYNGMAIDSHIHVPFDKSEDEYTEIYRTLTKEEILELSSFQLLSLYLTF 127

Query: 145 EKKK-EDSPWRVYLDILPECTDSTVF---WSEEELVELQGTQLLSTTLGVKEYVQNEYLK 200
           E+K+   S W+ +LD+LP   D  +    W +E    L  +  +     V+    N+Y  
Sbjct: 128 ERKRSHKSFWKPFLDMLPSMDDFELMPIDWPQEVCTLLPSSTEVRNK-KVRSRFDNDYQV 186

Query: 201 VEEEIILPNKQ------LFPRPITLDDFLWAFGILRSRAF------SRLRGQNLVLIPLA 248
           + E I     +      L PR     + L ++  + SR        S+    N  + P  
Sbjct: 187 ICELIKTKIDKDGDVTTLLPR----QEVLLSWLCINSRCLYMDLPTSKNSADNFTMAPYV 242

Query: 249 DLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALD 308
           D +NHS     +    +I G G       F +RT      G+QV + Y  + SN  L  +
Sbjct: 243 DFMNHS---CDDHCTLKIDGKG-------FQVRTTSQYNTGDQVYLSYGPH-SNDFLLCE 291

Query: 309 YGFI 312
           YGF+
Sbjct: 292 YGFV 295


>gi|302307608|ref|NP_984333.2| ADR237Cp [Ashbya gossypii ATCC 10895]
 gi|299789080|gb|AAS52157.2| ADR237Cp [Ashbya gossypii ATCC 10895]
 gi|374107548|gb|AEY96456.1| FADR237Cp [Ashbya gossypii FDAG1]
          Length = 574

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 75/282 (26%), Positives = 119/282 (42%), Gaps = 56/282 (19%)

Query: 94  LVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCS-GLKPWISVALFLIREKKKEDSP 152
            V  R   +++ ++ +P K  I  + +A   +G L S  L P     LFL +     D+ 
Sbjct: 37  FVKSRIEGRSQPLISIPKKLLITRE-MAIKSLGELGSKNLGPNALTQLFLAKLMFATDTG 95

Query: 153 ---------WRVYLDILPECTD--STVFWSEEELVELQGTQL-LSTTLGVKEYVQNEYLK 200
                    ++ YL +LP   +  +  FW+  EL+ L+G  + L     +++ V NE+ +
Sbjct: 96  PKDAELQQFFKPYLSVLPTHKEMHTPYFWTNSELLLLRGMDIYLKAKKNLRQLV-NEWHE 154

Query: 201 VEEEIILPNKQLF--------------------PRPITLD--DF---LWAFGILRSRAFS 235
           +     L N   F                      P T D  DF   LWA  I  SRAF 
Sbjct: 155 LVTAGELRNDTKFYDLFNSSENFDAGEYISNQLADPTTTDWTDFPAYLWASSIFSSRAFP 214

Query: 236 RL------RGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAG 289
            L            L P+ DL+NHS G T   ++Y  + A +       +  T   ++ G
Sbjct: 215 TLILGTTTDLNEAFLNPIIDLLNHSAG-TNVTWSYNEQVAAV-------TFSTAQTLETG 266

Query: 290 EQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEI-SES 330
           +++   Y  +KSN EL L+YGF+   ++ +  TL   I SES
Sbjct: 267 DELYNNYG-DKSNDELLLNYGFVLPNNEHDKSTLCFRIPSES 307


>gi|57335971|emb|CAH25387.1| putative ribulose-1,5 bisphosphate carboxylase/ oxygenase
           N-methyltransferase [Guillardia theta]
          Length = 122

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/94 (34%), Positives = 51/94 (54%)

Query: 391 GHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAGE 450
           G ++LPVS  NE+L+   +    + AL    ++  ED  +++  +   R   A  VR GE
Sbjct: 3   GFVNLPVSKDNEKLMLETLIATFEGALDDIGSSESEDMSIVRDASSTYRQVQAAYVRIGE 62

Query: 451 KRVLQQIDEIFKERELELDELEYYQERRLKDLGL 484
           +  L++   + ++   E+D  EYYQERRLK L L
Sbjct: 63  RSALKKTIYLLEQEMEEMDSKEYYQERRLKSLNL 96


>gi|388578758|gb|EIM19096.1| SET domain-containing protein [Wallemia sebi CBS 633.66]
          Length = 413

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 83/335 (24%), Positives = 145/335 (43%), Gaps = 40/335 (11%)

Query: 61  TAQVETFWQWLRDQKV-VSPKSPIRPATFPEG-LGLVAQRDIAKNEVVLEVPMKFWINPD 118
           T+ +++F  WL++ K  + P   I  +  PEG    +A+ DI ++  ++ V     I   
Sbjct: 4   TSTIDSFKSWLKENKCDIHPNLDIIYS--PEGGYKWLARNDIPEDAEIVSVNKNICITES 61

Query: 119 TVAASEIGSLCSGLKPWISVALFL--------IREKKKEDSPWRVYLDILPEC--TDSTV 168
           T   +       GL   + +A+++        + E  K     R Y+DILPE   T +T+
Sbjct: 62  TCKEAFKNLNNEGLPEKLLIAVYISLHYIYDQLPESLKSKLHHRRYVDILPEIGQTLTTL 121

Query: 169 FWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGI 228
           +W+++EL   + T L + T   +   +++Y  V++     + ++F    T D F  +  +
Sbjct: 122 YWTDDELEYTKPTSLFNATKEREIQWKSDYEVVKKWSRANDVEVF----TWDVFKHSLTM 177

Query: 229 LRSRAF-SRLRGQNLV----LIPLADLINH--SPGITTEDYAYEIKGAGLFSRDLLFSLR 281
           + SRAF S+L   + +    L+PL D+ NH     I   D  Y     G  +      ++
Sbjct: 178 ISSRAFPSKLIQDDEISSPMLVPLWDIGNHKSQSAIVWTDVKY----TGTDN----IGMK 229

Query: 282 TPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIA 341
            P   +   +V   Y    +N EL L YGF  +  + N   +   I         K DI 
Sbjct: 230 LPQGAQKDNEVFNNYGGKPTN-ELLLAYGF--AVDNINYDVVPFRIGAGVSLSESKKDIL 286

Query: 342 ETNGLGES----AYFDIVLGRTLPPAMLQYLRLVA 372
           + +GL         F+I L   LP  +   +RL+A
Sbjct: 287 KKHGLLNEDCTLKTFNINLNEGLPLGLKFLMRLLA 321


>gi|254568640|ref|XP_002491430.1| Nuclear protein that contains a SET-domain [Komagataella pastoris
           GS115]
 gi|238031227|emb|CAY69150.1| Nuclear protein that contains a SET-domain [Komagataella pastoris
           GS115]
          Length = 538

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/100 (32%), Positives = 54/100 (54%), Gaps = 5/100 (5%)

Query: 90  EGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEI----GSLCSGLKPWISVALFLIRE 145
           EG G+VA RDI ++EV+  +     +N  T + + +    G +   L  W  + L L  E
Sbjct: 36  EGRGVVATRDINEDEVLFSISKDNVLNIQTSSLANVKDNNGVILKRLNHWEGLILCLAYE 95

Query: 146 KK-KEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLL 184
           +   E S W+ YLD LP   ++ +FW+E++L  L+ + +L
Sbjct: 96  RSLGELSRWKGYLDTLPTSFNNLIFWNEQDLNSLKPSLIL 135


>gi|402908594|ref|XP_003917022.1| PREDICTED: N-lysine methyltransferase SETD6 [Papio anubis]
          Length = 456

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 71/289 (24%), Positives = 121/289 (41%), Gaps = 44/289 (15%)

Query: 64  VETFWQWLRDQKV-VSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAA 122
           V  F  W R   + +SPK  +       G G+VA+  +   E++  VP    ++  T + 
Sbjct: 22  VAGFLNWCRRVGLELSPKVAVSRQGTVAGYGMVARESVQAGELLFVVPRAALLSQHTCSI 81

Query: 123 S-----EIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECT--DSTVFWSEEE- 174
                 E G+L S    W+ + L L+ E +   S WR Y  + PE    +  +FW EEE 
Sbjct: 82  GGLLERERGALQSQ-SGWVPLLLALLHELQAPASRWRPYFALWPELGRLEHPMFWPEEER 140

Query: 175 LVELQGTQLLSTTLGVKEYVQNEYLKVEEE---IILPNKQLFP-----RPITLDDFLWAF 226
              LQGT       GV E V+ +   +  E   I+LP  +  P     R  +L+ +    
Sbjct: 141 RCLLQGT-------GVPEAVEKDLANIRSEYHSIVLPFMEAHPDLFSLRVRSLELYHQLV 193

Query: 227 GILRSRAFSRLRG--------QNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLF 278
            ++ + +F              + V++P AD++NH   +   +   E      +S + L 
Sbjct: 194 ALVMAYSFQEPLEEEEDEKEPNSPVMVPAADILNH---LANHNANLE------YSANCLR 244

Query: 279 SLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEI 327
            + T  P+  G ++   Y    +N +L   YGF+E   D    T  +++
Sbjct: 245 MVATQ-PIPKGHEIFNTYG-QMANWQLIHMYGFVEPYPDNTDDTADIQM 291


>gi|428164251|gb|EKX33284.1| hypothetical protein GUITHDRAFT_166511 [Guillardia theta CCMP2712]
          Length = 294

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 37/125 (29%), Positives = 62/125 (49%), Gaps = 17/125 (13%)

Query: 136 ISVALFLIREKKK-EDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYV 194
           + +AL LI EK++ E+S W  Y+ ILP   D  + +S++EL ELQ    +      ++Y 
Sbjct: 168 LRMALRLIYEKRRGEESKWYQYIQILPTNFDVPLLFSQDELKELQDPLFIHEVEIEQKYF 227

Query: 195 QNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFS------RLRGQNLVLIPLA 248
           + E  ++ +          P P + ++  WA     SR F+      R  GQ   + P+A
Sbjct: 228 EYERRRMAD--------FMPLPPSKEELGWALACAGSRTFTADFGDGRPVGQ--CMCPIA 277

Query: 249 DLINH 253
           D++NH
Sbjct: 278 DMVNH 282


>gi|428182191|gb|EKX51052.1| hypothetical protein GUITHDRAFT_134587 [Guillardia theta CCMP2712]
          Length = 365

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 56/242 (23%), Positives = 99/242 (40%), Gaps = 32/242 (13%)

Query: 93  GLVAQRDIAKNEVVLEVPMKFWINPDTVAAS--EIGSLCSGLKPWISVALFLIREKKKED 150
           G+ A + I + EV++ VP     + +T   +  ++      ++    + L L+    K  
Sbjct: 19  GMYATKKIDRGEVIVSVPPSLLFSYETAGGALKDVWKRTKDMQELDRLTLLLLYFSSKVR 78

Query: 151 SPWRVYLDILPECTD--STVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILP 208
           S W  +L  +P   +    V WS ++L E    +  S+     E  ++ Y ++    + P
Sbjct: 79  SRWDFFLCGIPGMNELGPAVLWSPKKLNETCEREEYSSLCSFVENRRSMYKRLWRTEVAP 138

Query: 209 NKQLFPRPITLDD-----FLWAFGILRSRAFSRLRGQ--------------NLVLIPLAD 249
             + FP   +  D     +LWA   + SR +   R +                V+ P+A+
Sbjct: 139 LPRKFPHIFSQQDTGYSNYLWAIAAVLSRMWLMRRFEEPEFYPNGTWIGPAKWVMAPVAE 198

Query: 250 LINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDY 309
           L+NH P      +  +        R     + + V  + GEQV + Y  NK N EL L+Y
Sbjct: 199 LLNHKPRAGHIRWGSQ--------RRPHLEVVSDVSYRPGEQVFVSYG-NKCNLELLLEY 249

Query: 310 GF 311
           GF
Sbjct: 250 GF 251


>gi|390602144|gb|EIN11537.1| SET domain-containing protein [Punctularia strigosozonata HHB-11173
           SS5]
          Length = 503

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 49/181 (27%), Positives = 78/181 (43%), Gaps = 21/181 (11%)

Query: 69  QWLRDQKVVSPKSPIRPATFPE--GLGLVAQRDIAKNEVVLEVPMKFWIN---------- 116
           +W + +     +S +    FP   G G VA  DI +   +  +P    ++          
Sbjct: 6   EWFKRRGGSIDESAMGFTVFPADGGRGAVALGDIPEGHALFTLPRSLTLSTRTSSLPLRI 65

Query: 117 PDTVAASEIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELV 176
           P+     E+G      K W+ + L ++ E+ + DS W  YL  LP+   + +FWSEEEL 
Sbjct: 66  PEEWKKYELG------KGWVGLILCMMWEEAQPDSLWAEYLASLPDVFSTPMFWSEEELS 119

Query: 177 ELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLF-PRPITLDDFLWAFGILRSRAFS 235
           EL  T ++   +G K+  + EY       +     +F P  I     L  F I+ SR  S
Sbjct: 120 ELNATAIID-KIG-KDEAEKEYHNKLLPTVRSRPDIFQPEAIESTYSLRRFHIMGSRILS 177

Query: 236 R 236
           R
Sbjct: 178 R 178


>gi|448082040|ref|XP_004195035.1| Piso0_005574 [Millerozyma farinosa CBS 7064]
 gi|359376457|emb|CCE87039.1| Piso0_005574 [Millerozyma farinosa CBS 7064]
          Length = 504

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 40/126 (31%), Positives = 63/126 (50%), Gaps = 13/126 (10%)

Query: 67  FWQWLRDQKV-VSPKSPIRP-ATFPEGLGLVAQRDIAKNEVVLEVPMK--FWINPDTVAA 122
           F +WLR+  V +SPK  I+   +  +G GLVA  DI + E +  +P      I  +T+ +
Sbjct: 13  FIEWLRNDGVEISPKVKIQDLRSQNQGRGLVATEDIEEEETLFTLPESTILSIKNNTLLS 72

Query: 123 SE--IGSLCSGLKPWISVALFLIREKKK-EDSPWRVYLDILP------ECTDSTVFWSEE 173
            +  +      L  W S+ + L+ E     DS W+ Y ++LP        +D  +FWSEE
Sbjct: 73  QKPHLRESLEALSSWESLIIILLYEVYALPDSKWKHYFEVLPIRDSENYKSDQLMFWSEE 132

Query: 174 ELVELQ 179
           EL  L+
Sbjct: 133 ELAHLK 138


>gi|238505934|ref|XP_002384169.1| SET domain-containing protein, putative [Aspergillus flavus
           NRRL3357]
 gi|220690283|gb|EED46633.1| SET domain-containing protein, putative [Aspergillus flavus
           NRRL3357]
          Length = 418

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 65/277 (23%), Positives = 114/277 (41%), Gaps = 41/277 (14%)

Query: 60  PTAQVETFWQWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPD 118
           P  + E F +W   Q +++  + + PA FP  GLG++A R+I ++E ++ VP+K  +  +
Sbjct: 7   PGEEHEQFTEWAISQGIIA--NGVGPARFPGRGLGMIATRNIEEDEAIVTVPLKAMLTSE 64

Query: 119 TVAASEIGSLCSGLKPWISVALFLIR---EKKKEDSPWRVYLDILPECTDST-VFWSEEE 174
            + +        G       A FL     E  +E + WR       +  DS  + WS E 
Sbjct: 65  RIPSYFTSKFPDGTPTHALYAAFLTNGNAEDLEEFNAWRKTWPSRQDFEDSMPILWS-ES 123

Query: 175 LVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQ-----------LFPRPITLDDFL 223
           L       + S    ++   + +Y    + ++   +Q           +FP     + F 
Sbjct: 124 LRNYLPPSISSHWHSIQSRDKLQYETTHQNLLAQQEQRLRTAWDIVVSIFP-DTDWETFS 182

Query: 224 WAFGILRSRAFSRL---------RGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSR 274
           + + I+ +R+F  L         R   + L+P AD  NHS      D A  +K  G    
Sbjct: 183 YHWLIVNTRSFFYLMPGQEPPEDRNDAMALLPFADYFNHS------DVACNVKFDG---E 233

Query: 275 DLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF 311
           + +F  R       GE++ + Y  +  N  L  +YGF
Sbjct: 234 NYVF--RATKHYDEGEEIYMSYGPH-PNDFLFAEYGF 267


>gi|115387935|ref|XP_001211473.1| predicted protein [Aspergillus terreus NIH2624]
 gi|114195557|gb|EAU37257.1| predicted protein [Aspergillus terreus NIH2624]
          Length = 412

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 63/236 (26%), Positives = 99/236 (41%), Gaps = 49/236 (20%)

Query: 60  PTAQVETFWQWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPD 118
           P  + E F QW     V    + I PA FP  G G++A R I +NE +L VP K  +N D
Sbjct: 5   PGLEHEAFTQWALSNGV--QINGIAPARFPGRGTGMIATRTIEENETILMVPPKSMVNID 62

Query: 119 TVAASEIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECTD----STVFWS--- 171
           T+  S +  L +G      +A +L  E  K+   WR   ++ P+  D      + W    
Sbjct: 63  TIPESFVRRLPAGASKHAIIAAYLTLEAPKDLDAWR---NVWPQWADFEESMPILWPAHL 119

Query: 172 -------EEELVELQGTQLLSTTLG---------------VKEYVQNEYLKVEEEIILPN 209
                  +EE +   G ++L                    VKEY Q+   + EE +    
Sbjct: 120 RRSDSKYQEEHIS-PGPKILPPAASGLWNTFSNDPLEDDPVKEY-QDLLAQQEERLTCAW 177

Query: 210 KQL---FPRPITLDDFLWAFGILRSRAFSRLR-GQN-------LVLIPLADLINHS 254
             +   FP     + F++ + I+ SR+F  +  G++       L L+P AD  NH+
Sbjct: 178 DNVVAAFPN-TDRETFVYHWLIVNSRSFYYVSPGKDVGEWVDALTLVPFADYFNHA 232


>gi|149237793|ref|XP_001524773.1| hypothetical protein LELG_03805 [Lodderomyces elongisporus NRRL
           YB-4239]
 gi|146451370|gb|EDK45626.1| hypothetical protein LELG_03805 [Lodderomyces elongisporus NRRL
           YB-4239]
          Length = 572

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 36/127 (28%), Positives = 66/127 (51%), Gaps = 8/127 (6%)

Query: 66  TFWQWLRDQKV-VSPKSPIRPAT-FPEGLGLVAQRDIAKNEVVLEVPMKFWIN--PDTVA 121
            + +WL+     +S K  I   T   +G G++A  +I K+E++  +P    +N   +++ 
Sbjct: 17  NYLEWLKQNNYDISDKVAIHDYTSVKQGRGVIALANIDKDEIIATIPKLALLNVIQNSLV 76

Query: 122 AS--EIGSLCSGLKPWISVALFLIRE-KKKEDSPWRVYLDILPECT-DSTVFWSEEELVE 177
           A   E+      L  W ++ + L+ E + +E S W+ YL++LP    D  +FWS  EL +
Sbjct: 77  AKYPELKYGLLHLNHWEALIIILLYELQNREQSKWKSYLNVLPTSNFDQLMFWSSNELNQ 136

Query: 178 LQGTQLL 184
           LQ + +L
Sbjct: 137 LQPSCIL 143


>gi|119194277|ref|XP_001247742.1| hypothetical protein CIMG_01513 [Coccidioides immitis RS]
          Length = 718

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 51/180 (28%), Positives = 77/180 (42%), Gaps = 43/180 (23%)

Query: 149 EDSPWRVYLDILPECTDSTV--FWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEII 206
           ++S W  Y+  LPE +  T   ++S+E+L  L+GT LL               K+ E ++
Sbjct: 142 DESFWAPYIRSLPEDSQLTRLEYYSDEDLEWLEGTNLL---------------KLRENML 186

Query: 207 LPNKQLFPRPITLDDFLWAFGILRSRAFSR---------------LRGQNLVLIPLADLI 251
           +  K   P  I  + FLWA  I+ SRAFS                  G+  VL+PL D+ 
Sbjct: 187 IKLKTTVPNKIFRERFLWASSIIISRAFSSEVLKDYVKNSKSINVTGGEFSVLVPLLDMT 246

Query: 252 NHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF 311
           NH P +   ++       GL     L           G++V   Y   ++N  L L+YGF
Sbjct: 247 NHQP-LAQVEWRTSQGVVGLIVHKTLL---------PGQEVPNNYG-PRNNERLMLNYGF 295


>gi|388579093|gb|EIM19422.1| SET domain-containing protein [Wallemia sebi CBS 633.66]
          Length = 381

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 45/179 (25%), Positives = 79/179 (44%), Gaps = 16/179 (8%)

Query: 146 KKKEDSPWRVYLDILP-ECTDSTVFWSEEELVELQGTQL---LSTTLGVKEYVQNEYLKV 201
            + E S W  Y++ LP E  D    W      E+ G +L   L +    KE ++  + + 
Sbjct: 79  HRGERSTWLSYIESLPKEAPDIASLWLPTP--EMDGMELAKELDSLCNTKEGIRGRHSRA 136

Query: 202 E-EEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTE 260
           + E+I   +       I+ +D+L+A+ I  SR F+     ++ L+P+ADL NH+   T E
Sbjct: 137 QLEDIFHTSLAPLADGISFNDYLYAYAITSSRLFTIDNYHDVGLVPVADLFNHTEESTVE 196

Query: 261 DYAYEIKGAGLFSR--------DLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF 311
            +  +    G                 + T   ++  +++   Y    SN +LA+DYGF
Sbjct: 197 -FESDTLVCGCCGNLSHSQTNDKQTVDIVTTETIEPHQELFNSYGDTLSNVQLAMDYGF 254


>gi|66806627|ref|XP_637036.1| hypothetical protein DDB_G0287857 [Dictyostelium discoideum AX4]
 gi|60465490|gb|EAL63575.1| hypothetical protein DDB_G0287857 [Dictyostelium discoideum AX4]
          Length = 532

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 58/231 (25%), Positives = 91/231 (39%), Gaps = 38/231 (16%)

Query: 217 ITLDDFLWAFGILRSRAFSRLRGQ-------------NLVLIPLADLINHSPGITTEDYA 263
           I  + + W +G ++SR +   R               +  L+PLADL NHS  + TE   
Sbjct: 225 IDFNLYQWCWGTIQSRTYYYDRNMKELPKHLQLEDKDDCALVPLADLFNHSSDVNTETKF 284

Query: 264 YEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTL 323
            E K          + + T    +   QV I Y    SN  L   YGFI   +  ++  L
Sbjct: 285 DEKKQC--------YQVITKTKFEKDSQVFISYG-KHSNFTLMNYYGFIIENNSNDSIPL 335

Query: 324 TLEISESDPF-----------FGDKLDIAETNGL---GESAYFDIVLGRTLPPAMLQYLR 369
             E +  D             +  K+ I E  GL   GE++ F + + + LP +      
Sbjct: 336 VQEDAIPDIILEKEMKQDLKSYERKMSILEQYGLSVYGENSKFLVSMDKELPFSWNYLSI 395

Query: 370 LVALGGTDAFLLESIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGF 420
           L  L  T   L   +  N    H DLP+S++N E +   +++     L+ F
Sbjct: 396 LKVLYMTKEELNNQLELNLF--HYDLPISNSNYEKVLYFLKNLSIIQLTQF 444


>gi|223994955|ref|XP_002287161.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220976277|gb|EED94604.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 902

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 62/257 (24%), Positives = 116/257 (45%), Gaps = 51/257 (19%)

Query: 93  GLVAQRDIAKNEVVLEVPMKFW--------INPD-TVAASEIGSLCSGLKPWISVALFLI 143
           GL A R   K+E++ ++P            +N D TVA   +  L      W     ++ 
Sbjct: 99  GLYATRPFKKDEIICKIPSDCALALVDPSVVNDDFTVADGAVNFL-----QW-----YVN 148

Query: 144 REKKKEDSPWRVYLDILP----ECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYL 199
             + K+   W  YLD LP        +  F+S+ E+ +L+   ++      KE  +N+ +
Sbjct: 149 NPQAKQ--MWSAYLDTLPTREMHFDPTPDFYSDGEIDQLEFPMIVKQ---AKER-KNQIM 202

Query: 200 KVEEEIILPNKQL-----------FPRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLA 248
           ++ +   +P  +L           F   I+++D      +++  A S  +    VL+P  
Sbjct: 203 ELAQSSGIPFDELQFATWLVASRSFMIKISVND-----PMVKEGALSTTQNTIRVLLPYL 257

Query: 249 DLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLN--KSNAELA 306
           D+INHS    ++DY  E+        D  F++R   P+K G++V I Y  +  +++A L 
Sbjct: 258 DMINHS----SDDYNAELHLIDPEKDDAWFAIRAVRPIKKGKEVTISYGASGVETSAGLL 313

Query: 307 LDYGFIESKSDRNAYTL 323
           ++YGF+  ++  + Y L
Sbjct: 314 MNYGFVADENRIDTYLL 330


>gi|219122993|ref|XP_002181819.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217407095|gb|EEC47033.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 579

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 63/238 (26%), Positives = 101/238 (42%), Gaps = 37/238 (15%)

Query: 18  FVSSSKTLKLLTSLTKKPSFRLKSRAFAATCSLHSASATTNPPTAQVETFWQWLRDQKV- 76
           F S +K+  +L +  KKP+    ++A  +   L S S        ++++   W +   V 
Sbjct: 53  FSSGAKSSTILCA--KKPN----TKAKPSVAPLQSLSRKN-----RIQSVLDWAQRADVQ 101

Query: 77  VSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINP-------------DTVAAS 123
           VS +  +       GLG  A  +I  N+V+L VP    +               D VA+S
Sbjct: 102 VSKEIALDSRVAEYGLGWYASTNIPTNQVLLSVPSNRALTVEIPGEGPDDRSVLDLVASS 161

Query: 124 EIGSLCSGLK-PW-ISVALFLIR----EKKKEDSPWRVYLDILPECTDSTVFWSEEELVE 177
           + GS       PW + ++L++ +    +  KE    R +LD LP   D+ + WSE    E
Sbjct: 162 DSGSKTEVRALPWFVQMSLYIYKLDQVDADKEGVDMRPWLDSLPRSFDTVIHWSEANRQE 221

Query: 178 LQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFS 235
           LQ   ++ T +  +E     Y +      L         +T + FLW   I RSRAFS
Sbjct: 222 LQYDSMV-TAVASQEQDWKRYYQS-----LLQAGASSSSLTWEQFLWGCEIARSRAFS 273


>gi|71995786|ref|NP_497604.2| Protein SET-27 [Caenorhabditis elegans]
 gi|373220599|emb|CCD73865.1| Protein SET-27 [Caenorhabditis elegans]
          Length = 502

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 68/291 (23%), Positives = 120/291 (41%), Gaps = 47/291 (16%)

Query: 64  VETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAAS 123
           ++TF  W  D   ++  +    +T   GL L A   I K+ +V  VP    I  D    S
Sbjct: 74  IKTFLAW-ADGVGIARNNVTIGSTKTAGLSLQATGPIPKSHIVARVPRHAMITLDLAKKS 132

Query: 124 -------EIGSLCSGLKPWISVALFLIRE-KKKEDSPWRVYLDILPECTDSTVFWSEEEL 175
                  E   +  G+   + +ALFL  +  + E S W+ Y+ ILP    + +F+SEE+L
Sbjct: 133 SLLKKAFEKDPIVGGMDN-VGLALFLACQWIQNEKSKWKSYISILPTTFPTPLFYSEEQL 191

Query: 176 VELQGTQLLSTTLGVKEYVQNEY----LKVEEEIILPNKQ-------------LFPRPIT 218
           ++L+ + +    +     +  ++    L + +  I    Q              +  P  
Sbjct: 192 LQLKPSPIFEEAILFYRTISRQFCYFLLAIAKNKIYEAAQRRKDARNAMETPIFYNVPFN 251

Query: 219 LDDF-----LWAFGILRSRAFSRLRGQNLV-----------LIPLADLINHSPGITTEDY 262
           + +F      WA G++ +R  + +  +N V           LIP+ D+ NH   + T+  
Sbjct: 252 VANFTPKLYFWAVGVVTTRV-NMVPSENQVGEDGNPVIIPALIPVLDMANHE-NVLTDVL 309

Query: 263 AYEIKGAGLFS-RDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI 312
              I+    +S  +    + +   VKAG +V I Y   +S  E  L  GF+
Sbjct: 310 TEPIEDLVCYSPEEECAVITSHCDVKAGNEVTIFYGC-RSKGEHLLHNGFV 359


>gi|448530918|ref|XP_003870180.1| Rms1 lysine methyltransferase [Candida orthopsilosis Co 90-125]
 gi|380354534|emb|CCG24050.1| Rms1 lysine methyltransferase [Candida orthopsilosis]
          Length = 614

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 42/143 (29%), Positives = 68/143 (47%), Gaps = 8/143 (5%)

Query: 50  LHSASATTNPPTAQVETFWQWLRDQKV-VSPKSPIRPAT-FPEGLGLVAQRDIAKNEVVL 107
           + +AS   +        +  WL+   V +SPK  I   T   +G G++A  DI + EV+ 
Sbjct: 1   MMTASQYKDAFNKSTANYLDWLKSHNVDLSPKITIHDYTSINQGRGVIAVEDIEEGEVLA 60

Query: 108 EVPMKFWIN-PDTVAASEIGSL---CSGLKPWISVALFLIRE-KKKEDSPWRVYLDILPE 162
            +P    IN    +   E  +L      L  W ++ + L+ E + KE S W  Y+D+LP+
Sbjct: 61  TIPKTALINVKQNLLVHEYPNLKHYLMRLPHWDALIIILLYELRNKEQSQWLEYIDVLPQ 120

Query: 163 CT-DSTVFWSEEELVELQGTQLL 184
              +  +FWS  EL  LQ + +L
Sbjct: 121 KGFNQLMFWSPNELNLLQPSYVL 143


>gi|406602709|emb|CCH45757.1| hypothetical protein BN7_5343 [Wickerhamomyces ciferrii]
          Length = 569

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 61/218 (27%), Positives = 92/218 (42%), Gaps = 45/218 (20%)

Query: 151 SPWRVYLDILPECTD--STVFWSEEELVELQGTQL-LSTTLGVKEYVQNEYLKV------ 201
           S ++ YLDILPE  D  S +FWS+ E   ++GT   L+    + + V   Y  V      
Sbjct: 102 SHFKPYLDILPEGHDTLSPLFWSDVERNTIKGTDAHLAIDYKISQLVDEWYSIVSKIDSK 161

Query: 202 ------EEEIIL------PNKQLFPRPITLDD-------FLWAFGILRSRAFSRLRGQNL 242
                 +E+I           + F + + L         +LWA  I  SRAF  L   N 
Sbjct: 162 FQPKTFQEDISFYKGFTKGKSEQFWKYLNLPQSWTSFPAYLWASAIFNSRAFPFLLAGNE 221

Query: 243 V--------LIPLADLINHSPGITTEDYAYEIKGAGLFSRD-LLFSLRTPVPVKAGEQVL 293
           +        L+P+ DL+NH       D    +K   L S +   F  +T   +K G+++ 
Sbjct: 222 ICRDLNEAFLVPIFDLLNH-------DNEANVKWDSLDSSNGKNFIFKTEQKLKNGDEIY 274

Query: 294 IQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESD 331
             Y   K+N EL   YGF    +  +  TL L I E++
Sbjct: 275 NSYG-PKTNQELMFGYGFAIENNKEDRATLALRIPEAN 311


>gi|317151155|ref|XP_001824477.2| ribosomal N-lysine methyltransferase [Aspergillus oryzae RIB40]
 gi|391868702|gb|EIT77912.1| hypothetical protein Ao3042_05894 [Aspergillus oryzae 3.042]
          Length = 407

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 65/277 (23%), Positives = 114/277 (41%), Gaps = 41/277 (14%)

Query: 60  PTAQVETFWQWLRDQKVVSPKSPIRPATFP-EGLGLVAQRDIAKNEVVLEVPMKFWINPD 118
           P  + E F +W   Q +++  + + PA FP  GLG++A R+I ++E ++ VP+K  +  +
Sbjct: 7   PGEEHEQFTEWAISQGIIA--NGVGPARFPGRGLGMIATRNIEEDEAIVTVPLKAMLTSE 64

Query: 119 TVAASEIGSLCSGLKPWISVALFLIR---EKKKEDSPWRVYLDILPECTDST-VFWSEEE 174
            + +        G       A FL     E  +E + WR       +  DS  + WS E 
Sbjct: 65  RIPSYFTSKFPDGTPTHALYAAFLTNGNAEDLEEFNAWRKTWPSRQDFEDSMPILWS-ES 123

Query: 175 LVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQ-----------LFPRPITLDDFL 223
           L       + S    ++   + +Y    + ++   +Q           +FP     + F 
Sbjct: 124 LRNYLPPSISSHWHSIQSRDKLQYETTHQNLLAQQEQRLRTAWDIVVSVFP-DTDWETFS 182

Query: 224 WAFGILRSRAFSRL---------RGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSR 274
           + + I+ +R+F  L         R   + L+P AD  NHS      D A  +K  G    
Sbjct: 183 YHWLIVNTRSFFYLMPGQEPPEDRNDAMALLPFADYFNHS------DVACNVKFDG---E 233

Query: 275 DLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGF 311
           + +F  R       GE++ + Y  +  N  L  +YGF
Sbjct: 234 NYVF--RATKHYDEGEEIYMSYGPH-PNDFLFAEYGF 267


>gi|303288325|ref|XP_003063451.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226455283|gb|EEH52587.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 478

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 45/160 (28%), Positives = 63/160 (39%), Gaps = 26/160 (16%)

Query: 168 VFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRP------ITLDD 221
           + WS+E++       LLS T      +      V + I     + F          ++DD
Sbjct: 133 IGWSDEDIERRLRGDLLSRTRATLARIH----AVADAIARSRSKAFGDADDAFFVFSVDD 188

Query: 222 FLWAFGILRSRAFSRLR---------GQNLVLIPLADLINHSPGITTEDYAYEIKGAGLF 272
             WA  +  SRA + LR         G    L+PL D+ NH  G T      E       
Sbjct: 189 LKWAHAVFWSRAMT-LRFPRKGFTGGGDVDALVPLVDMCNHRAGSTATLEIVEDDAG--- 244

Query: 273 SRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI 312
             D  + LR  V  KAG++V I Y   K N EL   +GF+
Sbjct: 245 --DAFYELRAGVATKAGDEVFINYGA-KGNEELLRCHGFV 281


>gi|225561151|gb|EEH09432.1| SET domain-containing protein [Ajellomyces capsulatus G186AR]
          Length = 494

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 49/208 (23%), Positives = 89/208 (42%), Gaps = 13/208 (6%)

Query: 144 REKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQL---LSTTLGVKEYVQNEYLK 200
           +E  K  + W  Y+  LP        ++ EE   L+GT L   L + +   E   ++  +
Sbjct: 124 KEHIKVSNAWTEYIKFLPASYSLPTLYTIEERELLRGTSLELALDSKILSLEKEFDQLRE 183

Query: 201 VEEEIILPNKQLFPRP---ITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGI 257
              +I   ++  +      +T +D+     + RSRA     G+   ++P  D++NH+ G 
Sbjct: 184 ATTDISFCDRDWWNEDTGRLTFEDWKLVDAMYRSRAL-EFPGKGHSMVPCIDMVNHASG- 241

Query: 258 TTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKS- 316
                 YE    G    + +  LR    ++ G+++ I Y   K  +E+   YGF++S S 
Sbjct: 242 GRPSALYETDENG----NAVLQLRRGQGLEVGDEITITYGDEKGASEMVFSYGFLDSSST 297

Query: 317 DRNAYTLTLEISESDPFFGDKLDIAETN 344
           +     L L+I   DP    K+ +  T 
Sbjct: 298 NARQLFLDLDIPADDPLRPAKIAVCNTT 325


>gi|150864441|ref|XP_001383253.2| hypothetical protein PICST_42613 [Scheffersomyces stipitis CBS
           6054]
 gi|149385697|gb|ABN65224.2| predicted protein [Scheffersomyces stipitis CBS 6054]
          Length = 453

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 70/320 (21%), Positives = 124/320 (38%), Gaps = 83/320 (25%)

Query: 63  QVETFWQWLRD----------QKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMK 112
           + E   +W+++             +SPK  I+      G G+ A+ DI +++ +L +P  
Sbjct: 7   RTEALLEWIKNTEDETFKPSTHSFISPKIEIKQFE-QSGRGIAAKDDIKRSQQILRIPHS 65

Query: 113 FWINPDTVAASEIGSLCSGLK---PWIS-------------------------------- 137
           F +N  TV  S I    S +K   P+                                  
Sbjct: 66  FLLNFTTVV-SHITRHNSNIKLKEPYYLGIYVPLESTNNNDKFTNIYKSLELQDLLALTS 124

Query: 138 ---VALFLIREKKK-EDSPWRVYLDILPECTD---STVFWSE------EELVELQGTQLL 184
              ++L+L  E+++   S W+ +L++LP+ +D   + + W        EEL++       
Sbjct: 125 FQLLSLYLCFERQRIHSSFWKPFLEMLPDISDFSLNPLIWQVLQVDQWEELIQFLPESAK 184

Query: 185 STTLGVKEYVQNEYLKVEEEI--ILPNKQLFPRP----ITLDDFLWAFGILRSRAF---- 234
                V E    +Y+ V   +  IL + +L        I +D FLWA+  + SR      
Sbjct: 185 RRAEDVYERFLEDYVVVRALVSRILDDLKLSESSADEYIPVDLFLWAWMCINSRCLYMTI 244

Query: 235 --SRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQV 292
              +    N  + P  D +NHS     ++ +  I   G       F +RT  P   G+Q+
Sbjct: 245 PQGKTNADNFTMAPYVDFLNHS---CNDECSILIDTTG-------FHVRTTTPYMPGDQL 294

Query: 293 LIQYDLNKSNAELALDYGFI 312
            + Y  +  N  L  +YGF+
Sbjct: 295 FLSYGPH-CNEFLLCEYGFV 313


>gi|303273258|ref|XP_003055990.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226462074|gb|EEH59366.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 580

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 67/260 (25%), Positives = 100/260 (38%), Gaps = 52/260 (20%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWI----NPDTVAASEIGSLCSGLKPWISVALFLIREK 146
           G GLVA   IA+ E ++ VP +  +    N          +L   L+  ISV+       
Sbjct: 95  GRGLVATAPIAQGETLVSVPWRESVHVIENGHDDDLRLALALLDVLEAGISVS------N 148

Query: 147 KKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEII 206
               + WR Y  +LP  T +  FWS E +  LQ  + +  T  + E     Y+       
Sbjct: 149 DARVNAWREYRVMLPISTGAAAFWSSENVSALQLPEAVKKTSELHEKFM-RYVDEVSSSS 207

Query: 207 LPNKQLFPRPITLDDFLWAFGILRSRAFSRLR---GQNLVLIPLADLINHS---PGITTE 260
             +  +  R     D +WA  ++ SR+FS +R   G    L+P ADL NH    P    E
Sbjct: 208 EVSPGVRSRA----DVMWALSMVHSRSFS-VRTPFGHARALVPFADLFNHKTEDPTAARE 262

Query: 261 -DYAYEIKG-------------------AGLFSRD---------LLFSLRTPVPVKAGEQ 291
            D    + G                    G   RD          +F +R+       E+
Sbjct: 263 TDVTLSMSGRVNDNGTRDNEPWIVDWGTKGEMDRDDVRDAGADSAMFHMRSIWNFDVNEE 322

Query: 292 VLIQYDLNKSNAELALDYGF 311
           V I Y  ++++AEL   YGF
Sbjct: 323 VFITYG-HETSAELLSSYGF 341


>gi|4185151|gb|AAD08954.1| unknown protein [Arabidopsis thaliana]
 gi|20197036|gb|AAM14885.1| unknown protein [Arabidopsis thaliana]
          Length = 441

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 41/143 (28%), Positives = 75/143 (52%), Gaps = 11/143 (7%)

Query: 91  GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLC---SGLKPWISVALFLIREKK 147
           G G +A  D+   +V LE+P+   I+ + V  S++  +     G+     + L+ +REK 
Sbjct: 168 GRGAIASEDLKFGDVALEIPVSSIISEEYVYNSDMYPILETFDGITSETMLLLWTMREKH 227

Query: 148 KEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEII- 206
             DS ++ Y D L E   + + +  + ++EL GT LL   +  KE ++  Y    +E+I 
Sbjct: 228 NLDSKFKPYFDSLQENFCTGLSFGVDAIMELDGTLLLDEIMQAKELLRERY----DELIP 283

Query: 207 -LPN-KQLFPRPI-TLDDFLWAF 226
            L N +++FP  + T + +LWA+
Sbjct: 284 LLSNHREVFPPELYTWEHYLWAY 306


>gi|169606334|ref|XP_001796587.1| hypothetical protein SNOG_06204 [Phaeosphaeria nodorum SN15]
 gi|160706968|gb|EAT86035.2| hypothetical protein SNOG_06204 [Phaeosphaeria nodorum SN15]
          Length = 634

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 60/208 (28%), Positives = 90/208 (43%), Gaps = 22/208 (10%)

Query: 129 CSGLKP-WISVALFLIREKKK-EDSPWRVYLDILP--ECTDSTVFWSEEELVELQGTQLL 184
           C G  P  I   L LI ++ K ++SPW  Y+  LP  E   + +++ +E++  L GT L 
Sbjct: 93  CQGKIPDHILTYLLLIEQRNKGKESPWHAYIACLPGAESMTTPLWFDDEDMAFLAGTSLA 152

Query: 185 STTLGVKEYVQNEYLKVEEEI-ILPNKQL-FPRPITLDDFLWAFGILRSRAF---SRLRG 239
                 KE     Y + E+ + I+ +  +     +  +  LWA  I  SRAF     L  
Sbjct: 153 P---AAKERKSLYYQQWEQALGIMKDAGVALADEVDFESLLWAATIFTSRAFISTHILPD 209

Query: 240 QNLV--LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYD 297
              V  L P+ D++NHS     E   +E +    FS  LL          AG+++   Y 
Sbjct: 210 HETVPLLFPIVDILNHSVSAKVE---WEFQPLASFSLKLLEGDT----FTAGQELFNNY- 261

Query: 298 LNKSNAELALDYGFIESKSDRNAYTLTL 325
             K N EL L YGF    +    + L L
Sbjct: 262 APKQNDELLLGYGFCLEHNPIEQFPLKL 289


>gi|145501218|ref|XP_001436591.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124403732|emb|CAK69194.1| unnamed protein product [Paramecium tetraurelia]
          Length = 716

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 48/189 (25%), Positives = 76/189 (40%), Gaps = 23/189 (12%)

Query: 93  GLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWIS------------VAL 140
           GLVA   I  NE ++ VP    +       S +  +      + S            +A 
Sbjct: 64  GLVASEKILSNETLVSVPRDLLLTTRHAFESPLKQMFIDHPQYFSNQFQSSWEDHQLMAF 123

Query: 141 FLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEY----VQN 196
            L   ++  +S W + +  LP   D  VFWS EE   L   +L+   L  K+Y    ++ 
Sbjct: 124 ILYEYQRGPESEWHLLISNLPRDIDYLVFWSHEEQELLDDEKLIK--LARKQYSEFLLEY 181

Query: 197 EYLKVEEEIILPNKQLF-PRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSP 255
           E LK    I     Q F P  +TL++  W +  L +R F +     + ++P  +L NH  
Sbjct: 182 ETLKC---ITDKYPQHFKPETVTLENARWVYTHLVTRCFGKYLAY-VTMVPFCELFNHEC 237

Query: 256 GITTEDYAY 264
                D+ Y
Sbjct: 238 TDVFYDFEY 246


>gi|410983655|ref|XP_003998153.1| PREDICTED: N-lysine methyltransferase SETD6, partial [Felis catus]
          Length = 417

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 98/436 (22%), Positives = 174/436 (39%), Gaps = 76/436 (17%)

Query: 77  VSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVA-----ASEIGSLCSG 131
           +SPK  +       G G+VA+  +   E++  VP    ++  T +       E G+L S 
Sbjct: 4   LSPKVAVSRQGTVAGYGMVARESVQPGELLFAVPRAALLSQHTCSIGGLLEQERGALQSQ 63

Query: 132 LKPWISVALFLIREKKKEDSPWRVYLDILPECT--DSTVFWSEEELVE-LQGTQLLSTTL 188
              W+ + L L+ E +   SPW  Y  + PE    +  +FW EEE    LQGT       
Sbjct: 64  -SGWVPLLLALLHELQAPASPWSPYFAMWPELGRLEHPMFWPEEERRRLLQGT------- 115

Query: 189 GVKEYVQNEYLKVEEE---IILP----NKQLF-PRPITLDDFLWAFGILRSRAFSRLRG- 239
           GV E V+ +   +  E   I+LP    +  LF PR  +L+ +     ++ + +F      
Sbjct: 116 GVPEAVEKDLANIRSEYYSIVLPFMEAHPDLFSPRVRSLELYHQLVALVMAYSFQEPLEE 175

Query: 240 -------QNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQV 292
                   + +++P AD++NH   +   +   E      +S + L  + T  P+  G ++
Sbjct: 176 EEDEKEPNSPLMVPAADILNH---LANHNANLE------YSPNCLRMVATQ-PIPKGHEI 225

Query: 293 LIQYDLNKSNAELALDYGFIESKSD-------------RNAYTLTLEISESDPFFGDKLD 339
              Y    +N +L   YGF+E   D             R A  L  ++        ++ D
Sbjct: 226 FNTYG-QMANWQLIHMYGFVEPYPDNTDDTADIQMVTVREAALLGTKVEAERLLLYERWD 284

Query: 340 -IAETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGH------ 392
            + +   +GE   F  V+GR       +    + +    A           WG       
Sbjct: 285 FLCKLEMVGEEGAF--VIGREEVLTEEELTTTLKVLCMPAEEFREFKDQDGWGDDKREEE 342

Query: 393 ----LDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGG----NLDPRLRIAV 444
                ++P   A+     +++RD+    L  + T ++ ++ LL        L  R + A+
Sbjct: 343 SLTVTNIPRLKASWR---QLLRDSVLLTLQTYATDLKSEQDLLSNKEVYTKLSSREQQAL 399

Query: 445 GVRAGEKRVLQQIDEI 460
            VR G+K +L Q+ E+
Sbjct: 400 QVRYGQKMILHQLLEL 415


>gi|449542715|gb|EMD33693.1| hypothetical protein CERSUDRAFT_56467 [Ceriporiopsis subvermispora
           B]
          Length = 510

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 39/148 (26%), Positives = 71/148 (47%), Gaps = 14/148 (9%)

Query: 64  VETFWQWLRDQKVVSPKSPIRPATFPE-GLGLVAQRDIAKNEVVLEVPMKFWINPDT--- 119
           ++ F  W R+Q      S +  A FP+ G G +A RDI +   +  +P    ++  T   
Sbjct: 1   MDNFTTWFREQGGELDASAMGFAIFPDSGRGAIALRDIPEGHALFSIPRGLTLSLRTSSL 60

Query: 120 -----VAASEIGSLCSGLKPWISVALFLIREKK-KEDSPWRVYLDILPECTDSTVFWSEE 173
                + A +   L  G   W+ + L ++ E+     S W  Y+  +P+   + +FWSEE
Sbjct: 61  PTRFGIEAWKQHGLHEG---WVGLILCMMWEESLGGSSKWSEYMSSMPDTFTTPMFWSEE 117

Query: 174 ELVELQGTQLLSTTLGVKEYVQNEYLKV 201
           E+ EL+GT ++   +G  +  ++ Y K+
Sbjct: 118 EIQELKGTAVVD-KIGRDDAERDYYEKL 144



 Score = 47.4 bits (111), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 58/244 (23%), Positives = 96/244 (39%), Gaps = 58/244 (23%)

Query: 241 NLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNK 300
           ++ ++P+AD++N           +E + A LF  +    +    P+ AGEQ+   Y  + 
Sbjct: 258 DVAMVPMADMLNAR---------FESENAKLFYEEHYLKMVATKPINAGEQIWNTYG-DP 307

Query: 301 SNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIAETNG--------------- 345
            N++L   YG ++           + + E     G+  D+ E                  
Sbjct: 308 PNSDLLRRYGHVD----------VVPLGEPLSGEGNPADVVEIRADLVVSAVRKARKAAG 357

Query: 346 -----------LGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWGHLD 394
                        +   F ++    +P  +L ++RL++L   +         N +     
Sbjct: 358 DLQVRVDFWLEEADDDTFVLMTDCEVPEELLSFIRLLSLTKDEW--------NKVKAKGK 409

Query: 395 LPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLL---QGGNLDPRLRIAVGVRAGEK 451
           LP      EL+  +V D  K  L  + TTIEEDE LL      NL    R AV VR GEK
Sbjct: 410 LPKGKLELELLPAIV-DVLKERLKEYPTTIEEDESLLGPDSAVNLSFNKRNAVVVRLGEK 468

Query: 452 RVLQ 455
           R+L+
Sbjct: 469 RILR 472


>gi|68488193|ref|XP_712057.1| hypothetical protein CaO19.10177 [Candida albicans SC5314]
 gi|46433419|gb|EAK92860.1| hypothetical protein CaO19.10177 [Candida albicans SC5314]
          Length = 552

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 76/310 (24%), Positives = 124/310 (40%), Gaps = 80/310 (25%)

Query: 67  FWQWLRDQKV-VSPKSPIRP-ATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASE 124
           F  WL    V +SPK  I       +G G++A  DI  +E++ ++P    +N D    S 
Sbjct: 11  FQDWLIKNNVEISPKIAIHDYCDTNQGRGIIALEDINPDEMIFKLPRSIVLNIDN--NSL 68

Query: 125 IGSLCSGLKP------WISVALFLIREKKKEDSP-----------WRVYLDILPECTDST 167
           I S  S LK       WI + + L  E K + +P           W  YL+ILP+  +  
Sbjct: 69  IKSYPSVLKKLRVLDQWIGLIIVLGFEIKFKFNPSDNNDNHNRSFWYEYLNILPDQFNQL 128

Query: 168 VFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLF------PRPITLDD 221
           ++W++EEL  LQ + +L   +G KE   N Y ++   I + N+ L         P+T ++
Sbjct: 129 IYWNDEELNHLQPSCILD-RIG-KENNLNMYNQI---ISIINQDLSGVEEFKSSPLTFEE 183

Query: 222 FLWAFGILRSRAF------SRLRGQNLV-------------------------------- 243
           +     I+ S +F      S+   +N                                  
Sbjct: 184 YNKVATIIMSYSFDVEVPKSKKMTKNGTNEKGNDEEDEEEDEDKEDDDDDEEEDNEYYKS 243

Query: 244 LIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNA 303
           ++P AD +N    +             ++S D L  +    P+  GEQV   Y  +  N+
Sbjct: 244 MVPFADTLNADTHLNN--------AILIYSTDQLI-MTCIKPIAKGEQVYNTYS-DHPNS 293

Query: 304 ELALDYGFIE 313
           EL   YG++E
Sbjct: 294 ELLRRYGYVE 303


>gi|298707719|emb|CBJ26036.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 444

 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 35/120 (29%), Positives = 54/120 (45%), Gaps = 8/120 (6%)

Query: 222 FLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLR 281
            LWAF ++RSRAF+   G     +P  D+ NH       ++ Y   G    S+  +F L+
Sbjct: 126 LLWAFAMVRSRAFA-ADGDRFAFVPFLDMANHGFADPAANFTYISGGE---SQPGVFQLQ 181

Query: 282 TPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDPFFGDKLDIA 341
               + AGE+V I Y    +  +L + YGF    +      L L +  SD    D + +A
Sbjct: 182 AMRNISAGEEVTISYGEQLNAEQLMVQYGFPAPPN----VPLELPLCGSDALVRDTVAVA 237


>gi|392576309|gb|EIW69440.1| hypothetical protein TREMEDRAFT_62307 [Tremella mesenterica DSM
           1558]
          Length = 510

 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 73/308 (23%), Positives = 123/308 (39%), Gaps = 59/308 (19%)

Query: 60  PTAQVETFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDT 119
           P   +E    WL D   V   S +      +G G+ A   I + ++V  +P    ++  T
Sbjct: 10  PPPSIEILQAWLDDHGAVI-NSHVEITESFDGWGIRATETIDEGDLVCMIPKTAVLSHRT 68

Query: 120 VAASEIG---SLCSGLK---PWISVALFLIREKKKE-DSPWRVYLDILP-ECTDSTVFWS 171
            +   +    +   G++   P + ++L ++ E + + DSP+  YL  LP E     VFWS
Sbjct: 69  SSLPLLAPPPTSVDGIEAHWPILQLSLAILHELRLDVDSPFFGYLQSLPRELILLPVFWS 128

Query: 172 EEE---------LVELQGTQL-------LSTTLGVKEYVQNEYLKVEEEIILPNKQLFPR 215
            +          L+ L+GT+         S  LG+ E   N + +    + LP+ +  P 
Sbjct: 129 VKRIAGEDGRLGLLWLKGTEAERDMARRSSIGLGLNEL--NTFYQTHR-LHLPSTRFHPS 185

Query: 216 PITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTE-----DYAYEIKGAG 270
           P   + F   F ++ +RAF       + L P ADL+NHS    T+     D  +      
Sbjct: 186 PSPFEAFYHTFALVSTRAFVIDLYHTVGLCPFADLLNHSSTPHTQLASDKDVCHICGSLP 245

Query: 271 LFSRDLLFSLRTP-----VP---------------------VKAGEQVLIQYDLNKSNAE 304
               DLL +   P     VP                     V+ GE++   YD N  +A 
Sbjct: 246 SCKHDLLNANDVPRRLEQVPPHTRKIMEETIDCVELKLERSVRGGEEIWNWYDDNIGDAR 305

Query: 305 LALDYGFI 312
             +++GF+
Sbjct: 306 SLVEWGFV 313


>gi|145356662|ref|XP_001422546.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144582789|gb|ABP00863.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 484

 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 65/264 (24%), Positives = 108/264 (40%), Gaps = 24/264 (9%)

Query: 62  AQVETFWQWLRDQKVVSPKSPIRPATFPEG----LGLVAQRDIAKNEVVLEVPMKFWINP 117
           A+ + F  W+R + V   +   R A   +G        A+  +A    +  +P    +  
Sbjct: 4   ARDDDFLDWMRARGVT--RDDARCAVRGDGDRGRFVECARATLAVGTTLATIPKSACLTT 61

Query: 118 DTVAASEI-GSLCSGLKPWISVALFLIREKK-KEDSPWRVYLDILP--ECTDSTVFWSEE 173
            T A   I G+  SG+   + + + +  E+     S +  YL  LP  E   ST    +E
Sbjct: 62  ATSALGAIEGARGSGV---VGLCVAVAHERSLGAKSTFAAYLKTLPAREALPSTFAADDE 118

Query: 174 ELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPI----TLDDFLWAFGIL 229
               L+GT +       +  + ++Y     E+    K+ F R      TL++F  A  ++
Sbjct: 119 ARRALRGTTVEGMLGADERAIADDYHAFAREM----KERFARRGVKMPTLEEFRDAATLV 174

Query: 230 RSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAG 289
            SRAF         L+P ADL NH  G  ++   + + G      D L +L T    K G
Sbjct: 175 ASRAFFVDDALGQGLVPFADLFNHKGG--SDGAHFNVVGCDDADADAL-TLVTCRTAKRG 231

Query: 290 EQVLIQYDLNKSNAELALDYGFIE 313
           E++   +  +  N  L   YGF+E
Sbjct: 232 EELFNSFGDDHDNTVLLYKYGFVE 255


>gi|311257193|ref|XP_003127001.1| PREDICTED: N-lysine methyltransferase SETD6 [Sus scrofa]
 gi|335289289|ref|XP_003355838.1| PREDICTED: N-lysine methyltransferase SETD6-like [Sus scrofa]
          Length = 448

 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 73/290 (25%), Positives = 124/290 (42%), Gaps = 43/290 (14%)

Query: 62  AQVETFWQWLRDQKV-VSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTV 120
           A V  F  W R   + +SPK  +       G G+VA+  +   E++  VP    ++  T 
Sbjct: 20  APVAGFLSWCRRVGLELSPKVAVSRQGTVAGYGMVARESVQPGELLFVVPRAAVLSQHTC 79

Query: 121 AAS-----EIGSLCSGLKPWISVALFLIREKKKEDSPWRVYLDILPECT--DSTVFWSEE 173
           + S     E G+L S    W+ + L L+ E +   SPW  Y  + PE    +  +FW EE
Sbjct: 80  SISGLLERERGALQSQ-SGWVPLLLALLHELQAPASPWSPYFALWPELGRLEHPMFWPEE 138

Query: 174 ELVE-LQGTQLLSTTLGVKEYVQNEYLKVEEE---IILP----NKQLF-PRPITLDDFLW 224
           E    LQGT       GV E V+ +   +  E   I+LP    +  LF PR  +L+ +  
Sbjct: 139 ERRRLLQGT-------GVPEAVEKDLANIRSEYYSIVLPFMEAHPDLFSPRVRSLELYHQ 191

Query: 225 AFGILRSRAFSRLRG-------QNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLL 277
              ++ + +F             + +++P AD++NH   +   +   E      +S + L
Sbjct: 192 LVALVMAYSFQEPLEEEDEKEPNSPLMVPAADILNH---LANHNANLE------YSPNCL 242

Query: 278 FSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEI 327
             + T   +  G ++   Y    +N +L   YGF+E   D    T  +++
Sbjct: 243 RMVATQ-SIPKGHEIFNTYG-QMANWQLIHMYGFVEPYPDNKDDTADIQM 290


>gi|46117158|ref|XP_384597.1| hypothetical protein FG04421.1 [Gibberella zeae PH-1]
          Length = 456

 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 95/394 (24%), Positives = 158/394 (40%), Gaps = 90/394 (22%)

Query: 135 WISVALFLIREKKKEDSPWRVYLDILPECTDSTV-----FWSEEELVELQGTQLLSTTLG 189
           ++ +  FL+RE     S W  Y+  LP+  D        FW +E+    +GT +      
Sbjct: 92  FVLIKHFLLRE-----SFWTPYIQALPQPNDVDSWSLPPFWPDEDAELFEGTNIEVGVAN 146

Query: 190 VKEYVQNEYLKVEEEIILPNKQLFP---RPITLDDFLWAFGILRSRAF-----------S 235
           +K  V  E+    +  +L      P   +  TL  + WA+ I  SR+F            
Sbjct: 147 IKANVMREFRAGCD--LLDRDDWEPQLLKQFTLPLYQWAYSIFSSRSFRPSLVLGPEDQQ 204

Query: 236 RL-RGQNL----VLIPLADLINHSPGITTE-DYAYEIKGAGLFSRDLLFSLRTPVPVKAG 289
           RL  G  L    VL+PL D+ NH   +TT+  +  + K +         SL+     + G
Sbjct: 205 RLPEGVKLDDFSVLMPLFDVGNHD--MTTQVRWERDEKSSDC-------SLKVGKAYQPG 255

Query: 290 EQVLIQYDLNKSNAELALDYGFI--ESKSDRNAYTLTLEISESDPFFGDKLDIAETNGLG 347
           EQ+   Y + K+NAEL L YGF+  E++   N Y   + + +  P  G+           
Sbjct: 256 EQIFNNYSM-KTNAELLLGYGFMLPETEELHNDY---VHVRKRQPAQGEAT--------- 302

Query: 348 ESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFL--LESIFRNTIWG-HLDLPVSHANEEL 404
           E  Y  +   R     + +  + V L  + + L   + +  + +W     L       + 
Sbjct: 303 EEYYISLRPIRHASSLLARSKQAVQLDDSTSVLGAFQHVQHDMVWDIFCTLAPPEQRAQF 362

Query: 405 IC--------------RVVRDA----CKSALSGFHTTIEEDEQLLQ-------GGNLDPR 439
           IC              +V  D      ++A    H  ++E E+LL+       GG+L   
Sbjct: 363 ICEGSEQEQQNKFFSGQVSEDGRMFMQQTAAIIQHKVMQELERLLETDVEVVGGGDLTRN 422

Query: 440 LRIAVGVRAGEKRVLQ------QIDEIFKERELE 467
            ++A+  RA  K+VL+       +DE+F E E E
Sbjct: 423 QQLALDYRARCKKVLETTLEAMDMDELFGEDEEE 456


>gi|345328941|ref|XP_001507526.2| PREDICTED: LOW QUALITY PROTEIN: N-lysine methyltransferase
           SETD6-like [Ornithorhynchus anatinus]
          Length = 495

 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 49/199 (24%), Positives = 82/199 (41%), Gaps = 39/199 (19%)

Query: 135 WISVALFLIREKKKEDSPWRVYLDILPECT--DSTVFWSEEELVE-LQGTQLLSTTLGVK 191
           W+ + L L+ E +  DSPW  Y  + P+    D  +FW +EE    LQGT       GV 
Sbjct: 144 WVPLLLALLHEYQASDSPWHHYFSLWPDLNDLDHPMFWPKEERGRLLQGT-------GVP 196

Query: 192 EYVQNEYLKVEEE---IILP----NKQLFPR-PITLDDFLWAFGILRSRAFS-------- 235
           E V+ +   +  E   I+LP    +  LFP    +L+ +     ++ + +F         
Sbjct: 197 EAVEKDLANISHEYSSIVLPFTEAHPDLFPAGSCSLELYCRLVAVVMAYSFQEPLEEEEE 256

Query: 236 RLRGQNLVLIPLADLINHSPGITTE-DYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLI 294
                  +++P+AD++NH        +YA E              +    P+  G ++  
Sbjct: 257 DEEPNPPLMVPVADILNHVANHNANLEYAPEC-----------LRMVATRPIPKGHEIFN 305

Query: 295 QYDLNKSNAELALDYGFIE 313
            Y    +N +L   YGF E
Sbjct: 306 TYG-QMANWQLVHMYGFAE 323


>gi|307195794|gb|EFN77608.1| SET domain-containing protein 3 [Harpegnathos saltator]
          Length = 245

 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 56/245 (22%), Positives = 111/245 (45%), Gaps = 34/245 (13%)

Query: 224 WAFGILRSRAFSRLRGQNLV-----------LIPLADLINHSPGITTEDYAYEIKGAGLF 272
           WA   + +R       QNLV           LIP+ D+ NH  G  T D+          
Sbjct: 8   WAVSTVMTR-------QNLVPSPDGSRMIHALIPMWDMCNHENGRITTDFN------ATS 54

Query: 273 SRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTLTLEISESDP 332
            R   ++LR     + GEQ+ I Y   ++N++  +  GF+   ++++ + L L IS++D 
Sbjct: 55  DRCECYALRN---FQKGEQIFISYG-PRTNSDFFVHSGFVYMDNEQDGFKLRLGISKADS 110

Query: 333 FFGDKLDIAETNGLGESAYFDIVLG-RTLPPAMLQYLRLVALGGTDAFLLESIFRNTIWG 391
              ++ ++     L     F +  G   +   +L +LR+ ++    A L   +  + ++ 
Sbjct: 111 LQKERTELLGKLDLPSVGEFLLKPGTEPISDMLLAFLRVFSM--RKAELAHWLRSDKVFD 168

Query: 392 --HLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQGGNLDPRLRIAVGVRAG 449
             H+D  +    EE + + +    +  ++ + TT++ED +LL+   L    ++ V +R  
Sbjct: 169 LKHMDCALETVVEENVRKFLLTRLQLLIANYPTTLKEDLELLE-TTLPQVKKMTVQLRVT 227

Query: 450 EKRVL 454
           EK++L
Sbjct: 228 EKKIL 232


>gi|358055500|dbj|GAA98620.1| hypothetical protein E5Q_05307 [Mixia osmundae IAM 14324]
          Length = 462

 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 65/234 (27%), Positives = 90/234 (38%), Gaps = 29/234 (12%)

Query: 94  LVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREKKKEDSP- 152
           ++A  D+     +   P    I P T A + +G   + L    ++ L+L+  K    SP 
Sbjct: 37  IIATTDLPAGTCIASCPHTLAITP-TSARAALGHHATELSDHQAMVLYLVLHKHP--SPA 93

Query: 153 ----WRVYLDILP--ECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEII 206
                + Y+D LP      + ++++  E+  LQGT L       +   Q E++ V     
Sbjct: 94  VCCLHQAYVDTLPPRSAMRTPLWFNPAEVQLLQGTNLAGAVTDRQRDWQLEWMTVLRRA- 152

Query: 207 LPNKQLFPRPI--TLDDFLWAFGILRSRAF-SRL-----RGQNLVLIPLADLINHSPGIT 258
                LF      T    LWA  IL SRAF S L     +    VL P  D  NH     
Sbjct: 153 -GQSGLFKASFEETWPSALWAATILSSRAFPSHLIDGNEQASTPVLFPGVDAFNHQQARK 211

Query: 259 TEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI 312
                    G         F+L    P  AG+QV   Y   KSN E  L YGFI
Sbjct: 212 VTWQTSSASGR--------FNLVQDEPTAAGQQVFNNYG-PKSNEEFLLGYGFI 256


>gi|313230987|emb|CBY18985.1| unnamed protein product [Oikopleura dioica]
          Length = 265

 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 61/255 (23%), Positives = 110/255 (43%), Gaps = 42/255 (16%)

Query: 77  VSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEI-----GSLCSG 131
           VS K  IR    P G  + + + I K++ +LE+     I+ +++  S+I      SL   
Sbjct: 14  VSEKIEIREG--PYGSSIFSTQKIHKDDTILEIKTSSIISVNSILGSDILRKFMASLKIS 71

Query: 132 LKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVK 191
           L P+  +A++L   +   ++    Y+  L +  DS + W   +++EL         L +K
Sbjct: 72  LTPYDLLAVYLTLSRLTGENA--AYMSSLQKTFDSPLTWPASDVLELP--------LDIK 121

Query: 192 EYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAF----------GILRSRAFSRL---- 237
            +V +    +++     N++L P       F W +          G+L  +A + L    
Sbjct: 122 FFVMSTLSHMKKVFSAINEKL-PEINDFKFFKWCYLNVITRNFNGGVLAIKAPTWLEINS 180

Query: 238 RGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYD 297
           R QN  LIP  D+ NH     + +  Y +K       D  F L     + + +++ I Y 
Sbjct: 181 RNQNASLIPGLDICNH---WNSPNCRYSLK-------DDRFFLTAKDTIDSNKEIFISYG 230

Query: 298 LNKSNAELALDYGFI 312
            +K + EL   YGF+
Sbjct: 231 YSKGDHELLSTYGFV 245


>gi|258572574|ref|XP_002545049.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237905319|gb|EEP79720.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 519

 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 53/217 (24%), Positives = 91/217 (41%), Gaps = 31/217 (14%)

Query: 151 SPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEE---EIIL 207
           S W  Y+  LP+      F+++ EL  LQGT L        + ++ E+ ++ +   +I  
Sbjct: 136 SAWTGYIQFLPKSYPLPTFYTDGELEILQGTSLKPALESKLDSLEREFEQLRQFTADIPW 195

Query: 208 PNKQLF---PRPITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAY 264
             +  +      +T  D+     + RSRA   +      ++P  D+ NH+ G  T +  Y
Sbjct: 196 CKENWWDGETGQLTFHDWKTVDAMYRSRALD-IPEIGHAMVPCVDMANHASGDET-NAIY 253

Query: 265 EIKGAGLFSRDLLFSLRTPVPVKAGEQVLI------------------QYDLNKSNAELA 306
           E+ G G    +++  LR    +  G++V I                  +Y   K  AE+ 
Sbjct: 254 EVDGNG----NVVLQLRYGRSLGEGDEVTITSVICPAISQAPSDLTWCRYGDEKGAAEMI 309

Query: 307 LDYGFIESK-SDRNAYTLTLEISESDPFFGDKLDIAE 342
             YGFIE   +D     L LEI + DP    K+ + +
Sbjct: 310 FSYGFIEGAITDARQMFLELEIPDDDPLKPAKMVVCD 346


>gi|224077384|ref|XP_002305239.1| SET domain protein [Populus trichocarpa]
 gi|222848203|gb|EEE85750.1| SET domain protein [Populus trichocarpa]
          Length = 518

 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 101/447 (22%), Positives = 161/447 (36%), Gaps = 100/447 (22%)

Query: 77  VSPKSP-------IRPATFPE--GLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGS 127
           + P+SP       +  + FP+  G GL A RD+ K E+VL VP    I  D++   E   
Sbjct: 58  LHPQSPTSCLGHSLTVSHFPDAGGRGLAAVRDLKKGELVLRVPKSVLITRDSLLKDE--K 115

Query: 128 LC--------SGLKPWISVALFLIREKKKEDSPWRV-YLDILPECTDSTVFWSEEELVEL 178
           LC        S L P   +A+ L+ E  K  S W   YL  LP   D             
Sbjct: 116 LCSFVNNNTYSSLSPTQILAVCLLYEMGKGKSSWWYPYLMHLPRSYD------------- 162

Query: 179 QGTQLLSTTLGVKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAFSRLR 238
               +L++        ++E+   E   ++   +L P+ +T   ++WA   + SRA     
Sbjct: 163 ----VLASFKKAVSKAKSEW--KEANSLMDALKLKPQLLTFRAWIWASATISSRALHIPW 216

Query: 239 GQNLVLIPLADLINH---------------------------SPGITTEDYAYEIKGAGL 271
            +   L P+ DL N+                           S G TT+D+  +    GL
Sbjct: 217 DEAGCLCPVGDLFNYAAPGEESNDLENVVHWMNASSLEDSSLSNGETTDDFIGDQPDIGL 276

Query: 272 -------FSRDL-LFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKSDRNAYTL 323
                  F  ++  +        K G QVL+ Y    +N EL   YGF+ +++  +   +
Sbjct: 277 ERLTDGGFDENMAAYCFYARKNYKKGTQVLLGYG-TYTNLELLEHYGFLLNENPNDKVFI 335

Query: 324 TLEISESDPFFGDKLDI-AETNGLGESAYFDIVLGRTLPPAMLQYLRLVALGGTDAFLLE 382
            LE S        K+ +    +G    A    +     PP   + +  +   G+      
Sbjct: 336 PLEPSMYSFISWPKVSMYIHQDGKPSFALLSALRLWATPPNQRRSISHLVYSGSR----- 390

Query: 383 SIFRNTIWGHLDLPVSHANEELICRVVRDACKSALSGFHTTIEEDEQLLQG----GNLDP 438
                         +S  NE  + + +   C   LS   T IEED  LL       N D 
Sbjct: 391 --------------LSVYNEISVLKWISKNCAMILSNLPTVIEEDSLLLSTINKIENFDK 436

Query: 439 RLRIAVGVRAGEKRVLQQIDEIFKERE 465
              + V    GE R   +  ++ K + 
Sbjct: 437 PTEL-VCTSGGEARAFLEASDLQKGKN 462


>gi|154277400|ref|XP_001539541.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
 gi|150413126|gb|EDN08509.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
          Length = 531

 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 49/208 (23%), Positives = 89/208 (42%), Gaps = 13/208 (6%)

Query: 144 REKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQL---LSTTLGVKEYVQNEYLK 200
           +E  K  + W  Y+  LP        ++ EE   L+GT L   L + +   E   ++  +
Sbjct: 187 KEHIKVSNAWTEYIKFLPASYSLPTLYTIEERELLRGTSLELALDSKILSLEKEFDQLRE 246

Query: 201 VEEEIILPNKQLFPRP---ITLDDFLWAFGILRSRAFSRLRGQNLVLIPLADLINHSPGI 257
              +I   ++  +      +T +D+     + RSRA     G+   ++P  D++NH+ G 
Sbjct: 247 ATTDISWCDRDWWNEDTGRLTFEDWKLVDAMYRSRALE-FPGKGHSMVPCIDMVNHASG- 304

Query: 258 TTEDYAYEIKGAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIESKS- 316
                 YE    G    + +  LR    ++ G+++ I Y   K  +E+   YGF++S S 
Sbjct: 305 GRPSALYETDENG----NAVLQLRRGQRLEVGDEITITYGDAKGASEMVFSYGFLDSGST 360

Query: 317 DRNAYTLTLEISESDPFFGDKLDIAETN 344
           +     L L+I   DP    K+ +  T 
Sbjct: 361 NARQLFLDLDIPADDPLRPAKIAVCNTT 388


>gi|308811352|ref|XP_003082984.1| SET domain-containing protein-like (ISS) [Ostreococcus tauri]
 gi|116054862|emb|CAL56939.1| SET domain-containing protein-like (ISS), partial [Ostreococcus
           tauri]
          Length = 468

 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 62/275 (22%), Positives = 112/275 (40%), Gaps = 42/275 (15%)

Query: 66  TFWQWLRDQKVVSPKSPIRPATFPEGLGLVAQRDIAKNEVVLEVPMKFWINPDTVAASEI 125
           +F  W + + ++   S         G  +VA  D+    ++  +P    +   T A   +
Sbjct: 13  SFNAWTKTRDIIVHASLRLDGDDRRGRRVVATNDVPCGTLLCSIPKASCLMTSTSA---L 69

Query: 126 GSLCSGLKP----WIS-----VALFLIREKKKEDSPWRVYLDI------LPECTDSTVFW 170
            +LC+  +     W+      VA      K    S +  YL I      LP C +S    
Sbjct: 70  ATLCASERAKEALWMLEEAALVACVAYERKIGSKSAFARYLGIVPRREALPGCRESE--- 126

Query: 171 SEEELVE--LQGTQLLSTTLGVKEYVQNEYLKV----EEEIILPNKQLFPRPITLDDFLW 224
            E++ V   L+GT +  T    +E ++ ++ +V    EE + +P       P TL++F+ 
Sbjct: 127 GEDDEVSRALRGTSVRLTLEDDREAMEEDHARVVEFYEEHVKVP-------PPTLNEFIE 179

Query: 225 AFGILRSRAFSRLRGQNLVLIPLADLINHSPGITTEDYAYEIKGAGLFSRDL-----LFS 279
           A  ++ SRAF     +   ++P AD+ NH          ++++G      D         
Sbjct: 180 AASLVASRAFFVDEKEGQGMVPFADMFNHRGNGGAH---FDVEGCDAEEGDQDAYPDALR 236

Query: 280 LRTPVPVKAGEQVLIQYDLNKSNAELALDYGFIES 314
           L +   +  GE+V   +  +  NA L   YGF+E+
Sbjct: 237 LVSCRDIARGEEVFNSFGDDHDNAMLYTKYGFVET 271


>gi|133902101|ref|NP_490849.4| Protein SET-29 [Caenorhabditis elegans]
 gi|373219869|emb|CCD70787.1| Protein SET-29 [Caenorhabditis elegans]
          Length = 401

 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 49/193 (25%), Positives = 87/193 (45%), Gaps = 34/193 (17%)

Query: 132 LKPWISVALFLIREKKKEDSPWRVYLDILPECTDSTVFWSEEELVELQGTQLLSTTL--G 189
           LKP   + +F   E   E S W  YL +LP+  ++  F         +G      TL   
Sbjct: 76  LKPMEILTMFFAFEDT-EHSAWSPYLKVLPKEFNTPAF---------KGIDYDVNTLPLS 125

Query: 190 VKEYVQNEYLKVEEEIILPNKQLFPRPITLDDFLWAFGILRSRAF----------SRLRG 239
           +++Y  ++  ++ E I    ++LFP  ++ D  LWA+ ++ +R                G
Sbjct: 126 IRKYWIDQKKEISE-ISEKLRRLFPE-LSHDKILWAWHVVNTRCIFVENEEHDNVDNSDG 183

Query: 240 QNLVLIPLADLINHSPGITTEDYAYEIKGAGLF-SRDLLFSLRTPVPVKAGEQVLIQYDL 298
             + +IP  D++NH P    E Y    +G  L   R+  + ++    ++ GEQ+ + Y  
Sbjct: 184 DTIAVIPYVDMLNHDP----EKY----QGLALHEKRNGRYVVQAKRQIQEGEQIFVCYGA 235

Query: 299 NKSNAELALDYGF 311
           +  NA L ++YGF
Sbjct: 236 H-DNARLLVEYGF 247


>gi|428167728|gb|EKX36682.1| hypothetical protein GUITHDRAFT_117107 [Guillardia theta CCMP2712]
          Length = 508

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 55/225 (24%), Positives = 100/225 (44%), Gaps = 29/225 (12%)

Query: 93  GLVAQRDIAKNEVVLEVPMKFWINPDTVAASEIGSLCSGLKPWISVALFLIREKKKEDSP 152
           GL+A+++ +  ++++ VP+K  IN  T ++S             S A+ L    +K    
Sbjct: 55  GLIAEKEFSPGDIIVTVPLKHAINLGTASSST-----------ESAAILLRERYRKSRRR 103

Query: 153 WRVYLDILPECTD--STVFWSEEELVELQGTQLLSTTLGVKEYVQNEYLKVEEEIILPNK 210
           +  YL  LP+  +  +     E+++  LQ   L+      +      Y  +  +  +   
Sbjct: 104 FDPYLQSLPKIEELMTVDLADEDDIWWLQSPDLIEAAWRWRNATLAGYRSIGSKSFM--- 160

Query: 211 QLFPRPITLDDFLWAFGILRSRAFSRL--RGQNL-VLIPLADLINHSPGITTEDYAYEIK 267
            L  R +TL+++ WA  I+ SR+ + +   G  L  LIP+ DL NH      E+  + ++
Sbjct: 161 -LGGRHLTLNEYRWAVSIISSRSLAIVAPNGDMLKYLIPVMDLANHQ-----EESKHHVR 214

Query: 268 GAGLFSRDLLFSLRTPVPVKAGEQVLIQYDLNKSNAELALDYGFI 312
              L      F L    P+K  E++ I Y   + + E  L YGF+
Sbjct: 215 ---LADGARAFHLVCGQPIKPKEEIRISYGPLRGD-ETVLFYGFL 255


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.319    0.136    0.400 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,866,845,508
Number of Sequences: 23463169
Number of extensions: 338523276
Number of successful extensions: 857549
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 335
Number of HSP's successfully gapped in prelim test: 1086
Number of HSP's that attempted gapping in prelim test: 854707
Number of HSP's gapped (non-prelim): 1893
length of query: 497
length of database: 8,064,228,071
effective HSP length: 147
effective length of query: 350
effective length of database: 8,910,109,524
effective search space: 3118538333400
effective search space used: 3118538333400
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 79 (35.0 bits)