BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 017168
         (376 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|225452167|ref|XP_002264334.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
           [Vitis vinifera]
          Length = 509

 Score =  681 bits (1756), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 325/367 (88%), Positives = 342/367 (93%)

Query: 10  AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
           AELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE+EL
Sbjct: 143 AELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSESEL 202

Query: 70  AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
           AYLTGSPTKAE+LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFVA
Sbjct: 203 AYLTGSPTKAEVLERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFVA 262

Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
           +QSCVVHLQKVSLARRFALVPLGPPLLAY S CKAMLAAVD +VQLVVDRPYKAGESIVV
Sbjct: 263 IQSCVVHLQKVSLARRFALVPLGPPLLAYRSNCKAMLAAVDGSVQLVVDRPYKAGESIVV 322

Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
           WCGPQPNSKLL+NYGFVDEDN YDR+VVEAALNTEDPQYQDKRMVAQRNGKL+VQ FHV 
Sbjct: 323 WCGPQPNSKLLLNYGFVDEDNSYDRIVVEAALNTEDPQYQDKRMVAQRNGKLTVQKFHVS 382

Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
            G+E+EA+SDMLPYLRLGYVSD SEMQSVISS GPICPVSPCMERAVLDQL DYF+ RLA
Sbjct: 383 VGKEREAVSDMLPYLRLGYVSDPSEMQSVISSQGPICPVSPCMERAVLDQLVDYFERRLA 442

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPA 369
           GYP T+SEDE +L D NL+PKK VATQLVR+EKKMLNACL+ T D+I  LPD TVSPCPA
Sbjct: 443 GYPTTMSEDECLLADSNLNPKKLVATQLVRLEKKMLNACLKATVDLINQLPDHTVSPCPA 502

Query: 370 PYAPLLN 376
           PY PLL 
Sbjct: 503 PYTPLLK 509


>gi|359488614|ref|XP_003633789.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
           [Vitis vinifera]
          Length = 515

 Score =  681 bits (1756), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 325/367 (88%), Positives = 342/367 (93%)

Query: 10  AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
           AELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE+EL
Sbjct: 149 AELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSESEL 208

Query: 70  AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
           AYLTGSPTKAE+LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFVA
Sbjct: 209 AYLTGSPTKAEVLERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFVA 268

Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
           +QSCVVHLQKVSLARRFALVPLGPPLLAY S CKAMLAAVD +VQLVVDRPYKAGESIVV
Sbjct: 269 IQSCVVHLQKVSLARRFALVPLGPPLLAYRSNCKAMLAAVDGSVQLVVDRPYKAGESIVV 328

Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
           WCGPQPNSKLL+NYGFVDEDN YDR+VVEAALNTEDPQYQDKRMVAQRNGKL+VQ FHV 
Sbjct: 329 WCGPQPNSKLLLNYGFVDEDNSYDRIVVEAALNTEDPQYQDKRMVAQRNGKLTVQKFHVS 388

Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
            G+E+EA+SDMLPYLRLGYVSD SEMQSVISS GPICPVSPCMERAVLDQL DYF+ RLA
Sbjct: 389 VGKEREAVSDMLPYLRLGYVSDPSEMQSVISSQGPICPVSPCMERAVLDQLVDYFERRLA 448

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPA 369
           GYP T+SEDE +L D NL+PKK VATQLVR+EKKMLNACL+ T D+I  LPD TVSPCPA
Sbjct: 449 GYPTTMSEDECLLADSNLNPKKLVATQLVRLEKKMLNACLKATVDLINQLPDHTVSPCPA 508

Query: 370 PYAPLLN 376
           PY PLL 
Sbjct: 509 PYTPLLK 515


>gi|296090251|emb|CBI40070.3| unnamed protein product [Vitis vinifera]
          Length = 428

 Score =  679 bits (1753), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 325/366 (88%), Positives = 342/366 (93%)

Query: 10  AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
           AELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE+EL
Sbjct: 62  AELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSESEL 121

Query: 70  AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
           AYLTGSPTKAE+LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFVA
Sbjct: 122 AYLTGSPTKAEVLERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFVA 181

Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
           +QSCVVHLQKVSLARRFALVPLGPPLLAY S CKAMLAAVD +VQLVVDRPYKAGESIVV
Sbjct: 182 IQSCVVHLQKVSLARRFALVPLGPPLLAYRSNCKAMLAAVDGSVQLVVDRPYKAGESIVV 241

Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
           WCGPQPNSKLL+NYGFVDEDN YDR+VVEAALNTEDPQYQDKRMVAQRNGKL+VQ FHV 
Sbjct: 242 WCGPQPNSKLLLNYGFVDEDNSYDRIVVEAALNTEDPQYQDKRMVAQRNGKLTVQKFHVS 301

Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
            G+E+EA+SDMLPYLRLGYVSD SEMQSVISS GPICPVSPCMERAVLDQL DYF+ RLA
Sbjct: 302 VGKEREAVSDMLPYLRLGYVSDPSEMQSVISSQGPICPVSPCMERAVLDQLVDYFERRLA 361

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPA 369
           GYP T+SEDE +L D NL+PKK VATQLVR+EKKMLNACL+ T D+I  LPD TVSPCPA
Sbjct: 362 GYPTTMSEDECLLADSNLNPKKLVATQLVRLEKKMLNACLKATVDLINQLPDHTVSPCPA 421

Query: 370 PYAPLL 375
           PY PLL
Sbjct: 422 PYTPLL 427


>gi|224117488|ref|XP_002331687.1| SET domain protein [Populus trichocarpa]
 gi|222874165|gb|EEF11296.1| SET domain protein [Populus trichocarpa]
          Length = 502

 Score =  665 bits (1715), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 319/367 (86%), Positives = 339/367 (92%)

Query: 10  AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
           AELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE EL
Sbjct: 136 AELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSEAEL 195

Query: 70  AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
           AYLTGSPTKAE+L+RA+GIKREY ELDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFVA
Sbjct: 196 AYLTGSPTKAEVLDRADGIKREYEELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFVA 255

Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
           +QSCVVHLQKVSLARRFALVPLGPPLLAYSS CKAML AVD AV+LVVDRPYKAGE IVV
Sbjct: 256 IQSCVVHLQKVSLARRFALVPLGPPLLAYSSNCKAMLTAVDGAVELVVDRPYKAGEPIVV 315

Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
           WCGPQPNSKLL+NYGFVDEDNPYDR+ VEAALNTEDPQYQDKRMVAQRNGKLSVQVF V+
Sbjct: 316 WCGPQPNSKLLLNYGFVDEDNPYDRIAVEAALNTEDPQYQDKRMVAQRNGKLSVQVFQVY 375

Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
           AG+EKEA+SD+LPYLRLGYVSD SEMQSVISS GP+CPVSPCME+AVLDQL  YF+ RLA
Sbjct: 376 AGKEKEAVSDILPYLRLGYVSDPSEMQSVISSQGPVCPVSPCMEQAVLDQLTVYFRTRLA 435

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPA 369
           GY  ++SEDE ML D NL+PKKRVATQLVR+EKKML ACLQ T D+I  LPD T+ PCPA
Sbjct: 436 GYCTSISEDELMLADPNLNPKKRVATQLVRLEKKMLKACLQATVDLINQLPDHTMPPCPA 495

Query: 370 PYAPLLN 376
           PYAPLL 
Sbjct: 496 PYAPLLK 502


>gi|449455876|ref|XP_004145676.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cucumis
           sativus]
 gi|449492872|ref|XP_004159127.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cucumis
           sativus]
          Length = 521

 Score =  664 bits (1712), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 317/366 (86%), Positives = 341/366 (93%)

Query: 10  AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
           AELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE EL
Sbjct: 155 AELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSEDEL 214

Query: 70  AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
            YL+GSPTK E+LERAEGIK+EYNELDTVWFMAGSLFQQYPYDIPTEAF+FEIFKQAFVA
Sbjct: 215 DYLSGSPTKKEVLERAEGIKKEYNELDTVWFMAGSLFQQYPYDIPTEAFSFEIFKQAFVA 274

Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
           VQSCVVHLQKVSLARRFALVPLGPPLLAY S CKAML AVD AV+LVVDRPYKAGESI V
Sbjct: 275 VQSCVVHLQKVSLARRFALVPLGPPLLAYRSNCKAMLTAVDGAVELVVDRPYKAGESIAV 334

Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
           WCGPQPNSKLL+NYGFVDEDN YDRLVVEAALNTEDPQYQDKRMVAQRNG+LS+Q F+V+
Sbjct: 335 WCGPQPNSKLLLNYGFVDEDNRYDRLVVEAALNTEDPQYQDKRMVAQRNGRLSIQAFYVY 394

Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
           AG+EKEA+ DMLPYLRLGYV+  SEMQSVISS GP+CPVSPCMERA+L+Q+ADYFK RLA
Sbjct: 395 AGKEKEAVLDMLPYLRLGYVTHPSEMQSVISSQGPVCPVSPCMERAMLEQVADYFKRRLA 454

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPA 369
           GYP TLSEDE +L D NL+PKKRVATQLVR+EKK+L++CL+VT D I  LPD TVSPCPA
Sbjct: 455 GYPTTLSEDEFLLADGNLNPKKRVATQLVRLEKKLLHSCLEVTIDFINQLPDHTVSPCPA 514

Query: 370 PYAPLL 375
           PYAPLL
Sbjct: 515 PYAPLL 520


>gi|356571407|ref|XP_003553868.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
           large subunit N-methyltransferase, chloroplastic-like
           isoform 1 [Glycine max]
 gi|356571409|ref|XP_003553869.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
           large subunit N-methyltransferase, chloroplastic-like
           isoform 2 [Glycine max]
          Length = 502

 Score =  649 bits (1674), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 307/366 (83%), Positives = 336/366 (91%)

Query: 10  AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
           AELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQL+VESPLLW ++EL
Sbjct: 136 AELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLSVESPLLWLKSEL 195

Query: 70  AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
            YL+GSP K E+++R E I++EYNELDTVWFMAGSLFQQYPYDIPTEAF+FEIFKQAF A
Sbjct: 196 DYLSGSPIKDEVIQREEAIRKEYNELDTVWFMAGSLFQQYPYDIPTEAFSFEIFKQAFAA 255

Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
           +QSCVVHLQKVSLARRFALVPLGPPLL+Y S CKAML AVD AV+L VDRPYKAG+ IVV
Sbjct: 256 IQSCVVHLQKVSLARRFALVPLGPPLLSYQSNCKAMLTAVDGAVELAVDRPYKAGDPIVV 315

Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
           WCGPQPNSKLLINYGFVDE+N  DRL+VEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV+
Sbjct: 316 WCGPQPNSKLLINYGFVDENNSNDRLIVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVY 375

Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
           AG+E+EA+ DML Y+RLGYVSD SEM+SVISS GP+CPVSPCMERA LDQLADYFKARLA
Sbjct: 376 AGKEREAVLDMLRYMRLGYVSDPSEMESVISSQGPVCPVSPCMERAALDQLADYFKARLA 435

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPA 369
           GYP TL+EDE+MLTD NL+PKKRVATQ VR+EKKML+ACLQ T D I  LPD T+SPCPA
Sbjct: 436 GYPTTLAEDESMLTDDNLNPKKRVATQYVRLEKKMLHACLQATTDFINQLPDHTISPCPA 495

Query: 370 PYAPLL 375
           PYAPLL
Sbjct: 496 PYAPLL 501


>gi|356511552|ref|XP_003524489.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
           large subunit N-methyltransferase, chloroplastic-like
           [Glycine max]
          Length = 503

 Score =  644 bits (1662), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 307/367 (83%), Positives = 335/367 (91%), Gaps = 1/367 (0%)

Query: 10  AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
           AELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQL+VESPLLWS++EL
Sbjct: 136 AELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLSVESPLLWSKSEL 195

Query: 70  AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
            YL+GSP K E+++R E I++EY ELDTVWFMAGSLFQQYPYDIPTEAF+FEIFKQAF A
Sbjct: 196 DYLSGSPIKDEVIQREEAIRKEYKELDTVWFMAGSLFQQYPYDIPTEAFSFEIFKQAFAA 255

Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
           +QSCVVHLQKVSLARRFALVPLGPPLL+Y S CKAML AVD AV+L VDRPYKAG+ IVV
Sbjct: 256 IQSCVVHLQKVSLARRFALVPLGPPLLSYQSNCKAMLTAVDGAVELAVDRPYKAGDPIVV 315

Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
           WCGPQPNSKLLINYGFVDE+N  DRL+VEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV+
Sbjct: 316 WCGPQPNSKLLINYGFVDENNSNDRLIVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVY 375

Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
           AG+E+EA+ DML Y+RLGYVSD SEMQSVISS GP+CPVSPCMERA LDQLADYFKARLA
Sbjct: 376 AGKEREAVLDMLRYMRLGYVSDPSEMQSVISSQGPVCPVSPCMERAALDQLADYFKARLA 435

Query: 310 GYPATLSEDEAMLTD-YNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCP 368
           GYP  L+EDE+MLTD  NL+PKKRVATQ VR+EKKML+ACLQ T D I  LPD T+SPCP
Sbjct: 436 GYPTILAEDESMLTDGGNLNPKKRVATQYVRLEKKMLHACLQATIDFINQLPDHTISPCP 495

Query: 369 APYAPLL 375
           APYAPLL
Sbjct: 496 APYAPLL 502


>gi|357497055|ref|XP_003618816.1| SET domain protein [Medicago truncatula]
 gi|355493831|gb|AES75034.1| SET domain protein [Medicago truncatula]
          Length = 501

 Score =  641 bits (1654), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 302/366 (82%), Positives = 332/366 (90%)

Query: 10  AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
           AELLTTNK SELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE+EL
Sbjct: 135 AELLTTNKFSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSESEL 194

Query: 70  AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
           AYL GSP K EI++R EGI++EYNELDTVWFM+GSLFQQYPYD+PTEAF FEIFKQAF A
Sbjct: 195 AYLEGSPLKDEIVKRIEGIRKEYNELDTVWFMSGSLFQQYPYDLPTEAFPFEIFKQAFAA 254

Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
           VQSCVVHLQ VSLARRFALVPLGPPLLAY S CKAML AVD AVQLVVDRPYKAG+ IVV
Sbjct: 255 VQSCVVHLQNVSLARRFALVPLGPPLLAYCSNCKAMLTAVDGAVQLVVDRPYKAGDPIVV 314

Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
           WCGPQPN+KLL NYGFVDEDN  DRL+VE AL+TEDPQYQDKR+VAQRNGKLS+Q F+V+
Sbjct: 315 WCGPQPNTKLLTNYGFVDEDNSNDRLIVEVALSTEDPQYQDKRIVAQRNGKLSIQTFYVY 374

Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
            G+E+EA+SDM+PY+RLGYVSD SEMQSVISS GP+CPVSPCMERAVLDQLADYF  RLA
Sbjct: 375 TGKEREAVSDMIPYMRLGYVSDPSEMQSVISSQGPVCPVSPCMERAVLDQLADYFNTRLA 434

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPA 369
            YP TL+EDE+MLTD +L+PK+RVATQLVR+EKKML+ACLQ   D+I  LPD +VSPCPA
Sbjct: 435 AYPTTLAEDESMLTDGSLNPKRRVATQLVRLEKKMLHACLQAIMDLISQLPDHSVSPCPA 494

Query: 370 PYAPLL 375
           PYAP L
Sbjct: 495 PYAPSL 500


>gi|22326803|ref|NP_196930.2| Rubisco methyltransferase family protein [Arabidopsis thaliana]
 gi|30684815|ref|NP_851038.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
 gi|42573363|ref|NP_974778.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
 gi|17473570|gb|AAL38260.1| putative protein [Arabidopsis thaliana]
 gi|23297671|gb|AAN13005.1| unknown protein [Arabidopsis thaliana]
 gi|332004624|gb|AED92007.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
 gi|332004625|gb|AED92008.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
 gi|332004626|gb|AED92009.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
          Length = 514

 Score =  637 bits (1642), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 300/367 (81%), Positives = 329/367 (89%)

Query: 10  AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
           AELLTTNKLSELACLALYLMYEKKQGKKS W PYIRELDRQRGRGQL  ESPLLWSE EL
Sbjct: 148 AELLTTNKLSELACLALYLMYEKKQGKKSVWYPYIRELDRQRGRGQLDAESPLLWSEAEL 207

Query: 70  AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
            YLTGSPTKAE+LERAEGIKREYNELDTVWFMAGSLFQQYP+DIPTEAF+FEIFKQAFVA
Sbjct: 208 DYLTGSPTKAEVLERAEGIKREYNELDTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVA 267

Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
           +QSCVVHLQ V LARRFALVPLGPPLLAY S CKAML AVD AV+LVVDRPYKAG+ IVV
Sbjct: 268 IQSCVVHLQNVGLARRFALVPLGPPLLAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVV 327

Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
           WCGPQPN+KLL+NYGFVDEDNPYDR++VEAALNTEDPQYQDKRMVAQRNGKLS QVF V 
Sbjct: 328 WCGPQPNAKLLLNYGFVDEDNPYDRVIVEAALNTEDPQYQDKRMVAQRNGKLSQQVFQVR 387

Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
            G+E+EA+ DMLPYLRLGY+SD SEMQSVISS GP+CP+SPCMERAVLDQLA+YF  RL+
Sbjct: 388 VGKEREAVQDMLPYLRLGYMSDPSEMQSVISSQGPVCPMSPCMERAVLDQLANYFMRRLS 447

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPA 369
           GYP T  ED+A+L D +L P+KRVAT+LV++EKK+L ACL  T D++  LPD  +SPCPA
Sbjct: 448 GYPTTPKEDDALLADPSLSPRKRVATRLVQLEKKILVACLTTTVDLLNQLPDTAISPCPA 507

Query: 370 PYAPLLN 376
           PYAP L 
Sbjct: 508 PYAPSLK 514


>gi|297807453|ref|XP_002871610.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297317447|gb|EFH47869.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 516

 Score =  634 bits (1635), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 299/367 (81%), Positives = 328/367 (89%)

Query: 10  AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
           AELLTTNKLSELACLALYLMYEKKQGKKS W PYIRELDRQRGRGQL  ESPLLWSE EL
Sbjct: 150 AELLTTNKLSELACLALYLMYEKKQGKKSVWYPYIRELDRQRGRGQLDAESPLLWSEAEL 209

Query: 70  AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
            YLTGSPTKAE+LERAEGIKREYNELDTVWFMAGSLFQQYP+DIPTEAF+FEIFKQAFVA
Sbjct: 210 DYLTGSPTKAEVLERAEGIKREYNELDTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVA 269

Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
           +QSCVVHLQ V LARRFALVPLGPPLLAY S CKAML AVD AV+LVVDRPYKAG+ IVV
Sbjct: 270 IQSCVVHLQNVGLARRFALVPLGPPLLAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVV 329

Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
           WCGPQPN+KLL+NYGFVDEDNPYDR++VEAALNTEDPQYQDKRMVAQRNGKLS QVF V 
Sbjct: 330 WCGPQPNAKLLLNYGFVDEDNPYDRIIVEAALNTEDPQYQDKRMVAQRNGKLSQQVFQVR 389

Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
            G+E+EA+ DMLPYLRLGY+SD SEMQSVISS GP+C +SPCMERAVLDQLA+YF  RL+
Sbjct: 390 VGKEREAVQDMLPYLRLGYMSDPSEMQSVISSQGPVCTMSPCMERAVLDQLANYFMRRLS 449

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPA 369
           GYP T  ED+A+L D +L P+KRVAT+LV++EKK+L ACL  T D++  LPD  +SPCPA
Sbjct: 450 GYPTTPKEDDALLADPSLSPRKRVATRLVQLEKKILAACLTTTVDLLNQLPDTAISPCPA 509

Query: 370 PYAPLLN 376
           PYAP L 
Sbjct: 510 PYAPSLK 516


>gi|18377718|gb|AAL67009.1| unknown protein [Arabidopsis thaliana]
          Length = 514

 Score =  633 bits (1633), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 299/367 (81%), Positives = 328/367 (89%)

Query: 10  AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
           AELLTTNKLSELACLALYLMYEKKQGKKS W PYIRELDRQRGRGQL  ESPLLWSE EL
Sbjct: 148 AELLTTNKLSELACLALYLMYEKKQGKKSVWYPYIRELDRQRGRGQLDAESPLLWSEAEL 207

Query: 70  AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
            YLTGSPTKAE+LERAEGIKREYNELDTVWFMAGSLFQQYP+DIPTEAF+FEIFKQAFVA
Sbjct: 208 DYLTGSPTKAEVLERAEGIKREYNELDTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVA 267

Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
           +QSCVVHLQ V LARRFALVPLGPPLLAY S CKAML AVD AV+LVVDRPYKAG+ IVV
Sbjct: 268 IQSCVVHLQNVGLARRFALVPLGPPLLAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVV 327

Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
           WCGPQPN+KLL+NYGFVDEDNPYDR++VEAALNTE PQYQDKRMVAQRNGKLS QVF V 
Sbjct: 328 WCGPQPNAKLLLNYGFVDEDNPYDRVIVEAALNTEGPQYQDKRMVAQRNGKLSQQVFQVR 387

Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
            G+E+EA+ DMLPYLRLGY+SD SEMQSVISS GP+CP+SPCMERAVLDQLA+YF  RL+
Sbjct: 388 VGKEREAVQDMLPYLRLGYMSDPSEMQSVISSQGPVCPMSPCMERAVLDQLANYFMRRLS 447

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPA 369
           GYP T  ED+A+L D +L P+KRVAT+LV++EKK+L ACL  T D++  LPD  +SPCPA
Sbjct: 448 GYPTTPKEDDALLADPSLSPRKRVATRLVQLEKKILVACLTTTVDLLNQLPDTAISPCPA 507

Query: 370 PYAPLLN 376
           PYAP L 
Sbjct: 508 PYAPSLK 514


>gi|125536207|gb|EAY82695.1| hypothetical protein OsI_37912 [Oryza sativa Indica Group]
          Length = 505

 Score =  606 bits (1562), Expect = e-171,   Method: Compositional matrix adjust.
 Identities = 284/367 (77%), Positives = 321/367 (87%)

Query: 10  AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
           AELLTTNKLSELACLALYLMYEKKQG+ SFW PYI+ELDRQRGRGQLAVESPLLW+E+EL
Sbjct: 139 AELLTTNKLSELACLALYLMYEKKQGQDSFWYPYIKELDRQRGRGQLAVESPLLWTESEL 198

Query: 70  AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
            YL GSP K E++ R EGI+REYNELDT+WFMAGSLFQQYP+DIPTEAF FEIFKQAFVA
Sbjct: 199 NYLKGSPIKDEVVARDEGIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIFKQAFVA 258

Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
           VQSCVVHLQKVSLARRFALVPLGPPLL Y S CKAML AV D+V+LVVDRPYKAGE I+V
Sbjct: 259 VQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTAVGDSVRLVVDRPYKAGEPIIV 318

Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
           WCGPQPNS+LL+NYGF+DEDNPYDR+V+EA+LN EDPQ+Q+KRMVAQRNGKL++Q FHV 
Sbjct: 319 WCGPQPNSRLLLNYGFIDEDNPYDRIVIEASLNIEDPQFQEKRMVAQRNGKLAIQNFHVC 378

Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
            G+EKE I++MLPYLRLGY+SD  EMQS++SS G  CPVSPC ERAVLDQL  Y ++RLA
Sbjct: 379 VGKEKETIAEMLPYLRLGYISDPDEMQSILSSEGDTCPVSPCTERAVLDQLVGYLESRLA 438

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPA 369
            YP TL ED+AML D NL PKK VAT+LVR+EKK+L+ CLQ   + I  LPD TVSPCPA
Sbjct: 439 DYPTTLDEDDAMLADGNLEPKKEVATRLVRLEKKLLHGCLQAANEFINDLPDHTVSPCPA 498

Query: 370 PYAPLLN 376
           P+AP L 
Sbjct: 499 PFAPELK 505


>gi|125578929|gb|EAZ20075.1| hypothetical protein OsJ_35675 [Oryza sativa Japonica Group]
          Length = 536

 Score =  605 bits (1561), Expect = e-171,   Method: Compositional matrix adjust.
 Identities = 284/367 (77%), Positives = 321/367 (87%)

Query: 10  AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
           AELLTTNKLSELACLALYLMYEKKQG+ SFW PYI+ELDRQRGRGQLAVESPLLW+E+EL
Sbjct: 170 AELLTTNKLSELACLALYLMYEKKQGQDSFWYPYIKELDRQRGRGQLAVESPLLWTESEL 229

Query: 70  AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
            YL GSP K E++ R EGI+REYNELDT+WFMAGSLFQQYP+DIPTEAF FEIFKQAFVA
Sbjct: 230 NYLKGSPIKDEVVARDEGIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIFKQAFVA 289

Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
           VQSCVVHLQKVSLARRFALVPLGPPLL Y S CKAML AV D+V+LVVDRPYKAGE I+V
Sbjct: 290 VQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTAVGDSVRLVVDRPYKAGEPIIV 349

Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
           WCGPQPNS+LL+NYGF+DEDNPYDR+V+EA+LN EDPQ+Q+KRMVAQRNGKL++Q FHV 
Sbjct: 350 WCGPQPNSRLLLNYGFIDEDNPYDRIVIEASLNIEDPQFQEKRMVAQRNGKLAIQNFHVC 409

Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
            G+EKE I++MLPYLRLGY+SD  EMQS++SS G  CPVSPC ERAVLDQL  Y ++RLA
Sbjct: 410 VGKEKETIAEMLPYLRLGYISDPDEMQSILSSEGDTCPVSPCTERAVLDQLVGYLESRLA 469

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPA 369
            YP TL ED+AML D NL PKK VAT+LVR+EKK+L+ CLQ   + I  LPD TVSPCPA
Sbjct: 470 DYPTTLDEDDAMLADGNLEPKKEVATRLVRLEKKLLHGCLQAANEFINDLPDHTVSPCPA 529

Query: 370 PYAPLLN 376
           P+AP L 
Sbjct: 530 PFAPELK 536


>gi|115487958|ref|NP_001066466.1| Os12g0236900 [Oryza sativa Japonica Group]
 gi|77554044|gb|ABA96840.1| SET domain containing protein, expressed [Oryza sativa Japonica
           Group]
 gi|113648973|dbj|BAF29485.1| Os12g0236900 [Oryza sativa Japonica Group]
          Length = 509

 Score =  605 bits (1560), Expect = e-170,   Method: Compositional matrix adjust.
 Identities = 284/367 (77%), Positives = 321/367 (87%)

Query: 10  AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
           AELLTTNKLSELACLALYLMYEKKQG+ SFW PYI+ELDRQRGRGQLAVESPLLW+E+EL
Sbjct: 143 AELLTTNKLSELACLALYLMYEKKQGQDSFWYPYIKELDRQRGRGQLAVESPLLWTESEL 202

Query: 70  AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
            YL GSP K E++ R EGI+REYNELDT+WFMAGSLFQQYP+DIPTEAF FEIFKQAFVA
Sbjct: 203 NYLKGSPIKDEVVARDEGIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIFKQAFVA 262

Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
           VQSCVVHLQKVSLARRFALVPLGPPLL Y S CKAML AV D+V+LVVDRPYKAGE I+V
Sbjct: 263 VQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTAVGDSVRLVVDRPYKAGEPIIV 322

Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
           WCGPQPNS+LL+NYGF+DEDNPYDR+V+EA+LN EDPQ+Q+KRMVAQRNGKL++Q FHV 
Sbjct: 323 WCGPQPNSRLLLNYGFIDEDNPYDRIVIEASLNIEDPQFQEKRMVAQRNGKLAIQNFHVC 382

Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
            G+EKE I++MLPYLRLGY+SD  EMQS++SS G  CPVSPC ERAVLDQL  Y ++RLA
Sbjct: 383 VGKEKETIAEMLPYLRLGYISDPDEMQSILSSEGDTCPVSPCTERAVLDQLVGYLESRLA 442

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPA 369
            YP TL ED+AML D NL PKK VAT+LVR+EKK+L+ CLQ   + I  LPD TVSPCPA
Sbjct: 443 DYPTTLDEDDAMLADGNLEPKKEVATRLVRLEKKLLHGCLQAANEFINDLPDHTVSPCPA 502

Query: 370 PYAPLLN 376
           P+AP L 
Sbjct: 503 PFAPELK 509


>gi|357160358|ref|XP_003578740.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
           large subunit N-methyltransferase, chloroplastic-like
           [Brachypodium distachyon]
          Length = 516

 Score =  604 bits (1557), Expect = e-170,   Method: Compositional matrix adjust.
 Identities = 285/367 (77%), Positives = 318/367 (86%)

Query: 10  AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
           AELLTTNKLSELACLALYLMYEKKQGK S W PYI+ELDRQRGRGQLAVESPLLW+E+EL
Sbjct: 150 AELLTTNKLSELACLALYLMYEKKQGKDSLWYPYIKELDRQRGRGQLAVESPLLWTESEL 209

Query: 70  AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
            YL GSP + E++ R EGI+REYNELDT+WFMAGSLF+QYP+D+PTEAF FEIFKQAFVA
Sbjct: 210 DYLNGSPMRDEVVVRDEGIRREYNELDTLWFMAGSLFKQYPFDVPTEAFPFEIFKQAFVA 269

Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
           VQSCVVHLQKVSLARRFALVPLGPPLL Y S CKAML AVDD+V+LVVDRPYKAGE I+V
Sbjct: 270 VQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTAVDDSVRLVVDRPYKAGEPIIV 329

Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
           WCGPQPNS+LL+NYGFVDEDNPYDR+ +EA+LN EDPQYQ+KRMVAQRNGKL++Q F V 
Sbjct: 330 WCGPQPNSRLLLNYGFVDEDNPYDRIAIEASLNMEDPQYQEKRMVAQRNGKLAIQKFQVC 389

Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
            G+EKE IS+MLPYLRLGY+SD  EMQ ++SS G  CPVSPC ERAVLDQL  Y K+RLA
Sbjct: 390 VGKEKETISEMLPYLRLGYISDPDEMQCILSSEGDTCPVSPCSERAVLDQLVVYLKSRLA 449

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPA 369
           GYP TL EDEAML D NL PKK VAT+LVR+EKK+L+ CLQ   + I  LPD TVSPCPA
Sbjct: 450 GYPTTLDEDEAMLADGNLEPKKEVATRLVRLEKKLLHGCLQAAHEFISALPDHTVSPCPA 509

Query: 370 PYAPLLN 376
            YAP L 
Sbjct: 510 LYAPNLK 516


>gi|414881266|tpg|DAA58397.1| TPA: hypothetical protein ZEAMMB73_027665 [Zea mays]
          Length = 512

 Score =  603 bits (1555), Expect = e-170,   Method: Compositional matrix adjust.
 Identities = 282/366 (77%), Positives = 321/366 (87%)

Query: 10  AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
           AELLT NKLSELACLALYLMYEKKQGK SFW PYI+ELDR RGRGQLAVESPLLW+E+EL
Sbjct: 146 AELLTNNKLSELACLALYLMYEKKQGKDSFWYPYIKELDRHRGRGQLAVESPLLWTESEL 205

Query: 70  AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
            YLTGSP K E++ R E I+REYNELDT+WFMAGSLFQQYP+DIPTEAF FEIFKQAFVA
Sbjct: 206 DYLTGSPLKDEVVARDEAIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIFKQAFVA 265

Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
           VQSCVVHLQKVSLARRFALVPLGPPLL Y S CKAML A  D+V+LVVDRPYKAGE I++
Sbjct: 266 VQSCVVHLQKVSLARRFALVPLGPPLLTYRSNCKAMLTADGDSVRLVVDRPYKAGEPIII 325

Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
           WCGPQ NS+L++NYGFVDEDNP+DR+ +EA+LNTEDPQYQ+KRMVAQRNGKL++Q F+V+
Sbjct: 326 WCGPQTNSRLVLNYGFVDEDNPFDRVAIEASLNTEDPQYQEKRMVAQRNGKLAIQNFNVY 385

Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
            G+EK+ +++MLPYLRLGY+S+  EMQS++SS G  CPVSPC ERAVLDQL  Y ++RLA
Sbjct: 386 VGKEKQTVAEMLPYLRLGYISNPDEMQSILSSEGDTCPVSPCTERAVLDQLVGYLESRLA 445

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPA 369
           GYP TL EDEAML D NL PKK VAT+LVR+EKKML+ACLQ T + I  LPD TVSPCPA
Sbjct: 446 GYPTTLDEDEAMLADGNLEPKKEVATRLVRLEKKMLHACLQATNEFINDLPDHTVSPCPA 505

Query: 370 PYAPLL 375
           PYAP L
Sbjct: 506 PYAPEL 511


>gi|242053769|ref|XP_002456030.1| hypothetical protein SORBIDRAFT_03g029140 [Sorghum bicolor]
 gi|241928005|gb|EES01150.1| hypothetical protein SORBIDRAFT_03g029140 [Sorghum bicolor]
          Length = 512

 Score =  599 bits (1544), Expect = e-169,   Method: Compositional matrix adjust.
 Identities = 279/366 (76%), Positives = 321/366 (87%)

Query: 10  AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
           AELLT NKLSELACLALYLMYEKKQGK SFW PYI+ELDR RGRGQLAVESPLLW+E+EL
Sbjct: 146 AELLTNNKLSELACLALYLMYEKKQGKDSFWYPYIKELDRHRGRGQLAVESPLLWTESEL 205

Query: 70  AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
            YLTGSP K E++ R E I+REYNELDT+WFMAGSLFQQYP+DIPTEAF FEIFKQAFVA
Sbjct: 206 DYLTGSPLKDEVVARDEAIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIFKQAFVA 265

Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
           VQSCVVHLQKVSLARRFALVPLGPPLL Y S CKAML A  D+V+LVVDRPYKAGE I++
Sbjct: 266 VQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTADGDSVRLVVDRPYKAGEPIII 325

Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
           WCGPQ NS+L++NYGFVDEDNP+DR+ +EA+LN+EDPQYQ+KRMVAQRNGKL++Q F+V+
Sbjct: 326 WCGPQTNSRLVLNYGFVDEDNPFDRIAIEASLNSEDPQYQEKRMVAQRNGKLAIQNFNVY 385

Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
            G+EK+ +++MLPYLRLGY+SD  EMQS++SS G  CP+SPC ERAVLDQL  Y ++RLA
Sbjct: 386 VGKEKQTVAEMLPYLRLGYISDPDEMQSILSSEGDTCPLSPCTERAVLDQLVGYLESRLA 445

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPA 369
           GYP TL EDEAML D +L PKK VAT+LVR+EKKM++ACLQ T + I  LPD TVSPCPA
Sbjct: 446 GYPTTLDEDEAMLADGSLEPKKEVATRLVRLEKKMIHACLQATNEFINDLPDHTVSPCPA 505

Query: 370 PYAPLL 375
           PYAP L
Sbjct: 506 PYAPEL 511


>gi|326510275|dbj|BAJ87354.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326525555|dbj|BAJ88824.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 523

 Score =  596 bits (1537), Expect = e-168,   Method: Compositional matrix adjust.
 Identities = 280/367 (76%), Positives = 317/367 (86%)

Query: 10  AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
           AELLTTNKLSELACLALYLMYEKKQG+ S W PYI+ELDRQRGRGQLAVESPLLW+E+EL
Sbjct: 157 AELLTTNKLSELACLALYLMYEKKQGRDSLWYPYIKELDRQRGRGQLAVESPLLWTESEL 216

Query: 70  AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
            YL GSP + E++ R EGIK+EYNELDT+WFMAGSLF+QYP+D+PTEAF FEIFKQAFVA
Sbjct: 217 DYLNGSPMRDEVVVRDEGIKKEYNELDTLWFMAGSLFKQYPFDVPTEAFPFEIFKQAFVA 276

Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
           VQSCVVHLQKVSLARRFALVPLGPPLL Y S CKAML AVD +V+L+VDRPYKAGE I+V
Sbjct: 277 VQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTAVDGSVRLLVDRPYKAGEPIIV 336

Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
           WCGPQPNS+LL+NYGFVDEDNPYDR+ +EA+LNTEDPQYQ+KRMVAQRNGKL++Q F V 
Sbjct: 337 WCGPQPNSRLLLNYGFVDEDNPYDRIAIEASLNTEDPQYQEKRMVAQRNGKLAIQKFQVC 396

Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
            G+EK+ IS+MLPYLRLGY+SD  EMQ ++SS G  CPVSPC ERAVLDQL  Y K+RLA
Sbjct: 397 VGKEKQTISEMLPYLRLGYISDPDEMQCILSSEGDTCPVSPCSERAVLDQLVVYLKSRLA 456

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPA 369
           GYP  L EDEAML D +L PKK VAT+LVR+EKKML+ CL+   + I  LPD TVSPCPA
Sbjct: 457 GYPTNLDEDEAMLADGSLEPKKEVATRLVRLEKKMLHGCLEAANEFISGLPDHTVSPCPA 516

Query: 370 PYAPLLN 376
            YAP L 
Sbjct: 517 LYAPELK 523


>gi|116786810|gb|ABK24248.1| unknown [Picea sitchensis]
          Length = 507

 Score =  596 bits (1536), Expect = e-168,   Method: Compositional matrix adjust.
 Identities = 281/367 (76%), Positives = 319/367 (86%)

Query: 10  AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
           AELLTTNKLSELACLALYLMYEKKQG +SFW P+IRELDRQRGRGQLAVESPLLWS  EL
Sbjct: 141 AELLTTNKLSELACLALYLMYEKKQGNQSFWRPFIRELDRQRGRGQLAVESPLLWSSEEL 200

Query: 70  AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
            Y TGSP K  +LER  GIKREY ELDTVWFMAGSLF+QYPYDIPTEAF FEIFKQAFVA
Sbjct: 201 KYFTGSPMKEIMLERNSGIKREYEELDTVWFMAGSLFKQYPYDIPTEAFPFEIFKQAFVA 260

Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
           VQSCVVHLQ V+LARRFALVPLGPPLL+Y S CKAML AV D+VQL VDR YKAGE IVV
Sbjct: 261 VQSCVVHLQNVNLARRFALVPLGPPLLSYKSNCKAMLKAVGDSVQLEVDREYKAGEPIVV 320

Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
           WCGPQPN++LL+NYGFVDEDNP+DRL+VE +L+T+DP YQDKR++AQRNGKLSVQ F+++
Sbjct: 321 WCGPQPNARLLLNYGFVDEDNPHDRLIVEVSLDTKDPLYQDKRIIAQRNGKLSVQTFNIY 380

Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
            GREKEA+ DMLPYLRL YVSD SEMQSV+SS GP+CPVSPC ERAVLDQL+ YF+ RLA
Sbjct: 381 IGREKEAVLDMLPYLRLAYVSDPSEMQSVLSSQGPVCPVSPCTERAVLDQLSRYFRERLA 440

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPA 369
           GYP T SEDE +L D   +PK++VATQLV +EKKMLN+CL    ++I  LPD+ V+PCP+
Sbjct: 441 GYPTTASEDEIVLADPTTNPKRQVATQLVLIEKKMLNSCLAAVYEIIDQLPDLAVTPCPS 500

Query: 370 PYAPLLN 376
           PY+P+L 
Sbjct: 501 PYSPILK 507


>gi|7573451|emb|CAB87765.1| putative protein [Arabidopsis thaliana]
          Length = 537

 Score =  556 bits (1433), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 268/335 (80%), Positives = 288/335 (85%), Gaps = 18/335 (5%)

Query: 10  AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
           AELLTTNKLSELACLALYLMYEKKQGKKS W PYIRELDRQRGRGQL  ESPLLWSE EL
Sbjct: 148 AELLTTNKLSELACLALYLMYEKKQGKKSVWYPYIRELDRQRGRGQLDAESPLLWSEAEL 207

Query: 70  AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
            YLTGSPTKAE+LERAEGIKREYNELDTVWFMAGSLFQQYP+DIPTEAF+FEIFKQAFVA
Sbjct: 208 DYLTGSPTKAEVLERAEGIKREYNELDTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVA 267

Query: 130 VQSCVVHLQ------------------KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD 171
           +QSCVVHLQ                   V LARRFALVPLGPPLLAY S CKAML AVD 
Sbjct: 268 IQSCVVHLQVVLVASSNLDCYASSCTQNVGLARRFALVPLGPPLLAYCSNCKAMLTAVDG 327

Query: 172 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 231
           AV+LVVDRPYKAG+ IVVWCGPQPN+KLL+NYGFVDEDNPYDR++VEAALNTEDPQYQDK
Sbjct: 328 AVELVVDRPYKAGDPIVVWCGPQPNAKLLLNYGFVDEDNPYDRVIVEAALNTEDPQYQDK 387

Query: 232 RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPC 291
           RMVAQRNGKLS QVF V  G+E+EA+ DMLPYLRLGY+SD SEMQSVISS GP+CP+SPC
Sbjct: 388 RMVAQRNGKLSQQVFQVRVGKEREAVQDMLPYLRLGYMSDPSEMQSVISSQGPVCPMSPC 447

Query: 292 MERAVLDQLADYFKARLAGYPATLSEDEAMLTDYN 326
           MERAVLDQLA+YF  RL+GYP T  ED+A+    N
Sbjct: 448 MERAVLDQLANYFMRRLSGYPTTPKEDDALEASCN 482


>gi|302794360|ref|XP_002978944.1| hypothetical protein SELMODRAFT_110000 [Selaginella moellendorffii]
 gi|300153262|gb|EFJ19901.1| hypothetical protein SELMODRAFT_110000 [Selaginella moellendorffii]
          Length = 432

 Score =  518 bits (1334), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 252/370 (68%), Positives = 298/370 (80%), Gaps = 4/370 (1%)

Query: 10  AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
           AELLTTNKLSELACLALYLMYEKK+GK+SFW P+IRELDRQRGRGQ+AVESPLLW+  EL
Sbjct: 62  AELLTTNKLSELACLALYLMYEKKRGKESFWYPFIRELDRQRGRGQVAVESPLLWTSEEL 121

Query: 70  -AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 128
             Y TGS  K  +LER EGIKREY ELDTVWFMAGSLF++YP+DIPTEAF+FEIFKQAFV
Sbjct: 122 DEYFTGSRMKEVVLERLEGIKREYQELDTVWFMAGSLFKEYPFDIPTEAFSFEIFKQAFV 181

Query: 129 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 188
           AVQSCVVHLQ VSL RRFALVPLGPPLLAY S CKAML A  D V+L VDR YK GE I+
Sbjct: 182 AVQSCVVHLQGVSLPRRFALVPLGPPLLAYKSNCKAMLKAAGDLVRLEVDRAYKKGEQIL 241

Query: 189 VWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV 248
           VWCGPQPN++LL+NYGFVD DNP+DRL VEA+LNT DP YQ+KR++ Q+N +L++Q F +
Sbjct: 242 VWCGPQPNTRLLLNYGFVDPDNPHDRLSVEASLNTRDPFYQNKRIIVQKNNRLTIQNFQI 301

Query: 249 HAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARL 308
             GREKEA+ +MLPYLRLG+VSD   M+SV S+ GP CPVS C ERAVLDQLA YF+ R+
Sbjct: 302 FKGREKEAVLEMLPYLRLGHVSDPYHMESVFSAEGPTCPVSACNERAVLDQLAQYFQERI 361

Query: 309 AGYPATLSEDEAMLTD--YNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVS- 365
           A Y  T+ ED A+L D   +++PK+RVATQL+ +EK++L+  L V       LPD +V+ 
Sbjct: 362 AKYKTTIDEDRALLEDGSSDINPKQRVATQLLLIEKEILHNTLDVVNGFRNQLPDGSVAP 421

Query: 366 PCPAPYAPLL 375
           PC   + P L
Sbjct: 422 PCCGDFVPKL 431


>gi|302809535|ref|XP_002986460.1| hypothetical protein SELMODRAFT_269129 [Selaginella moellendorffii]
 gi|300145643|gb|EFJ12317.1| hypothetical protein SELMODRAFT_269129 [Selaginella moellendorffii]
          Length = 432

 Score =  517 bits (1332), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 252/370 (68%), Positives = 298/370 (80%), Gaps = 4/370 (1%)

Query: 10  AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
           AELLTTNKLSELACLALYLMYEKK+GK+SFW P+IRELDRQRGRGQ+AVESPLLW+  EL
Sbjct: 62  AELLTTNKLSELACLALYLMYEKKRGKESFWYPFIRELDRQRGRGQVAVESPLLWTSEEL 121

Query: 70  -AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 128
             Y TGS  K  +LER EGIKREY ELDTVWFMAGSLF++YP+DIPTEAF+FEIFKQAFV
Sbjct: 122 DEYFTGSRMKEVVLERLEGIKREYQELDTVWFMAGSLFKEYPFDIPTEAFSFEIFKQAFV 181

Query: 129 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 188
           AVQSCVVHLQ VSL RRFALVPLGPPLLAY S CKAML A  D V+L VDR YK GE I+
Sbjct: 182 AVQSCVVHLQGVSLPRRFALVPLGPPLLAYKSNCKAMLKAAGDLVRLEVDRAYKKGEQIL 241

Query: 189 VWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV 248
           VWCGPQPN++LL+NYGFVD DNP+DRL VEA+LNT DP YQ+KR++ Q+N +L++Q F +
Sbjct: 242 VWCGPQPNTRLLLNYGFVDPDNPHDRLSVEASLNTRDPFYQNKRIIVQKNNRLTIQNFQI 301

Query: 249 HAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARL 308
             GREKEA+ +MLPYLRLG+VSD   M+SV S+ GP CPVS C ERAVLDQLA YF+ R+
Sbjct: 302 FKGREKEAVLEMLPYLRLGHVSDPYHMESVFSAEGPTCPVSACNERAVLDQLAQYFQERI 361

Query: 309 AGYPATLSEDEAMLTD--YNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVS- 365
           A Y  T+ ED A+L D   +++PK+RVATQL+ +EK++L+  L V       LPD +V+ 
Sbjct: 362 AKYKTTIDEDRALLEDCSSDINPKQRVATQLLLIEKEILHNTLDVVNGFRNQLPDGSVAP 421

Query: 366 PCPAPYAPLL 375
           PC   + P L
Sbjct: 422 PCCGDFVPKL 431


>gi|168044593|ref|XP_001774765.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162673920|gb|EDQ60436.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 523

 Score =  514 bits (1325), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 248/367 (67%), Positives = 291/367 (79%), Gaps = 4/367 (1%)

Query: 10  AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
           AELLTTNKLSELACLALYLMYEKKQGK+S+W PYIRELDRQRGRGQL+V SPLLWS  EL
Sbjct: 159 AELLTTNKLSELACLALYLMYEKKQGKESYWYPYIRELDRQRGRGQLSVASPLLWSREEL 218

Query: 70  -AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 128
             Y TGS  K  +LER  GIKREY ELDTVWFMAGSLF+QYP+D+PTEAF+FEIFKQAFV
Sbjct: 219 NEYFTGSTMKEVVLERLAGIKREYEELDTVWFMAGSLFKQYPFDLPTEAFSFEIFKQAFV 278

Query: 129 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 188
           AVQSCVVHLQ VSLARRFALVPLGPPLLAY S CKAML AVDD V L VDR YKAG+ I 
Sbjct: 279 AVQSCVVHLQGVSLARRFALVPLGPPLLAYKSNCKAMLKAVDDNVVLEVDRAYKAGDPIA 338

Query: 189 VWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV 248
           VWCGPQPNSKLL+NYGFVDEDNPYDRL VEA+L+TEDP YQ KR + Q+N +L++Q F +
Sbjct: 339 VWCGPQPNSKLLLNYGFVDEDNPYDRLAVEASLDTEDPLYQQKRAIVQKNNRLTIQTFQI 398

Query: 249 HAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARL 308
           + G+E EA+ DMLPY+RL +++D  EM++V  + GP+CPVS C ERAVL+QL  YF+ RL
Sbjct: 399 YKGKEMEAVLDMLPYMRLAHLADPEEMETVSFAQGPVCPVSACNERAVLEQLEQYFEKRL 458

Query: 309 AGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCP 368
           AGY ++ + +     D   + KKRVA +L+ +EK +L   L    ++I  LPD  +SPC 
Sbjct: 459 AGYKSSHATEGG---DAKKNAKKRVAEKLLCIEKSILRNALAAVQELISQLPDSAISPCI 515

Query: 369 APYAPLL 375
            PY P L
Sbjct: 516 GPYLPNL 522


>gi|168020073|ref|XP_001762568.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162686301|gb|EDQ72691.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 427

 Score =  501 bits (1289), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 243/367 (66%), Positives = 284/367 (77%), Gaps = 4/367 (1%)

Query: 10  AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
           AELLTTNKLSELACLALYLMYEKKQGK+S+W PYIRELDRQRGRGQL+V SPLLWS  EL
Sbjct: 63  AELLTTNKLSELACLALYLMYEKKQGKESYWYPYIRELDRQRGRGQLSVASPLLWSPEEL 122

Query: 70  -AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 128
             Y TGS  K  +LER  GIKREY ELDTVWFMAGSLF+QYP+D+PTEAF+FEIFKQAFV
Sbjct: 123 NEYFTGSTMKEVVLERLAGIKREYEELDTVWFMAGSLFKQYPFDLPTEAFSFEIFKQAFV 182

Query: 129 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 188
           AVQSCVVHLQ VSLARRFALVPLGPPLLAY S CKAML AV D VQL VD  YK G+ I 
Sbjct: 183 AVQSCVVHLQGVSLARRFALVPLGPPLLAYKSNCKAMLKAVGDNVQLEVDHAYKTGDPIA 242

Query: 189 VWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV 248
           VWCGPQPNSKLL+NYGFVDEDNP+DRL VEA+LNTEDP YQ KR V Q+N +L++Q F +
Sbjct: 243 VWCGPQPNSKLLLNYGFVDEDNPFDRLAVEASLNTEDPLYQQKRAVVQKNNRLTIQTFQI 302

Query: 249 HAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARL 308
           + G+E EA+ DMLPY+RLG+++D  E+++V  +  P+C VS C ERAVL+Q+  +F+ RL
Sbjct: 303 YKGKEMEAVRDMLPYMRLGHLADPEEIETVSFAQEPLCYVSACNERAVLNQIEHFFERRL 362

Query: 309 AGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCP 368
           AGY    S D     D     K+ VA +L+ +EK +L   L    ++I  LPD  +SPC 
Sbjct: 363 AGYK---SSDTTKAVDAKKDAKRTVAKKLMSIEKNILRNALAAVHELIRELPDGAISPCI 419

Query: 369 APYAPLL 375
            PY P L
Sbjct: 420 GPYLPNL 426


>gi|326503142|dbj|BAJ99196.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 425

 Score =  473 bits (1218), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 218/274 (79%), Positives = 248/274 (90%)

Query: 10  AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
           AELLTTNKLSELACLALYLMYEKKQG+ S W PYI+ELDRQRGRGQLAVESPLLW+E+EL
Sbjct: 152 AELLTTNKLSELACLALYLMYEKKQGRDSLWYPYIKELDRQRGRGQLAVESPLLWTESEL 211

Query: 70  AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
            YL GSP + E++ R EGIK+EYNELDT+WFMAGSLF+QYP+D+PTEAF FEIFKQAFVA
Sbjct: 212 DYLNGSPMRDEVVVRDEGIKKEYNELDTLWFMAGSLFKQYPFDVPTEAFPFEIFKQAFVA 271

Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
           VQSCVVHLQKVSLARRFALVPLGPPLL Y S CKAML AVD +V+L+VDRPYKAGE I+V
Sbjct: 272 VQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTAVDGSVRLLVDRPYKAGEPIIV 331

Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
           WCGPQPNS+LL+NYGFVDEDNPYDR+ +EA+LNTEDPQYQ+KRMVAQRNGKL++Q F V 
Sbjct: 332 WCGPQPNSRLLLNYGFVDEDNPYDRIAIEASLNTEDPQYQEKRMVAQRNGKLAIQKFQVC 391

Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLG 283
            G+EK+ IS+MLPYLRLGY+SD  EMQ ++SS G
Sbjct: 392 VGKEKQTISEMLPYLRLGYISDPDEMQCILSSEG 425


>gi|212721460|ref|NP_001132025.1| uncharacterized protein LOC100193433 [Zea mays]
 gi|194693232|gb|ACF80700.1| unknown [Zea mays]
 gi|414881264|tpg|DAA58395.1| TPA: hypothetical protein ZEAMMB73_027665 [Zea mays]
 gi|414881265|tpg|DAA58396.1| TPA: hypothetical protein ZEAMMB73_027665 [Zea mays]
          Length = 252

 Score =  374 bits (961), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 173/242 (71%), Positives = 206/242 (85%)

Query: 134 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 193
           ++  QKVSLARRFALVPLGPPLL Y S CKAML A  D+V+LVVDRPYKAGE I++WCGP
Sbjct: 10  LIQEQKVSLARRFALVPLGPPLLTYRSNCKAMLTADGDSVRLVVDRPYKAGEPIIIWCGP 69

Query: 194 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGRE 253
           Q NS+L++NYGFVDEDNP+DR+ +EA+LNTEDPQYQ+KRMVAQRNGKL++Q F+V+ G+E
Sbjct: 70  QTNSRLVLNYGFVDEDNPFDRVAIEASLNTEDPQYQEKRMVAQRNGKLAIQNFNVYVGKE 129

Query: 254 KEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 313
           K+ +++MLPYLRLGY+S+  EMQS++SS G  CPVSPC ERAVLDQL  Y ++RLAGYP 
Sbjct: 130 KQTVAEMLPYLRLGYISNPDEMQSILSSEGDTCPVSPCTERAVLDQLVGYLESRLAGYPT 189

Query: 314 TLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAP 373
           TL EDEAML D NL PKK VAT+LVR+EKKML+ACLQ T + I  LPD TVSPCPAPYAP
Sbjct: 190 TLDEDEAMLADGNLEPKKEVATRLVRLEKKMLHACLQATNEFINDLPDHTVSPCPAPYAP 249

Query: 374 LL 375
            L
Sbjct: 250 EL 251


>gi|413950742|gb|AFW83391.1| hypothetical protein ZEAMMB73_866859 [Zea mays]
 gi|413950743|gb|AFW83392.1| hypothetical protein ZEAMMB73_866859 [Zea mays]
          Length = 252

 Score =  363 bits (933), Expect = 6e-98,   Method: Compositional matrix adjust.
 Identities = 166/240 (69%), Positives = 203/240 (84%)

Query: 134 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 193
           ++  QKVSLARRFALVPLGPPLL Y S CKAML    ++V+LVVDRPYKAGE I++WCGP
Sbjct: 10  LIQEQKVSLARRFALVPLGPPLLTYKSNCKAMLTVDGESVRLVVDRPYKAGEPIIIWCGP 69

Query: 194 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGRE 253
           Q NS+L++NYGFVDE+NP+DR+ +EA+LNTEDPQYQ+KRMVAQRNGK ++Q F+V+ G+E
Sbjct: 70  QTNSRLVLNYGFVDENNPFDRISIEASLNTEDPQYQEKRMVAQRNGKHAIQNFNVYVGKE 129

Query: 254 KEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 313
           K+ +++MLPYLRLGY+SD  EMQS++SS G  CPVSPC ERAVLDQL  Y ++RLAGYP 
Sbjct: 130 KQTVAEMLPYLRLGYISDPDEMQSILSSEGDTCPVSPCTERAVLDQLGGYLESRLAGYPT 189

Query: 314 TLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAP 373
           TL+EDEAML D +L PK+ VAT+LVR+EKKML+ACLQ T + I  LPD TVSPCPA YAP
Sbjct: 190 TLNEDEAMLADGSLEPKQEVATRLVRLEKKMLHACLQATNEFITDLPDHTVSPCPAQYAP 249


>gi|384246822|gb|EIE20311.1| hypothetical protein COCSUDRAFT_48681 [Coccomyxa subellipsoidea
           C-169]
          Length = 539

 Score =  363 bits (931), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 182/341 (53%), Positives = 240/341 (70%), Gaps = 9/341 (2%)

Query: 10  AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
           AELLTT+KLSELACL LYLMYEKK G++S W  +I+ELDR +GRGQ+  +SPLLW E ++
Sbjct: 80  AELLTTDKLSELACLTLYLMYEKKNGRQSVWYEFIKELDRIQGRGQMGAKSPLLWDEGQV 139

Query: 70  -AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 128
             YL GSP  AEI ER +GI++EY ELDTVWFMAGSLF+ YPYD+PTEAF+ ++F+Q F 
Sbjct: 140 DEYLAGSPLVAEIKERLKGIEKEYAELDTVWFMAGSLFKSYPYDVPTEAFSLKLFRQGFA 199

Query: 129 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGES 186
           AVQ+ VVHLQ V L++RFALVPLGPPLL+YSS  KAML    +A  VQL VDR Y  GE 
Sbjct: 200 AVQASVVHLQGVPLSKRFALVPLGPPLLSYSSTAKAMLTYNREAKEVQLAVDRSYTKGEP 259

Query: 187 IVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 246
           I  WCGPQPN +LL+NYG V ++NP+D++ +   L   DP +Q KR V Q+N   + Q F
Sbjct: 260 IEAWCGPQPNRRLLLNYGIVTDNNPHDKMALTVTLPHADPLFQAKRAVLQQNNLSTQQTF 319

Query: 247 HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPIC--PVSPCMERAVLDQLADYF 304
            +   R+K     +LPYLRL + +D + ++  +++L   C  P+SP  ER VL QLA + 
Sbjct: 320 QLQ--RDKGLPELLLPYLRLAHCTDAASLK--LATLDTCCAAPISPENERTVLHQLASHL 375

Query: 305 KARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
           + RL  Y  T  EDE ++      P+++VA +L+R+EK +L
Sbjct: 376 QDRLDRYKTTCEEDEVIIRSTTAGPRQKVAARLLRIEKAIL 416


>gi|413950744|gb|AFW83393.1| hypothetical protein ZEAMMB73_866859 [Zea mays]
          Length = 281

 Score =  347 bits (889), Expect = 7e-93,   Method: Compositional matrix adjust.
 Identities = 166/269 (61%), Positives = 203/269 (75%), Gaps = 29/269 (10%)

Query: 134 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 193
           ++  QKVSLARRFALVPLGPPLL Y S CKAML    ++V+LVVDRPYKAGE I++WCGP
Sbjct: 10  LIQEQKVSLARRFALVPLGPPLLTYKSNCKAMLTVDGESVRLVVDRPYKAGEPIIIWCGP 69

Query: 194 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGRE 253
           Q NS+L++NYGFVDE+NP+DR+ +EA+LNTEDPQYQ+KRMVAQRNGK ++Q F+V+ G+E
Sbjct: 70  QTNSRLVLNYGFVDENNPFDRISIEASLNTEDPQYQEKRMVAQRNGKHAIQNFNVYVGKE 129

Query: 254 KEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 313
           K+ +++MLPYLRLGY+SD  EMQS++SS G  CPVSPC ERAVLDQL  Y ++RLAGYP 
Sbjct: 130 KQTVAEMLPYLRLGYISDPDEMQSILSSEGDTCPVSPCTERAVLDQLGGYLESRLAGYPT 189

Query: 314 TLSEDEAM-----------------------------LTDYNLHPKKRVATQLVRMEKKM 344
           TL+EDEAM                             L D +L PK+ VAT+LVR+EKKM
Sbjct: 190 TLNEDEAMVMSCDFLRVVSWSLYKLAECYGIGFGHCQLADGSLEPKQEVATRLVRLEKKM 249

Query: 345 LNACLQVTADMIMLLPDVTVSPCPAPYAP 373
           L+ACLQ T + I  LPD TVSPCPA YAP
Sbjct: 250 LHACLQATNEFITDLPDHTVSPCPAQYAP 278


>gi|307107385|gb|EFN55628.1| hypothetical protein CHLNCDRAFT_57818 [Chlorella variabilis]
          Length = 435

 Score =  327 bits (839), Expect = 5e-87,   Method: Compositional matrix adjust.
 Identities = 169/341 (49%), Positives = 228/341 (66%), Gaps = 10/341 (2%)

Query: 10  AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
           AEL+TT KLSELACL LYL YEKK+GK+  W  +I+ELDR +GRG    +SPLLW E + 
Sbjct: 71  AELVTTGKLSELACLTLYLAYEKKRGKEGCWYRFIKELDRMQGRGSQGAKSPLLWDEGQA 130

Query: 70  A-YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 128
           A  L GSP   EI  R +GI++EY ELDTVW++AGSLF + P+  PTE F+F +F+QAF 
Sbjct: 131 AELLAGSPVVGEIEARLQGIRKEYEELDTVWYLAGSLFNRQPFSPPTEQFSFPVFRQAFT 190

Query: 129 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGES 186
           AVQS VVHLQ V+L +RFALVP+GPPLL YSS  KAML    ++  V+L VDR Y+ GE+
Sbjct: 191 AVQSSVVHLQGVALGKRFALVPMGPPLLTYSSTAKAMLKFDPESHEVRLAVDRAYQPGEA 250

Query: 187 IVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 246
           ++ WCGPQPNS+LLINYG VDE NPYD+L +   + ++DP Y+ KR      G  + Q F
Sbjct: 251 VLAWCGPQPNSRLLINYGIVDESNPYDKLPLSITIPSDDPLYRLKRDRLAERGLSTQQTF 310

Query: 247 HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVI--SSLGPICPVSPCMERAVLDQLADYF 304
            + A     A   +LPYLRL + +  ++++ V      GP+ P +   E  VL+QL  + 
Sbjct: 311 QLQAAASLPA--QLLPYLRLVHSTREADVEGVKWEEEAGPVAPEN---ELTVLNQLITHL 365

Query: 305 KARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
           + R + Y  T+ EDEA++ D    P+  VA +L+++EK +L
Sbjct: 366 RLRQSRYRTTIEEDEAIIADPAKGPRPTVAARLLKIEKGIL 406


>gi|302847476|ref|XP_002955272.1| hypothetical protein VOLCADRAFT_76643 [Volvox carteri f.
           nagariensis]
 gi|300259344|gb|EFJ43572.1| hypothetical protein VOLCADRAFT_76643 [Volvox carteri f.
           nagariensis]
          Length = 488

 Score =  312 bits (800), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 169/342 (49%), Positives = 220/342 (64%), Gaps = 9/342 (2%)

Query: 8   YAAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET 67
           + AELLTT+KLSELACLALYLMYEKK  KKSFW PYI+ELD+Q+ RG  A ESPLLW + 
Sbjct: 106 FVAELLTTDKLSELACLALYLMYEKKLKKKSFWYPYIKELDKQQARGPQAAESPLLWGDQ 165

Query: 68  EL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 126
           EL + L GSP    + +R  GI++EY  LDTVWFMAGSLF +YP+D+PTE F+FE+F+QA
Sbjct: 166 ELDSLLKGSPLLPAVRQRQAGIRKEYEALDTVWFMAGSLFNKYPFDLPTETFSFELFQQA 225

Query: 127 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAG 184
           F  VQ+ +VHLQ V +A+RFALVPLGPPL+AYSS  K M+   +D  +V+LVV  P +AG
Sbjct: 226 FAVVQASIVHLQGVPIAKRFALVPLGPPLMAYSSTSKNMMTYDEDSRSVRLVVSGPVEAG 285

Query: 185 ESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE--AALNTEDPQYQDKRMVAQRNGKLS 242
             +  WCGPQPNS+LL+NYG VDE NP+D+L       L T DP +  KR V    G  +
Sbjct: 286 RPVAAWCGPQPNSRLLLNYGVVDEHNPFDKLQARFTFTLPTSDPLFPAKRAVLSEAGLAT 345

Query: 243 VQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLAD 302
            Q F V   R       +LPY+ L   +   ++ SV  S          +E A L  L  
Sbjct: 346 QQSFDVSVARPLP--PQLLPYMMLALATTPEQVASV--SFSDTAGHDRELEAAALAALMA 401

Query: 303 YFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKM 344
           Y + R A Y   L  D  ++ D +  P+++VA +L ++EK +
Sbjct: 402 YVQRRTAAYAHPLWRDLEIINDPSSTPRQKVAARLTKIEKSI 443


>gi|255536985|ref|XP_002509559.1| conserved hypothetical protein [Ricinus communis]
 gi|223549458|gb|EEF50946.1| conserved hypothetical protein [Ricinus communis]
          Length = 348

 Score =  278 bits (712), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 130/160 (81%), Positives = 148/160 (92%)

Query: 217 VEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQ 276
           ++AALNTEDPQYQDKRMVAQRNGKLS+QVF ++ G+EKEAISD+LPYLRLGYVSD SEMQ
Sbjct: 189 LQAALNTEDPQYQDKRMVAQRNGKLSIQVFQIYVGKEKEAISDILPYLRLGYVSDPSEMQ 248

Query: 277 SVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQ 336
           SVISS GPICPVSPCME+AVLDQLADYFK RLAGYP +L+EDE ML D+NL+PKKRVATQ
Sbjct: 249 SVISSQGPICPVSPCMEQAVLDQLADYFKRRLAGYPTSLNEDELMLADHNLNPKKRVATQ 308

Query: 337 LVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 376
           LVR+EKK+LNACLQ TAD+I  LPD++VSPCPAPYAP+L 
Sbjct: 309 LVRLEKKILNACLQATADLINQLPDLSVSPCPAPYAPILK 348



 Score =  258 bits (659), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 121/128 (94%), Positives = 125/128 (97%)

Query: 11  ELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA 70
           ELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE ELA
Sbjct: 63  ELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSEAELA 122

Query: 71  YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 130
           YLTGSPTKAE+LERA+GIKREY+ELDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFVA+
Sbjct: 123 YLTGSPTKAEVLERADGIKREYDELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFVAI 182

Query: 131 QSCVVHLQ 138
           QSCVVHLQ
Sbjct: 183 QSCVVHLQ 190


>gi|413950741|gb|AFW83390.1| hypothetical protein ZEAMMB73_201403, partial [Zea mays]
          Length = 130

 Score =  231 bits (590), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 108/128 (84%), Positives = 117/128 (91%)

Query: 11  ELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA 70
           ELLT NKLSELACLALYLMYEKKQGK SFW PYI+ELDR RGRGQLAVESPLLW+E+EL 
Sbjct: 1   ELLTNNKLSELACLALYLMYEKKQGKDSFWYPYIKELDRHRGRGQLAVESPLLWTESELD 60

Query: 71  YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 130
           YL+GSP K E++ R E I+REYNELDT+WFMAGSLFQQYP+DIPTEAF FEIFKQAFVAV
Sbjct: 61  YLSGSPLKDEVVARDEAIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIFKQAFVAV 120

Query: 131 QSCVVHLQ 138
           QSCVVHLQ
Sbjct: 121 QSCVVHLQ 128


>gi|298706765|emb|CBJ29688.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Ectocarpus siliculosus]
          Length = 521

 Score =  125 bits (314), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 105/359 (29%), Positives = 167/359 (46%), Gaps = 27/359 (7%)

Query: 8   YAAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET 67
           + A+++T + LSE   +AL  ++EK +GK+SFW  YI  L          V    LW+E 
Sbjct: 165 FGADVITED-LSEYIAIALLAVHEKAKGKESFWSSYIGVLPTVE-----EVYPTYLWAEE 218

Query: 68  ELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
           +LA L GSP  A        ++ EY  ++        L  ++P  +P E  T+E F+ AF
Sbjct: 219 DLALLEGSPVIAATESMRRKLEVEYATVEN------DLLDKFPEILPREVHTYEEFQWAF 272

Query: 128 VAVQSCVVHLQKVSLARRFALVPL------GPPLLAY-SSKCKAMLAAVDDAVQLVVDRP 180
             + S  + L  +S     ALVP        P   +Y  ++ + +  +  D V +  DR 
Sbjct: 273 AMLFSRAIRLGGLSTGEAVALVPYADLFNHNPFANSYIDARQQGLFFSKTDEVVVYADRS 332

Query: 181 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 240
           YK  E + +  GP+ NS LL+ YGF  + NPY+ + V  +L+  D  Y+ K+      G 
Sbjct: 333 YKKMEQVYISYGPKGNSDLLLLYGFSLDRNPYNSVDVTVSLDENDELYERKKAFLSEAGL 392

Query: 241 LSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQL 300
              + F ++  R  +   ++L YLRL  ++        +  L      +   E  VLD L
Sbjct: 393 PPTKAFPLYNDRYPD---ELLQYLRLIQLNTDQLRGRTLEDLSFEKKQTDVNELMVLDSL 449

Query: 301 ADYFKARLAGYPATLSEDEAMLTDYN----LHPKKRVATQLVRMEKKMLNACL-QVTAD 354
            +  KA +AGYP T  +D  ++ D      L   +R+A +  R EK +L   +  VT D
Sbjct: 450 VEACKATIAGYPTTEEQDSKLMNDPGMFRALSKTQRMAVKHRRQEKVILRRTIAAVTKD 508


>gi|168002824|ref|XP_001754113.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162694667|gb|EDQ81014.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 638

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 102/335 (30%), Positives = 158/335 (47%), Gaps = 27/335 (8%)

Query: 19  SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS-ETELAYLTGSPT 77
            ++  L L+LMYE+  G+KS W PY++              SP+LW+ E +   L GSP 
Sbjct: 151 GDVIGLTLWLMYERSLGEKSVWYPYLQTFPS-------TTLSPILWTAEEQQKLLKGSPA 203

Query: 78  KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
             E+ +R+  ++ EY +L        S F + P   P E F+ E FK AF  + S  V+L
Sbjct: 204 LEEVQQRSAALEGEYEDLQ-------SYFTKDPQAFPQEYFSLEAFKSAFSVILSRAVYL 256

Query: 138 QKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCG-PQ 194
               L   FALVP    L  + +  +A L  +  D AV   VDR YK GE +    G  +
Sbjct: 257 PSADL---FALVPYADAL-NHRADSQAYLDYSMEDQAVVFPVDRNYKEGEQVFTSYGRER 312

Query: 195 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREK 254
            N+ LLI YGFVDE+N  D L +E  L   D     K+ + Q+    S Q F ++  R  
Sbjct: 313 SNADLLITYGFVDENNAMDYLDLEVGLVDGDRLLVLKQQILQQAMLDSPQTFPLYLDR-- 370

Query: 255 EAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPAT 314
              + +L Y+RL  + D +    ++     +  +    E   L  L    + +L  Y   
Sbjct: 371 -FPTQLLTYMRLSRLQDPALFPKIVFDKDIM--LDQANEYECLQLLMGECRTKLGNYEGG 427

Query: 315 LSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 349
           + ++  +L +  +  ++RVA QL   EKK+L + +
Sbjct: 428 VDDEIRLLKNKKISQRERVAAQLRLCEKKILTSTM 462


>gi|452821842|gb|EME28868.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
           N-methyltransferase I [Galdieria sulphuraria]
          Length = 490

 Score =  114 bits (285), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 100/341 (29%), Positives = 166/341 (48%), Gaps = 39/341 (11%)

Query: 20  ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPL-LWSETELAYLTGSPTK 78
           E   + L L+YEK +G  SF+ PY+  L        L   +PL LWS  +L  L GSPT 
Sbjct: 165 EYVSIGLLLLYEKAKGFDSFFKPYLDILP------TLDELNPLFLWSNKDLDLLQGSPTL 218

Query: 79  AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ 138
           +   +  + + REY       ++  ++  Q P +  ++   F+ F+ AF  + S  +   
Sbjct: 219 SACEQLRDKLLREYT------YLGKNIIPQIP-NFASKPIDFKQFQWAFGILFSRAICFP 271

Query: 139 KVSLARRFALVPLGPPLLAYSSKCKAML--------AAVDDAVQLVVDRPYKAGESIVVW 190
               ++R ALVP    LL +S  C A +          V +AV + VDR Y+  E + V 
Sbjct: 272 S---SKRIALVPYAD-LLNHSPFCSAFIDEEKIPFGNGVTEAV-VYVDRLYEPYEQVYVS 326

Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 250
            GP+ N +LL+ YGF  E NP+D + +   L+  DP Y +K  + +  GK  +Q F ++ 
Sbjct: 327 YGPRSNQELLLLYGFSLERNPFDCVEITIGLDKTDPLYLEKCRMLESYGKSPLQSFPLYM 386

Query: 251 GREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAG 310
            R      +M  +LR   +   +++Q+   ++     VS   E + LD+L +Y   +L  
Sbjct: 387 DRYP---VEMAEFLRFCCIDTETDLQADFGTI-----VSASNEESALDKLLNYIVDQLRK 438

Query: 311 YPATLSEDEAMLTDY----NLHPKKRVATQLVRMEKKMLNA 347
           YP +L +DE ++ D      L   +R+A +    EK++L+A
Sbjct: 439 YPTSLEDDEKIIRDRAMFQTLEKNQRMAIRQRLGEKRILHA 479


>gi|219121061|ref|XP_002185762.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
           N-methyltransferase I [Phaeodactylum tricornutum CCAP
           1055/1]
 gi|209582611|gb|ACI65232.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
           N-methyltransferase I [Phaeodactylum tricornutum CCAP
           1055/1]
          Length = 575

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 95/369 (25%), Positives = 168/369 (45%), Gaps = 52/369 (14%)

Query: 16  NKLSELACLALYLMYEKK-QGKKSFWLPYIR---ELDRQRGRGQLAVESPLLWSETELAY 71
           ++++E   +A +L+YE+  +G++S W PY+    ++D         V     W + +LA+
Sbjct: 166 SEINEYLAMACHLIYERNVRGEESPWKPYLDVLPDIDE--------VNPTFTWPDEDLAF 217

Query: 72  LTGSPTKAEILERAEGIKREYNELDTVWFMAG--SLFQQYPYDIPTEAFTFEIFKQAFVA 129
           L GSP  A        ++REY+ L     + G   L  +YP   P EAF F+ ++ AF  
Sbjct: 218 LNGSPVIAATKSLQMKLRREYDAL-----LGGEDGLLAKYPDRFPAEAFNFKAWEWAFTM 272

Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA----------MLAAVDDAVQLVVDR 179
           + S  + L+ +      ALVP    L+ +S   +A          +  + D+ V L  DR
Sbjct: 273 LFSRAIRLRSLKQGETLALVPYAD-LINHSPFSQAYIDARQNGDWLFKSGDEEVILYADR 331

Query: 180 PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE--------------- 224
            Y+  E I +  GP+ N++LL+ YGF  E NP++ + V  ++                  
Sbjct: 332 GYRRMEQIYISYGPKSNAELLLLYGFAVERNPFNSVDVTVSIAPRTASFVKELDDDTIPV 391

Query: 225 DPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGP 284
           DP  ++K    ++ G+ +   F  +A R      +ML YLRL  ++        ++    
Sbjct: 392 DPLAEEKAAFLEQVGRDATVDFPCYADRYP---VEMLEYLRLMQMTPEDTRGKPLAEFDY 448

Query: 285 ICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP----KKRVATQLVRM 340
              +S   E AVL  +      +L+ YP +  +D A++ D +L       +R+A +  R 
Sbjct: 449 SRTISLGNEAAVLTSVITAVSRQLSNYPQSEEDDAALIKDKSLFRLLSYNQRMAVRHRRN 508

Query: 341 EKKMLNACL 349
           EK++L   +
Sbjct: 509 EKRLLKRTI 517


>gi|302764082|ref|XP_002965462.1| hypothetical protein SELMODRAFT_406852 [Selaginella moellendorffii]
 gi|300166276|gb|EFJ32882.1| hypothetical protein SELMODRAFT_406852 [Selaginella moellendorffii]
          Length = 481

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 100/333 (30%), Positives = 152/333 (45%), Gaps = 27/333 (8%)

Query: 20  ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLT-GSPTK 78
           EL  LAL+LMYE+++ + S W PY++           +  SPLLW + E   L  GS   
Sbjct: 117 ELVGLALWLMYEQERSQDSPWYPYLKVF-------PASTLSPLLWEQEEQEELLRGSSAL 169

Query: 79  AEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
           A++ ++   +++ ++ L DT+         +   D P E FTF  FK AF  V S  V+L
Sbjct: 170 AKVKDQLTSLRQTFDALKDTL---------KDNKDFPMEKFTFSAFKAAFSVVLSRAVYL 220

Query: 138 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPN 196
               L   FALVP G  +   SS+        +  V+L VD+ YK G+ +   +     +
Sbjct: 221 PSAEL---FALVPFGDLINHESSRSLLDYDIEEQKVKLAVDKRYKKGDQVFASYAQNLTS 277

Query: 197 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEA 256
           +  LI YGF+DE +  D + +E  L + D     KR + Q  G    Q F V+  R    
Sbjct: 278 ADFLIRYGFLDESDENDFIEIEVGLVSGDSLAPLKREILQEVGLTVPQKFPVYLNR---F 334

Query: 257 ISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLS 316
            + +L Y RL  + D+     +      I  V    E   L  L    + +L  +  T+ 
Sbjct: 335 PTQLLTYTRLARIQDSGLFAKITFEKDLI--VCQTNEYETLMLLMADCRTKLLSFSDTME 392

Query: 317 EDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 349
           +D   L   NL  K+RVA QL   EK++L   +
Sbjct: 393 DDMQTLKRKNLSYKQRVAAQLRLKEKRILTDTM 425


>gi|357462493|ref|XP_003601528.1| SET domain-containing protein [Medicago truncatula]
 gi|355490576|gb|AES71779.1| SET domain-containing protein [Medicago truncatula]
 gi|388500078|gb|AFK38105.1| unknown [Medicago truncatula]
          Length = 497

 Score = 98.2 bits (243), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 76/246 (30%), Positives = 117/246 (47%), Gaps = 21/246 (8%)

Query: 9   AAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 68
           A E+L  N + +   LA YL+ E    K S W  YI  L RQ         S L WS+ E
Sbjct: 128 AGEVLKKNSVPDWPLLATYLISEASLMKSSRWFSYISALPRQ-------PYSLLYWSQAE 180

Query: 69  L-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
           L  YL  S  +   +ER   +   YN+      M   +F +YP   P E F  E FK +F
Sbjct: 181 LDRYLEASQIRERAIERTNNVIGTYND------MRVRIFSKYPDFFPEEVFNIESFKWSF 234

Query: 128 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV--VDRPYKAGE 185
             + S +V L   S+  + ALVP    ++ +S + +  L     +  +V   DRPY+ GE
Sbjct: 235 GILFSRMVRLP--SMDGKNALVPWAD-MMNHSCEVETFLDYDKSSKGIVFPTDRPYQPGE 291

Query: 186 SIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 243
            + +  G + N +LL++YGFV ++  NP D + +  +L   D  Y++K  + ++ G    
Sbjct: 292 QVFISYGKKSNGELLLSYGFVPKEGTNPSDSVELSLSLKKSDESYKEKLELLKKYGLSGS 351

Query: 244 QVFHVH 249
           Q F + 
Sbjct: 352 QCFPIR 357


>gi|357469947|ref|XP_003605258.1| SET domain-containing protein [Medicago truncatula]
 gi|355506313|gb|AES87455.1| SET domain-containing protein [Medicago truncatula]
          Length = 494

 Score = 97.8 bits (242), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 76/246 (30%), Positives = 117/246 (47%), Gaps = 21/246 (8%)

Query: 9   AAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 68
           A E+L  N + +   LA YL+ E    K S W  YI  L RQ         S L WS+ E
Sbjct: 125 AGEVLKKNSVPDWPLLATYLISEASLMKSSRWFSYISALPRQ-------PYSLLYWSQAE 177

Query: 69  L-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
           L  YL  S  +   +ER   +   YN+      M   +F +YP   P E F  E FK +F
Sbjct: 178 LDRYLEASQIRERAIERTNNVIGTYND------MRVRIFSKYPDFFPEEVFNIESFKWSF 231

Query: 128 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV--VDRPYKAGE 185
             + S +V L   S+  + ALVP    ++ +S + +  L     +  +V   DRPY+ GE
Sbjct: 232 GILFSRMVRLP--SMDGKNALVPWAD-MMNHSCEVETFLDYDKSSKGIVFPTDRPYQPGE 288

Query: 186 SIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 243
            + +  G + N +LL++YGFV ++  NP D + +  +L   D  Y++K  + ++ G    
Sbjct: 289 QVFISYGKKSNGELLLSYGFVPKEGTNPSDSVELSLSLKKSDESYKEKLELLKKYGLSGS 348

Query: 244 QVFHVH 249
           Q F + 
Sbjct: 349 QCFPIR 354


>gi|224129218|ref|XP_002320530.1| predicted protein [Populus trichocarpa]
 gi|222861303|gb|EEE98845.1| predicted protein [Populus trichocarpa]
          Length = 503

 Score = 97.8 bits (242), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 77/243 (31%), Positives = 113/243 (46%), Gaps = 21/243 (8%)

Query: 9   AAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 68
           A E+L    + +   LA YL+ E    K S W  YI  L RQ         S L W+  E
Sbjct: 134 AGEVLKKYSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------PYSLLYWTRAE 186

Query: 69  L-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
           L  YL  S  +   +ER   +   YN+L         +F +YP+  P E F  E FK +F
Sbjct: 187 LDTYLEASQIRERAIERITNVTGTYNDLRL------RIFSKYPHLFPEEVFNMETFKWSF 240

Query: 128 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGE 185
             + S +V L   S+  R ALVP    +L +SS+ +  L        V    DRPY+ GE
Sbjct: 241 GILFSRLVRLP--SMDGRVALVPWAD-MLNHSSEVETFLDYDKSSKGVVFTTDRPYQPGE 297

Query: 186 SIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 243
            + +  G + N +LL++YGFV  +  NP D + +  +L   D  Y++K    +++G    
Sbjct: 298 QVFISYGRKSNGELLLSYGFVPREGTNPSDSVELSLSLKKSDKCYKEKLEALKKHGLSVS 357

Query: 244 QVF 246
           Q F
Sbjct: 358 QCF 360


>gi|323456050|gb|EGB11917.1| hypothetical protein AURANDRAFT_61181 [Aureococcus anophagefferens]
          Length = 516

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 92/343 (26%), Positives = 151/343 (44%), Gaps = 34/343 (9%)

Query: 15  TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 74
            +  +E   +AL L+ E+ +G +SFW  YI  L      G     +   W   ELAYL G
Sbjct: 162 NDDTNEYIAIALLLILERSKGSRSFWSEYIAILPTNEDVG-----ATFTWPAEELAYLEG 216

Query: 75  SPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 134
           SP  +        ++ E+         A  L      D   E FTFE ++ AF  + S  
Sbjct: 217 SPAASATASMMAKLRAEH---------AAVLEGNSALD--PEIFTFEAWQWAFTNLFSRA 265

Query: 135 VHLQKVSLARRFALVPL-----GPPLLAYSSKCKAMLAAV-----DDAVQLVVDRPYKAG 184
           + L+        A+VP        P  +     + +  A      +D V L  DR YK  
Sbjct: 266 IRLKASRAGELLAMVPYVDFINHSPFSSSYVDAREVPKAFPWEEKEDEVVLFADRAYKKF 325

Query: 185 ESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK-LSV 243
           E + +  GP+ N+ LL+ YGF  + NP++ + +    + +D  Y  K   A+  G+ +S 
Sbjct: 326 EQVFISYGPKSNADLLLLYGFALDRNPFNSVDLAVGASKDDALYDAKERFARGAGRDVSS 385

Query: 244 QVFHVHAGREKEAISDMLPYLRLGYVS-DTSEMQSVISSLGPICPVSPCMERAVLDQLAD 302
             F ++A R  +   +++ +LR+   + D    + +      +  +S   E AVLD + D
Sbjct: 386 AAFPLYADRFPD---ELVQFLRMACATEDHLGARPLDDPDNYVDILSLDNELAVLDTIRD 442

Query: 303 YFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
              A +A YPA   +D   + D  L   +R+A +LV  EK++L
Sbjct: 443 ACDAAVAAYPAKSGDD---VPDAFLSRNQRMAKRLVNTEKRIL 482


>gi|397613505|gb|EJK62256.1| hypothetical protein THAOC_17139 [Thalassiosira oceanica]
          Length = 648

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 96/372 (25%), Positives = 163/372 (43%), Gaps = 50/372 (13%)

Query: 18  LSELACLALYLMYEK-KQGKKSFWLPYIR---ELDRQRGRGQLAVESPLLWSETELAYLT 73
           ++E   +A  L++EK  +G+ SF+  Y+    E+D         V     W + +LA+L 
Sbjct: 247 INEYLAVACQLIHEKFVKGEDSFYAAYMGVLPEVDE--------VNPTFTWPDEDLAFLE 298

Query: 74  GSPTKAEILERAEGIKREYNELDTVWFMAG--SLFQQYPYDIPTEAFTFEIFKQAFVAVQ 131
           GSP  A        ++REY++L     + G   L  ++P   P E +TFE ++ AF  + 
Sbjct: 299 GSPVVAATRSLQMKLRREYDDL-----LGGPDGLVAKFPLRFPAEHYTFENWEWAFTMLF 353

Query: 132 SCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA----------VQLVVDRPY 181
           S  + L+ + +  R A+VP    L+ +S+  +A + A +            V L  DR Y
Sbjct: 354 SRAIRLRNLQVGERLAMVPYAD-LINHSAFSQAFIDARESGDWLFKSGEEEVILYADRGY 412

Query: 182 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN-------------TEDPQY 228
           +  E + +  G + N++LL+ YGF  E NPY+ + V  ++               EDP  
Sbjct: 413 RQMEQVYISYGQKSNAELLLLYGFALERNPYNSVDVTVSIAPRTKQIAEANEGVEEDPLA 472

Query: 229 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPV 288
            +K       G+     F  +A R      +ML YLRL  ++        +S       +
Sbjct: 473 DEKLEFLLSVGRDQTVDFPCYADRYP---VEMLEYLRLMMMTPEDTRGKPLSDFDYSRTI 529

Query: 289 SPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP----KKRVATQLVRMEKKM 344
           S   E +VL  +    K +L  +P T  +D A++ D  +       +R+A +  R EK++
Sbjct: 530 SSANEASVLRSVVAAVKYQLGLFPQTEEDDAAIIKDKGMFRLFSYNQRMAVRHRRNEKRL 589

Query: 345 LNACLQVTADMI 356
           L   L      I
Sbjct: 590 LKRTLAALEKQI 601


>gi|302823067|ref|XP_002993188.1| hypothetical protein SELMODRAFT_449044 [Selaginella moellendorffii]
 gi|300138958|gb|EFJ05708.1| hypothetical protein SELMODRAFT_449044 [Selaginella moellendorffii]
          Length = 600

 Score = 95.1 bits (235), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 95/328 (28%), Positives = 147/328 (44%), Gaps = 25/328 (7%)

Query: 20  ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 79
           EL  LAL+LMYE+++ + S W PY++            +   L   E +   L GS   A
Sbjct: 117 ELVGLALWLMYEQERSQDSPWYPYVKVFPAS------TLSLLLWEQEEQEELLRGSSALA 170

Query: 80  EILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ 138
           ++ ++   +++ ++ L DT+         +   D P E FTF  FK AF  V S  V+L 
Sbjct: 171 KVKDQLTSLRQTFDALKDTL---------KDNKDFPMEKFTFSAFKTAFSVVLSRAVYLP 221

Query: 139 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNS 197
              L   FALVP G  +   SS+        +  V+L VD+ YK G+ +   +     ++
Sbjct: 222 SAEL---FALVPFGDLINHESSRSLLDYDIEEQKVKLAVDKRYKKGDQVFASYAQNLTSA 278

Query: 198 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI 257
             LI YGF+DE +  D + +E  L + D     KR + Q  G    Q F ++  R     
Sbjct: 279 DFLIRYGFLDESDENDCIEIEVGLVSGDSLAPLKREILQEVGLTVPQKFPLYLNR---FP 335

Query: 258 SDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSE 317
           + +L Y RL  + D+     +      I  VS   E   L  L    + +L     T+ +
Sbjct: 336 TQLLTYTRLARIQDSGLFAKITFEKDLI--VSQTNEYETLMLLMADCRTKLLSSSDTMED 393

Query: 318 DEAMLTDYNLHPKKRVATQLVRMEKKML 345
           +   L   NL  K+RVA QL   EK++L
Sbjct: 394 EMQTLRRKNLSYKQRVAAQLRLKEKRIL 421


>gi|297829320|ref|XP_002882542.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297328382|gb|EFH58801.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 504

 Score = 95.1 bits (235), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 75/246 (30%), Positives = 116/246 (47%), Gaps = 21/246 (8%)

Query: 9   AAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 68
           A E++    + +   LA YL+ E    K S W  YI  L RQ         S L W+ TE
Sbjct: 135 AGEVMKRYDVPDWPLLATYLISEASLQKSSRWYNYISALPRQ-------PYSLLYWTRTE 187

Query: 69  L-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
           L  YL  S  +   +ER   +   Y +L +       +F ++P+  P E F  E FK +F
Sbjct: 188 LDMYLEASQIRERAIERITNVVGTYEDLRS------RIFSKHPHLFPKEVFNDETFKWSF 241

Query: 128 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGE 185
             + S +V L   S+  RFALVP    +L ++ + +  L        V    DRPY+ GE
Sbjct: 242 GILFSRLVRLP--SMDGRFALVPWAD-MLNHNCEVETFLDYDKSSKGVVFTTDRPYQPGE 298

Query: 186 SIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 243
            + +  G + N +LL++YGFV  +  NP D + +  +L   D  Y++K    +++G  + 
Sbjct: 299 QVFISYGNKSNGELLLSYGFVPREGTNPSDSVELALSLRKNDKCYKEKLDALKKHGLSTP 358

Query: 244 QVFHVH 249
           Q F V 
Sbjct: 359 QCFPVR 364


>gi|223992783|ref|XP_002286075.1| rubisco small subunit small subunit n-methyltransferase
           [Thalassiosira pseudonana CCMP1335]
 gi|220977390|gb|EED95716.1| rubisco small subunit small subunit n-methyltransferase
           [Thalassiosira pseudonana CCMP1335]
          Length = 434

 Score = 94.7 bits (234), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 94/364 (25%), Positives = 158/364 (43%), Gaps = 44/364 (12%)

Query: 16  NKLSELACLALYLMYEKKQ-GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 74
           + ++E   +A  L++EK   G +S W  Y+  L          V     W + +LA+L G
Sbjct: 85  DGINEYLAIACQLIHEKYVLGDESEWDAYMGVLPEVE-----EVNPTFTWKDEDLAFLDG 139

Query: 75  SPTKAEILERAEGIKREYNELDTVWFMAG--SLFQQYPYDIPTEAFTFEIFKQAFVAVQS 132
           SP  A        ++REY+ L     + G   L  ++P   P E FT+E +  AF  + S
Sbjct: 140 SPVVAATRSLQMKLRREYDAL-----LGGQDGLIAKFPDRFPAEHFTYENWVWAFTMLFS 194

Query: 133 CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD----------DAVQLVVDRPYK 182
             + L+ + +  R A+VP    L+ +S+   A + A +          + V L  DR Y+
Sbjct: 195 RAIRLRNLQVGERLAMVPYAD-LINHSAFSGAFIDARESGDWLFKNGEEEVILYADRGYR 253

Query: 183 AGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE-------------DPQYQ 229
             E + +  G + N++LL+ YGF  E NPY+ + V  ++                DP  Q
Sbjct: 254 QMEQVYISYGQKSNAELLLLYGFALERNPYNSVDVTVSIAPRTAALAAANEGIEVDPLAQ 313

Query: 230 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVS 289
           +K       G+     F  +A R      +ML +LRL  ++        ++       +S
Sbjct: 314 EKVEFLASVGRDQTVDFPCYADRYP---VEMLEFLRLMMMTPEDTRGKPLADFDYSRTIS 370

Query: 290 PCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP----KKRVATQLVRMEKKML 345
           P  E AVL  + +  K +L  YP +  +D  ++ D  L       +R+A +  R EK++L
Sbjct: 371 PANEAAVLSSVVEAVKYQLNLYPQSEEDDANIIKDKALFRLLSYNQRMAVRHRRNEKRLL 430

Query: 346 NACL 349
              L
Sbjct: 431 KRTL 434


>gi|356547583|ref|XP_003542190.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Glycine
           max]
          Length = 499

 Score = 94.4 bits (233), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 74/246 (30%), Positives = 113/246 (45%), Gaps = 21/246 (8%)

Query: 9   AAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 68
           A E+L  N + +   LA YL+ E    + S W  YI  L RQ         S L W++ E
Sbjct: 130 AGEVLKRNSVPDWPLLATYLISEASLMESSRWSNYISALPRQ-------PYSLLYWTQAE 182

Query: 69  L-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
           L  YL  S  +   +ER   +   YN+L         +F +YP   P E F  E FK +F
Sbjct: 183 LDRYLEASQIRERAIERINNVIGTYNDLRL------RIFSKYPDLFPDEVFNIESFKWSF 236

Query: 128 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGE 185
             + S +V L   S+    ALVP    +L +S   +  L        +    DRPY+ GE
Sbjct: 237 GILFSRLVRLP--SMGGNVALVPWAD-MLNHSCDVETFLDYDKTSKGIVFTTDRPYQPGE 293

Query: 186 SIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 243
            + +  G + N +LL++YGFV ++  NP D + +  +L   D  Y++K  + +  G  + 
Sbjct: 294 QVFISYGKKSNGELLLSYGFVPKEGANPSDSVELSLSLKKSDASYKEKLELLKNYGLSAS 353

Query: 244 QVFHVH 249
           Q F + 
Sbjct: 354 QCFPIQ 359


>gi|21537309|gb|AAM61650.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Arabidopsis thaliana]
          Length = 504

 Score = 94.4 bits (233), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 75/246 (30%), Positives = 115/246 (46%), Gaps = 21/246 (8%)

Query: 9   AAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 68
           A E++    + +   LA YL+ E    K S W  YI  L RQ         S L W+ TE
Sbjct: 135 AGEVMKRYDVPDWPLLATYLISEANLQKSSRWFNYISALPRQ-------PYSLLYWTRTE 187

Query: 69  L-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
           L  YL  S  +   +ER   +   Y +L +       +F ++P   P E F  E FK +F
Sbjct: 188 LDMYLEASQIRERAIERITNVVGTYEDLRS------RIFSKHPQLFPKEVFNDETFKWSF 241

Query: 128 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGE 185
             + S +V L   S+  RFALVP    +L ++ + +  L        V    DRPY+ GE
Sbjct: 242 GILFSRLVRLP--SMDGRFALVPWAD-MLNHNCEVETFLDYDKSSKGVIFTTDRPYQPGE 298

Query: 186 SIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 243
            + +  G + N +LL++YGFV  +  NP D + +  +L   D  Y++K    +++G  + 
Sbjct: 299 QVFISYGNKSNGELLLSYGFVPREGTNPSDSVELALSLRKNDKCYEEKLDALKKHGLSTP 358

Query: 244 QVFHVH 249
           Q F V 
Sbjct: 359 QCFPVR 364


>gi|3065835|gb|AAC14296.1| putative methyltransferase [Arabidopsis thaliana]
          Length = 504

 Score = 94.4 bits (233), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 75/246 (30%), Positives = 115/246 (46%), Gaps = 21/246 (8%)

Query: 9   AAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 68
           A E++    + +   LA YL+ E    K S W  YI  L RQ         S L W+ TE
Sbjct: 135 AGEVMKRYDVPDWPLLATYLISEASLQKSSRWFNYISALPRQ-------PYSLLYWTRTE 187

Query: 69  L-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
           L  YL  S  +   +ER   +   Y +L +       +F ++P   P E F  E FK +F
Sbjct: 188 LDMYLEASQIRERAIERITNVVGTYEDLRS------RIFSKHPQLFPKEVFNDETFKWSF 241

Query: 128 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGE 185
             + S +V L   S+  RFALVP    +L ++ + +  L        V    DRPY+ GE
Sbjct: 242 GILFSRLVRLP--SMDGRFALVPWAD-MLNHNCEVETFLDYDKSSKGVVFTTDRPYQPGE 298

Query: 186 SIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 243
            + +  G + N +LL++YGFV  +  NP D + +  +L   D  Y++K    +++G  + 
Sbjct: 299 QVFISYGNKSNGELLLSYGFVPREGTNPSDSVELALSLRKNDKCYEEKLDALKKHGLSTP 358

Query: 244 QVFHVH 249
           Q F V 
Sbjct: 359 QCFPVR 364


>gi|15231493|ref|NP_187424.1| rubisco methyltransferase-like protein [Arabidopsis thaliana]
 gi|6466950|gb|AAF13085.1|AC009176_12 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Arabidopsis thaliana]
 gi|6648179|gb|AAF21177.1|AC013483_1 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Arabidopsis thaliana]
 gi|15028205|gb|AAK76599.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Arabidopsis thaliana]
 gi|19310671|gb|AAL85066.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Arabidopsis thaliana]
 gi|332641064|gb|AEE74585.1| rubisco methyltransferase-like protein [Arabidopsis thaliana]
          Length = 504

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 75/246 (30%), Positives = 115/246 (46%), Gaps = 21/246 (8%)

Query: 9   AAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 68
           A E++    + +   LA YL+ E    K S W  YI  L RQ         S L W+ TE
Sbjct: 135 AGEVMKRYDVPDWPLLATYLISEASLQKSSRWFNYISALPRQ-------PYSLLYWTRTE 187

Query: 69  L-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
           L  YL  S  +   +ER   +   Y +L +       +F ++P   P E F  E FK +F
Sbjct: 188 LDMYLEASQIRERAIERITNVVGTYEDLRS------RIFSKHPQLFPKEVFNDETFKWSF 241

Query: 128 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGE 185
             + S +V L   S+  RFALVP    +L ++ + +  L        V    DRPY+ GE
Sbjct: 242 GILFSRLVRLP--SMDGRFALVPWAD-MLNHNCEVETFLDYDKSSKGVVFTTDRPYQPGE 298

Query: 186 SIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 243
            + +  G + N +LL++YGFV  +  NP D + +  +L   D  Y++K    +++G  + 
Sbjct: 299 QVFISYGNKSNGELLLSYGFVPREGTNPSDSVELALSLRKNDKCYEEKLDALKKHGLSTP 358

Query: 244 QVFHVH 249
           Q F V 
Sbjct: 359 QCFPVR 364


>gi|255582876|ref|XP_002532210.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase, chloroplast precursor, putative
           [Ricinus communis]
 gi|223528106|gb|EEF30179.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase, chloroplast precursor, putative
           [Ricinus communis]
          Length = 508

 Score = 92.0 bits (227), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 78/247 (31%), Positives = 117/247 (47%), Gaps = 23/247 (9%)

Query: 9   AAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 68
           A E+L    + +   LA+YL+ E    K S W  YI  L RQ         S L W+  E
Sbjct: 139 AGEVLKQYSVPDWPLLAIYLISEANLQKSSKWSNYISALPRQ-------PYSLLYWTRAE 191

Query: 69  L-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
           L  YL  S  +   +ER   +   YN+L         +F +YP   P E F  E FK +F
Sbjct: 192 LDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVFNLETFKWSF 245

Query: 128 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV---DRPYKAG 184
             + S +V L   S+  + ALVP    +L +S + +  L   D + Q VV   DR Y+ G
Sbjct: 246 GILFSRLVRLP--SMDGKVALVPWAD-MLNHSCEVETFLD-YDKSSQGVVFTTDRQYEPG 301

Query: 185 ESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLS 242
           E + +  G + N +LL++YGFV  +  NP D + +  +L   D  Y++K    +++G  +
Sbjct: 302 EQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELSLSLKKSDKSYKEKLEALKKHGFSA 361

Query: 243 VQVFHVH 249
            Q F V 
Sbjct: 362 SQCFPVR 368


>gi|225447500|ref|XP_002267469.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic [Vitis
           vinifera]
 gi|296085051|emb|CBI28466.3| unnamed protein product [Vitis vinifera]
          Length = 497

 Score = 92.0 bits (227), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 75/247 (30%), Positives = 118/247 (47%), Gaps = 23/247 (9%)

Query: 9   AAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 68
           A E+L  N + +   LA YL+ E    + S W  YI  L RQ         S L W+  E
Sbjct: 128 AGEVLKRNSVPDWPLLATYLIGEASFMQSSRWSNYISALPRQ-------PYSLLYWTRAE 180

Query: 69  L-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
           L  YL  S  +   +ER   +   YN+L         +F ++P+  P E F  E FK +F
Sbjct: 181 LDKYLEASQIRERAIERINDVTGTYNDLRL------RIFSKHPHLFPEEVFNMETFKWSF 234

Query: 128 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV---DRPYKAG 184
             + S +V L   S+  + ALVP    +L +S + +  L   D + Q VV   DR Y+  
Sbjct: 235 GILFSRLVRLP--SMDEKIALVPWAD-MLNHSCEVETFLD-YDKSSQGVVFTTDRTYQPS 290

Query: 185 ESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLS 242
           E + +  G + N +LL++YGFV  +  NP D++ +  +L   D  Y++K    +++G  +
Sbjct: 291 EQVFISYGKKSNGELLLSYGFVPREGTNPNDKVELLLSLKKSDKCYKEKSEAMKKHGLST 350

Query: 243 VQVFHVH 249
            Q F + 
Sbjct: 351 SQCFPIQ 357


>gi|440792294|gb|ELR13522.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
          Length = 568

 Score = 91.3 bits (225), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 102/375 (27%), Positives = 157/375 (41%), Gaps = 63/375 (16%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
           CLAL+L+ EK     SFW PYI+ L +  G       + L ++  EL  L GSPT    +
Sbjct: 148 CLALHLLVEK-HNHSSFWTPYIKTLPKSYG-------TCLYFTLEELEGLRGSPTFTSAI 199

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV---QSCVVHLQK 139
           +    +  +Y  +         LFQ     +   AFT++ F  A  AV   Q+ V     
Sbjct: 200 KVIATVAIQYTYIH-------DLFQIRKDILHINAFTWDEFIWAMSAVGSRQNQVPQWGH 252

Query: 140 VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 199
            +L+  +AL+P            +       D+ +    R YK GE + ++ GP+PNS L
Sbjct: 253 NALSE-YALIPAWDMCNHDHGDLQTFWDVNSDSTESHAMRAYKKGEQVYIFYGPRPNSDL 311

Query: 200 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDK-RMVAQRNGKLSVQVFHVHAGREKEAIS 258
           L++ GFV E+N +D L +   L  +    +DK R++   N K+  Q +    G       
Sbjct: 312 LLHAGFVYENNRFDALAIRVRLAPDAEHIKDKLRLLHLNNMKMDSQYYLYGLG----LAV 367

Query: 259 DMLPYLRLGYVSDTSEMQSVI----------------SSLGPICPVSPCMERAVLDQLAD 302
           D++ +LR+ +  +  E+Q V+                +S G +        R  L+   +
Sbjct: 368 DLMAFLRI-HAMNEQELQQVLGAYDQQEAKVHNGEHPASNGEVVASGVFDPRVKLNDRNE 426

Query: 303 YFKARLAG---------YPATLS---------EDEAMLTDYNLHPKKRVATQLVRMEKKM 344
               +LA          YP TL          ED+A L   +L P  R  T L   EK++
Sbjct: 427 LAALQLAEAKCLSLLSLYPTTLQVANGVELKQEDQAALRTTSLTPNMRAVTLLRLKEKEI 486

Query: 345 LNACLQVTADMIMLL 359
           LN  L    D I LL
Sbjct: 487 LNRTL----DAIRLL 497


>gi|3403236|gb|AAC29137.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
           N-methyltransferase I [Spinacia oleracea]
          Length = 491

 Score = 90.9 bits (224), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 92/340 (27%), Positives = 159/340 (46%), Gaps = 24/340 (7%)

Query: 16  NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 75
           N L     +AL+LM EKK G  S W PYI  L         +  S + WSE EL+ L GS
Sbjct: 122 NGLKPWVSVALFLMREKKLGNSSSWKPYIDILPD-------STNSTIYWSEEELSELQGS 174

Query: 76  PTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF--TFEIFK-QAFVAVQS 132
                 L   E +  E+ +L+    +     Q +P+D+  + F   F + + +AF  ++ 
Sbjct: 175 QLLNTTLGVKELVANEFAKLEEEVLVPHK--QLFPFDVTQDDFFWAFGMLRSRAFTCLEG 232

Query: 133 CVVHLQKVSLARRFALVP-LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-W 190
             + L  + LA      P +  P  A+  +  A L + +    L    P KAG+ +++ +
Sbjct: 233 QSLVL--IPLADLANHSPDITAPKYAWEIR-GAGLFSRELVFSLRNPTPVKAGDQVLIQY 289

Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 250
              + N++L ++YG  +  +  +   +   +   D  Y DK  +A+ NG      F +  
Sbjct: 290 DLNKSNAELALDYGLTESRSERNAYTLTLEIPESDSFYGDKLDIAESNGMGESAYFDIVL 349

Query: 251 GREKEAISDMLPYLRLGYVS--DTSEMQSVI--SSLGPI-CPVSPCMERAVLDQLADYFK 305
             E+   ++MLPYLRL  +   D   ++S+   S  G +  P+SP  E  +   + D   
Sbjct: 350 --EQPLPANMLPYLRLVALGGEDAFLLESIFRNSIWGHLDLPISPANEELICQVIRDACT 407

Query: 306 ARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
           + L+GY  T++EDE +L + ++ P+  +A  +   EKK+L
Sbjct: 408 SALSGYSTTIAEDEKLLAEGDIDPRLEIAITIRLGEKKVL 447


>gi|168003103|ref|XP_001754252.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162694354|gb|EDQ80702.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 431

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 75/268 (27%), Positives = 129/268 (48%), Gaps = 27/268 (10%)

Query: 18  LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSP 76
           L E   LA++L+ E  + + S W PY   L +          S L W+E E+  +LT SP
Sbjct: 83  LPEWPMLAIFLISEASREESSRWFPYFATLPK-------TPSSILQWTEEEVNTWLTASP 135

Query: 77  TKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVH 136
            + + LE    +   Y +L        ++F ++P   P++ +T   FK AF  + S +V 
Sbjct: 136 VREKALECIRDVTETYRDL------RATIFLKHPEVFPSQVYTLAAFKWAFGILFSRLVR 189

Query: 137 LQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD---AVQLVVDRPYKAGESIVVWCGP 193
           L  V    + ALVP    +L +S +  + L    +   +V  V DR Y++GE + +  G 
Sbjct: 190 LPSVG---KLALVPWAD-MLNHSPQVDSFLDFDQNNAKSVVTVTDRAYQSGEQVFISYGK 245

Query: 194 QPNSKLLINYGFV-DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGR 252
           + + +L + YGF+  E N +D + +E  ++++DP ++ K   A   G  S Q F V   R
Sbjct: 246 RSSGELFLAYGFIPSELNVHDSVELEMEIDSDDPSFEAKLRAANEQGLSSPQRFPV---R 302

Query: 253 EKEAISDMLPYLRL--GYVSDTSEMQSV 278
           +    + +L Y RL     SD +++  +
Sbjct: 303 KDGFPAQLLAYARLIASRTSDPAQLSRI 330


>gi|449453618|ref|XP_004144553.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Cucumis
           sativus]
 gi|449511789|ref|XP_004164054.1| PREDICTED: LOW QUALITY PROTEIN: ribulose-1,5 bisphosphate
           carboxylase/oxygenase large subunit N-methyltransferase,
           chloroplastic-like [Cucumis sativus]
          Length = 497

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 76/247 (30%), Positives = 118/247 (47%), Gaps = 23/247 (9%)

Query: 9   AAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 68
           A E+L  N + +   +A YL+ E    K S W  YI  L RQ         S L W+  E
Sbjct: 128 AGEVLKRNSVPDWPLIATYLISEASLMKSSRWNNYISALPRQ-------PYSLLYWTREE 180

Query: 69  L-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
           L  YL  S  +   +ER   +   YN+L         +F ++P   P E F  E FK +F
Sbjct: 181 LDRYLEASEIRERAIERITNVVGTYNDLSI------RVFSKHPELFPEEVFNIETFKWSF 234

Query: 128 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV---DRPYKAG 184
             + S +V L   S+  + ALVP    +L ++ + +  L   D A Q VV   DR Y+ G
Sbjct: 235 GILFSRLVRLP--SMDGKVALVPWAD-MLNHNCEVETFLD-YDKASQGVVFTTDRAYQPG 290

Query: 185 ESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLS 242
           E + +  G + N +LL++YGFV ++  NP D + +  +L   D  Y++K    +++G  +
Sbjct: 291 EQVFISYGKKSNGELLLSYGFVPKEGSNPSDSVELLLSLKKSDKCYKEKLEALKKHGLRA 350

Query: 243 VQVFHVH 249
            Q F + 
Sbjct: 351 SQCFPIQ 357


>gi|3403234|gb|AAC29136.1| ribulose-1,5-bisphosphate carboxylase/oxygenase N-methyltransferase
           [Spinacia oleracea]
 gi|3403238|gb|AAC29138.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
           N-methyltransferase II [Spinacia oleracea]
          Length = 495

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 89/342 (26%), Positives = 159/342 (46%), Gaps = 24/342 (7%)

Query: 16  NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 75
           N L     +AL+LM EKK G  S W PYI  L         +  S + WSE EL+ L GS
Sbjct: 122 NGLKPWVSVALFLMREKKLGNSSSWKPYIDILPD-------STNSTIYWSEEELSELQGS 174

Query: 76  PTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF--TFEIFK-QAFVAVQS 132
                 L   E +  E+ +L+    +     Q +P+D+  + F   F + + +AF  ++ 
Sbjct: 175 QLLNTTLGVKELVANEFAKLEEEVLVPHK--QLFPFDVTQDDFFWAFGMLRSRAFTCLEG 232

Query: 133 ---CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
               ++ L  + + +      +  P  A+  +  A L + +    L    P KAG+ +++
Sbjct: 233 QSLVLIPLADLWVQQANHSPDITAPKYAWEIR-GAGLFSRELVFSLRNPTPVKAGDQVLI 291

Query: 190 -WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV 248
            +   + N++L ++YG  +  +  +   +   +   D  Y DK  +A+ NG      F +
Sbjct: 292 QYDLNKSNAELALDYGLTESRSERNAYTLTLEIPESDSFYGDKLDIAESNGMGESAYFDI 351

Query: 249 HAGREKEAISDMLPYLRLGYVS--DTSEMQSVI--SSLGPI-CPVSPCMERAVLDQLADY 303
               E+   ++MLPYLRL  +   D   ++S+   S  G +  P+SP  E  +   + D 
Sbjct: 352 VL--EQPLPANMLPYLRLVALGGEDAFLLESIFRNSIWGHLDLPISPANEELICQVIRDA 409

Query: 304 FKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
             + L+GY  T++EDE +L + ++ P+  +A  +   EKK+L
Sbjct: 410 CTSALSGYSTTIAEDEKLLAEGDIDPRLEIAITIRLGEKKVL 451


>gi|146181028|ref|XP_001021989.2| SET domain containing protein [Tetrahymena thermophila]
 gi|146144300|gb|EAS01744.2| SET domain containing protein [Tetrahymena thermophila SB210]
          Length = 590

 Score = 87.4 bits (215), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 87/342 (25%), Positives = 155/342 (45%), Gaps = 48/342 (14%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           L+ +L+ E+K  K+S W PY+  L           + P+ +SE +L++L GSP + ++ E
Sbjct: 230 LSTFLLQERK-NKESKWKPYLDILPSDYN------QFPIFFSEDDLSWLKGSPFQNQVRE 282

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 143
           +   IKR+Y+++ +V       F +Y         TFE F  A +   S V  LQ ++  
Sbjct: 283 KKADIKRDYDDICSV----APEFAEY---------TFEDFCWARMTASSRVFGLQ-INEQ 328

Query: 144 RRFALVPLGPPLLAYSSKCKAMLAAVDD-----AVQLVVDRPYKAGESIVVWCGPQPNSK 198
           +  A VPL   L     + K      DD      +Q + D P   GE +    G + NS+
Sbjct: 329 KTDAFVPLADML--NHRRPKQTSWQYDDQREGFVIQALEDIP--RGEQVYDSYGRKCNSR 384

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL-SVQVFHVHAGREKEAI 257
             +NYGF++ DN  + + +    + EDP  + K+ +    G +   +V+ +    +++ +
Sbjct: 385 FFLNYGFINLDNDANEVALRLTFDAEDPTIERKKEMM--GGDVPEFKVYRILENYQEQNV 442

Query: 258 SDMLPYLRLGYVSDTS-------------EMQSVISSLGP--ICPVSPCMERAVLDQLAD 302
           S+ + YLR   + D S             E    +S   P    P+S   E  +  ++++
Sbjct: 443 SEFMSYLRFILIRDNSKLLMLSSLHEQQTENSENLSGYKPQKTPPISIQNETDMWVRISN 502

Query: 303 YFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKM 344
             +  ++ Y  TL ED+ +L   NL   +R    L   EK++
Sbjct: 503 MCQTSISLYNTTLKEDKELLAKDNLTQNQRNCVLLRSGEKEV 544


>gi|134254196|gb|AAI35195.1| LOC549331 protein [Xenopus (Silurana) tropicalis]
          Length = 507

 Score = 87.4 bits (215), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 88/365 (24%), Positives = 165/365 (45%), Gaps = 37/365 (10%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFWLPYI+ L  +        ++PL ++E E+ YL  +    ++ 
Sbjct: 81  TLAFHLLCERAD-PNSFWLPYIKTLPNE-------YDTPLYFNEDEVQYLQSTQAILDVF 132

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P    +P  ++FTF+ ++ A  +V +    +  
Sbjct: 133 SQYKNTARQY-----AYFY--KVIQTHPNANKLPLKDSFTFDDYRWAVSSVMTRQNQIPT 185

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +S         DD  + V  + +K+GE I ++ G + N++
Sbjct: 186 EDGSRVTLALIPLWDMCNHTNSLITTGYNLEDDRCECVALQDFKSGEQIYIFYGTRSNAE 245

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  E+N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    +
Sbjct: 246 FVIHNGFFFENNLHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHV-TEPPISA 304

Query: 259 DMLPYLRL---------GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR-- 307
            +L +LR+         G++     +  + +      PVS   E  +  +L  + +AR  
Sbjct: 305 QLLAFLRVFCMNEDELKGHLIGDHAIDKIFTLGNSEFPVS--WENEI--KLWTFLEARAS 360

Query: 308 --LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVS 365
             L  Y  T+ +D  +L   ++     +A +L R+EK++L   L+  +D   L    +  
Sbjct: 361 LLLKTYKTTVEDDNKVLEQPDMTFHSAMAIKLRRVEKEILEKALKSASDNRKLYSKNSEE 420

Query: 366 PCPAP 370
             P P
Sbjct: 421 GTPLP 425


>gi|302821397|ref|XP_002992361.1| hypothetical protein SELMODRAFT_430576 [Selaginella moellendorffii]
 gi|300139777|gb|EFJ06511.1| hypothetical protein SELMODRAFT_430576 [Selaginella moellendorffii]
          Length = 463

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 66/258 (25%), Positives = 109/258 (42%), Gaps = 47/258 (18%)

Query: 13  LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 72
           L ++ +SE A LAL L++EK+ G+ S W PYI  L R        + S   W + ELA +
Sbjct: 98  LLSSGVSEWAQLALLLLFEKRAGEASIWAPYISCLPRWG-----TIHSTAFWRKEELAMI 152

Query: 73  TGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 132
             S    E + R   I+ E+NE+  +       FQ+Y + +     ++  FK A+V    
Sbjct: 153 QESSLSYETMSRRAAIREEFNEMQPI-------FQRYEH-VFGGPVSYASFKHAYVTATV 204

Query: 133 CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD--------------------- 171
           C     ++    + A+VP     + +     AML    D                     
Sbjct: 205 CS-RAWRIDGLEKLAMVPFAD-FMNHDWSSNAMLTYDTDNGSTEVEEVKVYSDCLDIALF 262

Query: 172 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 231
             QL  D+ Y AGE + +  GP  N+ L +++GF    NP+D++ +   ++  D   ++K
Sbjct: 263 CAQLFADKNYAAGEQVTISFGPLCNASLALDFGFTVPYNPWDKVQLWLGISRRDSLRKEK 322

Query: 232 RMVAQRNGKLSVQVFHVH 249
                      +Q  H H
Sbjct: 323 -----------LQYLHAH 329


>gi|10439587|dbj|BAB15525.1| unnamed protein product [Homo sapiens]
          Length = 512

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 87/365 (23%), Positives = 168/365 (46%), Gaps = 34/365 (9%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+ YL  +    ++ 
Sbjct: 74  ALAFHLLCERAS-PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 125

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  ++FT+E ++ A  +V +    +  
Sbjct: 126 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 178

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  + ++AGE I ++ G + N++
Sbjct: 179 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 238

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    +
Sbjct: 239 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 297

Query: 259 DMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
            +L +LR+  +++          S +  + +      PVS   E  +   L D     L 
Sbjct: 298 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 357

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPA 369
            Y  T+ ED+++L +++L  + ++A +L   EK++L   ++  A    + P + +SP   
Sbjct: 358 TYKTTIEEDKSVLKNHDLSVRAKMAIKLRLGEKEILEKAVKSAA----VNPGI-LSPTDG 412

Query: 370 PYAPL 374
             APL
Sbjct: 413 GKAPL 417


>gi|226501968|ref|NP_001140387.1| uncharacterized protein LOC100272441 [Zea mays]
 gi|194699272|gb|ACF83720.1| unknown [Zea mays]
 gi|413923744|gb|AFW63676.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
           N-methyltransferase I [Zea mays]
          Length = 503

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 68/242 (28%), Positives = 113/242 (46%), Gaps = 21/242 (8%)

Query: 10  AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
            +++  N + +   +A YL+ E      S W+ YI  L RQ         S L W+  EL
Sbjct: 135 GDVMKRNSVPDWPLIATYLISEASLEGSSRWISYIAALPRQ-------PYSLLYWTRAEL 187

Query: 70  -AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 128
            AYL  SP +   ++R   +   YN+L         +F ++P   P E +  E F  +F 
Sbjct: 188 DAYLVASPIRKRAIQRITDVIGTYNDLRD------RIFSRHPDLFPEEVYNIETFLWSFG 241

Query: 129 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGES 186
            + S +V L   S+  R ALVP    +L +S + +  L        +    DR Y+ GE 
Sbjct: 242 ILFSRLVRLP--SMDGRVALVPWAD-MLNHSPEVETFLDFDKSSRGIVFTTDRSYQPGEQ 298

Query: 187 IVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ 244
           + +  G + + +LL++YGFV ++  NP D + +  +L+  D  Y++K    +RNG    +
Sbjct: 299 VFISYGKKSSGELLLSYGFVPKEGTNPNDSVELLVSLDKSDNCYKEKLQALKRNGLSESE 358

Query: 245 VF 246
            F
Sbjct: 359 SF 360


>gi|302785554|ref|XP_002974548.1| hypothetical protein SELMODRAFT_101776 [Selaginella moellendorffii]
 gi|300157443|gb|EFJ24068.1| hypothetical protein SELMODRAFT_101776 [Selaginella moellendorffii]
          Length = 467

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 67/235 (28%), Positives = 112/235 (47%), Gaps = 20/235 (8%)

Query: 9   AAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 68
           A +++    L E   LA YL+ E   GK S W PYI  L R+ G       S LLW+  +
Sbjct: 99  AGKVIRAKDLPEWPFLACYLISEASLGKSSPWYPYIAALPRRPG-------SILLWTALD 151

Query: 69  L-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
           + A+L+ +  K   L+    ++  +N+L+   FM          + P E F  E FK AF
Sbjct: 152 VEAHLSATSIKDRALQCVREVEDTFNDLNKQVFMKNR------EEFPPEVFNLESFKWAF 205

Query: 128 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGE 185
             + S +V L   SL ++ AL+P G  +L + ++    L   +   ++   +DR Y++ +
Sbjct: 206 GILFSRLVRLP--SLGQKLALIPFG-DMLNHDTEVTTFLDFDSGSKSITCTLDRGYESNK 262

Query: 186 SIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 239
            + +  G + N +LL+ YGFV    N  D + +   L+  D  Y+ K    + +G
Sbjct: 263 EVFISYGKRSNGELLVAYGFVPSGKNSEDSVSITLGLDPADEMYEAKLGALKEHG 317


>gi|293333172|ref|NP_001168589.1| uncharacterized protein LOC100382373 [Zea mays]
 gi|223949395|gb|ACN28781.1| unknown [Zea mays]
 gi|414885391|tpg|DAA61405.1| TPA: hypothetical protein ZEAMMB73_723554 [Zea mays]
          Length = 489

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 98/340 (28%), Positives = 152/340 (44%), Gaps = 38/340 (11%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            +AL L+ E  +G  S W PY+  L RQ        +S + WSE EL  + G+    ++L
Sbjct: 130 AVALLLLSEVARGADSPWAPYLAILPRQ-------TDSTIFWSEEELLEIQGT----QLL 178

Query: 83  ERAEGIKREY--NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV---VHL 137
               G+K EY  +E D+V     S  +    D+   + TF+ F  AF  ++S V   +  
Sbjct: 179 STTVGVK-EYVQSEFDSVQAEIISTNK----DLFPGSITFDDFLWAFGMLRSRVFPELRG 233

Query: 138 QKVSLARRFALVPLGPPLLAYSS----KCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCG 192
            K++L     LV   P + +  S    K K +    +    L      K+G+ I + +  
Sbjct: 234 DKLALIPFADLVNHSPNITSEGSSWEIKGKGLFGR-ELMFSLRTPVNVKSGQQIYIQYDL 292

Query: 193 PQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGR 252
            + N++L ++YGFV+ +   D   V   ++  DP Y DK  +A+ NG      F V    
Sbjct: 293 DKSNAELALDYGFVESNPSRDSFTVTLEISESDPFYGDKLDIAEANGLGETAYFDVIL-- 350

Query: 253 EKEAISDMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLADYFK 305
            +     MLPYLRL  +  T           SV   L    P+SP  E ++   + D  K
Sbjct: 351 NEPLPPQMLPYLRLLCIGGTDAFLLEALFRNSVWGHLE--LPLSPDNEESICQAMRDACK 408

Query: 306 ARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
           + LA Y  T+ EDE +    NL P+  +A  +   EKK+L
Sbjct: 409 SALADYHTTIEEDEELSGRENLQPRLAIAIGVRAGEKKVL 448


>gi|302804384|ref|XP_002983944.1| hypothetical protein SELMODRAFT_119151 [Selaginella moellendorffii]
 gi|300148296|gb|EFJ14956.1| hypothetical protein SELMODRAFT_119151 [Selaginella moellendorffii]
          Length = 439

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 92/345 (26%), Positives = 154/345 (44%), Gaps = 36/345 (10%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            +ALYL++EK +   S W  YIR L R        ++SPL WSE ELA L G+   + + 
Sbjct: 66  AVALYLLHEKAK-PHSDWSAYIRVLPR-------TLDSPLFWSEEELAELKGTQLLSSMN 117

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV------H 136
              E +KREY+++ T       + +  P       +T E F  AF  ++S         +
Sbjct: 118 GFKEFLKREYDKVMT------EVIEPRPDVFDRSLYTLEAFTWAFGILRSRTFPPLIGDN 171

Query: 137 LQKVSLA----RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG 192
           L  V LA      F L    P    +  K   + A  +           +  E ++ +  
Sbjct: 172 LALVPLADFVNHGFGLTNEDP---GWKVKSAGVFARQETLTLQAAANCAEKQEVLIQYGK 228

Query: 193 PQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 250
            + N++L  +YGFVD D  N  D   +   ++  +    DK  +AQ  G  S   F+++ 
Sbjct: 229 KKGNAQLATDYGFVDSDEKNNRDSFTLTLQVSLSERFADDKVDIAQMAGLDSTAYFNLY- 287

Query: 251 GREKEAISDMLPYLRLG--YVSDTSEMQSVISSL---GPICPVSPCMERAVLDQLADYFK 305
            R +    DM+ YLRL   + SD+  ++++  +        P+S   E A+ + + +  +
Sbjct: 288 -RNQGPPEDMIAYLRLIALFGSDSFLLEALFRNTVWDHLRLPISRENEEAICEAMIEGCR 346

Query: 306 ARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 350
           A L  Y +T+ ED  +L    L  +K++A  +   EK++L   LQ
Sbjct: 347 ATLREYSSTIDEDTMLLNSSELSTRKKMAVVVRLGEKRILQEQLQ 391


>gi|346474100|gb|AEO36894.1| hypothetical protein [Amblyomma maculatum]
          Length = 459

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 89/360 (24%), Positives = 156/360 (43%), Gaps = 53/360 (14%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA++L+ E   G+ SFW PYI  L         +  + L ++  EL  L+GS    E L
Sbjct: 121 ALAIFLILELSAGESSFWHPYISVLPD-------SFNTVLYFNIEELELLSGSAVLDEAL 173

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV---QSCVVHLQK 139
           +    I R+Y     + F    L +  P+    + FT+++++ A  AV   Q+ V   + 
Sbjct: 174 KLHRSIARQYAYFHKI-FRTHPLAKSLPF---KDCFTYDLYRWAVSAVMTRQNAVPWTES 229

Query: 140 VSL----------ARRFALVPL--------GPPLLAYSS-----KCKAMLAAVDDAVQLV 176
             L          A   ALVPL        G  L  Y S     +C AM           
Sbjct: 230 DGLGGDDVEIDGTAAVTALVPLWDMCNHSDGKVLTDYDSSASMVRCYAM----------- 278

Query: 177 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 236
             R +  GE + ++ G + N++  I+ GFV EDN YD + ++  ++ +DP +  K  + +
Sbjct: 279 --RDFDKGEEVTIFYGKRTNAEFFIHNGFVFEDNRYDAVDIKLGVSKKDPLFAVKSKLCE 336

Query: 237 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 296
            +  LS+        R++    D+  +LR+  + D S+ ++   S   I   S    R  
Sbjct: 337 -DHDLSLSGTFALVARDRPVSEDLSTFLRILVLKDASQPEAF--SAEHILTSSDSNARDA 393

Query: 297 LDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 356
           L  L    +  L  +P +  E E ++ D   + + ++A +L  +E K+L + L+   + +
Sbjct: 394 LTFLVVRIELLLKAFPKSDEEYEDIIKDGASNARVKMAARLRLLESKVLASVLETLGNHV 453


>gi|332321747|sp|B7ZUF3.1|SETD3_XENTR RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|213624517|gb|AAI71209.1| LOC549331 protein [Xenopus (Silurana) tropicalis]
          Length = 582

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 87/365 (23%), Positives = 164/365 (44%), Gaps = 37/365 (10%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFWLPYI+ L  +        ++PL ++E E+ YL  +    ++ 
Sbjct: 156 TLAFHLLCERAD-PNSFWLPYIKTLPNE-------YDTPLYFNEDEVQYLQSTQAILDVF 207

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P    +P  ++FTF+ ++ A  +V +    +  
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPNANKLPLKDSFTFDDYRWAVSSVMTRQNQIPT 260

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  + +K+GE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFKSGEQIYIFYGTRSNAE 320

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  E+N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    +
Sbjct: 321 FVIHNGFFFENNLHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHV-TEPPISA 379

Query: 259 DMLPYLRL---------GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR-- 307
            +L +LR+         G++     +  + +      PVS   E  +  +L  + +AR  
Sbjct: 380 QLLAFLRVFCMNEDELKGHLIGDHAIDKIFTLGNSEFPVS--WENEI--KLWTFLEARAS 435

Query: 308 --LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVS 365
             L  Y  T+ +D  +L   ++     +A +L R+EK++L   L+  +D   L    +  
Sbjct: 436 LLLKTYKTTVEDDNKVLEQPDMTFHSAMAIKLRRVEKEILEKALKSASDNRKLYSKNSEE 495

Query: 366 PCPAP 370
             P P
Sbjct: 496 GTPLP 500


>gi|62857953|ref|NP_001016577.1| histone-lysine N-methyltransferase setd3 [Xenopus (Silurana)
           tropicalis]
 gi|89272100|emb|CAJ81720.1| novel protein containing a SET domain [Xenopus (Silurana)
           tropicalis]
          Length = 581

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 87/364 (23%), Positives = 164/364 (45%), Gaps = 37/364 (10%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           LA +L+ E+     SFWLPYI+ L  +        ++PL ++E E+ YL  +    ++  
Sbjct: 156 LAFHLLCERAD-PNSFWLPYIKTLPNE-------YDTPLYFNEDEVQYLQSTQAILDVFS 207

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKV 140
           + +   R+Y      +F    + Q +P    +P  ++FTF+ ++ A  +V +    +   
Sbjct: 208 QYKNTARQY-----AYFY--KVIQTHPNANKLPLKDSFTFDDYRWAVSSVMTRQNQIPTE 260

Query: 141 SLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 199
             +R   AL+PL       +          DD  + V  + +K+GE I ++ G + N++ 
Sbjct: 261 DGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFKSGEQIYIFYGTRSNAEF 320

Query: 200 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD 259
           +I+ GF  E+N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + 
Sbjct: 321 VIHNGFFFENNLHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHV-TEPPISAQ 379

Query: 260 MLPYLRL---------GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR--- 307
           +L +LR+         G++     +  + +      PVS   E  +  +L  + +AR   
Sbjct: 380 LLAFLRVFCMNEDELKGHLIGDHAIDKIFTLGNSEFPVS--WENEI--KLWTFLEARASL 435

Query: 308 -LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSP 366
            L  Y  T+ +D  +L   ++     +A +L R+EK++L   L+  +D   L    +   
Sbjct: 436 LLKTYKTTVEDDNKVLEQPDMTFHSAMAIKLRRVEKEILEKALKSASDNRKLYSKNSEEG 495

Query: 367 CPAP 370
            P P
Sbjct: 496 TPLP 499


>gi|50252331|dbj|BAD28364.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplast precursor
           [Oryza sativa Japonica Group]
 gi|215769445|dbj|BAH01674.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 495

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 95/349 (27%), Positives = 150/349 (42%), Gaps = 56/349 (16%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            +AL L+ E  +G  S W PY+  L RQ        +S + WSE EL  + G+    ++L
Sbjct: 135 AVALLLLREAARGAGSPWAPYLAILPRQ-------TDSTIFWSEEELLEIQGT----QLL 183

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE-------AFTFEIFKQAFVAVQSCVV 135
               G+K EY +         S F+    +I +E         TF  F  AF  ++S V 
Sbjct: 184 STTMGVK-EYVQ---------SEFESVEAEIISENRELFPGTVTFNDFLWAFGILRSRVF 233

Query: 136 HLQKVSLARRFALVPLGPPLLAYSS-----------KCKAMLAAVDDAVQLVVDRPYKAG 184
              +     + AL+P    L+ +S            K K +    D    L      K+G
Sbjct: 234 AELR---GDKLALIPFAD-LVNHSDDITSKESSWEIKGKGLFGR-DVVFSLRTPVNVKSG 288

Query: 185 ESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 243
           E I + +   + N++L ++YGF + ++  D   +   ++  DP Y DK  +A+ NG    
Sbjct: 289 EQIYIQYDLDKSNAELALDYGFTESNSSRDAYTLTLEISESDPFYDDKLDIAELNGMGET 348

Query: 244 QVFHVHAGREKEAISDMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAV 296
             F +  G  +     MLPYLRL  +  T           +V   L    PVS   E A+
Sbjct: 349 AYFDIVLG--ESLPPQMLPYLRLLCLGGTDAFLLEALFRNAVWGHLE--LPVSQDNEEAI 404

Query: 297 LDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
              + +  K+ L  Y  T+ EDE +L   NL P+ ++A ++   EKK+L
Sbjct: 405 CQVIRNACKSALGAYHTTIEEDEELLGSENLQPRLQIAVEVRAGEKKVL 453


>gi|340780678|pdb|3SMT|A Chain A, Crystal Structure Of Human Set Domain-Containing Protein3
          Length = 497

 Score = 85.1 bits (209), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 81/344 (23%), Positives = 157/344 (45%), Gaps = 29/344 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+ YL  +    ++ 
Sbjct: 155 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 206

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  ++FT+E ++ A  +V +    +  
Sbjct: 207 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVXTRQNQIPT 259

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  + ++AGE I ++ G + N++
Sbjct: 260 EDGSRVTLALIPLWDXCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 319

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    +
Sbjct: 320 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAXKAEVLARAGIPTSSVFALHFT-EPPISA 378

Query: 259 DMLPYLRL---------GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
            +L +LR+          ++   S +  + +      PVS   E  +   L D     L 
Sbjct: 379 QLLAFLRVFCXTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 438

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
            Y  T+ ED+++L +++L  + + A +L   EK++L   ++  A
Sbjct: 439 TYKTTIEEDKSVLKNHDLSVRAKXAIKLRLGEKEILEKAVKSAA 482


>gi|194038089|ref|XP_001925323.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Sus scrofa]
 gi|456754196|gb|JAA74239.1| SET domain containing 3 [Sus scrofa]
          Length = 595

 Score = 85.1 bits (209), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 84/344 (24%), Positives = 157/344 (45%), Gaps = 29/344 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+ YL  +    ++ 
Sbjct: 156 TLAFHLLCERAD-PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 207

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYP--YDIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P  + +P  E+FT+E ++ A  +V +    +  
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPQAHKLPLKESFTYEDYRWAVSSVMTRQNQIPT 260

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  R ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALRDFRAGEQIYIFYGTRSNAE 320

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPVSA 379

Query: 259 DMLPYLRLGYVSDTSEMQSVIS--------SLGPI-CPVSPCMERAVLDQLADYFKARLA 309
            +L +LR+  +++    + ++         +LG    PVS   E  +   L D     L 
Sbjct: 380 QLLAFLRVFCMTEGELKEHLLGENAIDRIFTLGNSEYPVSWDNEVKLWTFLEDRASLLLK 439

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
            Y  T+ ED+  L ++ L  +  +A +L   EK++L   ++  A
Sbjct: 440 TYKTTIEEDKTFLKNHGLSVRATMAVKLRLGEKEILEKAVESAA 483


>gi|195651313|gb|ACG45124.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
           N-methyltransferase I [Zea mays]
          Length = 503

 Score = 85.1 bits (209), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 66/235 (28%), Positives = 110/235 (46%), Gaps = 21/235 (8%)

Query: 10  AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
            +++  N + +   +A YL+ E      S W+ YI  L RQ         S L W+  EL
Sbjct: 135 GDVMKRNSVPDWPLIATYLISEASLEGSSRWISYIAALPRQ-------PYSLLYWTRAEL 187

Query: 70  -AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 128
            AYL  SP +   ++R   +   YN+L         +F ++P   P E +  E F  +F 
Sbjct: 188 DAYLVASPIRKRAIQRITDVIGTYNDLRD------RIFSRHPDLFPEEVYNIETFLWSFG 241

Query: 129 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGES 186
            + S +V L   S+  R  LVP    +L +S + +  L        +    DR Y+ GE 
Sbjct: 242 ILFSRLVRLP--SMDGRVVLVPWAD-MLNHSPEVETFLDFDKSSRGIVFTTDRSYQPGEQ 298

Query: 187 IVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNG 239
           + +  G + + +LL++YGFV ++  NP D + +  +L+  D  Y++K    +RNG
Sbjct: 299 VFISYGKKSSGELLLSYGFVPKEGTNPNDSVELLVSLDKSDNCYKEKLQALKRNG 353


>gi|357137766|ref|XP_003570470.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like
           [Brachypodium distachyon]
          Length = 389

 Score = 85.1 bits (209), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 69/236 (29%), Positives = 114/236 (48%), Gaps = 23/236 (9%)

Query: 10  AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
            +++ +  + +   LA YL+ E      S W  YI  L RQ         S L W+ TE+
Sbjct: 46  GDVMKSYSVPDWPLLATYLISEASLEGSSRWSSYIDALPRQ-------PYSLLYWTRTEI 98

Query: 70  -AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 128
            AYL  SP +   + R   +   YN+L         +F ++P   P E +  E F+ +F 
Sbjct: 99  DAYLVASPIRERAISRIGDVIGTYNDL------RDRIFSKHPELFPEEVYNMENFRWSFG 152

Query: 129 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV---DRPYKAGE 185
            + S +V L   S+  + ALVP    +L ++ +  A L   D + Q +V   DR Y+ GE
Sbjct: 153 ILFSRLVRLP--SMDGKVALVPWAD-MLNHNPEVDAFLD-FDKSSQGIVFTTDRSYQPGE 208

Query: 186 SIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNG 239
            + +  G + + +LL++YGFV ++  NP D +    +LN  D  Y++K    +R+G
Sbjct: 209 QVFISYGKKSSGELLLSYGFVPKEGTNPNDSVEFSVSLNKSDDCYREKLQALKRHG 264


>gi|302754606|ref|XP_002960727.1| hypothetical protein SELMODRAFT_449995 [Selaginella moellendorffii]
 gi|300171666|gb|EFJ38266.1| hypothetical protein SELMODRAFT_449995 [Selaginella moellendorffii]
          Length = 430

 Score = 84.7 bits (208), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 93/345 (26%), Positives = 154/345 (44%), Gaps = 36/345 (10%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            +ALYL++EK +   S W  YIR L R        ++SPL WSE ELA L G+   + I 
Sbjct: 66  AVALYLLHEKAK-PHSDWSAYIRVLPR-------TLDSPLFWSEEELAELKGTQLLSSIN 117

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV------H 136
              E +KREY+++ T       + +  P       +T E F  AF  ++S         +
Sbjct: 118 GFKEFLKREYDKVMT------EVIEPRPDVFDRSLYTLEAFTWAFGILRSRTFPPLIGDN 171

Query: 137 LQKVSLA----RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG 192
           L  V LA      F L    P    +  K   + A  +           +  E ++ +  
Sbjct: 172 LALVPLADFVNHGFGLTNEDP---YWHVKSAGVFARQETLTLQAAANCAEKQEVLMQYGK 228

Query: 193 PQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 250
            + N++L  +YGFVD D  N  D   +   ++  +    DK  +AQ  G  S   F+++ 
Sbjct: 229 KKGNAQLATDYGFVDSDEKNNRDSFTLTLQVSLSERFADDKVDIAQMAGLDSTAYFNLY- 287

Query: 251 GREKEAISDMLPYLRLG--YVSDTSEMQSVISSL---GPICPVSPCMERAVLDQLADYFK 305
            R +    DM+ YLRL   + SD+  ++++  +        P+S   E A+ + + +  +
Sbjct: 288 -RNQGPPEDMIAYLRLIALFGSDSFLLEALFRNTVWDHLRLPISRENEEAICEAMIEGCR 346

Query: 306 ARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 350
           A L  Y +T+ ED  +L    L  +K++A  +   EK++L   LQ
Sbjct: 347 ATLREYSSTIDEDTMLLNSSELSTRKKMAVVVRLGEKRILQEQLQ 391


>gi|218202140|gb|EEC84567.1| hypothetical protein OsI_31339 [Oryza sativa Indica Group]
          Length = 649

 Score = 84.3 bits (207), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 95/349 (27%), Positives = 150/349 (42%), Gaps = 56/349 (16%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            +AL L+ E  +G  S W PY+  L RQ        +S + WSE EL  + G+    ++L
Sbjct: 289 AVALLLLREAARGAGSPWAPYLAILPRQ-------TDSTIFWSEEELLEIQGT----QLL 337

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE-------AFTFEIFKQAFVAVQSCVV 135
               G+K EY +         S F+    +I +E         TF  F  AF  ++S V 
Sbjct: 338 STTMGVK-EYVQ---------SEFESVEAEIISENRELFPGTVTFNDFLWAFGILRSRVF 387

Query: 136 HLQKVSLARRFALVPLGPPLLAYSS-----------KCKAMLAAVDDAVQLVVDRPYKAG 184
              +     + AL+P    L+ +S            K K +    D    L      K+G
Sbjct: 388 AELR---GDKLALIPFAD-LVNHSDDITSKESSWEIKGKGLFGR-DVVFSLRTPVNVKSG 442

Query: 185 ESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 243
           E I + +   + N++L ++YGF + ++  D   +   ++  DP Y DK  +A+ NG    
Sbjct: 443 EQIYIQYDLDKSNAELALDYGFTESNSSRDAYTLTLEISESDPFYDDKLDIAELNGMGET 502

Query: 244 QVFHVHAGREKEAISDMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAV 296
             F +  G  +     MLPYLRL  +  T           +V   L    PVS   E A+
Sbjct: 503 AYFDIVLG--ESLPPQMLPYLRLLCLGGTDAFLLEALFRNAVWGHL--ELPVSQDNEEAI 558

Query: 297 LDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
              + +  K+ L  Y  T+ EDE +L   NL P+ ++A ++   EKK+L
Sbjct: 559 CQVIRNACKSALGAYHTTIEEDEELLGSENLQPRLQIAVEVRAGEKKVL 607


>gi|242066146|ref|XP_002454362.1| hypothetical protein SORBIDRAFT_04g029430 [Sorghum bicolor]
 gi|241934193|gb|EES07338.1| hypothetical protein SORBIDRAFT_04g029430 [Sorghum bicolor]
          Length = 499

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 71/237 (29%), Positives = 114/237 (48%), Gaps = 25/237 (10%)

Query: 10  AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
            E++  N + +   +A YL+ E      S W  YI  L RQ         S L W+  EL
Sbjct: 131 GEVMKRNSVPDWPLIATYLISEASLEGSSRWSSYIAALPRQ-------PYSLLYWTRAEL 183

Query: 70  -AYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
            AYL  SP +   ++R   +   YN+L D ++     LF       P E +  E F  +F
Sbjct: 184 DAYLVASPIRKRAIQRITDVIGTYNDLRDRIFSRHSDLF-------PEEVYNIETFLWSF 236

Query: 128 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV---DRPYKAG 184
             + S +V L   S+  + ALVP    +L +S + +  L   D + Q +V   DR Y+ G
Sbjct: 237 GILFSRLVRLP--SMDEKVALVPWAD-MLNHSPEVETFLD-FDKSSQGIVFTTDRSYQPG 292

Query: 185 ESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNG 239
           E + +  G + + +LL++YGFV ++  NP D + +  +L+  D  Y++K    +RNG
Sbjct: 293 EQVFISYGKKSSGELLLSYGFVPKEGTNPNDSVELLVSLDKSDKCYKEKLQALKRNG 349


>gi|148744485|gb|AAI42996.1| SET domain containing 3 [Homo sapiens]
          Length = 594

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 81/344 (23%), Positives = 159/344 (46%), Gaps = 29/344 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+ YL  +    ++ 
Sbjct: 156 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 207

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  ++FT+E ++ A  +V +    +  
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  + ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 320

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379

Query: 259 DMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
            +L +LR+  +++          S +  + +      PVS   E  +   L D     L 
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSKFPVSWDNEVKLWTFLEDRASLLLK 439

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
            Y  T+ ED+++L +++L  + ++A +L   EK++L   ++  A
Sbjct: 440 TYKTTIEEDKSVLKNHDLSVRAKMAIKLRLGEKEILEKAVKSAA 483


>gi|40068481|ref|NP_115609.2| histone-lysine N-methyltransferase setd3 isoform a [Homo sapiens]
 gi|74750394|sp|Q86TU7.1|SETD3_HUMAN RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|28071092|emb|CAD61927.1| unnamed protein product [Homo sapiens]
 gi|119602070|gb|EAW81664.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
 gi|119602072|gb|EAW81666.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
 gi|119602073|gb|EAW81667.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
 gi|194380984|dbj|BAG64060.1| unnamed protein product [Homo sapiens]
 gi|307686103|dbj|BAJ20982.1| SET domain containing 3 [synthetic construct]
          Length = 594

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 81/344 (23%), Positives = 159/344 (46%), Gaps = 29/344 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+ YL  +    ++ 
Sbjct: 156 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 207

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  ++FT+E ++ A  +V +    +  
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  + ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 320

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379

Query: 259 DMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
            +L +LR+  +++          S +  + +      PVS   E  +   L D     L 
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 439

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
            Y  T+ ED+++L +++L  + ++A +L   EK++L   ++  A
Sbjct: 440 TYKTTIEEDKSVLKNHDLSVRAKMAIKLRLGEKEILEKAVKSAA 483


>gi|443722302|gb|ELU11224.1| hypothetical protein CAPTEDRAFT_181634 [Capitella teleta]
          Length = 541

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 79/316 (25%), Positives = 138/316 (43%), Gaps = 20/316 (6%)

Query: 38  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 97
           SFWLPY++ L         +  SPL ++  +L  L  SPT +E++ +   I R+Y     
Sbjct: 168 SFWLPYLKILPH-------SYSSPLYFNPEDLQLLKASPTLSEMINQFRNITRQYAYFFN 220

Query: 98  VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF-ALVPLGPPLL 156
           + F    L  + P  I  +   ++ ++ A  +V +    +  +   R   AL+PL     
Sbjct: 221 L-FQGHELASKLP--IQVKNICYDDYRWAVSSVMTRQNQIPTLDGQRMISALIPLWDMCN 277

Query: 157 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 216
             + +     +  +D  +        AG  + ++ G + N++LLI+ GFV   N  DRL 
Sbjct: 278 HTNGQITTDFSLKNDRSECFSLEGTVAGAQVFIFYGSRSNAELLIHNGFVYPQNHSDRLT 337

Query: 217 VEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQ 276
           +   ++  DP +  K  V  R    + ++F +H G      SD L +LR+  +++  +++
Sbjct: 338 IRLGISKNDPLFSMKSEVLSRLSMQASRLFSLHCG-VNPVDSDTLAFLRVVVMTE-DDLR 395

Query: 277 SVISSLGPICP-------VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 329
           + ++    I         VS   ER     LA      L  YP +  ED  +L   +L  
Sbjct: 396 TALACRQQISKLRDFDDFVSEDNERKAWAFLATRVLLLLKAYPTSAQEDATLLQGNDLST 455

Query: 330 KKRVATQLVRMEKKML 345
             R+A QL   EK +L
Sbjct: 456 HARLAVQLRHCEKNIL 471


>gi|355718753|gb|AES06373.1| SET domain containing 3 [Mustela putorius furo]
          Length = 585

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 84/344 (24%), Positives = 159/344 (46%), Gaps = 29/344 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+  L  +    ++ 
Sbjct: 156 TLAFHLLCERAD-PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRDLQSTQAIHDVF 207

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  +AFT+E ++ A  +V +    +  
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDAFTYEDYRWAVSSVMTRQNQIPT 260

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  R ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALRDFRAGEQIYIFYGTRSNAE 320

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H+  E    +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPVSA 379

Query: 259 DMLPYLRLGYVSDTSEMQSVIS--------SLGPI-CPVSPCMERAVLDQLADYFKARLA 309
            +L +LR+  +++    + ++         +LG    PVS   E  +   L D     L 
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDNALDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLK 439

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
            Y  T+ ED++ L D++L  +  +A +L   EK++L   ++  A
Sbjct: 440 TYKTTIEEDKSFLKDHDLSVRAAMAIKLRLGEKEILEKAVKSAA 483


>gi|110331827|gb|ABG67019.1| hypothetical protein LOC84193 [Bos taurus]
          Length = 488

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 79/339 (23%), Positives = 153/339 (45%), Gaps = 25/339 (7%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+ YL  +    ++ 
Sbjct: 163 TLAFHLLCERAD-PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 214

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPYDIP-TEAFTFEIFKQAFVAVQSCVVHLQKVS 141
            + +   R+Y      +F        + + +P  ++FT+E ++ A  +V +    +    
Sbjct: 215 SQYKNTARQY-----AYFYKVIQTHPHAHKLPLKDSFTYEDYRWAVSSVMTRQNQIPTED 269

Query: 142 LAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLL 200
            +R   AL+PL       S          DD  + V  + ++AGE I ++ G + N++ +
Sbjct: 270 GSRVTLALIPLWDMCNHTSGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFV 329

Query: 201 INYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDM 260
           I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +
Sbjct: 330 IHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQL 388

Query: 261 LPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGY 311
           L +LR+  +++          S +  + +      PVS   E  +   L D     L  Y
Sbjct: 389 LAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTY 448

Query: 312 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 350
             T+ ED++ L +++L  +  +A +L   EK++L   ++
Sbjct: 449 KTTIEEDKSFLKNHDLSARATMAIKLRLGEKEILERAVK 487


>gi|440907688|gb|ELR57800.1| SET domain-containing protein 3 [Bos grunniens mutus]
          Length = 594

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 82/344 (23%), Positives = 157/344 (45%), Gaps = 29/344 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+ YL  +    ++ 
Sbjct: 156 TLAFHLLCERAD-PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 207

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  ++FT+E ++ A  +V +    +  
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHAHKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       S          DD  + V  + ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTSGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 320

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379

Query: 259 DMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
            +L +LR+  +++          S +  + +      PVS   E  +   L D     L 
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLK 439

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
            Y  T+ ED++ L +++L  +  +A +L   EK++L   ++  A
Sbjct: 440 TYKTTIEEDKSFLKNHDLSARATMAIKLRLGEKEILERAVKSAA 483


>gi|119914085|ref|XP_589822.3| PREDICTED: histone-lysine N-methyltransferase setd3 [Bos taurus]
 gi|297488270|ref|XP_002696879.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Bos taurus]
 gi|296475307|tpg|DAA17422.1| TPA: SET domain containing 3 [Bos taurus]
          Length = 601

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 82/344 (23%), Positives = 157/344 (45%), Gaps = 29/344 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+ YL  +    ++ 
Sbjct: 163 TLAFHLLCERAD-PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 214

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  ++FT+E ++ A  +V +    +  
Sbjct: 215 SQYKNTARQY-----AYFY--KVIQTHPHAHKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 267

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       S          DD  + V  + ++AGE I ++ G + N++
Sbjct: 268 EDGSRVTLALIPLWDMCNHTSGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 327

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    +
Sbjct: 328 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 386

Query: 259 DMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
            +L +LR+  +++          S +  + +      PVS   E  +   L D     L 
Sbjct: 387 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLK 446

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
            Y  T+ ED++ L +++L  +  +A +L   EK++L   ++  A
Sbjct: 447 TYKTTIEEDKSFLKNHDLSARATMAIKLRLGEKEILERAVKSAA 490


>gi|302759643|ref|XP_002963244.1| hypothetical protein SELMODRAFT_80789 [Selaginella moellendorffii]
 gi|300168512|gb|EFJ35115.1| hypothetical protein SELMODRAFT_80789 [Selaginella moellendorffii]
          Length = 467

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 65/235 (27%), Positives = 110/235 (46%), Gaps = 20/235 (8%)

Query: 9   AAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 68
           A +++    L E   LA YL+ E   GK S W PYI  L R+ G       S LLW+  +
Sbjct: 99  AGKVIRAKDLPEWPFLACYLISEASLGKSSPWYPYIAALPRRPG-------SILLWTALD 151

Query: 69  L-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
           +  +L+ +  K   L+    ++  +N+L+   FM          + P E F  + FK AF
Sbjct: 152 VETHLSATSIKDRALQCVREVEDTFNDLNKQVFMKNR------EEFPPEVFNLKSFKWAF 205

Query: 128 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGE 185
             + S +V L   SL ++ AL+P G  +L + ++    L   +   ++   +DR Y++  
Sbjct: 206 GILFSRLVRLP--SLGQKLALIPFG-DMLNHDTEVTTFLDFDSGSKSITCTLDRGYESNR 262

Query: 186 SIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 239
            + +  G + N +LL+ YGFV    N  D + +   L+  D  Y+ K    + +G
Sbjct: 263 EVFISYGKRSNGELLVAYGFVPSGKNSEDSVSITLGLDPADEMYEAKLGTLKEHG 317


>gi|426248573|ref|XP_004018037.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
           setd3 [Ovis aries]
          Length = 596

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 82/344 (23%), Positives = 157/344 (45%), Gaps = 29/344 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+ YL  +    ++ 
Sbjct: 163 TLAFHLLCERAD-PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 214

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  ++FT+E ++ A  +V +    +  
Sbjct: 215 SQYKNTARQY-----AYFY--RVIQTHPHAHKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 267

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       S          DD  + V  + ++AGE I ++ G + N++
Sbjct: 268 EDGSRVTLALIPLWDMCNHTSGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 327

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    +
Sbjct: 328 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 386

Query: 259 DMLPYLRLGYVSDT---------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
            +L +LR+  +++          S +  + +      PVS   E  +   L D     L 
Sbjct: 387 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEYPVSWDNEVRLWAFLEDRASLLLK 446

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
            Y  T+ ED++ L +++L  +  +A +L   EK++L   ++  A
Sbjct: 447 TYKTTIEEDKSFLKNHDLSARATMAVKLRLGEKEILERAVKSAA 490


>gi|444909511|ref|ZP_21229702.1| hypothetical protein D187_00317 [Cystobacter fuscus DSM 2262]
 gi|444720460|gb|ELW61244.1| hypothetical protein D187_00317 [Cystobacter fuscus DSM 2262]
          Length = 445

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 88/336 (26%), Positives = 145/336 (43%), Gaps = 30/336 (8%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           LA +L+ EK +G  SFW P++  L       +     PL +SE E A + GS      LE
Sbjct: 89  LASWLLEEKHRGADSFWKPFVDSLP------EAYPHVPLFYSEQERARMKGSQ-----LE 137

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYPYDIPT-EAFTFEIFKQAFVAVQSCVVHLQKVSL 142
           R   ++R+  E +         + Q    +P  E F FE +  A +++ S +  L+    
Sbjct: 138 RLVEVQRQSFEQE---------YAQLREKLPEYERFGFEEYVWARISLYSRLFSLK--GG 186

Query: 143 ARRFALVPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGESIVVWCGPQPNSKLL 200
            +  +LVPL   +  +      + +  +D    +++  R   AG  I    G + +   L
Sbjct: 187 LQGPSLVPLSD-MFNHRQPPDVLWSTSEDGQTFRMIAQRAVPAGTEIHTHYGAKSSDVFL 245

Query: 201 INYGFV-DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD 259
           ++ GFV D +   D + +   L   DP    K+ +       +   F V    +  A   
Sbjct: 246 LHSGFVPDGNEENDEVYLSVGLPPGDPLASVKQQMFGLASATAKHPFKVSRQGKYLASWS 305

Query: 260 MLPYLRLGYVSDTSEM---QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLS 316
           +  +LR+ + S    +     ++S    I PVS   E  VL  LA   + RL  +P TL 
Sbjct: 306 VFSFLRMAHASPDEFLALSNRLLSGTKTIAPVSVACEERVLGTLAAACEERLKAFPTTLE 365

Query: 317 EDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVT 352
           EDE +L +  L P +R    L R EK++L   L++T
Sbjct: 366 EDERLLREGPLSPNERSCVLLRRQEKRLLGDYLELT 401


>gi|281182452|ref|NP_001162549.1| histone-lysine N-methyltransferase setd3 [Papio anubis]
 gi|332321745|sp|A9X1D0.1|SETD3_PAPAN RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|163781076|gb|ABY40825.1| SET domain containing 3, isoform 1 (predicted) [Papio anubis]
          Length = 595

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 81/344 (23%), Positives = 158/344 (45%), Gaps = 29/344 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+ YL  +    ++ 
Sbjct: 156 ALAFHLLCERAN-PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 207

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  ++FT+E ++ A  +V +    +  
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  + ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 320

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379

Query: 259 DMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
            +L +LR+  +++          S +  + +      PVS   E  +   L D     L 
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 439

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
            Y  T+ ED+++L + +L  + ++A +L   EK++L   ++  A
Sbjct: 440 TYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEKAVKSAA 483


>gi|355778846|gb|EHH63882.1| hypothetical protein EGM_16943 [Macaca fascicularis]
          Length = 595

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 81/344 (23%), Positives = 158/344 (45%), Gaps = 29/344 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+ YL  +    ++ 
Sbjct: 156 ALAFHLLCERAN-PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 207

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  ++FT+E ++ A  +V +    +  
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  + ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 320

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379

Query: 259 DMLPYLRLGYVSDT---------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
            +L +LR+  +++          S +  + +      PVS   E  +   L D     L 
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 439

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
            Y  T+ ED+++L + +L  + ++A +L   EK++L   ++  A
Sbjct: 440 TYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEKAVKSAA 483


>gi|386780935|ref|NP_001247800.1| SET domain containing 3 [Macaca mulatta]
 gi|355693560|gb|EHH28163.1| hypothetical protein EGK_18532 [Macaca mulatta]
 gi|380817110|gb|AFE80429.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
 gi|383422129|gb|AFH34278.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
 gi|384949778|gb|AFI38494.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
          Length = 595

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 81/344 (23%), Positives = 158/344 (45%), Gaps = 29/344 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+ YL  +    ++ 
Sbjct: 156 ALAFHLLCERAN-PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 207

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  ++FT+E ++ A  +V +    +  
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  + ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 320

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379

Query: 259 DMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
            +L +LR+  +++          S +  + +      PVS   E  +   L D     L 
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 439

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
            Y  T+ ED+++L + +L  + ++A +L   EK++L   ++  A
Sbjct: 440 TYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEKAVKSAA 483


>gi|114654683|ref|XP_522946.2| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2 [Pan
           troglodytes]
 gi|332843114|ref|XP_003314566.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Pan
           troglodytes]
 gi|397525919|ref|XP_003832895.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1 [Pan
           paniscus]
 gi|397525921|ref|XP_003832896.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2 [Pan
           paniscus]
 gi|410227562|gb|JAA11000.1| SET domain containing 3 [Pan troglodytes]
 gi|410255618|gb|JAA15776.1| SET domain containing 3 [Pan troglodytes]
 gi|410289938|gb|JAA23569.1| SET domain containing 3 [Pan troglodytes]
 gi|410342147|gb|JAA40020.1| SET domain containing 3 [Pan troglodytes]
          Length = 594

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 81/344 (23%), Positives = 158/344 (45%), Gaps = 29/344 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+ YL  +    ++ 
Sbjct: 156 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 207

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  ++FT+E ++ A  +V +    +  
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  + ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 320

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379

Query: 259 DMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
            +L +LR+  +++          S +  + +      PVS   E  +   L D     L 
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 439

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
            Y  T+ ED+++L + +L  + ++A +L   EK++L   ++  A
Sbjct: 440 TYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEKAVKSAA 483


>gi|332252553|ref|XP_003275417.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
           [Nomascus leucogenys]
 gi|332252555|ref|XP_003275418.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
           [Nomascus leucogenys]
          Length = 595

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 81/344 (23%), Positives = 158/344 (45%), Gaps = 29/344 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+ YL  +    ++ 
Sbjct: 156 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 207

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  ++FT+E ++ A  +V +    +  
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  + ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 320

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379

Query: 259 DMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
            +L +LR+  +++          S +  + +      PVS   E  +   L D     L 
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 439

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
            Y  T+ ED+++L + +L  + ++A +L   EK++L   ++  A
Sbjct: 440 TYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEKAVKSAA 483


>gi|426377975|ref|XP_004055723.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Gorilla
           gorilla gorilla]
          Length = 594

 Score = 82.0 bits (201), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 81/344 (23%), Positives = 158/344 (45%), Gaps = 29/344 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+ YL  +    ++ 
Sbjct: 156 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 207

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  ++FT+E ++ A  +V +    +  
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  + ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 320

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379

Query: 259 DMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
            +L +LR+  +++          S +  + +      PVS   E  +   L D     L 
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 439

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
            Y  T+ ED+++L + +L  + ++A +L   EK++L   ++  A
Sbjct: 440 TYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEKTVKSAA 483


>gi|297695854|ref|XP_002825140.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
           [Pongo abelii]
 gi|395746278|ref|XP_003778419.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
           [Pongo abelii]
          Length = 595

 Score = 82.0 bits (201), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 81/344 (23%), Positives = 158/344 (45%), Gaps = 29/344 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+ YL  +    ++ 
Sbjct: 156 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 207

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  ++FT+E ++ A  +V +    +  
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  + ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 320

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379

Query: 259 DMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
            +L +LR+  +++          S +  + +      PVS   E  +   L D     L 
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 439

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
            Y  T+ ED+++L + +L  + ++A +L   EK++L   ++  A
Sbjct: 440 TYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEKAVKSAA 483


>gi|431839268|gb|ELK01195.1| SET domain-containing protein 3 [Pteropus alecto]
          Length = 805

 Score = 82.0 bits (201), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 83/343 (24%), Positives = 158/343 (46%), Gaps = 29/343 (8%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           LA +L+ E+     SFW PYI+ L  +        ++PL + E E+ YL  +    ++  
Sbjct: 336 LAFHLLCERAD-PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFS 387

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKV 140
           + +   R+Y      +F    + Q +P+   +P  ++FT+E ++ A  +V +    +   
Sbjct: 388 QYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTE 440

Query: 141 SLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 199
             +R   AL+PL       +          DD  + V  R ++AGE I ++ G + N++ 
Sbjct: 441 DGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALRDFRAGEQIYIFYGTRSNAEF 500

Query: 200 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD 259
           +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + 
Sbjct: 501 VIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQ 559

Query: 260 MLPYLRLGYVSDTSEMQSVIS--------SLGPI-CPVSPCMERAVLDQLADYFKARLAG 310
           +L +LR+  +++    + ++         +LG    PVS   E  +   L D     L  
Sbjct: 560 LLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEYPVSWDNEVKLWTFLEDRASLLLKT 619

Query: 311 YPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
           Y  T+ ED + L +++L  +  +A +L   EK++L   ++  A
Sbjct: 620 YKTTVEEDRSFLRNHDLSVRAAMAVKLRLGEKEILERAVKSAA 662


>gi|332321743|sp|C1FXW2.1|SETD3_DASNO RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|226526916|gb|ACO71275.1| SET domain containing 3 isoform a (predicted) [Dasypus
           novemcinctus]
          Length = 589

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 83/344 (24%), Positives = 159/344 (46%), Gaps = 29/344 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+ YL  +    ++ 
Sbjct: 156 TLAFHLLCERAN-PNSFWQPYIQSLPGE-------YDTPLYFEEDEVRYLHSTQAIHDVF 207

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  ++FT+E ++ A  +V +    +  
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  + ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 320

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379

Query: 259 DMLPYLRLGYVSDTSEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLA 309
            +L +LR+  +++    + +        I +LG    PVS   E  +   L D     L 
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGENAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 439

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
            Y  T+ ED++ L +++L  +  +A +L   EK++L   ++  A
Sbjct: 440 TYKTTIEEDKSFLKNHDLSVRATMAIKLRLGEKEILEKAVKSAA 483


>gi|332321478|sp|B1MTJ4.2|SETD3_CALMO RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
          Length = 595

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 83/344 (24%), Positives = 160/344 (46%), Gaps = 29/344 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+ YL  +    ++ 
Sbjct: 156 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 207

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  ++FT+E ++ A  +V +    +  
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  + ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 320

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379

Query: 259 DMLPYLRLGYVSDTSEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLA 309
            +L +LR+  +++    + +        I +LG    PVS   E  +   L D     L 
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 439

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
            Y  T+ ED+++L + +L  + ++A +L   EK++L   ++  A
Sbjct: 440 TYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEKAVKSAA 483


>gi|168986666|gb|ACA35060.1| SET domain containing 3 isoform a (predicted) [Callithrix jacchus]
          Length = 597

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 83/344 (24%), Positives = 160/344 (46%), Gaps = 29/344 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+ YL  +    ++ 
Sbjct: 158 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRYLQSTQAVHDVF 209

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  ++FT+E ++ A  +V +    +  
Sbjct: 210 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 262

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  + ++AGE I ++ G + N++
Sbjct: 263 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 322

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    +
Sbjct: 323 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 381

Query: 259 DMLPYLRLGYVSDTSEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLA 309
            +L +LR+  +++    + +        I +LG    PVS   E  +   L D     L 
Sbjct: 382 QLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 441

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
            Y  T+ ED+++L + +L  + ++A +L   EK++L   ++  A
Sbjct: 442 TYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEKAVKSAA 485


>gi|296215874|ref|XP_002754318.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Callithrix jacchus]
          Length = 610

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 83/344 (24%), Positives = 160/344 (46%), Gaps = 29/344 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+ YL  +    ++ 
Sbjct: 171 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRYLQSTQAVHDVF 222

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  ++FT+E ++ A  +V +    +  
Sbjct: 223 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 275

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  + ++AGE I ++ G + N++
Sbjct: 276 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 335

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    +
Sbjct: 336 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 394

Query: 259 DMLPYLRLGYVSDTSEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLA 309
            +L +LR+  +++    + +        I +LG    PVS   E  +   L D     L 
Sbjct: 395 QLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 454

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
            Y  T+ ED+++L + +L  + ++A +L   EK++L   ++  A
Sbjct: 455 TYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEKAVKSAA 498


>gi|169409575|gb|ACA57918.1| SET domain containing 3 isoform a (predicted) [Callicebus moloch]
          Length = 597

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 83/344 (24%), Positives = 160/344 (46%), Gaps = 29/344 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+ YL  +    ++ 
Sbjct: 158 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 209

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  ++FT+E ++ A  +V +    +  
Sbjct: 210 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 262

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  + ++AGE I ++ G + N++
Sbjct: 263 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 322

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    +
Sbjct: 323 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 381

Query: 259 DMLPYLRLGYVSDTSEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLA 309
            +L +LR+  +++    + +        I +LG    PVS   E  +   L D     L 
Sbjct: 382 QLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 441

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
            Y  T+ ED+++L + +L  + ++A +L   EK++L   ++  A
Sbjct: 442 TYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEKAVKSAA 485


>gi|332320543|sp|B0VX69.2|SETD3_CALJA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
          Length = 595

 Score = 81.6 bits (200), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 83/344 (24%), Positives = 160/344 (46%), Gaps = 29/344 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+ YL  +    ++ 
Sbjct: 156 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRYLQSTQAVHDVF 207

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  ++FT+E ++ A  +V +    +  
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  + ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 320

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379

Query: 259 DMLPYLRLGYVSDTSEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLA 309
            +L +LR+  +++    + +        I +LG    PVS   E  +   L D     L 
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 439

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
            Y  T+ ED+++L + +L  + ++A +L   EK++L   ++  A
Sbjct: 440 TYKTTIEEDKSVLKNQDLSVRAKMAIKLRLGEKEILEKAVKSAA 483


>gi|427784595|gb|JAA57749.1| Putative histone-lysine n-methyltransferase setd3 [Rhipicephalus
           pulchellus]
          Length = 485

 Score = 81.3 bits (199), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 80/337 (23%), Positives = 147/337 (43%), Gaps = 23/337 (6%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           LA++L+ E   G+ SFW PYI  L         +  + L +S  EL  L GS    E L+
Sbjct: 152 LAMFLILEFCTGESSFWHPYISTL-------PASFNTVLYFSVEELELLHGSTVLDEALK 204

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS--CVVHLQKVS 141
               I R+Y+    + F    L +  PY    + FT+++++ A  AV +    V L   +
Sbjct: 205 LHRSIARQYSYFHKI-FRTHPLAKSLPY---KDCFTYDLYRWAVSAVMTRQNAVPLTDTA 260

Query: 142 LARR-------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 194
                       A+VPL         K         + ++    R ++ G+ + ++ G +
Sbjct: 261 GGDDEDGTDAMTAMVPLWDMCNHSDGKVFTDYDISANMLRCYAMRDFEKGQEVTIFYGRR 320

Query: 195 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREK 254
            N++  I+ GFV  +N +D + ++  ++ +DP Y  K  +   +      +F +   RE+
Sbjct: 321 TNAEFFIHNGFVFPENRHDSVDIKLGISKQDPLYAVKAKLCDDHELTPSGIFAL-VPRER 379

Query: 255 EAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPAT 314
               D+  +LR+  + D S+  S       I   +    R  L+ L    +  L  +P +
Sbjct: 380 PVCEDLSTFLRILVLKDASQAASFTDE--HIMVATDDNAREALNFLIVRIQLLLRAFPKS 437

Query: 315 LSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQV 351
             E E ++ D   + + ++A QL  +E+K+L A L+ 
Sbjct: 438 DQEYENIIADEGSNGRLKMAAQLRLLERKILTAVLET 474


>gi|326495906|dbj|BAJ90575.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 507

 Score = 81.3 bits (199), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 67/222 (30%), Positives = 108/222 (48%), Gaps = 23/222 (10%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEIL 82
           LA YL+ E      S W  YI  L RQ         S L W+ TE+ AYL  SP +   +
Sbjct: 152 LATYLISEASLEGSSRWSSYIDALPRQ-------PYSLLYWTRTEIDAYLVASPIRERAI 204

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
            R   +   YN+L         +F ++P   P + +  E F+ +F  + S +V L+  S+
Sbjct: 205 SRISDVIGTYNDLRD------RIFSKHPDLFPEKVYNMENFRWSFGILFSRLVRLE--SM 256

Query: 143 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV---DRPYKAGESIVVWCGPQPNSKL 199
             + ALVP    +L +S +  A L   D + Q +V   DR Y+ GE + +  G + + +L
Sbjct: 257 GGKVALVPWAD-MLNHSPEVDAFLD-YDKSSQGIVFTTDRSYQPGEQVFISYGKKSSGEL 314

Query: 200 LINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNG 239
           L++YGFV ++  NP D +    +L   D  Y++K    +++G
Sbjct: 315 LLSYGFVPKEGTNPNDSVEFLVSLKKSDECYKEKLQALKKHG 356


>gi|147843303|emb|CAN82664.1| hypothetical protein VITISV_015206 [Vitis vinifera]
          Length = 507

 Score = 81.3 bits (199), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 86/333 (25%), Positives = 153/333 (45%), Gaps = 25/333 (7%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            +AL+L+ EK +  +S W  Y+  L            S + WSE EL  + G+      L
Sbjct: 124 SVALFLIREKLR-DESPWRSYLDILPEY-------TNSTIYWSEEELVEIQGTQLSNTTL 175

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQSCVVHLQKVS 141
              E ++ E+ +++    +  S    +P  +    + F I + +AF  ++      Q + 
Sbjct: 176 GVKEYVQSEFLKVEEEVILPHSQLFPFPVTLDDFLWAFGILRSRAFSRLRG-----QNLV 230

Query: 142 LARRFALVPLGPPLLA--YSSKCK-AMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNS 197
           L     L+   P +    Y+ + K A L + D    L      KAGE +++ +   + N+
Sbjct: 231 LIPLADLINHSPSITTEEYAWEIKGAGLFSRDQLFSLRTPVSVKAGEQVLIQYDLDKSNA 290

Query: 198 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI 257
           +L ++YGF++     +   +   ++  DP + DK  +A+ NG   +  F +  G+   A 
Sbjct: 291 ELALDYGFIESRPNRNSYTLTLEISESDPFFGDKLDIAESNGLSEIAYFDIVLGQSLPAA 350

Query: 258 SDMLPYLRLGYVS--DTSEMQSVISSL--GPI-CPVSPCMERAVLDQLADYFKARLAGYP 312
             MLPYLRL  +   D   ++S+  +   G +  PVS   E  +   + D  K+ L+GY 
Sbjct: 351 --MLPYLRLVALGGPDAFLLESIFRNTIWGHLELPVSRANEELICQVIQDACKSALSGYL 408

Query: 313 ATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
            T+ EDE +  + NLHP+  +A  +   EKK+L
Sbjct: 409 TTIEEDEKLKEEGNLHPRLEIAVGVRTGEKKVL 441


>gi|291411315|ref|XP_002721936.1| PREDICTED: SET domain containing 3 [Oryctolagus cuniculus]
          Length = 591

 Score = 81.3 bits (199), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 85/345 (24%), Positives = 154/345 (44%), Gaps = 31/345 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+ YL  +    ++ 
Sbjct: 156 ALAFHLLCERAS-PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 207

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  ++FT+E ++ A  +V +    +  
Sbjct: 208 SQYKNTARQY-----AYFY--RVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  R + AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALRDFHAGEQIYIFYGTRSNAE 320

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379

Query: 259 DMLPYLRLGYVSDTSEMQSVISSLGPI----------CPVSPCMERAVLDQLADYFKARL 308
            +L +LR+ +     E++  +   G I           PVS   E  +   L D     L
Sbjct: 380 QLLAFLRV-FCMTEEELREHLLGDGAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLL 438

Query: 309 AGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
             Y  T+ ED+A+L    L  +  +A +L   EK++L   ++  A
Sbjct: 439 KTYKTTIEEDKAVLRSPALSARAAMAVKLRLGEKEILEKAVRSAA 483


>gi|332321744|sp|B5FW36.1|SETD3_OTOGA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|197215622|gb|ACH53017.1| SET domain containing 3 isoform a (predicted) [Otolemur garnettii]
          Length = 595

 Score = 80.9 bits (198), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 83/344 (24%), Positives = 159/344 (46%), Gaps = 29/344 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+ YL  +    ++ 
Sbjct: 156 ALAFHLLCER-ASPNSFWQPYIQSLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 207

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  ++FT+E ++ A  +V +    +  
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  + ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 320

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379

Query: 259 DMLPYLRLGYVSDTSEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLA 309
            +L +LR+  +++    + +        I +LG    PVS   E  +   L D     L 
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 439

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
            Y  T+ ED+ +L +++L  +  +A +L   EK++L   ++  A
Sbjct: 440 TYKTTIEEDKFVLKNHDLSVRATMAIKLRLGEKEILEKAVKSAA 483


>gi|224051705|ref|XP_002200601.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Taeniopygia
           guttata]
          Length = 593

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 82/344 (23%), Positives = 160/344 (46%), Gaps = 29/344 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFWLPYI+ L  +        ++PL + E E+ +L  +    ++ 
Sbjct: 156 TLAFHLLCERAN-PHSFWLPYIQTLPSE-------YDTPLYFEEDEVQHLQSTQAIHDVF 207

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P    +P  ++FT++ ++ A  +V +    +  
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPT 260

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  + +KAGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFKAGEQIYIFYGTRSNAE 320

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H+  E    +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPAISA 379

Query: 259 DMLPYLRLGYVSDTSEMQSVIS--SLGPICPVSPCMERAVLD---QLADYFKAR----LA 309
            +L +LR+  +S+    + +I   ++G I  +         D   +L  + +AR    L 
Sbjct: 380 QLLAFLRVFCMSEEELKEHLIGEHAIGKIFTLGNSDFPVSWDNEVKLWTFLEARASLLLK 439

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
            Y  T+  D++ L  ++L P   +A +L   EK++L   ++  A
Sbjct: 440 TYKTTVEVDKSFLETHDLTPHAIMAIKLRLGEKEILEKAVKSAA 483


>gi|344273731|ref|XP_003408672.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Loxodonta
           africana]
          Length = 597

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 83/344 (24%), Positives = 158/344 (45%), Gaps = 29/344 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+ +L  +    ++ 
Sbjct: 156 TLAFHLLCERAN-PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRHLQSTQAIHDVF 207

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  ++FT+E ++ A  +V +    +  
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  + ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 320

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379

Query: 259 DMLPYLRLGYVSDTSEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLA 309
            +L +LR+  +++    + +        I +LG    PVS   E  +   L D     L 
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 439

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
            Y  T+ ED+A L  ++L  +  +A +L   EK++L   ++  A
Sbjct: 440 TYKTTIEEDKAFLKGHDLSIRATMAVKLRLGEKEILERAVKSAA 483


>gi|225462926|ref|XP_002267249.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
           large subunit N-methyltransferase, chloroplastic [Vitis
           vinifera]
 gi|296087793|emb|CBI35049.3| unnamed protein product [Vitis vinifera]
          Length = 484

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 87/338 (25%), Positives = 154/338 (45%), Gaps = 25/338 (7%)

Query: 18  LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
           L     +AL+L+ EK +  +S W  Y+  L            S + WSE EL  + G+  
Sbjct: 119 LKPWVSVALFLIREKLR-DESPWRSYLDILPEY-------TNSTIYWSEEELVEIQGTQL 170

Query: 78  KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQSCVVH 136
               L   E ++ E+ +++    +  S    +P  +    + F I + +AF  ++     
Sbjct: 171 SNTTLGVKEYVQSEFLKVEEEVILPHSQLFPFPVTLDDFLWAFGILRSRAFSRLRG---- 226

Query: 137 LQKVSLARRFALVPLGPPLLA--YSSKCK-AMLAAVDDAVQLVVDRPYKAGESIVV-WCG 192
            Q + L     L+   P +    Y+ + K A L + D    L      KAGE +++ +  
Sbjct: 227 -QNLVLIPLADLINHSPSITTEEYAWEIKGAGLFSRDQLFSLRTPVSVKAGEQVLIQYDL 285

Query: 193 PQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGR 252
            + N++L ++YGF++     +   +   ++  DP + DK  +A+ NG   +  F +  G+
Sbjct: 286 DKSNAELALDYGFIESRPNRNSYTLTLEISESDPFFGDKLDIAESNGLSEIAYFDIVLGQ 345

Query: 253 EKEAISDMLPYLRLGYVS--DTSEMQSVISS--LGPI-CPVSPCMERAVLDQLADYFKAR 307
              A   MLPYLRL  +   D   ++S+  +   G +  PVS   E  +   + D  K+ 
Sbjct: 346 SLPAA--MLPYLRLVALGGPDAFLLESIFRNTIWGHLELPVSRANEELICQVIQDACKSA 403

Query: 308 LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
           L+GY  T+ EDE +  + NLHP+  +A  +   EKK+L
Sbjct: 404 LSGYLTTIEEDEKLKEEGNLHPRLEIAVGVRTGEKKVL 441


>gi|115448405|ref|NP_001047982.1| Os02g0725200 [Oryza sativa Japonica Group]
 gi|45735887|dbj|BAD12920.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase [Oryza sativa Japonica
           Group]
 gi|45736017|dbj|BAD13045.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase [Oryza sativa Japonica
           Group]
 gi|113537513|dbj|BAF09896.1| Os02g0725200 [Oryza sativa Japonica Group]
 gi|215737236|dbj|BAG96165.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222623589|gb|EEE57721.1| hypothetical protein OsJ_08208 [Oryza sativa Japonica Group]
          Length = 502

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 71/243 (29%), Positives = 112/243 (46%), Gaps = 23/243 (9%)

Query: 10  AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
             +L  N + +   +A YL+ E      S W  YI  L RQ         S L W+  EL
Sbjct: 134 GNVLKRNSVPDWPLIATYLISEASLESSSRWSSYIAALPRQ-------PYSLLYWTRPEL 186

Query: 70  -AYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
            AYL  SP +   ++R   +   YN+L D ++     LF       P E +  E F+ +F
Sbjct: 187 DAYLVASPIRERAIQRITDVVGTYNDLRDRIFSKHSDLF-------PEEVYNLETFRWSF 239

Query: 128 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGE 185
             + S +V L   S+  R ALVP    +L +S + +  L        +    DR Y+ GE
Sbjct: 240 GILFSRLVRLP--SMDGRVALVPWAD-MLNHSPEVETFLDYDKSSGGIVFTTDRSYQPGE 296

Query: 186 SIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 243
            + +  G + + +LL++YGFV ++  NP D + +  +LN  D  Y++K    +RNG    
Sbjct: 297 QVFISYGKKSSGELLLSYGFVPKEGTNPNDSVELLVSLNKSDKCYKEKLQALKRNGLSEF 356

Query: 244 QVF 246
           + F
Sbjct: 357 ESF 359


>gi|218191491|gb|EEC73918.1| hypothetical protein OsI_08761 [Oryza sativa Indica Group]
          Length = 502

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 71/243 (29%), Positives = 112/243 (46%), Gaps = 23/243 (9%)

Query: 10  AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
             +L  N + +   +A YL+ E      S W  YI  L RQ         S L W+  EL
Sbjct: 134 GNVLKRNSVPDWPLIATYLISEASLESSSRWSSYIAALPRQ-------PYSLLYWTRPEL 186

Query: 70  -AYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
            AYL  SP +   ++R   +   YN+L D ++     LF       P E +  E F+ +F
Sbjct: 187 DAYLVASPIRERAIQRITDVVGTYNDLRDRIFSKHSDLF-------PEEVYNLETFRWSF 239

Query: 128 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGE 185
             + S +V L   S+  R ALVP    +L +S + +  L        +    DR Y+ GE
Sbjct: 240 GILFSRLVRLP--SMDGRVALVPWAD-MLNHSPEVETFLDYDKSSGGIVFTTDRSYQPGE 296

Query: 186 SIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 243
            + +  G + + +LL++YGFV ++  NP D + +  +LN  D  Y++K    +RNG    
Sbjct: 297 QVFISYGKKSSGELLLSYGFVPKEGTNPNDSVELLVSLNKSDKCYKEKLQALKRNGLSEF 356

Query: 244 QVF 246
           + F
Sbjct: 357 ESF 359


>gi|395827792|ref|XP_003787079.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Otolemur
           garnettii]
          Length = 595

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 83/344 (24%), Positives = 159/344 (46%), Gaps = 29/344 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+ YL  +    ++ 
Sbjct: 156 ALAFHLLCERA-SPNSFWQPYIQSLPSE-------YDTPLYFEEDEVRYLQSTQAIYDVF 207

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  ++FT+E ++ A  +V +    +  
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  + ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 320

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379

Query: 259 DMLPYLRLGYVSDTSEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLA 309
            +L +LR+  +++    + +        I +LG    PVS   E  +   L D     L 
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 439

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
            Y  T+ ED+ +L +++L  +  +A +L   EK++L   ++  A
Sbjct: 440 TYKTTIEEDKFVLKNHDLSVRATMAIKLRLGEKEILEKAVKSAA 483


>gi|403274243|ref|XP_003928891.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Saimiri
           boliviensis boliviensis]
          Length = 513

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 83/344 (24%), Positives = 159/344 (46%), Gaps = 29/344 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+ YL  +    ++ 
Sbjct: 74  ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRYLQSTQAIHDVF 125

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  ++FT+E ++ A  +V +    +  
Sbjct: 126 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 178

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  + ++AGE I ++ G + N++
Sbjct: 179 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFQAGEQIYIFYGTRSNAE 238

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    +
Sbjct: 239 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 297

Query: 259 DMLPYLRLGYVSDTSEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLA 309
            +L +LR+  +++    + +        I +LG    PVS   E  +   L D     L 
Sbjct: 298 QLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 357

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
            Y  T+ ED+ +L + +L  + ++A +L   EK++L   ++  A
Sbjct: 358 TYKTTIEEDKFVLKNQDLSVRAKMAIKLRLGEKEILEKAVKSAA 401


>gi|432098266|gb|ELK28072.1| Histone-lysine N-methyltransferase setd3 [Myotis davidii]
          Length = 585

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 81/344 (23%), Positives = 154/344 (44%), Gaps = 29/344 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+  L  +    ++ 
Sbjct: 145 TLAFHLLCERAD-PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRSLQSTQAVHDVF 196

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  ++FT+E ++ A  +V +    +  
Sbjct: 197 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 249

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  R ++AGE I ++ G + N++
Sbjct: 250 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALRDFRAGEQIYIFYGTRSNAE 309

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    +
Sbjct: 310 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-MEPPISA 368

Query: 259 DMLPYLRL---------GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
            +L +LR+          ++   + +  + +      PVS   E  +   L D     L 
Sbjct: 369 QLLAFLRVFCMTEEELKDHLLGDNAIDKIFTLGNSEYPVSWDNEVKLWTFLEDRASLLLK 428

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
            Y  T  ED++ L +++L  + R+A +L   EK++L   +   A
Sbjct: 429 TYKTTSEEDKSFLKNHDLSVRARMAIKLRLGEKEILEKAVTSAA 472


>gi|348554489|ref|XP_003463058.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cavia
           porcellus]
          Length = 789

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 84/343 (24%), Positives = 158/343 (46%), Gaps = 29/343 (8%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           LA +L+ E+     SFWLPYI+ L  +        ++PL + E E+  L  +    ++  
Sbjct: 157 LAFHLLCERAN-PNSFWLPYIQTLPSE-------YDTPLYFEEEEVQCLQSTQAIHDVFS 208

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKV 140
           + +   R+Y      +F    + Q +P+   +P  ++FT+E ++ A  +V +    +   
Sbjct: 209 QYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTE 261

Query: 141 SLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 199
             +R   AL+PL       +          DD  + V  + ++AGE I ++ G + N++ 
Sbjct: 262 DGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEF 321

Query: 200 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD 259
           +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + 
Sbjct: 322 VIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQ 380

Query: 260 MLPYLRLGYVSDTSEMQSVIS--------SLGPI-CPVSPCMERAVLDQLADYFKARLAG 310
           +L +LR+  +++    + ++         +LG    PVS   E  +   L D     L  
Sbjct: 381 LLAFLRVFCMTEEELKEHLLGENAIDRIFTLGNSEFPVSWENEVKLWSFLEDRASLLLKT 440

Query: 311 YPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
           Y  T+ ED+A+L    L  + ++A +L   EK++L   +Q  A
Sbjct: 441 YKTTIEEDKAVLKGPELPTRMKMAVKLRLGEKEILERTVQSAA 483


>gi|308802083|ref|XP_003078355.1| ribulose-1,5-bisphosphate carb (ISS) [Ostreococcus tauri]
 gi|116056807|emb|CAL53096.1| ribulose-1,5-bisphosphate carb (ISS) [Ostreococcus tauri]
          Length = 520

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 90/351 (25%), Positives = 152/351 (43%), Gaps = 39/351 (11%)

Query: 20  ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET-ELAYLTGSPTK 78
           EL  LAL+L  E+ +G  S W PY++ L       +   ++PL W++  + A L GSP  
Sbjct: 121 ELIGLALWLCAERIKGGASEWAPYVKTL-------RANPDAPLFWTDAKDFALLKGSPVA 173

Query: 79  AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ 138
           A+ +ER++  + EY  +  V        +  P   P EA+ F    +   A+ +      
Sbjct: 174 ADAIERSKSARTEYASITEV-------IKSDPSSYPPEAYEFLTEARFVDALATVCAKAT 226

Query: 139 KVSLARRFALVPL----------GPPLLAYSS-----KCKAMLAAVDDAVQLVVDRPYKA 183
            +  A+ +ALVPL           P +L  S+     +C      VD A  ++      A
Sbjct: 227 WLPTAQCYALVPLLDVISIGGAPVPGVLPPSASDGVVRCGPADYDVDTASVVLRCATKAA 286

Query: 184 GESIVVWCGP--QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL 241
             S V+      + N +L +N G+VD+ +P D + ++  + T D  +  K+ V +  G  
Sbjct: 287 ANSEVIQLDALQRNNGELFLNTGYVDQKHPGDYIYMKTDIQTSDRLFTAKKQVLEGMGFT 346

Query: 242 SV-QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQL 300
           +  Q F V+  R     + +  YLR   V D  EM +V      I  VS   E  +L  L
Sbjct: 347 AADQYFPVYKDRMP---TQLYSYLRFSRVQDPGEMMAVSFEEDKI--VSVMNEYEILQIL 401

Query: 301 ADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM-EKKMLNACLQ 350
               +  +A Y     ++  +L   +  P + +    +RM EKK++ A + 
Sbjct: 402 MGDCRELMAEYDTNEEDELNLLKLSDQMPVREIEAAKLRMSEKKLIGATMN 452


>gi|168063638|ref|XP_001783777.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664720|gb|EDQ51429.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 395

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 70/242 (28%), Positives = 114/242 (47%), Gaps = 26/242 (10%)

Query: 11  ELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA 70
           ELL T  ++E A LAL+++ E+  G+ S W PYI  L         A+ S + W + EL 
Sbjct: 35  ELLPTG-VTEWARLALFILVEQHLGQASQWAPYINCLPTCG-----ALHSTVFWKKEELE 88

Query: 71  YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 130
            +  +    E ++R   I  E+  +  V        Q+ P+ I  E      FKQA+   
Sbjct: 89  LVRFTSLHRETMQRRAVIGSEFASVLPV-------LQKCPH-IFGERVLHSKFKQAYATG 140

Query: 131 QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIV 188
           +S      + S   R   VP       + S C+A+L+  ++    +++ D+ Y  GE +V
Sbjct: 141 KSL-----RRSSNTRILTVPF-VDFFNHDSNCRALLSYDEERACAEVIADKNYARGEQVV 194

Query: 189 VWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV 248
           +  G  PN+ L +++GF    NPYD++ V  AL+  DP  + K  +   +G  +V    V
Sbjct: 195 ISYGRLPNTTLALDFGFTISCNPYDQVEVWMALSHRDPLRKMKLALLHAHGMPTV----V 250

Query: 249 HA 250
           HA
Sbjct: 251 HA 252


>gi|332321746|sp|B2KI88.1|SETD3_RHIFE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|183637154|gb|ACC64548.1| SET domain containing 3 isoform a (predicted) [Rhinolophus
           ferrumequinum]
          Length = 594

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 82/344 (23%), Positives = 158/344 (45%), Gaps = 29/344 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+ YL  +    ++ 
Sbjct: 156 TLAFHLLCERAD-PNSFWQPYIQTLPSE-------YDTPLYFGEDEVRYLQSTQAIHDVF 207

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  ++FT+E ++ A  +V +    +  
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  + ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFQAGEQIYIFYGTRSNAE 320

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379

Query: 259 DMLPYLRLGYVSDTSEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLA 309
            +L +LR+  +++    + +        I +LG    PVS   E  +   L D     L 
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEYPVSWDNEVKLWTFLEDRASLLLK 439

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
            Y   + ED++ L +++L  +  +A +L   EK++L   ++  A
Sbjct: 440 TYKTNIEEDKSFLKNHDLSVRATMAIKLRLGEKEILEKAVKSAA 483


>gi|444915331|ref|ZP_21235465.1| SET domain containing protein [Cystobacter fuscus DSM 2262]
 gi|444713560|gb|ELW54457.1| SET domain containing protein [Cystobacter fuscus DSM 2262]
          Length = 449

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 86/341 (25%), Positives = 150/341 (43%), Gaps = 27/341 (7%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           LA +L+ EK + + SFW PYI  L           + PL +   E A L G      +  
Sbjct: 95  LASFLLQEKHR-EGSFWKPYIDSLPESYS------QMPLFYGSEEHALLKGCFALTLLTH 147

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 143
           +A+ ++ +Y           SL Q  P     E FT   F  A ++V S +  L+K    
Sbjct: 148 QAQSLREDYL----------SLCQNVP---GYERFTPGEFVWARLSVSSRLFSLKKGGFL 194

Query: 144 RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV--DRPYKAGESIVVWCGPQPNSKLLI 201
            +  LVP+   +L +      +    +D    V+  +    AG+ +    G + N  +L+
Sbjct: 195 GQ-TLVPMAD-MLNHRRPPDVLWETTEDGESFVMKANNAVAAGDEVHDSYGAKSNDLMLL 252

Query: 202 NYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDML 261
           ++GFV +DN +D   +   +   DP    K+M+       + + F +             
Sbjct: 253 HFGFVTDDNEHDEAFLGLRILDGDPLAATKQMLLMLPSPTAARPFKISRPYVHTTTRMAF 312

Query: 262 PYLRLGYV--SDTSEMQS-VISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSED 318
            +LR+     +D  ++ S V+S    + P+S   E  VL+ LA   +ARL+ +P +L++D
Sbjct: 313 SFLRIAAAVPNDIEDISSRVMSGERALGPLSVENEENVLELLAATCQARLSIFPTSLAQD 372

Query: 319 EAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLL 359
           E +L   +L P  R    + R EK+++   L++T   + LL
Sbjct: 373 EELLRGESLSPNARNCVLVRRAEKQLIEDYLEMTRVCLKLL 413


>gi|332321742|sp|E2RBS6.1|SETD3_CANFA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
          Length = 588

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 82/344 (23%), Positives = 158/344 (45%), Gaps = 29/344 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+  L  +    ++ 
Sbjct: 156 TLAFHLLCERAD-PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRDLQSTQAIHDVF 207

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  +AFT+E ++ A  +V +    +  
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDAFTYEDYRWAVSSVMTRQNQIPT 260

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  R ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALRDFRAGEQIYIFYGTRSNAE 320

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   +    +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHY-TDPPVSA 379

Query: 259 DMLPYLRLGYVSDTSEMQSVIS--------SLGPI-CPVSPCMERAVLDQLADYFKARLA 309
            +L +LR+  +++    + ++         +LG    PVS   E  +   L D     L 
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDNALDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLK 439

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
            Y  T+ ED++ L +++L  +  +A +L   EK++L   ++  A
Sbjct: 440 TYKTTIEEDKSFLRNHDLSVRATMAIKLRLGEKEILEKAVKSAA 483


>gi|73964462|ref|XP_547974.2| PREDICTED: SET domain containing 3 [Canis lupus familiaris]
          Length = 589

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 82/344 (23%), Positives = 158/344 (45%), Gaps = 29/344 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+  L  +    ++ 
Sbjct: 156 TLAFHLLCERAD-PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRDLQSTQAIHDVF 207

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  +AFT+E ++ A  +V +    +  
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDAFTYEDYRWAVSSVMTRQNQIPT 260

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  R ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALRDFRAGEQIYIFYGTRSNAE 320

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   +    +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHY-TDPPVSA 379

Query: 259 DMLPYLRLGYVSDTSEMQSVIS--------SLGPI-CPVSPCMERAVLDQLADYFKARLA 309
            +L +LR+  +++    + ++         +LG    PVS   E  +   L D     L 
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDNALDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLK 439

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
            Y  T+ ED++ L +++L  +  +A +L   EK++L   ++  A
Sbjct: 440 TYKTTIEEDKSFLRNHDLSVRATMAIKLRLGEKEILEKAVKSAA 483


>gi|410962953|ref|XP_003988033.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Felis catus]
          Length = 591

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 82/342 (23%), Positives = 155/342 (45%), Gaps = 37/342 (10%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           LA +L+ E+     SFW PYI+ L  +        ++PL + E E+  L  +    ++  
Sbjct: 157 LAFHLLCERAD-PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRDLQSTQAIHDVFS 208

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKV 140
           + +   R+Y      +F    + Q +P+   +P  +AFT+E ++   V   S  +    +
Sbjct: 209 QYKNTARQY-----AYFY--KVIQTHPHANKLPLKDAFTYEDYRLGLV---SLALGRWAL 258

Query: 141 SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLL 200
            L     +   G P +      +      DD  + V  + ++AGE I ++ G + N++ +
Sbjct: 259 GLECGVGIARCGKPQITTGYNLE------DDRCECVALQDFRAGEQIYIFYGTRSNAEFV 312

Query: 201 INYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDM 260
           I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +
Sbjct: 313 IHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPVSAQL 371

Query: 261 LPYLRLGYVSDTSEMQSVIS--------SLGPI-CPVSPCMERAVLDQLADYFKARLAGY 311
           L +LR+  +++    + ++         +LG    PVS   E  +   L D     L  Y
Sbjct: 372 LAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEYPVSWDNEVKLWTFLEDRASLLLKTY 431

Query: 312 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
             T+ ED+A L ++NL  +  +A +L   EK++L   ++  A
Sbjct: 432 KTTIEEDKAFLKNHNLSVRATMAIKLRLGEKEILEKAVKSAA 473


>gi|410928182|ref|XP_003977480.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Takifugu
           rubripes]
          Length = 598

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 87/345 (25%), Positives = 155/345 (44%), Gaps = 39/345 (11%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LAL+L+ E+     SFWLPYIR L ++        ++PL + + E+  L G+    ++L
Sbjct: 156 TLALHLLCERAN-PASFWLPYIRTLPQE-------YDTPLFYEQDEVQLLQGTQAVQDVL 207

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPYD--IP-TEAFTFEIFKQAFVAVQSCVVHL-- 137
            +     R+Y      +F    L Q +P    +P  ++FTF+ ++ A  +V +    +  
Sbjct: 208 SQYRNTARQY-----AYFY--KLIQTHPASSKLPLKDSFTFDDYRWAVSSVMTRQNQIPT 260

Query: 138 ---QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 194
              ++V+LA    L+PL       +          DD  + V  + YK  E I ++ G +
Sbjct: 261 EDGRQVTLA----LIPLWDMCNHRNGLITTGYNLEDDRCECVALQDYKKNEQIYIFYGTR 316

Query: 195 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREK 254
            N++ +I+ GF  ++N +D++ ++  ++  +  Y  K  V  R G     +F ++   E+
Sbjct: 317 SNAEFVIHNGFFYQENAHDQVKIKLGISKSERLYAMKAEVLARAGIPVSSIFALYCN-EQ 375

Query: 255 EAISDMLPYLRLGYVSDTSEMQ---------SVISSLGPI-CPVSPCMERAVLDQLADYF 304
              + +L +LR+ +     E++         + I +LG +  PVS   E  +   L    
Sbjct: 376 PISAQLLAFLRV-FCMKEEELRDYLLGGHAINKIVTLGSMEFPVSWDNEIKLWTFLETRV 434

Query: 305 KARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 349
              L  Y  T  ED + L    L P  R+A QL   EK +L   L
Sbjct: 435 ALLLKAYKTTSEEDSSTLEKSELSPHSRMAIQLRLAEKWILEKAL 479


>gi|343961019|dbj|BAK62099.1| SET domain containing 3 isoform a [Pan troglodytes]
          Length = 492

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 77/328 (23%), Positives = 150/328 (45%), Gaps = 29/328 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+ YL  +    ++ 
Sbjct: 156 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 207

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  ++FT+E ++ A  +V +    +  
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  + ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 320

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379

Query: 259 DMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
            +L +LR+  +++          S +  + +      PVS   E  +   L D     L 
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 439

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQL 337
            Y  T+ ED+++L + +L  + ++A +L
Sbjct: 440 TYKTTIEEDKSVLKNQDLSVRAKMAIKL 467


>gi|281338628|gb|EFB14212.1| hypothetical protein PANDA_005835 [Ailuropoda melanoleuca]
          Length = 585

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 83/344 (24%), Positives = 158/344 (45%), Gaps = 29/344 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+  L  +    ++ 
Sbjct: 156 TLAFHLLCERAD-PNSFWQPYIQTLPSE-------YDTPLYFEEEEVRDLQCTQAIHDVF 207

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  +AFT+E ++ A  +V +    +  
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDAFTYEDYRWAVSSVMTRQNQIPT 260

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  R ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALRDFRAGEQIYIFYGTRSNAE 320

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   +    +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TDPPVSA 379

Query: 259 DMLPYLRLGYVSDTSEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLA 309
            +L +LR+  +++    + +        I +LG    PVS   E  +   L D     L 
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDNALDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLK 439

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
            Y  T+ ED++ L +++L  +  +A +L   EK++L   ++  A
Sbjct: 440 TYKTTIEEDKSFLKNHDLSVRATMAIKLRLGEKEILEKAVKSAA 483


>gi|338719872|ref|XP_001488117.2| PREDICTED: histone-lysine N-methyltransferase setd3-like [Equus
           caballus]
          Length = 609

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 81/344 (23%), Positives = 154/344 (44%), Gaps = 29/344 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+ YL  +    ++ 
Sbjct: 170 TLAFHLLCERAD-PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAVHDVF 221

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  ++FT+E ++ A  +V +    +  
Sbjct: 222 SQYKNTARQY-----AYFY--RVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 274

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  + ++AGE I ++ G + N++
Sbjct: 275 EDGSRVTLALIPLWDMCNHTTGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAE 334

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    +
Sbjct: 335 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 393

Query: 259 DMLPYLRL---------GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
            +L +LR+          ++   + +  + +      PVS   E  +   L D     L 
Sbjct: 394 QLLAFLRVFCMTEEELKDHLLGDNAIDRIFTLGNSEYPVSWDNEVKLWTFLEDRALLLLK 453

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
            Y  T+ ED A L + +L  +  +A +L   EK++L   ++  A
Sbjct: 454 TYKTTVEEDRAFLKNSDLSVRATMAIKLRLGEKEILEKAVKSAA 497


>gi|301764186|ref|XP_002917505.1| PREDICTED: SET domain-containing protein 3-like [Ailuropoda
           melanoleuca]
          Length = 591

 Score = 78.6 bits (192), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 82/344 (23%), Positives = 158/344 (45%), Gaps = 29/344 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+  L  +    ++ 
Sbjct: 156 TLAFHLLCERAD-PNSFWQPYIQTLPSE-------YDTPLYFEEEEVRDLQCTQAIHDVF 207

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  +AFT+E ++ A  +V +    +  
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDAFTYEDYRWAVSSVMTRQNQIPT 260

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  R ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALRDFRAGEQIYIFYGTRSNAE 320

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   +    +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TDPPVSA 379

Query: 259 DMLPYLRLGYVSDTSEMQSVIS--------SLGPI-CPVSPCMERAVLDQLADYFKARLA 309
            +L +LR+  +++    + ++         +LG    PVS   E  +   L D     L 
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDNALDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLK 439

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
            Y  T+ ED++ L +++L  +  +A +L   EK++L   ++  A
Sbjct: 440 TYKTTIEEDKSFLKNHDLSVRATMAIKLRLGEKEILEKAVKSAA 483


>gi|302814473|ref|XP_002988920.1| hypothetical protein SELMODRAFT_129035 [Selaginella moellendorffii]
 gi|300143257|gb|EFJ09949.1| hypothetical protein SELMODRAFT_129035 [Selaginella moellendorffii]
          Length = 389

 Score = 78.6 bits (192), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 59/222 (26%), Positives = 107/222 (48%), Gaps = 20/222 (9%)

Query: 13  LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 72
           L +  +S  A LAL+L+  +K+ + S W PYI  L         ++ S + W++ EL YL
Sbjct: 36  LLSKDVSAWAKLALFLLAHQKKKETSAWAPYISCLPPFG-----SMHSTIFWTQDELVYL 90

Query: 73  TGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 132
             SP   E ++R + ++ E+   +    +        P+   +     E FK A+  V S
Sbjct: 91  KVSPVYRETVQRKDVVRMEFAAAENALLLC-------PHIFGSRVSALE-FKHAYATVCS 142

Query: 133 CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVW 190
               ++ +   +  ALVP       + + C+AML+  +D    ++V DR Y  G+ +V+ 
Sbjct: 143 RAWGIETI---KSLALVPF-VDFFNHDANCRAMLSYDEDRHCAEVVSDRDYATGDQVVIS 198

Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLV-VEAALNTEDPQYQDK 231
            G   N+ L +++GF    NP+D++  +  +L+ +DP    K
Sbjct: 199 YGQLSNATLALDFGFALPFNPHDQVAGIWLSLSEKDPLRDSK 240


>gi|47215092|emb|CAF98166.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 444

 Score = 78.2 bits (191), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 84/343 (24%), Positives = 151/343 (44%), Gaps = 37/343 (10%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           LAL+L+ E+     SFWLPYIR L ++        ++PL + + ++  L G+    ++L 
Sbjct: 75  LALHLLCERAD-PASFWLPYIRTLPQE-------YDTPLFYQQQDVQLLHGTQAIQDVLS 126

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYPYD--IP-TEAFTFEIFKQAFVAVQSCVVHL--- 137
           +     R+Y      +F    L Q +P    +P  ++FTF+ ++ A  +V +    +   
Sbjct: 127 QYRNTARQY-----AYFY--KLVQTHPASSKLPLKDSFTFDDYRWAVSSVMTRQNQIPTE 179

Query: 138 --QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 195
             ++V+LA    L+PL       +          DD  + V  + YK  E I ++ G + 
Sbjct: 180 DGRQVTLA----LIPLWDMCNHRNGLITTGYNLEDDRCECVALQDYKKNEQIYIFYGTRS 235

Query: 196 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 255
           N++ +I+ GF  ++N +D++ ++  ++  +  Y  K  V  R G     VF ++   E  
Sbjct: 236 NAEFVIHNGFFYQENAHDQVKIKLGISKSERLYAMKAEVLGRAGIPVSSVFALYCN-EPP 294

Query: 256 AISDMLPYLRL---------GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 306
             + +L +LR+          Y+     +  +++      PVS   E  +   L      
Sbjct: 295 ISAQLLAFLRVFCMMEEELKDYLFGAQAINRLVTLGSMEFPVSWENEIKLWTFLETRAAL 354

Query: 307 RLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 349
            L  Y  T  ED + L   +L P  R+A QL   EK +L   L
Sbjct: 355 LLKAYKTTAEEDSSTLDKTDLSPHSRMAVQLRLAEKAILEKAL 397


>gi|449280698|gb|EMC87934.1| SET domain-containing protein 3 [Columba livia]
          Length = 593

 Score = 77.8 bits (190), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 85/348 (24%), Positives = 160/348 (45%), Gaps = 37/348 (10%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFWLPYI+ L  +         +PL + E E+ YL  +    ++ 
Sbjct: 156 TLAFHLLCERAN-PNSFWLPYIQTLPSE-------YNTPLYFEEDEVRYLQSTQAIHDVF 207

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P    +P  ++FT++ ++ A  +V +    +  
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPT 260

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  + +KAGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFKAGEQIYIFYGTRSNAE 320

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H+  E    +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISA 379

Query: 259 DMLPYLRLGYVSDTSEMQSVIS--------SLGPI-CPVSPCMERAVLDQLADYFKAR-- 307
            +L +LR+  +S+    + +I         +LG    PVS   E     +L  + +AR  
Sbjct: 380 QLLAFLRVFCMSEEELKEHLIGEHAIDKIFTLGNSEFPVSWDNEV----KLWTFLEARAS 435

Query: 308 --LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
             L  Y  T+ +D++ L  ++L     +A +L   EK++L   ++  A
Sbjct: 436 LLLKTYKTTVEDDKSFLETHDLTSHAIMAIKLRLGEKEILEKAVKSAA 483


>gi|41056027|ref|NP_956348.1| histone-lysine N-methyltransferase setd3 [Danio rerio]
 gi|82187658|sp|Q7SXS7.1|SETD3_DANRE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|32766447|gb|AAH55261.1| SET domain containing 3 [Danio rerio]
          Length = 596

 Score = 77.8 bits (190), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 81/333 (24%), Positives = 143/333 (42%), Gaps = 25/333 (7%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           LAL+L+ E+     S WLPYI+ L  +        ++PL + E E+ +L  +    ++L 
Sbjct: 157 LALHLLCERAN-PSSPWLPYIKTLPSE-------YDTPLYFEEEEVRHLLATQAIQDVLS 208

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYPYDIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
           + +   R+Y      +F            +P  +AFTF+ ++ A  +V +    +     
Sbjct: 209 QYKNTARQY-----AYFYKVIHTHPNASKLPLKDAFTFDDYRWAVSSVMTRQNQIPTADG 263

Query: 143 AR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 201
           +R   AL+PL       +          DD  + V  + YK GE I ++ G + N++ +I
Sbjct: 264 SRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALKDYKEGEQIYIFYGTRSNAEFVI 323

Query: 202 NYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDML 261
           + GF  EDN +DR+ ++  ++  +  Y  K  V  R G  +  +F +H   E    + +L
Sbjct: 324 HNGFFFEDNAHDRVKIKLGVSKGERLYAMKAEVLARAGIPASSIFALHCS-EPPISAQLL 382

Query: 262 PYLRL---------GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP 312
            +LR+          Y+     +  + +      PVS   E  +   L       L  Y 
Sbjct: 383 AFLRVFCMTEEELRDYLVGDHAINKIFTLGNTEFPVSWENEIKLWTFLETRAALLLKTYK 442

Query: 313 ATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
               ED +ML   +L    R+A +L   EK++L
Sbjct: 443 TASEEDRSMLEKPDLSLHSRIAIKLRLAEKEIL 475


>gi|326921018|ref|XP_003206761.1| PREDICTED: LOW QUALITY PROTEIN: SET domain-containing protein
           3-like [Meleagris gallopavo]
          Length = 593

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 84/348 (24%), Positives = 160/348 (45%), Gaps = 37/348 (10%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFWLPYI+ L  +        ++PL + E E+ YL  +    ++ 
Sbjct: 156 TLAFHLLCERAN-PNSFWLPYIQTLPNE-------YDTPLYFEEDEVQYLRSTQAIHDVF 207

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P    +P  ++FT++ ++ A  +V +    +  
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPT 260

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  + +KAGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFKAGEQIYIFYGTRSNAE 320

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H+  E    +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-IEPPISA 379

Query: 259 DMLPYLRLGYVSDTSEMQSVIS--------SLGPI-CPVSPCMERAVLDQLADYFKAR-- 307
            +L +LR+  +++    + +I         +LG    P+S   E     +L  + +AR  
Sbjct: 380 QLLAFLRVFCMNEEELKEHLIGEHAIDKIFTLGNSEFPISWDNEV----KLWTFLEARAS 435

Query: 308 --LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
             L  Y  T+ +D+  L  ++L     +A +L   EKK+L   ++  A
Sbjct: 436 LLLKTYKTTVEDDKLFLETHDLTSHATMAIKLRLGEKKILEKTVKSAA 483


>gi|255080880|ref|XP_002504006.1| predicted protein [Micromonas sp. RCC299]
 gi|226519273|gb|ACO65264.1| predicted protein [Micromonas sp. RCC299]
          Length = 529

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 97/358 (27%), Positives = 147/358 (41%), Gaps = 48/358 (13%)

Query: 19  SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPT 77
            EL  LAL+L  E+ +G  S W PY+  L           + PLLW+  EL   L GSP 
Sbjct: 118 GELVGLALWLCLERHKGPLSEWAPYVATLP------SAGSDHPLLWTAGELQTLLQGSPV 171

Query: 78  KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV-AVQSCVVH 136
           + + + R E    EY  +           +  P D P +A+ F + + AFV A+ + +  
Sbjct: 172 REQAVSRLESADDEYASI-------ADQIRSNPNDFPPDAYEF-LTRDAFVDALATVLAR 223

Query: 137 LQKVSLARRFALVPL-------------------------GPPLLAYSSKCKAMLAAVDD 171
              ++ A  +A+VPL                         G P LA ++      AA + 
Sbjct: 224 AVWLNAANCYAMVPLVDLLPLVGSPPPGVSPAAAAGGPAVGKPGLAAAAGVVDYDAATEC 283

Query: 172 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 231
              +  +   +    + V    +    L +  G VDE +  D L   A+    D  Y+ K
Sbjct: 284 VAVVSANDAQQTARVVCVDPLARNAGDLFLATGAVDESHCGDYLAFAASCTQTDRLYEAK 343

Query: 232 RMVAQRNG-KLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSP 290
           R + +  G     Q F V A R       +L Y+R   V D  E+ SV      I  VSP
Sbjct: 344 RQILEGMGMSADGQTFPVFADRMP---MQLLAYMRFARVQDPGELMSVSFEEDRI--VSP 398

Query: 291 CMERAVLDQLADYFKARLAGYPATLSEDEAM-LTDYNLHPKKRVATQLVRMEKKMLNA 347
             E  VL  L    +  LA Y ++  E E + L +  L  ++RVA +L   EK+++NA
Sbjct: 399 MNEYEVLQLLMQDAREMLAEYESSSEEFELLQLKEKGLSARQRVAAKLRLAEKRLINA 456


>gi|148686777|gb|EDL18724.1| mCG18357, isoform CRA_b [Mus musculus]
          Length = 466

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 80/344 (23%), Positives = 156/344 (45%), Gaps = 29/344 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+  L  +    ++ 
Sbjct: 28  ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVF 79

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  E+FT+E ++ A  +V +    +  
Sbjct: 80  SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKESFTYEDYRWAVSSVMTRQNQIPT 132

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  + ++AG+ I ++ G + N++
Sbjct: 133 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAE 192

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H+  E    +
Sbjct: 193 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISA 251

Query: 259 DMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
            +L +LR+  +++          S +  + +      PVS   E  +   L D     L 
Sbjct: 252 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNAEFPVSWDNEVKLWTFLEDRASLLLK 311

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
            Y  T+ ED+ +L + +L  +  +A +L   EK++L   ++  A
Sbjct: 312 TYKTTIEEDKIVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAA 355


>gi|357153645|ref|XP_003576520.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
           large subunit N-methyltransferase, chloroplastic-like
           [Brachypodium distachyon]
          Length = 492

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 83/341 (24%), Positives = 152/341 (44%), Gaps = 38/341 (11%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            ++L ++ E  +G  S W PY+  L RQ        +S + WSE EL  + G+   +  +
Sbjct: 132 SVSLLILREAARGGDSLWAPYLAILPRQ-------TDSTIFWSEEELLEIQGTQLLSTTM 184

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
              E ++ E++ ++    + G     +P     +  TF+ F  AF  ++S V    +   
Sbjct: 185 GVKEYVQSEFDNVEAK--IIGPNKDLFP-----DTITFDDFLWAFGILRSRVFPELR--- 234

Query: 143 ARRFALVPLGPPLLAYSS-----------KCKAMLAAVDDAVQLVVDRPYKAGESIVV-W 190
             + AL+P    L+ +S+           + K  L   D    L      K+GE + V +
Sbjct: 235 GDKLALIPFAD-LINHSADITSKQSCWEIQGKGFLGR-DVVFSLRTPMEVKSGEQVYVQY 292

Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 250
              + N++L ++YGF + ++  D   +   ++  DP Y DK  +A+ NG      F V  
Sbjct: 293 DLDKSNAELALDYGFTETNSTRDSYTLTLEISESDPFYGDKLDIAELNGMGETAYFDVVL 352

Query: 251 GREKEAISDMLPYLRLGYVSDTSE--MQSVISS--LGPI-CPVSPCMERAVLDQLADYFK 305
           G  +     M+ YLRL  +  T    ++++  +   G +  PVS   E ++   +    K
Sbjct: 353 G--ESLPPQMITYLRLLCLGGTDAFLLEALFRNKVWGFLELPVSRDNEESICQVIQTACK 410

Query: 306 ARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 346
           + L  Y  T+ EDE +L   +L  + ++A ++   EKK+L 
Sbjct: 411 SALTAYHTTIEEDEELLKREDLQSRHQIAVEVRAGEKKVLQ 451


>gi|308807993|ref|XP_003081307.1| putative methyltransferase (ISS) [Ostreococcus tauri]
 gi|116059769|emb|CAL55476.1| putative methyltransferase (ISS) [Ostreococcus tauri]
          Length = 505

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 91/337 (27%), Positives = 141/337 (41%), Gaps = 59/337 (17%)

Query: 17  KLSELACLALYLMYEK---KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYL 72
           +L E + LA +L  +    + G    +  YIR L R+ G       S L W E E+   L
Sbjct: 131 ELQEWSVLATFLAQQAMALESGNAGTFGEYIRALPRRTG-------SVLDWPEDEVETLL 183

Query: 73  TGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 132
            GSP++    ER E +     E+ +           +P DI   A  +     AF  + S
Sbjct: 184 KGSPSRLAAAERQESVNAAIAEIRS----------SFP-DITEGALRW-----AFDILFS 227

Query: 133 CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG 192
            ++ L   ++    ALVP    +L +   C A +     AV L  DR Y AGE +    G
Sbjct: 228 RLIRLD--AMGGELALVPWAD-MLNHKPGCAAFIDLNGSAVNLTTDRAYAAGEQVWASYG 284

Query: 193 PQPNSKLLINYGFVDE--DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH- 249
            +P+S+LLI+YGF  E  +NP D   +   ++  DP  Q K  V +R G   V+ F +  
Sbjct: 285 QRPSSELLISYGFAPEVGENPDDEYSLTLGVDVNDPYAQAKADVLRRMGLSPVETFPLRL 344

Query: 250 AGREKEAI-------------SDMLPYLRLGYVSDTSEMQSVISSL------------GP 284
            G  ++ +             S++    R  +    +  QS+  S+            G 
Sbjct: 345 NGYPRQLLQYASFILCNPDKPSELEGLARTAFTGSANFGQSIFDSVRGLAQGQARGKQGV 404

Query: 285 ICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM 321
           I    P  E AV + LAD     L+ YP +L +D+ +
Sbjct: 405 ILGGVPG-EIAVREMLADMCAEALSAYPNSLEKDKGI 440


>gi|57529914|ref|NP_001006486.1| histone-lysine N-methyltransferase setd3 [Gallus gallus]
 gi|363734802|ref|XP_003641459.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Gallus
           gallus]
 gi|75571462|sp|Q5ZML9.1|SETD3_CHICK RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|53127281|emb|CAG31024.1| hypothetical protein RCJMB04_1k10 [Gallus gallus]
          Length = 593

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 83/348 (23%), Positives = 161/348 (46%), Gaps = 37/348 (10%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFWLPYI+ L  +        ++PL + E E+ YL  +    ++ 
Sbjct: 156 TLAFHLLCERAN-PNSFWLPYIQTLPSE-------YDTPLYFEEDEVQYLRSTQAIHDVF 207

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P    +P  ++FT++ ++ A  +V +    +  
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPT 260

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  + +KAGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFKAGEQIYIFYGTRSNAE 320

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H+  E    +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-IEPPISA 379

Query: 259 DMLPYLRLGYVSDTSEMQSVIS--------SLGPI-CPVSPCMERAVLDQLADYFKAR-- 307
            +L +LR+  +++    + +I         +LG    P+S   E     +L  + +AR  
Sbjct: 380 QLLAFLRVFCMNEEELKEHLIGEHAIDKIFTLGNSEFPISWDNEV----KLWTFLEARAS 435

Query: 308 --LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
             L  Y  T+ +D++ L  ++L     +A +L   EK++L   ++  A
Sbjct: 436 LLLKTYKTTVEDDKSFLETHDLTSHATMAIKLRLGEKEILEKAVKSAA 483


>gi|160774366|gb|AAI55279.1| SET domain containing 3 [Danio rerio]
          Length = 596

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 80/333 (24%), Positives = 142/333 (42%), Gaps = 25/333 (7%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           LAL+L+ E+     S WLPYI+ L  +        ++PL + E E+ +L  +    ++L 
Sbjct: 157 LALHLLCERAN-PSSPWLPYIKTLPSE-------YDTPLYFEEEEVRHLLATQAIQDVLS 208

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYPYDIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
           + +   R+Y      +F            +P  +AFTF+ ++ A  +V +    +     
Sbjct: 209 QYKNTARQY-----AYFYKVIHTHPNASKLPLKDAFTFDDYRWAVSSVMTRQNQIPTADG 263

Query: 143 AR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 201
           +R   AL+PL       +          DD  + V  + YK GE I ++ G + N++ +I
Sbjct: 264 SRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALKDYKEGEQIYIFYGTRSNAEFVI 323

Query: 202 NYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDML 261
           + GF  EDN +DR+ ++  ++  +  Y  K  V  R G  +  +F +H   E    + +L
Sbjct: 324 HNGFFFEDNAHDRVKIKLGVSKSERLYAMKAEVLARAGIPASSIFALHCS-EPPISAQLL 382

Query: 262 PYLRL---------GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP 312
            +LR+          Y+     +  + +      PVS   E  +   L       L  Y 
Sbjct: 383 AFLRVFCMTEEELRDYLVGDHAINKIFTLGNTEFPVSWENEIKLWTFLETRAALLLKTYK 442

Query: 313 ATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
               ED +ML   +L    R+  +L   EK++L
Sbjct: 443 TASEEDRSMLEKPDLSLHSRITIKLRLAEKEIL 475


>gi|268370088|ref|NP_082538.2| histone-lysine N-methyltransferase setd3 [Mus musculus]
 gi|81879567|sp|Q91WC0.1|SETD3_MOUSE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=Endothelial differentiation inhibitory protein D10;
           AltName: Full=SET domain-containing protein 3
 gi|16359331|gb|AAH16123.1| SET domain containing 3 [Mus musculus]
 gi|18044800|gb|AAH19973.1| Setd3 protein [Mus musculus]
 gi|26327255|dbj|BAC27371.1| unnamed protein product [Mus musculus]
 gi|74145116|dbj|BAE27425.1| unnamed protein product [Mus musculus]
 gi|74151505|dbj|BAE38861.1| unnamed protein product [Mus musculus]
 gi|148686776|gb|EDL18723.1| mCG18357, isoform CRA_a [Mus musculus]
          Length = 594

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 80/344 (23%), Positives = 156/344 (45%), Gaps = 29/344 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+  L  +    ++ 
Sbjct: 156 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVF 207

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  E+FT+E ++ A  +V +    +  
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKESFTYEDYRWAVSSVMTRQNQIPT 260

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  + ++AG+ I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAE 320

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H+  E    +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISA 379

Query: 259 DMLPYLRLGYVSDT---------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
            +L +LR+  +++          S +  + +      PVS   E  +   L D     L 
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNAEFPVSWDNEVKLWTFLEDRASLLLK 439

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
            Y  T+ ED+ +L + +L  +  +A +L   EK++L   ++  A
Sbjct: 440 TYKTTIEEDKIVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAA 483


>gi|432952574|ref|XP_004085141.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Oryzias
           latipes]
          Length = 606

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 82/335 (24%), Positives = 148/335 (44%), Gaps = 29/335 (8%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           LAL+L+ E+     SFW PYIR L ++        ++PL + + ++  L G+    ++L 
Sbjct: 157 LALHLLCERGD-PASFWSPYIRSLPQE-------YDTPLYYQQEDVQLLLGTQAVQDVLN 208

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYP--YDIP-TEAFTFEIFKQAFVAVQSCVVHLQKV 140
           + +   R+Y      +F    L Q +P    +P  + F+F+ ++ A  +V +    +  V
Sbjct: 209 QYKNTARQY-----AYFY--KLVQTHPAASKLPLKDGFSFDDYRWAVSSVMTRQNQIPTV 261

Query: 141 SLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 199
             +R   AL+PL       +          DD  + V  + YK  E I ++ G + N++ 
Sbjct: 262 DGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDYKKNEQIYIFYGTRSNAEF 321

Query: 200 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD 259
           +I+ GF  +DN +DR+ ++  ++  +  Y  K  V  R G  +  VF +H   +    + 
Sbjct: 322 VIHNGFFFQDNAHDRVKIKLGVSKSERLYAMKAEVLARAGIPASCVFALHC-NDPPISAQ 380

Query: 260 MLPYLRL---------GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAG 310
           +L +LR+          Y+     +  + +      PVS   E  +   L       L  
Sbjct: 381 LLAFLRVFCMTEEELKDYLLGERAINKIFTLGNSDFPVSWENEIKLWTFLETRAALLLKT 440

Query: 311 YPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
           Y  T  ED ++L   +L    R+A QL   EK++L
Sbjct: 441 YKTTSEEDRSILEKPDLSLHTRLAVQLRLAEKQIL 475


>gi|148686779|gb|EDL18726.1| mCG18357, isoform CRA_d [Mus musculus]
          Length = 597

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 80/344 (23%), Positives = 156/344 (45%), Gaps = 29/344 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+  L  +    ++ 
Sbjct: 159 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVF 210

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  E+FT+E ++ A  +V +    +  
Sbjct: 211 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKESFTYEDYRWAVSSVMTRQNQIPT 263

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  + ++AG+ I ++ G + N++
Sbjct: 264 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAE 323

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H+  E    +
Sbjct: 324 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISA 382

Query: 259 DMLPYLRLGYVSDT---------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
            +L +LR+  +++          S +  + +      PVS   E  +   L D     L 
Sbjct: 383 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNAEFPVSWDNEVKLWTFLEDRASLLLK 442

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
            Y  T+ ED+ +L + +L  +  +A +L   EK++L   ++  A
Sbjct: 443 TYKTTIEEDKIVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAA 486


>gi|168043570|ref|XP_001774257.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674384|gb|EDQ60893.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 458

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 59/227 (25%), Positives = 105/227 (46%), Gaps = 19/227 (8%)

Query: 15  TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 74
           +  ++E + LAL+ +  K  GK S W PYIR L   RG     +++ + W + EL  L  
Sbjct: 99  SEDVNEWSRLALFQLLHKHAGKASPWEPYIRCLPPLRG-----LQNTVFWRDEELELLRQ 153

Query: 75  SPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 134
           S    +   R   I  +++ +  V         +YP ++  E  T E FK A+    S  
Sbjct: 154 SNVYDQTEHRKTLISNQFDLVQAV-------VNKYP-ELFGETVTLESFKHAYCVASSRS 205

Query: 135 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGESIVVWCG 192
             ++ +       +VP    +  + S  +A+LA  ++    ++V D+ Y  G  +V+  G
Sbjct: 206 WGVEALG---SITMVPF-VDMFNHDSSARALLAYYEEEGYAEVVADKDYNQGSQVVITYG 261

Query: 193 PQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 239
             PNS L +++GF   DNP+D + +     + DP   +K  + + +G
Sbjct: 262 TLPNSSLALDFGFTLPDNPHDEVQIWMEAPSGDPLRAEKLKLLRDHG 308


>gi|307108530|gb|EFN56770.1| hypothetical protein CHLNCDRAFT_8187, partial [Chlorella
           variabilis]
          Length = 398

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 96/346 (27%), Positives = 153/346 (44%), Gaps = 34/346 (9%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            +AL+L++E+  G  S W PYI  L    G       SP+ W E +LA L GS     + 
Sbjct: 44  AIALFLLHERAMGNASRWAPYIALLPADSG-------SPVQWEEADLAELQGSQVLGTVQ 96

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV------VH 136
                 ++ +++L    F   S      +D P   F F+ F  A   V++         +
Sbjct: 97  GYRAYFQQRFDQLQAEVFGPNS----QAFD-PI-VFNFDAFLWAACTVRARAHPPLDGGN 150

Query: 137 LQKVSLARRFALVPLGPPLLA-YSSKCKAMLAAVDDAVQLVVD--RPYKAGESIVVWCGP 193
           +  V LA      P  PP  A +  K    L        LV++      AG++I +  GP
Sbjct: 151 IALVPLADMVRSQPSWPPDSAGWQLKQTGGLFGAGSTQALVMEASGSMAAGDAIAMDFGP 210

Query: 194 Q-PNSKLLINYGFVDE--DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV-H 249
           Q  + +LL+++G +D   + P   L +E  L+ ED  Y DK  + + N +L+    H+  
Sbjct: 211 QKSDGQLLVDHGVIDPLVNQPSYALTLE--LSKEDRNYDDKADILELN-ELAESTEHILR 267

Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSL---GPICPVSPCMERAVLDQLADYFKA 306
           A R  +A    +  L     +D   ++S+  +        PVS   ER    QL D   A
Sbjct: 268 ADRAPDAGLLPVLRLLNLSGTDAFLLESIFRNEVWEHMQLPVSEDNERGCYQQLIDGCTA 327

Query: 307 RLAGYPATLSEDEAMLTDYNLHPKKRVATQL-VRM-EKKMLNACLQ 350
            LA YP ++ ED A++   +L P  R  + + VR+ EK+ L+A L+
Sbjct: 328 ALAAYPTSIDEDLALMASGSLQPGSRRQSAVRVRLGEKEALDATLR 373


>gi|145344456|ref|XP_001416748.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144576974|gb|ABO95041.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 515

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 97/350 (27%), Positives = 144/350 (41%), Gaps = 43/350 (12%)

Query: 20  ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE-LAYLTGSPTK 78
           EL  LAL+L  E+ +G  S W PY++ L           ++PL W+E E  A L GSP  
Sbjct: 120 ELVGLALWLCAERIKGGASDWAPYVKTL-------AANPDAPLFWTEAEDFALLKGSPIV 172

Query: 79  AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA---FTFEIFKQAFVAVQSCVV 135
            + +ER+   + EY  +  V        +  P   P EA   FT E F  A   V +   
Sbjct: 173 NDAVERSRSAREEYAAIVEV-------IKGDPTAFPAEAYEFFTEERFVDALATVCAKAT 225

Query: 136 HLQKVSLARRFALVPL-------GPPLLAY---SSKCKAMLAAVD---DAVQLVVDRPYK 182
            L   S    +ALVPL       G P+      S+K      A D   D+  +V+    K
Sbjct: 226 WLPTASC---YALVPLLDVITIAGSPVPGVSPPSAKDGIARCAADYDVDSACVVLSAVVK 282

Query: 183 A-GESIVVWCGP--QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 239
           A   S VV   P  + N +L +N G VD+ +P D L +   +   D  +  K+ V +  G
Sbjct: 283 APANSRVVQLDPLQRNNGELFLNTGRVDQKHPGDYLYMRTEIQPSDRLFSAKKQVLEGMG 342

Query: 240 -KLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLD 298
                Q F V+   E    + +  YLR   V D  EM +V      I  VS   E  +L 
Sbjct: 343 FTAENQYFPVY---EDRMPTQLYSYLRFARVQDPGEMMAVSFEEDKI--VSVMNEYEILQ 397

Query: 299 QLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNAC 348
            L    +  ++ Y     ++  +L   +    + +    +RM +K L  C
Sbjct: 398 LLMGDCRELMSEYDTNEEDELNLLKLSDTMRVREIEAAKLRMSEKKLIGC 447


>gi|354483159|ref|XP_003503762.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Cricetulus
           griseus]
 gi|344254671|gb|EGW10775.1| SET domain-containing protein 3 [Cricetulus griseus]
          Length = 577

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 80/344 (23%), Positives = 155/344 (45%), Gaps = 29/344 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+  L  +    ++ 
Sbjct: 156 ALAFHLLCERAS-PNSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVF 207

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  ++FT+E ++ A  +V +    +  
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  + ++AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFQAGEQIYIFYGTRSNAE 320

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379

Query: 259 DMLPYLRLGYVSDT---------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
            +L +LR+  +++          S +  + +      PVS   E  +   L D     L 
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 439

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
            Y  T+ ED+ +L + +L  +  +A +L   EK++L   ++  A
Sbjct: 440 TYKTTIEEDKIVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAA 483


>gi|12848462|dbj|BAB27964.1| unnamed protein product [Mus musculus]
 gi|46241521|gb|AAS82953.1| endothelial differentiation inhibitory protein D10 [Mus musculus]
          Length = 594

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 80/344 (23%), Positives = 156/344 (45%), Gaps = 29/344 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+  L  +    ++ 
Sbjct: 156 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVF 207

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  E+FT+E ++ A  +V +    +  
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKESFTYEDYRWAVSSVMTRQNQIPT 260

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  + ++AG+ I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAE 320

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H+  E    +
Sbjct: 321 SVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISA 379

Query: 259 DMLPYLRLGYVSDT---------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
            +L +LR+  +++          S +  + +      PVS   E  +   L D     L 
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNAEFPVSWDNEVKLWTFLEDRASLLLK 439

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
            Y  T+ ED+ +L + +L  +  +A +L   EK++L   ++  A
Sbjct: 440 TYKTTIEEDKIVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAA 483


>gi|302755392|ref|XP_002961120.1| hypothetical protein SELMODRAFT_402746 [Selaginella moellendorffii]
 gi|300172059|gb|EFJ38659.1| hypothetical protein SELMODRAFT_402746 [Selaginella moellendorffii]
          Length = 371

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 59/206 (28%), Positives = 94/206 (45%), Gaps = 35/206 (16%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA-VESPLLWSETELAYLTGSPTKAEIL 82
           L+L+L+ EK + ++S W PY+R L       QL  +ES + W + ELA+L  SPT  E +
Sbjct: 51  LSLFLLSEKHKAQESQWAPYLRCL------PQLGDIESTMFWKDEELAWLKHSPTYRETM 104

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
           E  + IK E++ L+   F        +  D+  E  +   F  A+   Q           
Sbjct: 105 ECLKIIKSEFHVLEANVF-------PWCRDVLGEV-SLTDFMHAYSTDQ----------- 145

Query: 143 ARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLL 200
                 +P       +   C+  L+     D    V D+ YKAG+ I +  G  PNS L 
Sbjct: 146 ------IPFA-DFFNHDHNCQTRLSYDKEKDCAVAVADQDYKAGDEIFLSYGSTPNSILA 198

Query: 201 INYGFVDEDNPYDRLVVEAALNTEDP 226
           ++YGF    NP++++ V   ++  DP
Sbjct: 199 VDYGFAVASNPHEQVEVPMGVSLTDP 224


>gi|58177849|gb|AAH89108.1| Setd3 protein [Rattus norvegicus]
          Length = 450

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 79/344 (22%), Positives = 155/344 (45%), Gaps = 29/344 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+  L  +    ++ 
Sbjct: 10  ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVF 61

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  ++FT+E ++ A  +V +    +  
Sbjct: 62  SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 114

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  + ++AG+ I ++ G + N++
Sbjct: 115 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAE 174

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    +
Sbjct: 175 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 233

Query: 259 DMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
            +L +LR+  +++          S +  + +      PVS   E  +   L D     L 
Sbjct: 234 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 293

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
            Y  T+ ED+ +L + +L  +  +A +L   EK++L   ++  A
Sbjct: 294 TYKTTIEEDKTVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAA 337


>gi|149044197|gb|EDL97579.1| rCG27725, isoform CRA_c [Rattus norvegicus]
          Length = 468

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 79/344 (22%), Positives = 155/344 (45%), Gaps = 29/344 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+  L  +    ++ 
Sbjct: 28  ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVF 79

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  ++FT+E ++ A  +V +    +  
Sbjct: 80  SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 132

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  + ++AG+ I ++ G + N++
Sbjct: 133 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAE 192

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    +
Sbjct: 193 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 251

Query: 259 DMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
            +L +LR+  +++          S +  + +      PVS   E  +   L D     L 
Sbjct: 252 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 311

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
            Y  T+ ED+ +L + +L  +  +A +L   EK++L   ++  A
Sbjct: 312 TYKTTIEEDKTVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAA 355


>gi|392341246|ref|XP_002726820.2| PREDICTED: histone-lysine N-methyltransferase setd3 [Rattus
           norvegicus]
 gi|392349051|ref|XP_216781.6| PREDICTED: histone-lysine N-methyltransferase setd3 [Rattus
           norvegicus]
 gi|149044195|gb|EDL97577.1| rCG27725, isoform CRA_a [Rattus norvegicus]
          Length = 596

 Score = 74.3 bits (181), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 79/344 (22%), Positives = 155/344 (45%), Gaps = 29/344 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+  L  +    ++ 
Sbjct: 156 ALAFHLLCERAS-PNSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVF 207

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  ++FT+E ++ A  +V +    +  
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  + ++AG+ I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAE 320

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379

Query: 259 DMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
            +L +LR+  +++          S +  + +      PVS   E  +   L D     L 
Sbjct: 380 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 439

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
            Y  T+ ED+ +L + +L  +  +A +L   EK++L   ++  A
Sbjct: 440 TYKTTIEEDKTVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAA 483


>gi|330822500|ref|XP_003291689.1| hypothetical protein DICPUDRAFT_57488 [Dictyostelium purpureum]
 gi|325078125|gb|EGC31794.1| hypothetical protein DICPUDRAFT_57488 [Dictyostelium purpureum]
          Length = 540

 Score = 74.3 bits (181), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 86/354 (24%), Positives = 145/354 (40%), Gaps = 40/354 (11%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           LALYL+ E      S   PY++ L +          +   W   +   L GSP     + 
Sbjct: 153 LALYLIKESTNPDSSI-APYLKVLPK-------TYSTIGYWGIEDFKQLEGSPVFQTAVN 204

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 143
              G  R+Y      +F    LF   P  + T  FT+E F  A   VQS       V   
Sbjct: 205 YTRGSMRQY-----CYFY--QLFDNNPGILQTSNFTYEAFIWAVATVQS---RQNPVGGG 254

Query: 144 RRFALVPLGP--PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 201
           +  AL+P        ++  K    +  V   +     + YK GE + ++ GP+PNS+  +
Sbjct: 255 QEMALIPFWDFCNHSSHGGKITTFIDPVKHVLTCSAAKSYKKGEQVYMYYGPRPNSQFYL 314

Query: 202 NYGFVDEDNPYDRLVVEAALNTEDPQ--YQDK-RMVAQRNGKLSVQVFHVHAGREKEAI- 257
             GF  + N  D    +  L+ ED +    DK  ++ +R G    Q   +      E + 
Sbjct: 315 FQGFSLKTNLNDDYSFDMDLDNEDDRDIAHDKIHILEERCGLRVGQTVSLSQNPSSEKLP 374

Query: 258 SDMLPYLRLGYVS--DTSEM-----------QSVISSLGP--ICPVSPCMERAVLDQLAD 302
           ++++P+ R+  +S  +T ++                 + P     +S   E+     L D
Sbjct: 375 AEIIPFYRIAALSPEETKKLAPPQEEGHHHHHQGPMDMKPEAFNIISEENEKKAFKLLLD 434

Query: 303 YFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 356
             KARL+GYP TL++DE  + + N   ++R    ++  EKK+L   ++    +I
Sbjct: 435 SLKARLSGYPTTLAQDEQEMKN-NPTTQRRYVLYILINEKKILERNIKYVEQLI 487


>gi|302786274|ref|XP_002974908.1| hypothetical protein SELMODRAFT_102436 [Selaginella moellendorffii]
 gi|300157067|gb|EFJ23693.1| hypothetical protein SELMODRAFT_102436 [Selaginella moellendorffii]
          Length = 389

 Score = 74.3 bits (181), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 58/217 (26%), Positives = 106/217 (48%), Gaps = 22/217 (10%)

Query: 13  LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 72
           L +  +S  A LAL+L+  +K+ + S W PYI  L         ++ S + W++ EL YL
Sbjct: 36  LLSKDVSAWAKLALFLLAHQKKKETSAWAPYISCLPPFG-----SMHSTIFWTQDELVYL 90

Query: 73  TGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 132
             SP   E ++R + ++ E+   + V      L QQ    + +   T       ++ V S
Sbjct: 91  KVSPVYRETVQRKDVVRMEFAAAENVCM----LMQQVKLFVCSRILT------DYITVCS 140

Query: 133 CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVW 190
               ++ +   +  ALVP       + + C+AML+  +D    ++V DR Y  G+ +V+ 
Sbjct: 141 RAWGIETI---KSLALVPF-VDFFNHDANCRAMLSYDEDRHCAEVVSDRDYATGDQVVIS 196

Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLV-VEAALNTEDP 226
            G   N+ L +++GF    NP+D++  +  +L+ +DP
Sbjct: 197 YGQLSNATLALDFGFALPFNPHDQVAGIWLSLSEKDP 233


>gi|42565948|ref|NP_191068.2| SET domain-containing protein [Arabidopsis thaliana]
 gi|56236044|gb|AAV84478.1| At3g55080 [Arabidopsis thaliana]
 gi|59958342|gb|AAX12881.1| At3g55080 [Arabidopsis thaliana]
 gi|332645816|gb|AEE79337.1| SET domain-containing protein [Arabidopsis thaliana]
          Length = 463

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 59/240 (24%), Positives = 113/240 (47%), Gaps = 22/240 (9%)

Query: 12  LLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY 71
           +L +N++  +  LA  L+ EKK G+KS W+PYI  L +        + S + W E EL+ 
Sbjct: 105 VLLSNEVGNIGMLAAVLIREKKMGQKSRWVPYISRLPQ-----PAEMHSSIFWGEDELSM 159

Query: 72  LTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ 131
           +  S    E +++   I+++++      F+A +  Q  P  I TE    E F  A+  V 
Sbjct: 160 IRCSAVHQETVKQKAQIEKDFS------FVAQAFKQHCP--IVTERPDLEDFMYAYALVG 211

Query: 132 SCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVW 190
           S     +    ++R +L+P    +         +L   D+ + ++  DR Y  G+ + + 
Sbjct: 212 S-----RAWENSKRISLIPFADFMNHDGLSASIVLRDEDNQLSEVTADRNYSPGDEVFIK 266

Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ---VFH 247
            G   N+ L++++GF    N +D + ++  +  +DP    K  + Q +   +V+   +FH
Sbjct: 267 YGEFSNATLMLDFGFTFPYNIHDEVQIQMDVPNDDPLRNMKLGLLQTHHTRTVKDINIFH 326


>gi|297849804|ref|XP_002892783.1| hypothetical protein ARALYDRAFT_471564 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297338625|gb|EFH69042.1| hypothetical protein ARALYDRAFT_471564 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 482

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 89/347 (25%), Positives = 159/347 (45%), Gaps = 35/347 (10%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            +AL+L+ EK + ++S W  Y+  L +       + +S + WSE ELA L G+   +  L
Sbjct: 122 SVALFLIREKYE-EESSWRLYLDMLPQ-------STDSTVFWSEEELAELKGTQLLSTTL 173

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
              E ++ E+ +L+    +          D+ +   T + F  AF  ++S      +   
Sbjct: 174 GVKEYVENEFLKLEQEILLPNK-------DLFSSRITLDDFIWAFGILKSRAFSRLR--- 223

Query: 143 ARRFALVPL------GPPLLA--YSSKCK-AMLAAVDDAVQLVVDRPYKAGESIVV-WCG 192
            +   L+PL       P +    Y+ + K A L + D    L      KAGE + + +  
Sbjct: 224 GQNLVLIPLADLINHNPAITTEDYAYEIKGAGLFSRDLLFSLKSPVYVKAGEQVYIQYDL 283

Query: 193 PQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGR 252
            + N++L ++YGFV+ +   +   +   +   DP + DK  +A+ N       F V  G+
Sbjct: 284 NKSNAELALDYGFVESNPNRNSYTLTIEIPESDPFFGDKLDIAETNKMGETGYFDVVDGQ 343

Query: 253 EKEAISDMLPYLRLGYV--SDTSEMQSVISSL--GPI-CPVSPCMERAVLDQLADYFKAR 307
              A   ML YLRL  +  SD   ++S+ ++   G +  PVS   E  +   + D  K+ 
Sbjct: 344 TLPA--GMLQYLRLVALGGSDAFLLESIFNNTIWGHLELPVSRSNEELICRVVRDACKSA 401

Query: 308 LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTAD 354
           L+G+  T+ EDE +L +  L P+  +A ++   EK++L    Q+  D
Sbjct: 402 LSGFSTTIEEDEKLLEEGKLDPRLEMALKIRIGEKRVLQQIDQIFKD 448


>gi|126290266|ref|XP_001367810.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Monodelphis domestica]
          Length = 595

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 83/348 (23%), Positives = 158/348 (45%), Gaps = 37/348 (10%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFWLPYI+ L  +        ++PL + E E+ +L  +    ++ 
Sbjct: 156 TLAFHLLCERA-NPSSFWLPYIQTLPSE-------YDTPLYFEEDEVQHLQSTQAIHDVF 207

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P    +P  ++FT+E ++ A  +V +    +  
Sbjct: 208 SQYKNTARQY-----AYFY--KVIQTHPNANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  + +  GE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFNVGEQIYIFYGTRSNAE 320

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379

Query: 259 DMLPYLRLGYVSDTSEMQSVIS--------SLGPI-CPVSPCMERAVLDQLADYFKAR-- 307
            +L +LR+  +++    + +I         +LG    PVS   E     +L  + +AR  
Sbjct: 380 QLLAFLRVFCMTEEELKEHLIGEHAIDRIFTLGNSEFPVSWDNEV----KLWTFLEARAS 435

Query: 308 --LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
             L  Y  T+ ED++ L  ++L     +A +L   EK++L   ++  A
Sbjct: 436 LLLKTYKTTIEEDKSFLATHDLTFHATMAIKLRLGEKEILEKAVKSAA 483


>gi|395504553|ref|XP_003756612.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Sarcophilus
           harrisii]
          Length = 602

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 83/348 (23%), Positives = 158/348 (45%), Gaps = 37/348 (10%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFWLPYI+ L  +        ++PL + E E+ +L  +    ++ 
Sbjct: 163 TLAFHLLCERA-NPSSFWLPYIQTLPSE-------YDTPLYFEEDEVQHLQSTQAIHDVF 214

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P    +P  ++FT+E ++ A  +V +    +  
Sbjct: 215 SQYKNTARQY-----AYFY--KVIQTHPNANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 267

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  + +  GE I ++ G + N++
Sbjct: 268 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFNVGEQIYIFYGTRSNAE 327

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    +
Sbjct: 328 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 386

Query: 259 DMLPYLRLGYVSDTSEMQSVIS--------SLGPI-CPVSPCMERAVLDQLADYFKAR-- 307
            +L +LR+  +++    + +I         +LG    PVS   E     +L  + +AR  
Sbjct: 387 QLLAFLRVFCMTEEELKEHLIGEHAIDRIFTLGNSEFPVSWDNEV----KLWTFLEARAS 442

Query: 308 --LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
             L  Y  T+ ED++ L  ++L     +A +L   EK++L   ++  A
Sbjct: 443 LLLKTYKTTIEEDKSFLATHDLTFHATMAIKLRLGEKEILEKAVKSAA 490


>gi|168067849|ref|XP_001785817.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162662541|gb|EDQ49381.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 489

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 89/358 (24%), Positives = 156/358 (43%), Gaps = 50/358 (13%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            +AL+L++E      S W PY+  L +       +++SPL WS+ ELA L G+     + 
Sbjct: 114 AVALFLIHESSN-PSSKWRPYLDSLPK-------SLDSPLFWSDEELAELVGTQLLGSVT 165

Query: 83  ERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS 141
              E ++ EYN L + V      +F    Y       TF+ FK AF  ++S         
Sbjct: 166 GYLEFLENEYNNLVEEVLEPNNKIFNPAVY-------TFDGFKWAFGILRSRTFSPLT-- 216

Query: 142 LARRFALVPL------------GPP--LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESI 187
                ALVP+            G P  +   +S+   +     D + +     + AGE +
Sbjct: 217 -GEDIALVPIADLVNHGKGLGDGSPSWVRKGTSQFWNIGKGSSDLLTVRASANFSAGEQV 275

Query: 188 VVWCGP-QPNSKLLINYGFVDEDN--------PYDRLVVEAALNTEDPQYQDKRMVAQRN 238
           ++  G  + N+ L ++YGFV+ D           D L +   ++ +D    DK  + + N
Sbjct: 276 LMQYGATKSNADLALDYGFVERDRGSQFSPGIERDSLALSLEISPDDRFVDDKADILEIN 335

Query: 239 GKLSVQVFHVHAGREKEAISDMLPYLRLGYVS--DTSEMQSVI--SSLGPIC-PVSPCME 293
           G      F +  G+      +M+ +LRL  +S  D+  ++++    + G +  PVS   E
Sbjct: 336 GFQCSMQFDLSRGQGPS--DEMITFLRLSALSGPDSFLLEALFRNEAWGHVSLPVSRDNE 393

Query: 294 RAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQV 351
            A+   + +  KA L GY  T+ +D  +L   +L  +  +A  +VR+ +K +   LQ 
Sbjct: 394 EALCTSMLEGLKAALDGYSTTVEQDMELLARGDLSTRMEIAV-VVRLGEKRVMQELQT 450


>gi|17368377|sp|P94026.1|RBCMT_TOBAC RecName: Full=Ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic; AltName:
           Full=[Ribulose-bisphosphate carboxylase]-lysine
           N-methyltransferase; Short=RuBisCO LSMT; Short=RuBisCO
           methyltransferase; Short=rbcMT; Flags: Precursor
 gi|1731475|gb|AAC49565.1| ribulose-1,5-bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase [Nicotiana tabacum]
 gi|1731477|gb|AAC49566.1| ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase [Nicotiana tabacum]
          Length = 491

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 80/335 (23%), Positives = 151/335 (45%), Gaps = 28/335 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            +AL+L+ EK +   S W  Y+  L +       + +S + WSE EL+ + G+   +  +
Sbjct: 130 SVALFLLREKWR-DDSKWKYYMDVLPK-------STDSTIYWSEEELSEIQGTQLLSTTM 181

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF--TFEIFK-QAFVAVQS-CVVHLQ 138
              + ++ E+ +++    +     Q +P+ I  + F   F I + +AF  +++  ++ + 
Sbjct: 182 SVKDYVQNEFQKVEEEVILRNK--QLFPFPITLDDFFWAFGILRSRAFSRLRNQNLILVP 239

Query: 139 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNS 197
              L    A V       A+  +  A L + D    L      KAG+ + + +   + N+
Sbjct: 240 FADLTNHNARVTTEDH--AHEVRGPAGLFSWDLLFSLRSPLKLKAGDQLFIQYDLNKSNA 297

Query: 198 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI 257
            + ++YGF++  +  D   +   ++  D  Y DK  +A+ NG      F +  G+     
Sbjct: 298 DMALDYGFIEPSSARDAFTLTLEISESDEFYGDKLDIAETNGIGETAYFDIKIGQSLPPT 357

Query: 258 SDMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLADYFKARLAG 310
             M+PYLRL  +  T           SV   LG   PVS   E  +   + D  K+ L+G
Sbjct: 358 --MIPYLRLVALGGTDAFLLESIFRNSVWGHLG--LPVSRANEELICKVVRDACKSALSG 413

Query: 311 YPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
           Y  T+ EDE ++ + NL  + ++A  +   EK++L
Sbjct: 414 YHTTIEEDEKLMEEGNLSTRLQIAVGIRLGEKRVL 448


>gi|260803924|ref|XP_002596839.1| hypothetical protein BRAFLDRAFT_284593 [Branchiostoma floridae]
 gi|229282099|gb|EEN52851.1| hypothetical protein BRAFLDRAFT_284593 [Branchiostoma floridae]
          Length = 500

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 87/337 (25%), Positives = 148/337 (43%), Gaps = 31/337 (9%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           LAL+++ EK     SFW PYI               +PL + E E+ +L GS   +++L 
Sbjct: 170 LALHVLCEK-YSSNSFWAPYINIFPG-------TYTTPLYFEEGEMLHLQGSLNFSDVLN 221

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYP--YDIP-TEAFTFEIFKQAFVAVQSCVVHLQKV 140
           + + I R+Y      +F    LFQ  P    +P  E FTF+ ++ A   V +    +   
Sbjct: 222 QYKSIARQY-----AYFY--KLFQTQPEAAGLPLKECFTFDEYRWAVSTVMTRQNQV-PT 273

Query: 141 SLARRF--ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
           S  R    AL+P+       + +         D+ + +  R +     + ++ G + N++
Sbjct: 274 SDGRHLITALIPMWDMCNHSNGEVSTEFNLGSDSAECLAMREFPTDSQVYIFYGMRSNAE 333

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            LI+ GFV  +N +DR+ V+  ++  D  +  K  V  R G  +   F VH G++     
Sbjct: 334 FLIHNGFVYPENVHDRVNVKLGVSKNDSLFAMKAEVLSRAGIHASTSFQVHCGKDP-IPP 392

Query: 259 DMLPYLRL-----GYVSD--TSEMQSV-ISSLG-PICPVSPCMERAVLDQLADYFKARLA 309
           ++L +LR+     G + D  TSE QS  +S LG   C V+   E      L       + 
Sbjct: 393 ELLVFLRVFTMVEGDLRDLLTSEHQSAYLSCLGRSDCMVTQEQETKAWAFLETRLSLLIR 452

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 346
            Y  ++ + E  L   ++    R A QL   E ++L+
Sbjct: 453 SYRTSIKDVETELQAPDMTYHSRAALQLKLAEMQILS 489


>gi|387016380|gb|AFJ50309.1| Histone-lysine N-methyltransferase setd3 [Crotalus adamanteus]
          Length = 592

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 84/348 (24%), Positives = 158/348 (45%), Gaps = 37/348 (10%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFWLPYI+ L  +         + L + E E+ YL  +    +I 
Sbjct: 156 TLAFHLLCER-YNPNSFWLPYIQTLPNE-------YNTALYFEEDEVQYLQSTQAIHDIF 207

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P    +P  ++FT++ ++ A  +V +    +  
Sbjct: 208 SQYKNTARQY-----AYFY--KVVQTHPNASKLPLKDSFTYDDYRWAVSSVMARQNQIPA 260

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  + +KAGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLKDDRCECVALQDFKAGEQIYIFYGTRSNAE 320

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H+  E    +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISA 379

Query: 259 DMLPYLRLGYVSDTSEMQSVIS--------SLGPI-CPVSPCMERAVLDQLADYFKAR-- 307
            +L +LR+  +++    + +I         +LG    PVS   E     +L  + +AR  
Sbjct: 380 QLLAFLRVFCMTEDELKEHLIGEHTIDRIFTLGNSEFPVSWDNEV----KLWTFLEARAS 435

Query: 308 --LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
             L  Y  T+ +D+ +L   +L     +A +L   EK++L   ++  A
Sbjct: 436 LLLKTYKTTIHDDKFILETQDLTHNATMAIKLRLGEKEILEKAIKSAA 483


>gi|307109960|gb|EFN58197.1| hypothetical protein CHLNCDRAFT_142047 [Chlorella variabilis]
          Length = 485

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 86/319 (26%), Positives = 142/319 (44%), Gaps = 27/319 (8%)

Query: 12  LLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY 71
           LL    L   + LAL+L  ++  G    W PY+R L  + G         L WSE E+ +
Sbjct: 122 LLEEQPLPAWSVLALWLAEQRAAGSAGGWWPYVRLLPERTG-------CVLEWSEEEVEW 174

Query: 72  LTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ 131
           L GS   ++ LE     +  + E+  V   A +  +   +     AF     + AF  + 
Sbjct: 175 LCGSQLHSDALEIRAAAEASWAEMQAVLAAAKAQGRAPAHG----AFGRAQLQWAFAVLL 230

Query: 132 SCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVV 189
           S +V L    L  + AL+P    LL +     + L  +A + AV L  +R Y+AGE +++
Sbjct: 231 SRLVRL--AGLGDQEALLPWAD-LLNHDCAAASFLDWSATEAAVVLRAERRYRAGEQLLI 287

Query: 190 WCGPQPNSKLLINYGFVDE--DNPYD--RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQV 245
             G + + +LL++YGF  +   NP+D  RL++E  L   D     K    +++G  + Q+
Sbjct: 288 SYGQKTSGELLLSYGFCPDLGSNPHDGCRLLLE--LAPGDAARNWKAAALRQHGLAASQL 345

Query: 246 FHVHAGREKEAISDMLPY--LRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADY 303
           F +   R   A  +++ Y       V    E + +   L     + P ++ A L+ +   
Sbjct: 346 FPL---RMAAAPFELVHYTAFSAAVVGSRQEAEQLARRLFEEGDIPPALQTAALEAVVAA 402

Query: 304 FKARLAGYPATLSEDEAML 322
            KA LA YP +   D A L
Sbjct: 403 CKAALAAYPRSFDGDRAEL 421


>gi|302766942|ref|XP_002966891.1| hypothetical protein SELMODRAFT_408134 [Selaginella moellendorffii]
 gi|300164882|gb|EFJ31490.1| hypothetical protein SELMODRAFT_408134 [Selaginella moellendorffii]
          Length = 374

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 62/211 (29%), Positives = 95/211 (45%), Gaps = 42/211 (19%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA-VESPLLWSETELAYLTGSPTKAEIL 82
           L+L+L+ EK + ++S W PY+R L       QL  +ES + W   ELA+L  SPT  E +
Sbjct: 51  LSLFLLSEKHKAQESQWAPYLRCL------PQLGDIESTMFWKAEELAWLKHSPTYRETM 104

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
           E  + IK E++ L     +A    Q +P+                     C   L +VSL
Sbjct: 105 ECLKIIKSEFHLLT----LANK--QVFPW---------------------CRDALGEVSL 137

Query: 143 -----ARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQP 195
                A     +P       +   C+  L+     D    V D+ YKAG+ I +  G  P
Sbjct: 138 TDFMHAYSTDQIPFA-DFFNHDHNCQTRLSYDKEKDCAVAVADQDYKAGDEIFLSYGSTP 196

Query: 196 NSKLLINYGFVDEDNPYDRLVVEAALNTEDP 226
           NS L ++YGF    NP++++ V   ++  DP
Sbjct: 197 NSILAVDYGFAVASNPHEQVEVPMGVSLTDP 227


>gi|15223054|ref|NP_172856.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase
           [Arabidopsis thaliana]
 gi|17369870|sp|Q9XI84.1|RBCMT_ARATH RecName: Full=[Fructose-bisphosphate aldolase]-lysine
           N-methyltransferase, chloroplastic; AltName:
           Full=Aldolases N-methyltransferase; AltName:
           Full=[Ribulose-bisphosphate carboxylase]-lysine
           N-methyltransferase-like; Short=AtLSMT-L;
           Short=LSMT-like enzyme; Flags: Precursor
 gi|5080779|gb|AAD39289.1|AC007576_12 Putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase [Arabidopsis thaliana]
 gi|28973755|gb|AAO64193.1| putative ribulose-1,5 bisphosphate carboxylase oxygenase large
           subunit N-methyltransferase [Arabidopsis thaliana]
 gi|332190979|gb|AEE29100.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase
           [Arabidopsis thaliana]
          Length = 482

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 86/348 (24%), Positives = 156/348 (44%), Gaps = 37/348 (10%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            +AL+L+ EK + ++S W  Y+  L +       + +S + WSE ELA L G+   +  L
Sbjct: 122 SVALFLIREKYE-EESSWRVYLDMLPQ-------STDSTVFWSEEELAELKGTQLLSTTL 173

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
              E ++ E+ +L+    +          D+ +   T + F  AF  ++S      +   
Sbjct: 174 GVKEYVENEFLKLEQEILLPNK-------DLFSSRITLDDFIWAFGILKSRAFSRLR--- 223

Query: 143 ARRFALVPLGPPLLAYSSKCK----------AMLAAVDDAVQLVVDRPYKAGESIVV-WC 191
            +   L+PL   L+ ++   K          A L + D    L      KAGE + + + 
Sbjct: 224 GQNLVLIPLAD-LINHNPAIKTEDYAYEIKGAGLFSRDLLFSLKSPVYVKAGEQVYIQYD 282

Query: 192 GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 251
             + N++L ++YGFV+ +   +   +   +   DP + DK  +A+ N       F +  G
Sbjct: 283 LNKSNAELALDYGFVESNPKRNSYTLTIEIPESDPFFGDKLDIAESNKMGETGYFDIVDG 342

Query: 252 REKEAISDMLPYLRLGYVS--DTSEMQSVISSL--GPI-CPVSPCMERAVLDQLADYFKA 306
           +   A   ML YLRL  +   D   ++S+ ++   G +  PVS   E  +   + D  K+
Sbjct: 343 QTLPA--GMLQYLRLVALGGPDAFLLESIFNNTIWGHLELPVSRTNEELICRVVRDACKS 400

Query: 307 RLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTAD 354
            L+G+  T+ EDE +L    L P+  +A ++   EK++L    Q+  D
Sbjct: 401 ALSGFDTTIEEDEKLLDKGKLEPRLEMALKIRIGEKRVLQQIDQIFKD 448


>gi|356534483|ref|XP_003535783.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Glycine
           max]
          Length = 463

 Score = 71.6 bits (174), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 54/221 (24%), Positives = 104/221 (47%), Gaps = 16/221 (7%)

Query: 13  LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 72
           L   ++  +A LA  ++ EKK G+ S W PYI  L +Q   G+L   + + W+E+EL  +
Sbjct: 96  LIGEEVGNIAKLATVILIEKKLGQGSEWYPYISCLPQQ---GEL--HNTVFWTESELEMI 150

Query: 73  TGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 132
             S    E +++   I++++  +  ++  +   F    Y     A T  +F    V +  
Sbjct: 151 RPSSVYQETIDQKSQIEKDFLAIKHIFECSHQSFGDSTYKDFMHACTLVLFDHFNVELP- 209

Query: 133 CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD-------AVQLVVDRPYKAGE 185
             V  +        AL+P     L +    +A++ + DD       ++Q++ DR Y  GE
Sbjct: 210 --VGSRAWGSTNGLALIPFAD-FLNHDGVSEAIVMSDDDKQCSEVQSLQIIADRDYAPGE 266

Query: 186 SIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 226
            +++  G   N+ L++++GF    N YD++ ++  +   DP
Sbjct: 267 QVLIRYGKFSNATLMLDFGFTIPYNIYDQVQIQFDIPKHDP 307


>gi|8778402|gb|AAF79410.1|AC068197_20 F16A14.25 [Arabidopsis thaliana]
          Length = 474

 Score = 71.6 bits (174), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 85/342 (24%), Positives = 155/342 (45%), Gaps = 33/342 (9%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            +AL+L+ EK + ++S W  Y+  L +       + +S + WSE ELA L G+   +  L
Sbjct: 122 SVALFLIREKYE-EESSWRVYLDMLPQ-------STDSTVFWSEEELAELKGTQLLSTTL 173

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
              E ++ E+ +L+    +          D+ +   T + F  AF      +++ + ++ 
Sbjct: 174 GVKEYVENEFLKLEQEILLPNK-------DLFSSRITLDDFIWAF-----GILNRESLTS 221

Query: 143 ARRFALVPLGPPL----LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNS 197
              F  +   P +     AY  K  A L + D    L      KAGE + + +   + N+
Sbjct: 222 MFEFEQINHNPAIKTEDYAYEIKG-AGLFSRDLLFSLKSPVYVKAGEQVYIQYDLNKSNA 280

Query: 198 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI 257
           +L ++YGFV+ +   +   +   +   DP + DK  +A+ N       F +  G+   A 
Sbjct: 281 ELALDYGFVESNPKRNSYTLTIEIPESDPFFGDKLDIAESNKMGETGYFDIVDGQTLPA- 339

Query: 258 SDMLPYLRLGYVS--DTSEMQSVISSL--GPI-CPVSPCMERAVLDQLADYFKARLAGYP 312
             ML YLRL  +   D   ++S+ ++   G +  PVS   E  +   + D  K+ L+G+ 
Sbjct: 340 -GMLQYLRLVALGGPDAFLLESIFNNTIWGHLELPVSRTNEELICRVVRDACKSALSGFD 398

Query: 313 ATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTAD 354
            T+ EDE +L    L P+  +A ++   EK++L    Q+  D
Sbjct: 399 TTIEEDEKLLDKGKLEPRLEMALKIRIGEKRVLQQIDQIFKD 440


>gi|217038301|gb|ACJ76599.1| SET domain-containing protein 3 (predicted) [Oryctolagus cuniculus]
          Length = 394

 Score = 71.6 bits (174), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 62/248 (25%), Positives = 116/248 (46%), Gaps = 20/248 (8%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+ YL  +    ++ 
Sbjct: 156 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVF 207

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 139
            + +   R+Y      +F    + Q +P+   +P  ++FT+E ++ A  +V +    +  
Sbjct: 208 SQYKNTARQY-----AYFY--RVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 260

Query: 140 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
              +R   AL+PL       +          DD  + V  R + AGE I ++ G + N++
Sbjct: 261 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALRDFHAGEQIYIFYGTRSNAE 320

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 379

Query: 259 DMLPYLRL 266
            +L +LR+
Sbjct: 380 QLLAFLRV 387


>gi|146162512|ref|XP_001009518.2| SET domain containing protein [Tetrahymena thermophila]
 gi|146146406|gb|EAR89273.2| SET domain containing protein [Tetrahymena thermophila SB210]
          Length = 789

 Score = 71.2 bits (173), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 90/346 (26%), Positives = 149/346 (43%), Gaps = 61/346 (17%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES---PLLWSETELAYLTGSPTKAE 80
           L++Y++ EKK   KSFW P++           L VE    P+L+++ EL +L GSP   +
Sbjct: 426 LSIYIIIEKK-NHKSFWKPFL---------DILPVEYTTFPILYTDEELFWLKGSPFLNQ 475

Query: 81  ILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA--FTFEIFKQAFVAVQSCVVHLQ 138
           + ER E I ++Y              Q     IP  A   T + F  A +   S +  L 
Sbjct: 476 VKERRECITQDY--------------QAIVSKIPEFAKLCTLDEFAWARMMAASRIYGL- 520

Query: 139 KVSLARRFALVPLGPPLL----AYSS------KCKAMLAAVDDAVQLVVDRPYKAGESIV 188
            ++  R  A VPL         AY++      K   ML A +D          + G+ I 
Sbjct: 521 FINKKRTDAFVPLADMFNHRRPAYTNWGFCEDKGGFMLKASEDI---------RRGDQIY 571

Query: 189 VWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK-RMVAQRNGKLSVQVFH 247
             CG + NS+ L+NYGFV ++N  + + +    + +D     K +M+ +R  K    +F 
Sbjct: 572 YSCGRKCNSRFLLNYGFVVKNNEANEIQLRVDFDKKDETLPIKLQMIGKR--KPESLIFR 629

Query: 248 VHAGREKEAISDMLPYLRLGYVSDT------SEMQSVISSLGPIC--PVSPCMERAVLDQ 299
           +H   E++++ +   +LR   + D        EM S      P+   P S   E+ +  +
Sbjct: 630 IHINYEEKSVLEFFGFLRFVLIRDYIVLEKFHEM-SEGKEFDPLRTPPFSIENEKQMWTE 688

Query: 300 LADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
           +       +  YP TL ED+ +L    L   ++    L   EK++L
Sbjct: 689 IHKICAEIMIQYPTTLDEDKKILETSKLTINQKNCVILRMGEKEIL 734


>gi|297820264|ref|XP_002878015.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323853|gb|EFH54274.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 473

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 52/213 (24%), Positives = 101/213 (47%), Gaps = 19/213 (8%)

Query: 15  TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 74
           T+++  +  LA  L+ EKK+G+KS W+PYI  L +        + S + W E E + +  
Sbjct: 108 TDEVGNIGKLAAVLIREKKKGQKSRWVPYISRLPQ-----PAEMHSTIFWGEDEFSMIRC 162

Query: 75  SPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 134
           S    E +++   I++E++      F+A +  Q YP  I  E    E F  A+  V S  
Sbjct: 163 SAVHKETVKQKAQIEKEFS------FVAQAFKQHYPMVI--ERPYLEDFMYAYALVGS-- 212

Query: 135 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWCGP 193
              +    ++  +L+P    +         +L+  D+ + ++  DR Y  G+ + +  G 
Sbjct: 213 ---RAWETSKGISLIPFADFMNHDGLSASIVLSDEDNQLSEVTADRNYSPGDEVFIKYGE 269

Query: 194 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 226
             N+ L++++GF    N +D + ++  +  +DP
Sbjct: 270 FSNATLMLDFGFTVPYNIHDEVQIQMDVPNDDP 302


>gi|115657973|ref|XP_798530.2| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Strongylocentrotus purpuratus]
          Length = 682

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 86/339 (25%), Positives = 147/339 (43%), Gaps = 27/339 (7%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           LA++++ EK +   SFW PY+  L         +   PL ++  E+    GS    E L+
Sbjct: 158 LAIFILSEKLK-SDSFWKPYLDVLPS-------SYSLPLYFTPDEIQLFQGSTMYGECLK 209

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL-QKVSL 142
           + + I R+Y  L    F   +L +     I  E FT++ ++ A   V +    +  K   
Sbjct: 210 QHKNIARQYAYL----FKLLNLPENSKLHI-REYFTYDFYRWAVSTVMTRQNQIPAKDGK 264

Query: 143 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 202
               +L+PL       + + K       D+   +  R +  GE I +  G + ++ LL+ 
Sbjct: 265 GMSLSLIPLWDMCNHANGEMKTDFIEERDSCVNMALRDFSVGEQIFICYGRRSSADLLLY 324

Query: 203 YGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV--QVFHVHAGREKEAISDM 260
            GFV   N YD + ++  L++ D  Y  K  +     KL V  Q +H+ AG+E   + ++
Sbjct: 325 SGFVYPGNVYDGMAIQLGLSSSDRLYAMKAQLCSVM-KLGVPSQNYHISAGKEPVTL-EL 382

Query: 261 LPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGY 311
           L +LR+  + D          +  Q++ S +     +S   E      LA         Y
Sbjct: 383 LTFLRIFCMQDLELRDRLLGDNRAQALFSLVDRSQIISKLNELRTCVYLATRVTLLQRQY 442

Query: 312 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 350
             ++ EDE  L D NL  ++R A QL+ +EK  L   L+
Sbjct: 443 KTSIQEDEEKLKDGNLSAQERSALQLLLIEKCTLENVLE 481


>gi|145524453|ref|XP_001448054.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124415587|emb|CAK80657.1| unnamed protein product [Paramecium tetraurelia]
          Length = 581

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 88/370 (23%), Positives = 155/370 (41%), Gaps = 65/370 (17%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           L+ +L+ EK +   SFW PY+  L       Q     P+ ++  +L +L GSP   +I +
Sbjct: 215 LSTFLLQEKSRPN-SFWKPYLDIL------PQSYPSFPIFFNNYDLEWLQGSPFLKQIND 267

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 143
           +   +K++YN++  V       F QY         +F  F  A +   S +  +  +   
Sbjct: 268 KLSDLKKDYNDICNV----APEFSQY---------SFYEFCWARMTASSRIFGI-NIKGV 313

Query: 144 RRFALVPLG-------PPLLA--YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 194
           +  A VPL        P L +  YS + +  +   D+ +    DR    G+ I    G +
Sbjct: 314 KTDAFVPLADMLNHKRPKLTSWCYSEEKQGFIIETDEKI----DR----GQMIFDSYGRK 365

Query: 195 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK--------------RMVAQRNGK 240
            NS+ L+NYGFV +DN  + + V  A    DP  Q K              R++   +G 
Sbjct: 366 CNSRFLLNYGFVVDDNDANEVNVTVAAEFNDPLIQLKEDATEEQLKQPKTFRLIMDTDGI 425

Query: 241 LSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISS-----LGP--ICPVSPCME 293
             +  F       +  + + + Y+R   + D +++Q +++      + P  I P+    E
Sbjct: 426 NEITHFL------EATVMEFMSYIRFLVIRDQTQLQFLLNERESKYIKPTKIQPLGIHNE 479

Query: 294 RAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
             + D +       L+ YP TL +D+ +L   +L   +R    L   EK++L    Q + 
Sbjct: 480 LDMWDLIRRICYVSLSRYPTTLEQDKEILQICDLTTNQRNCLILRMGEKEILKFYYQFSE 539

Query: 354 DMIMLLPDVT 363
            M  LL +  
Sbjct: 540 KMKQLLSNFN 549


>gi|326492674|dbj|BAJ90193.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 449

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 86/341 (25%), Positives = 147/341 (43%), Gaps = 40/341 (11%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            ++L ++ E  +G  S W PY+  L RQ        +S + WSE EL  + G+   +  +
Sbjct: 89  SVSLLILREAARGGDSLWAPYLAILPRQ-------TDSTIFWSEEELLEIQGTQLLSTTM 141

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
              E ++ E++ ++     AG +      D+     TF+ F  AF  ++S V    +   
Sbjct: 142 GVKEYVQSEFDNVE-----AGII--NVNKDLFPGTITFDDFLWAFGVLRSRVFPELR--- 191

Query: 143 ARRFALVPLGPPL----------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WC 191
             + AL+P    +            +  K K  L   D    L      K+GE I V + 
Sbjct: 192 GDKLALIPFADLINHNGDITSKESCWEIKGKGFLGR-DTVFSLRTPVDVKSGEQIYVQYD 250

Query: 192 GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 251
             + N++L ++YGF + ++  D   +   ++  DP Y+DK  +A+ NG      F V  G
Sbjct: 251 LDKSNAELALDYGFTESNSSRDSYTLTLEISESDPFYEDKLDIAELNGMGETAYFDVVLG 310

Query: 252 REKEAISDMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLADYF 304
             +     M+ YLRL  +  T            V   L    PVS   E ++   + +  
Sbjct: 311 --ESLPPQMITYLRLLCLGGTDAFLLEALFRNKVWEHLE--LPVSRDNEESICQVIQNAC 366

Query: 305 KARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
           K+ LA Y  T+ EDE +L   +L  + ++A ++   EKK+L
Sbjct: 367 KSALAAYHTTIEEDEELLEREDLQSRHQIAVEVRVGEKKVL 407


>gi|326496433|dbj|BAJ94678.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 453

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 93/352 (26%), Positives = 151/352 (42%), Gaps = 44/352 (12%)

Query: 14  TTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLT 73
           +   L     ++L ++ E  +G  S W PY+  L RQ        +S + WSE EL  + 
Sbjct: 84  SGGDLRPWVSVSLLILREAARGGDSLWAPYLAILPRQ-------TDSTIFWSEEELLEIQ 136

Query: 74  GSPTKAEILERAEGIKREY--NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ 131
           G+    ++L    G+K EY  +E D V   AG +      D+     TF+ F  AF  ++
Sbjct: 137 GT----QLLSTTMGVK-EYVQSEFDNV--EAGII--NVNKDLFPGTITFDDFLWAFGVLR 187

Query: 132 SCVVHLQKVSLARRFALVPLGPPL----------LAYSSKCKAMLAAVDDAVQLVVDRPY 181
           S V    +     + AL+P    +            +  K K  L   D    L      
Sbjct: 188 SRVFPELR---GDKLALIPFADLINHDGDITSKESCWEIKGKGFLGR-DTVFSLRTPVDV 243

Query: 182 KAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 240
           K+GE I V +   + N++L ++YGF + ++  D   +   ++  DP Y+DK  +A+ NG 
Sbjct: 244 KSGEQIYVQYDLDKSNAELALDYGFTESNSSRDSYTLTLEISESDPFYEDKLDIAELNGM 303

Query: 241 LSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCME 293
                F V  G  +     M+ YLRL  +  T            V   L    PVS   E
Sbjct: 304 GETAYFDVVLG--ESLPPQMITYLRLLCLGGTDAFLLEALFRNKVWEHLE--LPVSRDNE 359

Query: 294 RAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
            ++   + +  K+ LA Y  T+ EDE +L   +L  + ++A ++   EKK+L
Sbjct: 360 ESICQVIQNACKSALAAYHTTIEEDEELLEREDLQSRHQIAVEVRVGEKKVL 411


>gi|145350419|ref|XP_001419603.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144579835|gb|ABO97896.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 524

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 83/336 (24%), Positives = 133/336 (39%), Gaps = 57/336 (16%)

Query: 17  KLSELACLALYLMYEK---KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYL 72
           +L E + LA +L  +    + G    +  YIR L R+ G       S L W E E+   L
Sbjct: 148 ELQEWSVLATFLAQQALALESGTAGTFGEYIRALPRRTG-------SVLDWPEDEVDKLL 200

Query: 73  TGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 132
            GSP++    ER + +    +E+ +           +P        T    + AF  + S
Sbjct: 201 KGSPSRLAAAERQDSVNAAIDEIRSY----------FPE------ITVGALRWAFDILFS 244

Query: 133 CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG 192
            ++ L   ++    ALVP    +L +   C A +    DAV L  DR Y  GE +    G
Sbjct: 245 RLIRLD--AMGGELALVPW-ADMLNHKPGCAAFIDLNGDAVNLTTDRSYVKGEQVWASYG 301

Query: 193 PQPNSKLLINYGFVDE--DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 250
            +P+S+LLI+YGF  E  +NP D   +   ++  DP    K  V +  G   V+ F +  
Sbjct: 302 QRPSSELLISYGFAPEVGENPDDEYALTLGVDVNDPLADAKAQVLRDMGLSPVETFPLRL 361

Query: 251 GR--------------EKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSP------ 290
                             E  S++    +  +    +  QS+  S+  +           
Sbjct: 362 NGYPRQLLQYASFILCNPEKPSELKGLAQSAFTGSANIGQSIFDSVRGLTNGKARGKQGV 421

Query: 291 -----CMERAVLDQLADYFKARLAGYPATLSEDEAM 321
                  E AV + LAD     L+ YP TL +D+ +
Sbjct: 422 ILGGVAGEIAVREMLADLCAEALSAYPNTLEKDKGL 457


>gi|79315114|ref|NP_001030864.1| SET domain-containing protein [Arabidopsis thaliana]
 gi|51971180|dbj|BAD44282.1| unnamed protein product [Arabidopsis thaliana]
 gi|332645817|gb|AEE79338.1| SET domain-containing protein [Arabidopsis thaliana]
          Length = 353

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 58/233 (24%), Positives = 106/233 (45%), Gaps = 27/233 (11%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           LA  L+ EKK G+KS W+PYI  L +        + S + W E EL+ +  S    E ++
Sbjct: 2   LAAVLIREKKMGQKSRWVPYISRLPQP-----AEMHSSIFWGEDELSMIRCSAVHQETVK 56

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 143
           +   I+++++      F+A +  Q  P  I TE    E F  A+  V S     +    +
Sbjct: 57  QKAQIEKDFS------FVAQAFKQHCP--IVTERPDLEDFMYAYALVGS-----RAWENS 103

Query: 144 RRFALVPLGPPLLAYSSKCKAMLAAVDD------AVQLVVDRPYKAGESIVVWCGPQPNS 197
           +R +L+P    +         +L   D+       +Q+  DR Y  G+ + +  G   N+
Sbjct: 104 KRISLIPFADFMNHDGLSASIVLRDEDNQLSEFSTLQVTADRNYSPGDEVFIKYGEFSNA 163

Query: 198 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ---VFH 247
            L++++GF    N +D + ++  +  +DP    K  + Q +   +V+   +FH
Sbjct: 164 TLMLDFGFTFPYNIHDEVQIQMDVPNDDPLRNMKLGLLQTHHTRTVKDINIFH 216


>gi|340720054|ref|XP_003398458.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Bombus
           terrestris]
          Length = 484

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 88/345 (25%), Positives = 149/345 (43%), Gaps = 32/345 (9%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           LA+ L+ EK + + S W PY+  L            + L  +  ++  L GSPT    L+
Sbjct: 158 LAIALLIEKHK-EYSKWKPYLDILPT-------TYTTVLYMTAADMNELKGSPTLEAALK 209

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYPYDIPT---EAFTFEIFKQAF--VAVQSCVVHLQ 138
           +   I R+Y      +F    LFQ+    +     + FT+E +  A   V  +  ++  +
Sbjct: 210 QCRNIARQY-----AYF--NKLFQKNNNAVSAILRDVFTYEKYCWAVSTVMTRQNIIPSK 262

Query: 139 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
             SL    AL+P+        SK      A  +  +    R +K  E I +  GP+ NS 
Sbjct: 263 DGSLMIH-ALIPMWDMCNHEDSKITTDFNATLNCCECYALRDFKKAEQIFISYGPRTNSD 321

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
             ++ GFV  DN  D   +   ++  DP ++++  +  +    +V  F +  G   E IS
Sbjct: 322 FFVHSGFVYMDNEQDGFKLRLGISKADPLHKERVELLNKLDLPAVGEFLLKPG--TEPIS 379

Query: 259 D-MLPYLRLGYVSDTSEMQSVISS--LGPICPVSPCMERAVLDQLADYFKARL----AGY 311
           D +L +LR+ +     E+   I S  +  +  +   +E  V + +  +   RL    A Y
Sbjct: 380 DTLLAFLRV-FSMRKEELAHWIQSDRVNDLKHMDCALETVVEENVKKFLLTRLQLLIANY 438

Query: 312 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 356
           P TL ED  +L +  L   K++A QL   EK++L   L+     I
Sbjct: 439 PTTLKEDLQLL-ETTLPRIKKLAIQLRVTEKRILQGALEYVQQWI 482


>gi|302768639|ref|XP_002967739.1| hypothetical protein SELMODRAFT_408995 [Selaginella moellendorffii]
 gi|300164477|gb|EFJ31086.1| hypothetical protein SELMODRAFT_408995 [Selaginella moellendorffii]
          Length = 421

 Score = 68.6 bits (166), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 55/219 (25%), Positives = 91/219 (41%), Gaps = 36/219 (16%)

Query: 13  LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 72
           L ++ +SE A LAL L++EK+ G+ S W PYI  L R        + S   W + EL  +
Sbjct: 98  LLSSGVSEWAQLALLLLFEKRAGEASIWAPYISCLPRWG-----TIHSTAFWRKEELTMI 152

Query: 73  TGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 132
             S    E + R   I+ E+NE+ +V F          +D  + A            V+ 
Sbjct: 153 QESSLSYETMSRRAAIREEFNEMQSVPFA-----DFMNHDWSSNAMLTYDTDNGSTEVEE 207

Query: 133 CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG 192
             V+   + +A   A                          QL  D+ Y AGE + +  G
Sbjct: 208 VKVYSDCLYIALFCA--------------------------QLFADKNYAAGEQVTISFG 241

Query: 193 PQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 231
           P  N+ L +++GF    NP+D++ +   ++  D   ++K
Sbjct: 242 PLCNASLALDFGFTVPYNPWDKVQLWLGISRRDSLRKEK 280


>gi|302771638|ref|XP_002969237.1| hypothetical protein SELMODRAFT_410177 [Selaginella moellendorffii]
 gi|300162713|gb|EFJ29325.1| hypothetical protein SELMODRAFT_410177 [Selaginella moellendorffii]
          Length = 336

 Score = 68.6 bits (166), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 63/230 (27%), Positives = 103/230 (44%), Gaps = 24/230 (10%)

Query: 16  NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 75
           ++ +E   ++L +M EK++G+ S W PYI  L      G+  + S + W   ELA L  S
Sbjct: 104 DQANEFTRVSLLVMAEKRKGQNSAWAPYIECLP---SFGE--IHSTIFWDPKELACLECS 158

Query: 76  PTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 135
           P      ER   ++ EY E+  V     S    Y  D+     + E FK  +  V S   
Sbjct: 159 PIHRGTGERNALLQSEYREVKKV---VESCPHLYDPDV-----SLEQFKHEYATVSSRAW 210

Query: 136 HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV-DRPYKAGE-----SIVV 189
                S      ++PL      +  + + + +  DD   +VV  R Y+ G+      + +
Sbjct: 211 GQGPHS---DMTMIPL-VDFANHDPRSRTLFSHADDNCTVVVASRDYQTGDENFHLKVHI 266

Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK-RMVAQRN 238
             G   N+ L ++YGFV  DNP+D   +   + +EDP  + K + +AQ N
Sbjct: 267 CYGDHSNAVLALDYGFVVPDNPFDEAEIFLEIPSEDPLREIKLQYMAQNN 316


>gi|302754340|ref|XP_002960594.1| hypothetical protein SELMODRAFT_402971 [Selaginella moellendorffii]
 gi|300171533|gb|EFJ38133.1| hypothetical protein SELMODRAFT_402971 [Selaginella moellendorffii]
          Length = 403

 Score = 68.6 bits (166), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 63/230 (27%), Positives = 103/230 (44%), Gaps = 24/230 (10%)

Query: 16  NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 75
           ++ +E   ++L +M EK++G+ S W PYI  L      G+  + S + W   ELA L  S
Sbjct: 104 DQANEFTRVSLLVMAEKRKGQNSAWAPYIECLP---SFGE--IHSTIFWDPKELACLECS 158

Query: 76  PTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 135
           P      ER   ++ EY E+  V     S    Y  D+     + E FK  +  V S   
Sbjct: 159 PIHRGTGERNALLQSEYREVKKV---VESCPHLYDPDV-----SLEQFKHEYATVSSRAW 210

Query: 136 HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV-DRPYKAGE-----SIVV 189
                S      ++PL      +  + + + +  DD   +VV  R Y+ G+      + +
Sbjct: 211 GQGPHS---DMTMIPL-VDFANHDPRSRTLFSHADDNCTVVVASRDYQTGDENFHLKVHI 266

Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK-RMVAQRN 238
             G   N+ L ++YGFV  DNP+D   +   + +EDP  + K + +AQ N
Sbjct: 267 CYGDHSNAVLALDYGFVVPDNPFDEAEIFLEIPSEDPLREIKLQYMAQNN 316


>gi|255947868|ref|XP_002564701.1| Pc22g06730 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211591718|emb|CAP97961.1| Pc22g06730 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 679

 Score = 68.2 bits (165), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 62/209 (29%), Positives = 101/209 (48%), Gaps = 15/209 (7%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
             A +LM +  +G + FW PY+R L +    GQL   +PL + E ++ ++ G+      +
Sbjct: 105 TFAFFLMGQYLRGSEGFWYPYLRTLPQP---GQLT--TPLFFGEEDVDWIQGTGIPEAAV 159

Query: 83  ERAEGIKREYN----ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA-VQSCVVHL 137
           ER +  +++Y+    +LD + F      +QY +++   A T  I  +AF A V S  V  
Sbjct: 160 ERIKVWEQKYDLGYLKLDEIGFPD---CEQYTWELYLWASTI-ITSRAFSAKVLSGAVQP 215

Query: 138 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 197
             +      AL+PL   L  +    K    A D+ + L+V   + AG+ I    GP+ N 
Sbjct: 216 DDLPEDGVSALLPL-IDLPNHRPMAKVEWRAGDEDIGLLVLEDHSAGQEISNNYGPRNNE 274

Query: 198 KLLINYGFVDEDNPYDRLVVEAALNTEDP 226
           +LLINYGF    NP D  +V   +  + P
Sbjct: 275 QLLINYGFCIAGNPTDYRIVLLGVKPDSP 303


>gi|298711968|emb|CBJ32910.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 247

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 64/196 (32%), Positives = 92/196 (46%), Gaps = 23/196 (11%)

Query: 175 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT-EDPQYQDK-R 232
           +   R + AGE +++  GP+ N  LL  YGFV++DNP D   +   ++   D   +D  R
Sbjct: 40  VTTQRGWTAGEQVLISYGPRSNDHLLRRYGFVEQDNPNDVYRITGLIDKLSDVLGKDSVR 99

Query: 233 MVAQRNGKLSVQ--------VFHVHAGR-----EKEAISDMLPYLRLGYVSDTS--EMQS 277
           ++ +  GKL           V  V  GR     EKE    ++P  RL  V D    E ++
Sbjct: 100 VLRESGGKLGTTGDNAEGRPVESVTVGRSGLLGEKEE-GRVMPVFRLAVVKDDQLPEGKA 158

Query: 278 VISSLGPIC-PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD--YNLHPKKRVA 334
              SL      +SP  E A  D L         G+  TL+EDEA L+    +L  +KRVA
Sbjct: 159 AGISLKDFSNEISPANEAAARDALRKLCIKEREGFATTLAEDEAYLSSLGNSLGAQKRVA 218

Query: 335 TQLVRMEKK-MLNACL 349
               RMEKK +L+A +
Sbjct: 219 FSF-RMEKKRVLDAAI 233


>gi|424513480|emb|CCO66102.1| predicted protein [Bathycoccus prasinos]
          Length = 571

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 95/365 (26%), Positives = 154/365 (42%), Gaps = 54/365 (14%)

Query: 20  ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA-YLTGSPTK 78
           EL  LAL+L  EK + K S W PY++ L+           S L ++E E    L G+   
Sbjct: 160 ELVQLALWLCCEKAKAKGSEWWPYLKTLNGNPN-------SVLRFTEEEFKELLKGTSID 212

Query: 79  AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV-AVQSCVVHL 137
            E  +R +  K EY  L            + P   P + + F + + AF+ A+       
Sbjct: 213 KEARQRRDSAKEEYEALRAA-------IAEDPGKYPLDVYAF-LTESAFIDALDIVCARA 264

Query: 138 QKVSLARRFALVPL--------GPPLLA-------------------YSSKCKAMLAAVD 170
           Q ++ A  +A+VPL         PP ++                    + +C      VD
Sbjct: 265 QWLNSANCYAMVPLMDAIPICGAPPPVSPEDPSFARFYEIRDIKTGLTAVRCGYADYDVD 324

Query: 171 DA-VQLVVDRPYKAGESIV-VWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 228
            A V L  +    AG  I+ +    + NS+L +++G VD+ +P D       L+  DP Y
Sbjct: 325 SASVVLCANTRASAGSKILQIDHSVRNNSELYLSFGDVDDQHPGDYEYWPTELSENDPLY 384

Query: 229 QDKRMVAQRNG-KLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICP 287
             K+ V +  G     Q F V+  R      + L YLR   V+++ E+ +V  +   +  
Sbjct: 385 AAKKSVLEAQGFADKGQTFPVYKDR---MPREFLSYLRFARVTNSEELFAVSFTEDKV-- 439

Query: 288 VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDY-NLHPKKRVATQLVRMEKKMLN 346
           VSP  E   L  L    + R++ Y  T  EDE +L    ++  K R A++L R EK+++ 
Sbjct: 440 VSPMNEYETLQLLMADCRDRMSAYD-TNEEDELLLQKRDDVSLKIRNASRLRRCEKELVG 498

Query: 347 ACLQV 351
             +  
Sbjct: 499 EMMNA 503


>gi|348537527|ref|XP_003456245.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Oreochromis niloticus]
          Length = 607

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 85/354 (24%), Positives = 146/354 (41%), Gaps = 52/354 (14%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           LAL+L+ E+     SFWLPYIR L ++        + PL + + ++  L G+    ++L 
Sbjct: 157 LALHLLCERAN-PASFWLPYIRSLPQE-------YDIPLYYQQEDVQLLLGTQAVQDVLS 208

Query: 84  RAEGIKREY----------NELDTVWF---------MAGSLFQQYPYDIPTEAFTFEIFK 124
           + +   R+Y            L +V           M G LF Q+       A +  + +
Sbjct: 209 QYKNTARQYAYFYKLVQDKGMLGSVELRLFASLTPVMGGKLFDQW-------AVSSVMTR 261

Query: 125 QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 184
           Q  +  +       +V+LA    L+PL       +          DD  + V  + YK  
Sbjct: 262 QNQIPTEDG----SRVTLA----LIPLWDMCNHTNGLITTGYNLEDDRCECVALQDYKEN 313

Query: 185 ESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ 244
           E I ++ G + N++ +I+ GF  +D+ +DR+ ++  ++  +  Y  K  V  R G  +  
Sbjct: 314 EQIYIFYGTRSNAEFVIHNGFFFQDDAHDRVKIKLGVSKSERLYAMKAEVLARAGIPASY 373

Query: 245 VFHVHAGREKEAISDMLPYLRLG---------YVSDTSEMQSVISSLGPICPVSPCMERA 295
           VF +H   E    + +L +LR+          Y+     +  + +      PVS   E  
Sbjct: 374 VFALHCN-EPPISAQLLAFLRVFCMTEDELKYYLLGDRAINKIFTLGNSEFPVSWENEIK 432

Query: 296 VLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 349
           +   L       L  Y  T  ED +ML   +L    R+A QL   EK++L   L
Sbjct: 433 LWTFLETRAALLLKTYKTTSEEDRSMLEKPDLSLHSRMAIQLRLAEKQILEKAL 486


>gi|449442309|ref|XP_004138924.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Cucumis
           sativus]
          Length = 503

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 83/348 (23%), Positives = 151/348 (43%), Gaps = 37/348 (10%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            +AL+L+ E  +G  S W  Y+  L ++        +S + WSE ELA + G+   +  L
Sbjct: 143 SVALFLIRENLKGD-SRWRRYLDILPQE-------TDSTVFWSEEELAEIQGTQLLSTTL 194

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
              E +K E+ +++    +          D+     T + F  AF  ++S      +   
Sbjct: 195 NVKEYVKSEFLKVEEEILLRHK-------DLFPSRITLDDFFWAFGILRSRAFSRLR--- 244

Query: 143 ARRFALVPLGPPL----------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WC 191
            +   L+P    +           A+  K  A L + D    L      KAG+ + + + 
Sbjct: 245 GQNLVLIPFADLVNHSANVTTEEHAWEVKGPAGLFSWDVLFSLRSPLSVKAGDQVFIQYD 304

Query: 192 GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 251
             + N+ L ++YGF+++ +  +   +   +   D  + DK  +A+ NG      F +   
Sbjct: 305 LKKSNADLALDYGFIEQKSDRNAYTLTLEIPESDLFFDDKLDIAETNGLNQTAYFDIIL- 363

Query: 252 REKEAISDMLPYLRLGYVSDTSE--MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKA 306
            E+     MLP+LRL  +  T    ++S+   S  G +  PVS   E  +   + +  +A
Sbjct: 364 -ERPFPPAMLPFLRLLALGGTDAFLLESLFRNSVWGHLEMPVSRANEELICQVVRNACEA 422

Query: 307 RLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTAD 354
            L+GY  T+ EDE  L + NL  + R+A  +   EK++L   +Q+  D
Sbjct: 423 ALSGYHTTIEEDEK-LKEENLDSRLRIAVGIREGEKRVLQQIIQIFKD 469


>gi|219126444|ref|XP_002183467.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217405223|gb|EEC45167.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 519

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 94/352 (26%), Positives = 142/352 (40%), Gaps = 58/352 (16%)

Query: 24  LALYLMYEKK-QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
           L +YL++++K  G  SF+ PY   L             P+ WS  EL  L GS   ++I 
Sbjct: 116 LMIYLLWDRKTHGSSSFFHPYYEILP------PTLRNMPIFWSAFELQELEGSHLLSQIA 169

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
           +R + I+ +Y  +  V    G+L             T + FK A + V S    LQ +  
Sbjct: 170 DRGQAIQDDYEAILEVAPSLGTLC------------TLDEFKWARMCVCSRNFGLQ-IDG 216

Query: 143 ARRFALVPLGPPLLAYSSK-CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 201
            R  ALVP    L  Y  +  K     V     +   +  +AG  +    G + N + L+
Sbjct: 217 HRTSALVPHADMLNHYRPRETKWTFDEVTQCFTITSLQSIQAGAQVYDSYGQKCNHRFLL 276

Query: 202 NYGFVDEDNPY------DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 255
           NYGF  EDN        + + +E  ++  D  +QDK     R               E  
Sbjct: 277 NYGFAVEDNRELDGFCPNEVPLELYVDPADILFQDKLEFWTRG--------------ETN 322

Query: 256 AISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATL 315
            IS        G V+     Q+V  S+G   P S   E      +    + RLA YP T+
Sbjct: 323 QIS--------GAVTAGLIAQAVGGSMGRGVP-SHAAESYTSGPVVK--RVRLASYPTTI 371

Query: 316 SEDEAMLTDYNLHPK---KRVATQLVRMEKKMLN---ACLQVTADMIMLLPD 361
           S+D A L D   +P+   +R A   VR EK++L+      +   DM+  + D
Sbjct: 372 SQDMADLQDEASYPQFSNRRHAKIQVRGEKEVLHHFRVWSETALDMLTFIED 423


>gi|361129824|gb|EHL01706.1| putative Ribosomal N-lysine methyltransferase 4 [Glarea lozoyensis
           74030]
          Length = 483

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 93/363 (25%), Positives = 149/363 (41%), Gaps = 51/363 (14%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            L   +M E +Q   S W PY+  L ++       ++S + WSE ELA L  S    +I 
Sbjct: 89  ALTATMMSEGQQSD-SRWAPYLAVLPQK-------LDSLVFWSEEELAELQASSVAKKIG 140

Query: 83  ERA--EGIKREYNELDTVWF------MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 134
             +  E   +  + L    F         S+   Y +DIP E    E  KQ     +   
Sbjct: 141 RSSAEEMFTKHISPLGLGEFNVELCHQVASVIMAYAFDIPEE----EPAKQENGGAEGET 196

Query: 135 VHL-------QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESI 187
             L       +K  L+    ++PL   L A + +  A +   ++ +++   +P  AGE I
Sbjct: 197 DDLVSDDGEDEKTILS----MIPLADMLNADAERNNARIYYENEDLEMRTIKPIMAGEEI 252

Query: 188 VVWCGPQPNSKLLINYGFVDED-NPYDRLVVEAA----LNTEDPQY-------------- 228
               G  P S LL  YG+V E+   YD + + +A    L TE PQ               
Sbjct: 253 FNDYGQLPRSDLLRRYGYVTENYAQYDVVEISSASIKSLMTEKPQEIQSGQFLDPLTSAE 312

Query: 229 -QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICP 287
            +++  +A R G L        A  E+ AI D L  L   ++ D   ++++++S   +  
Sbjct: 313 AEERVALADREGILEDSYDVNIANAEERAIPDELLALLYLFLLDNENLEAIVTSQSALPS 372

Query: 288 VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 347
            S      V   L    + R A Y  TL EDE +L   NL  +  +A Q+   EK++L  
Sbjct: 373 RSKLATELVGKVLVKVLRHREAEYATTLEEDEKLLQAANLPRRTAMAIQVRHGEKRVLRL 432

Query: 348 CLQ 350
            ++
Sbjct: 433 AVE 435


>gi|449495943|ref|XP_004159992.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Cucumis
           sativus]
          Length = 503

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 83/348 (23%), Positives = 151/348 (43%), Gaps = 37/348 (10%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            +AL+L+ E  +G  S W  Y+  L ++        +S + WSE ELA + G+   +  L
Sbjct: 143 SVALFLIRENLKGD-SRWRRYLDILPQE-------TDSTVFWSEEELAEIQGTQLLSTTL 194

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
              E +K E+ +++    +          D+     T + F  AF  ++S      +   
Sbjct: 195 NVKEYVKSEFLKVEEEILLRHK-------DLFPSRITLDDFFWAFGILRSRAFSRLR--- 244

Query: 143 ARRFALVPLGPPL----------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WC 191
            +   L+P    +           A+  K  A L + D    L      KAG+ + + + 
Sbjct: 245 GQNLVLIPFADLVNHSANVTTEEHAWEVKGPAGLFSWDVLCSLRSPLSVKAGDQVFIQYD 304

Query: 192 GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 251
             + N+ L ++YGF+++ +  +   +   +   D  + DK  +A+ NG      F +   
Sbjct: 305 LKKSNADLALDYGFIEQKSDRNAYTLTLEIPESDLFFDDKLDIAETNGLNQTAYFDIIL- 363

Query: 252 REKEAISDMLPYLRLGYVSDTSE--MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKA 306
            E+     MLP+LRL  +  T    ++S+   S  G +  PVS   E  +   + +  +A
Sbjct: 364 -ERPFPPAMLPFLRLLALGGTDAFLLESLFRNSVWGHLEMPVSRANEELICQVVRNACEA 422

Query: 307 RLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTAD 354
            L+GY  T+ EDE  L + NL  + R+A  +   EK++L   +Q+  D
Sbjct: 423 ALSGYHTTIEEDEK-LKEENLDSRLRIAVGIREGEKRVLQQIIQIFKD 469


>gi|350595011|ref|XP_003484025.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Sus
           scrofa]
          Length = 326

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 46/150 (30%), Positives = 77/150 (51%), Gaps = 7/150 (4%)

Query: 101 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS 160
           M   +F +YP   P E F  E FK +F  + S +V L   S+  + ALVP    ++ +S 
Sbjct: 73  MRVRIFSKYPDFFPEEVFNIESFKWSFGILFSRMVRLP--SMDGKNALVPWAD-MMNHSC 129

Query: 161 KCKAMLAAVDDAVQLV--VDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLV 216
           + +  L     +  +V   DRPY+ GE + +  G + N +LL++YGFV ++  NP D + 
Sbjct: 130 EVETFLDYDKSSKGIVFPTDRPYQPGEQVFISYGKKSNGELLLSYGFVPKEGTNPSDSVE 189

Query: 217 VEAALNTEDPQYQDKRMVAQRNGKLSVQVF 246
           +  +L   D  Y++K  + ++ G    Q F
Sbjct: 190 LSLSLKKSDESYKEKLELLKKYGLSGSQCF 219


>gi|198413420|ref|XP_002131202.1| PREDICTED: similar to SET domain containing 3 [Ciona intestinalis]
          Length = 577

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 87/375 (23%), Positives = 169/375 (45%), Gaps = 50/375 (13%)

Query: 8   YAAELLTTNKLSEL---ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 64
           Y A L+  N++  +    CLALYL  E+     S + PYI  + ++         + L +
Sbjct: 140 YLAGLIEGNEVLSVMPNVCLALYLHCERFT-LNSKYQPYIDMIPQE-------FNTILYF 191

Query: 65  SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA---FTFE 121
              E+ YL G+   +  + + + I R++  L  V+   GS  ++    +P +A   FTF+
Sbjct: 192 KPHEMKYLKGTAALSVAINQFKSIVRQFALLYQVF--NGSHQKEDVEKLPLQARNAFTFD 249

Query: 122 IFKQAFVAVQS----CVVHLQKV----SLARRFALVPL--------GPPLLAYSSKCKAM 165
            ++    AV +       H+  V          AL+P+        GP   AY+      
Sbjct: 250 TYRWCASAVTTRQNKIPTHVGDVLGDLDENSTLALIPMWDMFNHAIGPLSTAYN------ 303

Query: 166 LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 225
             A+   ++ +  + +K GE + +  G + NS LLI+ GFV +++P+D++ +   ++ +D
Sbjct: 304 --ALTRGIECLAMQDFKTGEQVKICYGARTNSDLLIHNGFVMKESPFDKVRIHLGVSQKD 361

Query: 226 PQYQDK-RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT------SEMQSV 278
           P Y  K +++ + N ++S Q               +L +LR+ ++++        + ++ 
Sbjct: 362 PLYSLKAKLLEKLNVEVSGQFAVCSMDNSLPTSPQLLVFLRVFHMNEEELRSWLEKQKNE 421

Query: 279 ISSLGPI---CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVAT 335
           +SSL  I     V    +  V + L +  K  L G+       E M+ D +L  + ++A 
Sbjct: 422 LSSLREIYISGEVKFKSDVKVWEFLENRVKLLLMGFKKIGDNIEEMMEDKSLTHRSKLAL 481

Query: 336 QLVRMEKKMLNACLQ 350
           Q    E ++L+AC+ 
Sbjct: 482 QFRIEEHRILSACVN 496


>gi|302896454|ref|XP_003047107.1| SET domain protein [Nectria haematococca mpVI 77-13-4]
 gi|256728035|gb|EEU41394.1| SET domain protein [Nectria haematococca mpVI 77-13-4]
          Length = 1037

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 100/397 (25%), Positives = 160/397 (40%), Gaps = 83/397 (20%)

Query: 17   KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP 76
            +L   + L L LMYE  QG+KS W PY   L         + ++P+ WSE+EL  L  S 
Sbjct: 660  RLDSWSSLILVLMYEYLQGEKSQWKPYFDVL-------PSSFDTPMFWSESELDQLQASH 712

Query: 77   TKAEILE-RAE-------------------GIKREYNELDTVWFMAGSLFQQYPYDIPTE 116
             + +I +  AE                   G  R  ++L  +    GS    Y +D+  +
Sbjct: 713  MRHKIGKADAESMFRKTLLPIIRKNSSVFGGENRSDDDLVEIAHRMGSTIMAYAFDLEND 772

Query: 117  AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 176
                E     +V  +     +          +VP+   +L   ++  A +   ++++ + 
Sbjct: 773  EDEEEEETDGWVEDREGKSMM---------GMVPMAD-ILNADAEFNAHVNHEEESLTVT 822

Query: 177  VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP-YDRLVVEAALNTEDPQYQDKRMVA 235
              RP KAGE I  + GP PNS+LL  YG+V E +  YD  VVE   +  +        V 
Sbjct: 823  SLRPIKAGEEIFNYYGPHPNSELLRRYGYVTERHSRYD--VVEIPWDVVES-------VM 873

Query: 236  QRNGKLSVQVFHV--HAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGP--------- 284
            + N  +S QV     H   E+E   D     R     +T E+ S  +  GP         
Sbjct: 874  RLNFGISGQVLEKLRHGLEEEEEFEDTFVLER-----ETGEVNSDGTFSGPARFESMPED 928

Query: 285  --------ICPVSPCMERAVLDQ----------LADYFKARLAGYPATLSEDEAMLTDYN 326
                    +  V      A+ D+          LA   +A  + YP + SED  +L   +
Sbjct: 929  LQEQLKTFLKGVKKAQPEAIPDKRKRDEIHHAVLAKTLQALASKYPTSTSEDGILLQRQD 988

Query: 327  LHPKKRVATQLVRMEKKMLNACLQVTA--DMIMLLPD 361
            L  + R+A ++   EKK+L   +  T+  D+ M + D
Sbjct: 989  LSQRTRMAIEVRLGEKKLLQEAIASTSSVDVEMTVDD 1025


>gi|302753470|ref|XP_002960159.1| hypothetical protein SELMODRAFT_437298 [Selaginella moellendorffii]
 gi|300171098|gb|EFJ37698.1| hypothetical protein SELMODRAFT_437298 [Selaginella moellendorffii]
          Length = 377

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 52/195 (26%), Positives = 89/195 (45%), Gaps = 26/195 (13%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           LAL ++ E+ +G+ S W PYI  L +        +++  LW +TEL+YL  SP   +  E
Sbjct: 114 LALIVLMERYKGQSSVWAPYISCLPQP-----AELDNTFLWEDTELSYLKASPLYGKTRE 168

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 143
           R E I  E+ ++     +   LF +          + E FK  +  V S     + +++ 
Sbjct: 169 RLEMITTEFGQVQNALNVWPQLFGK---------VSLEDFKHVYATVFS-----RSLAIG 214

Query: 144 RRFALVPLGPPLLAY-----SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
               LV +  P+L +     +S  K     + +   +  DR Y   + I +  G   N++
Sbjct: 215 EDSTLVMI--PMLDFFNHNATSFAKLSFNGLLNYAVVTADRAYTENDQIWINYGDLSNAE 272

Query: 199 LLINYGFVDEDNPYD 213
           L ++YGF   +NPYD
Sbjct: 273 LALDYGFTVPENPYD 287


>gi|145528147|ref|XP_001449873.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124417462|emb|CAK82476.1| unnamed protein product [Paramecium tetraurelia]
          Length = 605

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 80/356 (22%), Positives = 149/356 (41%), Gaps = 44/356 (12%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           L+ +L+ EKK  + SFW PY+  L +           P+ +++++L +L GSP   ++ +
Sbjct: 240 LSTFLLQEKK-IQDSFWKPYLDVLPKSYSNF------PIFFNDSDLEWLKGSPFLKQVKD 292

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 143
           +   +K++Y ++  V   A    Q           +F+ F  A +   S +  +  +   
Sbjct: 293 KITDLKKDYCDICQV---APEFLQN----------SFDEFCWARMTASSRIFGIN-IKGV 338

Query: 144 RRFALVPLG-------PPLLA--YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 194
           +  A VPL        P L +  YS + +  +   D+ ++         G+ I    G +
Sbjct: 339 KTDAFVPLADMLNHKRPKLTSWCYSDERQGFIIETDENIE--------KGQMIFDSYGSK 390

Query: 195 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREK 254
            NS+ L+NYGFV +DN  + + V    +      Q K  +++   +   + F +      
Sbjct: 391 CNSRFLLNYGFVVDDNNANEVNVMVEPDGTISLIQLKEGLSRETLQFP-KSFRLVIDPND 449

Query: 255 EAISDMLPYLRLGYVSDTSEMQSVISSLGPICP-----VSPCMERAVLDQLADYFKARLA 309
            + SD + ++R   + +  E  +++     I P     +S   E A  + + +     L 
Sbjct: 450 VSFSDFMSFIRFILIQEEKEFANLLGKNSYIKPTKIHFISIQNELATWNLIENICIRALN 509

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVS 365
            YP TL +D  +L    L   +R    L   EKK+LN   Q +  M  L  +   S
Sbjct: 510 QYPTTLEQDLEILKICELTTNQRNCLILRMGEKKILNFYKQFSEKMRQLFSNFDFS 565


>gi|270005261|gb|EFA01709.1| hypothetical protein TcasGA2_TC007289 [Tribolium castaneum]
          Length = 230

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 54/213 (25%), Positives = 99/213 (46%), Gaps = 12/213 (5%)

Query: 146 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 205
           +AL+PL       +         V D  + +  + +KAGE + ++ G + N+ L ++ GF
Sbjct: 14  YALIPLWDMCNHTNGTISTAYNPVLDRSECLAVKNFKAGEQLFIFYGSRSNADLFVHNGF 73

Query: 206 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD--MLPY 263
           V E+N YD   +   ++  DP  Q +  +    GKLS+      + R+  +  D  +L +
Sbjct: 74  VFENNDYDVYWIRLGISKSDPLQQKRGHLL---GKLSIASTCDFSIRKGASPIDGQLLAF 130

Query: 264 LRLGYVSDTSEMQSVISS-----LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSE 317
           LR+ +  +  ++   I+S     LG I C +   +E      L    K  L+ Y  TL E
Sbjct: 131 LRV-FNMNEEQLDHWINSDKSADLGHIDCALDTALETKSWRFLHARLKLLLSTYKTTLDE 189

Query: 318 DEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 350
           DE +L +    P + +A ++   EK+++   L+
Sbjct: 190 DEKLLAEAQATPNRLLAIKMRATEKRIIRETLE 222


>gi|399949805|gb|AFP65462.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Chroomonas mesostigmatica
           CCMP1168]
          Length = 464

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 71/339 (20%), Positives = 141/339 (41%), Gaps = 31/339 (9%)

Query: 18  LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
           L+E   LA+  + E+  G+KS W  Y   L +++    +       W  +++ +L GS  
Sbjct: 134 LNEYDSLAIKAIQERAIGEKSKWKVYYEILPKEKDLNLV-----FRWKISDIVFLRGSKV 188

Query: 78  KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
                   E IK ++  ++   F    L        P + F  + ++ A   + S  + L
Sbjct: 189 LNASFYLKEKIKIQFLRIEKTIFSKNRLV------YPEKIFNLQSWEWAISLLLSRAIFL 242

Query: 138 QKVSLARRFALVPLGP-----PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG 192
           Q +   ++ ALVP        P        K +  + ++ + +  D+ Y   + I    G
Sbjct: 243 QNM---KKIALVPYADFINHNPFSTSYINSKKIAFSENNEIVMYADKDYNKFDQIFTTYG 299

Query: 193 PQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV---AQRNGKLSVQVFHVH 249
            + N +LL+ YGF+ E NP+D + +  AL+T+D  Y  K       ++  +++  VF+  
Sbjct: 300 QKTNLELLVLYGFIIERNPFDSIELRVALSTKDELYNKKEKFINDCEKTEQITFPVFYYK 359

Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
             +E      +  ++RL             S+L      +  +E+ +   +    K  L 
Sbjct: 360 YPKE------LYEFMRLCLSGPRDFFGEEFSNLNFTDEENFNLEKIIRKTVIFACKKNLK 413

Query: 310 GYPATLSEDEAMLTDYN---LHPKKRVATQLVRMEKKML 345
            Y  T++E++ +    N   L   ++ + +  + EKK+L
Sbjct: 414 AYNKTINEEKILNNLSNIIVLTKNQKTSIKQRKCEKKIL 452


>gi|392349055|ref|XP_003750278.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Rattus
           norvegicus]
          Length = 416

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 59/247 (23%), Positives = 114/247 (46%), Gaps = 20/247 (8%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           LA +L+ E+     SFW PYI+ L  +        ++PL + E E+  L  +    ++  
Sbjct: 96  LAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVFS 147

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKV 140
           + +   R+Y     V        Q +P+   +P  ++FT+E ++ A  +V +    +   
Sbjct: 148 QYKNTARQYAYFYKV-------IQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTE 200

Query: 141 SLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 199
             +R   AL+PL       +          DD  + V  + ++AG+ I ++ G + N++ 
Sbjct: 201 DGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAEF 260

Query: 200 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD 259
           +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + 
Sbjct: 261 VIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQ 319

Query: 260 MLPYLRL 266
           +L +LR+
Sbjct: 320 LLAFLRV 326


>gi|242007310|ref|XP_002424484.1| SET domain-containing protein, putative [Pediculus humanus
           corporis]
 gi|212507902|gb|EEB11746.1| SET domain-containing protein, putative [Pediculus humanus
           corporis]
          Length = 492

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 77/340 (22%), Positives = 147/340 (43%), Gaps = 33/340 (9%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           LAL+L+ E  + + SFW  YI  L            + L +   +   +  SPT     +
Sbjct: 163 LALFLILEHVKNENSFWHHYISSLPSD-------YNTVLYFDLNDFLEMKNSPTFEMATK 215

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKV 140
             + I R+Y      +F   +LFQ    +   I    FT+++++ A   V +    +   
Sbjct: 216 HCKNIARQY-----AYF--NNLFQNSNDEASLILRNVFTYQLYRWAVSTVMTRQNFIPSS 268

Query: 141 SLARRF-----ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 195
           S +         L+PL   +  +++   +    VD +  L   +P+K GE ++++ G + 
Sbjct: 269 STSNDVENGINGLIPLWD-MCNHTNGYLSTQYKVDRSECLAC-KPFKKGEQVLIFYGERS 326

Query: 196 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 255
           NS  L++ GFV ++NP+D   +   ++  D  +  +  + +  G      F++++G E  
Sbjct: 327 NSDFLVHNGFVYDENPHDSFRLRLGISKSDKLHGLRCELLKDLGIPDSGDFYLYSGSEP- 385

Query: 256 AISDMLPYLRLGY--VSDTSEMQSVISSLGPI----CPVSPCMERAVLDQLADYFKARLA 309
              ++L +LR+    V + +  +S  S L  +    C +   +E  V + L D     L 
Sbjct: 386 VRENLLAFLRIFNMDVENLNHWKSHSSRLSDLMWKDCALDTKIESKVWNFLYDRINLLLK 445

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 349
            Y     E E  + D N    +R+    ++ EKK L++ L
Sbjct: 446 TYKG--DEVEVRVEDSNSTECRRLVRAQLKCEKKFLSSIL 483


>gi|52545671|emb|CAH56365.1| hypothetical protein [Homo sapiens]
          Length = 380

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 61/262 (23%), Positives = 123/262 (46%), Gaps = 14/262 (5%)

Query: 105 LFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSS 160
           + Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL       + 
Sbjct: 9   VIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNG 68

Query: 161 KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 220
                    DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  
Sbjct: 69  LITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLG 128

Query: 221 LNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD--------- 271
           ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++         
Sbjct: 129 VSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLG 187

Query: 272 TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKK 331
            S +  + +      PVS   E  +   L D     L  Y  T+ ED+++L +++L  + 
Sbjct: 188 DSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNHDLSVRA 247

Query: 332 RVATQLVRMEKKMLNACLQVTA 353
           ++A +L   EK++L   ++  A
Sbjct: 248 KMAIKLRLGEKEILEKAVKSAA 269


>gi|189236574|ref|XP_975615.2| PREDICTED: similar to SET domain containing 3 [Tribolium castaneum]
          Length = 667

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 57/220 (25%), Positives = 100/220 (45%), Gaps = 26/220 (11%)

Query: 146 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRP-------YKAGESIVVWCGPQPNSK 198
           +AL+PL          C      +  A   V+DR        +KAGE + ++ G + N+ 
Sbjct: 451 YALIPLW-------DMCNHTNGTISTAYNPVLDRSECLAVKNFKAGEQLFIFYGSRSNAD 503

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
           L ++ GFV E+N YD   +   ++  DP  Q +  +    GKLS+      + R+  +  
Sbjct: 504 LFVHNGFVFENNDYDVYWIRLGISKSDPLQQKRGHLL---GKLSIASTCDFSIRKGASPI 560

Query: 259 D--MLPYLRLGYVSDTSEMQSVISS-----LGPI-CPVSPCMERAVLDQLADYFKARLAG 310
           D  +L +LR+ +  +  ++   I+S     LG I C +   +E      L    K  L+ 
Sbjct: 561 DGQLLAFLRV-FNMNEEQLDHWINSDKSADLGHIDCALDTALETKSWRFLHARLKLLLST 619

Query: 311 YPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 350
           Y  TL EDE +L +    P + +A ++   EK+++   L+
Sbjct: 620 YKTTLDEDEKLLAEAQATPNRLLAIKMRATEKRIIRETLE 659


>gi|190402231|gb|ACE77646.1| hypothetical protein [Sorex araneus]
          Length = 350

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 61/248 (24%), Positives = 114/248 (45%), Gaps = 11/248 (4%)

Query: 116 EAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 174
           E+FT+E ++ A  +V +    +     +R   AL+PL       +          DD  +
Sbjct: 12  ESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCE 71

Query: 175 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 234
            V  + ++AGE I ++ G + N++ +++ GF  ++N +DR+ ++  ++  D  Y  K  V
Sbjct: 72  CVALQDFRAGEQIYIFYGTRSNAEFVVHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEV 131

Query: 235 AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS--------SLGPI- 285
             R G  +  VF +H   E    + +L +LR+  +++    + ++         +LG   
Sbjct: 132 LARAGIPTSSVFALHV-TELPISAQLLAFLRVFCMTEEELREHLLGENAIDRIFTLGNSE 190

Query: 286 CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
            PVS   E  +   L D     L  Y  T+ ED+A L    L P+  +A +L   EK++L
Sbjct: 191 YPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKAFLQSPGLSPRAAMAVKLRLGEKEIL 250

Query: 346 NACLQVTA 353
              ++  A
Sbjct: 251 EKAVRSAA 258


>gi|322694827|gb|EFY86647.1| SET domain protein [Metarhizium acridum CQMa 102]
          Length = 467

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 91/392 (23%), Positives = 158/392 (40%), Gaps = 61/392 (15%)

Query: 5   LFYYAAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 64
           LF    +      L   + L L +MYE   G +S W PYI  L        L  ++P+ W
Sbjct: 78  LFESQGDEDEEQALDSWSALILIMMYEFFLGDESKWKPYIDVL-------PLTFDTPMFW 130

Query: 65  SETELAYLTGSPTKAEILE-RAEGIKREY--------------------NELDTVWFMAG 103
           SE EL+YL  S T  +I +  AE + R                       +L  +    G
Sbjct: 131 SEEELSYLQASATVNKIGKADAEEMFRTRLIPAIRGNPSVFVSSGDCSDEDLIGLAHRMG 190

Query: 104 SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCK 163
           S    Y +D+  E    +     +V  +     +  V++A           +L   ++  
Sbjct: 191 STIMAYAFDLENEEAENDEESDGWVEDREGKSMMGMVAMA----------DILNADAEFN 240

Query: 164 AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN--------PYDRL 215
           A +   D+ + +   R  KAGE I+ + GP PNS+LL  YG++ E +        P+D  
Sbjct: 241 AHVNHGDEELTVTSIRDIKAGEEILNYYGPHPNSELLRRYGYITEKHSRYDVVEIPWD-- 298

Query: 216 VVEAALNTEDPQYQD------KRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV 269
            V+ +L +E    QD      +RM   ++    + V    +G      +   P +  G  
Sbjct: 299 AVQHSLMSELGVPQDIMAETMERM--DQDDLEDIFVLERDSGEPNPDGTFAGPAVVDGMP 356

Query: 270 SDTSE-MQSVISSL----GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 324
            D  E +++ I  L    G +       +  +   + +  +   + Y  T++EDE +L  
Sbjct: 357 PDLKEQLKATIKLLQKLDGNLISDKRKRDDILRSTMVETLRLIASRYSTTIAEDEVLLAQ 416

Query: 325 YNLHPKKRVATQLVRMEKKMLNACLQVTADMI 356
            +L  ++R+A Q+   EKK+L       ++M+
Sbjct: 417 DSLTRRQRMAVQVRLGEKKLLQEACDHFSEMV 448


>gi|425766115|gb|EKV04742.1| hypothetical protein PDIG_87340 [Penicillium digitatum PHI26]
 gi|425778867|gb|EKV16969.1| hypothetical protein PDIP_33360 [Penicillium digitatum Pd1]
          Length = 679

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 58/213 (27%), Positives = 98/213 (46%), Gaps = 23/213 (10%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
             A +LM +  +G + FW PY+R L +    GQL   +PL + E ++ ++ G+      +
Sbjct: 105 TFAFFLMAQYLRGPEGFWYPYLRTLPQP---GQLT--TPLFFGEEDVDWIQGTGIPEAAV 159

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
           ER +  + +Y   D+ +   G+    +P     E +T+E++  A   + S     + +S 
Sbjct: 160 ERIKIWEEKY---DSGYLQLGA--TGFP---DCETYTWELYLWASTIITSRAFSAKVLSG 211

Query: 143 ARR---------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 193
           A +          AL+PL   L  +    K    A D  + L+V   + AG+ I    GP
Sbjct: 212 AVQPGDLPEDGVSALLPL-IDLPNHRPMAKVEWRAGDKDIGLLVLEDHSAGQEISNNYGP 270

Query: 194 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 226
           + N +LLINYGF    NP D  +V   +  + P
Sbjct: 271 RNNEQLLINYGFCIAGNPTDYRIVHLGVKPDSP 303


>gi|297726941|ref|NP_001175834.1| Os09g0411650 [Oryza sativa Japonica Group]
 gi|255678893|dbj|BAH94562.1| Os09g0411650, partial [Oryza sativa Japonica Group]
          Length = 206

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 48/160 (30%), Positives = 78/160 (48%), Gaps = 13/160 (8%)

Query: 194 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGRE 253
           + N++L ++YGF + ++  D   +   ++  DP Y DK  +A+ NG      F +  G  
Sbjct: 10  KSNAELALDYGFTESNSSRDAYTLTLEISESDPFYDDKLDIAELNGMGETAYFDIVLG-- 67

Query: 254 KEAIS-DMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLADYFK 305
            E++   MLPYLRL  +  T           +V   L    PVS   E A+   + +  K
Sbjct: 68  -ESLPPQMLPYLRLLCLGGTDAFLLEALFRNAVWGHLE--LPVSQDNEEAICQVIRNACK 124

Query: 306 ARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
           + L  Y  T+ EDE +L   NL P+ ++A ++   EKK+L
Sbjct: 125 SALGAYHTTIEEDEELLGSENLQPRLQIAVEVRAGEKKVL 164


>gi|348675930|gb|EGZ15748.1| hypothetical protein PHYSODRAFT_561468 [Phytophthora sojae]
          Length = 430

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 88/361 (24%), Positives = 147/361 (40%), Gaps = 36/361 (9%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LAL+LM+E+ +G +SF+ P+I  +           + P+ W+E EL  L G+       
Sbjct: 83  VLALHLMHERSKGGESFFAPFIASM-------PTTFDLPVFWTEAELNELKGTNVLLLTQ 135

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
              + ++R++  +        + F      +PT   T + +  A   + S       VS 
Sbjct: 136 LMKQHLERDFENIHQA---VAADFPDIFASLPT--LTIDDYMWAMSVIWSRAF---GVSK 187

Query: 143 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV---------VDRPYKAGESIVVWCGP 193
             ++  V L P +  ++     +   +DD V            V     AG ++ +  G 
Sbjct: 188 GGKYLHV-LCPAMDMFNHDV-TVRKPLDDFVSFNEEKQMMTHHVPEDVAAGSAVHISYGQ 245

Query: 194 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGRE 253
             N+KLL +YGFV  +N    +     +   DP ++ K+ V   N     Q +  H    
Sbjct: 246 YSNAKLLYSYGFVSPENFRRGVDFWMKIPLSDPYFKLKQTVLDSNELTKEQTYDFHGTLL 305

Query: 254 KEAISD-MLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP 312
              + + +L  LR+  +++    Q   +    I  V    E AV + L    + +L+ Y 
Sbjct: 306 SNDVDERLLATLRVILMNEQEIRQYKKAFESSILSVRN--ELAVYENLQSTCRRKLSNYA 363

Query: 313 ATLSEDEAMLTDYNLHPKKRVATQL-VRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPY 371
            TL EDEA+L +     K R+A  + VRME K      QVT  +I  L     S    P 
Sbjct: 364 TTLEEDEAILAETETESKPRLAFAVRVRMEDK------QVTTSVIETLEQWKQSLASKPD 417

Query: 372 A 372
           A
Sbjct: 418 A 418


>gi|242049248|ref|XP_002462368.1| hypothetical protein SORBIDRAFT_02g024510 [Sorghum bicolor]
 gi|241925745|gb|EER98889.1| hypothetical protein SORBIDRAFT_02g024510 [Sorghum bicolor]
          Length = 489

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 93/341 (27%), Positives = 146/341 (42%), Gaps = 39/341 (11%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            +AL L+ E  +G  S W PY+  L RQ              S    A L  S  + ++L
Sbjct: 130 AVALLLLSEVARGADSPWAPYLAILPRQTD------------STIFCAGLKKSSLRYKLL 177

Query: 83  ERAEGIKREY--NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV---VHL 137
               G+K EY  +E D+V     S  +    D+   + TF+ F  AF  ++S V   +  
Sbjct: 178 STTVGVK-EYVQSEFDSVQAEIISRNK----DLFPGSITFDDFLWAFGILRSRVFPELRG 232

Query: 138 QKVSLARRFALVPLGPPLLAYSS----KCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCG 192
            K++L     LV   P + +  S    K K +    +    L      K+G+ I + +  
Sbjct: 233 DKLALVPFADLVNHSPDITSEGSSWEIKGKGLFGR-EPMFSLRTPVDVKSGQQIYIQYDL 291

Query: 193 PQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGR 252
            + N++L ++YGFV+ +   D   V   ++  DP Y DK  +A+ N       F +    
Sbjct: 292 DKSNAELALDYGFVESNPSRDSYTVTLEISESDPFYGDKLDIAELNELGETAYFDIIL-- 349

Query: 253 EKEAISDMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLADYFK 305
           ++     MLPYLRL  +  T           SV   L    P+SP  E ++   + D  K
Sbjct: 350 DEPLPPQMLPYLRLLCIGGTDAFILEALFRNSVWGHLE--LPLSPDNEESICQVMRDACK 407

Query: 306 ARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 346
           + LA Y  T+ EDE +    NL P+  +A  +   EKK+L 
Sbjct: 408 SALAAYHTTIEEDEELSERENLQPRLTIAIGVRAGEKKVLQ 448


>gi|350408192|ref|XP_003488333.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Bombus
           impatiens]
          Length = 484

 Score = 64.7 bits (156), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 87/345 (25%), Positives = 148/345 (42%), Gaps = 32/345 (9%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           LA+ L+ EK + + S W PY+  L            + L  +  ++  L GSPT    L+
Sbjct: 158 LAIALLIEKHK-EYSKWKPYLDILPT-------TYTTVLYMTAADMNELKGSPTLEAALK 209

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYPYDIPT---EAFTFEIFKQAF--VAVQSCVVHLQ 138
           +   I R+Y      +F    LFQ+    +     + FT+E +  A   V  +  ++  +
Sbjct: 210 QCRNIARQY-----AYF--NKLFQKNNNAVSAILRDVFTYEKYCWAVSTVMTRQNIIPSK 262

Query: 139 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
             SL    AL+P+       +SK      A  +  +    R +K  E I +  G + NS 
Sbjct: 263 DGSLMIH-ALIPMWDMCNHENSKITTDFNATLNCCECYALRDFKKAEQIFISYGARTNSD 321

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
             ++ GFV  DN  D   +   ++  DP  +++  +  +    +V  F +  G   E IS
Sbjct: 322 FFVHSGFVYMDNEQDGFKLRLGISKADPLQKERVELLNKLDLPAVGEFLLKPG--TEPIS 379

Query: 259 D-MLPYLRLGYVSDTSEMQSVISS--LGPICPVSPCMERAVLDQLADYFKARL----AGY 311
           D +L +LR+ +     E+   I S  +  +  +   +E  V + +  +   RL    A Y
Sbjct: 380 DTLLAFLRV-FSMRKEELAHWIQSDRVNDLKHMDCALETVVEENVKKFLLTRLQLLIANY 438

Query: 312 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 356
           P TL ED  +L +  L   K++A QL   EK++L   L+     I
Sbjct: 439 PTTLKEDLQLL-ETTLPRIKKLAIQLRVTEKRILQGALEYVQQWI 482


>gi|367029027|ref|XP_003663797.1| hypothetical protein MYCTH_2080826, partial [Myceliophthora
           thermophila ATCC 42464]
 gi|347011067|gb|AEO58552.1| hypothetical protein MYCTH_2080826, partial [Myceliophthora
           thermophila ATCC 42464]
          Length = 357

 Score = 64.7 bits (156), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 60/206 (29%), Positives = 93/206 (45%), Gaps = 32/206 (15%)

Query: 26  LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES-PLLWSETELAYLTGSPTKAEILER 84
            +L+ E  +GK SFW PYI  L       Q+AV + P  W + ++AYL G+     I E 
Sbjct: 117 FFLVKEYLKGKDSFWWPYIATLPPPE---QVAVWALPPFWPDHDIAYLEGTNAHVAIQEI 173

Query: 85  AEGIKREYNELDTVWFMAGSLFQQYPY-DIPTEAFTFEIFKQAFVAVQSCVVH----LQK 139
            E +KRE+ +       A  L ++  + D+P  A+T  ++K AF    S        L  
Sbjct: 174 QENVKREFKQ-------ARKLLKEEDFPDLP--AYTQLLYKWAFCIFTSRSFRPSLVLSD 224

Query: 140 VSLARRFALVPLG---------PPLLAYSSKCKAM-----LAAVDDAVQLVVDRPYKAGE 185
            +  R  AL+P G          PLL  ++            +V D  +L+   PY+ G 
Sbjct: 225 ATKRRLSALLPQGVQLDDFSVLQPLLDIANHSPTARYTWDTTSVPDTCRLICHDPYQPGT 284

Query: 186 SIVVWCGPQPNSKLLINYGFVDEDNP 211
            +    G + NS+LL+ YGF+  + P
Sbjct: 285 QVYNNYGLKTNSELLLAYGFILPETP 310


>gi|156064409|ref|XP_001598126.1| hypothetical protein SS1G_00212 [Sclerotinia sclerotiorum 1980]
 gi|154691074|gb|EDN90812.1| hypothetical protein SS1G_00212 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 470

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 90/359 (25%), Positives = 148/359 (41%), Gaps = 47/359 (13%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            L   LM E  Q + S W PY+  L  +       ++S + WS++ELA L  S    +I 
Sbjct: 86  VLTSILMTEA-QMENSKWAPYLAVLPER-------LDSLVFWSDSELAELQASAVVKKIG 137

Query: 83  ERA-----------EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ 131
           ++            +G+K    E+        S+   Y +DIP  +       +   A  
Sbjct: 138 KKDAEDMFKSYIAPQGLKHSSTEM---CHKVASVIMAYAFDIPDPSDAPTSGGKGGEAGD 194

Query: 132 SCVV---HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 188
             V      +K  L+    ++PL   L A + +  A L   ++ +++   +P   GE I 
Sbjct: 195 DLVSDDGEDEKTILS----MIPLADMLNADADRNNARLICDNEELEMRAIKPISKGEEIF 250

Query: 189 VWCGPQPNSKLLINYGFV-DEDNPYD------RLVVEAALN----------TEDPQYQDK 231
              G  P S LL  YG+V D  + YD       L+V    N          T+D   + +
Sbjct: 251 NDYGQLPRSDLLRRYGYVTDGYSAYDVAEISAELIVSLFRNGKVHPSLHKLTQD-GLKTR 309

Query: 232 RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPC 291
             +A+R G        VH+  ++ +I D L       + D S ++++++S   I   S  
Sbjct: 310 LELAEREGVYEDSFDLVHSSPDEPSIPDELLAFLYLLLVDESHLKAILNSESSIPSRSKL 369

Query: 292 MERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 350
                   LA   +AR   Y  TL EDE +L + +L  +  +A Q+   EKK+L A +Q
Sbjct: 370 TTELAGQVLATLLQAREKEYSTTLEEDEDLLKNADLPVRHAMAIQVRSGEKKVLRAAMQ 428


>gi|342881738|gb|EGU82570.1| hypothetical protein FOXB_06936 [Fusarium oxysporum Fo5176]
          Length = 467

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 95/389 (24%), Positives = 161/389 (41%), Gaps = 61/389 (15%)

Query: 18  LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
           L   + L L ++YE  QG+ S W PY   L         + ++P+ WS+ EL  L  S  
Sbjct: 96  LDSWSSLILIMIYEYLQGENSKWKPYFDVLPS-------SFDTPMFWSDNELDQLQASHM 148

Query: 78  KAEILE-RAEG--------IKREYNELDTVWFMAGS--------LFQQYPYDIPTEAFTF 120
           + +I +  AE         I R   E+    F AG+        +  +    I   AF  
Sbjct: 149 RHKIGKADAENMFQKTLLPIIRSNAEI----FNAGNKTDAELIEIAHRMGSTIMAYAFDL 204

Query: 121 EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRP 180
           E  ++        V      S+     +VP+   +L   ++  A +   ++++ +   RP
Sbjct: 205 ENDEEEEEEADGWVEDRDGKSM---MGMVPMAD-ILNADAEFNAHVNHEEESLTVTSLRP 260

Query: 181 YKAGESIVVWCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAAL--NTEDPQYQD 230
            KAGE I+ + GP PNS+LL  YG+V E +        P+D  +VE+AL  N   P    
Sbjct: 261 IKAGEEILNYYGPHPNSELLRRYGYVTEKHSRYDVVEIPWD--IVESALTSNFGIP---- 314

Query: 231 KRMVAQRNGKL-------SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE-MQSVISSL 282
            +++ Q  G L          V     G      +   P        D  E +++ +  +
Sbjct: 315 GQVLEQIRGALEEDEEFEDTFVLERETGEVNSDGTFAEPARFESMPEDLQEQLKTFLKGI 374

Query: 283 GPICP--VSPCMERAVLDQ--LADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLV 338
               P  +    +R  + Q  LA   +A +A YP ++SEDE +L   +L+ + R+A  + 
Sbjct: 375 KKAQPDAIPDKRKRDEIHQAVLAKTLEALVARYPTSISEDENLLK-QDLNQRTRMAIAVR 433

Query: 339 RMEKKMLNACLQVTADMIMLLPDVTVSPC 367
             EKK+L   +  ++  + +  D    P 
Sbjct: 434 LGEKKLLQEAITASSGDVEMTMDDESGPA 462


>gi|145516108|ref|XP_001443948.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124411348|emb|CAK76551.1| unnamed protein product [Paramecium tetraurelia]
          Length = 572

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 77/356 (21%), Positives = 147/356 (41%), Gaps = 46/356 (12%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           L+ +L+ EK +   SFW PYI  L             P+ ++ ++L +L GSP   +I +
Sbjct: 215 LSTFLLQEKFRPN-SFWKPYIDILPSSYP------SFPIFYNNSDLEWLKGSPFLKQIKD 267

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 143
           +   ++++YN++  V       F QY          F  F  A +   S +  +  ++  
Sbjct: 268 KLADLQKDYNDICNV----VPEFTQY---------QFHEFCWARMTASSRIFGI-NINGV 313

Query: 144 RRFALVPLG-------PPLLA--YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 194
           +  A VPL        P L +  YS + +  +   D+ ++         G+ I    G +
Sbjct: 314 KTDAFVPLADMLNHKRPKLTSWCYSDEKQGFIIETDEKIE--------RGQMIFDSYGRK 365

Query: 195 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREK 254
            NS+  +NYGFV E N  + + +    +  DP  Q K    + + +   + F +    ++
Sbjct: 366 CNSRFFLNYGFVVEGNDANEVNLAVEADQNDPLLQLKEQAIKESLQWP-KNFKLLMDTDE 424

Query: 255 EAISDMLPYLRLGYVSDTSEMQSVISSLGP-------ICPVSPCMERAVLDQLADYFKAR 307
            A+ D + ++R   + D ++++ +++             P+    E  +   +    K  
Sbjct: 425 TAVIDFMSHIRFLVIRDEAQLKLLLNQKNSQNFKSTKTQPLGIYNELEMWKMIGRICKKT 484

Query: 308 LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVT 363
           L  YP T  +D+ +L    L   +R    L   EK++L    Q +  M  LL +  
Sbjct: 485 LKQYPTTFEQDQEILQICELTTNQRNCLILRMGEKEILKFYFQFSERMKELLSNFN 540


>gi|380015248|ref|XP_003691619.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Apis
           florea]
          Length = 483

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 80/342 (23%), Positives = 145/342 (42%), Gaps = 24/342 (7%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LA+ L+ E+ + + S W PY+  L            + L  +  ++  L GSPT    L
Sbjct: 156 ALAIALLIERHK-ENSKWKPYLDILP-------TTYTTVLYMTAADMIELKGSPTLEAAL 207

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
           ++   I R+Y+  + V+    +       D+    FT+E +  A   V +    +     
Sbjct: 208 KQCRNIARQYSYFNKVFQNNNNAVSAILRDV----FTYERYCWAVSTVMTRQNLIPSEDG 263

Query: 143 ARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 201
           +R   AL+P+       + +      A  +  +    R +K GE I +  GP+ NS   +
Sbjct: 264 SRMIHALIPMWDMCNHENGRITTDFNATSNYCECYALRDFKKGEQIFISYGPRTNSDFFV 323

Query: 202 NYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD-M 260
           + GFV  +N  D   +   ++  D   +++  +  +    +V  F +  G   E ISD +
Sbjct: 324 HSGFVYMENKQDGFKLRLGISKADSLQKERIELLNKLDLPTVGEFLLKLG--TEPISDLL 381

Query: 261 LPYLRLGYVSDTSEMQSVISS--LGPICPVSPCMERAVLDQLADYFKARL----AGYPAT 314
           L +LR+ +    +E+   I S  +  +  +   +E  V + +  +   RL    A YP T
Sbjct: 382 LAFLRV-FSMRKAELAHWIRSDRVNDLKHMDCALETVVEENVRKFLLTRLQLLIANYPTT 440

Query: 315 LSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 356
           L ED  +L +  L   K++  QL   EK++L   L+     I
Sbjct: 441 LKEDLQLL-ETTLPQIKKLTIQLRVTEKRILQGALEYVEQWI 481


>gi|296419472|ref|XP_002839331.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295635461|emb|CAZ83522.1| unnamed protein product [Tuber melanosporum]
          Length = 541

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 57/220 (25%), Positives = 106/220 (48%), Gaps = 24/220 (10%)

Query: 8   YAAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET 67
           ++A+ +T    ++ A L ++L  E  +GK+SFW PY+  L R+        ++PL +S+ 
Sbjct: 73  FSADFITNT--TQHAALCMFLCLEWLKGKESFWWPYLCVLPRE-------FDTPLYFSDE 123

Query: 68  ELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
           +L +L G   +A  +E  + I RE  E       A S+ Q+  YD  TE +T+E++  A 
Sbjct: 124 DLQFLQGCNLEATEVEARKLIWREEFE------AAVSILQREGYD--TEYYTWELYLWAS 175

Query: 128 VAVQSCVVHLQKVSLARRFA----LVPLGPPLLAYSSKCKAMLAA---VDDAVQLVVDRP 180
               S     + +   R        +P+  PL+   +   A +      D +++++    
Sbjct: 176 TIFTSRSFPGKLMDWDRIIVHEDDTMPILFPLIDSLNHYPATIITWQPSDTSLRIISGVG 235

Query: 181 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 220
             AG  +    GP+ N +LL+ YGF    NP+D  +++++
Sbjct: 236 VSAGAEVYNNYGPKANEELLMGYGFTLLQNPFDSFLLKSS 275


>gi|303277863|ref|XP_003058225.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226460882|gb|EEH58176.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 612

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 76/267 (28%), Positives = 114/267 (42%), Gaps = 40/267 (14%)

Query: 14  TTNKLSELACLALYLMYEK-KQGKKSFWLPYIREL---------DRQRGRGQLAVESPLL 63
           T   L E   + L+L+YEK   G +S W P +  L           +   G L + +P  
Sbjct: 234 TFAGLGEDTLMTLWLVYEKYALGDRSPWAPLLASLPMDDGGGDDGDRTAAGALGL-TPAS 292

Query: 64  W-SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEI 122
           W +E   A L G+P   + ++  E   R++  L    F A  L + +P   PTE +T   
Sbjct: 293 WPAEVTDALLRGAPLLDDAVKARETTARQHAAL----FPA--LGEHFPEVFPTELYTLRR 346

Query: 123 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA-------AV------ 169
           F+ A  A  +  + +Q  ++          PP         A+L        AV      
Sbjct: 347 FRIASEAWNAYGMTVQAETVGGASGGGEHHPPAPTTCLPPIALLCNHATWPHAVRYSRLR 406

Query: 170 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 229
           DDA+ L + R  +AGE I V  G + N++LL+ YGF   DNPYD    +  L+ E PQ +
Sbjct: 407 DDALHLPIARGVRAGEEIFVSYGAKSNAELLLFYGFGVRDNPYD----DVPLSLELPQGE 462

Query: 230 DKRMVAQR-----NGKLSVQVFHVHAG 251
            + + A R       KLS+    V  G
Sbjct: 463 VRDVSALRERVLHRAKLSLSPHSVRCG 489


>gi|322703179|gb|EFY94792.1| UV-endonuclease UVE-1 [Metarhizium anisopliae ARSEF 23]
          Length = 1118

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 89/392 (22%), Positives = 158/392 (40%), Gaps = 61/392 (15%)

Query: 5    LFYYAAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 64
            LF    +      L   + L L +MYE   G +S W PYI  L        L  ++P+ W
Sbjct: 729  LFESQGDEDEQQALDSWSALILIMMYEFFLGHQSKWKPYIDVL-------PLTFDTPMFW 781

Query: 65   SETELAYLTGSPTKAEILE-RAEGIKREY--------------------NELDTVWFMAG 103
            SE EL+YL  S T  +I +  AE + R                       +L  +    G
Sbjct: 782  SEEELSYLQASATVNKIGKADAEEMFRTRLIPAIRGNPSVFASSGDCSDEDLIGLAHRMG 841

Query: 104  SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCK 163
            S    Y +D+  E    +     +V  +     +  V++A           +L   ++  
Sbjct: 842  STIMAYAFDLENEEAENDDESDGWVEDREGKSMMGMVAMA----------DILNADAEFN 891

Query: 164  AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN--------PYDRL 215
            A +   D+ + +   R  KAGE I+ + GP PNS+LL  YG++ E +        P+D  
Sbjct: 892  AHVNHGDEELTVTSIRDIKAGEEILNYYGPHPNSELLRRYGYITEKHSRYDVVEIPWD-- 949

Query: 216  VVEAALNTEDPQYQDKRMVAQRNGKL------SVQVFHVHAGREKEAISDMLPYLRLGYV 269
             V+ +L +E    QD  ++ +   K+       + V    +G      +   P +  G  
Sbjct: 950  AVQHSLMSELGVPQD--IMTETMDKMDQDDLEDIFVLERDSGEPNPDGTFAGPAVVDGMP 1007

Query: 270  SDTSE-MQSVISSL----GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 324
             D  E +++ I  L    G +       +  +   + +  +   + Y  T++EDE +L  
Sbjct: 1008 PDLKEQLKATIKLLQKVDGNLISDKRKRDDILRSTMVETLRLIASRYSTTIAEDEILLAQ 1067

Query: 325  YNLHPKKRVATQLVRMEKKMLNACLQVTADMI 356
             +L  ++R+A ++   EKK+L       ++M+
Sbjct: 1068 DSLTRRQRMAVRVRLGEKKLLQEAFDHFSEMV 1099


>gi|229596469|ref|XP_001008992.3| SET domain containing protein [Tetrahymena thermophila]
 gi|225565279|gb|EAR88747.3| SET domain containing protein [Tetrahymena thermophila SB210]
          Length = 629

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 58/191 (30%), Positives = 80/191 (41%), Gaps = 26/191 (13%)

Query: 22  ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 81
           A  AL+L+ E+K+   S +  YI  L             P L+SE EL YL G+     +
Sbjct: 227 AYFALFLLQEQKKKDASHYKAYIDSLPTDLSSF------PALFSEEELQYLEGTAALKLV 280

Query: 82  LERAEGIKREYNELDTVWFMAGSLFQQYPYDIP--TEAFTFEIFKQAFVAVQSCVVHLQK 139
            E+ E IK +Y  +  V              IP     F+FE F+ AF+   S V  + K
Sbjct: 281 QEQKEDIKTDYESISQV--------------IPEFKSEFSFEQFRWAFLCSHSRVFGI-K 325

Query: 140 VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA---GESIVVWCGPQPN 196
           V   +   +VPL   L    S  +      DDA      +  K     + I    G + N
Sbjct: 326 VKGVKTSVMVPLADMLNHKHSGQEDSEWVFDDATNCFTVKALKKIQRNQQIHFSYGSKCN 385

Query: 197 SKLLINYGFVD 207
           SKL +NYGFVD
Sbjct: 386 SKLFLNYGFVD 396


>gi|160331079|ref|XP_001712247.1| met [Hemiselmis andersenii]
 gi|159765694|gb|ABW97922.1| met [Hemiselmis andersenii]
          Length = 464

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 70/340 (20%), Positives = 149/340 (43%), Gaps = 33/340 (9%)

Query: 18  LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
           L+E   L ++L+ +   G KS W  Y   L R+       +     W+  ++ +L GS T
Sbjct: 139 LNEYDSLGIFLIQQMAMGDKSKWKIYFDILPREED-----LNLGFRWNLNDIVFLRGSKT 193

Query: 78  KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
               L   E IK ++  L+   F    L  +YP  I    F    ++ A   + S  + L
Sbjct: 194 LNASLYLKEKIKIQFLRLEKTIFSKNRL--KYPVSI----FNLAQWEWALSILLSRAIFL 247

Query: 138 QKVSLARRFALVPLGP-----PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG 192
           Q +   ++ +LVP        P        K +  + +  + +  D+ Y   + I    G
Sbjct: 248 QNL---KKVSLVPYADFMNHNPFSTSYINSKKISFSKNHEIVMYADKDYNKFDQIFTTYG 304

Query: 193 PQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV---AQRNGKLSVQVFHVH 249
            + N +LL+ YGF+ E NP+D + +  +L+ +D  ++ K+      ++  +++  +F+  
Sbjct: 305 QKTNLELLLLYGFILERNPFDSIELRISLSDKDSFFEKKKQFMIECEKTSEITFPIFYYK 364

Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSV-ISSLGPICPVSPCMERAVLDQLADYFKARL 308
             +E      +  +LR   +S+  E+ S  +S        +  +E+ +   +    +  L
Sbjct: 365 YPKE------LYEFLRFC-ISNQEELGSTDLSDFNFNDENNYEIEKIIRKLVLFSCEKLL 417

Query: 309 AGYPATLSEDE---AMLTDYNLHPKKRVATQLVRMEKKML 345
             Y   +SE++   ++ +++ +   +++A +  + EKK++
Sbjct: 418 KNYSKKVSEEKILNSLNSNFLISKNQKMALKQSKCEKKII 457


>gi|408393455|gb|EKJ72719.1| hypothetical protein FPSE_07119 [Fusarium pseudograminearum CS3096]
          Length = 465

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 86/394 (21%), Positives = 158/394 (40%), Gaps = 71/394 (18%)

Query: 18  LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
           L   + L L ++YE  +G  S W  Y   L         + ++P+ WSE EL  L  S  
Sbjct: 92  LDSWSSLILIMIYEYLRGDSSQWKSYFDVLPS-------SFDTPMFWSENELDQLQASHM 144

Query: 78  KAEILE-RAEGI-------------------KREYNELDTVWFMAGSLFQQYPYDIPTEA 117
           + +I +  AE +                    R  +EL  +    GS    Y +D+  + 
Sbjct: 145 RHKIGKADAENMFKKTLVPIIRSNPSIFNAENRSDSELVEIAHRMGSTIMAYAFDLENDE 204

Query: 118 FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV 177
              E  ++         +            +VP+   +L   ++  A +   ++++ +  
Sbjct: 205 EEEEETEEWVEDRDGKSM----------MGMVPMAD-ILNADAEFNAHVNHEEESLTVTS 253

Query: 178 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAALNT------ 223
            RP KAGE I+ + GP PNS+LL  YG+V E +        P+D  +VE+ L        
Sbjct: 254 LRPIKAGEEILNYYGPHPNSELLRRYGYVTEKHSRYDVVEIPWD--IVESVLTNFGISSK 311

Query: 224 ----------EDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 273
                      + +++D  ++ +  G+++         + +    D+   L+    S   
Sbjct: 312 ILKQIRGEFEGEEEFEDTFVLERDTGEINSDGTFAEPAKFEGMPEDLQEQLK----SFLK 367

Query: 274 EMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRV 333
            ++ V S   P       + +AVL +  +   AR   YP ++SED+ +L   NL  + R+
Sbjct: 368 GIKKVQSDTIPDKRKRDEIHQAVLVKTLEALAAR---YPTSISEDQTLLNGQNLDQRARM 424

Query: 334 ATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPC 367
           AT +   EKK+L   +   ++ + +  D    P 
Sbjct: 425 ATVVRLGEKKLLQEAIATFSEDVEMTMDDESGPA 458


>gi|159479580|ref|XP_001697868.1| rubisco large subunit N-methyltransferase [Chlamydomonas
           reinhardtii]
 gi|158273966|gb|EDO99751.1| rubisco large subunit N-methyltransferase [Chlamydomonas
           reinhardtii]
          Length = 475

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 89/342 (26%), Positives = 141/342 (41%), Gaps = 34/342 (9%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           LAL L+ E+    KS    Y   L    G       +PLLWSE E   L G+     +  
Sbjct: 117 LALVLVAERFGSAKSELAGYASSLPEDLG-------TPLLWSEEETRALAGTQVAGTLNS 169

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV---VHLQKV 140
                +  + +L         LF   P   P   FT   F  A  AV+S     +   K+
Sbjct: 170 YLTFFRSTFAQLQA------GLFTANPAAFPPAVFTLPNFVWAVAAVRSRSHPPLEGDKI 223

Query: 141 SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV---DRPYKAGESIVVWCGPQP-N 196
           +LA    LV L     A ++K     + +    Q+ V    R  + GE++ +   P   +
Sbjct: 224 ALA---PLVDLVSHRRAANTKLSVRSSGLFGRGQVAVVEATRAIRKGEALGMDYAPGKLD 280

Query: 197 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEA 256
             +L++YG +D  +P     +   L+  D    DK  + +  G      + +    +++ 
Sbjct: 281 GPVLLDYGVMDTASPKPGYSLTLTLDESDKFVDDKADIVEGAGLRPSMTYSITP--DQQP 338

Query: 257 ISDMLPYLRLGYVS--DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPAT 314
             +M+ +LRL  +   D   ++S+  +      VS   E AV   LA+  +A LAGYP T
Sbjct: 339 GEEMMAFLRLMNIKAMDAFLLESIFRN-----EVSEGNEEAVCAMLAEGARAALAGYPTT 393

Query: 315 LSEDEAMLTDYNLHPKKRV-ATQLVRM-EKKMLNACLQVTAD 354
           L +D A L   +     R  A  LVR+ EK+ L+A  +   D
Sbjct: 394 LDQDLAALRSNSTPLGSRAEAALLVRLGEKESLDAVARFFED 435


>gi|327259114|ref|XP_003214383.1| PREDICTED: SET domain-containing protein 3-like, partial [Anolis
           carolinensis]
          Length = 311

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 55/197 (27%), Positives = 97/197 (49%), Gaps = 18/197 (9%)

Query: 170 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 229
           DD  + V  + +KAGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y 
Sbjct: 9   DDRCECVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 68

Query: 230 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS--------S 281
            K  V  R G  +  VF +HA  E    + +L +LR+  +++    + +I         +
Sbjct: 69  MKAEVLARAGIPTSSVFALHA-TEPPISAQLLAFLRVFCMTEDELKEHLIGEHAIDRIFT 127

Query: 282 LGPI-CPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVATQ 336
           LG    PVS   E     +L  + +AR    L  Y  T+SED+A L   +L     +A +
Sbjct: 128 LGNSEFPVSWDNEV----KLWTFLEARASLLLKTYKTTVSEDKAFLGTQDLTCNATMAIK 183

Query: 337 LVRMEKKMLNACLQVTA 353
           L   EK++L   ++  A
Sbjct: 184 LRLGEKEILEKAIKSAA 200


>gi|238485948|ref|XP_002374212.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
 gi|83768069|dbj|BAE58208.1| unnamed protein product [Aspergillus oryzae RIB40]
 gi|220699091|gb|EED55430.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
          Length = 713

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 55/193 (28%), Positives = 89/193 (46%), Gaps = 7/193 (3%)

Query: 26  LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 85
            +LM +  QGK+ FW PYIR L  Q G    A+ +PL +   +L +L G+       ++A
Sbjct: 131 FFLMGQYLQGKEGFWYPYIRTLP-QPG----ALTTPLYYEGDDLEWLEGTSLSPARQQKA 185

Query: 86  EGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLAR 144
             +K +Y  + T    AG    ++Y +D+   A T  + +     V S V+   ++    
Sbjct: 186 NLLKEKYGTVYTELCKAGFDGAEKYTWDLYLWASTIFVSRAFSAKVLSGVIPDTQLPEEN 245

Query: 145 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 204
              L+P    +L +    K    A    V  +V     AG+ I    GP+ N +L++NYG
Sbjct: 246 VSVLLPF-IDILNHRPLAKVEWRAGKGNVAFLVLEDVAAGQEISNNYGPRNNEQLMMNYG 304

Query: 205 FVDEDNPYDRLVV 217
           F   +NP D  +V
Sbjct: 305 FCLPNNPCDYRIV 317


>gi|310799999|gb|EFQ34892.1| SET domain-containing protein [Glomerella graminicola M1.001]
          Length = 478

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 89/389 (22%), Positives = 155/389 (39%), Gaps = 72/389 (18%)

Query: 11  ELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA 70
           E +    L     L L ++YE  QG  S W  Y   L  +        ++ + W   +L 
Sbjct: 88  EDMENEPLDSWGSLILVMIYEYLQGNASPWKTYFEVLPEK-------FDTLMFWESPDLE 140

Query: 71  YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 130
           YL GS   ++I +           L  +   AG  F Q     P+E+   ++  +    +
Sbjct: 141 YLKGSAVLSKIGKDEADEMFRSRILPVISANAGIFFPQ-GVSPPSESELLQLAHRMGSII 199

Query: 131 QSCVVHLQKVSLARR--------------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 176
            +    L+      +                +VP+   +L   ++  A +   +D + + 
Sbjct: 200 MAYAFDLENEEEPEQEDEEWVEDREGKTMLGMVPMAD-ILNADAEFNAHVNHGEDDLSVT 258

Query: 177 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAALNTEDPQY 228
             RP KAGE I+ + GP PNS+LL  YG+V   +        P+D  +V++ L TE  + 
Sbjct: 259 ALRPIKAGEEILNYYGPHPNSELLRRYGYVTPKHSRYDVVEIPWD--LVQSTL-TEQLRL 315

Query: 229 QDK--RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSL---- 282
            D+  + VA+      ++   V      E  S+       G++   +++Q V + L    
Sbjct: 316 TDEVWKQVAEHVDPEDLEDVFVLERESGEPDSE-------GHLQTPAKVQEVSAELEEQL 368

Query: 283 -----------GPICP-------VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 324
                      G + P       V  C+  + L +L       LA YP T  EDEAML  
Sbjct: 369 KDVLKAIKKVRGDLIPDKRKRDEVYQCVVVSTLQKL-------LAQYPTTAEEDEAMLAS 421

Query: 325 YNLHPKKRVATQLVRMEKKMLNACLQVTA 353
            N+  ++++A ++   EK+++   LQV  
Sbjct: 422 GNVTSRQKLAVEVRLGEKRLIKEALQVAG 450


>gi|317144568|ref|XP_001820210.2| SET domain protein [Aspergillus oryzae RIB40]
 gi|391871646|gb|EIT80803.1| N-methyltransferase [Aspergillus oryzae 3.042]
          Length = 703

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 55/193 (28%), Positives = 89/193 (46%), Gaps = 7/193 (3%)

Query: 26  LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 85
            +LM +  QGK+ FW PYIR L  Q G    A+ +PL +   +L +L G+       ++A
Sbjct: 121 FFLMGQYLQGKEGFWYPYIRTLP-QPG----ALTTPLYYEGDDLEWLEGTSLSPARQQKA 175

Query: 86  EGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLAR 144
             +K +Y  + T    AG    ++Y +D+   A T  + +     V S V+   ++    
Sbjct: 176 NLLKEKYGTVYTELCKAGFDGAEKYTWDLYLWASTIFVSRAFSAKVLSGVIPDTQLPEEN 235

Query: 145 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 204
              L+P    +L +    K    A    V  +V     AG+ I    GP+ N +L++NYG
Sbjct: 236 VSVLLPF-IDILNHRPLAKVEWRAGKGNVAFLVLEDVAAGQEISNNYGPRNNEQLMMNYG 294

Query: 205 FVDEDNPYDRLVV 217
           F   +NP D  +V
Sbjct: 295 FCLPNNPCDYRIV 307


>gi|440797255|gb|ELR18348.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
          Length = 431

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 49/162 (30%), Positives = 76/162 (46%), Gaps = 15/162 (9%)

Query: 60  SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 119
           + L +S+ EL+YL GS       +R   I+ +Y+ + T       LF  YP     E F+
Sbjct: 138 TTLFFSDDELSYLEGSSLHHFARQRRRAIESQYDTIFT------PLFVDYPEHFAPEQFS 191

Query: 120 FEIFKQAFVAV--QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV 177
            + +K A   +  +S VV   K  L     +  + P     + + K  + AVD  +    
Sbjct: 192 LDAWKWALSVIWSRSFVVDEGKRGLVPWADMFNMAPE----TEQVKVAVDAVDHHLIYSA 247

Query: 178 DRPYKAGESIVVWCGPQ---PNSKLLINYGFVDEDNPYDRLV 216
             P K GE I V  G      N++LL++YGFV E+NP+D +V
Sbjct: 248 RSPIKKGEQIFVAYGQSRQMSNAQLLMDYGFVLENNPHDAVV 289


>gi|449017905|dbj|BAM81307.1| similar to ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplast precursor
           [Cyanidioschyzon merolae strain 10D]
          Length = 567

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 109/401 (27%), Positives = 159/401 (39%), Gaps = 85/401 (21%)

Query: 20  ELACLALYLMYE-------KKQGKK--SFWLPYIRELDR--QRGRGQLAVES----PL-- 62
           E   LA  L+YE       KK G+   SFW PY+  L       +G    ES    PL  
Sbjct: 161 EFFALATLLLYERGLDESWKKSGRGPGSFWGPYLDILPPVPWEFKGAEPAESLSMDPLDA 220

Query: 63  --LWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE-AFT 119
             LW+E E+ +L GSPT          ++REY E          L++++P+    E AF 
Sbjct: 221 LWLWAEDEMQWLQGSPTLLSARALRSKVEREYAE------ACERLYRRHPHIFDLEGAFR 274

Query: 120 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG--------------------PPLLAYS 159
            E F  AF  + S  V L   +     ALVP                      P     S
Sbjct: 275 LERFLWAFGVLFSRAVSLPAENGM--LALVPYADLANHSAFCVSFIDARTAAFPYAFRAS 332

Query: 160 SKCK----------------AMLAAVD------DAVQLVV---DRPYKAGESIVVWCGPQ 194
           SK K                  +A  D      DA + VV   DR Y   E + V  G +
Sbjct: 333 SKQKRGQWWQRFLAPNSDDAGAVANTDSSHYREDAQREVVAYADRFYDKFEQVYVSYGQK 392

Query: 195 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQ----YQDKR--MVAQRNGKLSVQVFHV 248
            N++LL+ YGFV + NPY+ + V  +L+  +         KR  ++A        + F +
Sbjct: 393 SNAELLLLYGFVSDRNPYNSVEVCVSLSGSEAAGAGLLDRKRSFLLACGRDPDKPECFPL 452

Query: 249 HAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARL 308
           +A R    +  +L +  L      +   S +  +    PV+   E A    L    K  L
Sbjct: 453 YADRYPLELMQLLRFASL--TEQDAAGYSDLEQIDVAQPVNRENEIAAKSALLQACKIAL 510

Query: 309 AGYPATLSEDEAMLTDYN----LHPKKRVATQLVRMEKKML 345
             YP +  ED+A L D +    L  K+R++ +L R EK++L
Sbjct: 511 QAYPTSADEDDAALKDKSMAQLLSRKQRLSVRLRRSEKRIL 551


>gi|442753255|gb|JAA68787.1| Putative set domain-containing protein [Ixodes ricinus]
          Length = 428

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 50/206 (24%), Positives = 92/206 (44%), Gaps = 16/206 (7%)

Query: 17  KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP 76
           KL+ +  L L+++ +K  G+ S W P++  L R          +P+    T    L    
Sbjct: 99  KLTPIDVLTLFVLDQKLLGEASRWWPFVDSLPR-------TFTTPVFLRRTVFESLP-KD 150

Query: 77  TKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVH 136
            + E+  R   I+R + +L  +  + G + ++      +  FT+  F  A+ AV +  + 
Sbjct: 151 LREEVHTRITSIQRTFLKLKVL--LGGHVEEEPEVQSLSTGFTWNNFVWAWTAVNTRCIF 208

Query: 137 LQKVSLARRF-----ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
            Q  + +  +     AL P     L +  K     A V +  +++  + + A E + +  
Sbjct: 209 AQGSNSSSLWENDHCALAPF-LDCLNHHWKASIETAMVGENFEILSHKSHDANEQVFISY 267

Query: 192 GPQPNSKLLINYGFVDEDNPYDRLVV 217
           GP  N +L ++YGFV  DNP D +VV
Sbjct: 268 GPHSNRRLFLDYGFVLPDNPNDVVVV 293


>gi|358392567|gb|EHK41971.1| hypothetical protein TRIATDRAFT_251278, partial [Trichoderma
           atroviride IMI 206040]
          Length = 956

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 85/394 (21%), Positives = 154/394 (39%), Gaps = 74/394 (18%)

Query: 22  ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 81
           + L + L+YE  +G +S W PY+  L         + E+P+ WS+ E+  L  S T+++I
Sbjct: 586 STLIIVLIYEYFKGDQSSWKPYLDVL-------PASFETPMFWSDAEVDELQASATRSKI 638

Query: 82  LE-RAEGI--------------------KREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 120
            +  AE +                     +   EL  +    GS    Y +D        
Sbjct: 639 GKTNAEEMFHAKILPVIRGNPDIFQTSQAKSDEELIQLAHRMGSTIMSYAFD-------- 690

Query: 121 EIFKQAFVAVQSCVVHLQKVSLARR-FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 179
             F+      +       +   A+    +VP+   +L   ++  A +   DDA+ +   R
Sbjct: 691 --FQNEDEEEEDDSEEWVEDREAKSTMGMVPMAD-ILNADAEYNAHVNYGDDALTVATLR 747

Query: 180 PYKAGESIVVWCGPQPNSKLLINYGFVDEDN--------PY----DRLVVEAALNTED-- 225
             KAGE I+ + GP PNS+LL  YG+V   +        P+    D L     L++E   
Sbjct: 748 TIKAGEEILNYYGPHPNSELLRRYGYVTPKHSRYDVVELPWKMIEDALAANLGLSSEQLD 807

Query: 226 --------PQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 277
                    ++++  ++ + + + +      +  +  E   D+   L+         M  
Sbjct: 808 SAREHLDLDEFEETFVLERESDEPNPDGTFANPAKFSEIPEDLREQLK--------SMLK 859

Query: 278 VISSLGPICPVSPCMERAVLDQ-LADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQ 336
            I  + P C V       V    L     A  + YP T+ EDE +L+  NL  +++ A  
Sbjct: 860 AIRKVDPSCIVDKRKRDEVQHTVLITALDALTSQYPTTIIEDELILSGSNLSERRKAAVT 919

Query: 337 LVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAP 370
           +   EK++L     + ++   +  D  +   PAP
Sbjct: 920 VRLGEKRLLQEARVLLSE---IASDAILDDAPAP 950


>gi|46130858|ref|XP_389160.1| hypothetical protein FG08984.1 [Gibberella zeae PH-1]
          Length = 1000

 Score = 61.2 bits (147), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 87/394 (22%), Positives = 160/394 (40%), Gaps = 71/394 (18%)

Query: 18  LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
           L   + L L ++YE  QG  S W  Y   L         + ++P+ WSE EL  L  S  
Sbjct: 627 LDSWSSLILIMIYEYLQGDSSQWKSYFDVL-------PSSFDTPMFWSENELDQLQASHM 679

Query: 78  KAEILE-RAEGI-------------------KREYNELDTVWFMAGSLFQQYPYDIPTEA 117
           + +I +  AE +                    R   EL  +    GS    Y +D+  + 
Sbjct: 680 RHKIGKADAEDMFKKTLVPIIRSNPSIFNAENRSDYELVEIAHRMGSTIMAYAFDLENDE 739

Query: 118 FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV 177
              E  ++     +   +            +VP+   +L   ++  A +   ++++ +  
Sbjct: 740 EEEEETEEWVEDREGKSM----------MGMVPMAD-ILNADAEFNAHVNHEEESLTVTS 788

Query: 178 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAALNT------ 223
            RP KAGE I+ + GP PNS+LL  YG+V E +        P+D  +VE+ L        
Sbjct: 789 LRPIKAGEEILNYYGPHPNSELLRRYGYVTEKHSRYDVVEIPWD--IVESVLTNFGISSK 846

Query: 224 ----------EDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 273
                     E+ +++D  ++ +  G+++         + +    D+   L+  ++    
Sbjct: 847 ILEQIRGEFEEEEEFEDTFVLERDTGEVNSDGTFAEPAKFEGMPEDLQEQLK-SFLKGIK 905

Query: 274 EMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRV 333
           ++Q   S   P       + +AVL +  +   AR   YP ++SED+ +L   NL  + R+
Sbjct: 906 KLQ---SDTIPDKRKRDEIHQAVLVKTLEALAAR---YPTSISEDQILLNGQNLDQRARM 959

Query: 334 ATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPC 367
           AT +   EKK+L   +   ++ + +  D    P 
Sbjct: 960 ATVVRLGEKKLLQEAIATFSEDVEMTMDDESGPA 993


>gi|340519125|gb|EGR49364.1| predicted protein [Trichoderma reesei QM6a]
          Length = 963

 Score = 61.2 bits (147), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 82/365 (22%), Positives = 147/365 (40%), Gaps = 63/365 (17%)

Query: 22  ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 81
           + L + +MYE  +G +S W PYI  L         + E+P+ WS+ EL  L  S T++++
Sbjct: 590 STLIIVMMYEYFKGSESKWKPYIDVL-------PSSFETPMFWSDAELDELQASATRSKV 642

Query: 82  LE---------------RAEG----IKREYNELDTVWFMA--GSLFQQYPYDIPTEAFTF 120
            +               RA        + Y++ D +      GS    Y +D        
Sbjct: 643 GKASAEEMFQDKVLPVIRANQHLFPTSQTYSDDDLIQLAHRMGSTIMSYSFD-------- 694

Query: 121 EIFKQAFVAVQSCVVHLQKVSLARR-FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 179
             F+      +       +   A+    +VP+   +L   ++  A +   DDA+ +   R
Sbjct: 695 --FQNEDEEDEDETEEWVEEREAKSTMGMVPMAD-ILNADAEYNAHVNYGDDALTVTALR 751

Query: 180 PYKAGESIVVWCGPQPNSKLLINYGFVD-EDNPYDRL---------VVEAALNTEDPQYQ 229
             KAGE I  + GP PNS+LL  YG+V  + + YD +          V A+L     Q  
Sbjct: 752 TIKAGEEIFNYYGPHPNSELLRRYGYVTPKHSRYDVVELPWTLVEESVAASLGLSSEQL- 810

Query: 230 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT--SEMQSVISSLGPICP 287
           DK      + +L           E      +    R   + +    +++S++ ++    P
Sbjct: 811 DKARECLDSDELEDTFVLERETEEPNPDGTLTGSARFSEIPEDLRDQLKSLLKAIRKAVP 870

Query: 288 VSPCMERA-------VLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 340
            S   +R        +L +  D   +R   YP ++SEDE +L   ++  ++R A  +   
Sbjct: 871 SSVVDKRKRDEIQHNILIRALDALASR---YPTSISEDERILAGNDISERRRAAVTVRLG 927

Query: 341 EKKML 345
           EK+++
Sbjct: 928 EKRLI 932


>gi|357615786|gb|EHJ69829.1| putative SET domain containing 3 [Danaus plexippus]
          Length = 489

 Score = 60.8 bits (146), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 66/264 (25%), Positives = 109/264 (41%), Gaps = 50/264 (18%)

Query: 22  ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 81
             LAL+L+ EK     SFW PYI  L  +         + L ++  ELA L  SP     
Sbjct: 159 VTLALFLLLEK-NNPNSFWKPYIDVLPEK-------YSTVLYFNSEELAELRPSPVFESS 210

Query: 82  LERAEGIKREY----NELDTVWFMAGSLFQQYPYDIPT-----EAFTFEIFKQAFVAV-- 130
           L+    I R+Y    N++ T+             D+P      + FTF+ ++ A   V  
Sbjct: 211 LKLYRSIVRQYAYFYNKIHTI-------------DLPVLKNLQDIFTFDNYRWAVSTVMT 257

Query: 131 -QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR-------PYK 182
            Q+ +V     +L    A +PL          C      +     L ++R        Y+
Sbjct: 258 RQNNIVQGTAFTLTN--AFIPLW-------DMCNHKHGKITTDFNLELNRGECYALQDYR 308

Query: 183 AGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLS 242
             E I ++ G +PNS L ++ GFV  DN YD L +   ++  D     K  +  + G   
Sbjct: 309 RDEQIFIFYGARPNSDLFLHNGFVYPDNDYDSLSIALGISPNDALRNGKVNLLNKLGLSG 368

Query: 243 VQVFHVHAGREKEAISDMLPYLRL 266
           V  F ++ G    ++ ++L ++R+
Sbjct: 369 VTNFSLYKGASPISV-ELLAFIRI 391


>gi|357444999|ref|XP_003592777.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase [Medicago truncatula]
 gi|355481825|gb|AES63028.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase [Medicago truncatula]
          Length = 451

 Score = 60.8 bits (146), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 54/227 (23%), Positives = 101/227 (44%), Gaps = 22/227 (9%)

Query: 13  LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 72
             +  +  +A LA  L+  K  G+ S W PYI  L       Q  + + + W+E+EL  +
Sbjct: 97  FISEDVGNIAKLATVLLIHKNLGQDSEWHPYISCLP-----PQAEMHNTIFWNESELEMI 151

Query: 73  TGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 132
             S    E + +   I++++ E+  V       FQ  P+      FT++ F  A   V S
Sbjct: 152 RQSSVYQETIYQKSQIEKDFLEIKPV-------FQ--PFCQSFGDFTWKDFMHACTLVGS 202

Query: 133 CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVW 190
                +     +  +L+P     L +    +A++ + DD    ++  DR Y  GE +++ 
Sbjct: 203 -----RAWGSTKGLSLIPFAD-FLNHDGISEAIVMSDDDNKCSEVFSDRDYVPGEQVLIR 256

Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQR 237
            G   N+ L++++GF    N YD++ ++  +   DP    K  + Q+
Sbjct: 257 YGKFSNATLMLDFGFTIPYNIYDQVQIQYDIPKYDPLRHTKLELLQQ 303


>gi|255562948|ref|XP_002522479.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase, chloroplast precursor, putative
           [Ricinus communis]
 gi|223538364|gb|EEF39971.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase, chloroplast precursor, putative
           [Ricinus communis]
          Length = 502

 Score = 60.8 bits (146), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 83/351 (23%), Positives = 149/351 (42%), Gaps = 44/351 (12%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW-----------------S 65
            +AL+L+ EK + + S W PY+  L            S + W                 S
Sbjct: 125 SVALFLIREKLKKEGSTWWPYLDILPD-------TTNSTIYWWVLLVAFYVLVLSFQRRS 177

Query: 66  ETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK- 124
           E ELA L G+      L   E ++RE+ +++    +        P  +    + F I + 
Sbjct: 178 EEELAELQGTQLLRTTLGVKEYMQREFAKVEEEILLPHKELFPSPITLDDFLWAFGILRS 237

Query: 125 QAFVAVQSCVVHLQKVSLARRFALVPLGPPLL----AYSSKCKAMLAAVDDAVQLVVDRP 180
           +AF  ++      Q + L     L+   P +     AY  K   + +  +    L     
Sbjct: 238 RAFSRLRG-----QNLVLIPLADLINHSPDITTEDYAYEIKGGGLFSR-ELLFSLRSPIS 291

Query: 181 YKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 239
            K+GE +++ +   + N++L ++YGF+++    +   +   ++  DP + DK  +A+ NG
Sbjct: 292 VKSGEQVLIQYDLNKSNAELALDYGFIEKTPDRNTYTLTLQISESDPFFGDKLDIAETNG 351

Query: 240 KLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE--MQSVISS--LGPI-CPVSPCMER 294
                 F +  G        MLPYLRL  +  T    ++S+  +   G +  P+S   E 
Sbjct: 352 SGETADFDIVLGNPLPPA--MLPYLRLVALGGTDAFLLESIFRNTIWGHLELPISRANEE 409

Query: 295 AVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
            +   + D  K+ L+GY  T+ EDE  L   +L+P+  +A  +   EKK+L
Sbjct: 410 LICRVVRDACKSALSGYHTTIEEDEK-LEAADLNPRLEIAVGIRAGEKKVL 459


>gi|260831632|ref|XP_002610762.1| hypothetical protein BRAFLDRAFT_91548 [Branchiostoma floridae]
 gi|229296131|gb|EEN66772.1| hypothetical protein BRAFLDRAFT_91548 [Branchiostoma floridae]
          Length = 604

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 58/231 (25%), Positives = 112/231 (48%), Gaps = 35/231 (15%)

Query: 15  TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 74
           T++ +    L+L+L+ EK +GK SFW PYIR L         +  +P+ ++E+EL  L+ 
Sbjct: 230 TSRFTCAQVLSLFLLLEKNKGKDSFWYPYIRSLPN-------SFTTPVYFTESELNALSP 282

Query: 75  SPTKAEILERAEGIKRE----YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 130
           S     + E+A  +K+E    +N+L+   F+   L +          FTF+ F+ A+  +
Sbjct: 283 S-----LQEKARDLKKELLHAFNDLEP--FVTSCLPEL------DSTFTFDAFRWAWSVL 329

Query: 131 QSCVVHLQKVS---LARR----FALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPY 181
           ++  ++ +      L+ +      LVP+   L+ +S   KA           ++ V  PY
Sbjct: 330 KTRTLYQEDCRSPYLSNKEPQTSTLVPM-LDLINHSPSAKARFGYNVNTSCYEVRVLEPY 388

Query: 182 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED-PQYQDK 231
           +  + + +  G + N++L++ +GF   +NP D + +  +   E  PQ  D+
Sbjct: 389 RKYDQVFISYGFEENTELMLKFGFFVPENPKDFMKINLSEMLESLPQINDE 439


>gi|380480025|emb|CCF42668.1| SET domain-containing protein RMS1 [Colletotrichum higginsianum]
          Length = 318

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 55/232 (23%), Positives = 107/232 (46%), Gaps = 34/232 (14%)

Query: 146 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 205
             +VP+   +L   ++  A +   +D + +V  RP KAGE I+ + GP PNS+LL  YG+
Sbjct: 71  LGMVPMAD-ILNADAEFNAHVNHGEDDLSVVALRPIKAGEEILNYYGPHPNSELLRRYGY 129

Query: 206 VDEDN--------PYDRLVVEAALNTE---------------DPQ-YQDKRMVAQRNGKL 241
           V   +        P+D  +V++ L  +               DP+ ++D  ++ + +G+ 
Sbjct: 130 VTPKHSRYDVVEIPWD--LVQSILTEQLRLTDDVWKQLAEHVDPEDFEDVFVLERDSGEP 187

Query: 242 SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLA 301
             +       + +E  +++   L+       + ++++    G + P     +      +A
Sbjct: 188 DSEGRLTTPAKVQEVSAELEEQLK-------AVLKAIKKVRGDLIPDKRKRDEVYQHVVA 240

Query: 302 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
              +  LA YP T  EDEA+L   NL  ++R+A ++   EK++L   LQ+  
Sbjct: 241 AALQKLLAQYPTTAEEDEALLASGNLTSRQRMAVEVRLGEKRLLKEALQMDG 292


>gi|156384284|ref|XP_001633261.1| predicted protein [Nematostella vectensis]
 gi|156220328|gb|EDO41198.1| predicted protein [Nematostella vectensis]
          Length = 403

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 82/347 (23%), Positives = 148/347 (42%), Gaps = 41/347 (11%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           LAL+++ E+   + S W PY+  L R       +  + L +S  ++  L GSP+  E L+
Sbjct: 73  LALHVLCERLH-EDSTWAPYLNILPR-------SYSTCLYFSPDDMMALQGSPSMGEALK 124

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYPYDIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
           +  GI ++Y     V+F            +P   +FTF+ F+ A   V +    ++  S 
Sbjct: 125 QFRGIVKQY-----VYFFRLVQINPEASRLPLKNSFTFDDFRWAVSTVMTRQNDVKVSSN 179

Query: 143 ARRFALVPLGPPLLAYSSKCKA-MLAAVDDAVQLVVD---RPYKAGESIVVWCGPQPNSK 198
               AL+P+        + C        DD+ + V     +P +AG+ + ++ G + N+ 
Sbjct: 180 ETVKALIPM----WDMCNHCNGPFTTGFDDSTKEVKSLAFKPTRAGDQVFIFYGRRNNAD 235

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
            L + GFV  +   D + ++  ++  D  Y  K  +    G L          R  E IS
Sbjct: 236 RLFHNGFVYTEAEEDWVNIQLGVSKNDRLYAMKAQILAMVG-LDASGRSYRVLRGPEPIS 294

Query: 259 -DMLPYLRLGYVSDTSEMQSV--------ISSLGPICPVSPCMERAVLDQLADYFKAR-- 307
            ++  +LR+ +  +T E++          ++ L  +C     +      +L  +F  R  
Sbjct: 295 PELRIFLRV-FSMNTGELKPYLFNPEGLPVTPLAELCKAEFTLSEENELKLWSFFHTRLQ 353

Query: 308 --LAGYPATLSEDEAMLT--DYNLHPKKRVATQLVRMEKKMLNACLQ 350
             L  Y  T  EDEA+L+  D  LH   R   +L   E+ +L + L+
Sbjct: 354 LILGQYKTTKQEDEALLSRDDNTLH--TRNCIRLRMSERDILVSALE 398


>gi|440804743|gb|ELR25614.1| SET domain containing protein, partial [Acanthamoeba castellanii
           str. Neff]
          Length = 273

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 48/159 (30%), Positives = 74/159 (46%), Gaps = 15/159 (9%)

Query: 60  SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 119
           + L +S+ EL+YL GS       +R   I+ +Y+ + T       LF  YP     E F+
Sbjct: 124 TTLFFSDDELSYLEGSSLHYFARQRRRAIESQYDTIFT------PLFVDYPEHFAPEQFS 177

Query: 120 FEIFKQAFVAV--QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV 177
            + +K A   +  +S VV   K  L     +  + P     + + K  + AVD  +    
Sbjct: 178 LDAWKWALSVIWSRSFVVDEGKSGLVPWADMFNMAPE----TEQVKVAVDAVDHHLIYSA 233

Query: 178 DRPYKAGESIVVWCGPQ---PNSKLLINYGFVDEDNPYD 213
             P K GE I V  G      N++LL++YGFV E+NP+D
Sbjct: 234 RSPIKKGEQIFVAYGQSRQMSNAQLLMDYGFVLENNPHD 272


>gi|336261436|ref|XP_003345507.1| hypothetical protein SMAC_07495 [Sordaria macrospora k-hell]
 gi|380088183|emb|CCC13858.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 499

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 50/205 (24%), Positives = 88/205 (42%), Gaps = 28/205 (13%)

Query: 26  LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 85
            YL+ +  +GK S W PYI  L       + A+  P  W+E ++  L G+     I E  
Sbjct: 107 FYLIQQYLKGKSSLWAPYISTLTDPSQLDKWAL--PPFWTEHDIELLRGTNAYVAIQEIQ 164

Query: 86  EGIKREYNELDTVWFMAGS----LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS 141
           + +K EY +   +    GS     + Q  Y+     FT   F+ + +  +S   ++++  
Sbjct: 165 DNVKSEYKQARKILKQEGSPDYRAYTQVLYNWAYCMFTSRSFRPSLILSESAREYVER-- 222

Query: 142 LARRFALVPLGPPLLAYS--------------SKCKAMLAAVDDAVQLVVDRPYKAGESI 187
                 L+P G  +  +S              ++    L +   A +L+    Y+ G+ +
Sbjct: 223 ------LLPEGAKIDDFSILQPLYDIGNHSPEAEYSWNLTSEPSACELICRNSYEPGQQV 276

Query: 188 VVWCGPQPNSKLLINYGFVDEDNPY 212
               G + NS+LL+ YGFV E+N Y
Sbjct: 277 FNNYGKKTNSELLLGYGFVTENNDY 301


>gi|162606198|ref|XP_001713614.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Guillardia theta]
 gi|13794534|gb|AAK39909.1|AF165818_117 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Guillardia theta]
          Length = 460

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 74/351 (21%), Positives = 155/351 (44%), Gaps = 39/351 (11%)

Query: 13  LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 72
           + +N     + LA+ L+ E  + KKSFW PYI  L  +     L       W   EL ++
Sbjct: 132 INSNGSDNYSDLAIKLLVELFKNKKSFWFPYIGILPEEYDLKLL-----FRWPLKELFFI 186

Query: 73  TGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 132
            GS          + +K +Y  ++        +FQ+     P++ F ++ ++ +   + S
Sbjct: 187 KGSRLSKASDYLKKKLKAQYEMVNK------EVFQRNRLLYPSKIFNYQNWEWSMSILLS 240

Query: 133 CVVHLQ---KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV--DRPYKAGESI 187
             + LQ   KV L     L+   P   ++ S  K  L+   D+ ++VV  D+     + +
Sbjct: 241 RTISLQETKKVVLIPYIDLLNHNPFSSSFISYRKIPLS---DSKEIVVYSDKNCNKFDQL 297

Query: 188 VVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFH 247
            +  G + N +LL  YGF+ E NPYD +++  +++ +D  +++K+     N K     + 
Sbjct: 298 YISYGQKSNLELLNLYGFIAERNPYDSVIIRISMSPKDIFFKEKKSFLFSNKKFFYNSYP 357

Query: 248 VHAGREKEAISDMLPYLRL----GYVSDTSEMQSVISSLGPICPVSPC----MERAVLDQ 299
           +   +  +   +M+ ++++      ++D +   + I +      +  C    +E+++   
Sbjct: 358 IFLYKYPD---EMIEFIKICLFNTNINDKNFNLNKIENYDYTKIIKSCIVTVIEKSLNSN 414

Query: 300 LADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 350
             DY   R       L E+   ++D      ++++ +   +EKK+LN  L+
Sbjct: 415 YNDYENLR----NIMLKENLLHISD-----NQKISIKYNALEKKILNRFLE 456


>gi|302836231|ref|XP_002949676.1| Rubisco large subunit N-methyltransferase [Volvox carteri f.
           nagariensis]
 gi|300265035|gb|EFJ49228.1| Rubisco large subunit N-methyltransferase [Volvox carteri f.
           nagariensis]
          Length = 484

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 84/312 (26%), Positives = 141/312 (45%), Gaps = 26/312 (8%)

Query: 58  VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 117
           +++PLLWSE EL  L G+    ++L+   G    +    T   +   LF   P   P   
Sbjct: 144 LDTPLLWSEDELQELQGT----QVLQTLGGYLTFFRS--TFQQLQSGLFTSNPAAFPPSI 197

Query: 118 FTFEIFKQAFVAVQSCV---VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 174
           FT   F  A  AV+S     +   K++LA    L  L     A +SK     A +    Q
Sbjct: 198 FTLPRFLWAVAAVRSRSHPPLDGPKIALA---PLTELVSHRRAANSKLSVRSAGLFGRGQ 254

Query: 175 LVV---DRPYKAGESIVVWCGP-QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQD 230
           ++V    R  + GE + +  GP + +  +L++YG +D  +P     +   +   D    D
Sbjct: 255 VLVLEATRAIRKGEPLSMDYGPGKLDGPVLVDYGVMDVTSPKPGYSLTLKMPDSDRFIDD 314

Query: 231 KRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV--SDTSEMQSVISS--LGPIC 286
           K  + + N      V+++    +++   +ML +LRL  +  SD   ++S+  +   G + 
Sbjct: 315 KLDILESNDLPQSVVYNLTP--DEQPTIEMLAFLRLMQLKGSDAFLLESIFRNDVWGFMQ 372

Query: 287 -PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPK--KRVATQLVRM-EK 342
            PVS   E AV + L++  +A L GY  T+ +D A L       K  +R A  L+R+ EK
Sbjct: 373 EPVSEGNEEAVCNTLSEGARAALGGYGTTIDQDLAELRAQGSRAKGSRREAALLIRLGEK 432

Query: 343 KMLNACLQVTAD 354
           + L+A  +   D
Sbjct: 433 EALDAVARFFED 444


>gi|19112238|ref|NP_595446.1| ribosomal lysine methyltransferase Set10 [Schizosaccharomyces pombe
           972h-]
 gi|74626910|sp|O74738.1|SET10_SCHPO RecName: Full=Ribosomal N-lysine methyltransferase set10
 gi|3738151|emb|CAA21252.1| ribosomal lysine methyltransferase Set10 [Schizosaccharomyces
           pombe]
          Length = 547

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 64/247 (25%), Positives = 106/247 (42%), Gaps = 31/247 (12%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           L  +L  E  +G +S W  YI  L +          +PL ++E + A+L  +   +   E
Sbjct: 82  LCTFLALESLKGIQSKWYGYIEYLPK-------TFNTPLYFNENDNAFLISTNAYSAAQE 134

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQSCVVHLQKVSL 142
           R    K EY E       A SL     +  PTE FTF+++   A V    C        +
Sbjct: 135 RLHIWKHEYQE-------ALSL-----HPSPTERFTFDLYIWSATVFSSRC---FSSNLI 179

Query: 143 ARRFALVPLGPPLL-AYSSKCKAMLAAVDD-----AVQLVVDRPYKAGESIVVWCGPQPN 196
            +     P+  PL+ + + K K  +    D     +VQL+       G  +    GP+ N
Sbjct: 180 YKDSESTPILLPLIDSLNHKPKQPILWNSDFQDEKSVQLISQELVAKGNQLFNNYGPKGN 239

Query: 197 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG--KLSVQVFHVHAGREK 254
            +LL+ YGF   DNP+D + ++ A++ + P    K  + + +   +LS  VF +    +K
Sbjct: 240 EELLMGYGFCLPDNPFDTVTLKVAIHPDLPHKDQKAAILENDCQFQLSNLVFFLPKSPDK 299

Query: 255 EAISDML 261
           E    +L
Sbjct: 300 EIFQKIL 306


>gi|291235388|ref|XP_002737626.1| PREDICTED: SET domain containing 4-like [Saccoglossus kowalevskii]
          Length = 353

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 57/217 (26%), Positives = 101/217 (46%), Gaps = 26/217 (11%)

Query: 17  KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP 76
           KLS +  +  YL+ E+ + K SFW  YI+ L +       +  +P+ ++  E+ +L    
Sbjct: 102 KLSPMQVICTYLICERSRQKDSFWYNYIKVLPK-------SYSNPVYFTNEEINWLP-RR 153

Query: 77  TKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVH 136
            K ++ +  E I   Y EL  ++ +  S F  +        F +  F+ A+  V +  V+
Sbjct: 154 IKRKVFDECEKINTAYRELKNLFSILESTFVSFK-----GIFEYSAFRWAWCTVNTRSVY 208

Query: 137 LQK-----VSLAR-RFALVPLGPPLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIV 188
           + +     +S+ R  +AL P    LL +++  + KA    V    ++      K  + + 
Sbjct: 209 MLQEQNPHLSIERDHYALAPF-LDLLNHTNTVEVKASYNPVSKCYEIFTCTACKKYDQMF 267

Query: 189 VWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 225
           ++ GP  N KL I YGFV   N ++  VVE  L+ ED
Sbjct: 268 IYYGPHDNVKLFIEYGFVLPQNQHN--VVE--LDFED 300


>gi|347836900|emb|CCD51472.1| similar to SET domain-containing protein [Botryotinia fuckeliana]
          Length = 470

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 90/361 (24%), Positives = 146/361 (40%), Gaps = 45/361 (12%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            L   LM E  Q   S W PY+  L  Q       + S + WS++ELA L  S    +I 
Sbjct: 86  ALTSILMAEG-QVDDSKWAPYLAILPEQ-------LNSLVFWSDSELAELQASAVVKKIG 137

Query: 83  ERA-----------EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ 131
           ++            +G++    E+        S+   Y +DIP  +       +   A  
Sbjct: 138 KQGAEDMFKTYITPQGLQHSSTEM---CHKVASVIMAYAFDIPDPSEGPTSGGKGEEAAD 194

Query: 132 SCVV---HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 188
             V      +K  L+    ++PL   L A + +  A L   ++ +++   +P   GE I 
Sbjct: 195 DLVSDDGEDEKTILS----MIPLADMLNADADRNNARLICDNEDLEMRAIKPIAKGEEIF 250

Query: 189 VWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAAL----------NTEDPQ-YQDKRMV-- 234
              G  P S LL  YG+V D  + YD   + A L          +   P+  QDK  V  
Sbjct: 251 NDYGQLPRSDLLRRYGYVTDGYSAYDVAEISAELIVSLFRNGKVHPSLPKLTQDKLKVRL 310

Query: 235 --AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCM 292
             A+R G        VH+  ++ +I D L       + D S +++++ S   +   S   
Sbjct: 311 DLAEREGVYDESFDLVHSSPDEPSIPDELLAFLYLLLVDESHLKAILDSESSLPSRSKLT 370

Query: 293 ERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVT 352
                  LA   +AR   Y  T+ EDE +L + +L  +  +A Q+   EKK+L A ++  
Sbjct: 371 TELAGQVLAILLQARENEYSTTVEEDEDLLKNADLPIRTAMAIQVRSGEKKVLRAAIREA 430

Query: 353 A 353
           A
Sbjct: 431 A 431


>gi|225448769|ref|XP_002275729.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Vitis
           vinifera]
          Length = 480

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 52/217 (23%), Positives = 100/217 (46%), Gaps = 25/217 (11%)

Query: 13  LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 72
           L  +++  +A LA+ +  E K G+ S W PYI  L  Q G     + S + WSE EL  +
Sbjct: 105 LLGDEVGNIAKLAIVISVEWKMGQDSEWAPYINRLP-QPGE----MHSTIFWSEGELKMI 159

Query: 73  TGSPTKAEILERAEGIKREYNELDTV-WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ 131
             S    E + +   I++++  +  V    + +LF+    DI  + F           + 
Sbjct: 160 QQSSVYQETINQKAQIQKDFLAIKPVLHHFSENLFK----DISLKEF-----------MH 204

Query: 132 SC-VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGESIV 188
           +C +V  +     +  +L+P     + +     ++L   +D    +++ DR Y  GE ++
Sbjct: 205 ACALVGSRAWGSTKGLSLIPFA-DFVNHDGFSDSVLLGDEDKQLSEVIADRNYAPGEQVL 263

Query: 189 VWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 225
           +  G  PN+ LL+++GF    N YD++ ++  +   D
Sbjct: 264 IRYGKFPNATLLLDFGFTLPYNIYDQVQIQVNIPHHD 300


>gi|332020870|gb|EGI61268.1| SET domain-containing protein 3 [Acromyrmex echinatior]
          Length = 232

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 55/211 (26%), Positives = 96/211 (45%), Gaps = 11/211 (5%)

Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
           AL+P+       + +      A  D  +    R +K GE + +  GP+ NS   ++ GFV
Sbjct: 18  ALIPMWDMCNHENGRITTDFNATSDRCECYALRDFKKGEQVFISYGPRTNSDFFVHSGFV 77

Query: 207 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD-MLPYLR 265
             DN  D   +   ++  D   +++  +  +    SV  F +  G   E ISD +L +LR
Sbjct: 78  CMDNEQDGFKLRLGISKADSLQKERIELLSKLDLPSVGEFLLKPG--TEPISDTLLAFLR 135

Query: 266 LGYVSDTSEMQSVISS--LGPICPVSPCMERAVLDQLADYFKARL----AGYPATLSEDE 319
           + +    +E+   + S  +  +  V   +E  V + +  +   RL    A YP TL ED 
Sbjct: 136 V-FSMRKAELTHWLRSDKVFDLKHVDCALETVVEENVRKFLLTRLQLLIANYPTTLKEDL 194

Query: 320 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 350
            +L +  L   K++A QL   EK++L+  L+
Sbjct: 195 ELL-ETTLPQMKKMAVQLRVTEKRILSGALE 224


>gi|302792358|ref|XP_002977945.1| hypothetical protein SELMODRAFT_107696 [Selaginella moellendorffii]
 gi|300154648|gb|EFJ21283.1| hypothetical protein SELMODRAFT_107696 [Selaginella moellendorffii]
          Length = 467

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 67/265 (25%), Positives = 110/265 (41%), Gaps = 55/265 (20%)

Query: 4   LLFYYAAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 63
           LL   A + +   +L     L + LMYE+ +GK S W  Y++ L RQ          P L
Sbjct: 57  LLTTAARDAIERARLGGGLGLTVALMYERSKGKGSKWYRYLKTLPRQE-------SVPFL 109

Query: 64  WSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEI 122
           WSE E+   L G+     + E    +K ++ E          L ++ P + P + FTFE 
Sbjct: 110 WSEEEIDGLLLGTELHKALKEDKLLMKEDWEE------NIAPLTKEDPLEFPAQDFTFE- 162

Query: 123 FKQAFVAVQSCVVHLQ-KVSLARRFALVPLGPPLLAYSSKCKA-----MLAA-------- 168
              +++A +S V     ++     + +VPL      ++ K  A     ML A        
Sbjct: 163 ---SYLAAKSLVSSRSFEIDAEHGYGMVPLAD---LFNHKTDAEDVHFMLNASDSDDDDN 216

Query: 169 ---VDDA---------------VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 210
              +DD                +++V+ +   AG  I    G   N+ LL  YGF + +N
Sbjct: 217 GLIIDDGLANGDCREISSDKSVLEMVMVKDVAAGSEIFNTYGQLGNAALLHRYGFTEPNN 276

Query: 211 PYD--RLVVEAALNTEDPQYQDKRM 233
           P+D   L ++  L     ++Q KR+
Sbjct: 277 PHDIVNLDMDCVLEVLLSRFQKKRV 301


>gi|159477607|ref|XP_001696900.1| rubisco small subunit N-methyltransferase [Chlamydomonas
           reinhardtii]
 gi|158274812|gb|EDP00592.1| rubisco small subunit N-methyltransferase [Chlamydomonas
           reinhardtii]
          Length = 411

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 52/182 (28%), Positives = 83/182 (45%), Gaps = 14/182 (7%)

Query: 179 RPYKAGESIVVWCGP---------QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 229
           R   A + +VVW G          +PN +LL+  G + ++N  D L   A L   D  Y 
Sbjct: 166 RAAGARKGVVVWDGAGSEMLLNDGRPNGELLLATGTLQDNNSSDFLSWPAGLVPADRYYM 225

Query: 230 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVS 289
            K  V +  G  + + F V+A R       +L YLRL  V+D + +     +      +S
Sbjct: 226 MKSQVLESMGYSAAEEFPVYADRMP---IQLLAYLRLSRVADPALLAKC--TFEADVELS 280

Query: 290 PCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 349
              E  +L  L    + RLA Y  +  ED  +    +L PK+R+A +L   EK+++NA +
Sbjct: 281 QMNEYEILQILMGDCRERLASYTKSYEEDVKIAQQSDLSPKERLAVKLRLGEKRIINATM 340

Query: 350 QV 351
           + 
Sbjct: 341 EA 342


>gi|212542185|ref|XP_002151247.1| SET domain protein [Talaromyces marneffei ATCC 18224]
 gi|210066154|gb|EEA20247.1| SET domain protein [Talaromyces marneffei ATCC 18224]
          Length = 709

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 68/224 (30%), Positives = 99/224 (44%), Gaps = 24/224 (10%)

Query: 25  ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE--LAYLTGSPTKAEIL 82
           A +LM +    +  FW PYIR L      G+  + +PL + E E  L +L G  + A   
Sbjct: 108 AFFLMGQYLLQEHGFWYPYIRSLP-----GKEELTTPLFFREEEGDLEWL-GMTSLAASR 161

Query: 83  ERAEGI-----KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA--VQSCVV 135
           ER   I     +R Y  L  + F      + Y +D+   A T  I  +AF A  + S + 
Sbjct: 162 ERRLAIWRGNYERGYTMLKELGFEG---VEGYTWDLYLWASTI-ISSRAFTAKVLASVIP 217

Query: 136 HLQKVSLARRFALVPLGPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 193
            L+   + R   L+PL   + A + K   K    A  D++ LVV     AGE +    GP
Sbjct: 218 ELKNAEVDRVSVLLPL---IDATNHKPLSKVEWRAGTDSIGLVVMSDVAAGEEVGNNYGP 274

Query: 194 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQR 237
           + N +L++NYGF   DNP +  VV      + P  Q K    Q 
Sbjct: 275 RNNEQLMMNYGFCIPDNPCEYRVVSLRAPLDSPLAQIKAQYEQH 318


>gi|148686780|gb|EDL18727.1| mCG18357, isoform CRA_e [Mus musculus]
          Length = 458

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 47/193 (24%), Positives = 93/193 (48%), Gaps = 10/193 (5%)

Query: 170 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 229
           DD  + V  + ++AG+ I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y 
Sbjct: 156 DDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 215

Query: 230 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVIS 280
            K  V  R G  +  VF +H+  E    + +L +LR+  +++          S +  + +
Sbjct: 216 MKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFT 274

Query: 281 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 340
                 PVS   E  +   L D     L  Y  T+ ED+ +L + +L  +  +A +L   
Sbjct: 275 LGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRLG 334

Query: 341 EKKMLNACLQVTA 353
           EK++L   ++  A
Sbjct: 335 EKEILEKAVKSAA 347


>gi|302804448|ref|XP_002983976.1| hypothetical protein SELMODRAFT_423083 [Selaginella moellendorffii]
 gi|300148328|gb|EFJ14988.1| hypothetical protein SELMODRAFT_423083 [Selaginella moellendorffii]
          Length = 266

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 52/199 (26%), Positives = 88/199 (44%), Gaps = 30/199 (15%)

Query: 24  LALYLMYEKKQGK----KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 79
           LAL ++ E+ +G+     S W PYI  L          +++  LW +TEL+YL  SP   
Sbjct: 67  LALIVLMERYKGQAKLQSSAWAPYISCLPEPA-----ELDNTFLWEDTELSYLRASPLYG 121

Query: 80  EILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK 139
           +  ER E I  E+ ++     +   LF +          + E FK  +  V S     + 
Sbjct: 122 KTRERLEMITTEFGQVQNALDVWPQLFGK---------VSLEDFKHVYATVFS-----RS 167

Query: 140 VSLARRFALVPLGPPLLAY-----SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 194
           +++     LV +  P+L +     +S  K     + +   +  DR Y   + I +  G  
Sbjct: 168 LAIGEDSTLVMI--PMLDFFNHNATSFAKLSFNGLLNYAVVTADRDYAENDQIWINYGDL 225

Query: 195 PNSKLLINYGFVDEDNPYD 213
            N++L ++YGF   +NPYD
Sbjct: 226 SNAELALDYGFAVPENPYD 244


>gi|255083899|ref|XP_002508524.1| set domain protein [Micromonas sp. RCC299]
 gi|226523801|gb|ACO69782.1| set domain protein [Micromonas sp. RCC299]
          Length = 425

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 64/237 (27%), Positives = 96/237 (40%), Gaps = 33/237 (13%)

Query: 17  KLSELACLALYLMYE----KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA-Y 71
            L E + LA +L  +          S +  Y+R L R+ G         L W E ++   
Sbjct: 62  NLQEWSVLAAFLAEQALAIDAGADGSRFATYVRALPRRTG-------GVLDWPEEDVKEL 114

Query: 72  LTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ 131
           L GSP++   +ER   +    +E+             +P   P         + AF  + 
Sbjct: 115 LAGSPSQRAAMERQASVDAAIDEIRA----------SFPQLTPG------ALRWAFDVLF 158

Query: 132 SCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
           S ++ L     A   ALVP    +L +   C A +     AV L  DR YK GE +    
Sbjct: 159 SRLIRLPNRGGA--LALVPWAD-MLNHRPGCDAYIDDTGGAVCLSPDRRYKPGEQVYASY 215

Query: 192 GPQPNSKLLINYGFVDE--DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 246
           GP+P+S+LLI+YGF     +NP D   V   ++  D     K    +R G   V+ F
Sbjct: 216 GPRPSSELLISYGFAPAVGENPDDEFEVVLGIDPNDRHADAKADALRRIGLSPVEAF 272


>gi|148686778|gb|EDL18725.1| mCG18357, isoform CRA_c [Mus musculus]
          Length = 536

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 47/193 (24%), Positives = 93/193 (48%), Gaps = 10/193 (5%)

Query: 170 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 229
           DD  + V  + ++AG+ I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y 
Sbjct: 234 DDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 293

Query: 230 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVIS 280
            K  V  R G  +  VF +H+  E    + +L +LR+  +++          S +  + +
Sbjct: 294 MKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFT 352

Query: 281 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 340
                 PVS   E  +   L D     L  Y  T+ ED+ +L + +L  +  +A +L   
Sbjct: 353 LGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRLG 412

Query: 341 EKKMLNACLQVTA 353
           EK++L   ++  A
Sbjct: 413 EKEILEKAVKSAA 425


>gi|168046556|ref|XP_001775739.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162672891|gb|EDQ59422.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 524

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 64/214 (29%), Positives = 95/214 (44%), Gaps = 23/214 (10%)

Query: 12  LLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET-ELA 70
           L+  + L+    + L+L+YE+ +G+ SFW PYI  L ++    +L    P+LWS+     
Sbjct: 144 LVKKSDLTPWQAMCLHLLYERARGETSFWYPYIAVLPKEL---ELIGIHPMLWSQKMRRE 200

Query: 71  YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLF----QQYPYDIPTEAFTFEIFKQA 126
           +L GSP   ++ ER   I RE  E   +   AG L     +  P  I   A     +   
Sbjct: 201 WLEGSPM-LDVTERRLAICREDYEA-MLLAGAGRLTPRGNEGEPISITETAVQ---WAAT 255

Query: 127 FVAVQSCVVHLQKVSL------ARRFALVPLGPPLLAYSSKCKAMLAAVDD---AVQLVV 177
            +  +S  ++LQ   L          ALVP    L   SS  +      D       L  
Sbjct: 256 MLLSRSFSLNLQTQKLRPGSFAEDTIALVPWADMLNHSSSAGRESCLVYDQKSGVATLQA 315

Query: 178 DRPYKAGESIVVWCGPQPN-SKLLINYGFVDEDN 210
            R Y  GE +    GP  + S+LL++YGFVDE+N
Sbjct: 316 HRTYSEGEQVFDSYGPSCSPSRLLLDYGFVDEEN 349


>gi|440804394|gb|ELR25271.1| rubisco lsmt substrate-binding protein [Acanthamoeba castellanii
           str. Neff]
          Length = 408

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 83/351 (23%), Positives = 138/351 (39%), Gaps = 40/351 (11%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           LAL++++E ++   SFW PY   L  +       V  P+ W++ ++  L GSP  A +L 
Sbjct: 48  LALFMLHELRK-PDSFWRPYFDALPSR-------VNLPMFWADEDMQLLAGSPLHAAVLA 99

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYP--YDIPTE------AFTFEIFKQAFVAVQSCVV 135
           + +  +  + E          + ++YP  + +  +      +++   F+     + S   
Sbjct: 100 QKKQARDWHTE------HIVPIVRRYPRPFGVSDDDSSLEPSYSLARFEWVLSMIASRAF 153

Query: 136 HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV---VDRPYKAGESIVVWCG 192
               +       +VP+   +    +         DD  Q     V +PY  GE + +   
Sbjct: 154 WHFDLKDTWEPHMVPMADLINHSLTNDNVSKYTFDDKTQTFIVHVQQPYAEGEQVFITYC 213

Query: 193 PQPNSKLLINYGFVDEDN-------PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQV 245
              N +LL  Y  + EDN         D   +      E  +    R + QR   L+ Q 
Sbjct: 214 TDSNFELLKTYAMMVEDNYNKYTEIRLDETTIARICPDEVERLTKTRALTQRG--LAKQT 271

Query: 246 FHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFK 305
           + V   + +E   D++  LRL ++  T       S+     PVS   E  V D +A   K
Sbjct: 272 YPV---KSEEFPLDLVQALRLYHLPLTDSHTE--STCFETDPVSVQNELMVYDTIAGCVK 326

Query: 306 ARLAGYPATLSEDEAMLT-DYNLHPKKRVATQLVRMEKKMLNACLQVTADM 355
             L+ YP T  ED AML  D  L    R+A    R +K  L     V A+M
Sbjct: 327 ELLSQYPITAQEDAAMLAHDPRLSATARLAVAYRREDKLFLTEVGSVFAEM 377


>gi|34784341|gb|AAH57968.1| Setd3 protein [Mus musculus]
          Length = 408

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 47/193 (24%), Positives = 93/193 (48%), Gaps = 10/193 (5%)

Query: 170 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 229
           DD  + V  + ++AG+ I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y 
Sbjct: 106 DDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 165

Query: 230 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVIS 280
            K  V  R G  +  VF +H+  E    + +L +LR+  +++          S +  + +
Sbjct: 166 MKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFT 224

Query: 281 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 340
                 PVS   E  +   L D     L  Y  T+ ED+ +L + +L  +  +A +L   
Sbjct: 225 LGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRLG 284

Query: 341 EKKMLNACLQVTA 353
           EK++L   ++  A
Sbjct: 285 EKEILEKAVKSAA 297


>gi|351701197|gb|EHB04116.1| SET domain-containing protein 3 [Heterocephalus glaber]
          Length = 705

 Score = 58.2 bits (139), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 61/251 (24%), Positives = 116/251 (46%), Gaps = 14/251 (5%)

Query: 116 EAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 174
           ++FT+E ++ A  +V +    +     +R   AL+PL       +          DD  +
Sbjct: 346 DSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCE 405

Query: 175 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 234
            V  + ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y  K  V
Sbjct: 406 CVALQDFQAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEV 465

Query: 235 AQRNG---KLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS--------SLG 283
             R G    +   VF +H   E    + +L +LR+  +++    + ++         +LG
Sbjct: 466 LARAGIPTYVWSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGENAIDRIFTLG 524

Query: 284 PI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEK 342
               PVS   E  +   L D     L  Y  T  ED+A+L + +L  + ++A +L   EK
Sbjct: 525 NSEFPVSWENEVKLWSFLEDRASLLLKTYKTTTEEDKAVLKNPDLPARTKMAIKLRLGEK 584

Query: 343 KMLNACLQVTA 353
           ++L   +Q  A
Sbjct: 585 EILEKAVQSAA 595


>gi|307190530|gb|EFN74527.1| SET domain-containing protein 3 [Camponotus floridanus]
          Length = 232

 Score = 58.2 bits (139), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 54/211 (25%), Positives = 93/211 (44%), Gaps = 11/211 (5%)

Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
           AL+P+       + +      A  D  +    R +K GE + +  GP+ NS   ++ GFV
Sbjct: 18  ALIPMWDMCNHENGRITTDFNATSDHCECYALRNFKKGEQVFISYGPRTNSDFFVHSGFV 77

Query: 207 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD-MLPYLR 265
             +N  D   +   ++  D   +++  +  + G  SV  F +  G   E ISD +L +LR
Sbjct: 78  YMNNKQDGFKLRLGISKADSLQKERIELLSKLGLPSVGEFLLKPG--TEPISDTLLAFLR 135

Query: 266 LGYVSDTSEMQSVISSLGPI------CPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 319
           + +    +E+   + S          C +   +E  V   L    +  +A YP TL ED 
Sbjct: 136 V-FSMRKAELAHWLRSDKVFDLKHMDCALETVVEENVRKFLLTRLQLLIANYPTTLKEDL 194

Query: 320 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 350
            +L +  L   K++A QL   EK++L   L+
Sbjct: 195 ELL-ETTLPQIKKMAVQLRVTEKRILLGALE 224


>gi|297736447|emb|CBI25318.3| unnamed protein product [Vitis vinifera]
          Length = 487

 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 52/223 (23%), Positives = 101/223 (45%), Gaps = 31/223 (13%)

Query: 13  LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 72
           L  +++  +A LA+ +  E K G+ S W PYI  L  Q G     + S + WSE EL  +
Sbjct: 106 LLGDEVGNIAKLAIVISVEWKMGQDSEWAPYINRLP-QPGE----MHSTIFWSEGELKMI 160

Query: 73  TGSPTKAEILERAEGIKREYNELDTV-WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ 131
             S    E + +   I++++  +  V    + +LF+    DI  + F           + 
Sbjct: 161 QQSSVYQETINQKAQIQKDFLAIKPVLHHFSENLFK----DISLKEF-----------MH 205

Query: 132 SC-VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD--------AVQLVVDRPYK 182
           +C +V  +     +  +L+P     + +     ++L   +D         ++++ DR Y 
Sbjct: 206 ACALVGSRAWGSTKGLSLIPFAD-FVNHDGFSDSVLLGDEDKQLSESSSTLEVIADRNYA 264

Query: 183 AGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 225
            GE +++  G  PN+ LL+++GF    N YD++ ++  +   D
Sbjct: 265 PGEQVLIRYGKFPNATLLLDFGFTLPYNIYDQVQIQVNIPHHD 307


>gi|302815683|ref|XP_002989522.1| hypothetical protein SELMODRAFT_129980 [Selaginella moellendorffii]
 gi|300142700|gb|EFJ09398.1| hypothetical protein SELMODRAFT_129980 [Selaginella moellendorffii]
          Length = 464

 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 89/375 (23%), Positives = 150/375 (40%), Gaps = 62/375 (16%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           L L L+YE+ Q K S+W PYI  L         +   P+ +S  ++  +  +P   ++ +
Sbjct: 105 LGLKLLYERAQ-KGSYWWPYISMLPH-------SFTLPIFFSGVDIESIDYAPVTHQVKK 156

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF---EIFKQAFVAVQSCVVHLQKV 140
           R   + +  +EL        +  +  P +I   A  F        A  AV S    +  V
Sbjct: 157 RCRFLLQFSSEL--------AKLESLPEEIHPFAGQFVDSGALGWAMAAVSSRAFRIHGV 208

Query: 141 SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV----------QLVVDRPYKAGESIVVW 190
           +     A++    PL+   +      A +++ +          ++V  R  + G +I + 
Sbjct: 209 TNKLCSAMML---PLIDMCNHSFQPNAHIEEDLSRDAQDVSFLKVVTKRNLEKGSAITLN 265

Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA--QRNGK------LS 242
            GP  N  LL++YGFV  DNP+DR+     L  +    ++ RM+A   R G        S
Sbjct: 266 YGPLSNDLLLLDYGFVIPDNPHDRI----ELRYDGSLMENARMIAGLSRTGSPPFSSPAS 321

Query: 243 VQVFH--------------VHAGREKEAISDMLPYLRLGYVSDTS--EMQSVIS--SLGP 284
            QV                V  G  +E    +L  LR+ +       E + ++S  + G 
Sbjct: 322 WQVDRLKQLGLADSGESQKVTLGGPEEVDGRLLAALRILHAESQEPLERRELVSLQAWGV 381

Query: 285 ICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKM 344
              VS   E  VL  L          +  T+ EDEA L+D +L    R+A Q    +K++
Sbjct: 382 ESMVSSDNEERVLRTLCGLGAIVFNQFKTTIEEDEAKLSDKSLAETSRIAVQFRLTKKRL 441

Query: 345 LNACLQVTADMIMLL 359
           +   L+     +M L
Sbjct: 442 VVRVLESLKKRLMDL 456


>gi|449542715|gb|EMD33693.1| hypothetical protein CERSUDRAFT_56467 [Ceriporiopsis subvermispora
           B]
          Length = 510

 Score = 57.8 bits (138), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 65/226 (28%), Positives = 98/226 (43%), Gaps = 31/226 (13%)

Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
           A+VP+   L A      A L   +  +++V  +P  AGE I    G  PNS LL  YG V
Sbjct: 260 AMVPMADMLNARFESENAKLFYEEHYLKMVATKPINAGEQIWNTYGDPPNSDLLRRYGHV 319

Query: 207 D----------EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQV-FHVHAGREK- 254
           D          E NP D + + A L          R   +  G L V+V F +    +  
Sbjct: 320 DVVPLGEPLSGEGNPADVVEIRADL-----VVSAVRKARKAAGDLQVRVDFWLEEADDDT 374

Query: 255 -------EAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 307
                  E   ++L ++RL  +S T +  + + + G +      +E  +L  + D  K R
Sbjct: 375 FVLMTDCEVPEELLSFIRL--LSLTKDEWNKVKAKGKLP--KGKLELELLPAIVDVLKER 430

Query: 308 LAGYPATLSEDEAML---TDYNLHPKKRVATQLVRMEKKMLNACLQ 350
           L  YP T+ EDE++L   +  NL   KR A  +   EK++L   LQ
Sbjct: 431 LKEYPTTIEEDESLLGPDSAVNLSFNKRNAVVVRLGEKRILRGALQ 476


>gi|149044196|gb|EDL97578.1| rCG27725, isoform CRA_b [Rattus norvegicus]
          Length = 538

 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 47/193 (24%), Positives = 92/193 (47%), Gaps = 10/193 (5%)

Query: 170 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 229
           DD  + V  + ++AG+ I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y 
Sbjct: 234 DDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 293

Query: 230 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVIS 280
            K  V  R G  +  VF +H   E    + +L +LR+  +++          S +  + +
Sbjct: 294 MKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFT 352

Query: 281 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 340
                 PVS   E  +   L D     L  Y  T+ ED+ +L + +L  +  +A +L   
Sbjct: 353 LGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKTVLKNPDLSVRATMAIKLRLG 412

Query: 341 EKKMLNACLQVTA 353
           EK++L   ++  A
Sbjct: 413 EKEILEKAVKSAA 425


>gi|340966944|gb|EGS22451.1| hypothetical protein CTHT_0019870 [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 499

 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 48/191 (25%), Positives = 78/191 (40%), Gaps = 12/191 (6%)

Query: 26  LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 85
            +L+ E  +G+ SFW PYI  L +        +  P  W E ++ +L G+     I E  
Sbjct: 111 FFLIKEYLKGENSFWWPYIATLPQPEQVNSWTL--PAFWPEDDIQFLEGTNAHVAIGEIQ 168

Query: 86  EGIKREYNELDTVW----FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK-V 140
             IKREY +   V     F     + Q  Y      FT   F+ + +  QS   ++   +
Sbjct: 169 ANIKREYKQARKVLKEENFPNWKEYSQMLYKWAFSIFTSRSFRPSLILSQSVKDYVSTLL 228

Query: 141 SLARRFALVPLGPPLLAYSSKCKAMLAAVD-----DAVQLVVDRPYKAGESIVVWCGPQP 195
             AR      +  PL   ++         D     +  QL+    Y+ G+ +    G + 
Sbjct: 229 PSAREIDDFSILQPLFDIANHSMTATYTWDTTSDPNCCQLICQDSYRPGDQVFNNYGFKT 288

Query: 196 NSKLLINYGFV 206
           NS+LL+ YGF+
Sbjct: 289 NSELLLAYGFI 299


>gi|384251065|gb|EIE24543.1| hypothetical protein COCSUDRAFT_40909 [Coccomyxa subellipsoidea
           C-169]
          Length = 685

 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 52/171 (30%), Positives = 76/171 (44%), Gaps = 18/171 (10%)

Query: 65  SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK 124
           +E E++ L G+P     +E  + I+ +Y     V     +L   YP DI  +  T + F 
Sbjct: 65  TEEEVSMLEGTPAHTTFVEARQHIREQYRAAQPVL---QALTAAYPDDITPDLVTEDKFI 121

Query: 125 QAFVAVQSCVVHLQKVSLARRFALVPLG--------PPLLAYSSKCKAMLAAVDDAVQLV 176
            A     S  + ++ V  A R  LVP+         P ++ Y       L A  D+++L 
Sbjct: 122 WACELWYSYAIEVEYVDGAVRQTLVPIAHLLNHSPWPHIVRY-----GRLDAATDSLRLR 176

Query: 177 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR--LVVEAALNTED 225
             R   AGE   +  GP PN KLL+ YGF   DNP+D   +  EA  N  D
Sbjct: 177 AFRHCAAGEQCFLSYGPLPNLKLLLFYGFALPDNPHDTVPITFEAEKNEGD 227


>gi|168005531|ref|XP_001755464.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693592|gb|EDQ79944.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1033

 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 49/209 (23%), Positives = 90/209 (43%), Gaps = 24/209 (11%)

Query: 8   YAAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET 67
           Y  E +   ++      A  L+ EK +G  S W  YI  L +        +  P+L  + 
Sbjct: 585 YDNETIVAGEVDSWDRAAARLLREKAKGSSSAWASYINILPQN-------MTVPILLEDH 637

Query: 68  ELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYD---IPTEAFTFEIFK 124
           EL  +   P   E+++  + I+  ++ L +V  +AG+ F++Y +    + + AFT  +F 
Sbjct: 638 ELHEVQWWPVLRELVQVRKSIRESFSLL-SVDDLAGADFEEYRWAAMMVHSRAFTLPVFA 696

Query: 125 QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 184
               A    + ++  ++    +    +  P+                 V++V  R  K G
Sbjct: 697 DDHYAPYVMMPYMDMINHHYHYQADWMSQPIWG-------------GKVEIVARRDIKKG 743

Query: 185 ESIVVWCGPQPNSKLLINYGFVDEDNPYD 213
           E +    GP+ N  L + YGFV +DNP+D
Sbjct: 744 EELFASFGPRANDNLFLYYGFVLKDNPFD 772


>gi|429861365|gb|ELA36056.1| set domain protein [Colletotrichum gloeosporioides Nara gc5]
          Length = 471

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 81/378 (21%), Positives = 143/378 (37%), Gaps = 55/378 (14%)

Query: 11  ELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA 70
           E +    L     L L ++YE  QG  S W PY   L  +         + + W  ++L 
Sbjct: 86  EDMENEPLDSWGSLILVMIYEFLQGAASPWKPYFEVLPEK-------FHTLMFWESSDLE 138

Query: 71  YLTGSPTKAEI-LERAEGIKREY----------------------NELDTVWFMAGSLFQ 107
            L GS   ++I  E A+ + R                         EL  +    GS+  
Sbjct: 139 NLKGSAVLSKIGKEEADEMFRSRILTVIAANPAIFYPEGSSPLGEAELLQLAHRMGSIIM 198

Query: 108 QYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA 167
            Y +D+  E    +     ++  +     L  V +A           +L   ++  A + 
Sbjct: 199 AYAFDLDNEEEPEQEEDDEWIEDRDGKTMLGMVPMAD----------ILNADAEFNAHVN 248

Query: 168 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN--------PYD--RLVV 217
             DD + +   RP  AGE I+ + GP PNS+LL  YG+V   +        P+D  +  V
Sbjct: 249 HGDDELTVTALRPIPAGEEILNYYGPHPNSELLRRYGYVTPKHSRYDVVEIPWDLVQASV 308

Query: 218 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGR-EKEAISDMLPYLRLGYVSDTSEMQ 276
              L   D  ++  +          V V    +G  + E     +  +R        +++
Sbjct: 309 SEHLKIGDDVWKQVQEYVDPEELEDVFVLERESGEPDSEGQFRTVAEVREISAELEEQLK 368

Query: 277 SVISSL----GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKR 332
           +V+ ++    G + P     +      +    +  L+ YP +  EDEA+L   +L  ++R
Sbjct: 369 AVLKAIKKINGDLIPDKRKRDEVFHAVIVSTLQKILSQYPTSTQEDEALLATSDLTNRQR 428

Query: 333 VATQLVRMEKKMLNACLQ 350
           +A  +   EKK+L   L+
Sbjct: 429 MAIHVRLGEKKLLKEALE 446


>gi|307195794|gb|EFN77608.1| SET domain-containing protein 3 [Harpegnathos saltator]
          Length = 245

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 56/219 (25%), Positives = 87/219 (39%), Gaps = 27/219 (12%)

Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
           AL+P+       + +      A  D  +    R ++ GE I +  GP+ NS   ++ GFV
Sbjct: 31  ALIPMWDMCNHENGRITTDFNATSDRCECYALRNFQKGEQIFISYGPRTNSDFFVHSGFV 90

Query: 207 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDML----- 261
             DN  D   +   ++  D   +++  +  +    SV  F +  G   E ISDML     
Sbjct: 91  YMDNEQDGFKLRLGISKADSLQKERTELLGKLDLPSVGEFLLKPG--TEPISDMLLAFLR 148

Query: 262 ----------PYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGY 311
                      +LR   V D   M          C +   +E  V   L    +  +A Y
Sbjct: 149 VFSMRKAELAHWLRSDKVFDLKHMD---------CALETVVEENVRKFLLTRLQLLIANY 199

Query: 312 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 350
           P TL ED  +L +  L   K++  QL   EKK+L   L+
Sbjct: 200 PTTLKEDLELL-ETTLPQVKKMTVQLRVTEKKILLGALE 237


>gi|301122791|ref|XP_002909122.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262099884|gb|EEY57936.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 426

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 83/365 (22%), Positives = 154/365 (42%), Gaps = 42/365 (11%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LAL+LM+E+ +   SF+ P+I  L           + P+ WSE+EL  L G+       
Sbjct: 83  VLALHLMHERSKRSDSFFAPFIASL-------PTTFDLPVFWSESELNELKGTNVLLLTQ 135

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQK 139
              + ++R++  +        ++ + +P     +PT   T E +  A   + S       
Sbjct: 136 LMKQQLQRDFENIHQ------AVVEDFPEVFALLPT--LTLEDYTWAMSVIWSRAF---G 184

Query: 140 VSLARRFALVPLGPPLLAYSSKCKAML-----AAVDDAVQLV---VDRPYKAGESIVVWC 191
           V+  +++  V L P +  ++      +      + D+  Q++   V +   AG ++ +  
Sbjct: 185 VTREKKYLRV-LCPAMDMFNHDVSLRILLDDFVSFDEETQMLTHHVPKEVAAGSALQISY 243

Query: 192 GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 251
           G   N+KLL +YGFV ++N    +     +   DP  + K+ V   N     Q +     
Sbjct: 244 GQYSNAKLLFSYGFVAKENSRRAVDFWMKIPPNDPYLKLKQTVLDSNELTRDQTYDFCGT 303

Query: 252 R-EKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAG 310
             E +    +L  LR+  +++  E++ +         +S   E AV + L +  + +LA 
Sbjct: 304 LFENDVDERLLATLRVILMNE-QEIR-LYKKAFETSIISIRNELAVYENLQNTCRRKLAN 361

Query: 311 YPATLSEDEAMLTDYNLHPKKRVATQL-VRMEKK--------MLNACLQVTADMIMLLPD 361
           +  TL EDEA+L +       R++  + VR+E K         L    QV A  + + P 
Sbjct: 362 FATTLEEDEAILAEMATESSPRLSFAVRVRVEDKQVLTGVIDTLEKWKQVLASNLEMYPP 421

Query: 362 VTVSP 366
            T  P
Sbjct: 422 STTRP 426


>gi|358366345|dbj|GAA82966.1| SET domain protein [Aspergillus kawachii IFO 4308]
          Length = 673

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 61/229 (26%), Positives = 96/229 (41%), Gaps = 7/229 (3%)

Query: 16  NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 75
           + + E      +L+ +  +G + FW PYIR L  Q G    ++ +P  +   +L +L G+
Sbjct: 77  DAVGEKESTIFFLIGQYLRGTEGFWYPYIRTLP-QPG----SLTTPPYYEGEDLQWLDGT 131

Query: 76  PTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 134
              A   +R E +K +Y +  T    AG      Y +D+   A +  I +     V S V
Sbjct: 132 SLLAAREKRLEVLKEKYEKGSTALRNAGFEGADAYTWDLYLWAASMFISRAFSARVLSGV 191

Query: 135 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 194
                +S  +   L+P+   +  +    K    A  D V  VV     AG+ I    GP+
Sbjct: 192 FPETDLSEEKLSVLLPI-IDMGNHRPLAKVEWRAGKDDVAFVVLEDVSAGQEISNNYGPR 250

Query: 195 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 243
            N +L++NYGF    NP D  +V        P Y  K    Q    L+V
Sbjct: 251 NNEQLMMNYGFCIPGNPCDHRIVSLRAPPGSPLYMAKSHQLQMYPDLAV 299


>gi|428175234|gb|EKX44125.1| hypothetical protein GUITHDRAFT_109909 [Guillardia theta CCMP2712]
          Length = 442

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 46/187 (24%), Positives = 76/187 (40%), Gaps = 28/187 (14%)

Query: 60  SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM---------AGSLFQQYP 110
           +PL WS+ E   L GS                YN LD  W M         A  L Q   
Sbjct: 163 TPLFWSDKEREELQGSNL--------------YNMLDG-WTMNVEKLHRSTARVLGQHNV 207

Query: 111 Y-DIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA---RRFALVPLGPPLLAYSSKCKAML 166
           + D+P   ++ + FK A+  + +    +   S     R+  + P+         K     
Sbjct: 208 FPDLPKAIYSLKEFKWAYATIFARAFDVDGKSFGFSGRQRIMAPMADLFNHGDVKTSYTF 267

Query: 167 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 226
            A     +L   + +  GE I +    + N++ L+ YGFV E NP+D + + A++  + P
Sbjct: 268 NAASGHFELFTQQFFSRGEQIFMNYDSKNNAEFLLQYGFVIESNPHDYVGIAASIGNDQP 327

Query: 227 QYQDKRM 233
            Y+DK +
Sbjct: 328 FYRDKSL 334


>gi|348671353|gb|EGZ11174.1| hypothetical protein PHYSODRAFT_361758 [Phytophthora sojae]
          Length = 486

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 82/356 (23%), Positives = 140/356 (39%), Gaps = 35/356 (9%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           L ++L+ + ++ + SF+  Y   L             P+ WS+ EL +L GS    +I E
Sbjct: 121 LMMFLLTDMERAETSFFRNYYSTLP------STLSNMPIFWSDEELGWLKGSYIIQQIQE 174

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 143
           R   I+++Y+ +  V             D     F+ + F  A + V S    L  +   
Sbjct: 175 RKAAIRKDYDVICRV-------------DPAFARFSLDRFSWARMIVCSRNFGL-TIDGV 220

Query: 144 RRFALVPLGPPLLAYSSKCKA-MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 202
           +  ALVP    L  Y  +  +       DA  +        G  +    G + N + L+N
Sbjct: 221 KTAALVPFADMLNHYRPRETSWTFDQSIDAFTITSLGTIGTGAQVYDSYGKKCNHRFLLN 280

Query: 203 YGFVDEDNPY------DRLVVEAALNTEDPQ-YQDKRMVAQRNGKLSVQVFHVHAGREKE 255
           YGF  EDN        + ++++  L+  D Q + DKR     +G  ++    +       
Sbjct: 281 YGFAVEDNTEEDGRNPNEVLIDFQLSQADGQLFYDKRAYLHESGIYTMDA-RLSCSHSDA 339

Query: 256 AISDMLPYLRLGYVSDTSEMQS--VISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 313
              +   + RL  V+   E  S  + S      P+S   E   L  L D    +L+ Y  
Sbjct: 340 NTREGFSFARL-IVATEDEFSSMKMKSPAHSSPPISFDNEIRALQYLRDLMTHQLSLYDT 398

Query: 314 TLSEDEAMLTD--YNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPC 367
           T+ ED  +L    Y L   +  A   +R EK++     Q  AD ++ L  + ++ C
Sbjct: 399 TIEEDNELLASKQYPLFSNRIQALFFIRGEKQVCR-YFQELADKVIQLFSLPLAEC 453


>gi|170067683|ref|XP_001868579.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167863782|gb|EDS27165.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 269

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 50/189 (26%), Positives = 89/189 (47%), Gaps = 24/189 (12%)

Query: 181 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 240
           Y+ GE I ++ G + N+  L++ GFV  DN    + +  +LN  + Q++ ++ + ++ G 
Sbjct: 74  YRKGEQIFIYYGNRTNADFLVHNGFVYPDNANSAVAIPLSLNPTEEQFEQRKQLLEKLGL 133

Query: 241 LSVQVFHVHAGREKEAIS-DMLPYLR--------LGYVSDTSEMQSVISSLGPICPVSPC 291
            S   F+V  G     IS ++L + R        LG+      +QS +  L P C   P 
Sbjct: 134 ASSGDFNVQRGGGDSFISPELLGFARVFNMTKEQLGHWQGEDAVQSQL--LEPDC---PG 188

Query: 292 MERAVLDQLADYFKARLA----GYPATLSEDEAMLTDYN------LHPKKRVATQLVRME 341
           +E ++ +++  Y   RL         TL +DEA+L +        L   K +  Q   +E
Sbjct: 189 LEASLREKVWKYLSIRLQLALRMTGTTLDQDEALLANQGQKGAQKLGHIKSMLVQFRVVE 248

Query: 342 KKMLNACLQ 350
           KK+L+  L+
Sbjct: 249 KKILSEALE 257


>gi|255568191|ref|XP_002525071.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase, chloroplast precursor, putative
           [Ricinus communis]
 gi|223535652|gb|EEF37318.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase, chloroplast precursor, putative
           [Ricinus communis]
          Length = 456

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 56/231 (24%), Positives = 108/231 (46%), Gaps = 35/231 (15%)

Query: 8   YAAELLTTNKLSEL-----------ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 56
           Y+A++ + N L EL           A LA+ L+ ++K G++S W PYI  L  Q G    
Sbjct: 66  YSAQIASDNLLPELSDLLGDEVGSVAKLAIVLLVDQKVGQESKWAPYISRLP-QLGE--- 121

Query: 57  AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 116
            + S + WS++EL  +  S    E +++   I++++  +  V        + +P    + 
Sbjct: 122 -MHSTIFWSKSELDMIFQSSVYKETIKQKAQIEKDFLTIKPV-------LEHFPQ--ISR 171

Query: 117 AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD--AVQ 174
           + TF+ F  A+  V+S     +     +  +L+P     L +    +A++   +D    +
Sbjct: 172 SITFQDFMHAYALVKS-----RAWGSTKGVSLIPFA-DFLNHDGFSEAVVLNDEDKQVSE 225

Query: 175 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 225
           +  DR Y   E +++  G   N+ LL+++GF    N +++  VE  +N  D
Sbjct: 226 VAADRNYAPHEEVLIRYGKFSNATLLLDFGFSLPYNIHEQ--VEIQINIPD 274


>gi|449464220|ref|XP_004149827.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Cucumis
           sativus]
          Length = 499

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 51/205 (24%), Positives = 91/205 (44%), Gaps = 22/205 (10%)

Query: 13  LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 72
           L  N++  +A LA+ ++ E K G  S W PYI  L +        + + + W E+EL  +
Sbjct: 111 LLGNEIGNVAKLAVVVLLEHKLGLGSEWAPYIIRLPQP-----WEMHNTIFWKESELEMI 165

Query: 73  TGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 132
             S    E L +   IKRE+  +           + +P  I  +  + + F  A+  V S
Sbjct: 166 RKSSLYEESLNQRSQIKREFLAIRKA-------LEAFPEII--DRISCDDFMHAYALVTS 216

Query: 133 CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGESIVVW 190
                +        +L+P     L +    +AML   DD    ++V DR +  GE +++ 
Sbjct: 217 -----RAWRSTEGVSLIPFAD-FLNHDGASEAMLLNDDDKQLSEVVADRDFAPGEHVLIR 270

Query: 191 CGPQPNSKLLINYGFVDEDNPYDRL 215
            G   N+ L++++GF    N +D++
Sbjct: 271 YGKYSNATLMLDFGFALPYNIHDQV 295


>gi|301094750|ref|XP_002896479.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262109454|gb|EEY67506.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 478

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 76/320 (23%), Positives = 128/320 (40%), Gaps = 31/320 (9%)

Query: 61  PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 120
           P+ WSE EL++L GS    +I ER   I+++Y+ +  V             D     F+ 
Sbjct: 144 PIFWSEEELSWLKGSYIIQQIQERKAAIRKDYDVICRV-------------DPSFARFSL 190

Query: 121 EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA-MLAAVDDAVQLVVDR 179
           + F  A + V S    L  +   +  ALVP    L  Y  +  +       DA  +    
Sbjct: 191 DRFSWARMIVCSRNFGL-TIDGVKTAALVPFADMLNHYRPRETSWTFDQSIDAFTITSLG 249

Query: 180 PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY------DRLVVEAALNTEDPQ-YQDKR 232
               G  +    G + N + L+NYGF  EDN        + ++++  L+  D Q + DKR
Sbjct: 250 TIGTGAQVYDSYGKKCNHRFLLNYGFAVEDNTEEDGRNPNEVLIDFQLSPADGQLFYDKR 309

Query: 233 MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPIC---PVS 289
                +G  ++    +          +   + RL  +  T E  S +    P     P+S
Sbjct: 310 AYLHESGIYTMDA-RLSCSHSDANTREGFSFARL--IVATEEEFSTMKMKSPAHSSPPIS 366

Query: 290 PCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD--YNLHPKKRVATQLVRMEKKMLNA 347
              E   L+ L +    +L+ Y  T+ ED  +L    Y L   +  A   +R EK++   
Sbjct: 367 FDNEIRALEYLRNLMTHQLSLYDTTIEEDNELLASKQYPLFSNRIQALFFIRGEKQVCRY 426

Query: 348 CLQVTADMIMLLPDVTVSPC 367
             Q  AD ++ L  + ++ C
Sbjct: 427 -FQELADKVIPLFSLPLAEC 445


>gi|412990750|emb|CCO18122.1| predicted protein [Bathycoccus prasinos]
          Length = 543

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 61/211 (28%), Positives = 89/211 (42%), Gaps = 31/211 (14%)

Query: 21  LACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKA 79
           LA  A  L  E     K  +  Y++ L R  G       S L W E+++   L GSP+  
Sbjct: 169 LAETAQNLSTEDNSSNKYRFATYVKALPRSTG-------SVLEWPESDVRTLLAGSPSLF 221

Query: 80  EILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK 139
             LER   +     E+   +        Q+ +DI        +F        S ++ L+ 
Sbjct: 222 SALERRASVAAAIAEIRVNFPELNEKTLQWAFDI--------LF--------SRLIRLE- 264

Query: 140 VSLARRFALVPLGPPLLAYSSKCKAM--LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 197
            SL    ALVP    +L +   C+A   L      V L  DR Y+ GE +    G +P+S
Sbjct: 265 -SLGGNLALVPWAD-MLNHQPGCEAFIDLDRGSRKVCLTTDRSYEPGEQVWASYGQRPSS 322

Query: 198 KLLINYGFVDE--DNPYDRLVVEAALNTEDP 226
           +LLI+YGF     DNP D   +   ++ EDP
Sbjct: 323 ELLISYGFAPAVGDNPDDEYALNLQIDEEDP 353


>gi|413923745|gb|AFW63677.1| hypothetical protein ZEAMMB73_839660 [Zea mays]
 gi|413923746|gb|AFW63678.1| hypothetical protein ZEAMMB73_839660 [Zea mays]
          Length = 306

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 49/178 (27%), Positives = 77/178 (43%), Gaps = 19/178 (10%)

Query: 10  AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
            +++  N + +   +A YL+ E      S W+ YI  L RQ         S L W+  EL
Sbjct: 135 GDVMKRNSVPDWPLIATYLISEASLEGSSRWISYIAALPRQ-------PYSLLYWTRAEL 187

Query: 70  -AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 128
            AYL  SP +   ++R   +   YN+L         +F ++P   P E +  E F  +F 
Sbjct: 188 DAYLVASPIRKRAIQRITDVIGTYNDL------RDRIFSRHPDLFPEEVYNIETFLWSFG 241

Query: 129 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAG 184
            + S +V L   S+  R ALVP    +L +S + +  L        +    DR Y+ G
Sbjct: 242 ILFSRLVRLP--SMDGRVALVPWA-DMLNHSPEVETFLDFDKSSRGIVFTTDRSYQPG 296


>gi|281207217|gb|EFA81400.1| mRNA-decapping enzyme 2 [Polysphondylium pallidum PN500]
          Length = 1078

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 78/321 (24%), Positives = 134/321 (41%), Gaps = 62/321 (19%)

Query: 58  VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 117
           + S L +S+ E   L GSP   +++ +    K  +  L        SLF++  +     A
Sbjct: 130 LNSGLFFSDAEAELLNGSPAYIDLMNQRLDAKELFGRL-------KSLFKEQQFS--KCA 180

Query: 118 FTFEIFKQAFVAVQSCVVHLQKVSL---ARRFALVPLGPPLLAYSSKCKAMLAAVD---- 170
            T++  K A+  V S  ++ +  +L      F  V L P  L Y +  +   AA D    
Sbjct: 181 MTYDRLKWAYSVVDSRKIYTEAPNLDANGNPFITVVLAP-FLDYFNHAEDAQAAYDFDYD 239

Query: 171 -DAVQLVVDRPYKAGESIVVWCGPQP-NSKLLINYGFVDEDNPYDRLV---VEAALNT-- 223
             A+++V  +P K GE I +  G Q  NS LLI+YGF+D+ +     V   VE  LNT  
Sbjct: 240 ESAIKVVALQPIKKGEQIFLNYGNQDCNSDLLIHYGFIDQSSTAKHCVNVLVEELLNTIP 299

Query: 224 -EDPQYQDKRMVAQRNGKLS--VQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS 280
             DPQ  +K  +  +  + +  +++F      E   IS  L Y                 
Sbjct: 300 ASDPQLIEKTELLTKAFEQNERMKLFKDSLTEELLKISKYLSYKNF-------------- 345

Query: 281 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML---TDYNLHPKKRVATQL 337
                         ++L  L      ++  YP T+ ED A++   T++    ++   + +
Sbjct: 346 --------------SLLPYLKSLIDMKMKAYPTTMEEDRAIIEATTEFEKLSQRSKMSII 391

Query: 338 VRMEK----KMLNACLQVTAD 354
           +R+++    K + A +QV  D
Sbjct: 392 MRLQEKETLKEIGALIQVKID 412


>gi|303271159|ref|XP_003054941.1| methyltransferase [Micromonas pusilla CCMP1545]
 gi|226462915|gb|EEH60193.1| methyltransferase [Micromonas pusilla CCMP1545]
          Length = 544

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 93/355 (26%), Positives = 143/355 (40%), Gaps = 43/355 (12%)

Query: 19  SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPT 77
            EL  LAL+L  E+ +G  S W PY+  L          V+ PL W E+E+ + L GSPT
Sbjct: 133 GELVGLALWLCCERAKGSLSDWAPYVNTLPT-----GCTVDHPLRWEESEIRSLLKGSPT 187

Query: 78  KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
             + + RA   + EY  +          +    Y+  TE      F  A   V +  V L
Sbjct: 188 CEQAVGRAVDAREEYASIRAAIAADADAYPADAYEFLTEL----AFTDALATVLARAVWL 243

Query: 138 QKVSLARRFALVPL--------------GPPLLAYSSKCKAMLAAV-----DDAVQLV-V 177
              ++   +ALVPL               P   A  +  + + AAV     D A + V V
Sbjct: 244 NAANV---YALVPLVDLLPVVGAPPPGVNPAAAAADAGARGLDAAVGVVDYDAATECVAV 300

Query: 178 DRPYKAGESIVVWCGP---QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 234
                A ++  V C     +    L ++ G V+  +  D L    +    D  Y  K+ +
Sbjct: 301 VSANDARQTAPVVCADALGRNAGDLFLSTGRVNGAHVGDYLTFVTSTVMSDKLYAAKKQI 360

Query: 235 AQRNG-KLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCME 293
            +  G     Q F V A R       +  Y+R   V + SE+ +V      I  VSP  E
Sbjct: 361 LEGMGYSADAQAFPVFADRMP---LQLFAYMRFARVQEPSELMTVSFEEDRI--VSPMNE 415

Query: 294 RAVLDQLADYFKARLAGYPATLSEDEAM-LTDYNLHPKKRVATQLVRMEKKMLNA 347
             +L  L    +  LA Y  +  E E + L + N+  ++  A +L   EK+++NA
Sbjct: 416 YEILQLLMGDAREMLAEYENSSEEFELLQLKETNISERQMTAAKLRLGEKRLINA 470


>gi|66828265|ref|XP_647487.1| hypothetical protein DDB_G0268558 [Dictyostelium discoideum AX4]
 gi|60475797|gb|EAL73732.1| hypothetical protein DDB_G0268558 [Dictyostelium discoideum AX4]
          Length = 459

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 61/219 (27%), Positives = 93/219 (42%), Gaps = 22/219 (10%)

Query: 25  ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILER 84
           A++LMY  K   KSFW PY+  L ++         + + +SE EL  L  S  K   + R
Sbjct: 116 AIFLMY-CKLNDKSFWYPYVSVLPKE-------FTTSIYFSEEELDELQSSKLKEFTIIR 167

Query: 85  AEGIKREYNELDTVWFMAGSLFQQYPYDIPT---EAFTFEIFKQAFVAVQSCVVHLQKVS 141
            +GI+R YN   T     G + +  P    T   + +T E+F  A   V S    L    
Sbjct: 168 KDGIERHYNSTFTRLSNRG-IAEFSPTSTQTLQQKGYTLELFTWALSCVWSRAFSLSDSD 226

Query: 142 LARRFALVPLGPPLLAYS-SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG---PQPNS 197
                 +VPL     A   SK K      D  +        + GE I    G   P  +S
Sbjct: 227 ----GGMVPLADMFNAEEISKSKVQPKVTDSTLDYYASDDIEIGEQIFTPYGVYKPLSSS 282

Query: 198 KLLINYGFV-DEDNPYDRLVVEAAL-NTEDPQYQDKRMV 234
           ++L++YGFV D   P D + +   + + ++P  Q K+ +
Sbjct: 283 QMLMDYGFVFDHGTPSDNVAISVPIFHPDEPNIQVKQSI 321


>gi|444705829|gb|ELW47217.1| Histone-lysine N-methyltransferase setd3 [Tupaia chinensis]
          Length = 539

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 46/193 (23%), Positives = 91/193 (47%), Gaps = 10/193 (5%)

Query: 170 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 229
           DD  + V  + ++ GE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y 
Sbjct: 234 DDRCECVALQDFRPGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 293

Query: 230 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVIS 280
            K  V  R G  +  VF +H   +    + +L +LR+  +++          S +  + +
Sbjct: 294 MKAEVLARAGIPTSSVFALHF-TDPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFT 352

Query: 281 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 340
                 PVS   E  +   L D     L  Y  T+ ED+++L   +L  +  +A +L   
Sbjct: 353 LGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKSRDLSVRATMAIKLRLG 412

Query: 341 EKKMLNACLQVTA 353
           EK++L   ++  A
Sbjct: 413 EKEILERAVRSAA 425


>gi|321470773|gb|EFX81748.1| hypothetical protein DAPPUDRAFT_317395 [Daphnia pulex]
          Length = 495

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 76/336 (22%), Positives = 136/336 (40%), Gaps = 25/336 (7%)

Query: 37  KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL- 95
           KSFW PY+  L         + ++ + ++  E+  L GSP   + L     I R+Y+   
Sbjct: 167 KSFWKPYLDALPS-------SYDTVMYFTPDEITELKGSPAFDDALRMCRNIARQYSYFY 219

Query: 96  ----DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 151
                 V     +L   + Y+    A +  + +Q  +  Q  +    K  L    AL+PL
Sbjct: 220 SLLQKNVDPALSNLRANFTYNDYRWAVSTVMTRQNLIPSQEEISGNDKDQLPPVNALIPL 279

Query: 152 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 211
                    +                 R +  GE + ++ G +  ++  I+ GFVD +N 
Sbjct: 280 WDFCNHQDGQFSTEFQLESRRTVCQAGRDFGPGEQVFIFYGTRTCAEQFIHNGFVDINNA 339

Query: 212 YDRLVVEAALNTEDPQYQDK-----RMVAQRNGKLSVQV-FHVHAGREKEAISDMLPYLR 265
           +D L ++  L+  DP    +     ++    + K+S  + F + AG  +     +L +LR
Sbjct: 340 HDALTLKVGLSKSDPLAGQRATLLCKLRILSDEKISGPIAFQLKAG-PQPVDGKLLAFLR 398

Query: 266 LGYVSDTSEMQSVISSLGPICPVSPC-MERAVLDQLADYFKAR----LAGYPATLSEDEA 320
           L  ++  S  + + S          C +E  V D+   + KAR    L  YP T   D  
Sbjct: 399 LFCMTKDSLDRWLQSDNASNLMHEECGIETEVDDKSWSFLKARCQLLLQLYPTTKEADLK 458

Query: 321 MLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 356
           ML + +L   +R+   L   EK++L + ++  A  I
Sbjct: 459 MLEE-DLSSHRRMCVLLRLAEKRILLSAIECAAQRI 493


>gi|57899520|dbj|BAD87034.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
 gi|57899939|dbj|BAD87851.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
          Length = 509

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 49/226 (21%), Positives = 86/226 (38%), Gaps = 36/226 (15%)

Query: 13  LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 72
           L  + + + + LA  L+ E+  G +S W PYI+ L  +       + + +LW   EL  +
Sbjct: 94  LLDHAVGDTSKLAALLIMEQHLGNESGWAPYIKSLPTKD-----QMHNMVLWDLNELHAV 148

Query: 73  TGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 132
             S    E +E  E  K+E+  L         LF +        A   +      V    
Sbjct: 149 QNSSIYDEAIEHKEQAKKEFLALKPALDHFPHLFGEVKLGDFMHASALDFLNHDGVFGSV 208

Query: 133 CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG 192
            +   QK                               D  +++ DR Y  GE +++  G
Sbjct: 209 LIYDEQK-------------------------------DVCEIIADRNYAVGEQVMIRYG 237

Query: 193 PQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 238
              N+ L +N+GF    N YD+ ++   +  +DP Y+ K  + Q++
Sbjct: 238 KYSNATLALNFGFTLARNIYDQALIRIDMPVQDPLYKKKLDIWQKH 283


>gi|302762396|ref|XP_002964620.1| hypothetical protein SELMODRAFT_81798 [Selaginella moellendorffii]
 gi|300168349|gb|EFJ34953.1| hypothetical protein SELMODRAFT_81798 [Selaginella moellendorffii]
          Length = 464

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 86/367 (23%), Positives = 153/367 (41%), Gaps = 46/367 (12%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           L L L+YE+ Q K S+W PYI  L         +   P+ +S  ++  +  +P   ++ +
Sbjct: 105 LGLKLLYERAQ-KGSYWWPYISMLPH-------SFTLPIFFSGVDIESIDYAPVTHQVKK 156

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVH--LQKVS 141
           R   + +   EL  +  +   +       + + A  + +   A V+ ++  +H    K+ 
Sbjct: 157 RCRFLLQFSAELAKLESLPEEVHPFAGQSVDSGALGWAM---AAVSSRAFRIHGVTNKLC 213

Query: 142 LARRFALVPLGPPLLAYSSKCKAMLA--AVDDA-VQLVVDRPYKAGESIVVWCGPQPNSK 198
            A    L+ +       ++  +  L+  A D + +++V  R  + G +I +  GP  N  
Sbjct: 214 SAMMLPLIDMCNHSFQPNAHIEEDLSRDAQDVSFLKVVTKRNLEKGSAITLNYGPLSNDL 273

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA--QRNGK------LSVQVFH--- 247
           LL++YGFV  DNP+DR+     L  +    ++ RM+A   R G        S QV     
Sbjct: 274 LLLDYGFVIPDNPHDRI----ELRYDGSLMENARMIAGLSRTGSPPFSSPASWQVDRLKQ 329

Query: 248 -----------VHAGREKEAISDMLPYLRLGYVSDTS--EMQSVIS--SLGPICPVSPCM 292
                      V  G  +E    +L  LR+ +       E + ++S  + G    VS   
Sbjct: 330 LGLADSGESQKVTLGGPEEVDGRLLAALRILHAESQEPLERRELVSLQAWGVESMVSSDN 389

Query: 293 ERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVT 352
           E  VL  L          +  T+ EDEA L+D +L    R+A Q    +K+++   L+  
Sbjct: 390 EERVLRTLCGLAAIVFNQFKTTIEEDEAKLSDKSLAETSRIAVQFRLTKKRLVVRVLESL 449

Query: 353 ADMIMLL 359
              +M L
Sbjct: 450 KKRLMDL 456


>gi|428182808|gb|EKX51668.1| hypothetical protein GUITHDRAFT_102933 [Guillardia theta CCMP2712]
          Length = 436

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 63/286 (22%), Positives = 122/286 (42%), Gaps = 35/286 (12%)

Query: 20  ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 79
           E+  LA +L  EK +G +S + P+I  L             PLLW++ E+  L G+    
Sbjct: 104 EVVALATFLAREKMKGSESSYKPFIDVL-------PWDSLHPLLWTDEEVDLLEGTYAHR 156

Query: 80  EILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF----KQAFVAVQS--- 132
           EIL   E ++      + V    G  ++Q+   I TE  T E F    + AF +V S   
Sbjct: 157 EILAFREQVEVATELFEPVLNPKG--WKQFFQTIETEKMTPEEFGFMMRGAFASVLSRAF 214

Query: 133 -CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY---------- 181
              +      L  R  +    P L  ++          D A++   ++ +          
Sbjct: 215 DSKIGRGDKGLEERVVI----PLLDIFNHGSYGPSITFDTALERDNEKGFPVRVADKGKS 270

Query: 182 -KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 240
            + GE +  + G +PN  +L  YGFV  +       +  +++ +DP +  K  + +  G 
Sbjct: 271 IEEGEELFGFYGDKPNWNMLTTYGFVSPNPKCQETTLSVSIDEKDPYFAQKEEILKARGM 330

Query: 241 LSV-QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPI 285
           ++V Q+F +    + + +  ++ Y R+  +S+ +++  V ++ G +
Sbjct: 331 VAVEQLFDIR--HDTDPMGPLINYFRIREISNEADLTKVQTNYGEM 374


>gi|388516285|gb|AFK46204.1| unknown [Lotus japonicus]
          Length = 271

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 39/133 (29%), Positives = 65/133 (48%), Gaps = 7/133 (5%)

Query: 121 EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVD 178
           E FK +F  + S +V L   S+  + ALVP    +L +S   +  L        +    D
Sbjct: 2   ESFKWSFGILFSRMVRLP--SMDGKVALVPWAD-MLNHSCDVETFLDYDKQSKGIVFTTD 58

Query: 179 RPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQ 236
           RPY+ GE + +  G + N +LL++YGFV  +  NP D + +  +L   D  Y++K  + +
Sbjct: 59  RPYQPGEQVFISYGKKSNGELLLSYGFVTREGANPSDSVELSLSLKKSDGSYKEKLELLK 118

Query: 237 RNGKLSVQVFHVH 249
           + G    Q F + 
Sbjct: 119 KYGLSGSQCFPIR 131


>gi|365989356|ref|XP_003671508.1| hypothetical protein NDAI_0H00910 [Naumovozyma dairenensis CBS 421]
 gi|343770281|emb|CCD26265.1| hypothetical protein NDAI_0H00910 [Naumovozyma dairenensis CBS 421]
          Length = 540

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 56/209 (26%), Positives = 86/209 (41%), Gaps = 22/209 (10%)

Query: 15  TNKLSELA---CLALYLMYEKKQ-GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA 70
           T KL EL+    L + +MYE K    +S W  Y   L        L     + W++ EL+
Sbjct: 126 TGKLMELSQWDSLIICMMYEMKVLQHESRWSSYFNVLPSSESLNTL-----MYWNDKELS 180

Query: 71  YLT--------GSPTKAEILERAEGIKREYNELDTVWFMAGSL----FQQYPYDIPTEAF 118
           +LT        G      +  R      E+NE D +    GS+    F   P  I   +F
Sbjct: 181 FLTPSLVVNRVGKGDAETMYRRILDTINEFNE-DILTEKLGSISWEEFLYIPSIIMAYSF 239

Query: 119 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD 178
             EI               ++       +++PL   L A + KC A L    D+++++  
Sbjct: 240 DVEIKNDDDENEGDEEFDEKEEEPELLKSMIPLADTLNADTHKCNANLTYDKDSLKMLAI 299

Query: 179 RPYKAGESIVVWCGPQPNSKLLINYGFVD 207
           +P K GE +    G  PNS+LL  YG+V+
Sbjct: 300 KPIKKGEQVYNTYGELPNSELLRKYGYVE 328


>gi|367048695|ref|XP_003654727.1| hypothetical protein THITE_2117893 [Thielavia terrestris NRRL 8126]
 gi|347001990|gb|AEO68391.1| hypothetical protein THITE_2117893 [Thielavia terrestris NRRL 8126]
          Length = 481

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 54/199 (27%), Positives = 81/199 (40%), Gaps = 28/199 (14%)

Query: 26  LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 85
            +L+ E  +G+ SFW PYI  L +       A+  P  W E ++AYL G+     I E  
Sbjct: 107 FFLIKEYLKGRDSFWAPYIATLPQPEHVSAWAL--PAFWPEEDIAYLAGTNAHVAIAEIQ 164

Query: 86  EGIKREYNE----LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS 141
             +K E+ +    L    F A   + Q  Y      FT   F+ + V        L + +
Sbjct: 165 ANVKSEFKQARKALKAAGFPAWQDYTQMLYKWAFCIFTSRSFRPSLV--------LSEPA 216

Query: 142 LARRFALVPLG---------PPLLAYSSKCKAMLAAVDDA-----VQLVVDRPYKAGESI 187
             +   L+P G          PL   ++       A D A      QLV    Y+ GE +
Sbjct: 217 KQQMAELLPPGCQLDDFSILQPLFDIANHSMTARYAWDVASDPASCQLVCHDAYQPGEQV 276

Query: 188 VVWCGPQPNSKLLINYGFV 206
               G + NS+LL+ YGF+
Sbjct: 277 YNNYGLKTNSELLLAYGFI 295


>gi|315039895|ref|XP_003169325.1| hypothetical protein MGYG_08872 [Arthroderma gypseum CBS 118893]
 gi|311337746|gb|EFQ96948.1| hypothetical protein MGYG_08872 [Arthroderma gypseum CBS 118893]
          Length = 455

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 82/346 (23%), Positives = 142/346 (41%), Gaps = 36/346 (10%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTK 78
           LALYL++ K            R L  +  R  +A       + + +++ EL    GS   
Sbjct: 77  LALYLLFVKS-----------RTLGYEGQRHHIAAMPQSYSASIFFTDDELQVCKGSSLY 125

Query: 79  AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ 138
           A   +  + +  +Y +L        +L  Q+    P + FT E +K A  ++ S  +   
Sbjct: 126 ALTPQLEQRVHDDYRQLLV------ALLSQHRDLFPLDQFTIEDYKWALCSIWSRAMDFA 179

Query: 139 KVSLARRFALVPLGPPLLAYS---SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 195
               A    + PL   +L +S    +C A      D + ++  + Y+ G+ I ++ G  P
Sbjct: 180 VSETASVRLVAPLAD-MLNHSPDVKQCHAYDPTSGD-LSILAAKDYQVGDQIFIYYGSVP 237

Query: 196 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 255
           N++LL  YGFV  DNP D   +    +   P Y+ K  +    G  S     +     K+
Sbjct: 238 NNRLLRLYGFVLPDNPNDSYDLVLQTSPLAPLYEQKERLWALAGLDSTCTIPLTV---KD 294

Query: 256 AI-SDMLPYLRLGYVSDTSEMQSVISSL--GPICPVSPCMERAVLDQLADYFKARLAGYP 312
            + +++L YLR+  + D S +  +   L  G    V+   E  VL  L D   + L G+ 
Sbjct: 295 PLPNNVLRYLRIQRL-DESNITDITLRLVNGTDGKVNDGNEIQVLQFLVDSIGSLLEGFG 353

Query: 313 ATLSEDEAMLT--DYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 356
             L + EA L   DY        A  +   E+++L    +   D++
Sbjct: 354 IPLEKLEAQLVAGDYPAGGNAWAAAHVSAGEQRVLTRAKKTAEDLL 399


>gi|224098926|ref|XP_002311320.1| SET domain-containing protein [Populus trichocarpa]
 gi|222851140|gb|EEE88687.1| SET domain-containing protein [Populus trichocarpa]
          Length = 490

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 82/354 (23%), Positives = 148/354 (41%), Gaps = 45/354 (12%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            +AL+L+ EK + + S W PY+  L         +  S + WSE ELA L G+   +  L
Sbjct: 123 SVALFLIREKLK-EDSTWRPYLDVLPE-------STNSTIFWSEEELAELQGTQLLSTTL 174

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
                ++RE+ +++    +     Q +P  +     T + F  AF  ++S      +   
Sbjct: 175 GVKSYLRREFLKVEEEILVPHK--QLFPSPV-----TLDDFSWAFGILRSRSFSRLR--- 224

Query: 143 ARRFALVPLG----------------PPLLAYSSKC----KAMLAAVDDAVQLVVDRPYK 182
            +   L+PL                  P +           A L + D    L      K
Sbjct: 225 GQNLVLIPLADLCNFLHTWLLDQVNHSPDITIEDGVYEIKGAGLFSRDLIFSLRSPISLK 284

Query: 183 AGESIVVWCGPQ-PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL 241
           AGE +++       N++L ++YGF++  +  +   +   ++  DP + DK  +A+ NG  
Sbjct: 285 AGEQVLIQYNLNLSNAELAVDYGFIEAKSDRNMYTLTLQISESDPFFGDKLDIAETNGLG 344

Query: 242 SVQVFH-VHAGREKEAISDMLPYLRLGYVSDTSEMQSVISS--LGPI-CPVSPCMERAVL 297
            +  F  V        +   L  + LG  +D+  ++S+  +   G +  PVS   E  + 
Sbjct: 345 EIADFDIVLGNPLPPTLLPYLRLVALGG-TDSFLLESIFRNTIWGHLELPVSRANEELIC 403

Query: 298 DQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQV 351
             + D  K+ L+GY  T+ EDE  L    L+P+  +A  +   EKK+L    ++
Sbjct: 404 RVVRDACKSALSGYHTTIEEDEK-LKGEELNPRLEIAVGIRAGEKKVLQQIEEI 456


>gi|328700922|ref|XP_003241429.1| PREDICTED: SET domain-containing protein 3-like [Acyrthosiphon
           pisum]
          Length = 463

 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 50/211 (23%), Positives = 90/211 (42%), Gaps = 18/211 (8%)

Query: 23  CLALYLMYEK-KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 81
            LA++++ E  ++ KKSFW  Y+  L        +   +P+ +   +L  L GSP     
Sbjct: 163 ALAIFILVESLRKDKKSFWHSYLTTL-------PVTYSTPVYFDVADLEALKGSPAFEAA 215

Query: 82  LERAEGIKREYNELDTVWFMAGSLFQQYPYD-IPTEAFTFEIFKQA---FVAVQSCVVHL 137
           L+    I R+Y     ++ ++       P   I  + FT+E ++ A    ++ Q+ V   
Sbjct: 216 LKLNRNIARQYAYFKKLFQLSND-----PASVILKDTFTYEYYRWAVSTLMSRQNTVPSS 270

Query: 138 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 197
              S     AL+PL       S +         +      D  Y A E + ++ G + N+
Sbjct: 271 DNPS-ENVSALIPLWDMFNHRSGRLSTDFVKSSNVCVCYADGDYAADEQVYIFYGVRTNA 329

Query: 198 KLLINYGFVDEDNPYDRLVVEAALNTEDPQY 228
             L++ GFV  DN +D + +   ++  DP Y
Sbjct: 330 DFLVHNGFVYPDNEHDAVKIRLGVSRSDPLY 360


>gi|296804474|ref|XP_002843089.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
 gi|238845691|gb|EEQ35353.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
          Length = 455

 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 74/305 (24%), Positives = 128/305 (41%), Gaps = 20/305 (6%)

Query: 60  SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 119
           + + ++E EL    GS   A   +  + ++ +Y +L         L  Q+    P + FT
Sbjct: 107 ASIFFTEDELQVCAGSSLYALTRQLEQRVRDDYRQLLV------PLLSQHRDLFPLDQFT 160

Query: 120 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYS---SKCKAMLAAVDDAVQLV 176
            E +K A  ++ S  +    VS      LV     +L +S    +C A      D + ++
Sbjct: 161 IEDYKWALCSIWSRAMDF-AVSGTTSVRLVAPLADMLNHSPDVKQCHAYDPTSGD-LSIL 218

Query: 177 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 236
             + Y+ G+ + ++ G  PN++LL  YGFV  DNP D   +    +   P Y+ K  +  
Sbjct: 219 AAKDYQVGDQVFIYYGSVPNNRLLRLYGFVLPDNPNDSYDLVLQTSPLAPLYEQKERLWA 278

Query: 237 RNGKLSVQVFHVHAGREKEAI-SDMLPYLRLGYVSDTSEMQSVISSL--GPICPVSPCME 293
             G  S     +     K+ + +++L YLR+  + D S +  +   L  G    VS   E
Sbjct: 279 LAGLDSTCTIPLTV---KDPLPNNVLRYLRIQRL-DESNITDITLQLVNGTDGKVSDGNE 334

Query: 294 RAVLDQLADYFKARLAGYPATLSEDEAMLT--DYNLHPKKRVATQLVRMEKKMLNACLQV 351
             VL  L D   + L G+   L + EA L   DY        A  +   E+++L    + 
Sbjct: 335 MQVLQFLVDSIGSLLEGFGIPLEKLEAQLAAGDYPAGGNAWAAAHVSAGEQRVLTRAKRT 394

Query: 352 TADMI 356
             D++
Sbjct: 395 AEDLL 399


>gi|241603784|ref|XP_002405757.1| SET domain-containing protein, putative [Ixodes scapularis]
 gi|215502568|gb|EEC12062.1| SET domain-containing protein, putative [Ixodes scapularis]
          Length = 429

 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 48/206 (23%), Positives = 90/206 (43%), Gaps = 16/206 (7%)

Query: 17  KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP 76
           KL+ +  L L+++ +K  G+ S W P++  L R          +P+         L    
Sbjct: 100 KLTPIDVLTLFVLDQKLLGEASRWWPFVDSLPR-------TFTTPVFLRRKVFESLP-KD 151

Query: 77  TKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVH 136
            + E+      I+R + +L  +  + G + ++      +  FT+  F  A+ AV +  + 
Sbjct: 152 LREEVQTGITFIQRTFLKLKVL--LGGHVEEEPEVQCLSTGFTWNNFVWAWTAVNTRCIF 209

Query: 137 LQKVSLARRF-----ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
            Q  + +  +     AL P     L +  K     A V +  +++  + + A E + +  
Sbjct: 210 AQGSNSSSLWEDDHCALAPF-LDCLNHHWKASIETAMVGENFEILSHKSHDANEQVFISY 268

Query: 192 GPQPNSKLLINYGFVDEDNPYDRLVV 217
           GP  N +L ++YGFV  DNP D +VV
Sbjct: 269 GPHSNRRLFLDYGFVLPDNPNDVVVV 294


>gi|367036287|ref|XP_003648524.1| hypothetical protein THITE_2106073 [Thielavia terrestris NRRL 8126]
 gi|346995785|gb|AEO62188.1| hypothetical protein THITE_2106073 [Thielavia terrestris NRRL 8126]
          Length = 496

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 55/203 (27%), Positives = 83/203 (40%), Gaps = 32/203 (15%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           L L L+YE  QG+ S W PY+  L           ++P+ WS TEL+ L  S   A++  
Sbjct: 109 LILVLIYEHLQGEASRWRPYLDVL-------PPTFDTPMFWSPTELSELQASALVAKV-G 160

Query: 84  RAEG----------IKREYNELDTVWFMAG----------SLFQQYPYDIPTEAFTFEIF 123
           RAE           + R + E   V+F  G           L  +    I   AF  E  
Sbjct: 161 RAEADRMIEAKVLPVIRAHEE---VFFPPGRAKLDDAQLFELAHRMGSTIMAYAFDLEND 217

Query: 124 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 183
                        ++         +VP+   +L   ++  A +   DDA+     RP +A
Sbjct: 218 DSDNDEADEDDEWVEDREGRTMLGMVPMAD-MLNADAEFNAHINHGDDALTATALRPIRA 276

Query: 184 GESIVVWCGPQPNSKLLINYGFV 206
           G+ I+ + GP PN +LL  YG+V
Sbjct: 277 GDEILNYYGPLPNGELLRRYGYV 299


>gi|156361027|ref|XP_001625323.1| predicted protein [Nematostella vectensis]
 gi|156212150|gb|EDO33223.1| predicted protein [Nematostella vectensis]
          Length = 447

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 56/207 (27%), Positives = 85/207 (41%), Gaps = 20/207 (9%)

Query: 7   YYAAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSE 66
           Y A  +    K S    L L+LMYEK   K SFW PYIR L            +P  ++ 
Sbjct: 113 YVAERMKGGAKSSNDYLLVLFLMYEKYLEKGSFWAPYIRTLPD-------TFNTPCYFTR 165

Query: 67  TELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 126
            EL +L     + +  E+   IK+ Y      +     + Q +  +       FE FK A
Sbjct: 166 KEL-FLLPEQCREQAFEQVTQIKQSYKSFAKAY---NDVLQDFDCNF-WRTVDFESFKWA 220

Query: 127 FVAVQS-CVVHLQKVSLAR----RFALVPLGPPLLAYSSKCK--AMLAAVDDAVQLVVDR 179
           +  V +  V H +    A+      AL PL   LL +  K +      +     ++ V  
Sbjct: 221 WCVVNTRSVYHDEPNRRAQPIDGNCALAPL-LDLLNHCDKAEMCGRFNSSSKNYEINVIT 279

Query: 180 PYKAGESIVVWCGPQPNSKLLINYGFV 206
            Y+ G  + +  GP  N++L + YGFV
Sbjct: 280 EYQKGTQVFINYGPHDNTRLFLEYGFV 306


>gi|134074534|emb|CAK38827.1| unnamed protein product [Aspergillus niger]
          Length = 625

 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 57/217 (26%), Positives = 92/217 (42%), Gaps = 7/217 (3%)

Query: 16  NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 75
           + + E      +L+ +  +G + FW PYIR L  Q G    ++ +P  +   +L +L G+
Sbjct: 29  DAVGEKESTIFFLIGQYLRGTEGFWYPYIRTLP-QPG----SLTTPPYYEGEDLQWLDGT 83

Query: 76  PTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 134
              A   +R E +K +Y +  T    AG      Y +D+   A +  I +     V S V
Sbjct: 84  SLLAAREKRLEVLKEKYEKGSTELRNAGFEGADAYTWDLYLWAASMFISRAFSAKVLSGV 143

Query: 135 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 194
                +S  +   L+P+   +  +    K    A  D +  VV     AG+ I    GP+
Sbjct: 144 FPETDLSEEKLSVLLPI-IDMGNHRPLAKVEWRAGKDDIAFVVLEDVWAGQEISNNYGPR 202

Query: 195 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 231
            N +L++NYGF    NP D  +V        P Y  K
Sbjct: 203 NNEQLMMNYGFCIPGNPCDHRIVSLRAPPGSPLYMAK 239


>gi|168014081|ref|XP_001759585.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689124|gb|EDQ75497.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 340

 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 51/212 (24%), Positives = 97/212 (45%), Gaps = 28/212 (13%)

Query: 155 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 214
           LL +SS+ +++       +++V ++  + GE++V+  GP  N  LL++YGFV   NP DR
Sbjct: 123 LLQHSSESQSL-----PVLEVVAEKDLEKGENVVLNYGPLSNDILLLDYGFVMPKNPNDR 177

Query: 215 --------------LVVEAALNT-EDPQYQDKRMVAQRN--GKLSVQVFHVHAGREKEAI 257
                         LV +  +++ +DP      ++ + N  G  S Q+  V  G  +   
Sbjct: 178 VELRYDDQLLHMACLVAKVNIDSFKDPTTSQLALLTRLNLHGPSSSQM--VTLGGTELVE 235

Query: 258 SDMLPYLRLGYVSDTSEMQSV----ISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 313
             +L  +R+ +  D  E+  V    + +     P+    ER  +  L       LA +P 
Sbjct: 236 GRLLAAVRVMHAQDPMELLDVDLEALQTWNQSPPLGVLNERKTIRTLIGLGMLALASFPT 295

Query: 314 TLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
            + ED++ L   ++    R+A Q   ++K++L
Sbjct: 296 EIEEDQSELVKGDISENHRLAIQFRMLKKRLL 327


>gi|317038661|ref|XP_001401929.2| SET domain protein [Aspergillus niger CBS 513.88]
          Length = 699

 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 57/217 (26%), Positives = 92/217 (42%), Gaps = 7/217 (3%)

Query: 16  NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 75
           + + E      +L+ +  +G + FW PYIR L  Q G    ++ +P  +   +L +L G+
Sbjct: 103 DAVGEKESTIFFLIGQYLRGTEGFWYPYIRTLP-QPG----SLTTPPYYEGEDLQWLDGT 157

Query: 76  PTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 134
              A   +R E +K +Y +  T    AG      Y +D+   A +  I +     V S V
Sbjct: 158 SLLAAREKRLEVLKEKYEKGSTELRNAGFEGADAYTWDLYLWAASMFISRAFSAKVLSGV 217

Query: 135 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 194
                +S  +   L+P+   +  +    K    A  D +  VV     AG+ I    GP+
Sbjct: 218 FPETDLSEEKLSVLLPI-IDMGNHRPLAKVEWRAGKDDIAFVVLEDVWAGQEISNNYGPR 276

Query: 195 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 231
            N +L++NYGF    NP D  +V        P Y  K
Sbjct: 277 NNEQLMMNYGFCIPGNPCDHRIVSLRAPPGSPLYMAK 313


>gi|358397725|gb|EHK47093.1| hypothetical protein TRIATDRAFT_298882 [Trichoderma atroviride IMI
           206040]
          Length = 481

 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 50/193 (25%), Positives = 87/193 (45%), Gaps = 14/193 (7%)

Query: 26  LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 85
           L L+ E  +G++SFW PYI+ L +       A+  P  W E E   L G+  +  + +  
Sbjct: 90  LLLIKELLRGEESFWWPYIQALPQPEDVDDWAL--PPFWPEEEAELLEGTNVEVGLDKIR 147

Query: 86  EGIKREYNELDTVWFMA--------GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
           + +KRE+ E   +   +          L  +  Y+     F+   F+ + V  ++    L
Sbjct: 148 DDLKREFREAKAMLLASQKDAEDDFSELLTRELYNWAYCIFSSRSFRASLVMTEAQQQAL 207

Query: 138 -QKVSLARRFALVPL---GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 193
            + VS+     L+PL   G   +A   + +   A    A QL V R ++ G+ I     P
Sbjct: 208 PEDVSVDDFSVLLPLFDIGNHDMAVDVRWELDAANSGAACQLRVGREHQPGQQIFNNYSP 267

Query: 194 QPNSKLLINYGFV 206
           + N++LL+ YGF+
Sbjct: 268 KTNAELLLGYGFM 280


>gi|350632383|gb|EHA20751.1| hypothetical protein ASPNIDRAFT_120572 [Aspergillus niger ATCC
           1015]
          Length = 668

 Score = 54.3 bits (129), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 57/217 (26%), Positives = 92/217 (42%), Gaps = 7/217 (3%)

Query: 16  NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 75
           + + E      +L+ +  +G + FW PYIR L  Q G    ++ +P  +   +L +L G+
Sbjct: 77  DAVGEKESTIFFLVGQYLRGTEGFWYPYIRTLP-QPG----SLTTPPYYEGEDLQWLDGT 131

Query: 76  PTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 134
              A   +R E +K +Y +  T    AG      Y +D+   A +  I +     V S V
Sbjct: 132 SLLAAREKRLEVLKEKYEKGSTELRNAGFEGADAYTWDLYLWAASMFISRAFSAKVLSGV 191

Query: 135 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 194
                +S  +   L+P+   +  +    K    A  D +  VV     AG+ I    GP+
Sbjct: 192 FPETDLSEEKLSVLLPI-IDMGNHRPLAKVEWRAGKDDIAFVVLEDVWAGQEISNNYGPR 250

Query: 195 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 231
            N +L++NYGF    NP D  +V        P Y  K
Sbjct: 251 NNEQLMMNYGFCIPGNPCDHRIVSLRAPPGSPLYMAK 287


>gi|327295326|ref|XP_003232358.1| hypothetical protein TERG_07206 [Trichophyton rubrum CBS 118892]
 gi|326465530|gb|EGD90983.1| hypothetical protein TERG_07206 [Trichophyton rubrum CBS 118892]
          Length = 692

 Score = 54.3 bits (129), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 50/200 (25%), Positives = 91/200 (45%), Gaps = 23/200 (11%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           LA ++++E+ +G+ S W PY+  L R        + S L + +++L +L G+        
Sbjct: 108 LAFFMVHEQLKGRDSHWWPYLATLPRAS-----ELTSALFFQDSDLEWLQGTSLYETHRA 162

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF--VAVQSCVVHLQKVS 141
               +K EY+       +A S+ +   Y +  E++T++IF  A+  +A ++    +    
Sbjct: 163 YRNTVKEEYD-------LAISILRDEGY-LAIESYTWDIFCWAYTLIASRAFTSRVLDAY 214

Query: 142 LARRFAL-----VPLGPPLLAYSSK---CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 193
           L+   +L       +  PL+ +S+     K    A    ++L V  P   GE +    GP
Sbjct: 215 LSNHPSLKQEEEFQIMLPLVDFSNHKPLAKIEWQAEATEIRLKVVEPTFTGEEVHNNYGP 274

Query: 194 QPNSKLLINYGFVDEDNPYD 213
             N +L+  YGF   DNP D
Sbjct: 275 LNNQQLMTTYGFCIVDNPCD 294


>gi|344277088|ref|XP_003410336.1| PREDICTED: SET domain-containing protein 4 [Loxodonta africana]
          Length = 440

 Score = 54.3 bits (129), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 61/237 (25%), Positives = 99/237 (41%), Gaps = 25/237 (10%)

Query: 19  SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
           S L  L  +L+ EK  G +S W PY+  L +           P+ W E E+  L   P +
Sbjct: 105 SPLLALCTFLVLEKHAGDQSSWKPYLETLPK-------TYTCPVCW-EPEVVNLLPRPLR 156

Query: 79  AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ 138
           A+  E+   ++  +      +     LF +   +I    FT+     A+  V +  V+L+
Sbjct: 157 AKAQEQRTRVQEFFTSFRDFFSSLQPLFSEAVENI----FTYSALLWAWCTVNTRAVYLR 212

Query: 139 KVSLARRFALVP----LGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVW 190
              L R F+  P    L P   LL +S   + KA         ++V     +  E + + 
Sbjct: 213 HRQL-RCFSAEPDTCALAPYLDLLNHSPDVQVKAAFNEKTRCYEIVAVSSCRKHEEVFIC 271

Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFH 247
            GP  N +LL+ YGFV   NP+  + V   +  +     DK+M    N K+S+   H
Sbjct: 272 YGPHDNHRLLLEYGFVSTRNPHACVYVSRDILVKYLPSTDKQM----NKKISILKDH 324


>gi|189189204|ref|XP_001930941.1| SET domain-containing protein RMS1 [Pyrenophora tritici-repentis
           Pt-1C-BFP]
 gi|187972547|gb|EDU40046.1| SET domain-containing protein RMS1 [Pyrenophora tritici-repentis
           Pt-1C-BFP]
          Length = 476

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 56/216 (25%), Positives = 85/216 (39%), Gaps = 44/216 (20%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            L L ++YE   G  S W PY   L  +         + + W+E ELA L  S    ++ 
Sbjct: 85  SLILVMLYEYHNGSASNWAPYFAVLPTE-------FNTLMFWTEDELAELQASAVVGKVG 137

Query: 83  ---------------------------ERAEGIKREYNELDTVWFM--AGSLFQQYPYDI 113
                                      ERA+   +E   L+ +  M   GSL   Y +D+
Sbjct: 138 KESADEAFLEQLLPVIEEFADIVFSGDERAKDKAKEMRSLENLELMHKMGSLIMAYAFDV 197

Query: 114 PTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV 173
                T E+ ++   A +     L K        +VPL   L A + +C A L    D +
Sbjct: 198 EPATPTKEVDEEG-FAEEEEDAALPK-------GMVPLADMLNADADRCNARLFYEKDCL 249

Query: 174 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 209
           ++   +P +AGE I    GP P S LL  YG+V ++
Sbjct: 250 EMKALKPIQAGEEIFNDYGPLPRSDLLRRYGYVTDN 285


>gi|449283795|gb|EMC90389.1| SET domain-containing protein 4 [Columba livia]
          Length = 440

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 71/298 (23%), Positives = 119/298 (39%), Gaps = 39/298 (13%)

Query: 18  LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
           +S L  L  +L+ EK  G+KS W PY+  L +           P+   E ++  L   P 
Sbjct: 104 VSPLTALCTFLIAEKHAGEKSLWKPYLDVLPK-------TYSCPVC-LEHDVVSLLPEPL 155

Query: 78  KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
           + +  E+   +   Y      +     LF +    I    F +   + A+  + +  +++
Sbjct: 156 RKKAQEQRTKVHELYISSKAFFSSLQPLFAENTETI----FNYSALEWAWCTINTRTIYM 211

Query: 138 QKVSLARRFALVP----LGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
            K S  + F+L P    L P   LL +S   + KA       + ++  +   K  E + +
Sbjct: 212 -KHSQRKCFSLEPDVYALAPYLDLLNHSPNVQVKAAFNEQTRSYEIRTNSLCKKYEEVFI 270

Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
             GP  N +LL+ YGFV  DNP+  + V +A   +     DK    QRN K+S+   H  
Sbjct: 271 CYGPHDNQRLLLEYGFVAMDNPHSSVYVSSATLLKYFPPLDK----QRNAKVSILKDH-- 324

Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 307
                    D+L  L  G+   +  + + +  L        C  R +   L D   AR
Sbjct: 325 ---------DLLENLTFGWDGPSWRLLTALKVLSLGADEFTCWRRTL---LGDVISAR 370


>gi|242769547|ref|XP_002341787.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
 gi|218724983|gb|EED24400.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
          Length = 739

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 65/230 (28%), Positives = 100/230 (43%), Gaps = 29/230 (12%)

Query: 25  ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSE--TELAYLTGSPTKAEIL 82
             +LM +  + ++ FW PYI+ L      G   + +PLL+ E   +LA+L  +   A   
Sbjct: 138 TFFLMGQYLRREEGFWYPYIQSLP-----GPEELTTPLLFKEEDGDLAWLNMTSLAASRE 192

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQ--------YPYDIPTEAFTFEIFKQAFVA--VQS 132
            R +  K  Y +       A S+ Q         Y +D+   A T  I  +AF A  + S
Sbjct: 193 RRLQIWKVNYEK-------AYSMMQDLGVENARLYTWDLYLWASTI-ISSRAFTAKVLAS 244

Query: 133 CVVHLQKVSLARRFALVPLGPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVW 190
            +  LQ      R ++  L P + A + K   K    A  D++ LVV    +AG+ +   
Sbjct: 245 VIPKLQTAEEGDRISV--LLPLIDATNHKPLSKVEWRAGTDSIGLVVMSDLRAGDEVGNN 302

Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 240
            GP+ N +L++NYGF   DNP +  VV      + P  Q K    Q   K
Sbjct: 303 YGPRNNEQLMMNYGFCIPDNPCEYRVVSLRAPPDSPLAQIKAQYEQHCSK 352


>gi|281201870|gb|EFA76078.1| hypothetical protein PPL_10657 [Polysphondylium pallidum PN500]
          Length = 1234

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 54/210 (25%), Positives = 89/210 (42%), Gaps = 17/210 (8%)

Query: 18   LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
            L++   L L+++YEK +   SFW P+   L            + + ++ TEL  L G+  
Sbjct: 809  LNDDTILFLFVIYEK-ENPNSFWRPFFDTLPS-------YFPTSIHYTSTELLELEGTNL 860

Query: 78   KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
             AE L+    +K     +  + F    L +QYP   P   F++E F  A     S  + L
Sbjct: 861  FAETLQ----VKEHLQSIRDMLF--PELSEQYPTIFPESLFSWENFLWARSLFDSRAIQL 914

Query: 138  QKVSLARRFALVPLGPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 195
             K+       LVP+   +  + +    +      D   ++V          I +  G   
Sbjct: 915  -KIDDKITNCLVPMADMINHHHNAQISQRFFDQTDQCFKMVSCCSVPPNAQIFLHYGALQ 973

Query: 196  NSKLLINYGFVDEDNPYDRLVVEAALNTED 225
            N +L + YGFV +DNPYD +++   L  ED
Sbjct: 974  NRELALYYGFVIQDNPYDSMLIGFDLPDED 1003


>gi|71895277|ref|NP_001025965.1| SET domain-containing protein 4 [Gallus gallus]
 gi|53134599|emb|CAG32346.1| hypothetical protein RCJMB04_23h14 [Gallus gallus]
          Length = 439

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 79/355 (22%), Positives = 140/355 (39%), Gaps = 42/355 (11%)

Query: 18  LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
           +S L  L  +L+ EK  G++S W PY+  L +           P+   E ++  L   P 
Sbjct: 103 VSPLIALCPFLIAEKHAGERSLWKPYLDVLPK-------TYSCPVC-LEQDVVQLLPEPL 154

Query: 78  KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
           + +  E+   +   Y      +    SLF +    I    F +   + A+  + +  +++
Sbjct: 155 RKQAQEQRTAVHELYMSSKAFFSSLQSLFAENTATI----FNYSALEWAWCTINTRTIYM 210

Query: 138 QKVSLARRFALVP----LGP--PLLAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
            K S    F+L P    L P   LL +S   + KA         ++  +   K  E + +
Sbjct: 211 -KHSQRECFSLEPDVYALAPYLDLLNHSPNVQVKAAFNEQSRNYEIQTNSQCKKYEEVFI 269

Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
             GP  N +LL+ YGFV  DNP+  + V +    +     DK    Q+N KLS+   H  
Sbjct: 270 CYGPHDNQRLLLEYGFVAVDNPHSSVYVSSDTLLKYFPSLDK----QKNAKLSILKEH-- 323

Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
                    D+L  L  G+   +  + + +  L        C  RA+   L D   AR  
Sbjct: 324 ---------DLLENLTFGWDGPSWRLLTALKVLSLGGDEFTCWRRAL---LGDVISARNE 371

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTV 364
                ++     +  + +   + V  Q+ +M++   N   Q+T    + L D+ +
Sbjct: 372 QQALNIT---TKICHFLIEETQHVLLQISQMKRDKENLKEQLTLVEALRLEDLKI 423


>gi|121707885|ref|XP_001271968.1| SET domain protein [Aspergillus clavatus NRRL 1]
 gi|119400116|gb|EAW10542.1| SET domain protein [Aspergillus clavatus NRRL 1]
          Length = 677

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 55/203 (27%), Positives = 92/203 (45%), Gaps = 27/203 (13%)

Query: 26  LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 85
            +L+ +  QG+  FW PYIR L +      L++ +PL +   +L +L G+       +R 
Sbjct: 97  FFLIGQYLQGEDGFWFPYIRTLPQP-----LSLTTPLYYEGDDLGWLKGTSLWPAREQRM 151

Query: 86  EGIKREYNELDTVWFMAGSLFQ---QYPYD--------IPTEAFTFEIFKQAFVAVQSCV 134
           E +K  Y   + V  +  + FQ   +Y +D        I + AF+ ++  +AF  +    
Sbjct: 152 ELLKEAYE--NGVRELRKAGFQDVDKYTWDLYLWASSMIVSRAFSPKVLAEAFADID--- 206

Query: 135 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 194
           +    VS+     L+P    L+ +    K    A    V  +V     AG+ I    GP+
Sbjct: 207 LPEDGVSV-----LLPC-IDLMNHRPLAKVEWRAGKQDVAYLVLEDVAAGQEIANNYGPR 260

Query: 195 PNSKLLINYGFVDEDNPYDRLVV 217
            N +L++NYGF   DNP D  +V
Sbjct: 261 NNEQLMMNYGFCLPDNPCDYRIV 283


>gi|451852693|gb|EMD65988.1| hypothetical protein COCSADRAFT_86793 [Cochliobolus sativus ND90Pr]
          Length = 478

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 85/369 (23%), Positives = 143/369 (38%), Gaps = 51/369 (13%)

Query: 15  TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 74
           +  L     L L ++YE  QG+ S W  Y+  L +       A E+P+ W+  EL  L G
Sbjct: 100 SEALDSWGSLILVMLYEYLQGEASRWKTYLDILPQ-------AFETPIFWTPDELKELEG 152

Query: 75  SPTKAEILERAEGIK--REY--------------------NELD--TVWFMAGSLFQQYP 110
           +    E + + E  +  RE                     NE D  ++    GS    Y 
Sbjct: 153 TSLTTEKIGKKESDRMLRERILPIVTSHPDVFSPPGAPRLNEDDLLSLAHRMGSTIMAYA 212

Query: 111 YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD 170
           +D+  E    E  +  ++  +         SL     +VP+   +L  +++  A +   D
Sbjct: 213 FDLENEEEQSEDEEDGWIEDRDGK------SL---IGMVPMAD-MLNANAEFNAHVHHGD 262

Query: 171 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRL-----VVEAALNTE 224
                 +     AG  I+ + GP P+S+LL  YG+V  E + YD       +V  AL  E
Sbjct: 263 QLQVTSLRESIPAGSEILNYYGPLPSSELLRRYGYVTSEHHRYDVAEISWSLVRTALAEE 322

Query: 225 DPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS----DMLPYLRLGYVSDTSEMQSVIS 280
               +D     +R  +  ++ F V      E  S       P LR        + ++ + 
Sbjct: 323 LKLSEDTIADIERKLESELEEFFVIERDAGEPSSYGTLTQPPVLREISTELEEQTKAFLK 382

Query: 281 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 340
           +L    P     E      L    + RL  YP +  +DE++L+   L  + R+A ++   
Sbjct: 383 ALKKRDPKRKRSETICNTVLEKALRTRLGQYPTSAKQDESLLSKEGLSKRHRMAVEVRLG 442

Query: 341 EKKMLNACL 349
           EK++L   L
Sbjct: 443 EKRLLQEAL 451


>gi|255071849|ref|XP_002499599.1| predicted protein [Micromonas sp. RCC299]
 gi|226514861|gb|ACO60857.1| predicted protein [Micromonas sp. RCC299]
          Length = 588

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 67/251 (26%), Positives = 105/251 (41%), Gaps = 32/251 (12%)

Query: 18  LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE-----SPLLWS-ETELAY 71
           L E    AL+L+ E+  G+ S W   I  L    G G+ +       +P+ W  E   A 
Sbjct: 231 LGEDTIAALWLIAERALGEASPWHAVIASLPWPEG-GEGSASPCGGCTPVSWPREACDAL 289

Query: 72  LTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDI-PTEAFTFEIFKQAFVAV 130
           L G+P  A+ +  +E + R++  L    F A     ++  D+ P  A+T + F++A  A 
Sbjct: 290 LGGTPLLADAIAASEKLARQHAAL----FPA---LSEHMADVFPASAYTLDNFRRAHEAW 342

Query: 131 QSCVVHLQKVSLARRFALVP---------LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY 181
            S  + +Q          +P         L P ++ YS          D  ++L V R  
Sbjct: 343 NSYGMTVQASPGEPAATCLPPVAMLCNHALWPHVVRYSRL-------RDGTLRLPVARSV 395

Query: 182 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL-NTEDPQYQDKRMVAQRNGK 240
            AGE + V  G + N++LL+ YGF    NPYD + +   L   E       R  A     
Sbjct: 396 HAGEEVFVSYGAKSNAELLLFYGFALPGNPYDDVPLSLELPGGEVADVTKAREAALARAG 455

Query: 241 LSVQVFHVHAG 251
           L++    V AG
Sbjct: 456 LTLSPHAVRAG 466


>gi|449662705|ref|XP_002165483.2| PREDICTED: uncharacterized protein LOC100209819 [Hydra
           magnipapillata]
          Length = 819

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 80/333 (24%), Positives = 133/333 (39%), Gaps = 49/333 (14%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY-LTGSPTKAEIL 82
           L + LM+E  Q K SFW  Y+  +      G      PL W E E      G P   +I+
Sbjct: 186 LLITLMWEFNQ-KDSFWASYLLLVPEISEFGH-----PLFWKEEEYNLEFQGMPLLNDII 239

Query: 83  ERAEGIKREYNELDTVWF-----MAGSLFQQYPYDIPTEAFTFEIFKQ--AFVAVQSCVV 135
              E I+ EY E   ++      + GSL          E ++ E FK+  AFV   S   
Sbjct: 240 VDRENIETEYAEFVLLFLRRNKDLFGSL----------ENYSLEFFKRMVAFVMAYSFTE 289

Query: 136 HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 195
             +  S+      VP+   +L + S   A L      +Q++  R  K GE +    G   
Sbjct: 290 DEESPSM------VPMAD-ILNHHSNNNAHLVFHKSNLQMISIRRIKKGEEVFNTFGKLG 342

Query: 196 NSKLLINYGFVD-EDNPYDRLVV-----------EAALNTEDPQYQDKRMVAQRNGKLSV 243
           N++LL  YG+V+   N YD L++           +     +DP    K  +  R G   V
Sbjct: 343 NTELLQMYGYVEIPSNQYDSLLLPVKDFYKIMTSKNGTANDDPYLLAKINLLNRTGIAEV 402

Query: 244 QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVS---PCMERAVLDQL 300
             F +          D++ +L++ + SD  E++ ++ +     P S     + +  L + 
Sbjct: 403 DAFFMFDKNGLRCGPDLIQFLKIFHASD-RELEKILKTRASKRPESFYHKLLRKLRLSKK 461

Query: 301 ADYFKARLAGYPATLSEDEAMLTDYNLHPKKRV 333
            +  K  L      ++ED+  +   N + +K V
Sbjct: 462 TE--KNSLGMTVIDITEDDTEMDIENFNKRKNV 492


>gi|358388339|gb|EHK25932.1| hypothetical protein TRIVIDRAFT_82204 [Trichoderma virens Gv29-8]
          Length = 915

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 48/198 (24%), Positives = 88/198 (44%), Gaps = 22/198 (11%)

Query: 22  ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 81
           + L + +M+E  +G +S W PY+  L         + E+P+ WS  EL  L  S T+ ++
Sbjct: 543 SILIIIMMFEYFKGDESKWKPYMDVL-------PASFETPMFWSGAELDELQASATRTKV 595

Query: 82  LERAEGIKREYNELDTVWFMAGSLF---QQYPYDIPTE----------AFTFEIFKQAFV 128
             +A+  +  + ++  V      +F   Q Y  D   +          ++ F+   +   
Sbjct: 596 -GKADAEEMFHAKVLPVIRANHEIFPSSQSYSDDELVQLAHRMGSTIMSYAFDFQNEDEE 654

Query: 129 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 188
             +     ++         +VP+   +L   ++  A +   DDA+ +   R  KAGE I+
Sbjct: 655 DEEDEEEWVEDRESKSTMGMVPMAD-ILNADAEYNAHVNYGDDALTVTALRTIKAGEEIL 713

Query: 189 VWCGPQPNSKLLINYGFV 206
            + GP PNS+LL  YG+V
Sbjct: 714 NYYGPHPNSELLRRYGYV 731


>gi|17367341|sp|Q43088.1|RBCMT_PEA RecName: Full=Ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic; AltName:
           Full=[Fructose-bisphosphate aldolase]-lysine
           N-methyltransferase; AltName:
           Full=[Ribulose-bisphosphate carboxylase]-lysine
           N-methyltransferase; Short=PsLSMT; Short=RuBisCO LSMT;
           Short=RuBisCO methyltransferase; Short=rbcMT; Flags:
           Precursor
 gi|508551|gb|AAA69903.1| ribulose-1,5 bisphosphate carboxylase large subunit
           N-methyltransferase [Pisum sativum]
          Length = 489

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 83/325 (25%), Positives = 136/325 (41%), Gaps = 24/325 (7%)

Query: 40  WLPYIRELDRQRGR---------GQL--AVESPLLWSETELAYLTGSPTKAEILERAEGI 88
           WL  I  L R+R R         G L    +S + WSE EL  L GS      +   E +
Sbjct: 125 WLSVILFLIRERSREDSVWKHYFGILPQETDSTIYWSEEELQELQGSQLLKTTVSVKEYV 184

Query: 89  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQS-CVVHLQKVSLARRF 146
           K E  +L+    +        P  +    + F I + +AF  +++  +V +    L    
Sbjct: 185 KNECLKLEQEIILPNKRLFPDPVTLDDFFWAFGILRSRAFSRLRNENLVVVPMADLINHS 244

Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGF 205
           A V       AY  K  A L + D    L      KAGE + + +   + N++L ++YGF
Sbjct: 245 AGVTTEDH--AYEVKGAAGLFSWDYLFSLKSPLSVKAGEQVYIQYDLNKSNAELALDYGF 302

Query: 206 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 265
           ++ +       +   ++  DP + DK  VA+ NG      F +   R       +LPYLR
Sbjct: 303 IEPNENRHAYTLTLEISESDPFFDDKLDVAESNGFAQTAYFDIFYNRTLPP--GLLPYLR 360

Query: 266 LGYVSDTSE--MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEA 320
           L  +  T    ++S+   +  G +   VS   E  +   + +  K+ LAGY  T+ +D  
Sbjct: 361 LVALGGTDAFLLESLFRDTIWGHLELSVSRDNEELLCKAVREACKSALAGYHTTIEQDRE 420

Query: 321 MLTDYNLHPKKRVATQLVRMEKKML 345
            L + NL  +  +A  +   EK +L
Sbjct: 421 -LKEGNLDSRLAIAVGIREGEKMVL 444


>gi|109158151|pdb|2H21|A Chain A, Structure Of Rubisco Lsmt Bound To Adomet
 gi|109158152|pdb|2H21|B Chain B, Structure Of Rubisco Lsmt Bound To Adomet
 gi|109158153|pdb|2H21|C Chain C, Structure Of Rubisco Lsmt Bound To Adomet
 gi|109158154|pdb|2H23|A Chain A, Structure Of Rubisco Lsmt Bound To Trimethyllysine And
           Adohcy
 gi|109158155|pdb|2H23|B Chain B, Structure Of Rubisco Lsmt Bound To Trimethyllysine And
           Adohcy
 gi|109158156|pdb|2H23|C Chain C, Structure Of Rubisco Lsmt Bound To Trimethyllysine And
           Adohcy
 gi|109158157|pdb|2H2E|A Chain A, Structure Of Rubisco Lsmt Bound To Azaadomet And Lysine
 gi|109158158|pdb|2H2E|B Chain B, Structure Of Rubisco Lsmt Bound To Azaadomet And Lysine
 gi|109158159|pdb|2H2E|C Chain C, Structure Of Rubisco Lsmt Bound To Azaadomet And Lysine
 gi|109158160|pdb|2H2J|A Chain A, Structure Of Rubisco Lsmt Bound To Sinefungin And
           Monomethyllysine
 gi|109158161|pdb|2H2J|B Chain B, Structure Of Rubisco Lsmt Bound To Sinefungin And
           Monomethyllysine
 gi|109158162|pdb|2H2J|C Chain C, Structure Of Rubisco Lsmt Bound To Sinefungin And
           Monomethyllysine
          Length = 440

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 83/325 (25%), Positives = 135/325 (41%), Gaps = 24/325 (7%)

Query: 40  WLPYIRELDRQRGR---------GQL--AVESPLLWSETELAYLTGSPTKAEILERAEGI 88
           WL  I  L R+R R         G L    +S + WSE EL  L GS      +   E +
Sbjct: 77  WLSVILFLIRERSREDSVWKHYFGILPQETDSTIYWSEEELQELQGSQLLKTTVSVKEYV 136

Query: 89  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQS-CVVHLQKVSLARRF 146
           K E  +L+    +        P  +    + F I + +AF  +++  +V +    L    
Sbjct: 137 KNECLKLEQEIILPNKRLFPDPVTLDDFFWAFGILRSRAFSRLRNENLVVVPMADLINHS 196

Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGF 205
           A V       AY  K  A L + D    L      KAGE + + +   + N++L ++YGF
Sbjct: 197 AGVTTEDH--AYEVKGAAGLFSWDYLFSLKSPLSVKAGEQVYIQYDLNKSNAELALDYGF 254

Query: 206 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 265
           ++ +       +   ++  DP + DK  VA+ NG      F +   R       +LPYLR
Sbjct: 255 IEPNENRHAYTLTLEISESDPFFDDKLDVAESNGFAQTAYFDIFYNR--TLPPGLLPYLR 312

Query: 266 LGYVSDTSE--MQSVISS--LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEA 320
           L  +  T    ++S+      G +   VS   E  +   + +  K+ LAGY  T+ +D  
Sbjct: 313 LVALGGTDAFLLESLFRDTIWGHLELSVSRDNEELLCKAVREACKSALAGYHTTIEQDRE 372

Query: 321 MLTDYNLHPKKRVATQLVRMEKKML 345
            L + NL  +  +A  +   EK +L
Sbjct: 373 -LKEGNLDSRLAIAVGIREGEKMVL 396


>gi|330806388|ref|XP_003291152.1| hypothetical protein DICPUDRAFT_155733 [Dictyostelium purpureum]
 gi|325078672|gb|EGC32310.1| hypothetical protein DICPUDRAFT_155733 [Dictyostelium purpureum]
          Length = 465

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 80/316 (25%), Positives = 128/316 (40%), Gaps = 34/316 (10%)

Query: 18  LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
           LS     A+++MY K   +KSFW PY+  L         +  + L +S+ EL  L  S  
Sbjct: 114 LSPTNLQAVFIMYSK-LNEKSFWHPYVTVLPE-------SFSTSLYFSDNELDELQASQL 165

Query: 78  KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA-FVAVQSCVVH 136
           K   + R +GI+R Y           S F +    +P E     ++ Q  F    SCV  
Sbjct: 166 KEFTIIRKDGIERHYE----------STFSRLSKLVP-EFSNLALYNQELFTWALSCVWS 214

Query: 137 LQKVSLARR-FALVPLGPPLLAYS-SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG-- 192
            +  SLA     +VPL     A   SK K +    D  +          GE I    G  
Sbjct: 215 -RAFSLAENDGGMVPLADMFNAEDRSKSKVLPKVTDTTLDYYASDDIAEGEQIFTPYGVY 273

Query: 193 -PQPNSKLLINYGFV-DEDNPYDRLVVEA-ALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
            P  +S++L++YGF+ DE    D + +     + ++P    K+ + + N  ++ +VF + 
Sbjct: 274 KPLSSSQMLMDYGFIFDEGTVSDNVAITVPVFHNDEPNLSTKQEILEENDIIN-EVFLLQ 332

Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGP---ICPVSPCMERAVLDQLADYFKA 306
                 A  D+L Y R+  +      Q+    L P     P++   E+  L  L +    
Sbjct: 333 KTDPLPA--DLLLYARVKNLIAKECDQAKKHFLSPNTRNTPLNTRNEKVSLRFLENLIHR 390

Query: 307 RLAGYPATLSEDEAML 322
            L  Y   L  D+ +L
Sbjct: 391 YLDSYGTNLESDKNLL 406


>gi|24987776|pdb|1MLV|A Chain A, Structure And Catalytic Mechanism Of A Set Domain Protein
           Methyltransferase
 gi|24987777|pdb|1MLV|B Chain B, Structure And Catalytic Mechanism Of A Set Domain Protein
           Methyltransferase
 gi|24987778|pdb|1MLV|C Chain C, Structure And Catalytic Mechanism Of A Set Domain Protein
           Methyltransferase
 gi|33357815|pdb|1OZV|A Chain A, Crystal Structure Of The Set Domain Of Lsmt Bound To
           Lysine And Adohcy
 gi|33357816|pdb|1OZV|B Chain B, Crystal Structure Of The Set Domain Of Lsmt Bound To
           Lysine And Adohcy
 gi|33357817|pdb|1OZV|C Chain C, Crystal Structure Of The Set Domain Of Lsmt Bound To
           Lysine And Adohcy
 gi|33357822|pdb|1P0Y|A Chain A, Crystal Structure Of The Set Domain Of Lsmt Bound To
           Melysine And Adohcy
 gi|33357823|pdb|1P0Y|B Chain B, Crystal Structure Of The Set Domain Of Lsmt Bound To
           Melysine And Adohcy
 gi|33357824|pdb|1P0Y|C Chain C, Crystal Structure Of The Set Domain Of Lsmt Bound To
           Melysine And Adohcy
          Length = 444

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 83/325 (25%), Positives = 136/325 (41%), Gaps = 24/325 (7%)

Query: 40  WLPYIRELDRQRGR---------GQL--AVESPLLWSETELAYLTGSPTKAEILERAEGI 88
           WL  I  L R+R R         G L    +S + WSE EL  L GS      +   E +
Sbjct: 81  WLSVILFLIRERSREDSVWKHYFGILPQETDSTIYWSEEELQELQGSQLLKTTVSVKEYV 140

Query: 89  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQS-CVVHLQKVSLARRF 146
           K E  +L+    +        P  +    + F I + +AF  +++  +V +    L    
Sbjct: 141 KNECLKLEQEIILPNKRLFPDPVTLDDFFWAFGILRSRAFSRLRNENLVVVPMADLINHS 200

Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGF 205
           A V       AY  K  A L + D    L      KAGE + + +   + N++L ++YGF
Sbjct: 201 AGVTTEDH--AYEVKGAAGLFSWDYLFSLKSPLSVKAGEQVYIQYDLNKSNAELALDYGF 258

Query: 206 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 265
           ++ +       +   ++  DP + DK  VA+ NG      F +   R       +LPYLR
Sbjct: 259 IEPNENRHAYTLTLEISESDPFFDDKLDVAESNGFAQTAYFDIFYNR--TLPPGLLPYLR 316

Query: 266 LGYVSDTSE--MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEA 320
           L  +  T    ++S+   +  G +   VS   E  +   + +  K+ LAGY  T+ +D  
Sbjct: 317 LVALGGTDAFLLESLFRDTIWGHLELSVSRDNEELLCKAVREACKSALAGYHTTIEQDRE 376

Query: 321 MLTDYNLHPKKRVATQLVRMEKKML 345
            L + NL  +  +A  +   EK +L
Sbjct: 377 -LKEGNLDSRLAIAVGIREGEKMVL 400


>gi|451999637|gb|EMD92099.1| hypothetical protein COCHEDRAFT_1134267 [Cochliobolus
           heterostrophus C5]
          Length = 476

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 80/333 (24%), Positives = 133/333 (39%), Gaps = 54/333 (16%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            L L ++YE   G  S W PY   L           ++ + W+E EL  L  S    +I 
Sbjct: 85  SLILVMLYEYHNGSASNWAPYFAVLPTD-------FDTLMFWTEDELTELQASAVVNKIG 137

Query: 83  ---------------------------ERAEGIKREYNELDTVWFM--AGSLFQQYPYDI 113
                                      ERA+ + +E    + +  M   GSL   Y +D+
Sbjct: 138 KEGANEVFIEQLLPVIEEFADVIFSGDERAKDLAKEMRAPENLELMHKMGSLIMAYAFDV 197

Query: 114 PTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV 173
              A + +   +   A +     L K        +VPL   L A + +C A L    D +
Sbjct: 198 EP-AISDKEVDEEGFAEEEEDAALPK-------GMVPLADMLNADADRCNARLFYEKDGL 249

Query: 174 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED-NPYDRLVVEAALNTE----DPQY 228
           ++   +P +AGE I    GP P S LL  YG++ E+   YD + + A L ++    D  +
Sbjct: 250 EMKALKPIQAGEEIFNDYGPLPRSDLLRRYGYITENYAQYDVVEIPADLVSQALAHDGLW 309

Query: 229 QDKRMVAQRNGKLSVQVFHVHAG---REKEAISDMLPYLRLGYVSDTSEMQSVISSLGPI 285
            +KR+      ++    + + A      +E++S  L  L    +  + E +  + S G +
Sbjct: 310 HEKRIEYLDEQEIVDTGYDIAASVPFSLEESLSPELVILVETMLLPSEEFER-LQSKGRL 368

Query: 286 CPVSPCMERAVLDQLADYFKARLAGYPATLSED 318
            P +  M     + L    +AR+A YP TL +D
Sbjct: 369 -PKAEKMTGKAAEILYKIVQARIAQYPTTLEQD 400


>gi|148908465|gb|ABR17345.1| unknown [Picea sitchensis]
          Length = 350

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 43/154 (27%), Positives = 73/154 (47%), Gaps = 16/154 (10%)

Query: 181 YKAGESIVVWCG-PQPNSKLLINYGFVDED----NPYDRLVVEAALNTEDPQYQDKRMVA 235
           ++ GE +++  G  + N +L ++YGFV+ +    +  D   +   ++  DP + DK  +A
Sbjct: 136 FRTGEQVLMQYGMNKSNGQLALDYGFVERNRKNGSNRDIFTLTLEISESDPFFADKLDIA 195

Query: 236 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM-------QSVISSLGPICPV 288
           + NG  +   F +  G  +     ML +LRL  +  T           SV   L    PV
Sbjct: 196 ELNGMETTAYFDITQG--QGVPESMLTFLRLIALGGTDAFLLEPLFRDSVWEHLS--LPV 251

Query: 289 SPCMERAVLDQLADYFKARLAGYPATLSEDEAML 322
           S   E A+   + D  ++ L+GY  T+ EDEA+L
Sbjct: 252 SQENEAAICKVVLDGCQSTLSGYGTTIEEDEALL 285


>gi|428179206|gb|EKX48078.1| hypothetical protein GUITHDRAFT_106158 [Guillardia theta CCMP2712]
          Length = 410

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 54/211 (25%), Positives = 95/211 (45%), Gaps = 21/211 (9%)

Query: 17  KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP 76
           +L E   L+L+L+ EK + ++S W  +IR +         + ++   WSE  +A L   P
Sbjct: 19  QLCERQLLSLHLLVEKWKAERSRWWRFIRSI-------PPSYDTLENWSEQSVARLQYKP 71

Query: 77  TKAEILERAEGIKREYNELDTV--------WFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 128
             A    R   +  E+++L  +        W    +  +     +   +F+ E +  A  
Sbjct: 72  FLAIAARRKRVVNDEFSQLQRLLSRCKKRSWNEPEAAEEAERIQLGFSSFSREDYLWAAG 131

Query: 129 AVQSCVVHLQKVS-LARRFALVPLGPPLLAYSSKCKAMLAAV---DDAV--QLVVDRPYK 182
            V +   H ++ S  + R   V    P+L + +   A +AA     DA+  ++   R Y+
Sbjct: 132 TVSTRSCHYERKSGYSLRGETVGCLVPVLDFLNHSTAPVAACGFCKDAMVYRVTCLRSYE 191

Query: 183 AGESIVVWCGPQPNSKLLINYGFVDEDNPYD 213
            GE +++  G   N+ LL +YGFV EDNP D
Sbjct: 192 EGEQVMIHYGNWSNAGLLEHYGFVLEDNPLD 222


>gi|400597281|gb|EJP65016.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
          Length = 484

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 57/236 (24%), Positives = 90/236 (38%), Gaps = 75/236 (31%)

Query: 13  LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 72
           L+++ L     L L L+YE  +G  S W PY+  L           E+P+ W+  EL  L
Sbjct: 96  LSSSPLDAWGALILVLLYEHLRGAASAWRPYLDVL-------PATFETPMFWTGAELGAL 148

Query: 73  TGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 132
               T  ++        RE  E DT   +   + + +P          ++F+ +      
Sbjct: 149 QAGATAGKV-------GRESAE-DTFRGILLPVVRAHP----------DVFQGSAALSDE 190

Query: 133 CVVHLQKVSLARRFALVPLGPPLLAYS---------------------SKCKAMLAAV-- 169
            +V     +LA R     +G  ++AY+                        KAM+  V  
Sbjct: 191 ALV-----ALAHR-----MGSTIMAYAFDLENDEEREDEEDEDGWVEDRDGKAMMGMVPM 240

Query: 170 -----------------DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 208
                            D+ + +   RP KAGE I+ + GP PNS+LL  YG+V E
Sbjct: 241 ADILNADAEFNAHVNHGDNELTVTALRPIKAGEEILNYYGPHPNSELLRRYGYVTE 296


>gi|384484604|gb|EIE76784.1| hypothetical protein RO3G_01488 [Rhizopus delemar RA 99-880]
          Length = 400

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 62/253 (24%), Positives = 105/253 (41%), Gaps = 51/253 (20%)

Query: 16  NKLSELAC---LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 72
           ++LSEL+    L L +MYE ++   SFW PY   L R+         +P+ W++ +L  L
Sbjct: 74  SELSELSGWSPLILCMMYEIEK-PDSFWKPYFDVLPRE-------FTTPMFWNQEDLKEL 125

Query: 73  TGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK--QAFVAV 130
            G+   ++I  + E  +  +NEL+ +        ++YP     +  T E+F    + +  
Sbjct: 126 EGTDIISKI-GKKESEELFHNELEPI-------IKKYPNLFDEQKHTIELFHICGSLIMA 177

Query: 131 QSCVVHLQKVSLARR------------------------FALVPLGPPLLAYSSKCKAML 166
            S    LQK                               ++VP+   L   +    A L
Sbjct: 178 YSFNDELQKAPKENNKEEEKEEEEEEEEEEEEEEEEEGLISMVPMADMLNHKTGFNNARL 237

Query: 167 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR------LVVEAA 220
               D++Q+   +  K GE I    G   N+ LL  YGFVDE N +D       L+VE  
Sbjct: 238 FHEPDSLQMRAIKDIKEGEQIYNTYGDLCNADLLRKYGFVDEKNDFDLVELDGPLLVEVC 297

Query: 221 LNTEDPQYQDKRM 233
              +D   +++++
Sbjct: 298 CEDQDEALKERKI 310


>gi|325183831|emb|CCA18289.1| conserved hypothetical protein [Albugo laibachii Nc14]
 gi|325183979|emb|CCA18437.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 561

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 73/301 (24%), Positives = 121/301 (40%), Gaps = 38/301 (12%)

Query: 61  PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 120
           P+ WS+ EL++L GS    +I ER   I+++Y+ +                D     F+ 
Sbjct: 227 PIFWSDQELSWLKGSYILHQIQERKAAIRKDYDAICRA-------------DPSFSRFSL 273

Query: 121 EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA-MLAAVDDAVQLVVDR 179
           E F  A + V S    L  +   +  ALVP    L  Y  +  +       D   +    
Sbjct: 274 ERFSWARMIVCSRNFGL-TIDGVKTAALVPFADMLNHYRPRETSWTFDQKLDGFTITSLE 332

Query: 180 PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY------DRLVVEAALNTEDPQYQDKRM 233
              +G  +    G + N + L+NYGF  EDN        + ++V+  L+  D Q     +
Sbjct: 333 SICSGAQVYDSYGKKCNHRFLLNYGFAVEDNTEEDGSNPNEIMVDFQLDPGDGQ-----L 387

Query: 234 VAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY-----VSDTSEMQSVISSLGPIC-- 286
           +  +   L     +    R   + SD  P  R G+     ++ T +  S +    P    
Sbjct: 388 LYDKTAYLYESGIYTMNARLSCSHSD--PSTREGFSFARLIAATEDEFSSMKMRSPAHAS 445

Query: 287 -PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML--TDYNLHPKKRVATQLVRMEKK 343
            P+S   E A L+ L      +L  YP +L E EA+L   +Y L+  +  A   +R EK+
Sbjct: 446 PPISFRNEIAALNLLKQLMDTQLDQYPTSLDEGEAILKSKEYPLYSNRIQALFFIRGEKQ 505

Query: 344 M 344
           +
Sbjct: 506 V 506


>gi|145553305|ref|XP_001462327.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124430166|emb|CAK94954.1| unnamed protein product [Paramecium tetraurelia]
          Length = 481

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 76/359 (21%), Positives = 141/359 (39%), Gaps = 56/359 (15%)

Query: 27  YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAE 86
           +L+ E ++ + SFW PYI  L +           P  +   + A L GSPT   ++ + +
Sbjct: 122 HLLQENRR-QNSFWKPYIDVLPKD------VSGFPTNFDAEQDALLKGSPTLFTVMNQRK 174

Query: 87  GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF 146
             + EY+ L      A   FQ+Y Y       T+  F           V  + ++++R F
Sbjct: 175 TFQEEYDNLKE----AVKEFQRYGY-------TYNDF-----------VKFRTLTISRSF 212

Query: 147 -----------ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD--RPYKAGESIVVWCGP 193
                       LVPL    + + +          DA    +   R  + GE +    G 
Sbjct: 213 PVYIGENEQQQLLVPLAD-FINHDNNGFLQYGYSPDADGFFMQAVRNIQKGEELFYNYGQ 271

Query: 194 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN---GKLSVQVFHVHA 250
             N    +NYGF    NP ++   +  L+  D  ++ K  +   N   G   V       
Sbjct: 272 WSNKYFFMNYGFASLTNPMNQFDFDICLDRNDRMFKMKVELTGGNICWGNRLVNETDHDT 331

Query: 251 GREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICP---VSPC---MERAVLDQLADYF 304
            R+  A    + + ++  + D  +++  + +     P    +P    +E+A    L D  
Sbjct: 332 FRQSLAT---VRFAQISKLDDFLQLEEDVQNYNQFWPGWHTTPKTIELEKATFKALRDLL 388

Query: 305 KARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVT 363
            + LA + +T+ +D+  L D +    +R    L   EK+++   ++V  DM++ + D T
Sbjct: 389 VSELANFASTIEDDQRRLNDPSTPEFRRHIIMLTMREKQIIKKNIEV-CDMMLSVIDKT 446


>gi|406860468|gb|EKD13526.1| putative SET domain-containing protein [Marssonina brunnea f. sp.
           'multigermtubi' MB_m1]
          Length = 474

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 89/367 (24%), Positives = 149/367 (40%), Gaps = 53/367 (14%)

Query: 20  ELAC-LALY-LMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
           E+ C LAL  ++  + Q + S W PY+  L  +       ++S + WSE+EL  L  S  
Sbjct: 79  EMPCWLALTAIILTEGQQEDSKWAPYLALLPSR-------LDSLVFWSESELLELQASTV 131

Query: 78  KAEILE-RAEGIKREY-------NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
             +I    AE +  E+       N    +     S+   Y +DIP         K+    
Sbjct: 132 VNKIGRASAEQLFLEHISPLGLSNTNTEMCHKVASVVMAYAFDIPE--------KKGHDD 183

Query: 130 VQS-----CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 184
            +S      +V   +       +++PL   L A +    A L   ++ +++   +P   G
Sbjct: 184 PESPEDGDDLVSDNEEEENTILSMIPLADMLNADADGNNARLCCDNEELEMRSIKPISKG 243

Query: 185 ESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVE-------AALNTEDP---------- 226
           E I+   G  P S LL  YG++ D+   YD  V E       A+L+TE P          
Sbjct: 244 EEILNDYGQLPRSDLLRRYGYISDKYAAYD--VAELSTQSLLASLSTEQPLLAGGTLQPL 301

Query: 227 ---QYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLG 283
              + + +  +AQR G         H G +  +I D L  L    + D   + ++ +S  
Sbjct: 302 SREKLEQRVELAQREGVYEDSYDLTHPGPDDPSIPDELLALLYILLLDNENLAAIETSHA 361

Query: 284 PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKK 343
            +   S      V   L    ++R   Y  T+  D+A+L   NL  +KR+A ++   EK 
Sbjct: 362 SLPSRSKLATSLVGQILTKILESRKQEYATTIEADQAILQADNLPSRKRMAVEVRLGEKL 421

Query: 344 MLNACLQ 350
           +L   +Q
Sbjct: 422 VLEKAIQ 428


>gi|159476096|ref|XP_001696150.1| protein N-methyltransferase [Chlamydomonas reinhardtii]
 gi|158275321|gb|EDP01099.1| protein N-methyltransferase [Chlamydomonas reinhardtii]
          Length = 474

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 58/221 (26%), Positives = 98/221 (44%), Gaps = 35/221 (15%)

Query: 21  LACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAE 80
            A +   L++ K+QG +S   P+I +L    G        PL WS+ +LA L      A+
Sbjct: 138 FAKMGAMLLWHKRQGSQSPLAPWIAQLPADTG-------VPLNWSDKQLAALQYPYLVAQ 190

Query: 81  ILERAEGIKREYNEL-DTV-----------------WFMAGSLFQQYPYDIPTEAFTFEI 122
           + E+    +RE+  L DT+                 W+  G + +   +  P    T   
Sbjct: 191 VKEQ----QREWTALYDTLRGSGMAAGAAPPSREEFWWAMG-VVRSRTFSGPYIGSTLSD 245

Query: 123 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAA--VDDAVQLVVDRP 180
             +    V + VV L + SL +++A+ PL   L  ++S  ++ ++     D+  +V  R 
Sbjct: 246 RLRLAGLVAALVVILSR-SL-KQYAICPL-IDLFNHTSAAQSEVSYNYFGDSYSVVASRD 302

Query: 181 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 221
           +K GE + +  G Q N  L+  YGF + DNP D  V+   L
Sbjct: 303 FKKGEQVFITYGAQSNDSLMQYYGFAEADNPQDTYVISDVL 343


>gi|299473350|emb|CBN77749.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 563

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 79/298 (26%), Positives = 125/298 (41%), Gaps = 30/298 (10%)

Query: 61  PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIP-TEAFT 119
           P+ W+E E+  L GS    ++ ER + I+ +Y          G +   YP   P  +  T
Sbjct: 212 PIFWTEEEMRLLQGSYLVTQVEERNQAIEGDY----------GVICDLYP---PFRDVAT 258

Query: 120 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSK-CKAMLAAVDDAVQLVVD 178
            E FK A + V S    L  ++  R  ALVP    L  Y  +  K           +   
Sbjct: 259 LEEFKWARMCVCSRNFGLD-INGLRTSALVPYADMLNHYRPRETKWTYDNNRGGFTITTL 317

Query: 179 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY------DRLVVEAALNTEDPQYQDKR 232
                G  +    G + N + L+NYGF  E+N        + + +   L+  DP  Q K 
Sbjct: 318 HRILGGAQVYDSYGQKCNHRFLLNYGFAIENNQEANGFCPNEVPLLFRLDARDPLRQKKA 377

Query: 233 MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS--VISSLGPI-CPVS 289
              + +G    +V  +  G + +A+   L  LR+  V+D +EM +  +  ++  +  P+S
Sbjct: 378 RFWRMDGPEQRRV-RLCVG-DTDAVRGALSMLRV-IVADAAEMGARYMYRTVKDVRFPLS 434

Query: 290 PCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP--KKRVATQLVRMEKKML 345
              E A +++L       L  YP TL ED A L +  L P   +R A   V  EK +L
Sbjct: 435 VRNEVAAMERLLLLTTGALDAYPTTLEEDRAALKNGGLEPFSNRRHALIQVYGEKVVL 492


>gi|302826668|ref|XP_002994755.1| hypothetical protein SELMODRAFT_432653 [Selaginella moellendorffii]
 gi|300136963|gb|EFJ04180.1| hypothetical protein SELMODRAFT_432653 [Selaginella moellendorffii]
          Length = 688

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 47/191 (24%), Positives = 82/191 (42%), Gaps = 26/191 (13%)

Query: 31  EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 90
           +K Q + S W PYI  L          +++  LW +TEL+YL  SP   +  ER E I  
Sbjct: 503 QKFQLQSSAWAPYISCLPEPA-----ELDNTFLWEDTELSYLRASPLYGKTRERLEIITT 557

Query: 91  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 150
           E+ ++     +   LF +          + E F   +  V S     + +++     LV 
Sbjct: 558 EFGQVQNALDVWPQLFGK---------VSVEDFMHVYATVFS-----RPLAIGEDSTLVM 603

Query: 151 LGPPLLAYSSKCKAMLAAVD-----DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 205
           +  P+L + +   A  A +      +   +  DR     + I + CG   N++L ++YGF
Sbjct: 604 I--PMLDFFNHNAASFAKLSFNGLLNYAVVTADRDCAENDQIWINCGDLSNAELALDYGF 661

Query: 206 VDEDNPYDRLV 216
              +N YD ++
Sbjct: 662 TVPENRYDEVM 672


>gi|167521575|ref|XP_001745126.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776740|gb|EDQ90359.1| predicted protein [Monosiga brevicollis MX1]
          Length = 390

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 49/204 (24%), Positives = 85/204 (41%), Gaps = 14/204 (6%)

Query: 38  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL-D 96
           SFW PYI  L        ++   PL W   +L  L GS    E +   + + R+Y  L +
Sbjct: 118 SFWRPYIACL-------PVSYSVPLYWDLPDLMSLRGSSLFVEAIRLYKHVCRQYGYLHN 170

Query: 97  TVWFMAGSLFQQYPY--DIPTEAFTFEIFKQAFVAVQSCVVHLQKVS----LARRFALVP 150
            +   A      +P    +  EAFTFE ++ A   V +    + +      +    AL+P
Sbjct: 171 KLSVRANPSCSCFPLTLGLSPEAFTFEDWRWAVATVMTRQNSIPQAGPDGQMKPTLALIP 230

Query: 151 LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 210
           L   +   +        +  + ++ V   P K G  I +W G + N + L++ GF    +
Sbjct: 231 LWDMINHANHPMSTQFDSERECLEFVCPAPAKPGSQITMWYGDRNNGQFLLHQGFFFAGH 290

Query: 211 PYDRLVVEAALNTEDPQYQDKRMV 234
             D + V  +L+  D  Y+ K ++
Sbjct: 291 ANDYVNVPFSLDETDSLYKIKALL 314


>gi|145353540|ref|XP_001421068.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144581304|gb|ABO99361.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 813

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 56/202 (27%), Positives = 85/202 (42%), Gaps = 23/202 (11%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           +AL++  E+ +G+K+    ++  L R       + ++   WSE EL  LTG+    E + 
Sbjct: 70  VALHVCAERFRGEKATRAAHVATLPR-------SFDTAFFWSEEELRELTGTTCLRETMN 122

Query: 84  RAEGIKREYNELDTVW--FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS 141
             E  K +Y  L         G   +++  D       +E +  A   + S    L    
Sbjct: 123 LREETKNDYETLTKKMEAIGEGGWMREHEVD-------YERYAWARSNLWSRQCDLLMPD 175

Query: 142 LARRFALVPLGPPLLAYSSKC----KAMLAAVDDAVQLVVDRPYKAGES--IVVWCGPQP 195
             R  A+VP    +  +S+K        L A  + V +     YKAGE   I    G   
Sbjct: 176 GKRTRAMVPT-FDIFNHSAKAPLGKTHKLNAEKNCVTVYAADDYKAGEQAFISYGSGEAA 234

Query: 196 NSKLLINYGFVDEDNPYDRLVV 217
           NSKLL  YGF  +DNPY+ L V
Sbjct: 235 NSKLLTWYGFCIDDNPYEELDV 256


>gi|326480913|gb|EGE04923.1| SET domain-containing protein [Trichophyton equinum CBS 127.97]
          Length = 692

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 52/211 (24%), Positives = 94/211 (44%), Gaps = 12/211 (5%)

Query: 7   YYAAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSE 66
           ++ +++L+  ++ +   LA ++ +E+ +G+ S W PY+  L R        + S L + +
Sbjct: 92  FHCSDILS-ERVRKSIILAFFVAHEQLKGRDSHWWPYLATLPRAS-----ELTSALFYQD 145

Query: 67  TELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSL-FQQYPYDIPTEAFTFEIFKQ 125
           ++L +L G+            +K EY+   ++    G L  + Y +DI   A+T  I  +
Sbjct: 146 SDLDWLQGTNLYQTHQAYRNTVKEEYDSAISILRDEGCLAVESYSWDIFCWAYTL-IASR 204

Query: 126 AFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSK---CKAMLAAVDDAVQLVVDRPYK 182
           AF + +    +L      ++     +  PL+  S+     K    A    + L V  P  
Sbjct: 205 AFTS-RVLDAYLSNHPTLKQDEEFQIMLPLVDSSNHKPLAKIEWRAEATEIGLKVIEPTF 263

Query: 183 AGESIVVWCGPQPNSKLLINYGFVDEDNPYD 213
            GE I    GP  N +L+  YGF   DNP D
Sbjct: 264 TGEEIHNNYGPLNNQQLMTTYGFCIVDNPCD 294


>gi|159490820|ref|XP_001703371.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158280295|gb|EDP06053.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 339

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 50/202 (24%), Positives = 87/202 (43%), Gaps = 13/202 (6%)

Query: 20  ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 79
           E+  L + +M+E+ +G +S W PY+  L             PL W   E   L G+    
Sbjct: 116 EVQGLVIAVMWERWRGPESRWAPYLALLPDD------MTHMPLYWKRREFRELRGTAAYD 169

Query: 80  EILERAEGIKREYNELDTVWF-MAGSLFQQYP-YDIPTEAFTFEIFKQAFVAVQSCVVHL 137
           ++L RA+       ++  +W  + G    ++P   +P     +E+++ A  AV S    L
Sbjct: 170 KMLGRAQHPSDAPTQVPLLWSEVVGPFIAEHPELGLPGGERGYELYRWATAAVASYSFIL 229

Query: 138 QKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQP 195
                 +  A+VP+   L   +      L   +    +Q++  R   AG  +V   G   
Sbjct: 230 GD---DKYQAMVPVWDLLNHITGDVNVRLHHCSKRHVLQMIAMRDIVAGSELVNNYGELS 286

Query: 196 NSKLLINYGFVDEDNPYDRLVV 217
           N++LL  YGFV+  N Y+ + V
Sbjct: 287 NAELLRGYGFVERANRYNHIPV 308


>gi|302834219|ref|XP_002948672.1| hypothetical protein VOLCADRAFT_104004 [Volvox carteri f.
           nagariensis]
 gi|300265863|gb|EFJ50052.1| hypothetical protein VOLCADRAFT_104004 [Volvox carteri f.
           nagariensis]
          Length = 510

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 94/386 (24%), Positives = 154/386 (39%), Gaps = 70/386 (18%)

Query: 21  LACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAE 80
            A +A  L++ K+QG +S   P+I +L    G        P+LW E ++A L      A+
Sbjct: 135 FAKMAAMLLWHKRQGSQSPLAPWIAQLPSDTG-------VPVLWDERQIAALQYPYLIAQ 187

Query: 81  ILERAEGIKREYNEL---------------DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ 125
           + E+    ++ Y +L               D  W M  S  +   +  P    T +   +
Sbjct: 188 VKEQQREWQQLYGDLVRSGTPAGVQAPSREDFFWAM--SCVRSRTFSGPYIGSTLQDRLR 245

Query: 126 -----AFVAVQSCVVHL---QKVSLA-------------------RRFALVPLGPPLLAY 158
                A +A  + V+ L   QK   A                   +++A+ PL   L  +
Sbjct: 246 TAGLVAVLAAGNTVLGLADPQKTLSAAIAVLLFNVLYELILSRSLKQYAICPL-IDLFNH 304

Query: 159 SSKCKAMLAA--VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 216
           SS  ++ +A     D+  +V  R +K GE + +  G Q N  L+  YGF + +NP D  V
Sbjct: 305 SSAVQSEVAYNYFGDSYSVVASREFKKGEQVFISYGAQSNDSLMQYYGFAEANNPQDVYV 364

Query: 217 VEAALN--TEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE 274
           +   L   T        R+ A +   L+  +  V   R     S+ L  +R    +D SE
Sbjct: 365 MTDMLRWLTAVRSVGQSRLDALKGSPLANSLQQVAIQRAGFP-SETLQAVRFLLAAD-SE 422

Query: 275 MQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKR-- 332
             + +SS       SP  E  + + +A+  +  L    ++L ED A+L+        R  
Sbjct: 423 AGADVSSFSKSG--SPDQEAQLAEVVAEVVRRELGHLGSSLQEDLALLSSTGASAGGRKG 480

Query: 333 -------VATQLVRMEKK-MLNACLQ 350
                  VA    R+EKK +L A LQ
Sbjct: 481 GTAAAAAVAAVAFRVEKKRLLTAVLQ 506


>gi|281207968|gb|EFA82146.1| hypothetical protein PPL_04566 [Polysphondylium pallidum PN500]
          Length = 510

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 30/116 (25%), Positives = 54/116 (46%), Gaps = 12/116 (10%)

Query: 17  KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP 76
           K+ +    ++ L+YE   G +S W  YI+ L        L+V+ P+LW++ +L  L G+ 
Sbjct: 95  KIKDATATSIALLYEASIGSQSKWYGYIKSL-------PLSVDLPILWNDADLKNLKGTS 147

Query: 77  TKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 132
            +  + E  E +   YN+     ++   L   +P       F+ + FK+A   V S
Sbjct: 148 IETVVYENKETVDATYNK-----YIKSKLIANHPDVFNEHVFSLDNFKRASCLVSS 198


>gi|367023575|ref|XP_003661072.1| hypothetical protein MYCTH_2300057 [Myceliophthora thermophila ATCC
           42464]
 gi|347008340|gb|AEO55827.1| hypothetical protein MYCTH_2300057 [Myceliophthora thermophila ATCC
           42464]
          Length = 496

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 59/221 (26%), Positives = 89/221 (40%), Gaps = 38/221 (17%)

Query: 9   AAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 68
             E  T++       L L L+YE  QG  S W PY+  L         A ++P+ WS TE
Sbjct: 98  GGEDGTSSSQDSWTLLILILIYEYLQGDASQWKPYLDVL-------PSAFDTPMFWSPTE 150

Query: 69  LAYLTGSPTKAEI-LERAEGIKRE-----YNELDTVWFMAGSLFQQYPYDIPTEAFTFEI 122
           LA L  S    ++  E A+ + R          D V+F  G   Q+   D       FE+
Sbjct: 151 LAELQASALVTKVGREEADRMIRSKILPVIRGHDHVFFPHGR--QRLDDDQ-----LFEL 203

Query: 123 FKQAFVAVQSCVVHLQKVSLARR-----------------FALVPLGPPLLAYSSKCKAM 165
             +   A+ +    L+K   A                     +VP+   +L   ++  A 
Sbjct: 204 AHRMGSAIMAYAFDLEKDDDANEEASEQDEWVDDREGRTMLGMVPMA-DMLNADAEFNAY 262

Query: 166 LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
           +    D++     R  KAGE I+ + GP PN +LL  YG+V
Sbjct: 263 INHGADSLTATALRTIKAGEEILNYYGPLPNGELLRRYGYV 303


>gi|336467028|gb|EGO55192.1| hypothetical protein NEUTE1DRAFT_147775 [Neurospora tetrasperma
           FGSC 2508]
 gi|350288355|gb|EGZ69591.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
          Length = 504

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 50/201 (24%), Positives = 88/201 (43%), Gaps = 20/201 (9%)

Query: 26  LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 85
            YL+ +  +GK SFW PYI  L       + A+  P  W+E ++  L G+     I E  
Sbjct: 107 FYLIQQYLKGKSSFWAPYISTLADPSQLDKWAL--PPFWAEDDIELLKGTNAYVAIQEIQ 164

Query: 86  EGIKREYNELDTVWFMAG----SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKV- 140
             +K EY +   +    G      + Q  Y+     FT   F+ + V  +S   +++++ 
Sbjct: 165 SNVKSEYKQARKILKKEGFPDYRDYTQVLYNWAYCMFTSRSFRPSLVLSESAREYVERLL 224

Query: 141 ---SLARRFALVPLGPPLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG 192
              S    F+++    PL      ++ +     L +   A +L+ +  Y  G+ +    G
Sbjct: 225 PEGSKIDDFSIL---QPLYDIGNHSWDASYTWNLTSEPSACELICNDSYGPGQQVFNNYG 281

Query: 193 PQPNSKLLINYGFVDEDNPYD 213
            + NS+LL+ YGF+   NP D
Sbjct: 282 FKTNSELLLGYGFI--INPKD 300


>gi|302810436|ref|XP_002986909.1| hypothetical protein SELMODRAFT_235145 [Selaginella moellendorffii]
 gi|300145314|gb|EFJ11991.1| hypothetical protein SELMODRAFT_235145 [Selaginella moellendorffii]
          Length = 447

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 61/247 (24%), Positives = 102/247 (41%), Gaps = 57/247 (23%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEIL 82
           L + +MYE+ +GK S W  Y++ L  Q          P LWSE E+   L G+     + 
Sbjct: 53  LTVAVMYERSKGKGSKWYRYLKTLPCQE-------SVPFLWSEEEIDGLLLGTELHKALK 105

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ-KVS 141
           E    +K ++ E          L ++ P + P + FTFE    +++A +S V     ++ 
Sbjct: 106 EDKLLMKEDWEE------NIAPLTKEDPLEFPAQDFTFE----SYLAAKSLVSSRSFEID 155

Query: 142 LARRFALVPLGPPLLAYSSKCKA-----MLAA-------------VDDA----------- 172
               + +VPL      ++ K  A     ML A             +DD            
Sbjct: 156 AEHGYGMVPLAD---LFNHKTDAEDVHFMLNASDSDDDDDNNGLIIDDGLANGDCREISS 212

Query: 173 ----VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD--RLVVEAALNTEDP 226
               +++V+ +   AG  I    G   N+ LL  YGF + +NP+D   L ++  L     
Sbjct: 213 DKSVLEMVMVKDVAAGSEIFNTYGQLGNAALLHRYGFTEPNNPHDIVNLDMDCLLEVLLS 272

Query: 227 QYQDKRM 233
           ++Q KR+
Sbjct: 273 RFQKKRV 279


>gi|326473914|gb|EGD97923.1| hypothetical protein TESG_05224 [Trichophyton tonsurans CBS 112818]
          Length = 692

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 50/223 (22%), Positives = 92/223 (41%), Gaps = 36/223 (16%)

Query: 7   YYAAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSE 66
           ++ +++L+  ++ +   LA ++ +E+ +G+ S W PY+  L R        + S L + +
Sbjct: 92  FHCSDILS-ERVRKSIILAFFVAHEQLKGRDSHWWPYLATLPRAS-----ELTSALFYQD 145

Query: 67  TELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 126
           ++L +L G+            +K EY+   ++    G L          E+++++IF  A
Sbjct: 146 SDLDWLQGTNLYQTHQAYRNTVKEEYDSAISILRDEGCL--------AVESYSWDIFCWA 197

Query: 127 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA-------------- 172
           +  + S      +   +R         P L    + + ML  VD +              
Sbjct: 198 YTLIAS------RAFTSRVLDAYFSNHPTLKQDEEFQIMLPLVDSSNHKPLAKIEWRAEA 251

Query: 173 --VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 213
             + L V  P   GE I    GP  N +L+  YGF   DNP D
Sbjct: 252 TEIGLKVIEPTFTGEEIHNNYGPLNNQQLMTTYGFCIVDNPCD 294


>gi|358056332|dbj|GAA97699.1| hypothetical protein E5Q_04377 [Mixia osmundae IAM 14324]
          Length = 347

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 36/107 (33%), Positives = 53/107 (49%), Gaps = 4/107 (3%)

Query: 116 EAFTFEIFKQAFVAVQS-CV-VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA- 172
           E   F+ F+ A++ V S CV + L   +    F LVPL   +  +SS C       D A 
Sbjct: 122 EIIDFDAFRWAWLCVNSRCVWLDLDYEAHEENFTLVPL-LDMANHSSTCANATVKYDHAH 180

Query: 173 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEA 219
            +L + RP K GE IV   G    + L   YGF++  NP++R+ + A
Sbjct: 181 FELKLTRPVKRGEEIVFEYGGHDQATLWAEYGFIESSNPHERIDLTA 227


>gi|345325919|ref|XP_001512656.2| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Ornithorhynchus anatinus]
          Length = 345

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 51/197 (25%), Positives = 94/197 (47%), Gaps = 18/197 (9%)

Query: 170 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 229
           DD  + V  + + AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y 
Sbjct: 42  DDRCECVALQDFTAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 101

Query: 230 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS--------S 281
            K  V  R G  +  VF +H   E    + +L +LR+  +++    + +I         +
Sbjct: 102 MKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLIGDHAIDKIFT 160

Query: 282 LGPI-CPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVATQ 336
           LG    PVS   E     +L  + +AR    L  Y  T+ ED++ L   +L     +A +
Sbjct: 161 LGNSEFPVSWDNEV----KLWTFLEARASLLLKTYKTTIEEDKSFLETPDLTFHATMAIK 216

Query: 337 LVRMEKKMLNACLQVTA 353
           L   EK++L   ++  A
Sbjct: 217 LRLGEKEILEKAVKSAA 233


>gi|328854233|gb|EGG03367.1| hypothetical protein MELLADRAFT_90239 [Melampsora larici-populina
           98AG31]
          Length = 509

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 57/216 (26%), Positives = 92/216 (42%), Gaps = 24/216 (11%)

Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
           A+VPL   L A S    A L      + +   +  + GE I       PN+ LL  YG V
Sbjct: 260 AMVPLADILNAKSGCENAKLFYEPTTLNMTTTKSIRKGEQIYNTYADPPNADLLRRYGHV 319

Query: 207 DEDNPYD----------RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV-------- 248
           D++NP+D          RL  E +L+  DPQ Q+     +   K +++V  +        
Sbjct: 320 DDENPFDLAEVSLELCIRLAAE-SLHPSDPQNQNTLDELKSRAKWALEVSDIDEIFMLPT 378

Query: 249 HAGRE-KEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 307
            + RE KE + D L  +    +S   E Q+   S G +    P M   +          R
Sbjct: 379 KSQREPKEILPDELVIMLRILLSTEEEFQT-WKSKGKVP--KPAMSEPIAQLAIQILSNR 435

Query: 308 LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKK 343
           L  Y  T+  D+ +L D +L  ++++ +  VR+ +K
Sbjct: 436 LNQYSTTIQNDQDLLKDQSL-SRRKLKSIKVRLGEK 470


>gi|358056251|dbj|GAA97802.1| hypothetical protein E5Q_04481 [Mixia osmundae IAM 14324]
          Length = 433

 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 83/343 (24%), Positives = 138/343 (40%), Gaps = 48/343 (13%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-L 82
           L L LMYE+     S W  Y+  +           +S + WS+ EL  L GS    +I  
Sbjct: 83  LILCLMYEQID-PASHWKRYLNSM-------PTCFDSLMFWSDDELRELQGSSVLDKIGR 134

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
           E AEG    Y+ L         +F+      P EA++  ++ +    + S   H+     
Sbjct: 135 EEAEG--SYYSILVPYLSKHADIFK------PLEAYSLALYHRCGSLILSRSFHVSNQDD 186

Query: 143 ---------------ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESI 187
                               +VP+   L A S    A L    DA+ +   +   AGE I
Sbjct: 187 SASDASDDDDAAYHEVETVGMVPMADVLNAKSGSANACLVYHPDALVMTTTKEIAAGEQI 246

Query: 188 VVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL-NTEDPQYQDKRMVAQRNGKLSVQVF 246
                  PN+ LL  YG VDE N  D + + A L   +D          +R   L  ++ 
Sbjct: 247 FNTYNDPPNADLLRRYGHVDEVNLNDNVEISADLIGCKD---------LERVDWLLDRLD 297

Query: 247 HVHAGREKEAI-SDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFK 305
            V+   + E +  D +  +++   +  SE + +  +     P    ++ A   ++ +  +
Sbjct: 298 DVYTLTQAEDLPEDFITAVKI-LTASKSEFRKIQKA--DDLP-DDVLDEATAMRVREILQ 353

Query: 306 ARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM-EKKMLNA 347
            RLA Y +T+ EDE++L    +   +  A  LVR+ EK++L A
Sbjct: 354 MRLAQYSSTIEEDESLLASSTMLTSRSRAALLVRLGEKRILAA 396


>gi|390354259|ref|XP_001201449.2| PREDICTED: SET domain-containing protein 4-like [Strongylocentrotus
           purpuratus]
          Length = 455

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 50/200 (25%), Positives = 84/200 (42%), Gaps = 19/200 (9%)

Query: 27  YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT--KAEILER 84
           +L+ E+ +GK SFW PYI  L +          +P   S T+ A     PT  ++  + +
Sbjct: 125 FLLTERSKGKSSFWYPYINVLPKD-------FTTPAFGS-TKQADFDVLPTIARSRAINQ 176

Query: 85  AEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF----KQAFVAVQSCVVHLQKV 140
            + I+  +     ++      F QY      ++F +  F    +  ++    C     K 
Sbjct: 177 LQDIRAAFESASCLFEDIERTFPQYRIFFSLDSFVWAWFVINSRSVYIEPSGCEAFDPKA 236

Query: 141 SLARRFALVPLGPPLLAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
           S    FAL P    LL +S  ++  A    V +  ++     Y A + + +  GP  N  
Sbjct: 237 S--DDFALAPFLD-LLNHSPGAEVTAGFDPVSNCYRIKTLDSYHAYDQVFIHYGPHDNVN 293

Query: 199 LLINYGFVDEDNPYDRLVVE 218
           LL+ YGFV   NP+D +  E
Sbjct: 294 LLLEYGFVIPSNPHDAVSFE 313


>gi|346980096|gb|EGY23548.1| SET domain-containing protein RMS1 [Verticillium dahliae VdLs.17]
          Length = 469

 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 83/371 (22%), Positives = 143/371 (38%), Gaps = 65/371 (17%)

Query: 18  LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
           L     L L ++YE  QG  S W PY   L +Q        ++P+ WS+ EL  L G+  
Sbjct: 91  LDSWGQLILVMLYEVLQGDASRWKPYFDILPQQ-------FDTPIFWSDGELLELQGTSL 143

Query: 78  KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
            AE + + E      +++  +     ++F       PTE     +  +    + +    L
Sbjct: 144 TAEKIGKVESDAMFRSKILPIVQANPAIFYPEGAAQPTEDELLHLAHRMGSTIMAYAFDL 203

Query: 138 QKVSLARR--------------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 183
           +                       +VP+   L A +++  A +   +      +    KA
Sbjct: 204 ENDDENENEEDGWVEDREGRTMLGMVPMADTLNA-NAEFNAHINHGESLEATAIRADIKA 262

Query: 184 GESIVVWCGPQPNSKLLINYGFVD-EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLS 242
           G+ I+ + GP P S+LL  YG+V  E + YD + V   L  E         V   +  LS
Sbjct: 263 GDQILNYYGPLPTSELLRRYGYVTPEHSRYDVVEVPWTLVKE---------VIVSSLSLS 313

Query: 243 VQVF-HVHAGREKEAISDMLPYLRLGYVSDTSE--------MQSVISSLGP--------- 284
            + +  V +  + E I D     R     D+ E          +V+  + P         
Sbjct: 314 AEAWKQVESQIDDEEIEDYFVIER-----DSGEPGPDGRFTAPAVLREVSPELVEQLKEF 368

Query: 285 ICPVSPCMERAVLDQ----------LADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVA 334
           +  V       + D+          +A+  K RLA YP ++  DE +L + +L  ++R+A
Sbjct: 369 LKAVKKLDSERIPDKRKRDEICDAVIAEVLKVRLAQYPTSIETDEKLLAEADLPARRRMA 428

Query: 335 TQLVRMEKKML 345
             +   EKK+L
Sbjct: 429 VVVRLGEKKLL 439


>gi|145549620|ref|XP_001460489.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124428319|emb|CAK93092.1| unnamed protein product [Paramecium tetraurelia]
          Length = 482

 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 71/344 (20%), Positives = 132/344 (38%), Gaps = 33/344 (9%)

Query: 27  YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAE 86
           +L+ E ++ K SFW PYI  L +           P  +   + A L GSPT   ++ + +
Sbjct: 123 HLLQENRR-KYSFWKPYIDVLPKD------VSGFPTYFDAEQDALLKGSPTLFTVINQRK 175

Query: 87  GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF 146
             K EY  L      A   FQ+Y Y       T++ F +  +   S    +Q     ++ 
Sbjct: 176 VFKEEYENLKE----AVKEFQKYGY-------TYDDFIKFRILTISRSFTVQIGEKEQQQ 224

Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD--RPYKAGESIVVWCGPQPNSKLLINYG 204
            LVPL    + + +          DA    +   R  + GE +    G   N    +NYG
Sbjct: 225 LLVPLAD-FINHDNNGFLKYGYSKDADGFFMQAVRNIQKGEELFYNYGQWSNKYFFMNYG 283

Query: 205 FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN---GKLSVQVFHVHAGREKEAISDML 261
           F    NP ++  ++  LN  D  +  K  + + N   G   V        R+  A    +
Sbjct: 284 FASLTNPMNQFDLDICLNKNDRLFNLKISLTKGNMCWGNRLVNETDHDTFRQSLA---TV 340

Query: 262 PYLRLGYVSDTSEMQSVISSLGPICP------VSPCMERAVLDQLADYFKARLAGYPATL 315
            + ++  + D  +++  + +     P       +  +E+A    L       L  + +T+
Sbjct: 341 RFTQISKLDDFLQLEEDVQNFKQFWPGWHTTIKTIELEKATFKALKGILVTELGNFASTI 400

Query: 316 SEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLL 359
            +DE  L D      ++    L   EK+++   + +   M+ ++
Sbjct: 401 EDDERRLNDPQTPEFRKHIIMLTLREKQIIKKNIDICDLMLQVI 444


>gi|119500300|ref|XP_001266907.1| SET domain protein [Neosartorya fischeri NRRL 181]
 gi|119415072|gb|EAW25010.1| SET domain protein [Neosartorya fischeri NRRL 181]
          Length = 704

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 55/203 (27%), Positives = 90/203 (44%), Gaps = 27/203 (13%)

Query: 26  LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 85
            +L+ +  +G + FW PYIR L +      L++ +PL +   +L +L G+       +R 
Sbjct: 124 FFLIGQYLKGSEGFWFPYIRTLPQP-----LSLTTPLYYEGGDLRWLDGTSLAPAREQRM 178

Query: 86  EGIKREYNELDTVWFMAGSLFQ---QYPYDI--------PTEAFTFEIFKQAFVAVQSCV 134
              K +Y    T    AG  FQ   QY +D+         + AF+ ++  +A   V+   
Sbjct: 179 GVWKEKYKNGITELRKAG--FQDVDQYTWDLYLWSSSILVSRAFSAKVLAEAVTDVE--- 233

Query: 135 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 194
           +    VS+     L+P    L+ +    K    A    V  VV     +G+ I    GP+
Sbjct: 234 LPEDGVSV-----LLPC-IDLMNHRPLAKVEWRAGKQDVAFVVLEDVGSGQEISNNYGPR 287

Query: 195 PNSKLLINYGFVDEDNPYDRLVV 217
            N +L++NYGF   DNP D  +V
Sbjct: 288 NNEQLMMNYGFCLPDNPCDYRIV 310


>gi|357131865|ref|XP_003567554.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Brachypodium distachyon]
          Length = 316

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 46/181 (25%), Positives = 74/181 (40%), Gaps = 22/181 (12%)

Query: 36  KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 95
           KKS W PY+R L R        + + + W   EL  +  S    E +ER E   +E++ +
Sbjct: 38  KKSGWAPYVRSLPRND-----QMHNMMFWDLNELHMVRISSICDEAIERRERAMKEFSAV 92

Query: 96  DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 155
                    LF +            E F  A     S +V  +    +R  +L+P     
Sbjct: 93  KPSLECFPHLFGE---------IKLEDFMHA-----SALVSSRAWQTSRGVSLIPFAD-F 137

Query: 156 LAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 213
           L +     ++L      D  +++ DR Y  GE ++V  G   N+ L +N+GF    N YD
Sbjct: 138 LNHDGVSDSILLYDGQKDIAEVISDRNYAVGEQVMVRYGKYSNAMLALNFGFTLPRNIYD 197

Query: 214 R 214
           +
Sbjct: 198 Q 198


>gi|340520781|gb|EGR51016.1| N-methyltransferase [Trichoderma reesei QM6a]
          Length = 470

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 50/186 (26%), Positives = 79/186 (42%), Gaps = 21/186 (11%)

Query: 60  SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 119
           S + +SE EL    G+       +  + IK +Y +L      A  LF Q+P   P + FT
Sbjct: 107 SSIFFSEGELEVCAGTSLYTVTKQLEQRIKDDYRQL------AVRLFAQHPDLFPLQKFT 160

Query: 120 FEI----------FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS---KCKAML 166
            E           +K A   V S  +       +    L P    +L +SS   +C A  
Sbjct: 161 IEDVRLLRRATDPYKWALCTVWSRSMDFTLPDGSSIRLLAPFAD-MLNHSSEVKQCHAYD 219

Query: 167 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 226
               D + +   + Y+ G+ + ++ GP PN++LL  YGFV  DNP D   +    +   P
Sbjct: 220 VKSGD-LSVFAGKDYEIGDQVYIYYGPIPNNRLLRLYGFVIPDNPNDSYDLVLTTHPMAP 278

Query: 227 QYQDKR 232
            Y+ K+
Sbjct: 279 FYEQKQ 284


>gi|400602586|gb|EJP70188.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
          Length = 797

 Score = 51.2 bits (121), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 27/77 (35%), Positives = 40/77 (51%), Gaps = 2/77 (2%)

Query: 145 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 204
           R AL+P+   L   ++ C    +   +A  +  DR Y+AGE +    G   N  LL  YG
Sbjct: 593 RLALLPVADVLNHANAGCSVAFST--EAYDITADRAYQAGEEVYTSYGAHSNDFLLAEYG 650

Query: 205 FVDEDNPYDRLVVEAAL 221
           FV  DNP+D+L ++  L
Sbjct: 651 FVLPDNPWDQLCLDKVL 667


>gi|308806489|ref|XP_003080556.1| SET-domain transcriptional regulator-like protein (ISS)
           [Ostreococcus tauri]
 gi|116059016|emb|CAL54723.1| SET-domain transcriptional regulator-like protein (ISS)
           [Ostreococcus tauri]
          Length = 394

 Score = 51.2 bits (121), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 52/186 (27%), Positives = 83/186 (44%), Gaps = 20/186 (10%)

Query: 40  WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 99
           W P +R L  +         S L W+  +L  + G      + E    +K +Y+ L    
Sbjct: 77  WAPALRALPTR-------ASSSLAWNADDLGAVAGEDLANRLREYRRSVKVQYDAL---- 125

Query: 100 FMAGSLFQQYPYDIPTEAF-TFEIFKQAFVAVQSCVVHLQK-VSLARRFALVP----LGP 153
           F A  L +Q P   P  AF  +  F++A+    S  + +Q   SL  R  +VP       
Sbjct: 126 FPA--LCEQVPEAFPARAFGDYAKFERAYDIWTSYAMKVQDPDSLQIREVIVPGVFLCNH 183

Query: 154 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 213
            L A+S +  ++      A +L + R    GE+I +  G   N+ LL+ YGF  E+NPYD
Sbjct: 184 SLSAHSVRYTSLERGTK-AFRLELSRGCVEGEAITISYGRLDNADLLMFYGFSLENNPYD 242

Query: 214 RLVVEA 219
           R+ + +
Sbjct: 243 RVSLHS 248


>gi|356577306|ref|XP_003556768.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Glycine
           max]
          Length = 487

 Score = 51.2 bits (121), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 75/330 (22%), Positives = 141/330 (42%), Gaps = 19/330 (5%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            +AL+L+ E+ +   S W  Y   L ++        +S + WSE EL+ L G+       
Sbjct: 125 AVALFLIRERSR-SNSLWKHYFSVLPKE-------TDSTIYWSEEELSELQGTQLLNTTR 176

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQS-CVVHLQKV 140
              + ++ EY  L+    +        P  +    + F I + +AF  +++  +V +   
Sbjct: 177 SVKQYVENEYRRLEEEIILPNKKLFPSPLTLDDFFWAFGILRSRAFSRLRNENLVVIPFA 236

Query: 141 SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKL 199
                 A V       AY  K  A L + D    L      KAG+ + + +   + N++L
Sbjct: 237 DFINHSARVTTEDH--AYEIKGAAGLFSWDYLFSLRSPLSLKAGDQVYIQYDLNKSNAEL 294

Query: 200 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV-HAGREKEAIS 258
            ++YGF++ +   +   +   ++  DP + DK  +A+ NG      F + ++      + 
Sbjct: 295 ALDYGFIEPNADRNAYTLTLQISESDPFFGDKLDIAESNGFGETAYFDIFYSRPLPPGLL 354

Query: 259 DMLPYLRLGYVSDTSEMQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATL 315
             L  + LG  +D   ++S+   S  G +  PVS   E  +   + +  K  LAGY  T+
Sbjct: 355 PYLRLVALG-GTDAFLLESIFRNSIWGHLELPVSRDNEELICRVVRETCKTALAGYHTTI 413

Query: 316 SEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
            ED+  L +  L  +  +A  +   EK++L
Sbjct: 414 EEDQK-LKEAKLDSRHAIAVGIREGEKQLL 442


>gi|391340216|ref|XP_003744440.1| PREDICTED: SET domain-containing protein 4-like [Metaseiulus
           occidentalis]
          Length = 381

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 48/206 (23%), Positives = 84/206 (40%), Gaps = 28/206 (13%)

Query: 13  LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 72
           ++   LS    L ++++ E+  G+KS W PYI  +             P ++   +    
Sbjct: 83  MSKENLSAEEILTIFVLSERSLGEKSKWKPYIESI-------------PDVFDGLQCRKS 129

Query: 73  TGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 132
              P +      A+ I R   E   V+      F+    D+      FE F  A+ AV +
Sbjct: 130 VRLPRRL-----AQAIDRWNAERRNVFSRLRMFFRGRGIDL-----NFETFSWAWSAVNT 179

Query: 133 CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG 192
             ++++         L P    LL +  K     + V++   +  +  Y+AG  + +  G
Sbjct: 180 RCIYVE----GHGSTLAPF-LDLLNHHWKASIETSFVNNHFIIRSNVGYEAGSEVFIGYG 234

Query: 193 PQPNSKLLINYGFVDEDNPYDRLVVE 218
              N  L +NYGFV ++NP D + VE
Sbjct: 235 SHDNRTLFLNYGFVLDENPNDCITVE 260


>gi|80479475|gb|AAI08868.1| Unknown (protein for MGC:132347) [Xenopus laevis]
          Length = 456

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 49/207 (23%), Positives = 91/207 (43%), Gaps = 19/207 (9%)

Query: 18  LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
           +S L  L  +L+ E+  G++S W PY+  +             P+ W E E+ +L  +P 
Sbjct: 104 VSPLLALCTFLIAERFAGERSQWKPYLDVIPS-------TYSCPVYW-ELEIVHLLPAPL 155

Query: 78  KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
           + + LE+    K E  EL T      +  Q    D   + +T++  + A+  V +  V++
Sbjct: 156 RQKALEQ----KTEVQELHTESLAFFNSLQPLFCDNVADIYTYDALRWAWCTVNTRTVYM 211

Query: 138 QKVSLARRFA---LVPLGP--PLLAYSSKCK--AMLAAVDDAVQLVVDRPYKAGESIVVW 190
           +     R  A   +  L P   LL +S + +  A  +      ++  +   +  +   + 
Sbjct: 212 KHTQQDRLLAQQDVCALAPYLDLLNHSPEVQVEAEFSKDRRCYEIRTNSGCRKHDQAFIC 271

Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVV 217
            GP  N +LL+ YGFV  +NP+  + V
Sbjct: 272 YGPHDNQRLLLEYGFVAANNPHRSVYV 298


>gi|50557134|ref|XP_505975.1| YALI0F28061p [Yarrowia lipolytica]
 gi|49651845|emb|CAG78787.1| YALI0F28061p [Yarrowia lipolytica CLIB122]
          Length = 454

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 61/226 (26%), Positives = 93/226 (41%), Gaps = 41/226 (18%)

Query: 17  KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP 76
           KL+    L LY+M   K G  + W PY   L  Q       ++S ++W++ EL  L GS 
Sbjct: 80  KLNSWLQLILYMM---KAGSMTKWKPYFDVLPTQ-------LDSLMMWTDDELEGLKGSM 129

Query: 77  TKAEILERAEGIKREYNE---------------LDTV---WFMAGSLFQQYPYDIPTEAF 118
              +I     G + +Y E                DT    +   G L   Y +D P ++F
Sbjct: 130 IVKKI--GKAGAEEDYQEKLKPIIDAHPEYFKDCDTSLESFHRMGGLIMAYSFDAP-DSF 186

Query: 119 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA-VQLVV 177
           +    +            L    L +  A+VPL   L A++  C A L A DD    +  
Sbjct: 187 S----EDEEDDEDIEHDDLYNEGLVK--AMVPLADTLNAHTRFCNANLIAEDDGGFSMTA 240

Query: 178 DRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALN 222
            +P K GE +    G  PN   L  YG+V +E   +D  +VE +++
Sbjct: 241 IQPIKKGEQVYNTYGELPNCDFLRRYGYVENEGTEFD--IVEFSMD 284


>gi|66825817|ref|XP_646263.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|60474297|gb|EAL72234.1| SET domain-containing protein [Dictyostelium discoideum AX4]
          Length = 567

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 74/345 (21%), Positives = 129/345 (37%), Gaps = 45/345 (13%)

Query: 38  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 97
           S W PYI+ L +Q         +   W   E     GSP     +    G  R+Y     
Sbjct: 164 SEWGPYIKLLPKQ-------YNTVYYWGLKEFTQFRGSPNLEYAMRYVRGAMRQY----- 211

Query: 98  VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR-----FALVPLG 152
             ++   + +     +P  +FT++ F  A   VQS     Q    A        AL+P  
Sbjct: 212 -CYLYSMIDRTQSNIMPISSFTWDAFVWAISTVQS----RQNPVYAGNGNGSIMALIPFW 266

Query: 153 P--PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 210
                 +  SK  +      + +     + +K GE + ++ GP+ N++LL++ GF  + N
Sbjct: 267 DFCNHSSTGSKITSFYHMDSNCMTSGAIKDFKKGEQVYMFYGPRDNTQLLMHAGFATKTN 326

Query: 211 PYDRLVVEAAL--NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 268
            +D    E  L     + ++    ++ +R  +  V V         E   +++P+ R+ Y
Sbjct: 327 LHDSYPFELHLLEGNHEIRHDKVHLLEERGIRDGVVVNLNQNPTSNELPLELIPFYRI-Y 385

Query: 269 VSDTSEMQSVIS---------------SLGPIC--PVSPCMERAVLDQLADYFKARLAGY 311
                E +++                  L P+    ++   E      L    K +LA Y
Sbjct: 386 ALSEQETRAIAPPQVPGEHNHHHGHQLELKPLAFKIITQENEEKAYSNLVQALKGKLASY 445

Query: 312 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 356
           P TL EDE  L   N    +R        EKK+L+  ++    +I
Sbjct: 446 PTTLEEDEQELKK-NPPANQRFILYTKINEKKILDRNIKYLESLI 489


>gi|387191841|gb|AFJ68625.1| set domain-containing protein [Nannochloropsis gaditana CCMP526]
          Length = 736

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 41/200 (20%), Positives = 79/200 (39%), Gaps = 22/200 (11%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           LA+ L+ E+ +G +SFW PY+R L  +          P+ ++ +E   +        +  
Sbjct: 260 LAVLLVAERMKGPQSFWWPYLRNLPEK------YAHMPIFYNNSEFGSIQIPSLMRTVQS 313

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 143
           R   +            ++    +Q  +  P E    +      +    C    + +   
Sbjct: 314 RCRMLVN----------ISDGYLRQLSHGGPAENPFLDDVHANDMGWGLCAASSRALRNI 363

Query: 144 RRFALVPLGPPLLAYSSKCKAMLAAVDD------AVQLVVDRPYKAGESIVVWCGPQPNS 197
                 PL  P++ +     +    + D      ++QLV  R  + G+++ +  G   N 
Sbjct: 364 PGLGSTPLMVPVIDFCEHAVSPTCYIKDYRKSGGSIQLVAGRDLQPGDALTISYGNLTNP 423

Query: 198 KLLINYGFVDEDNPYDRLVV 217
           +LL++YGF   DNP+DR  V
Sbjct: 424 QLLLDYGFTLSDNPHDRFEV 443


>gi|85093434|ref|XP_959692.1| hypothetical protein NCU09581 [Neurospora crassa OR74A]
 gi|28921141|gb|EAA30456.1| predicted protein [Neurospora crassa OR74A]
          Length = 504

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 48/199 (24%), Positives = 83/199 (41%), Gaps = 28/199 (14%)

Query: 26  LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 85
            YL+ +  +GK SFW PYI  L       + A+  P  W+E ++  L G+     I E  
Sbjct: 107 FYLIQQYLKGKSSFWAPYISTLADPSQLDKWAL--PPFWAEDDIELLQGTNAYIAIQEIQ 164

Query: 86  EGIKREYNELDTVWFMAG----SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS 141
             +K EY +   +    G      + Q  Y+     FT   F+ + V  +S   ++++  
Sbjct: 165 NNVKSEYKQARKILKKEGFPDYREYTQVLYNWAYCMFTSRSFRPSLVLSESAREYVER-- 222

Query: 142 LARRFALVPLGP---------PLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESI 187
                 L+P G          PL      ++ +     L +   A +L+ +  Y  G+ +
Sbjct: 223 ------LLPEGTKIDDFSVLQPLYDIGNHSWDASYTWNLTSEPSACELICNDSYGPGQQV 276

Query: 188 VVWCGPQPNSKLLINYGFV 206
               G + NS+LL+ YGF+
Sbjct: 277 FNNYGFKTNSELLLGYGFI 295


>gi|396495152|ref|XP_003844476.1| similar to SET domain-containing protein [Leptosphaeria maculans
           JN3]
 gi|312221056|emb|CBY00997.1| similar to SET domain-containing protein [Leptosphaeria maculans
           JN3]
          Length = 475

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 56/218 (25%), Positives = 83/218 (38%), Gaps = 48/218 (22%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI- 81
            L L ++YE   G  S W PY   L  +         + + WSE ELA L  S   A+I 
Sbjct: 85  SLILVMLYEHLNGDASNWAPYFAVLPNE-------FNTLMFWSEHELAELQASAVLAKIG 137

Query: 82  ------------------------------LERAEGIKREYNELDTVWFMAGSLFQQYPY 111
                                          ++AE ++ E N   T+    GSL   Y +
Sbjct: 138 REGANEAFLGQLVPVIKEFAGIFFSGDSRAAQKAEEMRDEKN--ITLMHKMGSLIMAYAF 195

Query: 112 DIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD 171
           DI           +  V  +      +  +L +   ++PL   L A + +C A L     
Sbjct: 196 DIEPAT------PRKDVDEEGFAEEEEDEALPK--GMIPLADMLNADADRCNARLFYEQK 247

Query: 172 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 209
            +++   +P KAGE I    GP P S LL  YG+V E+
Sbjct: 248 YLEMKALKPIKAGEEIFNDYGPLPRSDLLRRYGYVTEN 285


>gi|402224283|gb|EJU04346.1| hypothetical protein DACRYDRAFT_114691 [Dacryopinax sp. DJM-731 SS1]
          Length = 1313

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 57/215 (26%), Positives = 90/215 (41%), Gaps = 19/215 (8%)

Query: 147  ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
            A+VP+   L A      A L    D +Q++  +P   GE I    G  PNS LL  YG+V
Sbjct: 1060 AMVPMADMLNARCGCNNAKLFYTRDDLQMMATKPIAKGEQIWNTYGDPPNSDLLRRYGYV 1119

Query: 207  DE-------DNPYDRLVVEAALNTEDPQ---YQDKRMVAQRNGKLSVQVFHVHAGREKEA 256
            D         +P D + + A    E  +   YQD+       G     V  V       +
Sbjct: 1120 DALTLPDGVGSPSDVVEINADTVVEAAKVQSYQDRIDWWLEEGGDDAFVLDV-----TYS 1174

Query: 257  ISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLS 316
            + D +  L    + +  + +   S      P  P ++    + L    + RLA YP +L+
Sbjct: 1175 VPDEMLSLVRLLLLNQEDWEKAQSK---GKPPKPKLDEKSYEVLLVVLQKRLAMYPISLT 1231

Query: 317  EDEAMLTDYN-LHPKKRVATQLVRMEKKMLNACLQ 350
            E E ML   N L+ K+R A  +   E+++L+  L+
Sbjct: 1232 EQEGMLRSSNELNEKRRNALIVTTGEQRILHKTLE 1266


>gi|303275964|ref|XP_003057276.1| set domain protein [Micromonas pusilla CCMP1545]
 gi|226461628|gb|EEH58921.1| set domain protein [Micromonas pusilla CCMP1545]
          Length = 308

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 61/238 (25%), Positives = 102/238 (42%), Gaps = 35/238 (14%)

Query: 17  KLSELACLALYLMYEK---KQGKKS-FWLPYIRELDRQRGRGQLAVESPLLWSETEL-AY 71
           +L E + LA +L  +    + G +S  +  Y++ L R+ G         L W E ++   
Sbjct: 63  ELQEWSLLAAFLAEQALDIENGDESGVFAAYVKALPRRTG-------GVLDWPEEDVKTL 115

Query: 72  LTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ 131
           L GSP++    ER   +     E+            ++P   P         + AF  + 
Sbjct: 116 LAGSPSQRAAYERQASVDGAIEEIRA----------EFPQLTPG------ALRWAFDVLF 159

Query: 132 SCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
           S ++ L   +     ALVP    +L +   C A +      V L  DR YK GE +    
Sbjct: 160 SRLIRLP--NRGGELALVPWAD-MLNHKPGCNAYIDDSGGKVCLQPDRAYKPGEQVFASY 216

Query: 192 GPQPNSKLLINYGFVDE--DNPYDRLVVEAALNTEDPQYQDKRMVA-QRNGKLSVQVF 246
           G +P+++LLI+YGF  E  +NP D   +   ++  D +Y D +  A ++ G   V+ F
Sbjct: 217 GQRPSAELLISYGFAPEVGENPDDEYEITLGIDPND-RYADAKAAALEKIGLRPVESF 273


>gi|70993754|ref|XP_751724.1| SET domain protein [Aspergillus fumigatus Af293]
 gi|66849358|gb|EAL89686.1| SET domain protein [Aspergillus fumigatus Af293]
 gi|159125354|gb|EDP50471.1| SET domain protein [Aspergillus fumigatus A1163]
          Length = 674

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 53/201 (26%), Positives = 88/201 (43%), Gaps = 23/201 (11%)

Query: 26  LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 85
            +L+ +  +G + FW PYIR L +      L++ +PL +   +L +L G+       +R 
Sbjct: 94  FFLIGQYLRGSEGFWFPYIRTLPQP-----LSLTTPLYYEGDDLRWLDGTSLAPAREQRM 148

Query: 86  EGIKREYNELDTVWFMAG-SLFQQYPYDI--------PTEAFTFEIFKQAFVAVQSCVVH 136
              K +Y    T    AG     QY +D+         + AF+ ++  +A   V+   + 
Sbjct: 149 GVWKEKYENGITELRKAGFEDVDQYTWDLYLWSSSILVSRAFSAKVLAEAVTDVE---LP 205

Query: 137 LQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPN 196
              VS+     L+P    L+ +    K    A    V  VV     +G+ I    GP+ N
Sbjct: 206 EDGVSV-----LLPC-IDLMNHRPLAKVEWRAGKQDVAFVVLEDVASGQEISNNYGPRNN 259

Query: 197 SKLLINYGFVDEDNPYDRLVV 217
            +L++NYGF   DNP D  +V
Sbjct: 260 EQLMMNYGFCLPDNPCDYRIV 280


>gi|302840199|ref|XP_002951655.1| hypothetical protein VOLCADRAFT_105180 [Volvox carteri f.
           nagariensis]
 gi|300262903|gb|EFJ47106.1| hypothetical protein VOLCADRAFT_105180 [Volvox carteri f.
           nagariensis]
          Length = 517

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 46/206 (22%), Positives = 89/206 (43%), Gaps = 13/206 (6%)

Query: 16  NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 75
           +++ E+  L + +MYEK +G++S W PY+  +             PL W   E   L G+
Sbjct: 179 DEILEVQGLIIAVMYEKSRGRQSRWAPYLNLIPDD------MTHMPLYWKHREFKELRGT 232

Query: 76  PTKAEILERAEGIKREYNELDTVWF-MAGSLFQQYP-YDIPTEAFTFEIFKQAFVAVQSC 133
               +++ + +       ++  +W  +     Q++P  ++P     +++++ A  AV S 
Sbjct: 233 AAYDKMMGKVQCPADAPTQVPVLWSEVVEPFIQEHPELELPEGKAGYDLYRWATCAVASY 292

Query: 134 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWC 191
              L         A+VP+   L   + +    L   A    + ++  R    GE +V   
Sbjct: 293 SFILGDDKYQ---AMVPVWDLLNHITGRVNVRLHHCAKRHVLHMIATRDILRGEELVNNY 349

Query: 192 GPQPNSKLLINYGFVDEDNPYDRLVV 217
           G   N++LL  YGFV+  N  + + V
Sbjct: 350 GELSNAELLRGYGFVEARNRNNHVQV 375


>gi|389741836|gb|EIM83024.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
          Length = 502

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 91/414 (21%), Positives = 153/414 (36%), Gaps = 100/414 (24%)

Query: 22  ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 81
           A L L +M+E+ QG  S W  Y+  L         + ++P+ WS  +L  L G+    +I
Sbjct: 79  AGLILCMMWEEAQGASSRWSTYLASLPS-------SFDTPMFWSPDDLEELKGTSVVDKI 131

Query: 82  ---------------------------------------------------LERAEGIKR 90
                                                              +ER EG   
Sbjct: 132 GRDGAEEDYRSKVVPTLQSRPDLFAPEALSRHYSLENYHLMGSRILSRSFSVERWEGHAA 191

Query: 91  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFE-------IFKQAFVAVQSCVVHLQKVSLA 143
           +  E D+         +    D+ TEA T         + + +FV         +     
Sbjct: 192 DKQE-DSASSPVADTGRDEAMDVDTEAVTATAPEAEDGVDEPSFVVDDENDSDDEDEEDP 250

Query: 144 RRFALVPLGPPLLA-YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 202
              A+VP+   L A Y S+   +    +D ++++  +P   GE I    G  PNS LL  
Sbjct: 251 ANVAMVPMADMLNARYRSENAKLFYETED-LRMITTKPILKGEQIFNTYGDPPNSDLLRR 309

Query: 203 YGFV--------DEDNPYDRLVVE-----AALNTEDPQYQDKRMVAQR------NGKLSV 243
           YG V        D  NP D  +VE     A  +  +   Q     A+R       G   V
Sbjct: 310 YGHVDLVPLPNGDIGNPAD--IVELRGDLAFFSISERHKQPVESSAERVDWWLEEGGEDV 367

Query: 244 QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADY 303
            +   +     E   +++P+ RL   S +   ++   S  P   V    + ++L  +A+ 
Sbjct: 368 FILETN----HELPDELVPFCRLLLQSQSEWEKTKSKSKLPKAKV----DESILSTIANA 419

Query: 304 FKARLAGYPATLSEDEAMLTD-YNLHPKKRVATQLVRMEKKMLNACLQVTADMI 356
            + RLA YP ++ ED+ +LT+  +L+ K  V  +L   EK++L+  L    + +
Sbjct: 420 LERRLAEYPTSVEEDQKLLTEPLSLNRKHAVIVRL--GEKRILHGTLSTVKEKL 471


>gi|302658278|ref|XP_003020845.1| SET domain protein [Trichophyton verrucosum HKI 0517]
 gi|291184711|gb|EFE40227.1| SET domain protein [Trichophyton verrucosum HKI 0517]
          Length = 692

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 47/206 (22%), Positives = 81/206 (39%), Gaps = 35/206 (16%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           LA ++++E+ +G+ S W PY+  L R          S L + + +L +L G+        
Sbjct: 108 LAFFMVHEQLKGRDSHWWPYLATLPRAS-----EFTSALFYQDNDLEWLQGTNLYQTHQA 162

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 143
               ++ EY+   ++    G L          E++ ++IF  A+  + S      +   +
Sbjct: 163 YRNAVQEEYDSAISILRDEGFL--------AVESYRWDIFCWAYTLIAS------RAFTS 208

Query: 144 RRFALVPLGPPLLAYSSKCKAMLAAVDDA----------------VQLVVDRPYKAGESI 187
           R         P L    + + ML  VD +                + L V  P  +GE +
Sbjct: 209 RVLDAYFSNHPTLKQDEEFQIMLPLVDSSNHKPLAKIEWRAEATEIGLKVIEPTSSGEEV 268

Query: 188 VVWCGPQPNSKLLINYGFVDEDNPYD 213
               GP  N +L+  YGF   DNP D
Sbjct: 269 HNNYGPLNNQQLMTTYGFCIVDNPCD 294


>gi|393230612|gb|EJD38215.1| SET domain-containing protein [Auricularia delicata TFB-10046 SS5]
          Length = 381

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 59/242 (24%), Positives = 99/242 (40%), Gaps = 21/242 (8%)

Query: 16  NKLSE--LACLALYLMYEKKQ---GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA 70
           N LSE  L C  L + +  K+   G  +  L +   +D    R QL   +PL ++  ELA
Sbjct: 44  NSLSERELVCTYLAMHWIAKEVDLGPSAASLDHGPYVDSLPSRAQL--RTPLHFTPQELA 101

Query: 71  YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 130
            L G+   A   +R    + E      V    G       Y   +   +   F    ++ 
Sbjct: 102 LLKGTNMAAATTDREADWRSECERCRAVLGHWGEHLTWEHYLTASTHLSSRAFPSTLLSP 161

Query: 131 QSCVVHLQKVSLARRFALVPLGPPL-LAYSSKCKAMLAAVDDAVQ---LVVDRPYKAGES 186
           +  ++     + +    LVPL   L  A +      ++  D+      +V   P  AG  
Sbjct: 162 EPALI----PTPSSHPVLVPLIDSLNHARAHPVSWSVSPADNGAHTLSIVQHAPVAAGAE 217

Query: 187 IVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 246
           ++   GP+PN++L++ YGF   DNP D LV++ +         D+R + +  G L   +F
Sbjct: 218 VLNNYGPKPNAELVLGYGFALPDNPDDTLVLKVS------GAADRREIWRAGGGLQRILF 271

Query: 247 HV 248
            V
Sbjct: 272 DV 273


>gi|356521657|ref|XP_003529470.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Glycine
           max]
          Length = 487

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 77/332 (23%), Positives = 143/332 (43%), Gaps = 23/332 (6%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            +AL+L+ E+ +   S W  Y   L ++        +S + WSE EL+ L G+       
Sbjct: 125 AVALFLIRERSR-SDSLWKHYFSILPKE-------TDSTIYWSEEELSELQGTQLLNTTR 176

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF--TFEIFK-QAFVAVQS-CVVHLQ 138
              + ++ E+  L+    +     + +P  I  + F   F I + +AF  +++  +V + 
Sbjct: 177 SVKQYVQNEFRRLEEEIIIPNK--KLFPSSITLDDFFWAFGILRSRAFSRLRNENLVVIP 234

Query: 139 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNS 197
              L    A V       AY  K  A L + D    L      KAG+ + + +   + N+
Sbjct: 235 LADLINHSARVTTDDH--AYEIKGAAGLFSWDYLFSLRSPLSLKAGDQVYIQYDLNKSNA 292

Query: 198 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV-HAGREKEA 256
           +L ++YGF++ +   +   +   ++  DP + DK  +A+ NG      F + +       
Sbjct: 293 ELALDYGFIEPNTDRNAYTLTLQISESDPFFGDKLDIAESNGFGETAYFDIFYNRPLPPG 352

Query: 257 ISDMLPYLRLGYVSDTSEMQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPA 313
           +   L  + LG  +D   ++S+   S  G +  PVS   E  +   + +  K  LAGY  
Sbjct: 353 LLPYLRLVALGG-TDAFLLESIFRNSIWGHLELPVSRDNEELICRVVRETCKTALAGYHT 411

Query: 314 TLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
           T+ ED+  L +  L  +  +A  +   EK +L
Sbjct: 412 TIEEDQK-LKEAKLDSRHAIAVGIREGEKNLL 442


>gi|322712432|gb|EFZ04005.1| histone-lysine N-methyltransferase [Metarhizium anisopliae ARSEF
           23]
          Length = 462

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 64/271 (23%), Positives = 112/271 (41%), Gaps = 24/271 (8%)

Query: 60  SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 119
           S + +++ EL    G+       +  + I+ +Y +L         +  QYP   P + FT
Sbjct: 107 SSIFFTDDELEVCAGASLYTITKQLQQRIEDDYRDLVV------RVLVQYPDLFPLDKFT 160

Query: 120 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS---KCKAMLAAVDDAVQLV 176
              +K A  AV S  +  Q    +    L P    +L +SS   +C    A+  D + ++
Sbjct: 161 LHHYKWALCAVWSRAMDFQLSDGSSIRLLAPFAD-MLNHSSESKQCHVYDASSGD-LSVL 218

Query: 177 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 236
             + Y+AG+ + +  G  PN +LL  YGF+   NP D   +  A +   P ++ K+ +  
Sbjct: 219 AGKDYEAGDQVYIHYGSIPNHRLLRLYGFIIPGNPNDSYDLVLATHPLAPFFELKQKLWA 278

Query: 237 RNGKLSVQVFHVHAGREKEAISDMLP-----YLRLGYVSDTSEMQSVISSLGPICPVSPC 291
             G  S     +        ++D LP     YLR+  + D S++ S+         +S  
Sbjct: 279 LAGLDSTCTISL-------TLTDPLPKNVIRYLRIQRL-DESDLASIALGQAADEKISNS 330

Query: 292 MERAVLDQLADYFKARLAGYPATLSEDEAML 322
            E  VL  L +   + L  +   L + E  L
Sbjct: 331 NEVQVLQSLVESIASLLGSFGTRLEKLEEQL 361


>gi|328772383|gb|EGF82421.1| hypothetical protein BATDEDRAFT_86633 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 648

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 56/212 (26%), Positives = 82/212 (38%), Gaps = 32/212 (15%)

Query: 38  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA-EILERAEGIKREYNELD 96
           S WLPY+  L +           PL+W+   +  L G  +    ++ER E I+   N   
Sbjct: 146 SHWLPYLASLPKNYAL-------PLMWTRDRIQNLLGGTSLLYMMIERLEWIQ---NSTK 195

Query: 97  TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR----------- 145
            V    G  F       PT A T +  + A  ++ S      K SL  +           
Sbjct: 196 VVENACGHYF-------PTGALTVQSMQWATCSIWSRAFPKAKPSLDLQDGSHQDVQDWI 248

Query: 146 -FALVPLGPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 202
             + + L P L  ++ K   +      +  V  +       G  ++   GP+ N  LL N
Sbjct: 249 GLSEICLFPILDMFNHKRGYRVEWRMTEKGVSFITPDGICKGSELLNNYGPKGNENLLSN 308

Query: 203 YGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 234
           YGFV E+NP D   V   L  EDP Y  K+ V
Sbjct: 309 YGFVIENNPEDYFKVFLGLQQEDPLYTAKKAV 340


>gi|323452617|gb|EGB08490.1| hypothetical protein AURANDRAFT_71532 [Aureococcus anophagefferens]
          Length = 1114

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 85/340 (25%), Positives = 134/340 (39%), Gaps = 60/340 (17%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP---LLWSETELAYLTGSPTK- 78
            LAL L+YE+++G KS W P+I           L    P   L WSE ELA L GS    
Sbjct: 100 ALALRLLYERRKGAKSRWGPHI---------ALLPATPPHALLRWSEAELAELAGSDALE 150

Query: 79  -------------AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ 125
                        +EI++++     E +    +   A     ++P+ +  E F++ +   
Sbjct: 151 LANRWRSQVSSDFSEIVDKSRAAVEESDPGKQL-SAAVKASLRFPW-LDLEGFSWAV--- 205

Query: 126 AFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY--------SSKCKAMLAAVDDAVQLVV 177
                   ++  + VS++R+ A     PP+ A+                   DDA    V
Sbjct: 206 -------SMIWSRCVSVSRKGA-----PPIKAFLPVVDMHNHDPGAPENHGFDDARDGFV 253

Query: 178 DR---PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 234
            R     K G+ + +     PN+ LL+ YGF  +   +    + A L+ E P Y+ KR  
Sbjct: 254 LRRTGNAKKGDELKLCYDGLPNAWLLLLYGFALDHAAHAGRDLYAPLSPEAPHYEAKRAA 313

Query: 235 AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMER 294
            +   KL +      A   + A  D LP  RL  ++     ++ +  L  +   S    R
Sbjct: 314 LE---KLGLGATADGAAPFRLAADDALPE-RL--LTALMAQRATLDELPGLPATSEATAR 367

Query: 295 AVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVA 334
           A    L     A LA Y  +  ED A L D    P+ R+A
Sbjct: 368 AAAGDLVAACDALLAAYRGSEDEDAAALADPATPPRLRLA 407


>gi|328869852|gb|EGG18227.1| hypothetical protein DFA_03714 [Dictyostelium fasciculatum]
          Length = 504

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 36/143 (25%), Positives = 63/143 (44%), Gaps = 16/143 (11%)

Query: 10  AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
           A +L   +L E    ++ LMYE  +G +S W  YI+ +          ++ P+LW +  +
Sbjct: 73  APILEKYELEEAVATSIALMYETSKGVQSKWYSYIQSM-------PTVIDLPILWDKESI 125

Query: 70  AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
            YL G+  +  ++E  E ++ +Y E          + + +P       FT E FK A   
Sbjct: 126 EYLVGTDLEEIVIENIETLEEQYRE------DVEPIIKNHPETFKENIFTLESFKIASTI 179

Query: 130 VQSCVVHLQKVSLARRFALVPLG 152
           V S   ++ +       +LVPL 
Sbjct: 180 VSSRAFNIDQYHGE---SLVPLA 199


>gi|302784522|ref|XP_002974033.1| hypothetical protein SELMODRAFT_414219 [Selaginella moellendorffii]
 gi|300158365|gb|EFJ24988.1| hypothetical protein SELMODRAFT_414219 [Selaginella moellendorffii]
          Length = 527

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 56/214 (26%), Positives = 85/214 (39%), Gaps = 34/214 (15%)

Query: 12  LLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW-SETELA 70
           +L  N       +  +L+ E+ +GK+SFW PYI  L       +L++  PLLW +ET   
Sbjct: 100 MLRKNDFRPWLTMCAHLLVERSRGKESFWHPYISALPSVE---ELSISHPLLWPAETIQE 156

Query: 71  YLTGSPTKAEILERAEGIKREYNELDTVW---FMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
            L GSP    I  R +  + ++  L T     F+ G                 E   +  
Sbjct: 157 LLQGSPMLDTIATRLKLCQEDHEALLTAGIEKFLPGG----------------ETLSEGD 200

Query: 128 VAVQSCVVHLQKVSLA-------RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ---LVV 177
           V   S V+  +  SL            LVP    L   SS  +      D   +   L  
Sbjct: 201 VRWASAVLLSRAFSLELDVDDDFDTLCLVPWADMLNHCSSAGEESCLIFDQDTKTASLEA 260

Query: 178 DRPYKAGESIVVWCGPQ-PNSKLLINYGFVDEDN 210
            + Y  G+ +    GP    S+L ++YGFVD++N
Sbjct: 261 HKSYSKGDEVFDSYGPALTGSQLFLDYGFVDDEN 294


>gi|346324642|gb|EGX94239.1| SET domain-containing protein RMS1 [Cordyceps militaris CM01]
          Length = 482

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 53/214 (24%), Positives = 83/214 (38%), Gaps = 44/214 (20%)

Query: 22  ACLALYLMYEKKQ------GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 75
           + L L L+YE  Q      G    W PY+  L         A  +P+ WS  EL  L  S
Sbjct: 102 SALILVLLYEHLQRDADATGAACRWRPYLDVL-------PAAFATPMFWSPAELGALQAS 154

Query: 76  PTKAEI-LERAEGIKREY--------------------NELDTVWFMAGSLFQQYPYDIP 114
           P  A++  E A+ + R                       ++  +    GS    Y +D+ 
Sbjct: 155 PAVAKVGRESADNMFRGILLPAVRAHAHVFAGSERLSDEQIVALAHRMGSTIMAYAFDLD 214

Query: 115 TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 174
            E    E  +  +V  +     +  V +A           +L   ++    +   DD + 
Sbjct: 215 KEEDEDEDGEDGWVEDRDGKALMGMVPMA----------DILNADAEFNVHVNHGDDDLT 264

Query: 175 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 208
           +   RP +AGE I+ + GP PNS+LL  YG+V E
Sbjct: 265 VTALRPIRAGEEILNYYGPHPNSELLRRYGYVTE 298


>gi|392569623|gb|EIW62796.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
          Length = 509

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 61/221 (27%), Positives = 92/221 (41%), Gaps = 23/221 (10%)

Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
           A+VP+   L A      A L   +  +++V  +P KAGE I    G  PNS LL  YG V
Sbjct: 264 AMVPMADMLNARFESENAKLFYDERELKMVSTKPIKAGEQIWNTYGDPPNSDLLRRYGHV 323

Query: 207 D----------EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG----KLSVQVFHVHAGR 252
           D            NP D + V A L       + K  + +R      +    VF +    
Sbjct: 324 DLVPLSAPLSGLGNPGDVVEVRADLIVSVAAKKVKHDLKERVDWWLEEADDDVFVLRT-- 381

Query: 253 EKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP 312
           + E   +++ ++RL  +      ++   S  P     P +++ VL    D  + RL  YP
Sbjct: 382 DCELAEELVSFVRLLLLPKDEWEKAAQKSKLP----KPKLDKDVLTIAVDVLEKRLKDYP 437

Query: 313 ATLSEDEAMLTDY---NLHPKKRVATQLVRMEKKMLNACLQ 350
            TL EDEA+        L   KR A  +   EK++L   L+
Sbjct: 438 TTLEEDEALFAPERFGELSLNKRHAVVVRLGEKRILRGTLK 478


>gi|50556556|ref|XP_505686.1| YALI0F20944p [Yarrowia lipolytica]
 gi|49651556|emb|CAG78495.1| YALI0F20944p [Yarrowia lipolytica CLIB122]
          Length = 402

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 59/219 (26%), Positives = 106/219 (48%), Gaps = 35/219 (15%)

Query: 18  LSELACLALYLMYEKKQGKKSFWLPYIREL-DRQRGRGQLAVESPLLWSETELAYLTGSP 76
           +S    LAL+L+ ++  G KS W  ++  L DR+ G     ++ PL WS+ +   LT  P
Sbjct: 79  MSAHQVLALFLVIQQSLGSKSDWKAFMGLLPDRKEG----FLDVPLQWSKEDQDSLT--P 132

Query: 77  TKAEILERA-EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-CV 134
               +L++  +  + +Y++  T  F+A     +Y  D P +A+ +     A++ V S C+
Sbjct: 133 EGIVVLKKTLDTFEADYDKTKT--FVA-----KYDSD-PRDAYLW-----AWLCVNSRCL 179

Query: 135 VHLQKVSLARRFAL-----VPLGP--PLLAYS-----SKCKAMLAAVDDAVQLVVDRPYK 182
                ++  ++ A      + L P   L+ +S     + C+   +++   + L   R Y 
Sbjct: 180 YFDLTLTTGKKDAQEVPDNITLAPYVDLINHSVESGPTHCQLKTSSIGFEI-LCGQRGYT 238

Query: 183 AGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 221
           A E I +  GP+ NS LL  YGF   +NP+D + +  AL
Sbjct: 239 ADEEIFLCYGPRSNSVLLCEYGFTVPENPWDDVDISDAL 277


>gi|260835045|ref|XP_002612520.1| hypothetical protein BRAFLDRAFT_214305 [Branchiostoma floridae]
 gi|229297897|gb|EEN68529.1| hypothetical protein BRAFLDRAFT_214305 [Branchiostoma floridae]
          Length = 287

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 50/208 (24%), Positives = 85/208 (40%), Gaps = 24/208 (11%)

Query: 17  KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP 76
           +L+    LA++LM EK + +KSFW PYI  L  +          P  ++E +   L  S 
Sbjct: 91  RLTPTQALAVFLMCEKYRREKSFWRPYIDILPEE-------YSCPTFFTEDDFRLLPNS- 142

Query: 77  TKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV- 135
            + +   +     +EY EL   + M   LF         +AF F+ FK A+ A+++  + 
Sbjct: 143 LRGKAKAKKYECHKEYKELAPFFKMLADLFPDQ-----EDAFNFKDFKWAWSAIKTRALD 197

Query: 136 ---------HLQKVSLARRFALVPLGPPL-LAYSSKCKAMLAAVDDAVQLVVDRPYKAGE 185
                    HL+         + PL   +  A  +K +         ++   +  Y+   
Sbjct: 198 VPIGRESCRHLRDAEDTPTPTMFPLVDSINHAAQAKIRHRYNEKSRCLESRTETVYRRHA 257

Query: 186 SIVVWCGPQPNSKLLINYGFVDEDNPYD 213
            ++   G   N  LL+ +GFV   NP D
Sbjct: 258 EVMNSYGRADNDNLLLEFGFVVPGNPED 285


>gi|384246985|gb|EIE20473.1| rubisco small subunit N-methyltransferase, partial [Coccomyxa
           subellipsoidea C-169]
          Length = 363

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 80/337 (23%), Positives = 130/337 (38%), Gaps = 74/337 (21%)

Query: 19  SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW-SETELAYLTGSPT 77
            EL  LAL+LM E+++G++S W P++  L         A  SP+LW  E +   L  SPT
Sbjct: 31  GELTGLALWLMAERQKGEESRWAPFLECLPE-------ATLSPVLWPEEVQDELLKNSPT 83

Query: 78  KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
             E   R   +++E++                                  +A +      
Sbjct: 84  LKECRARRAALQQEWD---------------------------------VIAQRIATGDA 110

Query: 138 QKVSLARRFAL-VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPN 196
           ++ S      L + LG P   +      +L A+ D                      +PN
Sbjct: 111 RRFSGGDELKLWITLGSP--GWGGTSDKLLMAIYDG---------------------RPN 147

Query: 197 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEA 256
            +L +  G V++DN  D L V   L   D  +  K+ + +  G   VQ F +   R    
Sbjct: 148 GELAMATGRVEDDNASDCLTVRVGLVQADRLFSVKKQILESLGFDIVQEFPIFRDR---M 204

Query: 257 ISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP---- 312
            + +L YLRL  ++D + +  V  S      ++P  E  VL  L    + RL  Y     
Sbjct: 205 PTQLLAYLRLARLTDPALLAKV--SFEEDIILNPVNEYEVLQLLLGECRDRLTSYAGMHM 262

Query: 313 ATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 349
            +  ED  +L    L  ++R+A +L + EK +L   L
Sbjct: 263 GSAEEDVKLLQRPGLTAQERLAARLRKAEKAILQGTL 299


>gi|409080258|gb|EKM80618.1| hypothetical protein AGABI1DRAFT_71041 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 492

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 60/221 (27%), Positives = 94/221 (42%), Gaps = 25/221 (11%)

Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
           A+VP+   L A      A L    D +++V  +P K GE I    G  PN++LL  YG V
Sbjct: 245 AMVPMADILNARYQTENAKLFHEKDELKMVTTKPIKTGEQIWNTYGDLPNAELLRRYGHV 304

Query: 207 D--------EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
           D          NP D + ++A L            V     K  +  + +  G E   I 
Sbjct: 305 DFLSLPSGGHGNPGDVVEIKADLII-SAVSSTPEAVKDDEAKERID-WWLEEGGEDVFIL 362

Query: 259 D--------MLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAG 310
           D        M+ +++L  ++     ++   S     P    +E  + D L    + RLA 
Sbjct: 363 DYEYDLPPVMISFVKLLLLTQADWEKAREKS----KPPKSRLEGILYDILISTLEKRLAE 418

Query: 311 YPATLSEDEAMLT-DYNLHPKKRVATQLVRMEKKMLNACLQ 350
           YP T+  D+A+LT D  L+ K  +  +L   EK++L+  LQ
Sbjct: 419 YPTTIETDKALLTNDTPLNNKNAIIVRL--GEKEILHGILQ 457


>gi|330933580|ref|XP_003304225.1| hypothetical protein PTT_16721 [Pyrenophora teres f. teres 0-1]
 gi|311319308|gb|EFQ87682.1| hypothetical protein PTT_16721 [Pyrenophora teres f. teres 0-1]
          Length = 476

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 55/216 (25%), Positives = 83/216 (38%), Gaps = 44/216 (20%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            L L ++YE   G  S W PY   L  +         + + W+E ELA L  S    +I 
Sbjct: 85  SLILVMLYEYHNGSASNWAPYFAVLPTE-------FNTLMFWTEDELAELQASAVVGKIG 137

Query: 83  ---------------------------ERAEGIKREYNELDTVWFM--AGSLFQQYPYDI 113
                                      E+A+   +E      +  M   GSL   Y +D+
Sbjct: 138 KESADEAFLEQLLPVIEEFADIVFSGDEKAKDKAKEMRSPKNLELMHKMGSLIMAYAFDV 197

Query: 114 PTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV 173
                T E+ ++   A +     L K        +VPL   L A + +C A L    D +
Sbjct: 198 EPATPTKEVDEEG-FAEEEEDAALPK-------GMVPLADMLNADADRCNARLFYEKDCL 249

Query: 174 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 209
           ++   +P +AGE I    GP P S LL  YG+V ++
Sbjct: 250 EMKALKPIQAGEEIFNDYGPLPRSDLLRRYGYVTDN 285


>gi|171679805|ref|XP_001904849.1| hypothetical protein [Podospora anserina S mat+]
 gi|170939528|emb|CAP64756.1| unnamed protein product [Podospora anserina S mat+]
          Length = 468

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 51/199 (25%), Positives = 80/199 (40%), Gaps = 29/199 (14%)

Query: 26  LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 85
            +L+ E  + K S+W PYI  L +       A+  P +W E ++  L  +     + E  
Sbjct: 104 FFLVKEYLKEKDSYWWPYISTLPQPDRVDTWAL--PAVWPEDDIECLEETNAHVAVREIQ 161

Query: 86  EGIKREYNE----LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS 141
             IK+EY      L  V F     + Q  Y      FT   F+ + +  Q    H+    
Sbjct: 162 ANIKKEYKHARKLLKEVDFPGWQEYTQLLYKWAFCIFTSRSFRPSLILSQETQDHV---- 217

Query: 142 LARRFALVPLGP---------PLLAY-----SSKCKAMLAAVDDAVQLVVDRPYKAGESI 187
                 L P G          PLL       +S+ +  L  VD   QL+ +  Y+ G+ +
Sbjct: 218 ----LGLTPHGTKVDDFSILQPLLDIGNHDPTSQYQWNLE-VDGTCQLICNNAYQPGQQV 272

Query: 188 VVWCGPQPNSKLLINYGFV 206
               G + NS+LL+ YGF+
Sbjct: 273 FNNYGLKSNSELLLGYGFI 291


>gi|432901733|ref|XP_004076920.1| PREDICTED: SET domain-containing protein 4-like [Oryzias latipes]
          Length = 441

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 59/235 (25%), Positives = 101/235 (42%), Gaps = 26/235 (11%)

Query: 19  SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL-TGSPT 77
           S L  L ++L+ E+ +G+ S W PYI  L         +   P  +++T +A L +G   
Sbjct: 105 SSLVALCVFLVCERHRGEASDWFPYIDVLP-------CSYCCPPYFTDTVMAVLPSGVRR 157

Query: 78  KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
           +AE  E+ EG++  Y  +   +FM+      +P   P E  T+E  + A+ ++ +  V +
Sbjct: 158 RAE--EQREGLQHLY-AVHQDFFMSLQPVLSHP---PEEVLTYEALRWAWCSINTRSVFM 211

Query: 138 QK-----VSLARRFALVPLGPPLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVW 190
            +     +S    +AL P    LL +    + KA         ++      +      + 
Sbjct: 212 DRPSSSFLSGPDNYALAPF-LDLLNHRPDVQVKAGFNRTSGCYEIRSISGVQRYHQAFIN 270

Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE----DPQYQDKRMVAQRNGKL 241
            G   N +LL+ YGFV   NP+  + VE  L  E    D    +K    + NG L
Sbjct: 271 YGSHDNQRLLLEYGFVSSCNPHSVIYVEEDLLCEVLRGDESLDEKMKFLRENGFL 325


>gi|148226164|ref|NP_001079674.1| SET domain containing 4 [Xenopus laevis]
 gi|28422727|gb|AAH46855.1| MGC53706 protein [Xenopus laevis]
          Length = 456

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 50/207 (24%), Positives = 89/207 (42%), Gaps = 19/207 (9%)

Query: 18  LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
           +S L  L  +L+ E+  G  S W PY+  +             P+ W E E+ +L  +P 
Sbjct: 104 VSPLLALCTFLIAERFAGDCSQWKPYLDVIPS-------TYSCPVYW-ELEIIHLLPAPL 155

Query: 78  KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
           + + LE+    K E  EL T      S  Q    D   + +T++  + A+  V +  V++
Sbjct: 156 RKKALEQ----KTEVQELHTESLAFFSSLQPLFCDNVADIYTYDALRWAWCTVNTRTVYM 211

Query: 138 QKVSLARRFA---LVPLGP--PLLAYSSKCK--AMLAAVDDAVQLVVDRPYKAGESIVVW 190
           +     R  A   +  L P   LL +S + +  A  +      ++  +   +  +   + 
Sbjct: 212 KHTQQDRLLAQQDVCALAPYLDLLNHSPEVQVEAEFSKDRRCYEIRTNSGCRKHDQAFIC 271

Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVV 217
            GP  N +LL+ YGFV  +NP+  + V
Sbjct: 272 YGPHDNQRLLLEYGFVAANNPHRSVYV 298


>gi|66827459|ref|XP_647084.1| hypothetical protein DDB_G0267502 [Dictyostelium discoideum AX4]
 gi|60475269|gb|EAL73204.1| hypothetical protein DDB_G0267502 [Dictyostelium discoideum AX4]
          Length = 472

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 69/329 (20%), Positives = 133/329 (40%), Gaps = 67/329 (20%)

Query: 59  ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 118
           ++ L + E E+ YL GSP   +I+   E   + Y++L    F    + +       +   
Sbjct: 163 DTSLYFDEKEIEYLAGSPAFVDIMVEKEVATKLYDQLSQTLFKDNVILEMCQG--QSTII 220

Query: 119 TFEIFKQAFVAVQSCVVHLQK----VSLARRFALVPLGPPLLAYSSKCKAMLAAVD---- 170
            ++ F+ A   + +  +++       S  ++  L P+ PP++ Y +      A +D    
Sbjct: 221 GWDQFRWAHSTITARKIYVTDPDSVGSDGKQMKLSPVVPPIVDYFNHGNQPSAEIDYNEE 280

Query: 171 -DAVQLVVDRPYKAGESIVV-----WCGPQPNSKLLINYGF----VDEDNPYDRLVVE-- 218
             +V +   +  K GE I V     +CG    S LL++YG+    +D+ +  + L+ E  
Sbjct: 281 LGSVDVKAIKDIKKGEEIFVSYDHHYCG----SDLLVDYGYLPNQIDDKSCVNVLMEELL 336

Query: 219 AALNTEDPQYQDK-----RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 273
             +N +DP   DK     +++  ++ KL +                              
Sbjct: 337 ETINLDDPIKDDKYYLVNKLLETKDIKLKIS----------------------------- 367

Query: 274 EMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML--TDYN-LHPK 330
            M S+   L  I       + ++L+ L      ++  YP T+ +D+  L   +YN L  +
Sbjct: 368 -MDSLTEDLLKISKYMSYKQESLLEYLKRLVSLKIGHYPTTIIQDKEFLLSKEYNQLSAR 426

Query: 331 KRVATQLVRMEKKMLNAC---LQVTADMI 356
            ++A  L   EK++L+     LQ   D I
Sbjct: 427 SKLAFNLAFQEKQILSNVYTKLQENIDTI 455


>gi|7329638|emb|CAB82703.1| putative protein [Arabidopsis thaliana]
          Length = 486

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 36/126 (28%), Positives = 62/126 (49%), Gaps = 15/126 (11%)

Query: 12  LLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY 71
           +L +N++  +  LA  L+ EKK G+KS W+PYI  L +        + S + W E EL+ 
Sbjct: 105 VLLSNEVGNIGMLAAVLIREKKMGQKSRWVPYISRLPQ-----PAEMHSSIFWGEDELSM 159

Query: 72  LTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ 131
           +  S    E +++   I+++++      F+A +  Q  P  I TE    E F  A+   +
Sbjct: 160 IRCSAVHQETVKQKAQIEKDFS------FVAQAFKQHCP--IVTERPDLEDFMYAYALGE 211

Query: 132 S--CVV 135
              C+V
Sbjct: 212 KVLCIV 217


>gi|395332633|gb|EJF65011.1| SET domain-containing protein [Dichomitus squalens LYAD-421 SS1]
          Length = 502

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 61/223 (27%), Positives = 97/223 (43%), Gaps = 27/223 (12%)

Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
           A+VP+   L A      A L   +  +++V  +P +AGE I    G  PNS LL  YG V
Sbjct: 257 AMVPMADMLNARFESENAKLFYEERELKMVTTKPVEAGEQIWNTYGDPPNSDLLRRYGHV 316

Query: 207 D----------EDNPYDRLVVEAAL----NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGR 252
           D            NP D + V A L     ++  +Y  +  V     +    VF +    
Sbjct: 317 DVVPLRPPLSGMGNPRDIVEVRADLIVSAVSKKVEYSLQERVDWWLEEAEDDVFILRT-- 374

Query: 253 EKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP 312
           + E   +++ + RL ++S+   +++   S  P   V P     VL    D   ARL  YP
Sbjct: 375 DCELPEELVSFERLLFLSEDEWIKTAKKSKLPKPKVDP----DVLTVAIDVLSARLKEYP 430

Query: 313 ATLSEDEAMLT-----DYNLHPKKRVATQLVRMEKKMLNACLQ 350
            ++ EDE +L+       +L+ K  V  +L   EK++L   L+
Sbjct: 431 TSIEEDEKLLSADKVESLSLNKKHAVIVRL--GEKRILQGTLK 471


>gi|303288796|ref|XP_003063686.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226454754|gb|EEH52059.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 538

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 95/365 (26%), Positives = 146/365 (40%), Gaps = 64/365 (17%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            LAL L++EK  G  S W  YIR L R       A+++PL WS  ELA L G+    ++L
Sbjct: 157 ALALQLLHEKSLGDDSRWAAYIRCLPRVE-----ALDAPLFWSSEELAELAGT----QLL 207

Query: 83  ERAEG----IKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
             A G    ++  +  L    F    +LF     D    AF+   F  AF  ++S    L
Sbjct: 208 ANAAGYDSYVRGTHAALKETTFKEHPALFGDAGDDDGGGAFSEREFLWAFGVLRSRA--L 265

Query: 138 QKVSLARRFALVPLGPPLLAYSSKC-------KAMLAAV---------DDAVQLVVDRP- 180
             V      AL+P G  +  +   C          +AAV           +V L V++  
Sbjct: 266 PPVDQGESIALIP-GIDMANHDGLCSQTWQLNNGGIAAVFGGRGGADGGGSVLLRVEKTK 324

Query: 181 ---YKAGESIVVWCGP-QPNSKLLINYGFVDEDNPYDRLVVEA-ALNTEDPQYQDKRMVA 235
               K GE I    GP   +S+  ++YGFVD        V+   ++  +D    DK  V 
Sbjct: 325 AGGAKRGEEIRCNYGPANIDSQFALDYGFVDAFCSRPGYVLGPLSIPEDDVNAFDKMDVL 384

Query: 236 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS----------EMQSVISSLGPI 285
              G      F + A   ++   +M+ ++RL  + +            E  ++IS     
Sbjct: 385 SVAGLKESPAFTIRA--FEDPPPEMVVFMRLLNLKNDDAFLLEAIFRQECWALISD---- 438

Query: 286 CPVSPCMERAVLDQLADYF-KARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKM 344
            PVSP       D  AD   +  L  Y   + +D  +  D +  P+ R+A ++   EK+ 
Sbjct: 439 -PVSP-------DNEADAGCEEALGAYATKIEDDRGVADDADASPRLRLAARVRMGEKQA 490

Query: 345 LNACL 349
           L   L
Sbjct: 491 LEEVL 495


>gi|302803412|ref|XP_002983459.1| hypothetical protein SELMODRAFT_445547 [Selaginella moellendorffii]
 gi|300148702|gb|EFJ15360.1| hypothetical protein SELMODRAFT_445547 [Selaginella moellendorffii]
          Length = 536

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 56/214 (26%), Positives = 85/214 (39%), Gaps = 34/214 (15%)

Query: 12  LLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW-SETELA 70
           +L  N       +  +L+ E+ +GK+SFW PYI  L       +L++  PLLW +ET   
Sbjct: 100 MLRKNDFRPWLTMCAHLLVERSRGKESFWHPYIAALPSV---DELSISHPLLWPAETIQE 156

Query: 71  YLTGSPTKAEILERAEGIKREYNELDTVW---FMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
            L GSP    I  R +  + ++  L T     F+ G                 E   +  
Sbjct: 157 LLQGSPMLDTIATRLKLCQEDHEALLTAGIEKFLPGG----------------ETLSEGD 200

Query: 128 VAVQSCVVHLQKVSLA-------RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ---LVV 177
           V   S V+  +  SL            LVP    L   SS  +      D   +   L  
Sbjct: 201 VRWASAVLLSRAFSLELDVDDDFDTLCLVPWADMLNHCSSAGEESCLIFDQDTKTASLEA 260

Query: 178 DRPYKAGESIVVWCGPQ-PNSKLLINYGFVDEDN 210
            + Y  G+ +    GP    S+L ++YGFVD++N
Sbjct: 261 HKSYSKGDEVFDSYGPALTGSQLFLDYGFVDDEN 294


>gi|299115166|emb|CBN75532.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 524

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 59/240 (24%), Positives = 100/240 (41%), Gaps = 46/240 (19%)

Query: 22  ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW----SETELAYLTGSPT 77
           + LAL+LM E+++G  SFW  Y+R L          V++PL W    +E E   L G  T
Sbjct: 83  SVLALHLMAERRKGDGSFWKQYLRTLPDD-------VDTPLRWLVEQAEEEFRLLDG--T 133

Query: 78  KAEILER--AEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 135
              +L R     +++++ E          L + +P  +     TFE +  A  ++ S   
Sbjct: 134 MVGLLSRMMHSQVRKDWEEFHL------PLVEAHPEILG--GVTFEDYLWAMSSIWSRSF 185

Query: 136 HLQK----VSLARRFALVPL------GPP-------LLAYSSKCKAMLAAVDD------A 172
             Q+     S   R A+VP+       P        ++ + ++   +   + +       
Sbjct: 186 DYQEPGPDDSPCSRRAMVPVINAANHDPSAADSLSEMIEFQAQEGGLSMGIGEPGRARGT 245

Query: 173 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 232
           +++   R Y A E   +  G   N+KLL +YGFV   NPY  L     +   DP +  K+
Sbjct: 246 LRVSAGRDYAAREQFFILYGRYSNAKLLYSYGFVLASNPYGGLDYWVRVPQTDPGFAWKQ 305


>gi|322698908|gb|EFY90674.1| putative histone-lysine N-methyltransferase [Metarhizium acridum
           CQMa 102]
          Length = 437

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 38/135 (28%), Positives = 63/135 (46%), Gaps = 13/135 (9%)

Query: 105 LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL-----QKVSLARRFALVPLGPPLLAYS 159
           +  QYP   P + FT E +K A   V S  +       + + L   FA       +L +S
Sbjct: 146 MLGQYPDLFPLDKFTVEDYKWALCTVWSRAMDFVLPDGKSIRLLAPFA------DMLNHS 199

Query: 160 SKCKA--MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 217
           S+ K   +  A    + ++  + Y+AG+ + +  GP PN++LL  YGFV   NP D   +
Sbjct: 200 SEAKQCHVYDASSGNLSVLAGKDYEAGDQVFINYGPMPNNRLLRLYGFVVPGNPNDSYDL 259

Query: 218 EAALNTEDPQYQDKR 232
             A +   P ++ K+
Sbjct: 260 VLATHPMAPFFKQKQ 274


>gi|336468018|gb|EGO56181.1| hypothetical protein NEUTE1DRAFT_83233 [Neurospora tetrasperma FGSC
           2508]
 gi|350289741|gb|EGZ70966.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
          Length = 459

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 45/177 (25%), Positives = 82/177 (46%), Gaps = 13/177 (7%)

Query: 60  SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 119
           S +L++E +L    G+       +  + I+ ++  L         LF Q+P   P + FT
Sbjct: 107 SSILFAEDDLEACAGTSLYTITKQLEQSIEDDHRALVV------RLFVQHPDLFPLDKFT 160

Query: 120 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP--PLLAYSSKCKA--MLAAVDDAVQL 175
            E +K A   V S  +      LA   ++  L P   +L ++S+ K   +       + +
Sbjct: 161 VEDYKWALCTVWSRAMDF---VLADGNSIRLLAPFADMLNHTSEVKQCHVYDPSSGNLSV 217

Query: 176 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 232
           +  + Y+AG+ + +  GP PNS+LL  YGFV   NP D   +  + + + P ++ K+
Sbjct: 218 LAGKDYEAGDQVFINYGPVPNSRLLRLYGFVIPGNPNDSYDLVLSTHPQAPFFEQKQ 274


>gi|164423408|ref|XP_963594.2| hypothetical protein NCU08733 [Neurospora crassa OR74A]
 gi|157070080|gb|EAA34358.2| conserved hypothetical protein [Neurospora crassa OR74A]
          Length = 459

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 45/177 (25%), Positives = 81/177 (45%), Gaps = 13/177 (7%)

Query: 60  SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 119
           S +L++E +L    G+       +  + I+ ++  L         LF Q+P   P + FT
Sbjct: 107 SSILFAEDDLEACAGTSLYTITKQLEQSIEDDHRALVV------RLFVQHPDLFPLDKFT 160

Query: 120 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP--PLLAYSSKCKA--MLAAVDDAVQL 175
            E +K A   V S  +      LA   ++  L P   +L ++S+ K   +       + +
Sbjct: 161 VEDYKWALCTVWSRAMDF---VLADGNSIRLLAPFADMLNHTSEVKQCHVYDPSSGTLSV 217

Query: 176 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 232
              + Y+AG+ + +  GP PNS+LL  YGFV   NP D   +  + + + P ++ K+
Sbjct: 218 FAGKDYEAGDQVFINYGPVPNSRLLRLYGFVIPGNPNDSYDLVLSTHPQAPFFEQKQ 274


>gi|302410103|ref|XP_003002885.1| SET domain-containing protein RMS1 [Verticillium albo-atrum
           VaMs.102]
 gi|261357909|gb|EEY20337.1| SET domain-containing protein RMS1 [Verticillium albo-atrum
           VaMs.102]
          Length = 469

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 75/365 (20%), Positives = 139/365 (38%), Gaps = 53/365 (14%)

Query: 18  LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
           L     L L ++YE  QG  S W PY   L +Q        ++P+ WS+ EL  L G+  
Sbjct: 91  LDSWGQLILVMLYEVLQGDSSRWKPYFDILPQQ-------FDTPIFWSDGELLELQGTSL 143

Query: 78  KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
            AE + + E      +++  +     ++F       PTE     +  +    + +    L
Sbjct: 144 TAEKIGKVESDAMFRSKILPIVQANPAIFYPEGAAQPTEDELLHLAHRMGSTIMAYAFDL 203

Query: 138 QKVSLARR--------------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 183
           +                       +VP+   L A +++  A +   +      +    +A
Sbjct: 204 ENDDENENEEDGWVEDREGRTMLGMVPMADTLNA-NAEFNAHINHGESLEATAIRADIRA 262

Query: 184 GESIVVWCGPQPNSKLLINYGFVD-EDNPYDRLVVEAALNTE------------------ 224
           G+ ++ + GP P S+LL  YG+V  E + YD + V   L  E                  
Sbjct: 263 GDQVLNYYGPLPTSELLRRYGYVTPEHSRYDVVEVPWTLVKEVIVSCLSLSAEAWKQVES 322

Query: 225 ---DPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISS 281
              D + +D  ++ + +G+             +E   +++  L+        E    +  
Sbjct: 323 QIDDEEIEDYFVIERDSGEPGPDGRFTAPAVLREVSPELVEQLK--------EFLKAVKK 374

Query: 282 LGPICPVSPCMERAVLDQL-ADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 340
           L             + D + A+  K RLA YP ++  DE +L + +L  ++R+A  +   
Sbjct: 375 LDSERIPDKRKRDEICDAVIAEVLKVRLAQYPTSIETDEKLLAEADLPARRRMAVVVRLG 434

Query: 341 EKKML 345
           EKK+L
Sbjct: 435 EKKLL 439


>gi|325186532|emb|CCA21071.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 441

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 90/392 (22%), Positives = 155/392 (39%), Gaps = 56/392 (14%)

Query: 12  LLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY 71
           L   N+L +   +AL+LM E+ +   SF+ P+I+ L  Q        + P+ W++++ A 
Sbjct: 74  LQDKNELDQDEIVALFLMIERFKSSDSFFEPFIQSLPSQ-------FDLPIFWNDSDFAE 126

Query: 72  LTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ 131
           L G  T   +L +   I R+  E D   F A  +     Y+      T EI    +    
Sbjct: 127 LEG--TNVALLAK---IMRKQIEAD---FQAIHIPLLRAYEERLNLRTSEISISDYEWAL 178

Query: 132 SCVVHLQKVSLARRFALVPLGP------PLLAYSSKCKAMLAAVD---------DAVQLV 176
           S +         R F +   G       P L   +    +   +D         D +   
Sbjct: 179 SII-------WTRAFGITRYGEYLRVLCPALDMFNHSVLVQEPLDEFIKYDHMKDVLAHC 231

Query: 177 VDRPYKAGESIVVWCGPQPNSKLLINYGFV--DEDNPYDRLVVEAALNTEDPQYQDKRMV 234
           V     A +   +  G   ++KLL +YGFV  +E N ++ + +   +   DP ++ K+ +
Sbjct: 232 VVMETSANDPFYISYGSYSDAKLLYSYGFVSLNEKNRFNGIDLWMRVPVTDPNFKLKQAI 291

Query: 235 AQRNGKLSVQVFHVHAGREKEAISD-MLPYLRLGYVS--DTSEMQSVISSLGPICPVSPC 291
            + N     Q +        + + +  L   R+  +S  +  E +    S      VS  
Sbjct: 292 LEGNAATRDQTYDFRGTIHLDDVDERFLASFRIILLSQEEFREYEKAFDS----TIVSVR 347

Query: 292 MERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQL-VRME-KKMLNACL 349
            E AV   + D  + RLA +P +L +D   L +  ++   R    + VRME KK+L +  
Sbjct: 348 NELAVYAAIHDVCEKRLARFPTSLEDDLKKLAELEMNSDLRKTYAISVRMEDKKILQSVC 407

Query: 350 QVTADMIMLL--------PDVTVSPCPAPYAP 373
           ++  +   LL        PDVT    P    P
Sbjct: 408 RLMKEWRNLLENDSNIYPPDVTRQQQPQLSMP 439


>gi|260807503|ref|XP_002598548.1| hypothetical protein BRAFLDRAFT_118329 [Branchiostoma floridae]
 gi|229283821|gb|EEN54560.1| hypothetical protein BRAFLDRAFT_118329 [Branchiostoma floridae]
          Length = 448

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 56/245 (22%), Positives = 103/245 (42%), Gaps = 41/245 (16%)

Query: 17  KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP 76
           +L+ +  +  +L+Y+K  G+ SFW PY+  L  +          P+ + E +  YL  S 
Sbjct: 100 QLTTIQAITTFLIYQKHIGETSFWKPYLDILPNE-------YTHPVYFGEEDFLYLPHS- 151

Query: 77  TKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF--VAVQSCV 134
            +A I  + +   + Y EL   +     L   +        FTF+ ++ A+  V  +S  
Sbjct: 152 LRANIKAKKQECIKSYEELKPFFPSLEPLLPNW-----EGIFTFDAYRWAWSTVKTRSLY 206

Query: 135 VHLQKVSLARRF--------ALVPLGPPLLAYSSKCKAML------AAVDDAVQLVVDRP 180
           V  +  ++ R          +LVP+   LL +S   +  L         D    +  +  
Sbjct: 207 VDDKGSTVLRNLDKSGLGVTSLVPM-VDLLNHSHSARTGLLIKKSCKNGDYFYTVTAEDD 265

Query: 181 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL----------VVE-AALNTEDPQYQ 229
           YK G+ ++       N  LL+NYGFV  DN  D +          ++E      EDP+++
Sbjct: 266 YKRGDQVLFCYRRADNQTLLLNYGFVLPDNHLDTIKFFLVKDIIGILELMNFEEEDPKFR 325

Query: 230 DKRMV 234
            ++++
Sbjct: 326 RRKVL 330


>gi|428163078|gb|EKX32170.1| hypothetical protein GUITHDRAFT_121664 [Guillardia theta CCMP2712]
          Length = 449

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 77/356 (21%), Positives = 139/356 (39%), Gaps = 26/356 (7%)

Query: 11  ELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA 70
            ++    +  +  + LYL+ E+     S W P+++ L  +           L +SE ++ 
Sbjct: 88  HMIENTTIGRITAICLYLISERADSS-SHWKPWLQSLPPRFFHA-------LSYSEDDML 139

Query: 71  YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQ------QYPYDIPTEAFTFEIFK 124
           +   S  K     + + +++EY +           F         P ++  E FT+E F+
Sbjct: 140 HFQASSFKELRDRKKKNVRQEYEQTVAPLLHKLPAFDPLLAAVDKPQNVTREDFTYEAFE 199

Query: 125 QAFVAVQSCVVHLQKVSLARRFALVPL---GP---PLLAYSSKCKAMLAAVDDAVQLVVD 178
            A+  V +  +    +    R   VPL   GP     +  +S  K    A +        
Sbjct: 200 WAYSVVTTRGIFPGLLGEEEREGEVPLLVLGPLADSFIHGASGVKISYDAQEHRCVFSAL 259

Query: 179 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 238
                   I +  G   N +LL N GF+ ++N  + ++++  L+     +   R    + 
Sbjct: 260 HKVAKNSPISIGVGMSSNMELLANRGFMMQNNGNNFVLMKFQLDRNSDMHASARESMMKQ 319

Query: 239 GKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLD 298
             LS  + +V   R  E    +L  LR+  +S   E  S   +L    PV+   E     
Sbjct: 320 LNLSNPMTYV--VRYGEMPQGLLASLRIQSLSPV-EFGSYGKALA--TPVTLENEWRAYR 374

Query: 299 QLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTAD 354
            L     + LA YP T+ EDE +LT        R A  L+R E+K++   ++  A+
Sbjct: 375 LLISSCNSILAMYPTTIEEDEIVLTQTKTSRHLRAAV-LLRREEKLIYESIKTWAN 429


>gi|348690659|gb|EGZ30473.1| hypothetical protein PHYSODRAFT_553476 [Phytophthora sojae]
          Length = 437

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 90/354 (25%), Positives = 151/354 (42%), Gaps = 51/354 (14%)

Query: 24  LALYLMYEK-KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
           LA+ L+YEK  +G KS W  +I  L R          + L +   EL  L GS       
Sbjct: 97  LAIALLYEKFVRGSKSKWAKHIELLPR-------TYHNALYFGPEELRALEGSNVYFIAQ 149

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYP----YDIPTEAFTFEIFKQAFVAVQSCVVHLQ 138
           +  E +  +Y  L     +   LF+  P     D+  E F+ E +K A   + S      
Sbjct: 150 QMEEKVAHDYARLKESVLL--ELFENVPEGINVDLFDEFFSLENYKWALSTIWS---RFG 204

Query: 139 KVSLARRF--ALVPLGPPLLAYSSKCKAMLAAVDDAVQ---LVVDRPYKAGESIVVWCGP 193
            V +A++   A+VP+   +L +  + + M    D + Q   LV  + + AG  + +  GP
Sbjct: 205 DVPVAKQSFKAMVPVFD-MLNHDPEAE-MSHFFDMSTQRFKLVSHQHWNAGAQMFINYGP 262

Query: 194 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQ---YQDKRMVAQRNGKLSVQVFHVHA 250
             N KLL  YGFV   NP+D   VE  L  ++     +Q+K  +   NG         HA
Sbjct: 263 LSNHKLLALYGFVIIGNPFD--AVEMWLPMDEASTKFFQEKEQLLLTNGL-------DHA 313

Query: 251 GREKEAISD-----MLPYLRLGYVS--DTSEMQSVISSLGPICPVSPCMERAVLDQLADY 303
               E ++D     +L   R+  +      E + + +       +S   E+  L +L   
Sbjct: 314 TNPFELVADESNDLLLMAARIQEIDCETVEEFEELANKALEGEMISLENEQEALTRLIYT 373

Query: 304 FKARLAGYPATLSEDEAML------TDYNLHPKKRVATQLVRMEKKMLNACLQV 351
            +  L  +P ++ ED+ +L      TD NL+  +R+A  + R +K +L+  + +
Sbjct: 374 LEKMLESFPTSIEEDDILLEQDDKKTD-NLN-HERMAVAVRRSDKYILSENINM 425


>gi|347967018|ref|XP_321037.5| AGAP002018-PA [Anopheles gambiae str. PEST]
 gi|333469795|gb|EAA01259.5| AGAP002018-PA [Anopheles gambiae str. PEST]
          Length = 493

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 82/358 (22%), Positives = 144/358 (40%), Gaps = 56/358 (15%)

Query: 24  LALYLMYEKKQGKKSFWLPYIREL-DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
           LAL L+ E+ + K S W PY+  L DR          +PL ++  ++  L  +      L
Sbjct: 151 LALALIMERFRAK-SDWKPYLDLLPDR--------YTTPLYYTTEDMGELAETDAFLPAL 201

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS----CVVHLQ 138
           +  + I R+Y  +          F Q   D   + FT+++F+ A   V +      V+L 
Sbjct: 202 KLCKHIARQYGFIRR--------FVQEKVDELRDCFTYDVFRWAVSTVMTRQNKVPVNLA 253

Query: 139 KV-SLARRFALVPL------GPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYK--AGESI 187
           +   +    AL+PL        P  A  ++C A     A ++ ++  + R     A   I
Sbjct: 254 EFDGMDHTLALIPLWDMANHAFPDTANETRCVAETCYNATNEQLECSLTREVSDIASVPI 313

Query: 188 VVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFH 247
            +  G + +++ L++ GFV   NP+  +     L    P Y+++  + +  G  +   F 
Sbjct: 314 FIVYGTRTDAEFLVHNGFVCPRNPHANVQKRFTLVPAIPLYKERAHLLELLGMPTTGTFS 373

Query: 248 VHAGREKEAISDMLPY----LRLGYVSDTS------------EMQSVISSLGPICPVSPC 291
               RE  A +   P     + L  VS  +              +  + +   + P   C
Sbjct: 374 FGPAREPAAATTTTPISQELISLARVSSMTAKELDEYTAMKETQRQTLRTYQALLPAELC 433

Query: 292 --MERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 347
              ER     LA   K  L  YP T+ +DEA+L   N H  +R+  +    EK++L +
Sbjct: 434 ARTER----WLATVMKIMLLRYPTTIEQDEALLKT-NRHHIRRLLIEYRLGEKQILRS 486


>gi|212546319|ref|XP_002153313.1| SET domain protein [Talaromyces marneffei ATCC 18224]
 gi|210064833|gb|EEA18928.1| SET domain protein [Talaromyces marneffei ATCC 18224]
          Length = 481

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 90/416 (21%), Positives = 165/416 (39%), Gaps = 83/416 (19%)

Query: 1   MFHLLFYYAAELLTTNKLSELAC-----------LALYLMYEKKQGKKSFWLPYIRELDR 49
           +FHL  ++   ++ T++L+++             L + ++YE   G++S W  Y + L  
Sbjct: 62  LFHLP-HHIVLMVKTSRLNQILADDLKNLGPWLSLVVVMIYEYSLGEQSNWKQYFQVLPS 120

Query: 50  QRGRGQLAVESPLLWSETELAYLTGSPT---------KAEILERAEGIKREYNEL----- 95
           +        ++ + WSE E + L  S           + +I E+   + R + +L     
Sbjct: 121 K-------FDTLMFWSEEEFSQLQASAVVDKVGKRDAEEDIFEKVLPLVRAHPDLFPPID 173

Query: 96  -------DT-------VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS 141
                  DT       +    GSL   Y +DI          +  ++        L K  
Sbjct: 174 GVMSYDDDTGAQALLELAHRMGSLIMAYAFDIEKAEEEESEGEDGYLTDDEE--QLPK-- 229

Query: 142 LARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 201
                 +VPL   L A + +  A L   + A+ +   +P KAG+ I    G  P S LL 
Sbjct: 230 -----GMVPLADLLNADADRNNARLFQEEGALVMRAIKPIKAGDEIFNDYGELPRSDLLR 284

Query: 202 NYGFVDEDNPYDRLVVEAAL----------NTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 251
            YG+V  DN     VVE  L          N ED +Y   +++ Q   ++    + +   
Sbjct: 285 RYGYV-TDNYAQYDVVELPLTGICHAAGFDNIEDKEYPQLKLLDQL--EILEDGYCILRP 341

Query: 252 REKEAISDMLP----YLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 307
             ++ + D+LP     L      D+ E+Q ++S      P+    E  +   L D  +++
Sbjct: 342 SPEDTLLDILPDELLALLKTLTLDSEELQRLLSKNKHPKPILGAREARI---LLDAAQSK 398

Query: 308 LAGYPATLSEDEAMLTDY-------NLHPKKRVATQLVRMEKKMLNACLQVTADMI 356
           +  Y  T+ ED+ +L  +           ++ +A Q+   EK++L A L +  D +
Sbjct: 399 MGQYGTTIQEDKILLQQFASSSVLRTRERRRHMAVQVRVGEKEILQALLMMLQDFL 454


>gi|308802149|ref|XP_003078388.1| related to histone-lysine N-methyltransferase (ISS) [Ostreococcus
           tauri]
 gi|116056840|emb|CAL53129.1| related to histone-lysine N-methyltransferase (ISS), partial
           [Ostreococcus tauri]
          Length = 446

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 86/347 (24%), Positives = 138/347 (39%), Gaps = 53/347 (15%)

Query: 35  GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 94
           G++S + P++R L  +        ++ + W E EL  L GS   A      +  + EY+ 
Sbjct: 85  GERSAFWPWLRLLPSE-------TDAAVGWDEDELRELQGSNVVAFARAIKKSWREEYDA 137

Query: 95  LDTVWFMAGSLFQQYPYDIPTEAF--------TFEIFKQAFVAVQSCVVHLQKVSLARRF 146
           LD         F     D P EAF        TFE F  A   V S  + L+  S +   
Sbjct: 138 LD---------FAGLGVDFP-EAFGGEHAAHYTFEKFTWARFVVWSRAIDLKTDSTS--A 185

Query: 147 ALVPLGPPLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 201
            ++ +  P+L     A S K      A  +AV++     +K    +      +P+   L+
Sbjct: 186 PVIRMLVPILDMANHAPSGKLLPRWDAKANAVKIYAGSAFKRNTELRFNYDTKPSQYFLL 245

Query: 202 NYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG-KLSVQVFHVHAGREKEAISDM 260
            YGF+ E NP + + V   L+  D   + K  + +R+G   + + F       +    D+
Sbjct: 246 QYGFIPEANPAECVEVTMQLSQRDNLRERKEALLRRHGLDPTKRNFEWKV---RGLDYDL 302

Query: 261 LPYLRLGYVSDTSEMQSVIS-----SLGPICPVSPCMERAVLDQLADYFKARLAGYPATL 315
           L   R+    D SE+    S     S   +   +    +AVL +        L GY  TL
Sbjct: 303 LAAARI-IAMDESELDDDTSVALSVSGASVSAKNDARTKAVLLK---SLITSLDGYGTTL 358

Query: 316 SEDEAMLTDYNLH----PKKRVA-TQLVRMEKKMLNACLQVTADMIM 357
            ED + +  +N      PKKR     L+RM +K     L  +AD + 
Sbjct: 359 GEDNSYIARFNTSSDELPKKRKRFAVLLRMREK---GILLASADALF 402


>gi|403215215|emb|CCK69715.1| hypothetical protein KNAG_0C06190 [Kazachstania naganishii CBS
           8797]
          Length = 496

 Score = 48.5 bits (114), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 50/205 (24%), Positives = 88/205 (42%), Gaps = 30/205 (14%)

Query: 24  LALYLMYE-KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TG 74
           L + L++E K  G+KS W PY++ L ++    QL     + W++ EL  L         G
Sbjct: 90  LIIALLFEWKVVGEKSKWWPYLQVLPKKTDMNQL-----IYWADDELELLKPSLILERVG 144

Query: 75  SPTKAEILERAEGI--KREYNELDT-----VW---FMAGSLFQQYPYDIPTEAFTFEIFK 124
           +    E+ E    I  K    E D+      W    +  S+   Y +D+  + +  E  K
Sbjct: 145 ADKAKEMFENVVDIINKSTLKEKDSYILKVTWENFLLVASIIMSYSFDV--QDYVEE--K 200

Query: 125 QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 184
           +     +      + V   +   ++PL   L + + KC A L    + +++   +  K G
Sbjct: 201 EGGTDEEEDDNESENVRSLK--CMIPLADTLNSNTHKCNAHLIHGSNLLEMRSIKAIKKG 258

Query: 185 ESIVVWCGPQPNSKLLINYGFVDED 209
           E I    G  PNS++L  YG+++ D
Sbjct: 259 EQIYNIYGDHPNSEILRRYGYIEPD 283


>gi|320163048|gb|EFW39947.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 476

 Score = 48.5 bits (114), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 95/402 (23%), Positives = 163/402 (40%), Gaps = 58/402 (14%)

Query: 10  AELLTTNKLSELAC-----LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 64
           AELL  NK    A      L + +MYE    K SFW PY+  L          +++P+ W
Sbjct: 82  AELLLENKKEACALVGWMPLVVAMMYEI-TNKDSFWRPYLDLLPE-------TLDTPMFW 133

Query: 65  SETELAYLTGSPTKAEI-LERAEGIKRE----------------------YNELDTVWFM 101
           ++ +L  L G+ T + +  E AE I  E                      Y+ + +V  M
Sbjct: 134 NDDDLELLEGTSTLSHLGKEDAETIFTEQIVPFMKLHPTHFDLKVHNMALYHRVASV-IM 192

Query: 102 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSK 161
           A S  +    D   E    E       A   C    ++  +  + A+VPL   +L + + 
Sbjct: 193 AYSFSEDDDEDDDDEDDDEEEDCCDGDANNECCSQKRQKRM-EKIAMVPLAD-MLDHKTG 250

Query: 162 CK-AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 220
           C  A L      + +    P  AG  +    G   NS+LL  YGF+D+ N ++ + +   
Sbjct: 251 CNNARLFYGKTTLAMSCIEPCAAGHELYNTYGDLSNSELLRKYGFIDDVNEHNSVDIPVE 310

Query: 221 L---NTEDPQYQDKRMVAQRNGKLSVQVFHVHAG----REKEAISDML-----PYLRLGY 268
           +     E   + ++ M A       +  FH+ A     +E EA   +L         L  
Sbjct: 311 MLEERFESCSFMEEAMEALEEIGCWLPEFHIPADALPPQELEASIALLFQSPKQVRALRA 370

Query: 269 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 328
           + D  E++S +++L     V+ C  R V + L  + + R   Y  T  EDE  L + +L 
Sbjct: 371 LDDEDEIRSFLATL-----VNKC-RRKVSETLLAFGQKRAEEYTTTREEDEERLKESDLT 424

Query: 329 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAP 370
            ++++A ++   E+ +L+  +    + +   P    +  PAP
Sbjct: 425 HRQKMALRVRIGERTILHNYISHLKERLETTPPDQETKEPAP 466


>gi|195439104|ref|XP_002067471.1| GK16171 [Drosophila willistoni]
 gi|194163556|gb|EDW78457.1| GK16171 [Drosophila willistoni]
          Length = 511

 Score = 48.5 bits (114), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 67/317 (21%), Positives = 125/317 (39%), Gaps = 34/317 (10%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           LA  L+ EK +G  S W PYI  L  +         + L ++  ++  L G+   +  L 
Sbjct: 159 LANALVVEKSRGADSIWKPYIDVLPSR-------YNTVLYFTVEQMRRLRGTSVCSSALR 211

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 143
           +   I R+Y +L    +   S  +            +E+++ A   V +   +L    +A
Sbjct: 212 QCRMIARKYAKLYAFAYCDSSYLRPDTGLFTQHGLCYELYRWAVSTVMT-RQNLVPREIA 270

Query: 144 RR-------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPN 196
            +        AL+P          K  +   +    ++       KAG    ++ G +PN
Sbjct: 271 TKDDGNSPISALIPCWDMANHRPGKITSFYDSNAHQMECTAQEFCKAGNQFFIYYGDRPN 330

Query: 197 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREK-- 254
           + LL++ GFVD +N  D + +   L+  D        +A++  +L  ++   H G  +  
Sbjct: 331 ADLLVHNGFVDPNNNKDFVNIRLGLSPTDG-------LAEKRSRLLDRLNIEHKGEFRVL 383

Query: 255 ---EAIS-DMLPYLRLGYVSD------TSEMQSVISSLGPICPVSPCMERAVLDQLADYF 304
              E IS  +L ++R+  +S        S+++  +  L   C +   +E          F
Sbjct: 384 PAPEYISGQLLAFVRVFNMSSDQLDHWCSDLERAVDLLHIDCALETDLETRTWQYFHQRF 443

Query: 305 KARLAGYPATLSEDEAM 321
           K  L    ATL E + +
Sbjct: 444 KLLLGVLEATLREADEL 460


>gi|384248321|gb|EIE21805.1| SET domain-containing protein, partial [Coccomyxa subellipsoidea
           C-169]
          Length = 275

 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 53/207 (25%), Positives = 88/207 (42%), Gaps = 30/207 (14%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           L L LM EKK GK S W  Y+  L +           PL W   +L  L G+     ++E
Sbjct: 83  LILALMTEKKLGKSSKWKGYLDFLPKS------IPGMPLFWDSEQLQSLEGT----SLIE 132

Query: 84  RAEGIKR----------EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSC 133
           +  G K           ++N +  + F+  +   + P++  +    + ++  A V+  S 
Sbjct: 133 KMNGCKAMPDRPLEPPCKFNSV-VLPFLQSNAHLKLPHNAASTRRLY-VWATAMVSAYSF 190

Query: 134 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWC 191
            +   +       A+VP+   L   +      L   A   A++++       GE ++   
Sbjct: 191 TIGEDRFQ-----AMVPMWDALNHITGHANVRLHHCARKGALRMIATCLITKGEQVINSY 245

Query: 192 GPQPNSKLLINYGFVDED-NPYDRLVV 217
           G  PNS+LL  YGFV+ D NP+D L V
Sbjct: 246 GDLPNSELLRRYGFVETDPNPHDCLEV 272


>gi|346319394|gb|EGX88996.1| Protein kinase-like domain [Cordyceps militaris CM01]
          Length = 1753

 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 70/302 (23%), Positives = 121/302 (40%), Gaps = 18/302 (5%)

Query: 62   LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 121
            + ++E EL    GS       +  + ++ ++ +L         L  Q+    P + FT E
Sbjct: 993  IFFAEDELQVCEGSSLHTLTTQLEQRVQDDFRQLLV------QLLSQHRDLFPLDQFTIE 1046

Query: 122  IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYS---SKCKAMLAAVDDAVQLVVD 178
             +K A   + S  +            + PL   +L +S    +C A      D + ++  
Sbjct: 1047 DYKWALCTIWSRAMDFAVSDTTSVRLVAPLAD-MLNHSLDVKQCHAYDPTSGD-LSILAA 1104

Query: 179  RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 238
            + Y+ G+ I ++ G  PN++LL  YGFV  DNP D   +    +   P Y+ K  +    
Sbjct: 1105 KDYQVGDQIFIYYGSVPNNRLLRLYGFVLLDNPNDSYDLVLQTSPMAPLYEQKERLWALA 1164

Query: 239  GKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSL--GPICPVSPCMERAV 296
            G  S     + A  +     ++L YLR   + D +++  +   L  G    V+   E  V
Sbjct: 1165 GLDSTCTIPLTA--KHPLPKNVLRYLRTQRL-DAADVADMTLQLLNGTDGKVNDGNEIQV 1221

Query: 297  LDQLADYFKARLAGYPATLSEDEAMLTD--YNLHPKKRVATQLVRMEKKMLNACLQVTAD 354
            L  L D   + L G+   L + EA L    Y        A Q+   E+ +L    +   D
Sbjct: 1222 LQFLIDSLGSVLEGFGIPLEKLEAQLAGGFYPAGGNAWAAAQVSAGEQGILTRAKKTAED 1281

Query: 355  MI 356
            M+
Sbjct: 1282 ML 1283


>gi|302804174|ref|XP_002983839.1| hypothetical protein SELMODRAFT_445692 [Selaginella moellendorffii]
 gi|300148191|gb|EFJ14851.1| hypothetical protein SELMODRAFT_445692 [Selaginella moellendorffii]
          Length = 236

 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 46/186 (24%), Positives = 79/186 (42%), Gaps = 29/186 (15%)

Query: 36  KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 95
           + S W PYI  L    G     +++  LW +TEL+YL  SP   +  ER E I  E+ ++
Sbjct: 74  QSSAWAPYISCLPEPAG-----LDNTFLWEDTELSYLRASPLYGKTRERLEIITTEFGQV 128

Query: 96  DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 155
                +   LF +          + E F   +  V S     + +++     LV +  P+
Sbjct: 129 QNALDVWPQLFGK---------VSVEDFMHVYATVFS-----RPLAIGEDSTLVMI--PM 172

Query: 156 LAYSSKCKAMLAAVD-----DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 210
           L + +   A  A +      +   +  DR     + I + CG   N++L ++YGF     
Sbjct: 173 LDFFNHNAASFAKLSFNGLLNYAVVTADRDCAENDQIWINCGDLSNAELALDYGFT---V 229

Query: 211 PYDRLV 216
           P +RL+
Sbjct: 230 PENRLI 235


>gi|42567909|ref|NP_197226.2| protein SET DOMAIN GROUP 40 [Arabidopsis thaliana]
 gi|75271674|sp|Q6NQJ8.1|SDG40_ARATH RecName: Full=Protein SET DOMAIN GROUP 40
 gi|34222078|gb|AAQ62875.1| At5g17240 [Arabidopsis thaliana]
 gi|51969984|dbj|BAD43684.1| unknown protein [Arabidopsis thaliana]
 gi|332005020|gb|AED92403.1| protein SET DOMAIN GROUP 40 [Arabidopsis thaliana]
          Length = 491

 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 72/355 (20%), Positives = 136/355 (38%), Gaps = 54/355 (15%)

Query: 16  NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRG---------RGQLAVESPLLWSE 66
           N LS    L++ L+YE  + KKSFW PY+  + R            +  L VE  +  +E
Sbjct: 92  NSLSSTQILSVCLLYEMSKEKKSFWYPYLFHIPRDYDLLATFGNFEKQALQVEDAVWATE 151

Query: 67  TELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 126
              A       +A  L +   +K ++      W  A +        +P ++         
Sbjct: 152 KATAKCQSEWKEAGSLMKELELKPKFRSFQ-AWLWASATISSRTLHVPWDS--------- 201

Query: 127 FVAVQSCVVHLQKVSLARRFALVPLGPP---------LLAYSSKCKAMLAAVDDAVQ--- 174
                  V  L        ++  P GP          L+  +   +      ++ V    
Sbjct: 202 -AGCLCPVGDLFNYDAPGDYSNTPQGPESANNVEEAGLVVETHSERLTDGGFEEDVNAYC 260

Query: 175 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV--EAALNTEDPQYQDKR 232
           L   R Y+ GE +++  G   N +LL +YGF+ E+N  D++ +  E +L +    +    
Sbjct: 261 LYARRNYQLGEQVLLCYGTYTNLELLEHYGFMLEENSNDKVFIPLETSLFSLASSWPKDS 320

Query: 233 MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCM 292
           +   ++GKLS                 ++  LRL  +  +   +SV+  +     +S   
Sbjct: 321 LYIHQDGKLSFA---------------LISTLRLWLIPQSQRDKSVMRLVYAGSQISVKN 365

Query: 293 ERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 347
           E  V+  +++   + L   P +++ED  +     LH   ++    +R+E+K   A
Sbjct: 366 EILVMKWMSEKCGSVLRDLPTSVTEDTVL-----LHNIDKLQDPELRLEQKETEA 415


>gi|241712095|ref|XP_002413441.1| conserved hypothetical protein [Ixodes scapularis]
 gi|215507255|gb|EEC16749.1| conserved hypothetical protein [Ixodes scapularis]
          Length = 227

 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 36/116 (31%), Positives = 54/116 (46%), Gaps = 11/116 (9%)

Query: 17  KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP 76
           K  E   L+++L+ E   G  SFW PYI  L R       +  + L +S  EL  LTGS 
Sbjct: 76  KSMENVALSMFLILELCAGSASFWHPYISILPR-------SFNTVLYFSVDELQLLTGSS 128

Query: 77  TKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 132
              E L+    I R+Y     + F    L +  PY    + FT+++++ A  AV +
Sbjct: 129 VLDEALKLHRSIARQYAYFHKI-FRTHPLAKSLPY---KDCFTYDLYRWAVSAVMT 180


>gi|358386801|gb|EHK24396.1| hypothetical protein TRIVIDRAFT_168260 [Trichoderma virens Gv29-8]
          Length = 370

 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 33/140 (23%), Positives = 59/140 (42%), Gaps = 6/140 (4%)

Query: 82  LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS 141
           LE  E +++   E    W    + F+    D+P E +T+        +  +      K  
Sbjct: 113 LESREHLRKREKEFQGNW----NAFKDAFPDVPYEEYTYAWMIVNTRSFYNETPETLKYP 168

Query: 142 LARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 201
              R AL+P+          CK   +A  D   +V DR YK GE + +      N  +L+
Sbjct: 169 WEDRLALIPVADLFNHSDDGCKVYYSA--DGYHIVADREYKKGEELFISYSSHSNDYILL 226

Query: 202 NYGFVDEDNPYDRLVVEAAL 221
            YGF+ +++  D + ++ A+
Sbjct: 227 EYGFIPDESLDDDVYIDDAV 246


>gi|322697804|gb|EFY89580.1| putative histone-lysine N-methyltransferase [Metarhizium acridum
           CQMa 102]
          Length = 466

 Score = 48.1 bits (113), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 43/175 (24%), Positives = 75/175 (42%), Gaps = 9/175 (5%)

Query: 60  SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 119
           S + ++E EL    G+       +  + I+ +Y +L         +    P   P   FT
Sbjct: 107 SSIFFTEDELEVCAGTSLYTITKQLKQRIEDDYKDL------IARVLGPRPDLFPLNKFT 160

Query: 120 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA--MLAAVDDAVQLVV 177
              +K A   V S  +  +    +    L P    +L +SS+ K   +  A    + ++ 
Sbjct: 161 IHHYKWALCTVWSRAMDFELYDGSSMRLLAPFAD-MLNHSSESKQCHVYDASTGNLSILA 219

Query: 178 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 232
            + Y+AG+ + +  G  PNS+LL  YGFV  DNP D   +  A +   P ++ K+
Sbjct: 220 GKDYEAGDQVYIHYGSIPNSRLLRLYGFVIPDNPNDSYDLVLATHPMAPFFEQKQ 274


>gi|440302460|gb|ELP94773.1| hypothetical protein EIN_341910 [Entamoeba invadens IP1]
          Length = 823

 Score = 48.1 bits (113), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 77/333 (23%), Positives = 130/333 (39%), Gaps = 80/333 (24%)

Query: 16  NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 75
           N+  E + + LYL    K  +K F  PYI  L             PL +SE EL  L G+
Sbjct: 78  NRDDENSVVYLYLAM-NKTNEKCFHFPYINTLPT-------TFSCPLSYSENELKMLKGT 129

Query: 76  PTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 135
               ++L   E  K    +L   +    +L  QYP    T    F+ F Q  V       
Sbjct: 130 ----KLLVTVEKTKTFLKKLSDYY---ETLTHQYP----TRFQQFDDFYQRLVWAH---- 174

Query: 136 HLQKVSLARRFALV---PLGP-----PLLAYSSKCKAMLAAVDDAVQLVVDRPYKA---- 183
              +V  +R F ++   P+G      P   +S+         +  V  V +R  +     
Sbjct: 175 ---QVFWSRAFLVIYPDPIGDVASLIPFADFSNH------NTETKVTYVSNRQTQTFSLQ 225

Query: 184 GESIVVWCGPQ--------PNSKLLINYGFVDEDNPYDRLVV-----EAALNTEDPQYQD 230
               V+ CG Q        PN K+L+ YGFV  +NPYD +++     E     +  + ++
Sbjct: 226 TNEKVLHCGEQIFNNYRIRPNEKMLLGYGFVISENPYDEVLLRINFKERHFEKQVEESEE 285

Query: 231 KRM--------------------VAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS 270
            +M                    + Q   +  V  F  +  REKE  +D+L  LR+  +S
Sbjct: 286 SKMEVENKENERMEVEEEDNEDEITQILKREGVDRFDYYLTREKELPTDLLRVLRIVNLS 345

Query: 271 --DTSEMQSVISSLGPICPVSPC-MERAVLDQL 300
             + ++    +  L  + P++     R++++Q+
Sbjct: 346 LVEANQYSQALLDLSYVSPINEIKATRSLMEQI 378


>gi|327290197|ref|XP_003229810.1| PREDICTED: SET domain-containing protein 4-like [Anolis
           carolinensis]
          Length = 440

 Score = 48.1 bits (113), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 56/233 (24%), Positives = 97/233 (41%), Gaps = 23/233 (9%)

Query: 18  LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
           +S L  L  +L+ EK   +KS W PY+  L          + S  +  E ++  L   P 
Sbjct: 104 ISPLIALCTFLIAEKWAQEKSPWKPYLDLLPE--------IYSCPVCLEQKIVNLFPEPL 155

Query: 78  KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV-- 135
           + +  E+ + ++  +      +F    LF   P D+ +  F ++ FK A+  + +  V  
Sbjct: 156 RRKAHEQRKLVQELFISSQQFFFSLQPLF---PKDVAS-VFNYQAFKWAWCTINTRTVYM 211

Query: 136 -HLQKVSLARRFALVPLGP--PLLAY--SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 190
            H Q+   +R      L P   LL +  + + KA         ++           + + 
Sbjct: 212 KHSQRDCFSRDTDTYALAPYLDLLNHNPTVQVKAGFNEKTKCYEITTVTQCHHYNEVFIC 271

Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 243
            GP  N +LL+ YGFV  DNP+  + V      ++   +DK    QR  KLS+
Sbjct: 272 YGPHDNQRLLLEYGFVSRDNPHSSVYVGTDTLLKNVFPEDK----QRPKKLSI 320


>gi|308809221|ref|XP_003081920.1| N-methyltransferase (ISS) [Ostreococcus tauri]
 gi|116060387|emb|CAL55723.1| N-methyltransferase (ISS) [Ostreococcus tauri]
          Length = 403

 Score = 47.8 bits (112), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 51/171 (29%), Positives = 75/171 (43%), Gaps = 40/171 (23%)

Query: 22  ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 81
           A LA+ LM +   G  + W  Y   L         AV+S ++WS+ EL  L GS  +   
Sbjct: 47  ATLAVALMQQTNGGASARWRAYCDAL-------PAAVDSLMMWSDEELEVLQGSALRQRA 99

Query: 82  LERAEGIKREYNELDTVWFMAGSLFQQYPYDI-PTEAFTFEIFKQAFVAVQSCVVHLQKV 140
           + R +  KREY+ L    F A  L +  P      EA++F++F+ A+  V          
Sbjct: 100 VFRRDLCKREYDAL----FPA--LARADPETFGDVEAYSFDVFRWAYATV---------- 143

Query: 141 SLARRFALVPLGPPLLAYSSKCKAMLAAVD------DAVQLVVDRPYKAGE 185
            +AR F L  L         +C A+L  +D      DA + VV+R   A E
Sbjct: 144 -MARAFVLPDL---------QCMALLPGLDIYNSARDAEKCVVERDEGACE 184


>gi|308812602|ref|XP_003083608.1| unnamed protein product [Ostreococcus tauri]
 gi|116055489|emb|CAL58157.1| unnamed protein product [Ostreococcus tauri]
          Length = 427

 Score = 47.8 bits (112), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 80/343 (23%), Positives = 139/343 (40%), Gaps = 46/343 (13%)

Query: 15  TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 74
           +N+ +E A LA+ L  E+++G  S + P++                  L+  T       
Sbjct: 79  SNESAEWA-LAIELAMEREKGVASRYRPFV----------------DSLYERTPANSTVV 121

Query: 75  SPTKAEIL--ERAEGIKREYNELDTV--WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 130
           S    E L    AE + R Y+E D V  W  A   F+ +P     + FT   F++A   V
Sbjct: 122 SKKARERLAEHHAEKVMRRYDE-DIVRGWNAAVRTFRTFPTIFRAQDFTRSKFEEALAIV 180

Query: 131 QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 190
           ++    + +    R   LVPL   L+  +S     +  VDD   + VD  ++AG+ +   
Sbjct: 181 RANSFEVTRADGVRERVLVPLAHLLVHDTSSSVPCVKMVDDTFVINVD-EHRAGDELSCS 239

Query: 191 CGPQPNSKLLINYG----FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL--SVQ 244
            G   +++    +G    + +E+N  D +            + D+  + +  G    +  
Sbjct: 240 HGEYSDAETFARFGTSAVYSEENNARDVITF---------TFPDEVHLKEEIGSCGPAED 290

Query: 245 VFHVHAGREKEAISDMLPYLRLGYVSDT--SEMQSVISSLGPIC--PVSPCMERAVLDQL 300
           +     G    A ++++  LRL   + T  SEM+     L  +   P+S   E AV D L
Sbjct: 291 IGFTRDG----ASAELMCALRLVSANATEWSEMRKPNFDLQSLKNRPLSEESEVAVYDAL 346

Query: 301 ADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKK 343
                  L  YP +  +DE +L    L   +R A ++   EK+
Sbjct: 347 FATLTDLLNSYPYSDVDDEHLLRGDRLADDERRAVKIRLREKR 389


>gi|378731232|gb|EHY57691.1| hypothetical protein HMPREF1120_05719 [Exophiala dermatitidis
           NIH/UT8656]
          Length = 714

 Score = 47.8 bits (112), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 61/224 (27%), Positives = 86/224 (38%), Gaps = 46/224 (20%)

Query: 25  ALYLMYEKKQGKKSFWLPYIRELD--RQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
           A +L+ +   G KS+W PYI  L         Q   E+ LLW       L G+  KA   
Sbjct: 128 AFFLLEQLVLGDKSWWAPYISSLPTVEDVSHSQFEDEADLLW-------LEGTNLKAGFA 180

Query: 83  ERAEGIKREYNELDTVWFMAG--SLFQQYPYDIPTEAFTFEIFKQAFV--AVQSCVVHLQ 138
             A   K  Y        + G   L Q    +    A+T+E F+ A      +S    + 
Sbjct: 181 AEAARWKEMY--------LKGMHQLKQSQWENAVNGAYTWERFRWAMTIFGSRSFTSQVL 232

Query: 139 KVSLARRFALVP------------LGP----------PLLAYSSK---CKAMLAAVDDAV 173
             +L    AL+             LG           PL+  S+     K    A    V
Sbjct: 233 DATLPADKALLQQYRHDDGRDLCVLGELFAQHFGVLLPLVDISNHKPGAKVEWQARYSFV 292

Query: 174 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 217
            L V  PY++G+ I    GP+ N  LL+ YGF   DNP+D +V+
Sbjct: 293 GLQVLEPYESGQEIFNNYGPRDNETLLVAYGFTIPDNPFDHVVI 336


>gi|426197159|gb|EKV47086.1| hypothetical protein AGABI2DRAFT_203917 [Agaricus bisporus var.
           bisporus H97]
          Length = 491

 Score = 47.8 bits (112), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 60/225 (26%), Positives = 97/225 (43%), Gaps = 33/225 (14%)

Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
           A+VP+   L A      A L    D +++V  +P ++GE I    G  PN++LL  YG V
Sbjct: 244 AMVPMADILNARYQTENAKLFHEKDELKMVTTKPIRSGEQIWNTYGDLPNAELLRRYGHV 303

Query: 207 D--------EDNPYDRLVVEAAL----NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREK 254
           D          NP D + ++A L     +  P+      V     K  +  + +  G E 
Sbjct: 304 DFLSLPSEGHGNPGDVVEIKADLIISAVSSIPE-----AVKDDEAKERID-WWLEEGGED 357

Query: 255 EAISD--------MLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 306
             I D        M+ +++L  +      ++   S     P    +E  + D L    + 
Sbjct: 358 IFILDYEYDLPPVMISFVKLLLLPQADWEKAREKS----KPPKSKLEGILYDILISTLEK 413

Query: 307 RLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM-EKKMLNACLQ 350
           RLA YP T+  D+A+LT  N  P  +    +VR+ EK++L+  LQ
Sbjct: 414 RLAEYPTTIETDKALLT--NDTPLNKKNAIIVRLGEKEILHGILQ 456


>gi|409045252|gb|EKM54733.1| hypothetical protein PHACADRAFT_97093 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 513

 Score = 47.8 bits (112), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 63/222 (28%), Positives = 97/222 (43%), Gaps = 26/222 (11%)

Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 205
           A+VP+   L    +   A L   D+ V +++     KAGE I    G  PNS LL  YGF
Sbjct: 271 AMVPMADMLNGRFNTETARLFYDDEHVLRMMTVHEIKAGEQIWNTYGDPPNSDLLRRYGF 330

Query: 206 VD----------EDNPYD------RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
           +D            NP D       LVVEAA      + QD+  V     +    VF V 
Sbjct: 331 IDVTKLESPLSGAGNPADIVEIPANLVVEAATKHTTSKTQDR--VDWWLEEAEDDVFVV- 387

Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
            G + E   +M+   RL  +   +E +    + G +    P M+  +     D  ++RL 
Sbjct: 388 -GTDCELPPEMVSLARL-LLQPKAEWEKT-KAKGKVP--KPTMDTTIAAIAMDVLQSRLK 442

Query: 310 GYPATLSEDEAMLTDYN-LHPKKRVATQLVRMEKKMLNACLQ 350
            YP ++ EDE +L D + L   +++A  +   EK++L   L+
Sbjct: 443 EYPTSVEEDERLLADESQLGFNRKMAVTVRLGEKRILAGTLR 484


>gi|302793745|ref|XP_002978637.1| hypothetical protein SELMODRAFT_52721 [Selaginella moellendorffii]
 gi|300153446|gb|EFJ20084.1| hypothetical protein SELMODRAFT_52721 [Selaginella moellendorffii]
          Length = 523

 Score = 47.8 bits (112), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 54/247 (21%), Positives = 99/247 (40%), Gaps = 61/247 (24%)

Query: 16  NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 75
           N++ +   + ++L+ E+ +G+ SFW PY+  L    G       +PL + + EL  L G+
Sbjct: 91  NEIDDRLLVMIFLIIERARGRASFWAPYLEMLPSGFG-------TPLWFEDEELMELDGT 143

Query: 76  P----TKAEI--------------LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 117
                TKA++              L R +  + E+ E   +W  A  +F     +IP  A
Sbjct: 144 TLFEATKAQVFFPSTFVSTCMSLYLFRPDDRELEFQEF--LW--ANCIFWTRALNIPCPA 199

Query: 118 -----------------FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS 160
                                +    F++  +     + VS      LVP G     ++ 
Sbjct: 200 SFVTSSSPEVAKDDGNRLVIYVLPHPFISCSA-----KDVSTIWIEGLVP-GIDFCNHTR 253

Query: 161 KCKAML---------AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 211
           +   +          + V  ++ L+ D  +  G  +++  G + N +LL  YGFV+EDN 
Sbjct: 254 RASGLWEIDGSDGSTSGVPHSMYLIADVVFPPGSEVLINYGDKGNEELLFLYGFVEEDNS 313

Query: 212 YDRLVVE 218
            D ++V 
Sbjct: 314 NDYVMVH 320


>gi|358380690|gb|EHK18367.1| hypothetical protein TRIVIDRAFT_47382 [Trichoderma virens Gv29-8]
          Length = 479

 Score = 47.8 bits (112), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 49/207 (23%), Positives = 85/207 (41%), Gaps = 32/207 (15%)

Query: 26  LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 85
           L+L+ E  +G++SFW PYI+ L +       A+  P  W E E   L G+  +  + +  
Sbjct: 91  LFLIKELLRGQESFWYPYIQALPQPEDFDDWAL--PPFWPEEEAELLEGTNVEIGLEKIR 148

Query: 86  EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-------CVVHLQ 138
           E + RE+ +   +   +    Q+   D  ++  T E+++ A+    S        +   Q
Sbjct: 149 EDLGREFRDARNLLIAS----QKDAEDDFSDHLTRELYQWAYCIFSSRSFRPSLVLSEEQ 204

Query: 139 KVSLARRFALVPLGPPLLAYSSKCKAMLAAV-------------------DDAVQLVVDR 179
           + SL    ++      L  +      M   V                     AVQL V R
Sbjct: 205 QQSLPDGVSVNDFSVLLPLFDIGNHDMTVHVRWDLAAGDEAAAGAGVRGSGAAVQLKVGR 264

Query: 180 PYKAGESIVVWCGPQPNSKLLINYGFV 206
            +K G+ I     P+ N++LL+ YGF+
Sbjct: 265 EHKPGQQIFNNYSPKTNAELLLGYGFM 291


>gi|294948379|ref|XP_002785721.1| hypothetical protein Pmar_PMAR008080 [Perkinsus marinus ATCC 50983]
 gi|239899769|gb|EER17517.1| hypothetical protein Pmar_PMAR008080 [Perkinsus marinus ATCC 50983]
          Length = 353

 Score = 47.4 bits (111), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 21/72 (29%), Positives = 37/72 (51%)

Query: 146 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 205
             ++PL       S+K    +  V++  Q++ ++P K GE I    G   N  LL+ +GF
Sbjct: 171 LCVIPLADQFNHSSTKWHTRVREVEEGFQMLAEKPVKKGEEIFNNYGLYTNEMLLLTHGF 230

Query: 206 VDEDNPYDRLVV 217
           ++ DNP+D  + 
Sbjct: 231 IEFDNPHDHFIT 242


>gi|451854554|gb|EMD67847.1| hypothetical protein COCSADRAFT_34629 [Cochliobolus sativus ND90Pr]
          Length = 476

 Score = 47.4 bits (111), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 53/216 (24%), Positives = 82/216 (37%), Gaps = 44/216 (20%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            L L ++YE   G  S W PY   L           ++ + W+E ELA L  S    +I 
Sbjct: 85  SLILIMLYEYHNGSASNWAPYFAVLPTD-------FDTLMFWTEDELAELQASAVVNKIG 137

Query: 83  ---------------------------ERAEGIKREYNELDTVWFM--AGSLFQQYPYDI 113
                                      ERA+   +E    + +  M   GSL   Y +D+
Sbjct: 138 KEGANEVFIEQLLPVIEEFADVIFSGDERAKHKAKEMRAPENLELMHKMGSLIMAYAFDV 197

Query: 114 PTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV 173
              A + +   +   A +     L K        +VPL   L A   +C A L    D +
Sbjct: 198 EP-AISDKEVDEEGFAEEEEDAALPK-------GMVPLADMLNADGDRCNARLFYEKDGL 249

Query: 174 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 209
           ++   +P +AG+ I    GP P S LL  YG++ ++
Sbjct: 250 EMKALKPIQAGDEIFNDYGPLPRSDLLRRYGYITDN 285


>gi|116206234|ref|XP_001228926.1| hypothetical protein CHGG_02410 [Chaetomium globosum CBS 148.51]
 gi|88183007|gb|EAQ90475.1| hypothetical protein CHGG_02410 [Chaetomium globosum CBS 148.51]
          Length = 442

 Score = 47.4 bits (111), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 49/195 (25%), Positives = 79/195 (40%), Gaps = 20/195 (10%)

Query: 26  LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 85
            +L+ E  +GK SFW PY+  L          +  P  W E ++AYL  +     I E  
Sbjct: 113 FFLIKEYLKGKDSFWWPYLATLPSPDQVNAWVL--PAFWPEDDIAYLECTNAHVAIQEIQ 170

Query: 86  EGIKREYNE----LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---- 137
             +K E+ +    L    F   + +    Y      FT   F+ + +   +   H+    
Sbjct: 171 ANVKGEFKQARKILKNENFPDVAAYTSLMYKWAFTIFTSRSFRPSLILSDTTKRHISTLL 230

Query: 138 -QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD-----DAVQLVVDRPYKAGESIVVWC 191
            Q V L   F+++    PLL  ++     + + D     DA  LV    Y  G  +    
Sbjct: 231 PQSVEL-DDFSILQ---PLLDIANHSPTAVYSWDTTSPADACTLVCGDRYPPGAQVFNNY 286

Query: 192 GPQPNSKLLINYGFV 206
           G + NS+LL+ YGF+
Sbjct: 287 GLKTNSELLLGYGFI 301


>gi|452841392|gb|EME43329.1| hypothetical protein DOTSEDRAFT_131367 [Dothistroma septosporum
           NZE10]
          Length = 445

 Score = 47.4 bits (111), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 82/360 (22%), Positives = 140/360 (38%), Gaps = 50/360 (13%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI- 81
            L + +  E   G +S W PY   L        ++ ++ + WS+ EL +L GS    +I 
Sbjct: 85  SLIVVMALEYLDGSQSRWKPYFDVL-------PVSFDNLMFWSDRELRHLEGSTVVGKIG 137

Query: 82  ---------------LERAEGIKREYNE-LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ 125
                          +ER    K   NE L  +    GS    Y +D+ T +   +   +
Sbjct: 138 KEAADATFREQLIPVIERISKAKAADNEELLRMCHRMGSTIMAYGFDLETSSDQAKNDGE 197

Query: 126 AFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGE 185
            +         L K        +VPL   L A + +  A L   DD V +   +P KAGE
Sbjct: 198 EWEEDSDAGETLPK-------GMVPLADMLNADADRNNAKLFYEDDKVVMKTIKPVKAGE 250

Query: 186 SIVVWCGPQPNSKLLINYGFVDEDN---------PYDRLVVEAALNTEDPQYQDKRMVAQ 236
            +    G  P + LL  YG++  DN         P D +   A L T+D    ++   A+
Sbjct: 251 ELYNDFGSLPRADLLRRYGYL-TDNYAQYDVVEIPADLIKERAGLRTQD--VDERWQYAE 307

Query: 237 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 296
             G L        A  E+    + L  L        +E + V +      P      + +
Sbjct: 308 EQGVLDDGYDVSRASSEEGQFPEELCVLLNLLALPRAEFEKVKNKDKIPKPDLTTNAKKL 367

Query: 297 LDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKR-VATQLVRMEKKMLNACLQVTADM 355
           L  +  Y   R A YP  + +   M +D +L+ ++R +A  +++ EK++L   +   +++
Sbjct: 368 LRTILVY---RYAAYPGNVDQ---MHSDVSLNDRRRKMAIVVIQGEKQVLQEAVDAISEI 421


>gi|240278777|gb|EER42283.1| conserved hypothetical protein [Ajellomyces capsulatus H143]
 gi|325090312|gb|EGC43622.1| conserved hypothetical protein [Ajellomyces capsulatus H88]
          Length = 471

 Score = 47.4 bits (111), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 44/164 (26%), Positives = 70/164 (42%), Gaps = 11/164 (6%)

Query: 60  SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 119
           S + ++E EL   TG+   A   +    I+ +Y  L         L  Q+    P   FT
Sbjct: 107 SSIFFTEDELEVCTGTSLYAITKQLGRCIQDDYKALVV------RLLIQHRDLFPLSKFT 160

Query: 120 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVV 177
            E +K A   V S  +            L P    +L +SS  +   A   +   + ++ 
Sbjct: 161 IEDYKWALCTVWSRAMDFVLPDGKSIRLLAPFAD-MLNHSSDVRQCHAYDPLSGNLSILA 219

Query: 178 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR--LVVEA 219
            + YKAG+ + ++ G  PN++LL  YGF+   NP D   LV+E 
Sbjct: 220 GKDYKAGDQVFIYYGSIPNNRLLRLYGFIIPSNPNDNYELVLET 263


>gi|346465219|gb|AEO32454.1| hypothetical protein [Amblyomma maculatum]
          Length = 353

 Score = 47.0 bits (110), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 64/271 (23%), Positives = 113/271 (41%), Gaps = 28/271 (10%)

Query: 17  KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP 76
           +L+ +  L L+L+ EK +G  S W  +I  L        ++  +P+      LA L  + 
Sbjct: 97  QLTAIEVLTLFLINEKLRGLDSEWRFFIDSL-------PVSYTTPVFLGSKLLARLPETM 149

Query: 77  T-KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-CV 134
             KAE   +   I+R +  L  +  +  +L         +E FT+ +F  A+ AV + C+
Sbjct: 150 CRKAE--AQVSRIRRTFVRLQIL--LKRALLDDSALLNLSENFTWHLFVWAWTAVNTRCI 205

Query: 135 VHLQKVSLARRF---ALVPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGESIVV 189
               K      F       L P L   +   KA +    +    ++V +  Y+  + + +
Sbjct: 206 --FSKHRTDHSFWDDDYCALAPFLDCLNHHWKADVETTVEGSYFEIVTNNNYEPNDQVFI 263

Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 249
             G   N KLL+ YGFV  DNP D + +     T+   Y   ++ +Q+N  +      + 
Sbjct: 264 SYGSHDNKKLLLEYGFVLADNPNDVVAI-----TKGHLY---KLNSQQNDTVLYFATKLS 315

Query: 250 AGREKEAISDMLPYLRLGYVSDTSEMQSVIS 280
              EK+ ISD   +   G   +   +  V+S
Sbjct: 316 FLEEKDIISDTCGFTTDGLTWNGKIVMQVLS 346


>gi|320163219|gb|EFW40118.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 1188

 Score = 47.0 bits (110), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 45/202 (22%), Positives = 81/202 (40%), Gaps = 22/202 (10%)

Query: 61  PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 120
           PL W++ EL +L G+     I ER   ++ ++  +  V      L ++ P   P + FT+
Sbjct: 245 PLWWNDAELDHLDGTNIGGYIQERRNQVRNQFLNVFPV------LSREQPALFPKDVFTY 298

Query: 121 EIFKQAFVAVQSCVVHLQ-KVSLARRFALVPLGPPLLAYSSKC------------KAMLA 167
           E +  AF    S    L+  V+         +G P+     +C             A + 
Sbjct: 299 EAYLWAFSTCSSRAFPLRVTVNPTTGVESHAIGNPMKEPCVECLLPLLDMMNHQFGASIT 358

Query: 168 AVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 225
              D  +V+       + GE +    GP+ N +LL+ YGF   +N  D + ++  +   D
Sbjct: 359 WFTDETSVRFFTGAKVRKGEQVYNNYGPKSNEELLMGYGFCLPNNEADHVKIQLTVGN-D 417

Query: 226 PQYQDKRMVAQRNGKLSVQVFH 247
           P  + K  + + +G       H
Sbjct: 418 PDGEAKLAILRWHGLSLTHFLH 439


>gi|225561342|gb|EEH09622.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
          Length = 487

 Score = 47.0 bits (110), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 78/339 (23%), Positives = 134/339 (39%), Gaps = 50/339 (14%)

Query: 60  SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 119
           S + +++ EL    GS   A        I+ +Y  L         L  Q+    P + FT
Sbjct: 125 SSIFFTDDELEVCAGSSLYALTKRLGRCIEDDYRALVV------RLLVQHQDLFPLDKFT 178

Query: 120 FEIFKQAFVAVQSCVVHL-----QKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDA 172
            E +K A   V S  +       + + L   FA       +L +SS+ +   A   +   
Sbjct: 179 IEDYKWALCTVWSRAMDFVLPGGKSIRLMAPFA------DMLNHSSEVRQCHAYDPLSGN 232

Query: 173 VQLVVDRPYKAGES-----IVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQ 227
           + ++  + Y+AG+      + ++ G  PN++LL  YGFV   NP D   +    +   P 
Sbjct: 233 LTILAGKDYEAGDQGVFFQVFIYYGSIPNNRLLRLYGFVMPGNPNDSYDLVLETHPMAPF 292

Query: 228 YQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV----SDTSEMQSVISSLG 283
           ++ KR +    G  S     +        ++D LP   LGY+    SD S++ S+     
Sbjct: 293 FEQKRKLWDLAGFDSTSTISI-------TLTDPLPKNVLGYLRIQRSDESDLASIARQR- 344

Query: 284 PICP----VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKR---VATQ 336
            I P    +S   E  VL  L + F   L  +   L   E  L +  ++P +     A  
Sbjct: 345 -IDPKYEKISDSNEVEVLQSLIESFCGLLDSFGTQLESLEKQLAE-GVYPSRGNAWAAAH 402

Query: 337 LVRMEKKMLNACLQVTADMIMLLPDVTVS-----PCPAP 370
           +   E+++L    +   DM+  +   + +     P PAP
Sbjct: 403 VSLGEQQVLRLARKRAEDMLAAVESGSGNEKGSLPAPAP 441


>gi|400596811|gb|EJP64567.1| histone-lysine N-methyltransferase [Beauveria bassiana ARSEF 2860]
          Length = 406

 Score = 47.0 bits (110), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 76/316 (24%), Positives = 133/316 (42%), Gaps = 28/316 (8%)

Query: 18  LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPL--LWSETELAYLTGS 75
           LS    LALY+++ + +G+   +       +RQ     L  E  L   +++ EL    GS
Sbjct: 71  LSVEDTLALYILFVRSRGEDPAYA------ERQTHVAMLPSEYTLSMYFTDEELRVCAGS 124

Query: 76  PTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 134
                       +  +Y +L T  FM    LF       P + F+F+ +K A  ++ S  
Sbjct: 125 SLYTLTTHLRGRVGDDYKKLLTGVFMRHRDLF-------PLDKFSFQHYKWALSSIWSRG 177

Query: 135 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV--DRPYKAGESIVVWCG 192
           +    +S      L+     +L ++S  K   A       L V   R Y+ G+ + ++ G
Sbjct: 178 MDF-TISEGNSVRLMAPFADMLNHASDAKQCHAYDPSTGSLTVLACRDYEVGDQVFIYYG 236

Query: 193 PQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV--QVFHVHA 250
              NS+LL  YGFV  DNP D   +    ++  P Y+ K    QR  KL+   ++  +  
Sbjct: 237 NVSNSRLLRLYGFVLPDNPNDNYELVLQTSSMAPLYEQK----QRLWKLAGLDEISTIPL 292

Query: 251 GREKEAISDMLPYLRLGYVSDTSEMQSVISSLG--PICPVSPCMERAVLDQLADYFKARL 308
             +      +L YLR+  + D S++ ++   +       +S   E  +L  L+   +A L
Sbjct: 293 SLQNPLPDSVLRYLRIQRL-DASDLGTMTMQIATESYTKISDENESQILLFLSQSIEALL 351

Query: 309 AGYPATLSEDEAMLTD 324
            G+  +L + E  L +
Sbjct: 352 EGFEISLEKLETQLAE 367


>gi|422293951|gb|EKU21251.1| hypothetical protein NGA_2061300, partial [Nannochloropsis gaditana
           CCMP526]
          Length = 452

 Score = 47.0 bits (110), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 47/198 (23%), Positives = 81/198 (40%), Gaps = 27/198 (13%)

Query: 18  LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
           L + + +AL L+YE   G KS +  YI+ L R    GQ   + PL WS  E   L  S T
Sbjct: 137 LGDASLIALQLLYEAHMGPKSKYAVYIKSLPRP---GQDGFDHPLFWSTAEQGVLAKSST 193

Query: 78  KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
           +       + +  +Y      W  + +L +     +  ++F    F+ A   V S     
Sbjct: 194 RNLGETLIDAVAEDYG-----WIQS-ALARGGISGLQADSFDLSDFEWAVAVVLSRSFFA 247

Query: 138 QKVSLARRFALVPLGPPLLAYSSKCKA---------MLAAVDDAVQLVVDRPYKAGESIV 188
           +      R A      PLL  +++ +          +       ++++ DR    G+ IV
Sbjct: 248 EN---GLRLA------PLLDMANRGEGCTNEPQIGGLGIFGGKGLKVIADRDTDKGQEIV 298

Query: 189 VWCGPQPNSKLLINYGFV 206
           +  GP+   + L ++GFV
Sbjct: 299 ISYGPKSGIEFLEDHGFV 316


>gi|387193935|gb|AFJ68731.1| hypothetical protein NGATSA_2061300, partial [Nannochloropsis
           gaditana CCMP526]
          Length = 446

 Score = 46.6 bits (109), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 47/198 (23%), Positives = 81/198 (40%), Gaps = 27/198 (13%)

Query: 18  LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
           L + + +AL L+YE   G KS +  YI+ L R    GQ   + PL WS  E   L  S T
Sbjct: 131 LGDASLIALQLLYEAHMGPKSKYAVYIKSLPRP---GQDGFDHPLFWSTAEQGVLAKSST 187

Query: 78  KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
           +       + +  +Y      W  + +L +     +  ++F    F+ A   V S     
Sbjct: 188 RNLGETLIDAVAEDYG-----WIQS-ALARGGISGLQADSFDLSDFEWAVAVVLSRSFFA 241

Query: 138 QKVSLARRFALVPLGPPLLAYSSKCKA---------MLAAVDDAVQLVVDRPYKAGESIV 188
           +      R A      PLL  +++ +          +       ++++ DR    G+ IV
Sbjct: 242 EN---GLRLA------PLLDMANRGEGCTNEPQIGGLGIFGGKGLKVIADRDTDKGQEIV 292

Query: 189 VWCGPQPNSKLLINYGFV 206
           +  GP+   + L ++GFV
Sbjct: 293 ISYGPKSGIEFLEDHGFV 310


>gi|145356486|ref|XP_001422460.1| chloroplast lysine N-methyltransferase [Ostreococcus lucimarinus
           CCE9901]
 gi|144582703|gb|ABP00777.1| chloroplast lysine N-methyltransferase [Ostreococcus lucimarinus
           CCE9901]
          Length = 529

 Score = 46.6 bits (109), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 68/274 (24%), Positives = 118/274 (43%), Gaps = 49/274 (17%)

Query: 18  LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
           L+    LAL L+ E+++G++S +  Y++ L          + SPL W+  EL+ + G+  
Sbjct: 136 LARWTTLALTLLKERERGEESKYAAYVKTLPE-------VLHSPLFWNAEELSEIQGT-- 186

Query: 78  KAEILERAEG----IKREYNELDTVWFMAGSLFQQYP--YDIPTEAFTFEIFKQAFVAVQ 131
             ++L+ A G    ++  Y  L T       +F ++   +D+   AF+ + F+ AF  ++
Sbjct: 187 --QLLDNAAGYDGYVRGVYETLRT------GMFAKHADVFDVEG-AFSEDNFRWAFGILR 237

Query: 132 SCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA-------MLAAV---------DDAVQL 175
           S              ALVP G  L+ +SS  +A       +  AV         DD V  
Sbjct: 238 S---RTMAPCDGANIALVP-GVDLVNHSSLSQARWRVSGGVAGAVAGLFGGGKGDDGVSA 293

Query: 176 VV--DRPYKAGESIVVWCGPQ-PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 232
            V  DR     E + V   P+  ++   +++GFVD   P     +  ++  +DP   DK 
Sbjct: 294 RVECDRALNVNEPLYVNYNPEGTDTSFALDFGFVDTITPSPGYALSLSVPEDDPNVFDKL 353

Query: 233 MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 266
            V    G      F + A  + +   D+  +LRL
Sbjct: 354 DVLDVCGLGETPTFTLRAYSDPD--PDLRTFLRL 385


>gi|281205954|gb|EFA80143.1| hypothetical protein PPL_06965 [Polysphondylium pallidum PN500]
          Length = 417

 Score = 46.6 bits (109), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 74/309 (23%), Positives = 133/309 (43%), Gaps = 37/309 (11%)

Query: 25  ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILER 84
           A+YLMY     K S+W  Y   L +Q         + + + + E+  L  S  +     R
Sbjct: 93  AVYLMY-SLLNKDSYWHQYTSILPKQ-------FTTSIYFDQDEMKELQLSKLRYFTESR 144

Query: 85  AEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLAR 144
             GI+R YN    ++    SL  ++      + +TFE+FK A   + S    L     + 
Sbjct: 145 LSGIERHYN---VIFKKLSSLNDEFK----KKEYTFELFKWALSCIWSRAFSLS----SD 193

Query: 145 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV--VDRPYKAGESIVVWCGPQP---NSKL 199
              +VPL     A   K K+ +     A QL+    +  + GE +    G      N+++
Sbjct: 194 DGGMVPLADMFNAIE-KAKSKVRPDSRADQLIYYASKDIERGEQVFTPYGVYKTIGNAQM 252

Query: 200 LINYGFVDEDNPYDRLVVEAALN--TEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI 257
           L++YGF   D+P +   ++  L+  ++D  Y D ++       + V+ F++   +  +  
Sbjct: 253 LMDYGFA-FDDPSEGDTIQLTLDNFSDDELYIDTKIDLLEQLDI-VREFNL---KRNQLP 307

Query: 258 SDMLPYLRLGYVSDTSEMQ----SVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 313
            ++L Y R+  + + +E+Q       +      PVS   E+  L  L++Y    L  Y  
Sbjct: 308 QELLIYARVKNLKE-NELQLAKEHYRNDDNRNKPVSRRNEKTALRYLSNYLSRYLDSYET 366

Query: 314 TLSEDEAML 322
           TLS+D  +L
Sbjct: 367 TLSDDLELL 375


>gi|384254260|gb|EIE27734.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 724

 Score = 46.6 bits (109), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 63/239 (26%), Positives = 92/239 (38%), Gaps = 44/239 (18%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           +AL L+YE++ G  S   PY+  L        +   +PL WSE +L  L       E+  
Sbjct: 152 MALNLLYERQLGPASKLAPYVAALP-------VDFSTPLSWSEAQLQALCYPQLIREVAT 204

Query: 84  RAEGIKREYNEL------------DTVWFMAGSLFQQY--PYDIPTEAFTFEIFKQAFVA 129
           + EG+KR + EL            D +W +     + +  PY  PT     + F      
Sbjct: 205 QREGLKRLHAELAVSTPGTPITEQDLIWALQAVRSRAFSGPYAGPTWRSRLKTFGALGAL 264

Query: 130 VQSCVVHLQKVSLARRFALVPL-GPPLLAYSSKCKAMLAAVD-----DAVQLVVDRPYKA 183
             + +     ++ A   AL  L    +L+   K  AM   VD       VQ  V+  Y A
Sbjct: 265 AAASITVAHVLNGAIAAALFNLLYDVVLSQKVKWYAMCPVVDFLNHKSTVQSEVEYEYFA 324

Query: 184 -------------GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV----EAALNTED 225
                        GE + +  G Q N  LL  YGFV+   P+D   +     AAL   D
Sbjct: 325 DRFSVRCQSYFSKGEQVFISYGKQSNDSLLQYYGFVEPGIPHDTYTIPDLRAAALALSD 383


>gi|322706860|gb|EFY98439.1| SET domain protein [Metarhizium anisopliae ARSEF 23]
          Length = 595

 Score = 46.6 bits (109), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 56/231 (24%), Positives = 105/231 (45%), Gaps = 27/231 (11%)

Query: 24  LALYLMYEKKQGKK------SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
           + LYL+ +  Q  +      + W  YIR L R        +  P +W+E E   L G+  
Sbjct: 201 IMLYLVSQFAQSSRPKGLSPTPWTEYIRLLPR-------PIPVPTMWTEPERLLLNGTSL 253

Query: 78  KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
           +A +  +   + +E++ L  V        + +P+         E+  + +V V +     
Sbjct: 254 EAALEAKLLSLGKEFDTLREV-------SEDFPFWNEFLWSGEEVSLEDWVLVDAWY-RS 305

Query: 138 QKVSLARR-FALVPLGPPLLAYSSKCKAMLAAVD-DAVQLVVDR--PYKAGESIVVWCG- 192
           + + L R   A+VP G  ++ +SSK  A     D D V L++    P ++GE + +  G 
Sbjct: 306 RCLELPRSGTAMVP-GLDMVNHSSKATAYYEEDDHDNVVLLIRPGCPVRSGEEVTISYGD 364

Query: 193 PQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 243
            +P S++L +YGF+D +N  D+L +      +DP  + K  ++     L++
Sbjct: 365 AKPASEMLFSYGFIDPNNIVDKLTLRLDPFPDDPLARAKLRISNSGPTLTI 415


>gi|345566622|gb|EGX49564.1| hypothetical protein AOL_s00078g53 [Arthrobotrys oligospora ATCC
           24927]
          Length = 611

 Score = 46.6 bits (109), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 74/313 (23%), Positives = 130/313 (41%), Gaps = 43/313 (13%)

Query: 22  ACLAL---YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
           AC  L    L+ E+ Q    FW PYIR L +         ++PL +++ E+  L G+   
Sbjct: 112 ACFHLSQHLLLKEQSQ----FW-PYIRLLPK-------TFDTPLYFNDDEMERLAGTNLG 159

Query: 79  A-EILERAEGIKREYNELDTVWFMAG---SLFQQYPYDIPTEA---FTFEIFKQAFVAV- 130
           A ++L R +    E+       F+ G      ++Y +D+   A   +T   F    V + 
Sbjct: 160 AGDVLLRKQLWMEEWEAGKQ--FLEGVGAERAREYTWDLFLRAATIYTSRSFPSKLVGIT 217

Query: 131 -QSCVVHLQKVSLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 188
             S +     +S    F  L+PL   +L +    K +      +  L+       G  + 
Sbjct: 218 MDSSIEENTMLSDDNGFPVLIPL-VDILNHKPNTKIIWEPTQTSFSLITPETISEGSQVF 276

Query: 189 VWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV-QVFH 247
              GP+ N +LL+ YGFV  +NP D L ++  ++   P+ Q  ++  QR  K +  +VFH
Sbjct: 277 NNYGPKGNEELLMGYGFVIPENPGDSLAMKFTIS---PRGQAAQIWEQRALKQTWREVFH 333

Query: 248 VHAGREKEAISDMLPYLRLGY-----------VSDTSEMQSVISSLGPICPVSPCMERAV 296
           +    +    +  +P L   +           V++ +E+  + +      P+S   E AV
Sbjct: 334 LTKSADSGQKTSTVPALESDWPEAFVDLFRILVANENEIDDLENGDINATPISIRNELAV 393

Query: 297 LDQLADYFKARLA 309
              L    K +LA
Sbjct: 394 ALGLKAAIKQKLA 406


>gi|400594002|gb|EJP61885.1| histone-lysine N-methyltransferase [Beauveria bassiana ARSEF 2860]
          Length = 481

 Score = 46.6 bits (109), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 64/277 (23%), Positives = 111/277 (40%), Gaps = 24/277 (8%)

Query: 62  LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 121
           + +S+ E+    GS       +    I  +Y +L T   M      ++    P   F  E
Sbjct: 115 IFFSDEEMQVCKGSSLYTLTTQLRGRIGDDYKKLLTRVLM------RHRNLFPLSKFGIE 168

Query: 122 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS---KCKAMLAAVDDAVQLVVD 178
            +K A   V S  +    VS      L+     +L +SS   +C A      D + ++  
Sbjct: 169 HYKWALCTVWSRGMDF-TVSEGNSLRLLAPFADMLNHSSDVKQCHAYDPTTGD-LSILAS 226

Query: 179 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 238
           + Y  G+ + ++ GP PN++LL  YGFV  +NP+D   +    +   P Y+ K  + +  
Sbjct: 227 KDYNVGDQVFIYYGPVPNNRLLRLYGFVLPENPHDSYDLVLQTSPMAPLYEQKERLWKLA 286

Query: 239 GKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSL------GPICPVSPCM 292
           G  +     + A       +D LP   L Y+      +S++ ++      G    +S   
Sbjct: 287 GLDTACTIPLTA-------NDPLPRSVLRYLRIQRLDESLLGAMTMQIATGADEKISDDS 339

Query: 293 ERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 329
           E  +L  L D   A L G+   L    A L   +++P
Sbjct: 340 ETLILQFLIDSISAILEGFSIPLDILTAQLAAGDVYP 376


>gi|1150596|emb|CAA86307.1| putative transcription regulator [Saccharomyces cerevisiae]
          Length = 496

 Score = 46.6 bits (109), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 51/205 (24%), Positives = 88/205 (42%), Gaps = 20/205 (9%)

Query: 24  LALYLMYEKKQ-GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLT--------G 74
           L + ++YE +   ++S W PY +  ++      L     + W + EL  L         G
Sbjct: 90  LIICILYEMEVLQERSRWAPYFKVWNKPSDMNAL-----IFWDDNELQLLKPSLVLERIG 144

Query: 75  SPTKAEILERA-EGIKREYNELDTVWFMAGSLFQQYPYD---IPTEAFTFEIFKQAFVAV 130
                E+ ER  + IK+   E  T      S F  + Y    I + +F  E+   +    
Sbjct: 145 KKEAKEMHERIIKSIKQIGGEFSTCVANCPSKFDNFAYIASIILSYSFDLEMQDSSVNEN 204

Query: 131 QSCVVHLQKVSLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
           +      +++   R   +++PL   L A +SKC A L    + +++V  R  +  E +  
Sbjct: 205 EEEETSEEELENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYN 264

Query: 190 WCGPQPNSKLLINYGFVDED-NPYD 213
             G  PNS+LL  YG+V+ D + YD
Sbjct: 265 IYGEHPNSELLRRYGYVEWDGSKYD 289


>gi|323447496|gb|EGB03414.1| hypothetical protein AURANDRAFT_72732 [Aureococcus anophagefferens]
          Length = 403

 Score = 46.2 bits (108), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 44/151 (29%), Positives = 62/151 (41%), Gaps = 14/151 (9%)

Query: 61  PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 120
           P+ W   EL +L GS    +I ER   IK +Y  +  +W          P  I  +  T 
Sbjct: 141 PIFWQPDELEWLKGSYLLTQIEERKRAIKADYEAICGIW----------PSFI--DVCTL 188

Query: 121 EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSK-CKAMLAAVDDAVQLVVDR 179
           E FK A + V S    +  V+ AR  A+VP    L  +  +  K        A  +   +
Sbjct: 189 EEFKWARMCVCSRNFGVV-VNGARTSAMVPYADMLNHFRPRETKWTFDNSRGAFTITSLQ 247

Query: 180 PYKAGESIVVWCGPQPNSKLLINYGFVDEDN 210
               G  I    G + N + L+NYGF  EDN
Sbjct: 248 KISVGSQIYDSYGQKCNHRFLLNYGFAIEDN 278


>gi|294868786|ref|XP_002765694.1| hypothetical protein Pmar_PMAR013760 [Perkinsus marinus ATCC 50983]
 gi|239865773|gb|EEQ98411.1| hypothetical protein Pmar_PMAR013760 [Perkinsus marinus ATCC 50983]
          Length = 330

 Score = 46.2 bits (108), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 21/72 (29%), Positives = 36/72 (50%)

Query: 146 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 205
             ++PL       S+K    +  V+   Q++ ++P K GE I    G   N  LL+ +GF
Sbjct: 171 LCVIPLADQFNHSSTKWHTRVREVEGGFQMLAEKPVKKGEEIFNNYGLYTNEMLLLTHGF 230

Query: 206 VDEDNPYDRLVV 217
           ++ DNP+D  + 
Sbjct: 231 IEFDNPHDHFIT 242


>gi|384246211|gb|EIE19702.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 503

 Score = 46.2 bits (108), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 91/359 (25%), Positives = 151/359 (42%), Gaps = 48/359 (13%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           +AL+L++E+ +   + W  ++  +         A + PL WSE EL+ L G+   + +  
Sbjct: 126 IALFLLHERSK-PDAAWQGFLDSI-------PAAPDVPLFWSEEELSQLEGTQLLSSVQG 177

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 143
             +  + +Y EL+        LF  +    P ++   + F  A   V+S V         
Sbjct: 178 YRQFFEAKYAELEE------QLFAPHREAFPPKSHQLDDFLWAVATVRSRV---HSPLDG 228

Query: 144 RRFALVPLGPPLLAY----SSKCKAMLAA--VDDAVQLVVD--RPYKAGESIVVWCGP-- 193
              ALVPL   L+ +     ++ +  LA      A  LVV+  R Y  GE + +  G   
Sbjct: 229 EDVALVPLAD-LVQHRKLQGARWQLQLAGGLFSKAQALVVEAQRDYAEGEVVTMDFGAPL 287

Query: 194 ------QPNSKLLINYGFVDEDNPY-DRLVVEA------ALNTEDPQYQDKRMVAQRNGK 240
                 + +S++L++YG +D D P  D  VV+       AL  +D  Y DK  + + NG 
Sbjct: 288 TEEDQEKLDSQVLLDYGALDADRPQADPGVVQGGFILSLALPEDDKYYDDKADILELNGL 347

Query: 241 LSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLG---PICPVSPCMERAVL 297
                F + A  E          L      D   ++ +  +      + PVS   ERAV 
Sbjct: 348 SEAASFVLRANEEPSEQLLGFLRLLNLSGQDAFLLEPLFRNEAWGHMLAPVSEANERAVY 407

Query: 298 DQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQ-LVRM-EKKMLNACLQVTAD 354
           + + +  +A L GY  ++ +D   L D    P  R+    LVR+ EK+ L+A L    D
Sbjct: 408 ESMMEGCRAALQGYATSIDDDLRALRDT--QPGTRLEKAILVRLGEKETLDATLAFFED 464


>gi|62860180|ref|NP_001017105.1| SET domain containing 4 [Xenopus (Silurana) tropicalis]
 gi|89267009|emb|CAJ81787.1| novel protein containing a SET domain [Xenopus (Silurana)
           tropicalis]
          Length = 442

 Score = 46.2 bits (108), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 47/207 (22%), Positives = 94/207 (45%), Gaps = 19/207 (9%)

Query: 18  LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
           +S L  L  +L+ E+   ++S W PY+  L         +   P+ W E+E+  L  +P 
Sbjct: 104 VSPLLALCTFLIAERVARERSPWKPYLDVLPS-------SYSCPVYW-ESEIISLLPAPL 155

Query: 78  KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV-- 135
           + + LE+   +K  + E    W    SL   +  +I T+ +T+   + A+  V +  V  
Sbjct: 156 RQKALEQQTEVKELHTE---SWSFFVSLQPLFGGNI-TDIYTYGALRWAWCTVNTRTVYM 211

Query: 136 -HLQKVSLARRFALVPLGP--PLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVW 190
            H ++  L+ +  +  + P   LL +S   +   A  ++    ++  +   +  +   + 
Sbjct: 212 KHPRRHGLSAQQDVYAMAPYLDLLNHSPAVQVEAAFNEERRCYEIRTNSGCRKHDQAFIC 271

Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVV 217
            GP  N +LL+ YGF+  +NP+  + V
Sbjct: 272 YGPHDNQRLLLEYGFIAANNPHRSVYV 298


>gi|367009050|ref|XP_003679026.1| hypothetical protein TDEL_0A04830 [Torulaspora delbrueckii]
 gi|359746683|emb|CCE89815.1| hypothetical protein TDEL_0A04830 [Torulaspora delbrueckii]
          Length = 484

 Score = 46.2 bits (108), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 39/68 (57%), Gaps = 1/68 (1%)

Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
           +++PL   L A +SKC A L    +++++   +P   GE +    G  PNS+LL  YG+V
Sbjct: 213 SMIPLADTLNANTSKCNANLVYDIESLKMCATKPIGMGEQVYNIYGDHPNSELLRRYGYV 272

Query: 207 D-EDNPYD 213
           + E + YD
Sbjct: 273 EWEGSKYD 280


>gi|302754814|ref|XP_002960831.1| hypothetical protein SELMODRAFT_402223 [Selaginella moellendorffii]
 gi|300171770|gb|EFJ38370.1| hypothetical protein SELMODRAFT_402223 [Selaginella moellendorffii]
          Length = 486

 Score = 46.2 bits (108), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 43/162 (26%), Positives = 71/162 (43%), Gaps = 18/162 (11%)

Query: 60  SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF- 118
           S   W +TEL+YL  SP   +  ER E I  E+ ++    F    L Q    D+  + F 
Sbjct: 313 STFRWEDTELSYLRASPLYGKARERLEMITTEFGQVQND-FCTCVLEQ--ALDVWPQLFG 369

Query: 119 --TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY-----SSKCKAMLAAVDD 171
             + E  K  +  V S     + +++     LV +  P+L +     +S  K     + +
Sbjct: 370 KVSLEDLKHVYATVFS-----RSLAIGEDSTLVMI--PMLDFFNHNATSFAKLSFNGLLN 422

Query: 172 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 213
              +  DR Y   + I +  G   N++L ++YGF   +NPYD
Sbjct: 423 YAVVTADRDYAENDQIWINYGDLSNAELALDYGFTVPENPYD 464


>gi|402076002|gb|EJT71425.1| hypothetical protein GGTG_10683 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 497

 Score = 46.2 bits (108), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 53/202 (26%), Positives = 83/202 (41%), Gaps = 27/202 (13%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            L L +++E  +G  S W PY+  L  +        E+P+ WS  ELA L  SP  A + 
Sbjct: 104 SLILVMIHEHLRGSASPWRPYLDVLPAR-------FETPMFWSAAELAELQASPVVASV- 155

Query: 83  ERAEG---IKRE----YNELDTVWFMAGS----------LFQQYPYDIPTEAFTFEIFKQ 125
            RAEG   I+        E + ++F AG           L  +    I   AF  E    
Sbjct: 156 GRAEGDAMIRSRILPVIRENEALFFGAGGAAMGDEELVELAHRMGSTIMAYAFDLERDDD 215

Query: 126 AFVAVQSCVVHLQKVSLARR-FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 184
           A     +      +    R    +VP+   +L   ++  A +   ++A+  +  R   AG
Sbjct: 216 AMDEDDAEGDGWVEDRDGRTVMGMVPMA-DILNADAEFNAHINHSEEALVAISLRKIPAG 274

Query: 185 ESIVVWCGPQPNSKLLINYGFV 206
           E I+ + GP PN +L   YG+ 
Sbjct: 275 EEILNYYGPLPNGQLCRRYGYT 296


>gi|219125216|ref|XP_002182882.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217405676|gb|EEC45618.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 488

 Score = 46.2 bits (108), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 40/161 (24%), Positives = 63/161 (39%), Gaps = 17/161 (10%)

Query: 61  PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 120
           P  WSE+EL  L GSP    + +  EGI+++YN +   W       + +    P+    F
Sbjct: 138 PRQWSESELKALRGSPLLIRVRKAREGIQKDYNLIREAW------KEIHSSHSPSTTANF 191

Query: 121 ---EIFKQAFVAVQSCVVH-----LQKVSLARRFALVPLGPPLLAYSSKCKAMLAAV--- 169
              E F  A   V S           K        ++PL      +  +C     +    
Sbjct: 192 PKLEEFSSAMAVVSSRAFSGMAGFETKGGAVNDTTMIPLLDLCNHHRGRCVTKNVSYRFK 251

Query: 170 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 210
           D  V +        G+++ +  G Q N++L +NYGF   DN
Sbjct: 252 DGTVMVKAVTDIAIGDTLKITYGAQGNAQLFLNYGFCVADN 292


>gi|294659704|ref|XP_462118.2| DEHA2G13354p [Debaryomyces hansenii CBS767]
 gi|199434171|emb|CAG90604.2| DEHA2G13354p [Debaryomyces hansenii CBS767]
          Length = 480

 Score = 46.2 bits (108), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 51/214 (23%), Positives = 89/214 (41%), Gaps = 19/214 (8%)

Query: 18  LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS------ETELAY 71
           LS    L+ Y+ +EK++G  SFW P+I  L  +     LA   PL+W         EL  
Sbjct: 143 LSSFQLLSFYICFEKQRGSSSFWKPFIDMLP-ETSDFDLA---PLVWKVLKVDHYEELLK 198

Query: 72  LTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFT----FEIFKQA 126
           L  + TK  + +  +  + +YN + D +      +      +  T+A       E++  +
Sbjct: 199 LLPNSTKRHMDKIYDRFQTDYNVVKDLISIKLKEISDNERSNDLTDAIRHLVPIELYLWS 258

Query: 127 FVAVQSCVVHL---QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 183
           ++ + S  +++   Q  + A  F + P     L +S   +  L       Q+     Y  
Sbjct: 259 WMCINSRCLYMEIPQSKNAADNFTMAPY-VDFLNHSCDDQCGLKIDGTGFQVYTTCSYNP 317

Query: 184 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 217
            E + +  GP  N  LL  YGF   +N ++ L V
Sbjct: 318 DEQLFLSYGPHSNEFLLCEYGFTLPENKWNDLDV 351


>gi|320170159|gb|EFW47058.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 640

 Score = 46.2 bits (108), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 52/199 (26%), Positives = 81/199 (40%), Gaps = 24/199 (12%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP--TKAE 80
            LA+ L+YE  +  KS W  +I  L +        ++S + WS  E   L   P   K +
Sbjct: 252 ALAMALLYESYE-PKSMWREWISSLPQ-------TLDSTVFWSAEEQDALQSLPLKRKTQ 303

Query: 81  ILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-CVVHLQK 139
           ILER   +++ YN           L   +P+      +++E+FK A++ V S  +     
Sbjct: 304 ILER--HLQQLYNA------TTPRLLAAFPHIFAGGNYSYEMFKWAYMIVDSRSLTFSTG 355

Query: 140 VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV-----QLVVDRPYKAGESIVVWCGPQ 194
                +  L PL   L     +    L    + V      L   R  K GE +V   G  
Sbjct: 356 PDTLPQIMLAPLVDLLHHDPVQTNIQLGVHPEEVLGFEISLKTTRAIKKGEPLVRHIGEL 415

Query: 195 PNSKLLINYGFVDEDNPYD 213
           PN +LL+ +G     NPY+
Sbjct: 416 PNHQLLLRFGLAMPRNPYE 434


>gi|224012755|ref|XP_002295030.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220969469|gb|EED87810.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 753

 Score = 46.2 bits (108), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 49/189 (25%), Positives = 81/189 (42%), Gaps = 21/189 (11%)

Query: 24  LALYLMYEKK-QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
           L +Y+++++K  G+ SF+ PY + L       +     P+ W+  EL  L GS    +I 
Sbjct: 177 LMIYILWDRKVNGETSFFAPYYKILP------ETLRNMPIFWTREELDALEGSYLLLQIA 230

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
           +RAE IK +Y  + ++    G +             T E F+ A + V S    L  ++ 
Sbjct: 231 DRAEAIKEDYISICSIAPEFGDI------------ATLEEFQWARMIVCSRNFGLL-ING 277

Query: 143 ARRFALVPLGPPLLAYSSK-CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 201
            R  ALVP    L     +  K   +    +  +   +    GE +    G + N + L+
Sbjct: 278 HRTSALVPHADMLNHLRPRETKWTFSEESQSFTITTLQEIGMGEQVFDSYGQKCNHRFLL 337

Query: 202 NYGFVDEDN 210
           NYGF  E N
Sbjct: 338 NYGFCVERN 346


>gi|299470104|emb|CBN78133.1| protein N-methyltransferase [Ectocarpus siliculosus]
          Length = 482

 Score = 45.8 bits (107), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 36/74 (48%)

Query: 145 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 204
           R AL+PL   +  YS     M    D A+ L V   +  G+      GP  N  LL  YG
Sbjct: 215 RMALLPLIDSINHYSRMPTHMYWEADGALSLSVGAAFDPGDHAFASYGPVSNDDLLQYYG 274

Query: 205 FVDEDNPYDRLVVE 218
           FV++DNP D  V+E
Sbjct: 275 FVEQDNPSDTYVLE 288


>gi|443730800|gb|ELU16158.1| hypothetical protein CAPTEDRAFT_140019 [Capitella teleta]
          Length = 255

 Score = 45.8 bits (107), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 55/212 (25%), Positives = 92/212 (43%), Gaps = 30/212 (14%)

Query: 17  KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP 76
           +LS    L ++L+ E+ +G  SFW PY+  L         +    L W+  E+  L    
Sbjct: 50  RLSPTETLVIFLLCERNKGCSSFWKPYVDILPS-------SYTDILHWTSKEMDLLPKF- 101

Query: 77  TKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVH 136
           TK    +     +  +N L   +     L +Q P      AFT+++FK A+ +V +  V+
Sbjct: 102 TKRRACDLRLKAEESFNRLCNGFLPL--LVRQMPQF--NGAFTWDLFKWAWSSVNTRCVY 157

Query: 137 L---QKVSLA----RRFALVPLGPPLLAYSSKCKAMLAAVDDA------VQLVVDRPYKA 183
           +   Q   L+     + AL P    LL ++   +   A  DD+        L   +PY  
Sbjct: 158 MSQPQNSVLSPDEEDKSALAPFL-DLLNHTVDVEVN-ARFDDSSKSYKITTLTACKPY-- 213

Query: 184 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 215
            + + +  GP  N KLL+ YGF    NP++ +
Sbjct: 214 -DQVFINYGPHSNEKLLLEYGFTLPCNPHNNI 244


>gi|380089029|emb|CCC12973.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 465

 Score = 45.8 bits (107), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 61/270 (22%), Positives = 105/270 (38%), Gaps = 21/270 (7%)

Query: 60  SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 119
           S + ++E EL    G+       +  + I+ ++  L         LF Q+    P + F+
Sbjct: 116 SSIFFTEEELEVCAGTSLYTITKQLEQSIEDDHRAL------VMQLFIQHRDLFPLDKFS 169

Query: 120 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCK--AMLAAVDDAVQLVV 177
            E +K A   V S  +  Q         L P    +L +SS+ K   +       + ++ 
Sbjct: 170 IEDYKWALCTVWSRRMDFQLRDGKSMRLLAPFAD-MLNHSSEAKPCHVYDVSSGNLSVLA 228

Query: 178 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQR 237
            + Y+ G+ + +  G  PNS+LL  YGFV   NP D   +  + + + P Y+ K  +   
Sbjct: 229 GKDYEPGDQVFINYGSVPNSRLLRLYGFVIPGNPNDTYDLVLSTHPQAPFYEQKHKLWVS 288

Query: 238 NGKLSVQVFHVHAGREKEAISDMLP-----YLRLGYVSDTSEMQSVISSLGPICPVSPCM 292
            G  S     +        ++D LP     YLR+     +      + +      VS   
Sbjct: 289 AGLDSTSTIPL-------TLTDPLPKNVLRYLRIQRADASDLAAMALQNAKADEKVSDSN 341

Query: 293 ERAVLDQLADYFKARLAGYPATLSEDEAML 322
           E  +L  L + F   L G+   L + E  L
Sbjct: 342 EVEILQFLVESFGHLLGGFGTPLEKLEEQL 371


>gi|255075907|ref|XP_002501628.1| predicted protein [Micromonas sp. RCC299]
 gi|226516892|gb|ACO62886.1| predicted protein [Micromonas sp. RCC299]
          Length = 607

 Score = 45.8 bits (107), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 59/224 (26%), Positives = 93/224 (41%), Gaps = 40/224 (17%)

Query: 17  KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP 76
           +L+  A LAL++++E     +S    Y+  L    G+   +V  PLLW+ T++A L G+P
Sbjct: 145 ELATHAALALHVLFELGD-PRSEGFAYLATLPGLAGKASPSV--PLLWTPTQVATLRGTP 201

Query: 77  TKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVH 136
           T   +L RA+ +   +  L                 +P E F +     A  +V S    
Sbjct: 202 THGRVLRRAKFVSDAHAAL---------FGSGGGGGVPLEKFAW-----ALSSVLSRAAS 247

Query: 137 LQKVSLARRFALVPLGPPLLAYSS-KCKAMLAAV--------------DDAVQLVVDRPY 181
             ++     +A +P G  LL +        L+AV              D  V  V D P 
Sbjct: 248 GDRMP----YAFLP-GVDLLNHGGVDANCELSAVKLAPGGNEENVTWGDVEVTCVKDTP- 301

Query: 182 KAGESIVVWCGPQP-NSKLLINYGFVDEDNPYDRLVVEAALNTE 224
            AGE + +  G +  N +LL  YGF    N +DR  +E  L  +
Sbjct: 302 -AGEQLTISYGDESDNCRLLRLYGFATRGNVHDRRTIELRLTGD 344


>gi|347967016|ref|XP_003436005.1| AGAP002018-PB [Anopheles gambiae str. PEST]
 gi|333469796|gb|EGK97407.1| AGAP002018-PB [Anopheles gambiae str. PEST]
          Length = 504

 Score = 45.8 bits (107), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 82/369 (22%), Positives = 143/369 (38%), Gaps = 67/369 (18%)

Query: 24  LALYLMYEKKQGKKSFWLPYIREL-DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
           LAL L+ E+ + K S W PY+  L DR          +PL ++  ++  L  +      L
Sbjct: 151 LALALIMERFRAK-SDWKPYLDLLPDR--------YTTPLYYTTEDMGELAETDAFLPAL 201

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV--------AVQSCV 134
           +  + I R+Y  +          F Q   D   + FT+++F+            AV + +
Sbjct: 202 KLCKHIARQYGFIRR--------FVQEKVDELRDCFTYDVFRLLLFSLLIPHSWAVSTVM 253

Query: 135 VHLQKV--------SLARRFALVPL------GPPLLAYSSKCKAMLA--AVDDAVQLVVD 178
               KV         +    AL+PL        P  A  ++C A     A ++ ++  + 
Sbjct: 254 TRQNKVPVNLAEFDGMDHTLALIPLWDMANHAFPDTANETRCVAETCYNATNEQLECSLT 313

Query: 179 RPYK--AGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 236
           R     A   I +  G + +++ L++ GFV   NP+  +     L    P Y+++  + +
Sbjct: 314 REVSDIASVPIFIVYGTRTDAEFLVHNGFVCPRNPHANVQKRFTLVPAIPLYKERAHLLE 373

Query: 237 RNGKLSVQVFHVHAGREKEAISDMLPY----LRLGYVSDTS------------EMQSVIS 280
             G  +   F     RE  A +   P     + L  VS  +              +  + 
Sbjct: 374 LLGMPTTGTFSFGPAREPAAATTTTPISQELISLARVSSMTAKELDEYTAMKETQRQTLR 433

Query: 281 SLGPICPVSPC--MERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLV 338
           +   + P   C   ER     LA   K  L  YP T+ +DEA+L   N H  +R+  +  
Sbjct: 434 TYQALLPAELCARTER----WLATVMKIMLLRYPTTIEQDEALLKT-NRHHIRRLLIEYR 488

Query: 339 RMEKKMLNA 347
             EK++L +
Sbjct: 489 LGEKQILRS 497


>gi|115391295|ref|XP_001213152.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114194076|gb|EAU35776.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 691

 Score = 45.8 bits (107), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 50/193 (25%), Positives = 87/193 (45%), Gaps = 15/193 (7%)

Query: 26  LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 85
            +L+ +  +G + FW PYI  L +    G L   +PL +   +L +L G+ + A   E+ 
Sbjct: 115 FFLIGQYLRGSEGFWYPYICTLPQP---GDLT--TPLYYEGADLRWLEGT-SLAPAREQK 168

Query: 86  EGIKRE-----YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKV 140
           E + +E     + EL    F      ++Y +++   A T  + +     V + VV   ++
Sbjct: 169 ESLLKEKYQSTFEELRKSGFGDA---EKYTWELYLWASTIFVSRAFSAKVLAGVVPHAEL 225

Query: 141 SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLL 200
                  L+P    +L +    K    A +  V  VV     AGE +    GP+ N +L+
Sbjct: 226 PEENVSVLLPF-IDVLNHRPLAKVEWRAGERDVLFVVLEHVAAGEEVANNYGPRNNEQLM 284

Query: 201 INYGFVDEDNPYD 213
           +NYGF  ++NP D
Sbjct: 285 MNYGFCLQNNPCD 297


>gi|308811012|ref|XP_003082814.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplast precursor (ISS)
           [Ostreococcus tauri]
 gi|116054692|emb|CAL56769.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplast precursor (ISS)
           [Ostreococcus tauri]
          Length = 588

 Score = 45.8 bits (107), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 66/271 (24%), Positives = 112/271 (41%), Gaps = 46/271 (16%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           +AL L+ E+++G    +  Y+  L +        ++SPL WS  EL  + G+    ++L+
Sbjct: 128 VALLLLKERERGADGEFAAYVATLPK-------TLDSPLFWSADELRDIAGT----QLLD 176

Query: 84  RAEG----IKREYNELDTVWFMAGSLFQQYPYDIPTE-AFTFEIFKQAFVAVQS-CVVHL 137
            A G    ++  Y EL         +F +Y      + AF    F+ AF  ++S  +  L
Sbjct: 177 NAAGYDAYVRAVYEEL------KNGVFVEYASTFDVDGAFDEASFRWAFGILRSRTMAPL 230

Query: 138 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD---------------AVQLVVDRPYK 182
              ++A    LVP G  L+ +SS   A                      A  +  DR Y 
Sbjct: 231 DGANVA----LVP-GLDLINHSSLSGARWRVGGGGGMGGLFGGGSGSGVAAYVECDRDYD 285

Query: 183 AGESIVVWCGPQP-NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL 241
            G  I V   P+  +SK  ++YGF+D  NP     +  ++  +D    DK  V +  G  
Sbjct: 286 EGAEIFVNYDPEGIDSKFALDYGFIDVVNPSPGYALTLSIPEDDANLFDKLDVLETQGLP 345

Query: 242 SVQVFHVHAGREKEAISDMLPYLRLGYVSDT 272
               F +    + +   ++  +LRL +  DT
Sbjct: 346 EAPTFTLRPYSDPD--RELRTFLRLLHCKDT 374


>gi|403414266|emb|CCM00966.1| predicted protein [Fibroporia radiculosa]
          Length = 420

 Score = 45.8 bits (107), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 54/212 (25%), Positives = 92/212 (43%), Gaps = 27/212 (12%)

Query: 42  PYIREL---DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE---- 94
           PYIR L   D+ R        +PL +S+TEL  L GS      L+R    + E+ +    
Sbjct: 117 PYIRMLPAPDKLR--------TPLHFSDTELEALKGSNLYGATLDRRRDWQSEWEQCQKT 168

Query: 95  LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 154
           + TV    G  F    Y   +   +   F    ++    +V     S    + ++  G  
Sbjct: 169 IATVDLTWGEQFSWERYLSASTYLSSRAFPSMVLSPNPSLV-----STEESYPVLLPGID 223

Query: 155 LLAYS-----SKCKAMLAAVD-DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 208
            L +S     S   ++  + D + + LV+ +   AG  ++   GP+PN++L++ YGF   
Sbjct: 224 SLNHSRGQPVSWVVSIGTSSDVNRISLVLHKSTPAGSELLNNYGPKPNAELILGYGFSLP 283

Query: 209 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 240
           +NP D +V++   N+     Q K  V  RN +
Sbjct: 284 ENPDDTIVLKIGGNSASGLQQQKWEVG-RNAQ 314


>gi|336260071|ref|XP_003344832.1| hypothetical protein SMAC_06115 [Sordaria macrospora k-hell]
          Length = 456

 Score = 45.4 bits (106), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 63/271 (23%), Positives = 111/271 (40%), Gaps = 23/271 (8%)

Query: 60  SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 119
           S + ++E EL    G+       +  + I+ ++  L         LF Q+    P + F+
Sbjct: 107 SSIFFTEEELEVCAGTSLYTITKQLEQSIEDDHRAL------VMQLFIQHRDLFPLDKFS 160

Query: 120 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCK--AMLAAVDDAVQLVV 177
            E +K A   V S  +  Q         L P    +L +SS+ K   +       + ++ 
Sbjct: 161 IEDYKWALCTVWSRRMDFQLRDGKSMRLLAPFAD-MLNHSSEAKPCHVYDVSSGNLSVLA 219

Query: 178 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQR 237
            + Y+ G+ + +  G  PNS+LL  YGFV   NP D   +  + + + P Y+ K  +   
Sbjct: 220 GKDYEPGDQVFINYGSVPNSRLLRLYGFVIPGNPNDTYDLVLSTHPQAPFYEQKHKLWVS 279

Query: 238 NGKLSVQVFHVHAGREKEAISDMLP-----YLRLGYVSDTSEMQSV-ISSLGPICPVSPC 291
            G  S     +        ++D LP     YLR+   +D S++ ++ + +      VS  
Sbjct: 280 AGLDSTSTIPL-------TLTDPLPKNVLRYLRI-QRADASDLAAMALQNAKADEKVSDS 331

Query: 292 MERAVLDQLADYFKARLAGYPATLSEDEAML 322
            E  +L  L + F   L G+   L + E  L
Sbjct: 332 NEVEILQFLVESFGHLLGGFGTPLEKLEEQL 362


>gi|195132508|ref|XP_002010685.1| GI21676 [Drosophila mojavensis]
 gi|193907473|gb|EDW06340.1| GI21676 [Drosophila mojavensis]
          Length = 593

 Score = 45.4 bits (106), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 73/351 (20%), Positives = 134/351 (38%), Gaps = 44/351 (12%)

Query: 28  LMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG 87
           L+ EK +G  S W P+I  L  +         + L ++  ++  L G+   +  +     
Sbjct: 251 LIMEKLRGADSPWQPFIDTLPSR-------YNTVLYFTVEQMQRLRGTSACSAAVRHCRV 303

Query: 88  IKREYNEL---------DTVWFMAGSLFQQYP--YDIPTEAFTFEIFKQAFVAVQSCVVH 136
           I R Y  +         D+V     +LF  Y   Y++   A +    +Q  V  Q     
Sbjct: 304 IARLYASMYKCAFMQLDDSVMGGMANLFTDYGLCYELYRWAVSTVTTRQNLVPRQEIPSD 363

Query: 137 LQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPN 196
              + ++   AL+P        S K  +        ++      YK+GE   ++ G + N
Sbjct: 364 AANLPIS---ALIPYWDMANHRSGKITSFYDQAAGQMECTAQEAYKSGEQYFIYYGDRSN 420

Query: 197 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV-----AQRNGKLSVQVFHVHAG 251
           +  L++ GFVD  NP D + +   L+  D   + + ++      +R  +L V     H  
Sbjct: 421 ADRLVHNGFVDMQNPKDYVQIRLGLSPTDALAEQRAILLAELNIERKAELRVLPAPEHIS 480

Query: 252 REKEAISDMLPYLRLGYVSDT------SEMQSVISSLGPICPVSPCMERAVLDQLADYFK 305
            E      +L ++R+  +S        S+++  +  L   C +   +E      L    K
Sbjct: 481 GE------LLAFVRVFNMSKEQLEHWCSDLERAVDLLHIDCALETDLETRTWQYLYQRLK 534

Query: 306 ARLAGYPATLSED------EAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 350
             L    ATL E       EA+    +      +  Q  R+E+++L+  LQ
Sbjct: 535 LLLGVLEATLKETDELKQLEALQQQADASEIDIMVLQYRRLERRILSDALQ 585


>gi|403158396|ref|XP_003307692.2| hypothetical protein PGTG_00642 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375163798|gb|EFP74686.2| hypothetical protein PGTG_00642 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 622

 Score = 45.4 bits (106), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 30/78 (38%), Positives = 39/78 (50%), Gaps = 2/78 (2%)

Query: 146 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 205
            A+VPL   L A +    A L    D +++   R  K GE I    G  PNS LL  YG 
Sbjct: 345 IAMVPLADLLNAKTGSENARLFYETDCLKMKATRNIKKGEQIYNTYGDPPNSDLLRRYGH 404

Query: 206 VDEDNPYDRLVVEAALNT 223
           VD+ N +D  VVE ++ T
Sbjct: 405 VDDPNRFD--VVEISIKT 420


>gi|302754816|ref|XP_002960832.1| hypothetical protein SELMODRAFT_437299 [Selaginella moellendorffii]
 gi|300171771|gb|EFJ38371.1| hypothetical protein SELMODRAFT_437299 [Selaginella moellendorffii]
          Length = 418

 Score = 45.4 bits (106), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 46/172 (26%), Positives = 76/172 (44%), Gaps = 22/172 (12%)

Query: 64  WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF---TF 120
           W +TEL+YL  SP   +  ER E I  E+ ++    F    L Q    D+  + F   + 
Sbjct: 202 WEDTELSYLRASPLYGKARERLEMITTEFGQVQND-FCTCVLEQ--ALDVWPQLFGKVSL 258

Query: 121 EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY-----SSKCKAMLAAVDDAVQL 175
           E  K  +  V S     + +++     LV +  P+L +     +S  K     + +   +
Sbjct: 259 EDLKHVYATVFS-----RSLAIGEDSTLVMI--PMLDFFNHNATSFAKLSFNGLLNYAVV 311

Query: 176 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQ 227
             DR Y   + I +  G   N++L ++YGF   +NPYD    E  L T+ P+
Sbjct: 312 TADRDYAENDQIWINYGDLSNAELALDYGFTVPENPYD----ETELLTQFPE 359


>gi|428171155|gb|EKX40074.1| hypothetical protein GUITHDRAFT_113813 [Guillardia theta CCMP2712]
          Length = 353

 Score = 45.4 bits (106), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 24/79 (30%), Positives = 41/79 (51%), Gaps = 1/79 (1%)

Query: 139 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
           ++ L+R FA    G   L +S              Q+V ++ +K G+S+ +  G + N +
Sbjct: 183 EIVLSRAFAFSRTGGDDLVFSG-TSVKYDNSKQEFQIVAEKDFKVGQSVEISYGLKSNHE 241

Query: 199 LLINYGFVDEDNPYDRLVV 217
           LL++YGF+  DNP D  V+
Sbjct: 242 LLLSYGFILPDNPEDFFVI 260


>gi|307109196|gb|EFN57434.1| hypothetical protein CHLNCDRAFT_142903 [Chlorella variabilis]
          Length = 1233

 Score = 45.4 bits (106), Expect = 0.049,   Method: Compositional matrix adjust.
 Identities = 77/334 (23%), Positives = 127/334 (38%), Gaps = 86/334 (25%)

Query: 19   SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA-YLTGSPT 77
            SEL  LAL+LM E+ +                      A  +P+LW + E    L GSP 
Sbjct: 913  SELVGLALWLMQERAK----------------------ATLTPILWPDEERQQLLRGSPV 950

Query: 78   KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV-H 136
              E   R + +++E+ ++  +        + YP  +  E        QAF+   S V+ H
Sbjct: 951  LEEARTREQALRQEWQDIAAIAAQTSGGPEAYPAVVYNE--------QAFLEAMSVVLAH 1002

Query: 137  LQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQ 194
               +  A+ FAL+PL   L    S   A+L      +AV +V  R    G+ + ++C   
Sbjct: 1003 AAYLPKAQCFALLPLVGGLCRTGSSSGALLDYDLEREAVTVVAQR--TPGQEVALYC--- 1057

Query: 195  PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG---KLSVQVFHVHAG 251
                                L + A+L   D  Y  KR + +  G   K    +F     
Sbjct: 1058 --------------------LFMAASLVAADRLYTTKREILEELGLGVKAEFPIFE---- 1093

Query: 252  REKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGY 311
             ++ A   ++ +          E  ++I         SP  E  +L  L    + R+  Y
Sbjct: 1094 -DRLATQQLINF----------EQDTII---------SPENEYEILQLLMGDLRDRIQAY 1133

Query: 312  PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
                 +D   L   +L P++R+A QL   EK++L
Sbjct: 1134 ATEFDDDIKDLQRTDLTPRQRLAAQLRLGEKRIL 1167


>gi|428173103|gb|EKX42007.1| hypothetical protein GUITHDRAFT_141487 [Guillardia theta CCMP2712]
          Length = 355

 Score = 45.4 bits (106), Expect = 0.050,   Method: Compositional matrix adjust.
 Identities = 53/202 (26%), Positives = 80/202 (39%), Gaps = 21/202 (10%)

Query: 18  LSELACLALYL---MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 74
           LSE   + +Y    MY  K  K+SFW PY   +          + SP+ WSE EL  L G
Sbjct: 129 LSEKNKVLMYFLLQMYNPK--KESFWKPYFDIM-------PTNLTSPIFWSEDELQELAG 179

Query: 75  SPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 134
           S          + ++  Y+EL         +F+        +AFT + +  A     S V
Sbjct: 180 SEVSNMARIEKKRLRAMYDELRE------RIFKHDRKTFLKQAFTLKNWFWANGLYDSRV 233

Query: 135 VHLQKVSLARRF-ALVPLGPPLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWC 191
           + L + +        +PL   +    S+ K  +       A  +  DR    G  +    
Sbjct: 234 IQLNRQTGHGNVPTFIPLIDMVNCIESQDKTFIQYDKKLRAAVMYADRAVSRGVQVFESY 293

Query: 192 GPQPNSKLLINYGFVDEDNPYD 213
           G + N + L+  GFV EDNP D
Sbjct: 294 GNKSNYEYLLYNGFVMEDNPND 315


>gi|348684109|gb|EGZ23924.1| hypothetical protein PHYSODRAFT_296170 [Phytophthora sojae]
          Length = 452

 Score = 45.4 bits (106), Expect = 0.050,   Method: Compositional matrix adjust.
 Identities = 51/192 (26%), Positives = 77/192 (40%), Gaps = 36/192 (18%)

Query: 58  VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 117
           V+ PL W + +   L G       ++      R Y+++    F A + F      +  EA
Sbjct: 143 VDLPLYWDDKQFEELQGCEEARRAMQHG---ARFYSQVYKHLFGANNQF------VNAEA 193

Query: 118 FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY---SSKCKAMLAAVDDAVQ 174
           F + I         S ++        + FAL+P           S  C+  L + D+ VQ
Sbjct: 194 FFWAI---------SILMSRATSGQNQPFALIPFFDWFNHAGNGSDNCRHALDS-DECVQ 243

Query: 175 ---------LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD--RLVVEAAL-- 221
                    +   R Y+ GE + +  G   N +LL NYGF   +NPYD   L + AAL  
Sbjct: 244 DFDMQKGFTIHTTRSYEPGEQLFINYGSHGNLRLLRNYGFTMPNNPYDVVNLPMPAALQQ 303

Query: 222 -NTEDPQYQDKR 232
            N  DP +  KR
Sbjct: 304 PNEADPAFAQKR 315


>gi|298715435|emb|CBJ28046.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 719

 Score = 45.1 bits (105), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 60/248 (24%), Positives = 107/248 (43%), Gaps = 24/248 (9%)

Query: 7   YYAAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSE 66
           Y     L T  L     LALY+M E  + ++SF+ PY+R L           ES LL  +
Sbjct: 83  YGHVHRLNTRLLGSDNGLALYIMQEILKEERSFYWPYLRMLPTPCNLRNWNRESLLLLQD 142

Query: 67  TELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 126
            +L   T + ++ ++L     + RE     T+ F++ S    YP     + +TFE+F  A
Sbjct: 143 HKLVRRTAARSR-QLL----ALYRE-----TIEFLSSS----YPELYTADRYTFELFDFA 188

Query: 127 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV---DRPYKA 183
           +  +Q+     +  S     ALVP    L   + + K       +    +    +  Y  
Sbjct: 189 WRTIQARAFGKRLKS----SALVPFADCLNHGNVQTKYDFDVGGNGTFRLFPSGNNRYPR 244

Query: 184 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL---NTEDPQYQDKRMVAQRNGK 240
              ++   G + N  LL++YGF   DN +D   V  +L   + + P  + ++   + +G+
Sbjct: 245 NSEVLNSYGRRANDNLLLDYGFAMLDNEWDAAEVICSLPPSHDQSPLDRRRKACLRASGQ 304

Query: 241 LSVQVFHV 248
            +V++  V
Sbjct: 305 HTVRILRV 312


>gi|330800139|ref|XP_003288096.1| hypothetical protein DICPUDRAFT_152307 [Dictyostelium purpureum]
 gi|325081857|gb|EGC35358.1| hypothetical protein DICPUDRAFT_152307 [Dictyostelium purpureum]
          Length = 525

 Score = 45.1 bits (105), Expect = 0.053,   Method: Compositional matrix adjust.
 Identities = 90/444 (20%), Positives = 162/444 (36%), Gaps = 128/444 (28%)

Query: 10  AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
           + +LT   +      A+ L+YE   G+KS W  YI  L        L V+ P+LW +   
Sbjct: 111 SNILTKYTMERNIATAIALIYEASIGEKSKWYGYISSL-------PLKVDIPILWDKESQ 163

Query: 70  AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
             L G+  +  I +    I   Y ++     +   L + +P     E F+ EIF      
Sbjct: 164 QLLNGTVMEDVIQDDNILINHAYADI-----VESLLIKNHP-----EYFSKEIFSFENFK 213

Query: 130 VQSCVVHLQKVSLARRF--ALVPLG----------------------------------- 152
           + + +V  +   +      +LVPL                                    
Sbjct: 214 IANSIVSSRAFCIDSYHGDSLVPLADIFNHKTGRENVHIESNGDVCNKCGSIKTCKHRKV 273

Query: 153 PPLLAYSSK---------------------------CKAMLAAVDDAVQLVVDRPYKAGE 185
            PL+  S+K                           C  +    D+ + + V +  KA +
Sbjct: 274 TPLITKSAKSYKKLTNKKKMELIEKQKQQQINDEENCGDIAEEDDEHLYIKVVKAVKANQ 333

Query: 186 SIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL-----------NTEDPQYQDKRM- 233
            +    G   N+ LL  YGF++ DNP D L VE +L           N  D     KR+ 
Sbjct: 334 EVYNTYGDHSNATLLSKYGFIEMDNPCDNLPVEKSLVDTNLISLCKENGFDSNELSKRIS 393

Query: 234 -------VAQRNGKLSVQVFHVHAGREKEA-----------ISDMLPYLRLGYVSDTSEM 275
                  +  RN   S+++    +GR  +A           +S+   +L++       ++
Sbjct: 394 FYASLFDIDSRNTH-SIEI----SGRLDDALVCSVGIALAPLSEFEGWLKMS----EHKL 444

Query: 276 QSVISSLGP--ICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDY--NLHPKK 331
           +     L    I   +  +++A++  L +    +L+ YP TL +D+  L +   N   +K
Sbjct: 445 EKYFEKLEAEDIVKQNAQVKKAIVQILNN----KLSNYPTTLEQDQNKLKELKENEENRK 500

Query: 332 RVATQLVRMEKKMLNACLQVTADM 355
            ++T L   EKK++   ++   D+
Sbjct: 501 IISTSLNICEKKLIYKSIKYYEDL 524


>gi|320169513|gb|EFW46412.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 495

 Score = 45.1 bits (105), Expect = 0.055,   Method: Compositional matrix adjust.
 Identities = 88/383 (22%), Positives = 153/383 (39%), Gaps = 53/383 (13%)

Query: 12  LLTTNKLSELACLALYLMYE-KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA 70
           +L   +LS+   +A +L+YE  +  + S WLPY+  L       QL     + +   E+ 
Sbjct: 113 ILDDFRLSDAEAMAFWLIYELTRPERASPWLPYLESL--PASIKQLT----MFYDPFEMK 166

Query: 71  YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP-----YDIPTEAFTFEIFKQ 125
            L  SP       R   ++ ++ +          + +  P      + P E  T + F  
Sbjct: 167 RLQASPVAEFTSRRTVKMRNKFGKYRE------QISKHRPAHLAEIEFPVELITVDDFLW 220

Query: 126 AFVAVQSC------VVHLQKVSLARRFALVPLG-----PPLLAYSSKCKAMLAAVDDAVQ 174
           A +AVQ        V H       R   LVPL       P    + +C   L +     +
Sbjct: 221 A-MAVQFTRLITVQVKHPADGEWERTKCLVPLADLLNTAPADQINVECATNLDSTH--FE 277

Query: 175 LVVDRPYKAGESIVV-WCGPQ--PNSKLLINYGFVDEDNPYDRLVV------EAALNTED 225
               RP   G+ ++  + G +   N +L+++YG    +NP D + +      E A+  + 
Sbjct: 278 CATIRPVAEGQELLTPYGGAEQLSNGQLIMDYGVTFRNNPSDLVALPIPKLRETAVAYDS 337

Query: 226 PQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS---DTSEMQSVISSL 282
                  M   R  +L + V        KE    +L + R+ YVS   D S+++ V+  +
Sbjct: 338 KMRLLMAMSLDRFDRLQLPVLDHFESIPKE----LLAFARV-YVSTPSDLSDLEHVLELM 392

Query: 283 GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH--PKKRVATQLV-R 339
                ++P  ER  L+ L       +  Y  T+ EDE ML + +    P       +V R
Sbjct: 393 KEHRAINPSNERRALELLLQLTNEMILKYITTIEEDETMLRELDAESVPNANAVNAVVLR 452

Query: 340 M-EKKMLNACLQVTADMIMLLPD 361
           + EK++L++  Q+    I  LP+
Sbjct: 453 LGEKRILSSLWQLLDSAIEALPE 475


>gi|324503528|gb|ADY41532.1| SET domain-containing protein 3 [Ascaris suum]
          Length = 502

 Score = 45.1 bits (105), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 63/275 (22%), Positives = 109/275 (39%), Gaps = 51/275 (18%)

Query: 32  KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 91
           +K    S WLPY+  L +          +PL +S  EL  L+ SP   E L     + R+
Sbjct: 167 QKLSPDSSWLPYLDALPQ-------TFSTPLYFSALELRKLSPSPAYEESLIMYRNVARQ 219

Query: 92  YNELDTVWFMAG----------------------SLFQQYPYDIPTEAFTFEIFKQAFVA 129
           +     V+F+A                        LF   P+ +    FTF++++ A   
Sbjct: 220 F-----VYFLAAVQRSERSRSAKKDKNHAAVGMEPLFLNAPFTVSN--FTFDLYRWAVAC 272

Query: 130 VQSCVVHL----QKVSLARRFALVPLGPPLL-----AYSSKCKAMLAAVDDAVQLVVDRP 180
           V + +  +     K S  +  A VP   PLL      +        +   D   +   + 
Sbjct: 273 VTTRINFIPSQYAKDSNGQPVA-VPCLIPLLDMANHEFDHPLTVHFSTEGDYASIKATKD 331

Query: 181 YKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALNTEDPQYQDKRMV---AQ 236
           YKAG+ + ++ G + N +  ++ GFV D +N  D   ++      D Q + +  +   A 
Sbjct: 332 YKAGDEVTIFYGIRTNRQFFLHNGFVPDGENKNDTYKLKIGFPRGDKQVRARLKLMHDAG 391

Query: 237 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD 271
            N +  V VF V+A      +S +L + R+  V +
Sbjct: 392 FNAESRVFVFEVNASERPVPLS-LLDFARVFLVEN 425


>gi|303271033|ref|XP_003054878.1| set domain protein [Micromonas pusilla CCMP1545]
 gi|226462852|gb|EEH60130.1| set domain protein [Micromonas pusilla CCMP1545]
          Length = 664

 Score = 45.1 bits (105), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 42/145 (28%), Positives = 68/145 (46%), Gaps = 17/145 (11%)

Query: 9   AAELLTTNKLSELACLALYLMYE-KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET 67
           A + +T+ +++  A +AL+L++E   Q +KS W P++  L R        VE+PLLW+  
Sbjct: 185 ALKTITSREVTIDAVIALHLLHELYVQREKSEWWPWVSILPRD-------VETPLLWTPR 237

Query: 68  ELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
           ELA L GS     ++   + + + +       F    L Q++P   P E F  E +  A 
Sbjct: 238 ELAQLEGS----NLIGFRDAVLKGWTTQRDALF--PKLTQKFPSLFPEEHFRTERWAWAM 291

Query: 128 VAVQSCVVHLQKVSLARRFALVPLG 152
             V S       V + R  A+ P G
Sbjct: 292 AIVWSRAA---DVPVPRPEAIFPSG 313


>gi|328772335|gb|EGF82373.1| hypothetical protein BATDEDRAFT_86177 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 966

 Score = 45.1 bits (105), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 51/206 (24%), Positives = 78/206 (37%), Gaps = 41/206 (19%)

Query: 38  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 97
           S W PY   L R           P+L S  +LA  TG+    E++     + R+Y  L  
Sbjct: 652 SLWQPYFDLLPR-------VFTIPVLGSAQDLAAYTGTSIIDEVVHDMIALMRQYLYLQH 704

Query: 98  VWFMAGSLFQQYPYD-IPTEAFTFEIFKQAFVAVQS-----CVVHLQKVSLARRFALVPL 151
           +       F+  P   IP   FTF  F  A   V +     C  +     + +   L+PL
Sbjct: 705 I-------FKSIPEPPIPLADFTFAAFSWARAIVSTRQNEICYANPSTSEMQQFLCLIPL 757

Query: 152 --------GPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 201
                   G     + +K  C   +A+ D             GE I +  G + N ++L+
Sbjct: 758 FDMFNHKPGNSTTQFDTKEYCSETIASCD----------VSPGEQIFIHYGKRSNQEMLL 807

Query: 202 NYGFVDEDN-PYDRLVVEAALNTEDP 226
             GFVD  N  YD + +  ++   DP
Sbjct: 808 YSGFVDPTNIEYDHIKLSVSIPQSDP 833


>gi|408392258|gb|EKJ71616.1| hypothetical protein FPSE_08255 [Fusarium pseudograminearum CS3096]
          Length = 527

 Score = 45.1 bits (105), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 23/63 (36%), Positives = 35/63 (55%), Gaps = 2/63 (3%)

Query: 159 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 218
           S  CK + +A+  +VQ   DR YK GE + V  GP  N  LL  YGF+ + N +D + ++
Sbjct: 194 SQGCKLVYSALGYSVQ--TDRAYKQGEEVFVSYGPHSNDFLLTEYGFILDTNRWDEVYLD 251

Query: 219 AAL 221
             +
Sbjct: 252 EVI 254


>gi|348676999|gb|EGZ16816.1| hypothetical protein PHYSODRAFT_251772 [Phytophthora sojae]
          Length = 424

 Score = 45.1 bits (105), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 71/302 (23%), Positives = 118/302 (39%), Gaps = 56/302 (18%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV----ESPLLWSETELAYLTGSPTK 78
            LAL+L+ E     +SF+ PY            LAV    ES   W++ EL  L      
Sbjct: 90  VLALFLVRELLLADRSFFHPY------------LAVLPYPESVQDWTQAELGELHD---- 133

Query: 79  AEILER-AEGIKREYNELDTVWFMAGSLFQ-QYPYDIPTEAFTFEIFKQAFVAVQSCVVH 136
               ER  +   R  +E+D  +       Q +YP + P   +TF+ FK A+  +Q+    
Sbjct: 134 ----ERLVDAAARRTSEIDVYYRRVMVRLQTKYPGEFPEALYTFDRFKFAWKTIQA---- 185

Query: 137 LQKVSLARRF---ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR---PYKAGESIVVW 190
               +  RR    ALVP    L   +   K      D+ +  +       +  G  +   
Sbjct: 186 ---RTFGRRLPWTALVPFADCLNHTNVATKYDFDVNDNGLFRLYPSGATSFAQGAEVFNS 242

Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 250
            G + N +LL++YGF   DN +D + VE   +   P          R  KL      V  
Sbjct: 243 YGRRSNFQLLLDYGFALPDNEWDYVDVEIGKDRAGP----------RGRKLRFMKRVVRI 292

Query: 251 GREKEAISDMLPYLRLGYVSD------TSEMQSVISSLGPICPVSPCMERAVLDQLADYF 304
            R+  ++ ++ P   L  ++D       SE  + +S    +C     +   +++ +AD+ 
Sbjct: 293 DRQS-SLDELFPPSFLAGLADPVPDEEQSEAAAELSERTALCDALEWLRSILIETIADWG 351

Query: 305 KA 306
            A
Sbjct: 352 TA 353


>gi|390602144|gb|EIN11537.1| SET domain-containing protein [Punctularia strigosozonata HHB-11173
           SS5]
          Length = 503

 Score = 45.1 bits (105), Expect = 0.062,   Method: Compositional matrix adjust.
 Identities = 59/220 (26%), Positives = 87/220 (39%), Gaps = 24/220 (10%)

Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
           A+VP+   L A      A L      +++V  +P  +GE I    G  PNS LL  YG V
Sbjct: 255 AMVPMADMLNARYGSENAKLFYESRDLRMVTTKPIASGEQIWNTYGDPPNSDLLRRYGHV 314

Query: 207 D---------EDNPYDRLVVEA--ALNTEDPQYQDKRMVAQ-----RNGKLSVQVFHVHA 250
           D           NP D + V A   LN  + + Q   +  +       G   V VF    
Sbjct: 315 DLLALSDGDGMGNPSDIVEVRADLVLNHVNSKKQSHELEERIDWWLEEGGDDVFVFT--- 371

Query: 251 GREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAG 310
            R+ E  S+++  +RL  +  T   ++      P   V       +L  +      RL  
Sbjct: 372 -RDAELPSELVSLIRLLILPPTEWTKTRDKGKLPKGKVDDVR---ILHVVTGALHERLQQ 427

Query: 311 YPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 350
           YP ++ +DEA+L    L   KR A  +   EK +L   L 
Sbjct: 428 YPTSIEDDEALLA-TALSENKRQAVIVRLAEKHILRKALH 466


>gi|255087300|ref|XP_002505573.1| set domain protein [Micromonas sp. RCC299]
 gi|226520843|gb|ACO66831.1| set domain protein [Micromonas sp. RCC299]
          Length = 509

 Score = 45.1 bits (105), Expect = 0.064,   Method: Compositional matrix adjust.
 Identities = 82/353 (23%), Positives = 146/353 (41%), Gaps = 46/353 (13%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
            +AL L++E+  G+KS W  Y+  L  Q       +++PL WS  E+A LTG+    ++L
Sbjct: 134 AVALQLLHERSIGEKSRWAAYVNALPAQ-------LDAPLFWSAEEVATLTGT----QLL 182

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-CVVHLQKVS 141
           + A G   +     T   +  S F   P   P++AF    F  AF  ++S C      V 
Sbjct: 183 DAAAGY--DSYARGTWARLKESAFDANPDVFPSDAFDEPSFLWAFGILRSRCQA---PVD 237

Query: 142 LARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV------------ 189
                ALVP     +A  S   +    +++     V    K+G S+++            
Sbjct: 238 QGADIALVP--GLDMANHSGLSSQTWTLNNGGVAAVFGGGKSGGSMLLRTEKGAKGLLAK 295

Query: 190 -------WCGPQPNSKLLINYGFVDEDNPYDRLVVE-AALNTEDPQYQDKRMVAQRNGKL 241
                  +   + +++L ++YGF D        V+   A+   DP   DK  V +  G  
Sbjct: 296 GAEVFMNYGQRKIDNQLALDYGFTDAFASRPGYVLGPIAIPESDPNAFDKMDVLEVAGLR 355

Query: 242 SVQVFHVHAGREKEAISDMLPYLRLGYVS--DTSEMQSVI--SSLGPIC-PVSPCMERAV 296
               F + A  + E   ++  ++RL  +   D   ++++    + G I  PVS   E+  
Sbjct: 356 EAPSFVLRAFEDPE--PELRVFMRLLNLKGEDAFLLEAIFRQEAWGLISEPVSRLNEQEA 413

Query: 297 LDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 349
              + +  +  L GY   + +D  +  D  +  + R+A ++   EK+ L   L
Sbjct: 414 CGTMINGCEEALRGYATRVEDDRRVAEDPGVGHRLRLAARVRMGEKQALADAL 466


>gi|126325439|ref|XP_001376285.1| PREDICTED: SET domain-containing protein 4-like [Monodelphis
           domestica]
          Length = 437

 Score = 44.7 bits (104), Expect = 0.066,   Method: Compositional matrix adjust.
 Identities = 50/207 (24%), Positives = 88/207 (42%), Gaps = 19/207 (9%)

Query: 18  LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
           +S L  L  +L+ EK  G KS W PY+  L +          + L+  E E+  L   P 
Sbjct: 105 ISPLLALCAFLISEKHAGNKSPWKPYLDVLPK--------AYTCLVCLEPEVVRLLPRPL 156

Query: 78  KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV-- 135
           + +  E+   +++ +      +    SLF +   D+    F +  F  A+  + +  V  
Sbjct: 157 QMKAEEQRMQVQKLFISSRGFFSSLQSLFTE---DV-KHVFHYHAFLWAWCTINTRTVYM 212

Query: 136 -HLQKVSLARRFALVPLGP--PLLAYSSKCKAMLAAVDDAV--QLVVDRPYKAGESIVVW 190
            H QK  L+    +  L P   LL +S +     A  ++    ++      K  E + + 
Sbjct: 213 KHAQKQCLSAEPDVYALAPYLDLLNHSPRVWVEAAFNEETCCYEIRTTSHCKKFEELFIC 272

Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVV 217
            GP  N +LL+ YGFV  +NP+  + +
Sbjct: 273 YGPHDNHRLLLEYGFVASNNPHSAVYI 299


>gi|346327621|gb|EGX97217.1| SET domain-containing protein, putative [Cordyceps militaris CM01]
          Length = 371

 Score = 44.7 bits (104), Expect = 0.070,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 38/77 (49%), Gaps = 2/77 (2%)

Query: 145 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 204
           R AL+P+       S  C    +   +   +  DR Y+A E +    G   N  LL  YG
Sbjct: 178 RLALLPVADMFNHASVGCAVAFST--EVYDVTADRDYEADEELYTSYGAHSNDFLLAEYG 235

Query: 205 FVDEDNPYDRLVVEAAL 221
           F+ +DNP+D+L ++A L
Sbjct: 236 FMLQDNPHDQLCLDAVL 252


>gi|366987955|ref|XP_003673744.1| hypothetical protein NCAS_0A08050 [Naumovozyma castellii CBS 4309]
 gi|342299607|emb|CCC67363.1| hypothetical protein NCAS_0A08050 [Naumovozyma castellii CBS 4309]
          Length = 499

 Score = 44.7 bits (104), Expect = 0.070,   Method: Compositional matrix adjust.
 Identities = 27/85 (31%), Positives = 47/85 (55%), Gaps = 3/85 (3%)

Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
           +++PL   L + ++KC A L   +D++++      KAGE +    G  PN+++L  YG+V
Sbjct: 229 SMIPLADTLNSDTNKCNAHLIYDEDSLKMRAISNIKAGEQVYNIYGNHPNAEILRRYGYV 288

Query: 207 D-EDNPYD--RLVVEAALNTEDPQY 228
           + E + YD   L +E  + T   QY
Sbjct: 289 EWEGSKYDFGELPLEVIIETLHEQY 313


>gi|145349891|ref|XP_001419360.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144579591|gb|ABO97653.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 465

 Score = 44.7 bits (104), Expect = 0.072,   Method: Compositional matrix adjust.
 Identities = 50/213 (23%), Positives = 81/213 (38%), Gaps = 31/213 (14%)

Query: 14  TTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLT 73
           T  + S L  L   L  E+  G KS +  Y R L R       A      W++ E +YL 
Sbjct: 89  TKTEASWLCGLTAALCVERSLGLKSRYFAYDRVLPRCEANVVCA------WNDGERSYLA 142

Query: 74  GSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSC 133
           G+  +  + + A   K E+        +   +F+++  +      +FE F +A   V S 
Sbjct: 143 GTEVETSLRDEAAAAKNEWER------VVAPVFKEHGVEC-----SFEQFIEARTVVSSR 191

Query: 134 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD-----------AVQLVVDRPYK 182
              L   S      LVP+       +      +   D             V++  ++  +
Sbjct: 192 AFTL---SPNAGVGLVPIADAFNHLTGNHHVNVGDGDAVVRSETGGEALCVKVTNEQGVR 248

Query: 183 AGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 215
            G+ I    G   N+KLL +YGF   DNP D +
Sbjct: 249 RGDEIFNTYGFHGNAKLLNSYGFTQNDNPADEV 281


>gi|348552908|ref|XP_003462269.1| PREDICTED: SET domain-containing protein 4-like [Cavia porcellus]
          Length = 440

 Score = 44.7 bits (104), Expect = 0.073,   Method: Compositional matrix adjust.
 Identities = 56/244 (22%), Positives = 98/244 (40%), Gaps = 32/244 (13%)

Query: 19  SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
           S L  L  +L+ EK  G +S W PY+  L +       +   P+   E E+  L   P K
Sbjct: 105 SPLLALCTFLVSEKHAGDQSVWKPYLDILPK-------SYTCPVCL-EPEVVNLLPEPLK 156

Query: 79  AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ 138
           A+  E+   +++ +      +     LF++    +    F++     A+  V +  V+L+
Sbjct: 157 AKAEEQRMSVQQFFASSRDFFSSLQPLFEEATDSV----FSYSALLWAWCTVNTRAVYLR 212

Query: 139 KVSLARRFALVPLGPP---------LLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESI 187
                RR   + L P          LL +S   + KA         ++     Y+  + +
Sbjct: 213 ----TRRRDCLSLEPDTCALAPYLDLLNHSPNVQVKAAFNEETGCYEIRTASDYRKHKEV 268

Query: 188 VVWCGPQPNSKLLINYGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLS 242
            +  GP  N +LL+ YGFV   NP+      R ++   L + D Q   K  + + +G L 
Sbjct: 269 FICYGPHDNHRLLLEYGFVSLCNPHACVYVSREILVKYLPSTDKQMNKKISILKDHGFLE 328

Query: 243 VQVF 246
              F
Sbjct: 329 NLTF 332


>gi|315042966|ref|XP_003170859.1| SET domain-containing protein [Arthroderma gypseum CBS 118893]
 gi|311344648|gb|EFR03851.1| SET domain-containing protein [Arthroderma gypseum CBS 118893]
          Length = 693

 Score = 44.7 bits (104), Expect = 0.077,   Method: Compositional matrix adjust.
 Identities = 50/194 (25%), Positives = 84/194 (43%), Gaps = 11/194 (5%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           LAL++ +++ + K S W PY+  L R        + S L +   +L +L G+        
Sbjct: 108 LALFVAHQQLKEKGSHWWPYLATLPRAS-----ELTSALFYHGDDLEWLQGTNLYQTHQA 162

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQ-YPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
               +K EY+   ++    G L  + Y +D+   A+T  I  +AF + +   V+L +   
Sbjct: 163 YMNAVKEEYDSAISILRDEGCLAAELYSWDLFCWAYTV-IASRAFTS-RVLSVYLSRNPA 220

Query: 143 ARRFALVPLGPPLLAYSSK---CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 199
            ++     +  PL+  S+     K    A    + L V  P  + E I    GP  N +L
Sbjct: 221 LKQDEEFQILLPLVDSSNHKPLAKIEWRAEAAEIGLKVVEPIVSEEEIHNNYGPLNNQQL 280

Query: 200 LINYGFVDEDNPYD 213
           +  YGF   DNP D
Sbjct: 281 MTTYGFCIVDNPCD 294


>gi|328772032|gb|EGF82071.1| hypothetical protein BATDEDRAFT_23340 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 419

 Score = 44.7 bits (104), Expect = 0.080,   Method: Compositional matrix adjust.
 Identities = 52/213 (24%), Positives = 93/213 (43%), Gaps = 32/213 (15%)

Query: 18  LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
           L +   +AL++ ++K       W PYI  L R      L ++  LL     L Y      
Sbjct: 86  LKQHPSIALFIAWQKIHPTPE-WSPYIDILPRSFDTMPLCIDLKLL---AMLPY------ 135

Query: 78  KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYD-IPTEAFTFEIFKQAFVAVQSCVVH 136
             +I E A   K + ++LDT +    +      Y+ IP +     IFK A++ V +  + 
Sbjct: 136 --DIQEIA---KNQQSKLDTDYAFVCTALAVSGYEMIPKD-----IFKWAWIVVNTRCIT 185

Query: 137 LQKVSLAR-------RFALVPLGPPLLAYSSKCKAMLAAVDDAVQ----LVVDRPYKAGE 185
           +   ++++       +  ++ L P L   +    A ++A  D V+    +    PYK G 
Sbjct: 186 MNTNAISKPQLSHIHQQPIITLAPFLDCLNHTSTARISAGYDTVEKAYIIRTLVPYKKGS 245

Query: 186 SIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 218
            + +  GP  N+ LL  YGF    NP++ +V++
Sbjct: 246 QVFINYGPHDNNFLLAEYGFAILKNPFNHVVLD 278


>gi|291000152|ref|XP_002682643.1| predicted protein [Naegleria gruberi]
 gi|284096271|gb|EFC49899.1| predicted protein [Naegleria gruberi]
          Length = 619

 Score = 44.7 bits (104), Expect = 0.080,   Method: Compositional matrix adjust.
 Identities = 51/210 (24%), Positives = 88/210 (41%), Gaps = 19/210 (9%)

Query: 21  LACLALYLMYE-KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 79
           L    ++L+YE   + +KS   PY+  L R+         + L + E E+A L  +    
Sbjct: 106 LIVFYMFLIYELHVEKEKSTHFPYLNLLPRE-------FTTALYFDEDEMAALRSTNLYK 158

Query: 80  EILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK 139
            +    + +K+ Y E    + M      +YP     + F++E F  AF AV S V  ++ 
Sbjct: 159 SVQSIRQNLKQIY-ETKVEYLM-----NKYPQKFDRQVFSYENFMWAFSAVWSRVFPIEY 212

Query: 140 -VSLARRFALVP-LGPPLLAYSSKCKAMLA---AVDDAVQLVVDRPYKAGESIVVWCGPQ 194
                    +VP L P +   + K  A +      D    L      K+G+ +    G +
Sbjct: 213 PAENGEGVEIVPTLLPTVDILNHKFNAKITYFTGSDRRFYLKTRESLKSGDYVCNNYGAK 272

Query: 195 PNSKLLINYGFVDEDNPYDRLVVEAALNTE 224
            N   L++YGFV  +N  D L V+  ++ +
Sbjct: 273 SNDSFLLSYGFVIPNNSEDTLYVQFGISDD 302


>gi|46129354|ref|XP_389038.1| hypothetical protein FG08862.1 [Gibberella zeae PH-1]
          Length = 478

 Score = 44.7 bits (104), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 23/67 (34%), Positives = 36/67 (53%), Gaps = 2/67 (2%)

Query: 155 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 214
           L  +   CK + +A+  +VQ   DR YK GE + V  GP  N  LL  YGF+ + N +D 
Sbjct: 187 LFNHDQGCKLVYSALGYSVQ--TDRVYKQGEEVYVSYGPHSNDFLLTEYGFILDTNRWDE 244

Query: 215 LVVEAAL 221
           + ++  +
Sbjct: 245 VYLDEVI 251


>gi|115386294|ref|XP_001209688.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114190686|gb|EAU32386.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 486

 Score = 44.7 bits (104), Expect = 0.082,   Method: Compositional matrix adjust.
 Identities = 88/388 (22%), Positives = 152/388 (39%), Gaps = 70/388 (18%)

Query: 13  LTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
           L +  L EL     L L +MYE   G +S W  Y + L R+        ++ + W+ +EL
Sbjct: 81  LLSQDLEELGPWLSLMLVMMYEYLLGDQSTWAAYFKVLPRK-------FDTLMFWTPSEL 133

Query: 70  AYLTGSP---------TKAEILERAEGIKREYNEL----------------DTVWFMA-- 102
             L GS              ILE    I R +  L                  +  +A  
Sbjct: 134 LELQGSAVIDKIGRQGADESILEMIAPIVRAHPSLFPPVDGLPSYDGDAGTQALLHLAHT 193

Query: 103 -GSLFQQYPYDI--PTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYS 159
            GSL   Y +DI  P +       +  ++  +      ++  L++   +VPL   L A +
Sbjct: 194 MGSLIMAYAFDIEKPEDEDEEGDGEGGYMTDE------EEEQLSK--GMVPLADLLNADA 245

Query: 160 SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED-NPYDRLVVE 218
            +  A L   ++A+ +   +P   GE I    G  P + LL  YG+V ++  PYD + V 
Sbjct: 246 DRNNARLFQDENALVMKAIKPIAKGEEIFNDYGEIPRADLLRRYGYVTDNYAPYDVVEVS 305

Query: 219 -------AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP-YLRLGYVS 270
                  A L+  DP+ Q           L          +E + ++D+LP  L +   +
Sbjct: 306 LDVICKAAGLSDSDPEKQPPLEFLDELELLDDGYVIPRPSQEDDQLTDILPDELIILLRT 365

Query: 271 DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT------- 323
            T   + +        P  P    A    LA   + + A Y  T+++D+ +L+       
Sbjct: 366 LTLSPEQLAQQRSKNKPPKPAFAEAEATILAKAIQLKQAQYATTIAQDQEILSQLNSSEV 425

Query: 324 -----DYNLHPKKRVATQLVRMEKKMLN 346
                D + H ++++A Q+   EK++L+
Sbjct: 426 SSGFVDESTH-RRKMAVQVRIGEKEILH 452


>gi|258567286|ref|XP_002584387.1| predicted protein [Uncinocarpus reesii 1704]
 gi|237905833|gb|EEP80234.1| predicted protein [Uncinocarpus reesii 1704]
          Length = 706

 Score = 44.7 bits (104), Expect = 0.084,   Method: Compositional matrix adjust.
 Identities = 55/206 (26%), Positives = 87/206 (42%), Gaps = 22/206 (10%)

Query: 18  LSELACLALYLMYEKKQGKKSFWLPYIREL--DRQRGRGQLAVESPLLWSE-TELAYLTG 74
           + E   LA +LM +   G +SFW PYI+ L  D Q  R +      L W E T L  L  
Sbjct: 120 VEEPGALAFFLMDQYLLGDESFWAPYIQSLPDDSQFTRLEYYTGDDLKWLEGTNLLKLRE 179

Query: 75  SPTKAEILERAEGIK--REYNELDT---VW--FMAGSLFQQYPYDIPTEAFTFEIFKQAF 127
              +    +   G++  +E+   +T    W  F+  S        I + AF+ E+ K   
Sbjct: 180 KLLERLKAKYETGLRLLKEFPNKNTPKYTWERFLWASSI------ILSRAFSSEVLKDYI 233

Query: 128 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESI 187
               + V  L+  S+     LVPL   +  +    +   A   + + L+V +    GE +
Sbjct: 234 KGTPTRVKPLEDFSV-----LVPLVD-ISNHQPLAQVEWATSLEKIGLIVHKTLLPGEEV 287

Query: 188 VVWCGPQPNSKLLINYGFVDEDNPYD 213
               GP+ N +L++NYGF    N  D
Sbjct: 288 PNNYGPRSNERLMMNYGFCIRGNVCD 313


>gi|296808191|ref|XP_002844434.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
 gi|238843917|gb|EEQ33579.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
          Length = 684

 Score = 44.3 bits (103), Expect = 0.088,   Method: Compositional matrix adjust.
 Identities = 51/194 (26%), Positives = 84/194 (43%), Gaps = 11/194 (5%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           LA ++ +++ + K S W PY+  L R    G+L   S L +   +L +L  +        
Sbjct: 111 LAFFVAHQQLKAKDSHWWPYLATLPRA---GELT--SALFYQGEDLEWLQDTNFYHARQM 165

Query: 84  RAEGIKREYNELDTVWFMAGS-LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
             + +K EY+   ++    G  L + Y ++I   A+T  I  +AF + +    ++ K   
Sbjct: 166 YHDAVKTEYDAAISILRKEGCPLVESYSWNIFCWAYTV-IASRAFTS-RVLEAYISKNPA 223

Query: 143 ARRFALVPLGPPLLAYSSK---CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 199
            R+     +  PL+  S+     K    A    + L V  P  A E I    GP  N +L
Sbjct: 224 LRQDDEFQIMLPLVDSSNHRPLAKIEWRAEATRIGLKVIDPVSAKEEIHNNYGPLNNQQL 283

Query: 200 LINYGFVDEDNPYD 213
           +  YGF   DNP D
Sbjct: 284 MATYGFCIVDNPCD 297


>gi|224042477|ref|XP_002188626.1| PREDICTED: SET domain-containing protein 4 [Taeniopygia guttata]
          Length = 457

 Score = 44.3 bits (103), Expect = 0.090,   Method: Compositional matrix adjust.
 Identities = 70/297 (23%), Positives = 113/297 (38%), Gaps = 39/297 (13%)

Query: 12  LLTTNKLSELACLALYL---MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 68
            L    L  L C    L   + EK  G+KS W PY+  L +       A   P    E +
Sbjct: 112 FLIGQNLELLECFQFLLVNGIAEKHAGQKSPWKPYLDVLPK-------AYTCPAC-LEPD 163

Query: 69  LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 128
           +  L   P + +  E+   I+  +      +     LF +   +I    F F   + A+ 
Sbjct: 164 IINLLPKPLQKKAQEQKMLIQELFQSSRAFFSSLQPLFAEDTGNI----FNFSALQWAWC 219

Query: 129 AVQSCVVHLQKVSLARRFALVP----LGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRP 180
            V +  +++ K      F+L P    L P   LL +S   + KA       + ++  D  
Sbjct: 220 TVNTRTIYM-KHPHRECFSLEPDVYALAPYLDLLNHSPNVQVKAGFNEQTRSYEIWTDSQ 278

Query: 181 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 240
            K  + +++  GP  N +LL+ YGFV  DNP+  + V A    +     DK    QR  K
Sbjct: 279 CKKYQEVLICYGPHDNQRLLLEYGFVATDNPHSSVYVSADTLLKYFSSLDK----QREAK 334

Query: 241 LSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVL 297
           +S+   H           D L  L  G+   +  + + +  L        C  R +L
Sbjct: 335 VSILKDH-----------DFLENLTFGWEGPSWRLLTALKVLSLAADEFACWRRILL 380


>gi|449702130|gb|EMD42824.1| Hypothetical protein EHI5A_004190 [Entamoeba histolytica KU27]
          Length = 749

 Score = 44.3 bits (103), Expect = 0.093,   Method: Compositional matrix adjust.
 Identities = 57/227 (25%), Positives = 87/227 (38%), Gaps = 23/227 (10%)

Query: 26  LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 85
           LYL   K   K   W PYI  L           + PL ++  EL  + G+   A + E+ 
Sbjct: 42  LYLAVNKTNPKCFHW-PYINVLPE-------TYDCPLSYTIDELNLMKGTKLYAAV-EKI 92

Query: 86  EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR 145
                    +  V +    L QQ+P     ++F  ++FK+   A QS       V   + 
Sbjct: 93  NAFL-----MKVVDYYNNKLIQQFPQYF--QSFD-DLFKRLQWAHQSFWSRAFLVIYPQP 144

Query: 146 FALVPLGPPLLAYSSKCKAMLAAVDDAVQL------VVDRPYKAGESIVVWCGPQPNSKL 199
           F  V    P   +S+ C           Q         +   K GE I      + N KL
Sbjct: 145 FGEVGSLIPFCDFSNHCTQAKVTYISNTQTETFSFQTNEELVKPGEQIFNNYRIRSNEKL 204

Query: 200 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 246
           L+ YGFV+E+NP D L++      +D QY +   + ++    S   F
Sbjct: 205 LLGYGFVEENNPCDNLLLRIYFEVDDNQYNEIEEILKQEEIKSFDFF 251


>gi|67484540|ref|XP_657490.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
 gi|56474743|gb|EAL52100.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
          Length = 791

 Score = 44.3 bits (103), Expect = 0.093,   Method: Compositional matrix adjust.
 Identities = 57/227 (25%), Positives = 87/227 (38%), Gaps = 23/227 (10%)

Query: 26  LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 85
           LYL   K   K   W PYI  L           + PL ++  EL  + G+   A + E+ 
Sbjct: 84  LYLAVNKTNPKCFHW-PYINVLPE-------TYDCPLSYTIDELNLMKGTKLYAAV-EKI 134

Query: 86  EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR 145
                    +  V +    L QQ+P     ++F  ++FK+   A QS       V   + 
Sbjct: 135 NAFL-----MKVVDYYNNKLIQQFPQYF--QSFD-DLFKRLQWAHQSFWSRAFLVIYPQP 186

Query: 146 FALVPLGPPLLAYSSKCKAMLAAVDDAVQL------VVDRPYKAGESIVVWCGPQPNSKL 199
           F  V    P   +S+ C           Q         +   K GE I      + N KL
Sbjct: 187 FGEVGSLIPFCDFSNHCTQAKVTYISNTQTETFSFQTNEELVKPGEQIFNNYRIRSNEKL 246

Query: 200 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 246
           L+ YGFV+E+NP D L++      +D QY +   + ++    S   F
Sbjct: 247 LLGYGFVEENNPCDNLLLRIYFEVDDNQYNEIEEILKQEEIKSFDFF 293


>gi|403412960|emb|CCL99660.1| predicted protein [Fibroporia radiculosa]
          Length = 508

 Score = 44.3 bits (103), Expect = 0.095,   Method: Compositional matrix adjust.
 Identities = 63/227 (27%), Positives = 92/227 (40%), Gaps = 37/227 (16%)

Query: 148 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 207
           +VP+   L A      A L   +  +++V  +P KAGE I    G  PNS LL  YG VD
Sbjct: 258 MVPMADMLNARFGSENAKLFYEEHHLKMVTTKPIKAGEQIWNTYGDPPNSDLLRRYGHVD 317

Query: 208 ----------EDNPYD------RLVVEAALNTEDPQYQDK----RMVAQRNGKLSVQVFH 247
                       NP D       L V AA      + QDK      VA  +       F 
Sbjct: 318 LVPLEPPLAGLGNPADIVEIGADLAVFAAKKDSPEKLQDKIDWWLEVANDD------TFV 371

Query: 248 VHAGREKEAISDMLPYLRLGYV-SDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 306
           +  G + +   +++ + RL ++  D  E     S L       P ++  VL    D    
Sbjct: 372 I--GTDCQLPEELVSFARLLFLPRDEWEKVRQKSKL-----PKPKIDAQVLSVAEDVLSR 424

Query: 307 RLAGYPATLSEDEAMLTDYNLHP---KKRVATQLVRMEKKMLNACLQ 350
           R+  Y  T+ +DEA+L   N  P    K+ A  +   EK++L+  LQ
Sbjct: 425 RINEYSTTIEDDEALLALENAQPLSLNKKHALIVRHGEKRILHGTLQ 471


>gi|167389227|ref|XP_001738871.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165897700|gb|EDR24782.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
          Length = 791

 Score = 44.3 bits (103), Expect = 0.096,   Method: Compositional matrix adjust.
 Identities = 55/212 (25%), Positives = 86/212 (40%), Gaps = 25/212 (11%)

Query: 26  LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 85
           LYL   K   K   W PYI  L +         + PL ++  EL  + G+     + E+ 
Sbjct: 84  LYLAINKTNPKCFHW-PYINVLPK-------TYDCPLSYTIDELNIMKGTKLYVAV-EKI 134

Query: 86  EGIKREYNELDTVWFMAGSLFQQYP-YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLAR 144
                    +  V +    L QQ+P Y  P +    ++FK+   A QS       V   +
Sbjct: 135 NAFL-----MKVVDYYNNKLIQQFPQYFQPFD----DLFKRLQWAHQSFWSRAFLVIYPQ 185

Query: 145 RFALVPLGPPLLAYSSKC-KAMLAAVDDAVQLVV-----DRPYKAGESIVVWCGPQPNSK 198
            F  V    P   +S+ C +A +  + +           +   K GE I      + N K
Sbjct: 186 PFGEVGSLIPFCDFSNHCTQAKVTYISNTRTETFSFQTNEEVVKPGEQIFNNYRIRSNEK 245

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQD 230
           LL+ YGFV+E+NP D L++      +D QY +
Sbjct: 246 LLLGYGFVEENNPCDNLLLRIYFEVDDNQYNE 277


>gi|302832548|ref|XP_002947838.1| hypothetical protein VOLCADRAFT_88145 [Volvox carteri f.
           nagariensis]
 gi|300266640|gb|EFJ50826.1| hypothetical protein VOLCADRAFT_88145 [Volvox carteri f.
           nagariensis]
          Length = 508

 Score = 44.3 bits (103), Expect = 0.100,   Method: Compositional matrix adjust.
 Identities = 45/160 (28%), Positives = 72/160 (45%), Gaps = 8/160 (5%)

Query: 167 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV--DEDNPYDRLVVEAALNTE 224
            A   A+ L  DRPY AG+ + V  GP+ + +LL++YGF      NP+    +  A++ +
Sbjct: 242 GATKGALVLRADRPYAAGQQVYVSYGPKSSGELLLSYGFCPPPASNPHQDCRLRVAVDRQ 301

Query: 225 -DPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL--RLGYVSDTSEMQSVISS 281
            DP    K     R+G  S   F +      E +   L +L  R     +T E+ SV+  
Sbjct: 302 GDPLADLKEQALARHGLPSELEFPLKLEGIPEGLLQYLAFLDARPKVAQETFELASVLFE 361

Query: 282 LGPICPVSPCMERAV--LDQLADYFKARLAGYPATLSEDE 319
            G   P+    +  V  L  L++   A L  YP ++  D+
Sbjct: 362 SGGF-PLLDGQDTLVLALRGLSNRCTAALKAYPTSMEADQ 400


>gi|428163884|gb|EKX32933.1| hypothetical protein GUITHDRAFT_120884 [Guillardia theta CCMP2712]
          Length = 320

 Score = 44.3 bits (103), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 45/183 (24%), Positives = 75/183 (40%), Gaps = 30/183 (16%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
           C+ L LM E      SF+ PY+  L          V++P+ WS+ E   L GSP     +
Sbjct: 73  CMLLLLMCESVN-SSSFFRPYLDLLPD-------TVDTPITWSKEEAKELVGSPVLHRAV 124

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 142
           +    + R + E      M   +F +YP   P   F++E ++ A+  ++S          
Sbjct: 125 KLRHELARSFQE------MKDKVFDKYPDRFPPLLFSYERYQWAYSILRSRAFG------ 172

Query: 143 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC--GPQPNSKLL 200
              + L+PL   +  +     A     D +  L+  R Y       VW   G + ++ LL
Sbjct: 173 --NYTLMPLIDLMNHHPDSRLAPTLLSDGSDALIARREYN------VWGFYGRKSDADLL 224

Query: 201 INY 203
           +NY
Sbjct: 225 LNY 227


>gi|358399747|gb|EHK49084.1| hypothetical protein TRIATDRAFT_213818 [Trichoderma atroviride IMI
           206040]
          Length = 378

 Score = 44.3 bits (103), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 36/171 (21%), Positives = 72/171 (42%), Gaps = 24/171 (14%)

Query: 56  LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPT 115
             V  P++W   EL +L     +  + +R +  + ++++           F +   DI  
Sbjct: 102 FEVGMPMMWPR-ELKHLLPLEPRNLVFKREKAFQGDWSD-----------FHKAFSDISY 149

Query: 116 EAFTFEIFKQAFVAVQSCVVHLQ-----KVSLARRFALVPLGPPLLAYSSKCKAMLAAVD 170
           E +T+     A++ V +   + +     K     R AL+P+        + C+   +   
Sbjct: 150 EEYTY-----AWLTVNTRTFYNESPETLKYPWEDRLALIPVADLFNHADAGCRVYYSP-- 202

Query: 171 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 221
           +   +V DR YK GE + +      N   L+ YGFV ++NP D + ++  +
Sbjct: 203 EGYHIVADRDYKRGEELYISYSSHSNDYNLVEYGFVPDENPSDDVYIDDVI 253


>gi|344300819|gb|EGW31140.1| hypothetical protein SPAPADRAFT_142076 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 436

 Score = 44.3 bits (103), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 48/210 (22%), Positives = 78/210 (37%), Gaps = 19/210 (9%)

Query: 18  LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW---SETELAYLTG 74
           LS    L +Y+  E ++GK SFW P+   LD         +  PL+W   ++ +L  L  
Sbjct: 117 LSSFQLLGMYITIETQRGKSSFWKPF---LDMLPSIADFEL-MPLVWQINNQHDLLDLLP 172

Query: 75  SPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 134
            P +    +       +YN +        +   Q   D        + F  A++ + S  
Sbjct: 173 QPIRKTSEKVYTRFTSDYNTV--------TALLQTKIDNTEAVLPLDQFLLAWICINSRC 224

Query: 135 VHLQ---KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
           +++      S +  F + P     L +S      L       Q+     Y   E + +  
Sbjct: 225 LYMNLPTSKSASDNFTMAPY-VDFLNHSPNDHCTLKIDGRGFQVFSTCAYSENEQVYLSY 283

Query: 192 GPQPNSKLLINYGFVDEDNPYDRLVVEAAL 221
           GP  N  LL  YGF   DN ++ L V   L
Sbjct: 284 GPHSNDFLLCEYGFTISDNKWNDLDVTEYL 313


>gi|145516585|ref|XP_001444181.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124411592|emb|CAK76784.1| unnamed protein product [Paramecium tetraurelia]
          Length = 658

 Score = 43.9 bits (102), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 26/92 (28%), Positives = 39/92 (42%), Gaps = 16/92 (17%)

Query: 19  SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
           +E  CLALY+ Y K QG KSFW PY+  +++          +   W   +L  L      
Sbjct: 98  AEFNCLALYIFYHKLQGDKSFWYPYLNVVEQH---------TMFEWRNRDLFNLQDQSLI 148

Query: 79  AEILERAEGIKREYNELDTVWFMAGSLFQQYP 110
            E +          +E+D  W+    L  +YP
Sbjct: 149 DEFMYIQ-------SEMDKSWYKFKGLMNKYP 173


>gi|407035166|gb|EKE37568.1| [Ribulose-bisphosphate-carboxylase]-lysine N-methyltransferase
           [Entamoeba nuttalli P19]
          Length = 791

 Score = 43.9 bits (102), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 58/228 (25%), Positives = 87/228 (38%), Gaps = 25/228 (10%)

Query: 26  LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 85
           LYL   K   K   W PYI  L           + PL ++  EL  + G+   A + E+ 
Sbjct: 84  LYLAVNKTNPKCFHW-PYINVLPE-------TYDCPLSYTIDELNLMKGTKLYAAV-EKI 134

Query: 86  EGIKREYNELDTVWFMAGSLFQQYP-YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLAR 144
                    +  V +    L QQ+P Y  P +    ++FK+   A QS       V   +
Sbjct: 135 NAFL-----MKVVDYYNNKLIQQFPQYFQPFD----DLFKRLQWAHQSFWSRAFLVIYPQ 185

Query: 145 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQL------VVDRPYKAGESIVVWCGPQPNSK 198
            F  V    P   +S+ C           Q         +   K GE I      + N K
Sbjct: 186 PFGEVGSLIPFCDFSNHCTQAKVTYISNTQTETFSFQTNEALVKPGEQIFNNYRIRSNEK 245

Query: 199 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 246
           LL+ YGFV+E+NP D L++      +D QY +   + ++    S   F
Sbjct: 246 LLLGYGFVEENNPCDNLLLRIYFEVDDNQYNEIEEILKQEEIKSFDFF 293


>gi|299115489|emb|CBN75653.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 451

 Score = 43.9 bits (102), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 21/70 (30%), Positives = 38/70 (54%)

Query: 170 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 229
           +D + LV  + + +G  +    GP PNS+LL+ +GF   DNP++ + + A +    P + 
Sbjct: 229 NDCLHLVTLQDWASGSEVKFSYGPLPNSRLLLLHGFCLPDNPFESVELWAMMEPGAPGFA 288

Query: 230 DKRMVAQRNG 239
           +K  +   NG
Sbjct: 289 EKNKIMLDNG 298


>gi|328864871|gb|EGG13257.1| hypothetical protein DFA_11018 [Dictyostelium fasciculatum]
          Length = 1658

 Score = 43.9 bits (102), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 71/308 (23%), Positives = 122/308 (39%), Gaps = 25/308 (8%)

Query: 18   LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
            L++   L L+++YEK +   SFW P+   L            + + ++ TEL  L G+  
Sbjct: 1255 LNDDTILFLFVIYEK-ENPNSFWRPFFDTLPS-------YFPTSIHYTTTELLELEGT-- 1304

Query: 78   KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
               + E    IK     +  + F    L  QYP   P   FT E F  A     S  + L
Sbjct: 1305 --NLFEETIQIKEHLESIRELLF--PELSNQYPDVFPESLFTMENFLWARSLFDSRAIQL 1360

Query: 138  QKVSLARRFALVPLGPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 195
             K+       LVP+   +  +      +      +D  +++      A   I +  G   
Sbjct: 1361 -KIDGRIVNCLVPMADMINHHDQAQISQRYFDQENDCFRMISCCNIPATSQIFLQYGALQ 1419

Query: 196  NSKLLINYGFVDEDNPYDRLVVEAALNTED-PQYQDKRMVAQRNGKLSVQVFHVHAGREK 254
            + +L + YGFV  +N YD + +   +  ED P+ ++++        L+V   ++H     
Sbjct: 1420 SWELALYYGFVISNNHYDSVHIGFDMPEEDTPELREEKQKLLDRHLLTVDHHYLHRSN-- 1477

Query: 255  EAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPAT 314
               S +L  LR+  +++  E    +    PI   S   E  VL  L       L  + +T
Sbjct: 1478 -IPSKLLASLRVALLAE-DEFNPHVDVWNPI---SRSNEEVVLYTLYSTVLMLLKQFSST 1532

Query: 315  LSEDEAML 322
              ED+ +L
Sbjct: 1533 CDEDQQLL 1540


>gi|242059429|ref|XP_002458860.1| hypothetical protein SORBIDRAFT_03g041640 [Sorghum bicolor]
 gi|241930835|gb|EES03980.1| hypothetical protein SORBIDRAFT_03g041640 [Sorghum bicolor]
          Length = 491

 Score = 43.9 bits (102), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 54/245 (22%), Positives = 93/245 (37%), Gaps = 60/245 (24%)

Query: 22  ACLALYL--MYEKKQGKKSFWLPYIREL-DRQRGRGQLAVESPLLWSETEL-AYLTGSPT 77
            CLAL +  MYE+ +G  S W  Y++ L DR+          PL+W   E    L G+  
Sbjct: 79  GCLALAVAVMYERARGTDSPWDAYLQLLPDRE--------SVPLVWPADEAECLLAGTEL 130

Query: 78  KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
              + +  E +  ++ E      ++G L      D+  + F+ E +  A   V S    +
Sbjct: 131 DKIVKQDREFLCEDWKECIEPLLLSGEL------DVDPDDFSLEKYFSAKTLVSSRSFQI 184

Query: 138 QKVSLARRFALVPLGPPLLAYSSKCKAML------------------------------- 166
                   F +VPL   L  + + C+ +                                
Sbjct: 185 DSY---HGFGMVPLAD-LFNHKTDCEHVHFTSASDASDSDGEDADDDQSDASADDESTIE 240

Query: 167 -------AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEA 219
                   + D+ +++++ R    GE +    G   N+ LL  YGF + DN YD + ++ 
Sbjct: 241 NPTSSSPGSKDEDLEMIIVRDVNEGEEVYNTYGTMGNAALLHRYGFTELDNQYDIVNIDL 300

Query: 220 ALNTE 224
           AL T+
Sbjct: 301 ALVTK 305


>gi|426392958|ref|XP_004062802.1| PREDICTED: SET domain-containing protein 4 [Gorilla gorilla
           gorilla]
          Length = 440

 Score = 43.9 bits (102), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 55/240 (22%), Positives = 96/240 (40%), Gaps = 24/240 (10%)

Query: 19  SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
           S L  L  +L+ EK  G++S W PY+  L +       A   P+   E E+  L     K
Sbjct: 105 SPLLALCTFLVSEKHAGRRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLK 156

Query: 79  AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL- 137
           A+  E+   ++  +      +     LF +    I    F++     A+  V +  V+L 
Sbjct: 157 AKAEEQRAHVQEFFASSRDFFSSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLR 212

Query: 138 --QKVSLARRFALVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
             Q+  L+       L P   LL +S   + KA       + ++     ++  E + +  
Sbjct: 213 PRQRECLSAEPDTCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICY 272

Query: 192 GPQPNSKLLINYGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 246
           GP  N +L + YGFV   NP+      R ++   L + D Q   K  + + +G +    F
Sbjct: 273 GPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332


>gi|33468718|emb|CAE30375.1| SI:dZ63M10.4 (novel protein similar to human chromosome 21 open
           reading frame 18 (C21orf18)) [Danio rerio]
          Length = 440

 Score = 43.9 bits (102), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 58/242 (23%), Positives = 101/242 (41%), Gaps = 26/242 (10%)

Query: 18  LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
           +S L  L  +L+ E+  G+ S W PYI  L +           PL + +  +  L  S  
Sbjct: 105 ISPLLALCCFLISERHHGEASEWNPYIDILPK-------TYTCPLYFPDNVIELLPRSLQ 157

Query: 78  KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA-FTFEIFKQAFVAVQSCVV- 135
           K +  ++ E  +  ++   T +     LF Q     PTE  F+ +  + A+ +V +  V 
Sbjct: 158 K-KATQQKEQFQELFSSSQTFFHSLQPLFNQ-----PTEELFSQDALRWAWCSVNTRTVY 211

Query: 136 --HLQKVSLARRFALVPLGP--PLLAY--SSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
             H Q   L+R   +  L P   LL +  + + +A         ++      K  +   +
Sbjct: 212 MEHDQSKYLSREKDVYALAPYLDLLNHCPNVQVEAGFNKETRCYEIRSVNGCKKFQQAFI 271

Query: 190 WCGPQPNSKLLINYGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ 244
             GP  N +LL+ YGFV   NP+     D   ++  L+ +D Q ++K +  + N  L   
Sbjct: 272 NYGPHDNHRLLLEYGFVAPCNPHSVVYVDLETLKVGLDEKDKQLKEKLLYLKDNDFLRNL 331

Query: 245 VF 246
            F
Sbjct: 332 TF 333


>gi|330797452|ref|XP_003286774.1| hypothetical protein DICPUDRAFT_54488 [Dictyostelium purpureum]
 gi|325083217|gb|EGC36675.1| hypothetical protein DICPUDRAFT_54488 [Dictyostelium purpureum]
          Length = 1335

 Score = 43.9 bits (102), Expect = 0.12,   Method: Composition-based stats.
 Identities = 52/203 (25%), Positives = 85/203 (41%), Gaps = 19/203 (9%)

Query: 18   LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
            L++   L L+++YEK +   +FW P+   L            + + +S TEL  L G+  
Sbjct: 874  LNDETILFLFVIYEK-ENPNTFWRPFYDTLPSY-------FTTSIHYSSTELLELEGTNL 925

Query: 78   KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
             AE L     +K++        F    L  QYP   P   F++E F  A   + S  + L
Sbjct: 926  FAETL----AVKQQLQAFRDYLF--PELSNQYPDIFPESVFSWENFLWARSLLDSRAIQL 979

Query: 138  QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD---DAVQLVVDRPYKAGESIVVWCGPQ 194
             K+    +  LVP+   ++ + +  +      D   +  ++V      A   I +  G  
Sbjct: 980  -KIDGKIKSCLVPMAD-MINHHTNAQISERHFDQDSNCFRMVSSCNIPANNQIFLHYGAL 1037

Query: 195  PNSKLLINYGFVDEDNPYDRLVV 217
             NS L + YGFV  +N YD   V
Sbjct: 1038 QNSDLALYYGFVIPNNIYDSFHV 1060


>gi|320168265|gb|EFW45164.1| hypothetical protein CAOG_03170 [Capsaspora owczarzaki ATCC 30864]
          Length = 464

 Score = 43.9 bits (102), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 57/260 (21%), Positives = 95/260 (36%), Gaps = 69/260 (26%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           L ++L+ E+++G  S+W PYI +L  QR R       PL ++E EL  L  SP   E   
Sbjct: 227 LRMFLIVERRRGGNSYWSPYI-DLLPQRFR------LPLYFTEAELELLKPSPALQEAFV 279

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 143
           +   + R+Y           + ++QY           E+ + A +   S   H + +   
Sbjct: 280 QLRNVVRQY-----------AAWKQY-------LMMLELARAAELPSGSGDAHQKILDQR 321

Query: 144 RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR------------------------ 179
           RR   +P+    L Y   C A  A      Q+VV                          
Sbjct: 322 RRAQAMPVRYNELTYDLFCWASSAVATRQNQIVVGEVRANQAPELSLALIPGWDMCNHAF 381

Query: 180 --------------------PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEA 219
                               P   GE +++  G + +     N  FV  D+P D+ ++  
Sbjct: 382 GGASSFYDTQTRSLECVAVAPIAKGEPVLLHYGDRSSMAYFGNSEFVPADHPTDQYLILL 441

Query: 220 ALNTEDPQYQDKRMVAQRNG 239
           A+  +DP ++ K  + Q  G
Sbjct: 442 AVGKQDPLFKSKSTILQALG 461


>gi|440464611|gb|ELQ34010.1| hypothetical protein OOU_Y34scaffold00824g3 [Magnaporthe oryzae
           Y34]
          Length = 373

 Score = 43.9 bits (102), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 46/163 (28%), Positives = 71/163 (43%), Gaps = 24/163 (14%)

Query: 61  PLLWSETELAYLTGSPTKAEILERAEGIKREYN-ELDTVWFMAGSL----FQQYPYDIPT 115
           P +W + EL  L   PT A +    E  + +YN E +TV     S+    FQ Y + + T
Sbjct: 107 PFMWPK-ELQKLL--PTSARVF--LENQQTKYNHEWNTVSQAMPSISEERFQYYWHIVNT 161

Query: 116 EAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQL 175
             F +E      V+   C       S   R ALVPL          C+  ++ + +   +
Sbjct: 162 RTFLYE------VSETECY------SWEDRLALVPLADIFNHADEGCR--VSYMPEHYVI 207

Query: 176 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 218
             DR Y+AGE + +  G   N  LL  YGF+   N +D + ++
Sbjct: 208 TTDRAYEAGEELFISYGDHSNDCLLTEYGFLLPKNRWDIICID 250


>gi|345326326|ref|XP_001512617.2| PREDICTED: SET domain-containing protein 4-like [Ornithorhynchus
           anatinus]
          Length = 499

 Score = 43.9 bits (102), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 54/226 (23%), Positives = 92/226 (40%), Gaps = 19/226 (8%)

Query: 18  LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
           +S L  L  +L+ EK+ G +S W PY+  L +       A   P+      L+ L   P 
Sbjct: 104 VSPLLALCTFLIAEKQAGARSLWQPYLGVLPQ-------AYTCPVGLDAAVLSLLP-QPL 155

Query: 78  KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV-- 135
                E+   + RE       +F   SL   +  D+    FT +    A+  V +  V  
Sbjct: 156 GRRAREQRTAV-RELFAASRAFF--SSLQPLFSEDV-ERVFTLDALGWAWCTVNTRTVYM 211

Query: 136 -HLQKVSLARRFALVPLGP--PLLAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 190
            H Q+   +    +  L P   LL +S  ++ +A         ++      +  E +++ 
Sbjct: 212 EHAQRDCFSAEADIYALAPYLDLLNHSPGAQVEAAFNKETRCYEIRTASRCRKYEEVLIC 271

Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 236
            GP  N +LL+ YGFV  +NP+  +VV   +        DK+M  +
Sbjct: 272 YGPHDNRRLLLEYGFVCSNNPHSNVVVSPDVLVRHLPSGDKQMTKK 317


>gi|113930683|ref|NP_001039027.1| SET domain-containing protein 4 [Danio rerio]
 gi|66911144|gb|AAH96876.1| SET domain containing 4 [Danio rerio]
          Length = 440

 Score = 43.9 bits (102), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 58/242 (23%), Positives = 101/242 (41%), Gaps = 26/242 (10%)

Query: 18  LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
           +S L  L  +L+ E+  G+ S W PYI  L +           PL + +  +  L  S  
Sbjct: 105 ISPLLALCCFLISERHHGEASEWNPYIDILPK-------TYTCPLYFPDNVIELLPRSLQ 157

Query: 78  KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA-FTFEIFKQAFVAVQSCVV- 135
           K +  ++ E  +  ++   T +     LF Q     PTE  F+ +  + A+ +V +  V 
Sbjct: 158 K-KATQQKEQFQELFSSSQTFFHSLQPLFNQ-----PTEELFSQDALRWAWCSVNTRTVY 211

Query: 136 --HLQKVSLARRFALVPLGP--PLLAY--SSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
             H Q   L+R   +  L P   LL +  + + +A         ++      K  +   +
Sbjct: 212 MEHDQSKYLSREKDVYALAPYLDLLNHCPNVQVEAGFNKETRCYEIRSVNGCKKFQQAFI 271

Query: 190 WCGPQPNSKLLINYGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ 244
             GP  N +LL+ YGFV   NP+     D   ++  L+ +D Q ++K +  + N  L   
Sbjct: 272 NYGPHDNHRLLLEYGFVAPCNPHSVVYVDLETLKVGLDEKDKQLKEKLLYLKDNDFLRNL 331

Query: 245 VF 246
            F
Sbjct: 332 TF 333


>gi|406607002|emb|CCH41620.1| SET domain-containing protein 4 [Wickerhamomyces ciferrii]
          Length = 424

 Score = 43.9 bits (102), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 42/208 (20%), Positives = 89/208 (42%), Gaps = 24/208 (11%)

Query: 18  LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
           LS    ++L+L  E  +GK+S+W P+I+ L          + SP LW       + G   
Sbjct: 114 LSSFQIMSLFLELESSRGKESWWDPFIQMLPTIND----FLTSPFLWQ------IQG--- 160

Query: 78  KAEILERAEGIKRE-----YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 132
           K E++E+     ++     +N  ++ +    +L +   ++   +    + F   ++ + S
Sbjct: 161 KYELIEKLPKSTQKHSLKMFNRFESDFKAVKTLLE--THNASKDIINHDKFVLYWMCINS 218

Query: 133 CVVHL---QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
             +++   QK + +  F + P     + +S+  +  L        ++    YK  + + +
Sbjct: 219 RCLYMEIPQKKTTSDNFTMAPY-VDFINHSTNDQCKLKIDRTGFHVITTSNYKENDELYL 277

Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVV 217
             GP  N  LL  YGF   +N ++ L +
Sbjct: 278 SYGPHSNEFLLCEYGFHLSNNEWNDLDI 305


>gi|403349615|gb|EJY74245.1| hypothetical protein OXYTRI_04500 [Oxytricha trifallax]
          Length = 689

 Score = 43.9 bits (102), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 28/90 (31%), Positives = 45/90 (50%), Gaps = 12/90 (13%)

Query: 11  ELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL--WSETE 68
           EL   +  ++  CLA++LM E  QG++SFW PYI  ++          ES LL  W + E
Sbjct: 115 ELFMKHADADQLCLAVFLMNEYLQGQQSFWWPYINVMN----------ESDLLYKWKDEE 164

Query: 69  LAYLTGSPTKAEILERAEGIKREYNELDTV 98
           +  L       +  E  + I+ E+N+L  +
Sbjct: 165 IKLLNDFEIYQQAKEYRDDIEDEWNKLSKI 194


>gi|222640175|gb|EEE68307.1| hypothetical protein OsJ_26571 [Oryza sativa Japonica Group]
          Length = 422

 Score = 43.9 bits (102), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 35/130 (26%), Positives = 59/130 (45%), Gaps = 22/130 (16%)

Query: 151 LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 210
           + P +L Y    K     V  +++  + RP KAGE   +  G  P S L+  YGF+  DN
Sbjct: 302 VSPHILNYGRVDK-----VTKSLKFPLSRPCKAGEQCFLSYGKHPGSHLITFYGFLPRDN 356

Query: 211 PYDR--LVVEAALNTEDP----------QYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 258
           PYD   L ++ +++ ED            +  +RM+  R  +  +Q     + ++   + 
Sbjct: 357 PYDVIPLDLDTSVDEEDSSSPSVTTSQTSHMGERMLG-RQSRTGLQ----RSTKKDSFVH 411

Query: 259 DMLPYLRLGY 268
               YLRLG+
Sbjct: 412 CYFVYLRLGH 421


>gi|260835124|ref|XP_002612559.1| hypothetical protein BRAFLDRAFT_219602 [Branchiostoma floridae]
 gi|229297937|gb|EEN68568.1| hypothetical protein BRAFLDRAFT_219602 [Branchiostoma floridae]
          Length = 327

 Score = 43.9 bits (102), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 54/223 (24%), Positives = 90/223 (40%), Gaps = 41/223 (18%)

Query: 17  KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP 76
           +L+    L+LYL+ EK +GK SFW PYI+ L         +  +P  +S  E+  L    
Sbjct: 76  RLTPKQVLSLYLIAEKSRGKDSFWYPYIQTL-------PTSYTTPSYFSTAEVDALPALV 128

Query: 77  TKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVH 136
            +A +  R + ++  Y  L T       LF  +        FT + ++ A+  V +  V+
Sbjct: 129 REATLRHR-KVLQNSYKSLQTSLHNLEPLFPDW-----KTVFTLKSYRWAWATVYTRSVY 182

Query: 137 LQ-------KVSLARRFALVPL-----GPPLLA------YSSKCKAMLAAVDDAVQLVVD 178
            +         S    +AL P        PL+        SSKC           ++  +
Sbjct: 183 KRGPGWEFLDPSDPDVYALAPFLDMLNHSPLVQTDTDFNVSSKC----------YEVKTE 232

Query: 179 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 221
              +    + +   P  N +LL+ YGFV   NP+  +   AA+
Sbjct: 233 GACRKYRQVFINYDPYDNGRLLMEYGFVMPRNPHSVVTFTAAV 275


>gi|440792461|gb|ELR13682.1| [Ribulose-bisphosphate-carboxylase]-lysine N-methyltransferase
           [Acanthamoeba castellanii str. Neff]
          Length = 400

 Score = 43.5 bits (101), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 49/204 (24%), Positives = 88/204 (43%), Gaps = 23/204 (11%)

Query: 25  ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE- 83
           A+  + E     +SFW PY+ EL         AV +   W++ ELA +  +    E++E 
Sbjct: 49  AVLWLLESVNCAQSFWQPYLSELPD-------AVATVDRWNQEELAEVGHTLMLYEMVEY 101

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 143
           + + I  +Y  +   +    +  Q +   IP+E    E +++A   V S      +  L 
Sbjct: 102 KKKKIAADYAAILLPFLQENT--QLFGGSIPSE----EEYRRALSLVYSRTFDFSE--LI 153

Query: 144 RRFALVPLGPPLLAYS------SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 197
                +P     L +S      + C        D  +L+    Y  GE + +  G + +S
Sbjct: 154 GEHVFIPF-VDFLNHSINDTGKAACTYSYNHDKDCFELLAGADYDEGEEVFISYGEKTSS 212

Query: 198 KLLINYGFVDEDNPYDRLVVEAAL 221
           +LL +YGF+ E+N  D + + A+L
Sbjct: 213 QLLASYGFMYENNAEDTVDITASL 236


>gi|207346544|gb|EDZ73016.1| YDR257Cp-like protein [Saccharomyces cerevisiae AWRI1631]
          Length = 354

 Score = 43.5 bits (101), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)

Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
           +++PL   L A +SKC A L    + +++V  R  +  E +    G  PNS+LL  YG+V
Sbjct: 80  SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 139

Query: 207 DED-NPYD 213
           + D + YD
Sbjct: 140 EWDGSKYD 147


>gi|302498903|ref|XP_003011448.1| SET domain protein [Arthroderma benhamiae CBS 112371]
 gi|291174999|gb|EFE30808.1| SET domain protein [Arthroderma benhamiae CBS 112371]
          Length = 689

 Score = 43.5 bits (101), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 49/225 (21%), Positives = 90/225 (40%), Gaps = 38/225 (16%)

Query: 6   FYYAAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 65
           F+Y+  L  + ++ +   LA ++++E+ +G+ S W PY+  L R        + S L + 
Sbjct: 88  FHYSDIL--SQRVRKSIILAFFMVHEQLKGRDSHWWPYLATLPRAS-----ELTSALFYQ 140

Query: 66  ETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ 125
           + +L +L G+            +K EY+   ++    G L          E++ ++IF  
Sbjct: 141 DNDLEWLQGTNLYQTHQAYRNAVKEEYDSAISILRDEGFL--------AVESYRWDIFCW 192

Query: 126 AFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA------------- 172
           A+  + S      +   +R         P L    + + ML  VD +             
Sbjct: 193 AYTLIAS------RAFTSRVLDAYFSNHPTLKQDEEFQIMLPLVDSSNHKPLAKIEWRAE 246

Query: 173 ---VQLVVDRPYKAGESIVVWCGPQPNSK-LLINYGFVDEDNPYD 213
              + L V  P  +GE +    G   N + ++  YGF   DNP D
Sbjct: 247 ATEIGLKVIEPTFSGEEVHNNYGSLNNQQSVMTTYGFCIVDNPCD 291


>gi|367016539|ref|XP_003682768.1| hypothetical protein TDEL_0G01900 [Torulaspora delbrueckii]
 gi|359750431|emb|CCE93557.1| hypothetical protein TDEL_0G01900 [Torulaspora delbrueckii]
          Length = 573

 Score = 43.5 bits (101), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 25/79 (31%), Positives = 37/79 (46%), Gaps = 9/79 (11%)

Query: 155 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD- 213
           LL + +  K      +D V  V     K GE +    G + N  LL++YGFV + NPYD 
Sbjct: 229 LLNHKNDTKVKWTFTNDNVCFVSQEIMKEGEEVFNNYGEKSNEDLLLSYGFVQDQNPYDL 288

Query: 214 --------RLVVEAALNTE 224
                   + +++ ALN E
Sbjct: 289 TRLTLRLTKEMIDEALNAE 307


>gi|22328112|gb|AAH36556.1| SETD4 protein [Homo sapiens]
 gi|119630166|gb|EAX09761.1| SET domain containing 4, isoform CRA_d [Homo sapiens]
 gi|167773807|gb|ABZ92338.1| SET domain containing 4 [synthetic construct]
          Length = 416

 Score = 43.5 bits (101), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 55/240 (22%), Positives = 95/240 (39%), Gaps = 24/240 (10%)

Query: 19  SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
           S L  L  +L+ EK  G +S W PY+  L +       A   P+   E E+  L     K
Sbjct: 81  SPLLALCTFLVSEKHAGHRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLK 132

Query: 79  AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL- 137
           A+  E+   ++  +      +     LF +    I    F++     A+  V +  V+L 
Sbjct: 133 AKAEEQRAHVQEFFASSRDFFSSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLR 188

Query: 138 --QKVSLARRFALVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
             Q+  L+       L P   LL +S   + KA       + ++     ++  E + +  
Sbjct: 189 PRQRECLSAEPDTCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICY 248

Query: 192 GPQPNSKLLINYGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 246
           GP  N +L + YGFV   NP+      R ++   L + D Q   K  + + +G +    F
Sbjct: 249 GPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 308


>gi|114684050|ref|XP_001168792.1| PREDICTED: SET domain-containing protein 4 isoform 4 [Pan
           troglodytes]
 gi|410222534|gb|JAA08486.1| SET domain containing 4 [Pan troglodytes]
 gi|410259178|gb|JAA17555.1| SET domain containing 4 [Pan troglodytes]
 gi|410287502|gb|JAA22351.1| SET domain containing 4 [Pan troglodytes]
 gi|410336607|gb|JAA37250.1| SET domain containing 4 [Pan troglodytes]
          Length = 307

 Score = 43.5 bits (101), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 50/211 (23%), Positives = 84/211 (39%), Gaps = 19/211 (9%)

Query: 19  SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
           S L  L  +L+ EK  G +S W PY+  L +       A   P+   E E+  L     K
Sbjct: 105 SPLLALCTFLVSEKHAGHRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLK 156

Query: 79  AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL- 137
           A+  E+   ++  +      +     LF +    I    F++     A+  V +  V+L 
Sbjct: 157 AKAEEQRAHVQEFFASSRDFFSSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLR 212

Query: 138 --QKVSLARRFALVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
             Q+  L+       L P   LL +S   + KA       + ++     ++  E + +  
Sbjct: 213 PRQRECLSAEPDTCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICY 272

Query: 192 GPQPNSKLLINYGFVDEDNPYDRLVVEAALN 222
           GP  N +L + YGFV   NP+  + V    N
Sbjct: 273 GPHDNQRLFLEYGFVSVHNPHACVYVSRGWN 303


>gi|328872715|gb|EGG21082.1| hypothetical protein DFA_00957 [Dictyostelium fasciculatum]
          Length = 643

 Score = 43.5 bits (101), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 78/369 (21%), Positives = 145/369 (39%), Gaps = 53/369 (14%)

Query: 17  KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP 76
           K+S     A+ ++Y     + S+W  Y+  L +Q         + + ++  E+  L  SP
Sbjct: 282 KISRTDQQAIAIIYSILH-EDSYWYEYVSILPKQ-------FTTTVYFTREEMTQLQASP 333

Query: 77  TKAEILERAEGIKREYN-------------ELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 123
                  R  G+ R Y+             E D+      S       +   + +T + F
Sbjct: 334 VHRFTEMRLNGVHRHYDTTISRLRFGYEGGEDDSTKTKTKSQLDAMK-EFKDDRYTLDQF 392

Query: 124 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYS----SKCKAMLAAVDDAVQLVVDR 179
           K A   V S    L +        +VPL     A +    SK    ++A   ++     +
Sbjct: 393 KWALGCVWSRAFSLSE----EDGGMVPLADMFNADTVISRSKVHPKISASSPSLVYTASQ 448

Query: 180 PYKAGESIVVWCG---PQPNSKLLINYGFVDEDNPY--DRLVVEAALNTEDPQYQDKRMV 234
             +AGE I    G      + ++L++YGF+ ED       +V  A +   +P Y  KR +
Sbjct: 449 DIEAGEQIFTPYGVYKTLGSGQMLMDYGFIHEDGSSADSTIVTVAPIPPSEPLYDLKRHL 508

Query: 235 AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS--DTSEMQSVISSLGPICPVSPCM 292
            Q NG +  + F +    + +   ++  + R+  ++  ++ +  +   S      ++P  
Sbjct: 509 MQSNG-IESEEFTI---TKNKLAKELFLFARIKSINKKESDQASAHFMSTQRHSMLNPRN 564

Query: 293 ERAVLDQLADYFKARLAGYPATLSEDEAML-------TDYNLHPKKRVAT----QLVRME 341
           E+A L  L++     L  Y  T+ +D  +L       T+ N H      T    +L  ME
Sbjct: 565 EKAALRLLSNLISRHLDAYQTTIDQDNQILKEIEKDKTNTN-HSSVTFNTINAIKLRLME 623

Query: 342 KKMLNACLQ 350
           K +LN+ L+
Sbjct: 624 KNILNSFLK 632


>gi|320170797|gb|EFW47696.1| hypothetical protein CAOG_05634 [Capsaspora owczarzaki ATCC 30864]
          Length = 903

 Score = 43.5 bits (101), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 33/99 (33%), Positives = 46/99 (46%), Gaps = 15/99 (15%)

Query: 24  LALYLMYEK-KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
           L LY+++EK   G  S W P+   L            SPLL+  TEL  L G+    E  
Sbjct: 469 LLLYVLHEKLVAGSASRWAPFFETL-------PATYNSPLLFHVTELLELEGTRLIDETF 521

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 121
           E  +G++  +  L       G L + YP   PT+AFT+E
Sbjct: 522 EIKDGLRVLHESL-------GPLAEAYPALFPTDAFTYE 553


>gi|55953063|ref|NP_001007260.1| SET domain-containing protein 4 isoform 2 [Homo sapiens]
 gi|12804091|gb|AAH02898.1| SET domain containing 4 [Homo sapiens]
 gi|119630161|gb|EAX09756.1| SET domain containing 4, isoform CRA_a [Homo sapiens]
          Length = 307

 Score = 43.5 bits (101), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 50/211 (23%), Positives = 84/211 (39%), Gaps = 19/211 (9%)

Query: 19  SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
           S L  L  +L+ EK  G +S W PY+  L +       A   P+   E E+  L     K
Sbjct: 105 SPLLALCTFLVSEKHAGHRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLK 156

Query: 79  AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL- 137
           A+  E+   ++  +      +     LF +    I    F++     A+  V +  V+L 
Sbjct: 157 AKAEEQRAHVQEFFASSRDFFSSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLR 212

Query: 138 --QKVSLARRFALVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
             Q+  L+       L P   LL +S   + KA       + ++     ++  E + +  
Sbjct: 213 PRQRECLSAEPDTCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICY 272

Query: 192 GPQPNSKLLINYGFVDEDNPYDRLVVEAALN 222
           GP  N +L + YGFV   NP+  + V    N
Sbjct: 273 GPHDNQRLFLEYGFVSVHNPHACVYVSRGWN 303


>gi|440802665|gb|ELR23594.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
          Length = 984

 Score = 43.1 bits (100), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 46/212 (21%), Positives = 86/212 (40%), Gaps = 34/212 (16%)

Query: 22  ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 81
           A L  YL++++   + +FW PY+R L +   R            + ++ +L G+     +
Sbjct: 84  AILCAYLIHQRA-AQDAFWGPYLRSLPKHDDR-----------PDEDIQHLAGTNLFYAM 131

Query: 82  LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV------QSCVV 135
            E+ + I+  ++       +  +L   +P   P + FT++ F   F A       Q+ V 
Sbjct: 132 QEKQQQIRESFD------LLFPALCHAHPTVFPPDLFTWDHFLWTFTACSSRSFPQTLVQ 185

Query: 136 HLQKVSLARR--FALVPLGPPLL--------AYSSKCKAMLAAVDDAVQLVVDRPYKAGE 185
                + A    + L+ +   LL         Y  K    L      ++ V +   + G 
Sbjct: 186 QPTATTSAHADPYDLLEIDECLLPGLDMLNHQYRKKITWALDPSTGRLKFVTEDTVEKGT 245

Query: 186 SIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 217
                 GP+ N +LL+ YGF  EDN  D +++
Sbjct: 246 EAFNNYGPKGNEELLMGYGFCIEDNEQDYVMI 277


>gi|340503949|gb|EGR30449.1| SET domain protein [Ichthyophthirius multifiliis]
          Length = 518

 Score = 43.1 bits (100), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 50/211 (23%), Positives = 91/211 (43%), Gaps = 25/211 (11%)

Query: 170 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 229
           D+   +   +P K G+ I    G + N  LL+ YGF    N YD   +   +N    Q  
Sbjct: 250 DNYFVVTTQKPEKKGQQIYNCYGQRTNKFLLMWYGFCFNKNRYDSYSLRLWINMRQEQLN 309

Query: 230 D---KRMVAQ---------------RNGKLSVQVFHVHAGREKEAIS-DMLPYLRLGYVS 270
           +   +++V Q               +  K+++     +   +K  I+ D++ YLRL  + 
Sbjct: 310 NDLFEKIVFQEFLEKEDCKGGFVWKKQEKVNLDDITQNFRIKKNKINIDLIIYLRLYLMM 369

Query: 271 DTS--EMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 328
                +++ V+ SL    PVSP  E  VL          L+ +  T+ +D+ +L + NL+
Sbjct: 370 HYKGPDLKRVMVSL----PVSPVYECFVLSFAIRLLSYLLSRFTTTIKDDKELLQNQNLN 425

Query: 329 PKKRVATQLVRMEKKMLNACLQVTADMIMLL 359
            K R A      +K++L   + +    ++LL
Sbjct: 426 YKYRFAIIYRLNQKEILQEQISLMNQALILL 456


>gi|12718364|emb|CAC28558.1| related to histone-lysine N-methyltransferase [Neurospora crassa]
          Length = 471

 Score = 43.1 bits (100), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 45/181 (24%), Positives = 81/181 (44%), Gaps = 17/181 (9%)

Query: 60  SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 119
           S +L++E +L    G+       +  + I+ ++  L         LF Q+P   P + FT
Sbjct: 115 SSILFAEDDLEACAGTSLYTITKQLEQSIEDDHRALVV------RLFVQHPDLFPLDKFT 168

Query: 120 FE----IFKQAFVAVQSCVVHLQKVSLARRFALVPLGP--PLLAYSSKCKA--MLAAVDD 171
            E     +K A   V S  +      LA   ++  L P   +L ++S+ K   +      
Sbjct: 169 VEDVGLHYKWALCTVWSRAMDF---VLADGNSIRLLAPFADMLNHTSEVKQCHVYDPSSG 225

Query: 172 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 231
            + +   + Y+AG+ + +  GP PNS+LL  YGFV   NP D   +  + + + P ++ K
Sbjct: 226 TLSVFAGKDYEAGDQVFINYGPVPNSRLLRLYGFVIPGNPNDSYDLVLSTHPQAPFFEQK 285

Query: 232 R 232
           +
Sbjct: 286 Q 286


>gi|323355591|gb|EGA87411.1| Set7p [Saccharomyces cerevisiae VL3]
          Length = 515

 Score = 43.1 bits (100), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)

Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
           +++PL   L A +SKC A L    + +++V  R  +  E +    G  PNS+LL  YG+V
Sbjct: 220 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 279

Query: 207 DED-NPYD 213
           + D + YD
Sbjct: 280 EWDGSKYD 287


>gi|388452885|ref|NP_001253203.1| SET domain-containing protein 4 [Macaca mulatta]
 gi|355560299|gb|EHH16985.1| SET domain-containing protein 4 [Macaca mulatta]
 gi|387541878|gb|AFJ71566.1| SET domain-containing protein 4 isoform 1 [Macaca mulatta]
          Length = 440

 Score = 43.1 bits (100), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 55/240 (22%), Positives = 95/240 (39%), Gaps = 24/240 (10%)

Query: 19  SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
           S L  L  +L+ EK  G +S W PY+  L +       A   P+   E E+  L     K
Sbjct: 105 SPLLALCTFLVSEKHAGDRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLK 156

Query: 79  AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL- 137
           A+  E+   ++  +      +     LF +    I    F++     A+  V +  V+L 
Sbjct: 157 AKAEEQRAHVQEFFASSRDFFSSLQPLFVEAVDSI----FSYSALLWAWCTVNTRAVYLR 212

Query: 138 --QKVSLARRFALVPLGP--PLLAYSSKC--KAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
             Q+  L+       L P   LL +S +   KA       + ++     ++  E + +  
Sbjct: 213 PRQRECLSAEPDTCALAPYLDLLNHSPRVQVKAAFNEETHSYEIRTTSRWRKHEEVFICY 272

Query: 192 GPQPNSKLLINYGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 246
           GP  N +L + YGFV   NP+      R ++   L + D Q   K  + + +G +    F
Sbjct: 273 GPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSRDKQMDKKISILKDHGYIENLTF 332


>gi|443733230|gb|ELU17670.1| hypothetical protein CAPTEDRAFT_97123, partial [Capitella teleta]
          Length = 199

 Score = 43.1 bits (100), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 53/205 (25%), Positives = 90/205 (43%), Gaps = 30/205 (14%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           L ++L+ E+ +G  SFW PY+  L         +    L W+  E+  L    TK    +
Sbjct: 1   LVIFLLCERNKGCSSFWKPYVDILPS-------SYTDILHWTSKEMDLLPKF-TKRRACD 52

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKV 140
                +  +N L   +     L +Q P      AFT+++FK A+ +V +  V++   Q  
Sbjct: 53  LRLKAEESFNRLCNGFLPL--LVRQMPQF--NGAFTWDLFKWAWSSVNTRCVYMSQPQNS 108

Query: 141 SLA----RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ------LVVDRPYKAGESIVVW 190
            L+     + AL P    LL ++   +   A  DD+ +      L   +PY   + + + 
Sbjct: 109 VLSPDEEDKSALAPFLD-LLNHTVDVEVN-ARFDDSSKSYKITTLTACKPY---DQVFIN 163

Query: 191 CGPQPNSKLLINYGFVDEDNPYDRL 215
            GP  N KLL+ YGF    NP++ +
Sbjct: 164 YGPHSNEKLLLEYGFTLPCNPHNNI 188


>gi|428181778|gb|EKX50641.1| hypothetical protein GUITHDRAFT_135258 [Guillardia theta CCMP2712]
          Length = 254

 Score = 43.1 bits (100), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 45/162 (27%), Positives = 68/162 (41%), Gaps = 25/162 (15%)

Query: 163 KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 222
           K+ L  V+  VQL+   P KAGE I ++ G    +  L  +GF D DNP D +  E  L 
Sbjct: 74  KSELGRVE--VQLLA--PVKAGEQIFIYYGALSTASELTRFGFCDRDNPNDTVPFELDL- 128

Query: 223 TEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSL 282
           +E  + Q K M          +V+      ++    D LP  RL            +  +
Sbjct: 129 SEMTELQRKAM----------EVWEFRPDVQQLLKRDGLPSWRL----------LAMLRI 168

Query: 283 GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 324
             +  +S   E+ V   + +   A  AGYP  L ED + L +
Sbjct: 169 LHLNQLSVANEKLVWGTMEELLNAVTAGYPTRLEEDISRLEE 210


>gi|323334121|gb|EGA75505.1| Set7p [Saccharomyces cerevisiae AWRI796]
          Length = 515

 Score = 43.1 bits (100), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)

Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
           +++PL   L A +SKC A L    + +++V  R  +  E +    G  PNS+LL  YG+V
Sbjct: 220 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 279

Query: 207 DED-NPYD 213
           + D + YD
Sbjct: 280 EWDGSKYD 287


>gi|151942233|gb|EDN60589.1| SET domain-containing protein [Saccharomyces cerevisiae YJM789]
          Length = 494

 Score = 43.1 bits (100), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)

Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
           +++PL   L A +SKC A L    + +++V  R  +  E +    G  PNS+LL  YG+V
Sbjct: 220 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 279

Query: 207 DED-NPYD 213
           + D + YD
Sbjct: 280 EWDGSKYD 287


>gi|349577313|dbj|GAA22482.1| K7_Set7p [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 494

 Score = 43.1 bits (100), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)

Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
           +++PL   L A +SKC A L    + +++V  R  +  E +    G  PNS+LL  YG+V
Sbjct: 220 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 279

Query: 207 DED-NPYD 213
           + D + YD
Sbjct: 280 EWDGSKYD 287


>gi|6320463|ref|NP_010543.1| Rkm4p [Saccharomyces cerevisiae S288c]
 gi|46577338|sp|Q12504.1|RKM4_YEAST RecName: Full=Ribosomal N-lysine methyltransferase 4; AltName:
           Full=SET domain-containing protein 7
 gi|1136212|emb|CAA92714.1| unknown [Saccharomyces cerevisiae]
 gi|1226033|emb|CAA94096.1| unknown [Saccharomyces cerevisiae]
 gi|51830266|gb|AAU09704.1| YDR257C [Saccharomyces cerevisiae]
 gi|190404795|gb|EDV08062.1| hypothetical protein SCRG_00269 [Saccharomyces cerevisiae RM11-1a]
 gi|259145494|emb|CAY78758.1| Set7p [Saccharomyces cerevisiae EC1118]
 gi|285811273|tpg|DAA12097.1| TPA: Rkm4p [Saccharomyces cerevisiae S288c]
 gi|323349272|gb|EGA83501.1| Set7p [Saccharomyces cerevisiae Lalvin QA23]
 gi|365766338|gb|EHN07836.1| Set7p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
 gi|392300372|gb|EIW11463.1| Rkm4p [Saccharomyces cerevisiae CEN.PK113-7D]
          Length = 494

 Score = 43.1 bits (100), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)

Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
           +++PL   L A +SKC A L    + +++V  R  +  E +    G  PNS+LL  YG+V
Sbjct: 220 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 279

Query: 207 DED-NPYD 213
           + D + YD
Sbjct: 280 EWDGSKYD 287


>gi|308812294|ref|XP_003083454.1| N-methyltransferase (ISS) [Ostreococcus tauri]
 gi|116055335|emb|CAL58003.1| N-methyltransferase (ISS) [Ostreococcus tauri]
          Length = 492

 Score = 43.1 bits (100), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 20/56 (35%), Positives = 32/56 (57%)

Query: 158 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 213
           +S    A +   ++ V+LV  R  KAG+ I +  G   N +L ++YGF+ EDN +D
Sbjct: 248 HSFDASARVRECENGVELVTTRDLKAGQPIELCYGELSNDELFLDYGFIVEDNAFD 303


>gi|332872029|ref|XP_001168891.2| PREDICTED: SET domain-containing protein 4 isoform 8 [Pan
           troglodytes]
 gi|410222532|gb|JAA08485.1| SET domain containing 4 [Pan troglodytes]
 gi|410259176|gb|JAA17554.1| SET domain containing 4 [Pan troglodytes]
 gi|410287500|gb|JAA22350.1| SET domain containing 4 [Pan troglodytes]
 gi|410336605|gb|JAA37249.1| SET domain containing 4 [Pan troglodytes]
          Length = 440

 Score = 43.1 bits (100), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 55/240 (22%), Positives = 95/240 (39%), Gaps = 24/240 (10%)

Query: 19  SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
           S L  L  +L+ EK  G +S W PY+  L +       A   P+   E E+  L     K
Sbjct: 105 SPLLALCTFLVSEKHAGHRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLK 156

Query: 79  AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL- 137
           A+  E+   ++  +      +     LF +    I    F++     A+  V +  V+L 
Sbjct: 157 AKAEEQRAHVQEFFASSRDFFSSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLR 212

Query: 138 --QKVSLARRFALVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
             Q+  L+       L P   LL +S   + KA       + ++     ++  E + +  
Sbjct: 213 PRQRECLSAEPDTCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICY 272

Query: 192 GPQPNSKLLINYGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 246
           GP  N +L + YGFV   NP+      R ++   L + D Q   K  + + +G +    F
Sbjct: 273 GPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332


>gi|255080174|ref|XP_002503667.1| set domain protein [Micromonas sp. RCC299]
 gi|226518934|gb|ACO64925.1| set domain protein [Micromonas sp. RCC299]
          Length = 401

 Score = 43.1 bits (100), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 30/117 (25%), Positives = 52/117 (44%), Gaps = 10/117 (8%)

Query: 10  AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
           AE L   +L     L + +M+E+  G+ S W  Y   L     RG+  +  P+ W+  +L
Sbjct: 40  AETLREARLGGGLALNIAIMHERSLGEGSRWAGYFAVLP---ARGERTL--PMFWTSAQL 94

Query: 70  AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 126
            +L G+     + E AE ++ ++NE      +   L   +P   P    T E + +A
Sbjct: 95  EHLRGTDLLRHVTEDAESMRLDFNE-----NVVDGLCVTHPVAFPPGKHTLEAYMEA 146


>gi|8393013|ref|NP_059134.1| SET domain-containing protein 4 isoform 1 [Homo sapiens]
 gi|12229715|sp|Q9NVD3.1|SETD4_HUMAN RecName: Full=SET domain-containing protein 4
 gi|7023055|dbj|BAA91819.1| unnamed protein product [Homo sapiens]
 gi|119630162|gb|EAX09757.1| SET domain containing 4, isoform CRA_b [Homo sapiens]
 gi|119630163|gb|EAX09758.1| SET domain containing 4, isoform CRA_b [Homo sapiens]
 gi|119630165|gb|EAX09760.1| SET domain containing 4, isoform CRA_b [Homo sapiens]
          Length = 440

 Score = 43.1 bits (100), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 55/240 (22%), Positives = 95/240 (39%), Gaps = 24/240 (10%)

Query: 19  SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
           S L  L  +L+ EK  G +S W PY+  L +       A   P+   E E+  L     K
Sbjct: 105 SPLLALCTFLVSEKHAGHRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLK 156

Query: 79  AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL- 137
           A+  E+   ++  +      +     LF +    I    F++     A+  V +  V+L 
Sbjct: 157 AKAEEQRAHVQEFFASSRDFFSSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLR 212

Query: 138 --QKVSLARRFALVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
             Q+  L+       L P   LL +S   + KA       + ++     ++  E + +  
Sbjct: 213 PRQRECLSAEPDTCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICY 272

Query: 192 GPQPNSKLLINYGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 246
           GP  N +L + YGFV   NP+      R ++   L + D Q   K  + + +G +    F
Sbjct: 273 GPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332


>gi|397507017|ref|XP_003824008.1| PREDICTED: SET domain-containing protein 4 [Pan paniscus]
          Length = 440

 Score = 43.1 bits (100), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 55/240 (22%), Positives = 95/240 (39%), Gaps = 24/240 (10%)

Query: 19  SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
           S L  L  +L+ EK  G +S W PY+  L +       A   P+   E E+  L     K
Sbjct: 105 SPLLALCTFLVSEKHAGHRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLK 156

Query: 79  AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL- 137
           A+  E+   ++  +      +     LF +    I    F++     A+  V +  V+L 
Sbjct: 157 AKAEEQRAHVQEFFASSRDFFSSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLR 212

Query: 138 --QKVSLARRFALVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
             Q+  L+       L P   LL +S   + KA       + ++     ++  E + +  
Sbjct: 213 PRQRECLSAEPDTCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICY 272

Query: 192 GPQPNSKLLINYGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 246
           GP  N +L + YGFV   NP+      R ++   L + D Q   K  + + +G +    F
Sbjct: 273 GPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332


>gi|299472213|emb|CBN77183.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloropl [Ectocarpus
           siliculosus]
          Length = 460

 Score = 43.1 bits (100), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 48/195 (24%), Positives = 79/195 (40%), Gaps = 25/195 (12%)

Query: 19  SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
            E   +AL +++E+ QG+ S   P+I  L ++   G+L  E PL W E +L     S T+
Sbjct: 87  GETGLIALQVLWERAQGEGSKMAPWIAVLPKE---GEL--EMPLFWGEADLTLADASSTR 141

Query: 79  AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ 138
                   G   + +E D  W ++ + F ++P   P + F    F+ A     S    + 
Sbjct: 142 G-----ISGFVADVDE-DFAW-LSENAFAKHPKVFPADKFGPGDFRWAVGVALSRSFFVD 194

Query: 139 K-------VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
                   V  A   +L  +  P    +     +  +   AV L   + Y+ GE   V  
Sbjct: 195 GELRLTPLVDFANHSSLRGVSEP----TGGTTGLFGS--KAVVLRAGKNYEEGEEFFVSY 248

Query: 192 GPQPNSKLLINYGFV 206
           GP+  +  L   GFV
Sbjct: 249 GPKGAAGYLEENGFV 263


>gi|323309789|gb|EGA62995.1| Set7p [Saccharomyces cerevisiae FostersO]
          Length = 417

 Score = 43.1 bits (100), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)

Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
           +++PL   L A +SKC A L    + +++V  R  +  E +    G  PNS+LL  YG+V
Sbjct: 220 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 279

Query: 207 DED-NPYD 213
           + D + YD
Sbjct: 280 EWDGSKYD 287


>gi|256270722|gb|EEU05884.1| Set7p [Saccharomyces cerevisiae JAY291]
          Length = 494

 Score = 43.1 bits (100), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)

Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
           +++PL   L A +SKC A L    + +++V  R  +  E +    G  PNS+LL  YG+V
Sbjct: 220 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 279

Query: 207 DED-NPYD 213
           + D + YD
Sbjct: 280 EWDGSKYD 287


>gi|407852222|gb|EKG05847.1| hypothetical protein TCSYLVIO_003073 [Trypanosoma cruzi]
          Length = 565

 Score = 43.1 bits (100), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 43/148 (29%), Positives = 70/148 (47%), Gaps = 15/148 (10%)

Query: 184 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRL-----VVEAALNTEDPQYQDKR--MVAQ 236
           G  I +  GP  N +LL  YGFV E N +DRL       EAA+  E   +  +R  +VA+
Sbjct: 362 GREIWMSYGPLQNWELLQFYGFVLEGNEHDRLPFPLDFPEAAVGDE---WDGRRAALVAK 418

Query: 237 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 296
               L+   +  H GR   A   ++  LR+ ++++  E  ++  + GP   +    E  V
Sbjct: 419 YGLHLAGCCWICHDGRPPPA---LVALLRV-HLAEAEEFDTMERN-GPFASLGAGTEARV 473

Query: 297 LDQLADYFKARLAGYPATLSEDEAMLTD 324
              +AD  +  L  +  +L EDE +L +
Sbjct: 474 FATIADTIRCILDLFSTSLEEDERLLEN 501


>gi|452986759|gb|EME86515.1| hypothetical protein MYCFIDRAFT_131111 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 391

 Score = 43.1 bits (100), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 31/120 (25%), Positives = 49/120 (40%), Gaps = 10/120 (8%)

Query: 121 EIFKQAFVAVQSCVVHLQKVSLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 179
           ++FK  +  V S   H +       F  L P    +    S     +       ++  +R
Sbjct: 164 DVFKYYWAIVNSRSFHFKPPGAKPGFMVLCPFIDYMNHGPSGTGVNVRQTAKGYEVTANR 223

Query: 180 PYKAGESIVVWCGPQPNSKLLINYGFV---------DEDNPYDRLVVEAALNTEDPQYQD 230
            Y AGE ++   G  PN KLL++YGF+         D+D   D  +++   NT   Q QD
Sbjct: 224 DYVAGEEVLATYGAHPNDKLLVHYGFINSSKPGAPSDDDIRLDHYILDNLSNTTRDQLQD 283


>gi|298708218|emb|CBJ30557.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 493

 Score = 43.1 bits (100), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 49/203 (24%), Positives = 85/203 (41%), Gaps = 25/203 (12%)

Query: 22  ACLA-LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAE 80
           ACL  L L++E+  G+ S +  Y+  L +           PL W+E E+  L G  T AE
Sbjct: 133 ACLTVLRLLHERGLGESSPFHSYLSVLPQDH-------RLPLEWTEAEVGLLQG--TSAE 183

Query: 81  ILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKV 140
            L  A  +  ++    +V         Q+P        T   F +    V+S     +  
Sbjct: 184 PLVGAGSLDSQFEAFQSV-------VAQHPTVWEPSVCTKAAFAKGVNWVRS-----RGF 231

Query: 141 SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD--RPYKAGESIVVWCGPQPNSK 198
           ++     ++P G  +  +    +++    D     V+   +P KAGE +    G   N++
Sbjct: 232 TVMGDPHMIP-GADMFNHDPNKQSVQIGTDGEEHFVMKTVQPVKAGEEVFSSFGHISNAQ 290

Query: 199 LLINYGFVDEDNPYDRLVVEAAL 221
           LL +YGFV   N +D +++   L
Sbjct: 291 LLNSYGFVLPGNSFDTVLIPTQL 313


>gi|301094169|ref|XP_002997928.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262109714|gb|EEY67766.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 440

 Score = 42.7 bits (99), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 41/166 (24%), Positives = 66/166 (39%), Gaps = 27/166 (16%)

Query: 179 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL-----NTEDPQYQDKRM 233
           + Y+ GE + +  G   N +LL NYGF   +NPYD + +   +     N  DP +  KR 
Sbjct: 247 KAYEPGEQLYINYGSHSNLRLLRNYGFTTPNNPYDVVTLPMPIALQQPNPADPAFLQKRG 306

Query: 234 VAQR-NGKLSVQVFHVHAGREKE----------------AISDMLPYLRLGYVSDTSEMQ 276
           + Q   G  S  +  + + R                   A  + L  +     S +    
Sbjct: 307 LLQSATGSHSTDIPALRSLRFNHDGQLAPNAEHWLEILLATPEELSEIITQAASQSGAAD 366

Query: 277 SVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 322
           S IS   P+      ++  V  ++     ARL  + +TL ED+A L
Sbjct: 367 STISLALPMS-----LKHKVHSEVGSLVTARLKQHSSTLEEDDAFL 407


>gi|149742140|ref|XP_001496337.1| PREDICTED: SET domain-containing protein 4 [Equus caballus]
          Length = 440

 Score = 42.7 bits (99), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 56/224 (25%), Positives = 92/224 (41%), Gaps = 21/224 (9%)

Query: 18  LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
           LS L  L  +L+ EK  G +S W PY+  L +       A   P+   E E+  L   P 
Sbjct: 104 LSPLLALCTFLVAEKHAGDRSVWKPYLEVLPK-------AYTCPVC-LEPEVVDLLPKPL 155

Query: 78  KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
           KA+  E+   ++  +      +     LF +    I    F++  F  A+  V +  V++
Sbjct: 156 KAKAREQRTRLQAFFTSSRDFFSSLRPLFSEAVESI----FSYSAFLWAWCTVNTRAVYM 211

Query: 138 QKVSLARRFALVP----LGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVV 189
            K    R F+  P    L P   LL +S   + +A         ++      +  E + +
Sbjct: 212 -KPRRRRCFSAEPDTYALAPYLDLLNHSPDVQVRAGFNEETRCYEIRTVSSCRKHEEVFI 270

Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRM 233
             GP  N +LL+ YGFV   NP+  + V   +  +     DK+M
Sbjct: 271 CYGPHDNQRLLLEYGFVSIHNPHACVYVSKDILVKYLPSTDKQM 314


>gi|391342782|ref|XP_003745694.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Metaseiulus occidentalis]
          Length = 278

 Score = 42.7 bits (99), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 37/182 (20%), Positives = 83/182 (45%), Gaps = 12/182 (6%)

Query: 179 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 238
           R YK  E + ++ G + N++ +++ GFV ++N +D L ++  L+  D  ++ KR + ++ 
Sbjct: 96  REYKKNEQVNIFYGNRANAQFMLHNGFVPDENQWDSLAIKIGLSKADKLFEMKRRLCEQM 155

Query: 239 GKLSVQVFHVHAGREKEAISDMLPYLRLGYV--------SDTSEMQSVISSLGPICPVSP 290
              +  VF +    + + +  ++P + L  V        SD +   + + +  P     P
Sbjct: 156 KIPTSDVFELKKAPDGDGV--LVPKVLLHLVHILQWKAPSDGTTSGTDVGA-DPSDATDP 212

Query: 291 CMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML-NACL 349
              +     L    +  +   P ++ E   +L D     + ++A +    E++ML NAC 
Sbjct: 213 VRTKKAKTFLHVRCQLLMKALPRSVEELTEILNDPTTSLESKLAIRYRLSEQRMLTNACN 272

Query: 350 QV 351
           ++
Sbjct: 273 KI 274


>gi|145344497|ref|XP_001416768.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144576994|gb|ABO95061.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 514

 Score = 42.7 bits (99), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 94/373 (25%), Positives = 154/373 (41%), Gaps = 67/373 (17%)

Query: 22  ACLALYLMYEK-KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAE 80
           A LA +++ E    G KS + P++R L R        V+S + W+E EL+ L+GS     
Sbjct: 130 AILAAHVLREAFDAGSKSAYWPWLRLLPRD-------VDSTVGWNEDELSELSGS--NVV 180

Query: 81  ILERA--EGIKREYNELDTVWFMAGSLFQQYPYDIPTEA---FTFEIFKQAFVAVQSCVV 135
           +  RA     + EY+ LD       +L +++P     E    +TF+ F  A   + S  +
Sbjct: 181 VFTRAIKAQWRMEYDALDVP-----TLGEKFPDVFGGERAAHYTFDKFTWARFIIWSRAI 235

Query: 136 HLQKVSLARRFALVPLGPPLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 190
            L   S       + +  PLL     A   K +    A  +AV++     ++    +   
Sbjct: 236 DLSTESA--EAPTIRVLVPLLDMANHAPGGKLRPEWDARSNAVKVYAASAFREHTELRFN 293

Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 250
              +P+   L+ YGF+ E NP +   VEA +   D         + R+ K  +   H   
Sbjct: 294 YDTKPSQYFLLQYGFIPETNPAE--CVEATVRVSDHD-------SLRDAKEELLRLHGLD 344

Query: 251 GREKE------AIS-DMLPYLRLGYVSDTSEMQSVIS-----SLGPICPVSPCMERAV-L 297
            +++       +I  D+L   R+    D +EM    S     S   +   +    +AV L
Sbjct: 345 PKKRNFEWKPRSIDYDLLAATRV-ITMDEAEMSDATSLTLAVSGASVSAKNDARTKAVLL 403

Query: 298 DQLADYFKARLAGYPATLSEDEAMLT----DYNLHP----KKRVATQLVRMEKKMLNACL 349
             LA +    L  Y  TL+ED   +     + N  P    +KR A  L+RM +K +   L
Sbjct: 404 KSLASF----LESYTTTLAEDNEYVARVDDESNDEPLPGKRKRFAV-LLRMREKQI---L 455

Query: 350 QVTADMIML-LPD 361
             +AD +   LPD
Sbjct: 456 LASADALFKELPD 468


>gi|345561352|gb|EGX44442.1| hypothetical protein AOL_s00188g347 [Arthrobotrys oligospora ATCC
           24927]
          Length = 468

 Score = 42.7 bits (99), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 53/215 (24%), Positives = 88/215 (40%), Gaps = 23/215 (10%)

Query: 60  SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 119
           S + +++ EL    GS       +  + I+ +Y  L         LF QY        FT
Sbjct: 107 SSIFFTDDELEVCAGSSLYTITKQLKQQIQDDYRTL------VERLFGQYLDIFSLGKFT 160

Query: 120 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS---KCKAMLAAVDDAVQLV 176
            E +K A   V S  +   +        L P    +L +SS   KC     +  D + ++
Sbjct: 161 IEDYKWALCTVWSRAMDFVQPDGKSIRLLAPFAD-MLNHSSDVKKCHVYDTSSGD-LSIL 218

Query: 177 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 236
             + Y+ G+ + +  G  PN++LL  YGFV  +NP D   +      E P ++ K+ +  
Sbjct: 219 AGKDYEPGDQVFINYGSIPNNRLLRLYGFVVPNNPNDSYDLVLMTQPEAPFFELKQKLWV 278

Query: 237 RNGKLSVQVFHVHAGREKEAISDMLP-----YLRL 266
             G  SV    +       +++D LP     YLR+
Sbjct: 279 SAGLDSVSTISL-------SLNDPLPKSVLQYLRI 306


>gi|66813084|ref|XP_640721.1| hypothetical protein DDB_G0281543 [Dictyostelium discoideum AX4]
 gi|60468751|gb|EAL66753.1| hypothetical protein DDB_G0281543 [Dictyostelium discoideum AX4]
          Length = 1339

 Score = 42.7 bits (99), Expect = 0.28,   Method: Composition-based stats.
 Identities = 48/200 (24%), Positives = 84/200 (42%), Gaps = 17/200 (8%)

Query: 18   LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
            L++   L L+++YEK     SFW P+   L            + + +S TEL  L G+  
Sbjct: 828  LNDDTILFLFVIYEKGNAN-SFWRPFYDTLPSY-------FTTSIHYSATELLELEGT-- 877

Query: 78   KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
               + E     K++ N      F    L +QYP   P   F++E F  A   + S  + L
Sbjct: 878  --NLFEETLHTKQQLNSFRDYLF--PELSKQYPDIFPESQFSWENFLWARSLLDSRAIQL 933

Query: 138  QKVSLARRFALVPLGPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 195
             K+  + +  LVP+   +  +++    +        + +++      A   I +  G   
Sbjct: 934  -KIDGSIKSCLVPMADMINHHTNAQISERFFDHDSQSFKMISSCNIPANNQIFLHYGALQ 992

Query: 196  NSKLLINYGFVDEDNPYDRL 215
            N +L + YGF+  +N YD L
Sbjct: 993  NWELALYYGFIIPNNIYDSL 1012


>gi|384249602|gb|EIE23083.1| SET domain-containing protein, partial [Coccomyxa subellipsoidea
           C-169]
          Length = 306

 Score = 42.7 bits (99), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 30/117 (25%), Positives = 54/117 (46%), Gaps = 13/117 (11%)

Query: 10  AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
           A++L  +++     L + +MYE   GK+SFW  Y+ EL ++          PL W+E E 
Sbjct: 33  ADILEQHRIRGGLGLIIAIMYELSIGKESFWHGYLEELHKRE-------YLPLFWAEQER 85

Query: 70  AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 126
           + L G+  +    E  E  + ++            L +Q+   +  ++FT E F+ A
Sbjct: 86  SLLQGTEAEHRPQEDEELTQEDFET------HVPPLVEQHADRLRADSFTLESFRVA 136


>gi|50303805|ref|XP_451849.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
 gi|49640981|emb|CAH02242.1| KLLA0B07161p [Kluyveromyces lactis]
          Length = 553

 Score = 42.7 bits (99), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 25/65 (38%), Positives = 33/65 (50%), Gaps = 1/65 (1%)

Query: 169 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 228
            D+ V +++    KAGE I    G   NS LL  YGF  EDNP+D + +   L  E  Q 
Sbjct: 319 TDECVDIILSNDVKAGEEIFNSYGDHSNSYLLARYGFCIEDNPHDVVDLSEEL-VEYGQK 377

Query: 229 QDKRM 233
             KR+
Sbjct: 378 NSKRI 382


>gi|322707769|gb|EFY99347.1| SET domain protein [Metarhizium anisopliae ARSEF 23]
          Length = 467

 Score = 42.7 bits (99), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 78/192 (40%), Gaps = 13/192 (6%)

Query: 26  LYLMYEKKQGKKSFWLPYIRELDR--QRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           L+L+ E  +  KSFW PYIR L +  Q  + Q A+     W + E   L G+  +  I +
Sbjct: 95  LFLIKEYLKRDKSFWWPYIRALPQPGQGNKSQWALAP--FWDDDEAELLEGTNVEVGIDK 152

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE--IFKQAFVAVQSCVVHLQKVS 141
               ++R+  E   +  + G     +   + TE + +   IF          +   Q+ S
Sbjct: 153 IRNDVRRDLQEAQELLRLHGDADGAFGKALTTELYQWAYCIFSSRSFRPSLVLSDEQRRS 212

Query: 142 LARRFALVPLGPPLLAYSSKCKAMLAAV----DDAVQ---LVVDRPYKAGESIVVWCGPQ 194
           L R   +      L  +      M   +    DD  Q   L V R +  G+ +      +
Sbjct: 213 LPRGVTMDDFSVLLPLFDIGNHDMTTEIRWDLDDDRQTCELRVGRTHMPGQQVFNNYSMK 272

Query: 195 PNSKLLINYGFV 206
            N++LL+ YGF+
Sbjct: 273 TNAELLLGYGFM 284


>gi|330924929|ref|XP_003300837.1| hypothetical protein PTT_12198 [Pyrenophora teres f. teres 0-1]
 gi|311324820|gb|EFQ91062.1| hypothetical protein PTT_12198 [Pyrenophora teres f. teres 0-1]
          Length = 372

 Score = 42.7 bits (99), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 22/77 (28%), Positives = 38/77 (49%), Gaps = 2/77 (2%)

Query: 145 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 204
           R A++P+          C+A  A+  +    + DR Y+AGE + +  G      LL  YG
Sbjct: 179 RLAILPVADLFNHADVGCEAQFAS--ENYSFIADRTYRAGEELYISYGTHSTDFLLAEYG 236

Query: 205 FVDEDNPYDRLVVEAAL 221
           FV  +N +D + ++ A+
Sbjct: 237 FVPAENRWDVVCLDEAI 253


>gi|281201674|gb|EFA75882.1| tryptophan 2,3-dioxygenase [Polysphondylium pallidum PN500]
          Length = 732

 Score = 42.7 bits (99), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 38/120 (31%), Positives = 59/120 (49%), Gaps = 15/120 (12%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           L ++L+ E ++ ++SFW PYIR L +       + ++ L ++  E   L GSP    +LE
Sbjct: 146 LVIHLIQETQKQQQSFWAPYIRMLPK-------SYKTALYFTLAEFQLLIGSP----VLE 194

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 143
             E I    N L    F+    F + P  + T  FT+E F+Q  +A    +V L K  LA
Sbjct: 195 --ESINTYRNTLRQYCFLY-DFFGKNPGILSTSNFTWE-FEQNELAAYKSIVSLLKKRLA 250


>gi|302754812|ref|XP_002960830.1| hypothetical protein SELMODRAFT_402221 [Selaginella moellendorffii]
 gi|300171769|gb|EFJ38369.1| hypothetical protein SELMODRAFT_402221 [Selaginella moellendorffii]
          Length = 393

 Score = 42.7 bits (99), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 27/192 (14%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           LAL ++ E+ +G+ + W PYI  L +        +++   W +TEL+YL  SP   +  E
Sbjct: 167 LALIVLMERYKGQ-AIWAPYISCLPQPA-----ELDNTFRWEDTELSYLRASPLYGKARE 220

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ-SCVVHLQKVSL 142
           R E I  E+ ++    F    L Q             +++ Q F  V    + H+     
Sbjct: 221 RLEMITTEFGQVQND-FCTCVLEQ-----------ALDVWPQLFGKVSLEDLKHVYATVF 268

Query: 143 ARRFALVPLGP---PLLAY-----SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 194
           +R  A+        P+L +     +S  K     + +   +  DR Y   + I +  G  
Sbjct: 269 SRSLAIGEDSTTLIPMLDFFNHNATSFAKLSFNGLLNYAVVTADRDYAENDQIWINYGDL 328

Query: 195 PNSKLLINYGFV 206
            N++L ++YGF 
Sbjct: 329 SNAELALDYGFT 340


>gi|297707870|ref|XP_002830708.1| PREDICTED: SET domain-containing protein 4 [Pongo abelii]
          Length = 440

 Score = 42.7 bits (99), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 54/240 (22%), Positives = 95/240 (39%), Gaps = 24/240 (10%)

Query: 19  SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
           S L  L  +L+ EK  G +S W PY+  L +       A   P+   E E+  L     K
Sbjct: 105 SPLLALCTFLVSEKHAGDRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPQSLK 156

Query: 79  AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ 138
           A+  E+   ++  +      +     LF +    I    F++     A+  V +  V+L+
Sbjct: 157 AKAEEQRAHVQEFFASSRDFFSSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLR 212

Query: 139 ---KVSLARRFALVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
              +  L+       L P   LL +S   + KA       + ++     ++  E + +  
Sbjct: 213 PRHRECLSAELDTCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRRHEEVFICY 272

Query: 192 GPQPNSKLLINYGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 246
           GP  N +L + YGFV   NP+      R ++   L + D Q   K  + + +G +    F
Sbjct: 273 GPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332


>gi|355747383|gb|EHH51880.1| SET domain-containing protein 4 [Macaca fascicularis]
          Length = 440

 Score = 42.7 bits (99), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 54/240 (22%), Positives = 95/240 (39%), Gaps = 24/240 (10%)

Query: 19  SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
           S L  L  +L+ EK  G +S W PY+  L +       A   P+   E E+  L     K
Sbjct: 105 SPLLALCTFLVSEKHAGDRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLK 156

Query: 79  AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL- 137
           A+  E+   ++  +      +     LF +    I    F++     A+  + +  V+L 
Sbjct: 157 AKAEEQRAHVQEFFASSRDFFSSLQPLFVEAVDSI----FSYSALLWAWCTINTRAVYLR 212

Query: 138 --QKVSLARRFALVPLGP--PLLAYSSKC--KAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
             Q+  L+       L P   LL +S +   KA       + ++     ++  E + +  
Sbjct: 213 PRQRECLSAEPDTCALAPYLDLLNHSPRVQVKAAFNEETHSYEIRTTSRWRKHEEVFICY 272

Query: 192 GPQPNSKLLINYGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 246
           GP  N +L + YGFV   NP+      R ++   L + D Q   K  + + +G +    F
Sbjct: 273 GPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSRDKQMDKKISILKDHGYIENLTF 332


>gi|297608243|ref|NP_001061350.2| Os08g0244400 [Oryza sativa Japonica Group]
 gi|255678277|dbj|BAF23264.2| Os08g0244400, partial [Oryza sativa Japonica Group]
          Length = 195

 Score = 42.4 bits (98), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 22/59 (37%), Positives = 34/59 (57%), Gaps = 2/59 (3%)

Query: 169 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD--RLVVEAALNTED 225
           V  +++  + RP KAGE   +  G  P S L+  YGF+  DNPYD   L ++ +++ ED
Sbjct: 14  VTKSLKFPLSRPCKAGEQCFLSYGKHPGSHLITFYGFLPRDNPYDVIPLDLDTSVDEED 72


>gi|413917183|gb|AFW57115.1| hypothetical protein ZEAMMB73_742803 [Zea mays]
          Length = 514

 Score = 42.4 bits (98), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 61/240 (25%), Positives = 96/240 (40%), Gaps = 24/240 (10%)

Query: 16  NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 75
           N ++    L L+ M E+      F  PY   L      G       L +    LA L G+
Sbjct: 190 NNITSETMLLLWSMRERYNLGSKF-KPYFDTLPANFNTG-------LSFGIDALAALEGT 241

Query: 76  PTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 135
               EI++  + ++++Y+EL  +      L   +P     +  T++ F  A     S  +
Sbjct: 242 LLFDEIIQARQHLRQQYDELFPL------LCTNFPEIFRKDVCTWDDFLWACELWYSNSM 295

Query: 136 HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV---VDRPYKAGESIVVWCG 192
            +   S      LVP+   L    S        VD+A + +   + RP  AGE   +  G
Sbjct: 296 MIVLSSGKLSTCLVPVAGLLNHSVSPHILNYGRVDEATKSLKFPLSRPCDAGEQCFLSYG 355

Query: 193 PQPNSKLLINYGFVDE-DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 251
             P S LL  YGF+   DNPYD + ++      D    D+ + AQ +   S Q  H+  G
Sbjct: 356 KHPGSHLLTFYGFLPRGDNPYDVIPLDL-----DTSADDEDITAQSSATTS-QTTHMVRG 409


>gi|403271547|ref|XP_003927684.1| PREDICTED: SET domain-containing protein 4 [Saimiri boliviensis
           boliviensis]
          Length = 440

 Score = 42.4 bits (98), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 55/240 (22%), Positives = 94/240 (39%), Gaps = 24/240 (10%)

Query: 19  SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
           S L  L  +L+ EK  G +S W PY+  L +       A   P+   E E+  L     K
Sbjct: 105 SPLLALCTFLVSEKHAGDRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLK 156

Query: 79  AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL- 137
           A+  E+   ++  +      +     LF +    I    F++     A+  V +  V+L 
Sbjct: 157 AKAEEQRAHVQEFFASSRDFFSSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLR 212

Query: 138 --QKVSLARRFALVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
             Q+  L+       L P   LL +S   + KA         ++     ++  E + +  
Sbjct: 213 PRQQECLSAEPDTCALAPYLDLLNHSPHVQVKAAFNEETHCYEIRTTSRWRKHEEVFICY 272

Query: 192 GPQPNSKLLINYGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 246
           GP  N +L + YGFV   NP+      R ++   L + D Q   K  + + +G +    F
Sbjct: 273 GPHDNQRLFLEYGFVSAHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332


>gi|355718756|gb|AES06374.1| SET domain containing 4 [Mustela putorius furo]
          Length = 256

 Score = 42.4 bits (98), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 56/230 (24%), Positives = 94/230 (40%), Gaps = 23/230 (10%)

Query: 21  LACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAE 80
           L  L  +L+ EK  G +S W PY+  L +       A   P+   E ++  L   P KA+
Sbjct: 10  LCTLCTFLVSEKHAGDQSLWKPYLDILPK-------AYTCPVC-LEPKVVNLFPEPLKAK 61

Query: 81  ILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HL 137
             E+   ++  ++     +     LF +   +I    F++     A+  V +  V   H 
Sbjct: 62  AEEQRARVQGFFSSSRDFFSSLQPLFSEAVENI----FSYSALLWAWCTVNTRAVYMKHG 117

Query: 138 QKVSLARRFALVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 193
           Q+   +       L P   LL +S   + KA         ++      +  E + +  GP
Sbjct: 118 QRKCFSPEPDTYALAPYLDLLNHSPDVQVKAAFNEETRCYEVRTASGCRKHEQVFICYGP 177

Query: 194 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 243
             N +LL+ YGFV   NP+  + V A L  +     DK+M    N K+S+
Sbjct: 178 HDNQRLLLEYGFVSIQNPHACVYVSADLLVKYLPSTDKQM----NKKISI 223


>gi|400598098|gb|EJP65818.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
          Length = 356

 Score = 42.4 bits (98), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 41/168 (24%), Positives = 69/168 (41%), Gaps = 22/168 (13%)

Query: 61  PLLWSETELAYLTGSPTKAEILERAE-GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 119
           P  W       L G  T   +LE+ +    R++  L +           YPY +P+E + 
Sbjct: 104 PFFWPPEAQRLLPG--TARRLLEKQQSNFGRDWKHLQSA----------YPY-VPSEDYM 150

Query: 120 FEIFKQAFVAVQSCVVHLQKVSL---ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 176
              F    V+ ++     Q+  L     R A++P+       S  CK    A  ++  +V
Sbjct: 151 HAWF---VVSSRAFYQETQQTLLYPWHDRLAMLPVADLFNHASVGCKVSYCA--ESYDIV 205

Query: 177 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 224
            DR Y  G+ +    G   N  LL  YGF+ ++N  DR   +  +++E
Sbjct: 206 ADREYGTGDEVCTCYGEHSNDFLLAEYGFLLQNNTNDRFDPDDLISSE 253


>gi|336258546|ref|XP_003344085.1| hypothetical protein SMAC_09068 [Sordaria macrospora k-hell]
 gi|380093059|emb|CCC09296.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 421

 Score = 42.4 bits (98), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 18/50 (36%), Positives = 29/50 (58%)

Query: 172 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 221
           A  +   RPY AGE + +  G   N  LLI YGF+ ++N +D + ++ A+
Sbjct: 263 AFTITTTRPYSAGEEVYICYGNHSNDFLLIEYGFLFDENVWDEVCIDDAI 312


>gi|242823770|ref|XP_002488126.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
 gi|218713047|gb|EED12472.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
          Length = 480

 Score = 42.4 bits (98), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 49/193 (25%), Positives = 85/193 (44%), Gaps = 20/193 (10%)

Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
            +VPL   L A + +  A L   D A+ +   +P K G+ I    G  P S LL  YG+V
Sbjct: 230 GMVPLADLLNADADRNNARLFQEDGALVMRAIKPIKTGDEIFNDYGELPRSDLLRRYGYV 289

Query: 207 DEDNPYDRLVVEAAL----------NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEA 256
             DN     VVE  L          N E  +Y   +++ +   ++    + +     +++
Sbjct: 290 -TDNYAQYDVVELPLTGICHAAGLDNIESQEYPHLKLLHEL--EILEDGYCILRPSAEDS 346

Query: 257 ISDMLP----YLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP 312
           ++D+LP     L      +  E+Q + S   P  P+    E  +   L D  K++L+ Y 
Sbjct: 347 LTDILPDELLALLKSLTLEREELQRLQSKQKPPKPILAAREARI---LLDSVKSKLSQYG 403

Query: 313 ATLSEDEAMLTDY 325
            T+ +D+A+L  +
Sbjct: 404 TTVEQDKAILQQF 416


>gi|367042232|ref|XP_003651496.1| hypothetical protein THITE_2111880 [Thielavia terrestris NRRL 8126]
 gi|346998758|gb|AEO65160.1| hypothetical protein THITE_2111880 [Thielavia terrestris NRRL 8126]
          Length = 377

 Score = 42.0 bits (97), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 42/171 (24%), Positives = 69/171 (40%), Gaps = 22/171 (12%)

Query: 56  LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPT 115
           LA   PL WS   L      P +A  L RA+  K   +     W    + F         
Sbjct: 104 LATALPLAWSSPVLHNYLPPPARA--LLRAQQAKFARD-----WAAVSAAFP-------- 148

Query: 116 EAFTFEIFKQAFVAVQSCVVHLQKVSLAR-----RFALVPLGPPLLAYSSKCKAMLAAVD 170
            A   + F+ A++   +   + +    AR     R  L P+   L  +++     +A   
Sbjct: 149 -ALAPDAFRHAWLLTNTRTFYHETARTARLPHDDRMVLQPVAD-LFNHAADGGCEVAFTP 206

Query: 171 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 221
            +  +  DR Y  GE +++  G   N  LL+ YGFV E N +D + ++ A+
Sbjct: 207 ASFAITADRAYAEGEEVLICYGRHSNDFLLVEYGFVLEQNRWDEVGLDEAV 257


>gi|302896942|ref|XP_003047350.1| hypothetical protein NECHADRAFT_106552 [Nectria haematococca mpVI
           77-13-4]
 gi|256728280|gb|EEU41637.1| hypothetical protein NECHADRAFT_106552 [Nectria haematococca mpVI
           77-13-4]
          Length = 471

 Score = 42.0 bits (97), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 52/230 (22%), Positives = 97/230 (42%), Gaps = 36/230 (15%)

Query: 25  ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-LE 83
           A++L+ +   G++SFW PYI+ L +       A+  PLLW E++L +L G+  +  +  +
Sbjct: 111 AIFLVQQYLLGEQSFWYPYIQILPQPDDDKDSAI--PLLWPESDLLWLRGTHLEEAVSKQ 168

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF---------------- 127
           + + +KR        W  A    Q+Y +D P++ FT E+   A+                
Sbjct: 169 KVDHVKR--------WTEAMETLQKYGWD-PSQ-FTLELGLWAYYCFYSRYFWSIILEPD 218

Query: 128 -VAVQSCVVHLQKVSLARRFALVPLGPPL--LAYSSKCKAMLAAVDDAVQLVVDRPYKAG 184
              ++    HL K  +        L P L  L ++ +        D  + +  +   K G
Sbjct: 219 VANIKPEFQHLVKAGMNLDDTAKILLPILETLNHAQETNTEYNLDDKGLSVSKNIELKPG 278

Query: 185 ESIVVWCGPQP----NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQD 230
           +   +    +     N+ LL ++GF+  DN    LV+ +  +   P + D
Sbjct: 279 DPFYIAYDKETQRFNNTVLLKDFGFILPDNEAAELVLSSPFDLTRPMHLD 328


>gi|218200748|gb|EEC83175.1| hypothetical protein OsI_28406 [Oryza sativa Indica Group]
          Length = 319

 Score = 42.0 bits (97), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 26/77 (33%), Positives = 40/77 (51%), Gaps = 7/77 (9%)

Query: 151 LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 210
           + P +L Y    K     V  +++  + RP KAGE   +  G  P S L+  YGF+  DN
Sbjct: 166 VSPHILNYGRVDK-----VTKSLKFPLSRPCKAGEQCFLSYGKHPGSHLITFYGFLPRDN 220

Query: 211 PYDR--LVVEAALNTED 225
           PYD   L ++ +++ ED
Sbjct: 221 PYDVIPLDLDTSVDEED 237


>gi|170588849|ref|XP_001899186.1| SET domain containing protein [Brugia malayi]
 gi|158593399|gb|EDP31994.1| SET domain containing protein [Brugia malayi]
          Length = 278

 Score = 42.0 bits (97), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 28/79 (35%), Positives = 44/79 (55%), Gaps = 5/79 (6%)

Query: 147 ALVPLGPPLLAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 204
           A+VPL   +L +S  S+C A+  +  +  +++V RP + GE I +  G   N  L I YG
Sbjct: 186 AIVPL-IDMLNHSNDSQCCAIWDSKFNLYKVIVTRPIRKGEQIFICYGSHTNGSLWIEYG 244

Query: 205 FVDEDNPYDRLVVEAALNT 223
           F  +DN  D+  VE +L +
Sbjct: 245 FYLKDNICDK--VEISLGS 261


>gi|358055500|dbj|GAA98620.1| hypothetical protein E5Q_05307 [Mixia osmundae IAM 14324]
          Length = 462

 Score = 42.0 bits (97), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 18/46 (39%), Positives = 28/46 (60%)

Query: 175 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 220
           LV D P  AG+ +    GP+ N + L+ YGF+  +NP D +V++ A
Sbjct: 225 LVQDEPTAAGQQVFNNYGPKSNEEFLLGYGFIIPNNPDDHMVLKLA 270


>gi|383863095|ref|XP_003707018.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Megachile
           rotundata]
          Length = 277

 Score = 42.0 bits (97), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 28/91 (30%), Positives = 49/91 (53%), Gaps = 4/91 (4%)

Query: 179 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD--RLVVEAALNTE-DPQYQDKRMVA 235
           R +K G+ I +  GP+PNS   ++ GFV  D+ +D  +  V + L +  DP   ++R + 
Sbjct: 80  RDFKKGDQIFISYGPRPNSDFFLHSGFVYMDHKHDTLKFWVGSFLESNLDPHLAERRQLL 139

Query: 236 QRNGKLSVQVFHVHAGREKEAISDMLPYLRL 266
           ++        F V++GRE    S +L Y+R+
Sbjct: 140 KKLHLQPWSEFVVNSGREPIPGS-VLAYMRV 169


>gi|378728064|gb|EHY54523.1| SET domain-containing protein 6 [Exophiala dermatitidis NIH/UT8656]
          Length = 495

 Score = 42.0 bits (97), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 73/281 (25%), Positives = 105/281 (37%), Gaps = 46/281 (16%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           L + ++YE  +G+ S W PY + L            + + W++ ELA L  S    +I  
Sbjct: 98  LIVTIIYEYLRGESSPWHPYFKIL-------PTTFNTLMFWNDAELAELQASAVVDKIGR 150

Query: 84  R-AEG------------------IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK 124
           R AE                   +     +L  +  MAGSL   Y +DI  +    +   
Sbjct: 151 RQAEEEWQNTIIPTMADHPDLFPVGGSSAKLIELAHMAGSLIMAYAFDIDRDDMEDDNDN 210

Query: 125 QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 184
               A  +     +         +VP    L A + K  A L    D + +   +P  AG
Sbjct: 211 DKDGADSADDEFEEDDEDEPFKGMVPFADMLNADADKNNARLFQEPDYLIMKATKPISAG 270

Query: 185 ESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 243
           E I    GP P S LL  YG+V D    YD +     L  E         VA ++ K   
Sbjct: 271 EQIFNDYGPLPRSDLLRMYGYVTDNYAQYDVVEFSHDLLLE---------VAGKHSKSKD 321

Query: 244 QVFHVHAGREKEAISDMLPYLRLGYV-----SDTSEMQSVI 279
           QV+     RE+E   D L  L  GY       DT  +Q V+
Sbjct: 322 QVW-----REREQQLDELGVLDDGYAITRPEYDTQGLQDVL 357


>gi|440464432|gb|ELQ33864.1| hypothetical protein OOU_Y34scaffold00857g1 [Magnaporthe oryzae
           Y34]
          Length = 464

 Score = 42.0 bits (97), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 36/129 (27%), Positives = 55/129 (42%), Gaps = 3/129 (2%)

Query: 105 LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA 164
           L  Q+    P E FT E +K A   V S  +            L P    +L +S   K 
Sbjct: 146 LLVQHRDLFPLEQFTIEDYKWALCTVWSRAMDFVLPGGNSIRLLAPFAD-MLNHSDNVKQ 204

Query: 165 MLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 222
             A  +    + ++  + Y+AG+ + ++ GP  NS+LL  YGFV   N  D   +  A +
Sbjct: 205 CHAYDSSSKTLSVLAGKDYEAGDQVFIYYGPVSNSRLLRLYGFVLPGNSNDNYDLVLATH 264

Query: 223 TEDPQYQDK 231
            E P +  K
Sbjct: 265 PEAPFFARK 273


>gi|395518633|ref|XP_003763464.1| PREDICTED: SET domain-containing protein 4 [Sarcophilus harrisii]
          Length = 440

 Score = 42.0 bits (97), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 50/207 (24%), Positives = 83/207 (40%), Gaps = 19/207 (9%)

Query: 18  LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
           +S L  L  +L+ E   G KS W PY+  L +          + L+  E ++  L   P 
Sbjct: 104 ISPLLALCTFLISENNAGNKSPWKPYLDILPKDY--------TCLVCLEPQVVRLLPKPL 155

Query: 78  KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV-- 135
           K +  E+   ++  +      +    SLF +    I    F +  F  A+  + +  V  
Sbjct: 156 KIKAQEQKTQVQELFVSSRGFFSSLQSLFTEDVKHI----FHYHAFLWAWCTINTRTVYM 211

Query: 136 -HLQKVSLARRFALVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVW 190
            H QK  L+    +  L P   LL +S   +  A         ++      K  E + + 
Sbjct: 212 KHAQKKCLSAEPDVYALAPYLDLLNHSPGVQVNAAFNEKTRCYEIRTTSSCKKYEELFIC 271

Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVV 217
            GP  N +LL+ YGFV  +NP+  + V
Sbjct: 272 YGPHDNHRLLLEYGFVAINNPHSAVYV 298


>gi|218200744|gb|EEC83171.1| hypothetical protein OsI_28399 [Oryza sativa Indica Group]
          Length = 437

 Score = 42.0 bits (97), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 33/56 (58%), Gaps = 2/56 (3%)

Query: 172 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR--LVVEAALNTED 225
           +++  + RP KAGE   +  G  P S L+  YGF+  DNPYD   L ++ +++ ED
Sbjct: 300 SLKFPLSRPCKAGEQCFLSYGKHPGSHLITFYGFLPRDNPYDVIPLDLDTSVDEED 355


>gi|358335378|dbj|GAA53907.1| histone-lysine N-methyltransferase setd3 [Clonorchis sinensis]
          Length = 254

 Score = 42.0 bits (97), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 36/168 (21%), Positives = 79/168 (47%), Gaps = 15/168 (8%)

Query: 192 GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 251
           G + +++ L+  GFV   NP++ + +   ++  D     +  + +     S  + H+   
Sbjct: 63  GKRTSAEFLMFSGFVPATNPHNNVRIVLGVSKSDQLSSKREQLLELIALQSPLILHITG- 121

Query: 252 REKEAISDMLPYLRLGYVSDTSEMQSVIS---------SLGPICPVSPCMERAVLDQLAD 302
            +  ++SD + + R+ +V D+ ++ + +S            P+CP  P  ++A+   L  
Sbjct: 122 -DLSSLSDAIAFARV-FVMDSDQLDAHLSMTTSALHALRTSPLCPGDPIDDQAIA-FLIM 178

Query: 303 YFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 350
            F+  ++ Y   +SEDE      NL P +R   +L   E ++L +C++
Sbjct: 179 RFELLVSAYGPMVSEDEVGYE--NLTPIQRYCERLRVQEVQILRSCIE 224


>gi|260819628|ref|XP_002605138.1| hypothetical protein BRAFLDRAFT_122719 [Branchiostoma floridae]
 gi|229290469|gb|EEN61148.1| hypothetical protein BRAFLDRAFT_122719 [Branchiostoma floridae]
          Length = 453

 Score = 42.0 bits (97), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 48/209 (22%), Positives = 84/209 (40%), Gaps = 24/209 (11%)

Query: 16  NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 75
           ++L+    LA++LM EK + +KSFW PYI  L  +          P  ++E +   L  S
Sbjct: 99  SQLTPSQALAVFLMCEKCRREKSFWRPYIDILPEE-------YTCPAFFTEEDFRLLPNS 151

Query: 76  PTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS--- 132
             + +   +     +E+ EL   + M   LF         +AF F+ FK A+ A+++   
Sbjct: 152 -LRGKAKAKKYECHKEFMELAPFFKMLADLFPDQ-----EDAFNFKDFKWAWSAIKTRAF 205

Query: 133 -------CVVHLQKVSLARRFALVPLGPPL-LAYSSKCKAMLAAVDDAVQLVVDRPYKAG 184
                      L+         + PL   +  A  +K +         ++   +  Y+  
Sbjct: 206 DVPLGGETCYRLRDSEDTSNPTMFPLVDSINHAAQAKIRHRYNEKRRCLESRTETVYRRH 265

Query: 185 ESIVVWCGPQPNSKLLINYGFVDEDNPYD 213
             ++   G   N  LL+ +GFV   NP D
Sbjct: 266 AEVMNSYGRADNDNLLLEFGFVVPGNPAD 294


>gi|50294638|ref|XP_449730.1| hypothetical protein [Candida glabrata CBS 138]
 gi|49529044|emb|CAG62706.1| unnamed protein product [Candida glabrata]
          Length = 510

 Score = 42.0 bits (97), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 22/66 (33%), Positives = 34/66 (51%)

Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
           ++VPL   L A + KC A L      ++++  R   +GE +    G   NS+LL  YG+V
Sbjct: 238 SMVPLADTLNADTKKCNANLLHSKQTLRMIAIRDIPSGEQVYNTYGELSNSELLRRYGYV 297

Query: 207 DEDNPY 212
           + D  Y
Sbjct: 298 EWDGSY 303


>gi|85113406|ref|XP_964517.1| hypothetical protein NCU02158 [Neurospora crassa OR74A]
 gi|28926302|gb|EAA35281.1| hypothetical protein NCU02158 [Neurospora crassa OR74A]
          Length = 504

 Score = 42.0 bits (97), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 18/50 (36%), Positives = 29/50 (58%)

Query: 172 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 221
           A  +   RPY AGE + +  G   N  LLI YGF+ ++N +D + ++ A+
Sbjct: 270 AFTITTTRPYAAGEEVYICYGNHSNDFLLIEYGFLFDENVWDEVCIDDAI 319


>gi|422293007|gb|EKU20308.1| ribulose- -bisphosphate carboxylase oxygenase small subunit
           n-methyltransferase i [Nannochloropsis gaditana CCMP526]
          Length = 385

 Score = 42.0 bits (97), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 35/121 (28%), Positives = 53/121 (43%), Gaps = 11/121 (9%)

Query: 18  LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
           ++E   +AL L+ E+  G  SFW PYI  L          V   L+W E +LA L  SP 
Sbjct: 192 INEYFAIALMLIKERALGPSSFWAPYIDVLPTTE-----EVNPTLVWPEGDLALLEASPL 246

Query: 78  KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
            A        +  E+  L+  +  A S      +D     FTFE +  AF+ + S  + +
Sbjct: 247 VAATRSLKRKLAAEFALLEEQYMRARSDV----FD--PSVFTFEAYLWAFINIFSRAIRV 300

Query: 138 Q 138
           +
Sbjct: 301 K 301


>gi|118395738|ref|XP_001030215.1| hypothetical protein TTHERM_01108540 [Tetrahymena thermophila]
 gi|89284510|gb|EAR82552.1| hypothetical protein TTHERM_01108540 [Tetrahymena thermophila SB210]
          Length = 1709

 Score = 42.0 bits (97), Expect = 0.54,   Method: Composition-based stats.
 Identities = 29/88 (32%), Positives = 45/88 (51%), Gaps = 19/88 (21%)

Query: 19   SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
            ++   LALYLMYEK +G++SFW PY  EL+++         + L WS  ELA    S   
Sbjct: 1014 ADFNILALYLMYEKLKGEQSFWHPYF-ELNQKS-------YTLLDWSTEELAQFEDSY-- 1063

Query: 79   AEILERAEGIKREYNELDTVWFMAGSLF 106
                     I +E N+ + ++F+  S+ 
Sbjct: 1064 ---------ILQEVNQSNQIFFLQQSVL 1082


>gi|388250581|gb|AFK23406.1| histone-lysine N-methyltransferase [Cordyceps militaris]
          Length = 479

 Score = 42.0 bits (97), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 49/199 (24%), Positives = 83/199 (41%), Gaps = 7/199 (3%)

Query: 173 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 232
           + ++  + Y+ G+ I ++ G  PN++LL  YGFV  DNP D   +    +   P Y+ K 
Sbjct: 239 LSILAAKDYQVGDQIFIYYGSVPNNRLLRLYGFVLLDNPNDSYDLVLQTSPMAPLYEQKE 298

Query: 233 MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSL--GPICPVSP 290
            +    G  S     + A  +     ++L YLR   + D +++  +   L  G    V+ 
Sbjct: 299 RLWALAGLDSTCTIPLTA--KHPLPKNVLRYLRTQRL-DAADVADMTLQLLNGTDGKVND 355

Query: 291 CMERAVLDQLADYFKARLAGYPATLSEDEAMLTD--YNLHPKKRVATQLVRMEKKMLNAC 348
             E  VL  L D   + L G+   L + EA L    Y        A Q+   E+ +L   
Sbjct: 356 GNEIQVLQFLIDSLGSVLEGFGIPLEKLEAQLAGGFYPAGGNAWAAAQVSAGEQGILTRA 415

Query: 349 LQVTADMIMLLPDVTVSPC 367
            +   DM+  +     + C
Sbjct: 416 KKTAEDMLAAVTRPLATQC 434


>gi|453087416|gb|EMF15457.1| SET domain-containing protein [Mycosphaerella populorum SO2202]
          Length = 454

 Score = 41.6 bits (96), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 57/263 (21%), Positives = 108/263 (41%), Gaps = 26/263 (9%)

Query: 59  ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 118
            S + +++ EL    GS         A  I+ +Y +L         LF ++    P++ F
Sbjct: 106 SSSIFFADAELEICAGSSLYTTTKHLARQIEVDYKDL------VARLFGRHRDVFPSDKF 159

Query: 119 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYS---SKCKAMLAAVDDAVQL 175
           T + +K A   V S  +   K+       L+     +L +S    +C        + + +
Sbjct: 160 TIDDYKWALCTVWSRAMDF-KLRDGESIRLMAPFADMLNHSPDVGQCHVYDPQSGN-LSI 217

Query: 176 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 235
           +  + Y+ G+ + +  GP PN++L   YGFV   NP D   +  + +   P ++ K  + 
Sbjct: 218 LAGKSYEPGDQVFINYGPIPNNRLSRLYGFVVPGNPNDSYDLVLSTHPMAPFFEQKHKLW 277

Query: 236 QRNGKLSVQVFHVHAGREKEAISDMLP-----YLRLGYVSDTSEMQSVISSLGPIC--PV 288
              G  S     +        ++D LP     YLR+  +++T ++ +V +    +    +
Sbjct: 278 IAAGLDSTSTVSL-------TLTDPLPRSVLRYLRIQRLNET-DLAAVGTRQSDVAFEKI 329

Query: 289 SPCMERAVLDQLADYFKARLAGY 311
           S   E  VL  L +   A L G+
Sbjct: 330 SDSNETEVLTFLVESISALLDGF 352


>gi|169626351|ref|XP_001806576.1| hypothetical protein SNOG_16462 [Phaeosphaeria nodorum SN15]
 gi|160705819|gb|EAT76160.2| hypothetical protein SNOG_16462 [Phaeosphaeria nodorum SN15]
          Length = 474

 Score = 41.6 bits (96), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 59/234 (25%), Positives = 88/234 (37%), Gaps = 49/234 (20%)

Query: 23  CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI- 81
            L L ++YE   G  S W PY   L  +         + + WSE ELA L  S    +I 
Sbjct: 85  SLILVMLYEYINGDASNWAPYFSVLPTE-------FNTLMFWSEDELAELQASAVLNKIG 137

Query: 82  ------------------------------LERAEGIKREYNELDTVWFMAGSLFQQYPY 111
                                          +RAE ++ E N L  +    GSL   Y +
Sbjct: 138 KEGANEAFMEQLLPIIKEFADIFFAGDERAKQRAEEMRDERNVL--LMHKMGSLIMAYAF 195

Query: 112 DIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD 171
           D+   A + +   +   A +     L K        ++PL   L A +    A L   + 
Sbjct: 196 DVEP-ATSRKDVDEEGFAEEEEDEALPK-------GMIPLADMLNADADCNNARLFYEEK 247

Query: 172 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALNTE 224
            +++   +P +AGE +    GP P S LL  YG+V D    YD + +   L TE
Sbjct: 248 YLEMKALKPIRAGEEVFNDYGPLPRSDLLRRYGYVTDNYAQYDVVEINMDLVTE 301


>gi|336463341|gb|EGO51581.1| hypothetical protein NEUTE1DRAFT_125257 [Neurospora tetrasperma
           FGSC 2508]
 gi|350297448|gb|EGZ78425.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
          Length = 503

 Score = 41.6 bits (96), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 18/50 (36%), Positives = 29/50 (58%)

Query: 172 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 221
           A  +   RPY AGE + +  G   N  LLI YGF+ ++N +D + ++ A+
Sbjct: 270 AFTITTTRPYAAGEEVYICYGNHSNDFLLIEYGFLFDENVWDEVCIDDAI 319


>gi|401624185|gb|EJS42251.1| set7p [Saccharomyces arboricola H-6]
          Length = 494

 Score = 41.6 bits (96), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 22/68 (32%), Positives = 38/68 (55%), Gaps = 1/68 (1%)

Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
           +++PL   L A +SKC A L     +++++  R  +  E +    G  PNS++L  YG+V
Sbjct: 220 SMIPLADTLNADTSKCNANLTYDSGSLKMIAVRDIEIDEQVYNIYGEHPNSEILRRYGYV 279

Query: 207 DED-NPYD 213
           + D + YD
Sbjct: 280 EWDGSKYD 287


>gi|365982325|ref|XP_003667996.1| hypothetical protein NDAI_0A05980 [Naumovozyma dairenensis CBS 421]
 gi|343766762|emb|CCD22753.1| hypothetical protein NDAI_0A05980 [Naumovozyma dairenensis CBS 421]
          Length = 573

 Score = 41.6 bits (96), Expect = 0.63,   Method: Compositional matrix adjust.
 Identities = 23/54 (42%), Positives = 28/54 (51%)

Query: 160 SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 213
           SK +  L   DD V +V  R    GE I +  GP PN+ LL   GF   DNP+D
Sbjct: 335 SKPEEELNNPDDYVDIVTTRGILKGEEIFISYGPLPNAFLLAKCGFTMADNPFD 388


>gi|320167148|gb|EFW44047.1| hypothetical protein CAOG_02072 [Capsaspora owczarzaki ATCC 30864]
          Length = 533

 Score = 41.6 bits (96), Expect = 0.63,   Method: Compositional matrix adjust.
 Identities = 53/229 (23%), Positives = 97/229 (42%), Gaps = 36/229 (15%)

Query: 20  ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 79
           E A L+L  +YE      SFW P+ +   R+       ++    W +  L  L  +  + 
Sbjct: 146 EEAKLSLLFLYEYFD-PDSFWQPWFQLFPRE-------LDCAGFWDDLLLMELDNTSIRD 197

Query: 80  EILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK 139
            I +    I+ EY++LD       +L  ++P     + F+++ FK AF+ + S  + +  
Sbjct: 198 AIRQLEALIEYEYDQLDL-----PALRLRFPDSFVADRFSYDDFKWAFMVLASRGLTM-S 251

Query: 140 VSLARRFALVPL-------GPPLLAYSSKCKAMLAA------VDDAVQ------LVVDRP 180
           V+ A    ++P        G   +A+S   +A  A+       DD+V+      +  +  
Sbjct: 252 VNNAPCTVMIPFVDFFNHNGAKSIAFSYTRRAGDASDVSSGNYDDSVENLNCAVISGNET 311

Query: 181 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN---TEDP 226
           +  GE + +      N  LL++YGF    N +D  +V    +   T DP
Sbjct: 312 FLPGEQMFLNYKAHSNEVLLLHYGFALPHNEHDTFLVRLHFDREKTNDP 360


>gi|356511297|ref|XP_003524363.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
           setd3-like [Glycine max]
          Length = 449

 Score = 41.6 bits (96), Expect = 0.64,   Method: Compositional matrix adjust.
 Identities = 41/172 (23%), Positives = 70/172 (40%), Gaps = 19/172 (11%)

Query: 170 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD--RLVVEAAL------ 221
           D  +++V +   K  + +++  G   N   L++YGFV   NPYD   L  + AL      
Sbjct: 232 DSKMKVVAETAIKEDDPLLLCYGCLNNDLFLLDYGFVMHSNPYDCIELKYDGALLDAAST 291

Query: 222 -------NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE 274
                  N   P    + +++Q N         V  G ++     +L  LR+   ++   
Sbjct: 292 AAGVSSPNFSTPAPWQELILSQLNLAGETPDLKVSLGGQETVEGRLLAALRVILSTNVET 351

Query: 275 MQ----SVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 322
           MQ    S++ SL    P+    E AV   L       L  +P  + +DE++L
Sbjct: 352 MQKYDLSILQSLDAEAPLGVANEIAVFRTLIALCVIALGHFPTKIMDDESLL 403


>gi|392594054|gb|EIW83379.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
          Length = 508

 Score = 41.6 bits (96), Expect = 0.66,   Method: Compositional matrix adjust.
 Identities = 64/236 (27%), Positives = 94/236 (39%), Gaps = 30/236 (12%)

Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
           A+VP+   L A      A L   + A+++   R  + G+ I    G  PNS LL  YG V
Sbjct: 261 AMVPMADLLNARYGSSNAKLFYEEHALEMRTTRFIRRGDQIWNTYGDPPNSDLLRRYGHV 320

Query: 207 DE--------DNPYDRLVVEAALNTEDPQYQDKRM-VAQR------NGKLSVQVFHVHAG 251
           D          NP D + V A L T+        + VA+R       G   V V      
Sbjct: 321 DLVPLAQGGLGNPADVVEVRADLVTDVVSASGSSIPVAERIDWYLEMGGDDVFVLETDLD 380

Query: 252 REKEAIS--DMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 309
             +  I    +L  L   +   T E +          P  P M+ AV+  L +    RL 
Sbjct: 381 IPEPLIVLVRLLQLLEPDW-EKTREKEK---------PPKPKMDGAVIGVLVEVLHRRLK 430

Query: 310 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVS 365
            YP T+ EDEA+L   +     +    +VRM +K++   LQ T + +  L   +V 
Sbjct: 431 EYPTTIDEDEALLYKEDTLSINKKNAIIVRMGEKII---LQRTLESLQRLASQSVG 483


>gi|384483765|gb|EIE75945.1| hypothetical protein RO3G_00649 [Rhizopus delemar RA 99-880]
          Length = 376

 Score = 41.6 bits (96), Expect = 0.66,   Method: Compositional matrix adjust.
 Identities = 47/194 (24%), Positives = 83/194 (42%), Gaps = 31/194 (15%)

Query: 34  QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 93
           + K+S+W PY   L        +   S LL      ++L  S  K E +++ + I  +Y 
Sbjct: 51  RDKQSWWKPYTDLLPMHFNTMPVNYPSELL------SHLPNS-LKQETMQQKDNIHTDY- 102

Query: 94  ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP 153
               V  +     +Q P DI     T E FK A++ V +  +H+       +   + L P
Sbjct: 103 ----VTCLKFCKSKQLPQDI-----TAEEFKWAWLCVNTRCIHMTVPDYLAKGENIALAP 153

Query: 154 PL--LAYSSKCKAMLAAVDDAVQLVVDR-------PYKAGESIVVWCGPQPNSKLLINYG 204
            L  L ++++ K     ++    +   R        YK GE + +  GP  N  +L  YG
Sbjct: 154 MLDFLNHTTEAK-----IESGFNIRTQRFEIKTLTAYKKGEQVYINYGPHDNLAMLKEYG 208

Query: 205 FVDEDNPYDRLVVE 218
           FV  +N Y+ ++++
Sbjct: 209 FVLNENIYNFVLLD 222


>gi|281338852|gb|EFB14436.1| hypothetical protein PANDA_005285 [Ailuropoda melanoleuca]
          Length = 415

 Score = 41.6 bits (96), Expect = 0.66,   Method: Compositional matrix adjust.
 Identities = 57/236 (24%), Positives = 94/236 (39%), Gaps = 23/236 (9%)

Query: 19  SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
           S L  L  +L+ EK  G +S W PY+  L +       A   P+   E E+  L   P K
Sbjct: 81  SPLLALCTFLVSEKHAGDQSLWKPYLEILPK-------AYTCPVC-LEPEVVNLFPKPLK 132

Query: 79  AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV--- 135
           A+  E+   ++  ++     +     LF +    I    F++     A+  V +  V   
Sbjct: 133 AKAEEQRARVQGFFSSSRDFFSSLQPLFSEAVESI----FSYSALLWAWCTVNTRAVYVK 188

Query: 136 HLQKVSLARRFALVPLGP--PLLAYSSKC--KAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
           H Q+   +       L P   LL +S +   KA         ++      +  E + +  
Sbjct: 189 HRQEQCFSTEPNTCALAPYLDLLNHSPRVQVKAAFNEETRCYEIRTASGCRKHEEVFICY 248

Query: 192 GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFH 247
           GP  N +LL+ YGFV   NP+  + V   +  +     DK+M    N K+S+   H
Sbjct: 249 GPHDNQQLLLEYGFVSIQNPHACVYVSEDVLVKYLPLTDKQM----NKKISILKDH 300


>gi|449506720|ref|XP_004162829.1| PREDICTED: uncharacterized LOC101212907 [Cucumis sativus]
          Length = 559

 Score = 41.6 bits (96), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 45/178 (25%), Positives = 77/178 (43%), Gaps = 20/178 (11%)

Query: 57  AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 116
           A  + L +    +  L G+    E+++  E ++++YNEL    F A  L   +P   P E
Sbjct: 259 AFNTGLSFGVGAMTTLVGTLLFDELMQAKEHLRKQYNEL----FPA--LCNNHPDIFPEE 312

Query: 117 AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP--------LGPPLLAYSSKCKAMLAA 168
            +++E F  A     S  + +       R  LVP        L P +L Y       + +
Sbjct: 313 FYSWEEFLWACELWYSNSLKIMFPDGNVRTCLVPIAGFLNHSLHPHILHYGK-----VDS 367

Query: 169 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYDRLVVEAALNTED 225
             D+++  + RP +AGE   +  G    S L+  YGF+ E DN  D + ++     +D
Sbjct: 368 DTDSLKFRLSRPCRAGEECYLSYGNYSGSHLVTFYGFLPEGDNVNDVIPLDIDFGDDD 425


>gi|71425330|ref|XP_813082.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70877934|gb|EAN91231.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 565

 Score = 41.6 bits (96), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 51/184 (27%), Positives = 80/184 (43%), Gaps = 31/184 (16%)

Query: 184 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRL-----VVEAALNTEDPQYQDKR--MVAQ 236
           G  I +  GP  N +LL  YGFV E N +DRL       E  +  E   +  +R  +VA 
Sbjct: 362 GREIWMSYGPLQNWELLQFYGFVLEGNEHDRLPFPFDFPEGVVGDE---WDGRRAALVAT 418

Query: 237 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 296
               L+ + +  H GR   A   ++  LR+ ++++  E  + +   GP   +    E  V
Sbjct: 419 YGLHLAGRCWICHDGRPPPA---LVALLRV-HLAEAEEFDT-MERKGPFASLGAGTEARV 473

Query: 297 LDQLADYFKARLAGYPATLSEDEAML------------TDYNLHP---KKRVATQLVRME 341
           +  +AD  +  L  +  +L EDE +L             D N  P    KR+A  L+RM 
Sbjct: 474 VATIADTIRCILDLFSTSLEEDERLLENGSGPVATHSGDDGNTQPLSCNKRLAI-LLRMG 532

Query: 342 KKML 345
            K +
Sbjct: 533 MKRI 536


>gi|452824261|gb|EME31265.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase
           [Galdieria sulphuraria]
          Length = 546

 Score = 41.6 bits (96), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 70/307 (22%), Positives = 118/307 (38%), Gaps = 64/307 (20%)

Query: 16  NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 75
           + LS+   +A  L+ E  + + SFW P+I  L              L+WS  E  YL  S
Sbjct: 137 SDLSDEEAIAALLLVETARKETSFWKPWIETLPSSDELHHF-----LVWSTAETQYLESS 191

Query: 76  PTKAEILERAEGIKREYNELDTVWFMAGSLFQQ--YP-YDIPTEAFTFEIFKQAFVAVQS 132
            T  +IL   E     + EL+T       LF +  YP YD+  + FT   F  A   VQS
Sbjct: 192 STFEDILSLRETASLVFEELNT------ELFPKFLYPQYDV--KYFTLPYFTWALSIVQS 243

Query: 133 CVVH-----------------------LQKVSLARRF------ALVPLGPPLLAYSSKCK 163
             ++                       + + S  R++      +L+ +GP    ++ + +
Sbjct: 244 FGLYDIMDSCPLVIVPGLEWLTYKYSLITEESFFRQYFHISNVSLIRVGP---FFTQERR 300

Query: 164 AMLAAVDDA-----VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN--PYDRLV 216
             + A +D      V LV +      ++   W          ++ G +DE+         
Sbjct: 301 LKITASEDLKVGEPVSLVYEGNVSLIDTFCRWGWK-------LDLGALDEEQLLKMGSYE 353

Query: 217 VEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE-- 274
           +  A+ T D  + DK  +        +Q F +     KE +  +LP+LRL  + D     
Sbjct: 354 ISFAVTTTDQFFDDKEDILDAQRLELLQTFELRYDMSKELLQRILPFLRLICLKDKDSFI 413

Query: 275 MQSVISS 281
           ++SV  S
Sbjct: 414 LESVFRS 420


>gi|358384831|gb|EHK22428.1| hypothetical protein TRIVIDRAFT_84056 [Trichoderma virens Gv29-8]
          Length = 458

 Score = 41.6 bits (96), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 20/60 (33%), Positives = 35/60 (58%)

Query: 173 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 232
           + ++  + Y+A + + ++ GP PNS+LL  YGFV  DNP D   +  + +   P Y+ K+
Sbjct: 216 LSILAGKDYEAEDQVFIYYGPMPNSRLLRLYGFVIPDNPNDSYDLVLSTHPLAPFYEQKQ 275


>gi|449466129|ref|XP_004150779.1| PREDICTED: uncharacterized protein LOC101212907 [Cucumis sativus]
          Length = 559

 Score = 41.6 bits (96), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 45/178 (25%), Positives = 77/178 (43%), Gaps = 20/178 (11%)

Query: 57  AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 116
           A  + L +    +  L G+    E+++  E ++++YNEL    F A  L   +P   P E
Sbjct: 259 AFNTGLSFGVGAMTTLVGTLLFDELMQAKEHLRKQYNEL----FPA--LCNNHPDIFPEE 312

Query: 117 AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP--------LGPPLLAYSSKCKAMLAA 168
            +++E F  A     S  + +       R  LVP        L P +L Y       + +
Sbjct: 313 FYSWEEFLWACELWYSNSLKIMFPDGNVRTCLVPIAGFLNHSLHPHILHYGK-----VDS 367

Query: 169 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYDRLVVEAALNTED 225
             D+++  + RP +AGE   +  G    S L+  YGF+ E DN  D + ++     +D
Sbjct: 368 DTDSLKFRLSRPCRAGEECYLSYGNYSGSHLVTFYGFLPEGDNVNDVIPLDIDFGDDD 425


>gi|307103393|gb|EFN51653.1| hypothetical protein CHLNCDRAFT_139846 [Chlorella variabilis]
          Length = 712

 Score = 41.6 bits (96), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 31/101 (30%), Positives = 53/101 (52%), Gaps = 7/101 (6%)

Query: 179 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 238
           RP +AG+ + +  GP PN KLL  YGFV   NP+D  +V   L   +   + +++ A   
Sbjct: 444 RPCQAGQQVFISYGPVPNLKLLCYYGFVVPHNPHD--LVPLQLEPPEGPLKQQQLAAMEA 501

Query: 239 GKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVI 279
             L ++    H+ ++      +L  LRL  V+ ++E+Q V+
Sbjct: 502 LGLGLE----HSLQDGPLSKQLLACLRL-IVATSAELQLVV 537


>gi|452823683|gb|EME30691.1| hypothetical protein Gasu_19370 [Galdieria sulphuraria]
          Length = 370

 Score = 41.2 bits (95), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 56/226 (24%), Positives = 95/226 (42%), Gaps = 35/226 (15%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           ++++L +E +Q   SFW PY++ L        L  +  LL      +Y+T    +A+I++
Sbjct: 86  VSIFLFFESRQSD-SFWKPYLQVLPTSYDLLFLYRDGLLL------SYVT----EADIMQ 134

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQY--PY-----DIPTEAFTFEIFKQAFVAVQSCVVH 136
             E ++R          +    FQ Y  P+     D       F+ F + + AV S + +
Sbjct: 135 MVESVRR----------ILRDTFQTYVIPHFSSVDDRDKWNVLFKEFVRWYCAVVSRICY 184

Query: 137 LQKVSLARRFALVPLGPPLL--AYSSKCKAMLAAVDDAVQLV-VDRPYKAGESIVVWCGP 193
           L         ALVPLG      A  +    + A  +    +    R +  G  + V  G 
Sbjct: 185 LPDDIAG---ALVPLGDIFNHEAVDTPVDILYAKWERGYYVFRAHRNFSIGTQVFVSYGA 241

Query: 194 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 239
             N++L++ YGF   DNP+D L        E  ++ + R+V  R G
Sbjct: 242 LSNTELMMYYGFTLNDNPWDTLSFYPHELDESIKFYE-RVVLDREG 286


>gi|301763371|ref|XP_002917104.1| PREDICTED: SET domain-containing protein 4-like [Ailuropoda
           melanoleuca]
          Length = 440

 Score = 41.2 bits (95), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 57/236 (24%), Positives = 94/236 (39%), Gaps = 23/236 (9%)

Query: 19  SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
           S L  L  +L+ EK  G +S W PY+  L +       A   P+   E E+  L   P K
Sbjct: 105 SPLLALCTFLVSEKHAGDQSLWKPYLEILPK-------AYTCPVC-LEPEVVNLFPKPLK 156

Query: 79  AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV--- 135
           A+  E+   ++  ++     +     LF +    I    F++     A+  V +  V   
Sbjct: 157 AKAEEQRARVQGFFSSSRDFFSSLQPLFSEAVESI----FSYSALLWAWCTVNTRAVYVK 212

Query: 136 HLQKVSLARRFALVPLGP--PLLAYSSKC--KAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
           H Q+   +       L P   LL +S +   KA         ++      +  E + +  
Sbjct: 213 HRQEQCFSTEPNTCALAPYLDLLNHSPRVQVKAAFNEETRCYEIRTASGCRKHEEVFICY 272

Query: 192 GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFH 247
           GP  N +LL+ YGFV   NP+  + V   +  +     DK+M    N K+S+   H
Sbjct: 273 GPHDNQQLLLEYGFVSIQNPHACVYVSEDVLVKYLPLTDKQM----NKKISILKDH 324


>gi|150864441|ref|XP_001383253.2| hypothetical protein PICST_42613 [Scheffersomyces stipitis CBS
           6054]
 gi|149385697|gb|ABN65224.2| predicted protein [Scheffersomyces stipitis CBS 6054]
          Length = 453

 Score = 41.2 bits (95), Expect = 0.77,   Method: Compositional matrix adjust.
 Identities = 54/238 (22%), Positives = 92/238 (38%), Gaps = 42/238 (17%)

Query: 7   YYAAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS- 65
           Y + EL     L+    L+LYL +E+++   SFW P++  L            +PL+W  
Sbjct: 111 YKSLELQDLLALTSFQLLSLYLCFERQRIHSSFWKPFLEMLPDISDFSL----NPLIWQV 166

Query: 66  ------ETELAYLTGSPTKAEILERAEGIKREYNE------------LDTVWFMAGSLFQ 107
                 E  + +L  S  +     RAE +   + E            LD +     S  +
Sbjct: 167 LQVDQWEELIQFLPESAKR-----RAEDVYERFLEDYVVVRALVSRILDDLKLSESSADE 221

Query: 108 QYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGPPLLAYSSKCKA 164
             P D         +F  A++ + S  +++   Q  + A  F + P     L +S   + 
Sbjct: 222 YIPVD---------LFLWAWMCINSRCLYMTIPQGKTNADNFTMAPY-VDFLNHSCNDEC 271

Query: 165 MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAAL 221
            +        +    PY  G+ + +  GP  N  LL  YGFV   DN ++ L + A +
Sbjct: 272 SILIDTTGFHVRTTTPYMPGDQLFLSYGPHCNEFLLCEYGFVIPHDNKWNDLDISAYI 329


>gi|428175768|gb|EKX44656.1| hypothetical protein GUITHDRAFT_109433 [Guillardia theta CCMP2712]
          Length = 591

 Score = 41.2 bits (95), Expect = 0.82,   Method: Compositional matrix adjust.
 Identities = 56/214 (26%), Positives = 90/214 (42%), Gaps = 33/214 (15%)

Query: 20  ELACLALYLMYEK-KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
           ELA L L ++YEK  QG  SFW P I  L    G       +   WSE EL  L     K
Sbjct: 91  ELA-LTLLILYEKLDQGNASFWKPMIDILPADPG-------AASKWSEEELQELQDESLK 142

Query: 79  AEILERAEGIKREYNE-LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
           AE +     +++ Y   L  +    G +F         + +T+E F+ A + V+S     
Sbjct: 143 AEAMIVVASMQQTYQRVLRPILVQHGDVF-------SVDRYTWEEFRWALLCVESRTFG- 194

Query: 138 QKVSLARRF----ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD----RPYKAGESIVV 189
                  RF    ++VP    L   + +        +     + D      ++ GE   +
Sbjct: 195 -------RFLPHPSIVPFADLLNHVNVQTSYRWLPEERRAAYMCDASGEHVHRRGEEAFM 247

Query: 190 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 223
             GP+ N++LL++YGF  + N Y+ + +   +NT
Sbjct: 248 SYGPRSNAELLLHYGFALQSNRYEAVELNFRINT 281


>gi|225678514|gb|EEH16798.1| SET domain-containing protein RMS1 [Paracoccidioides brasiliensis
           Pb03]
          Length = 488

 Score = 41.2 bits (95), Expect = 0.85,   Method: Compositional matrix adjust.
 Identities = 82/390 (21%), Positives = 143/390 (36%), Gaps = 61/390 (15%)

Query: 13  LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 72
           +    L +  CL L ++YE  QG  S W PY + L           ++ + W++ EL  L
Sbjct: 84  INERDLGQWLCLILVMIYEYLQGAASPWAPYFKVLPTD-------FDTLMFWTDAELLEL 136

Query: 73  TGSPTKAEILERA----------------------EGIKREYNELD------TVWFMAGS 104
            GS     I + A                       G    YN  D      ++    GS
Sbjct: 137 KGSAVLGRIGKSAAEEVFLRDLLPLVSKNSELFPLTGGLLSYNSPDGKAALLSLAHRMGS 196

Query: 105 LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA 164
           L   Y +D+  +        +        V   ++  L +   ++PL   L A + +  A
Sbjct: 197 LIMSYAFDVENDE------AEEVEGEDGYVTDDEERQLPK--GMIPLADLLNADADRNNA 248

Query: 165 MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVE----- 218
            L   D  + +   +  + GE I    G  P ++LL  YG+V D    YD   V      
Sbjct: 249 RLFQEDGYLAMKSIKSIRKGEEIFNDYGELPRAELLRRYGYVTDSYAQYDEAEVPIQTIC 308

Query: 219 --AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQ 276
             A L +  P   + R+    + ++    + +        +++ LP   L  ++      
Sbjct: 309 RVAGLKSSTPGPDEPRLEFLDDLEVLDDGYGIPRHDRSTPLAETLPTELLVVLNILVMPL 368

Query: 277 SVISSLGPICPV-SPCMERAVLDQLADYFKARLAGYPATLSEDEAML---TDYNLHP--- 329
              + L     V  P +  A    L +  +  L  YP T+++D+ +L    +Y       
Sbjct: 369 EQFNQLKQKSKVPKPALGIAEATLLDEVVRLILGEYPTTVAQDKELLASCANYQGSTSPI 428

Query: 330 ---KKRVATQLVRMEKKMLNACLQVTADMI 356
              + ++A Q+ + EK++LNA L    D I
Sbjct: 429 SAGRLKMALQVRKGEKEILNAVLSELEDFI 458


>gi|407417214|gb|EKF38012.1| hypothetical protein MOQ_001785 [Trypanosoma cruzi marinkellei]
          Length = 578

 Score = 41.2 bits (95), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 37/137 (27%), Positives = 66/137 (48%), Gaps = 9/137 (6%)

Query: 192 GPQPNSKLLINYGFVDEDNPYDRL--VVEAALNTEDPQYQDKR--MVAQRNGKLSVQVFH 247
           GP  N +LL  YGFV E+N +DRL    +        ++  +R  +VA     L+ + + 
Sbjct: 370 GPLQNWELLQFYGFVVEENEHDRLPFPFDFPEGVAGDEWDRRRATLVATYGLHLAGRCWI 429

Query: 248 VHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 307
            H GR   A   ++  LR+ ++++  E  ++  + GP   +    E  V+  +AD  +  
Sbjct: 430 CHDGRPPPA---LVALLRV-HLAEAEEFDTMERN-GPFASLGAGTEARVVATIADTIRCI 484

Query: 308 LAGYPATLSEDEAMLTD 324
           L  +  +L EDE +L +
Sbjct: 485 LDLFSTSLEEDEWLLEN 501


>gi|213407234|ref|XP_002174388.1| lysine methyltransferase [Schizosaccharomyces japonicus yFS275]
 gi|212002435|gb|EEB08095.1| lysine methyltransferase [Schizosaccharomyces japonicus yFS275]
          Length = 537

 Score = 41.2 bits (95), Expect = 0.88,   Method: Compositional matrix adjust.
 Identities = 51/229 (22%), Positives = 91/229 (39%), Gaps = 28/229 (12%)

Query: 17  KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP 76
           K+  + C   +   E+ +G+KS W  YI  L +         ++PL +++ EL  L  + 
Sbjct: 74  KMHTIVCT--FFALERLKGEKSQWAKYIEYLPK-------TFDTPLYFTDDELKSLEHTN 124

Query: 77  TKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVH 136
                 +R    K E+         A  L      D P + F++ ++  A     S    
Sbjct: 125 IFYGCNDRKRIWKEEHA-------TAAKLL-----DNP-DNFSWNMYLWAATVFSSRCFS 171

Query: 137 LQKVSLARRFALVPLGPPL---LAYSSKCKAMLAAV---DDAVQLVVDRPYKAGESIVVW 190
              +         P+  PL   L +  +C  +   V     AVQLV  +P  +G  +   
Sbjct: 172 SALLGEEDTDDAAPILIPLVDSLNHKPRCPIIWNKVTKESHAVQLVSVKPISSGGQVYNN 231

Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 239
            GP+ N +LL+ YGF   +N ++   +  +L+      + KR +   +G
Sbjct: 232 YGPKGNEELLMGYGFCLPNNEFETFALRLSLDKAVYNSEKKRSILASHG 280


>gi|426218421|ref|XP_004003445.1| PREDICTED: SET domain-containing protein 4 [Ovis aries]
          Length = 439

 Score = 41.2 bits (95), Expect = 0.89,   Method: Compositional matrix adjust.
 Identities = 46/201 (22%), Positives = 85/201 (42%), Gaps = 19/201 (9%)

Query: 19  SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
           S L  L  +L+ EK  G +S W PY+  L +       A   P+   E E+  L  +P K
Sbjct: 105 SPLLALCTFLVSEKHAGDRSPWKPYLEVLPK-------AYTCPVC-LEPEVVNLLPNPLK 156

Query: 79  AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ 138
            +  E+   ++  ++     +     LF +    I    F++   + A+  V +  V+++
Sbjct: 157 TKAWEQRSHVQEFFSSSRGFFSSLQPLFSEAIETI----FSYRALRWAWCTVNTRAVYMK 212

Query: 139 K---VSLARRFALVPLGP--PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGE--SIVVWC 191
           +   + L+       L P   LL +S   +   A  ++     +    + G+   + +  
Sbjct: 213 RPPQLCLSPEPDTCALAPYLDLLNHSPDVQVKAAFNEETRCYEIRTATRCGKHKEVFICY 272

Query: 192 GPQPNSKLLINYGFVDEDNPY 212
           GP  N +LL+ YGFV   NP+
Sbjct: 273 GPHDNHRLLLEYGFVSVSNPH 293


>gi|156849027|ref|XP_001647394.1| hypothetical protein Kpol_1018p68 [Vanderwaltozyma polyspora DSM
           70294]
 gi|156118080|gb|EDO19536.1| hypothetical protein Kpol_1018p68 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 494

 Score = 41.2 bits (95), Expect = 0.89,   Method: Compositional matrix adjust.
 Identities = 18/61 (29%), Positives = 34/61 (55%)

Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
           +++PL   L + +  C A L    + +++   +P +AGE +    G  PNS++L  YG+V
Sbjct: 221 SMIPLADTLNSDTHLCNANLMYDKETLKMTAIKPIRAGEEVFNIYGEHPNSEILRRYGYV 280

Query: 207 D 207
           +
Sbjct: 281 E 281


>gi|121701277|ref|XP_001268903.1| SET domain protein [Aspergillus clavatus NRRL 1]
 gi|119397046|gb|EAW07477.1| SET domain protein [Aspergillus clavatus NRRL 1]
          Length = 498

 Score = 41.2 bits (95), Expect = 0.89,   Method: Compositional matrix adjust.
 Identities = 48/205 (23%), Positives = 91/205 (44%), Gaps = 18/205 (8%)

Query: 35  GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 94
           G  + W  YIR +         ++  P  ++E EL  L G+  +  +  +   +++E+  
Sbjct: 129 GISNPWTEYIRFM-------PPSIRLPTFYTEAELELLRGTSLRTAVFAKLASLEKEFER 181

Query: 95  LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP---L 151
           L           Q+Y +D  T   TF+ +K      +S VV L +       A+VP   +
Sbjct: 182 LRQS-TEGIPWCQKYWWDEDTGRLTFDDWKYVDAVYRSRVVELPESG----HAIVPCVDM 236

Query: 152 GPPLLAYSSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWCGPQ-PNSKLLINYGFV-DE 208
                  S K +   ++ +DA+ QL   R   +GE + +  G + P S+++ +YGFV +E
Sbjct: 237 ANHASEDSVKARYDESSTEDALLQLRQGRRICSGEEVTISYGSEKPASEMVFSYGFVENE 296

Query: 209 DNPYDRLVVEAALNTEDPQYQDKRM 233
                ++ ++  +  +DP    K+M
Sbjct: 297 RTDAKQIFLDLEIPDDDPLRMAKQM 321


>gi|226294776|gb|EEH50196.1| SET domain-containing protein [Paracoccidioides brasiliensis Pb18]
          Length = 488

 Score = 41.2 bits (95), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 83/390 (21%), Positives = 144/390 (36%), Gaps = 61/390 (15%)

Query: 13  LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 72
           +    L +  CL L ++YE  QG  S W PY + L           ++ + W++ EL  L
Sbjct: 84  INERDLGQWLCLILVMIYEYLQGAASPWAPYFKVLPTD-------FDTLMFWTDAELLEL 136

Query: 73  TGS--------PTKAEILERA--------------EGIKREYNELD------TVWFMAGS 104
            GS         T  E+  R                G    YN  D      ++    GS
Sbjct: 137 KGSAVLGRIGKSTAEEVFLRDLLPLVSKNSELFPLTGGLLSYNSPDGKAALLSLAHRMGS 196

Query: 105 LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA 164
           L   Y +D+  +        +        V   ++  L +   ++PL   L A + +  A
Sbjct: 197 LIMSYAFDVENDE------AEEVEGEDGYVTDDEERQLPK--GMIPLADLLNADADRNNA 248

Query: 165 MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVE----- 218
            L   D  + +   +  + GE I    G  P ++LL  YG+V D    YD   V      
Sbjct: 249 RLFQEDGYLSMKSIKSIRKGEEIFNDYGELPRAELLRRYGYVTDSYAQYDEAEVPIQTIC 308

Query: 219 --AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQ 276
             A L +  P   + R+    + ++    + +        +++ LP   L  ++      
Sbjct: 309 RVAGLKSSTPGPDEPRLEFLDDLEVLDDGYGIPRHDRSTPLAETLPTELLVVLNILVMPL 368

Query: 277 SVISSLGPICPV-SPCMERAVLDQLADYFKARLAGYPATLSEDEAML---TDYNLHP--- 329
              + L     V  P +  A    L +  +  L  YP T+++D+ +L    +Y       
Sbjct: 369 EQFNQLKQKSKVPKPALGIAEATLLDEVVRLILGEYPTTVAQDKELLASCANYQGSTSPI 428

Query: 330 ---KKRVATQLVRMEKKMLNACLQVTADMI 356
              + ++A Q+ + EK++LNA L    D I
Sbjct: 429 SAGRLKMALQVRKGEKEILNAVLSELEDFI 458


>gi|449301991|gb|EMC98000.1| hypothetical protein BAUCODRAFT_146595 [Baudoinia compniacensis
           UAMH 10762]
          Length = 633

 Score = 41.2 bits (95), Expect = 0.93,   Method: Compositional matrix adjust.
 Identities = 55/230 (23%), Positives = 92/230 (40%), Gaps = 27/230 (11%)

Query: 26  LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW-SETELAYLTGSPTKAEILER 84
            YLM +    ++SFW PY+  L            +PL + +  +LA+L G+     +L R
Sbjct: 87  FYLMTQYLNKEQSFWKPYLDVLPSPS-----EFSTPLWFDAPADLAWLDGTDVLHTMLAR 141

Query: 85  AEGIKREYNELDTVWFMAGSLFQQYPYDI---PTEAFTFEIFKQAFVAVQS---CVVHLQ 138
            E   + Y     V   +G     Y +D+       FT   F    +  Q+     VH  
Sbjct: 142 REVYAQYYQSGLKVLSESGIDVTLYTWDLFRWAITTFTSRSFTSRVLLPQNRKYWPVHRT 201

Query: 139 KVSLARRFALVPLG-------------PPLLAYSSKCKAMLAAVDDAVQLVVD--RPYKA 183
             +  R+  L+ +              P L + +    A +    DA Q  +   +P +A
Sbjct: 202 STNGRRQTVLLDMSHSPAEDLDFSVLFPGLDSGNHDPNAQVDWSFDANQFSIALVQPIEA 261

Query: 184 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRM 233
           G  +    GP+ N +LL+ YGF   +NP D +++      E  Q + KR+
Sbjct: 262 GAEVCNNYGPKANDELLMGYGFCIPNNPRDEVLLTLKAPPEALQVELKRI 311


>gi|145485580|ref|XP_001428798.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124395886|emb|CAK61400.1| unnamed protein product [Paramecium tetraurelia]
          Length = 331

 Score = 41.2 bits (95), Expect = 0.93,   Method: Compositional matrix adjust.
 Identities = 21/69 (30%), Positives = 33/69 (47%), Gaps = 1/69 (1%)

Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
            LVP G  +L +  KC+     +   +  +  +  +  E I  + G   NS LL  YGF 
Sbjct: 186 CLVPFGD-MLNHHDKCQTKQKIIGTDLVFITTKQIQENEEIYNFFGEHGNSFLLCWYGFT 244

Query: 207 DEDNPYDRL 215
            ++N YD+L
Sbjct: 245 YDNNIYDKL 253


>gi|171684553|ref|XP_001907218.1| hypothetical protein [Podospora anserina S mat+]
 gi|170942237|emb|CAP67889.1| unnamed protein product [Podospora anserina S mat+]
          Length = 396

 Score = 40.8 bits (94), Expect = 0.95,   Method: Compositional matrix adjust.
 Identities = 25/111 (22%), Positives = 50/111 (45%), Gaps = 12/111 (10%)

Query: 138 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 197
           ++++   + AL P+   L      C+ +         +  DR YK GE + +  G   N 
Sbjct: 194 ERLTKDDKMALQPVADLLNHSDEGCEVVFDT--GCYTISADREYKQGEEVYICYGTHSND 251

Query: 198 KLLINYGFVDEDNPYDRLVVEAA----LNTEDPQYQDKR------MVAQRN 238
            L++ YGF  E+N +D + ++      ++T   ++ D R      ++ +RN
Sbjct: 252 FLMVEYGFCPEENKWDEVCIDEVVLEEMSTARKKWLDGRDFLGKYLIDERN 302


>gi|320584053|gb|EFW98265.1| Nuclear protein that contains a SET-domain [Ogataea parapolymorpha
           DL-1]
          Length = 499

 Score = 40.8 bits (94), Expect = 0.99,   Method: Compositional matrix adjust.
 Identities = 55/214 (25%), Positives = 87/214 (40%), Gaps = 28/214 (13%)

Query: 11  ELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA 70
           E+L T  L++   L L L YE   G++S W  Y+  L  +         S + WS  EL 
Sbjct: 77  EVLET--LNQWEALILCLAYEMMLGEESRWSSYLAVLPEK-------FNSLMFWSSEELE 127

Query: 71  YLTGSPTKAEI-LERAE--------------GIKR--EYNELDTVWFMAGSLFQQYPYDI 113
            L  S     I  E+AE              G K+  EY  +D  + +  S+   Y +D+
Sbjct: 128 KLKPSNVLQRIGREQAEQMYSKLVPEYCLRLGSKKLVEYLTIDR-FHVVASIIMSYSFDV 186

Query: 114 PTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV 173
                  E  +              K     + ++VPL   L + ++   A L+  +DA+
Sbjct: 187 DDPEDDPEDDEDEEEDFDEIEQECIKYDGYLK-SMVPLADTLNSNTNLVNANLSYENDAL 245

Query: 174 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 207
            +   +  K GE I    G  PNS++L  YG+V+
Sbjct: 246 VMTATKDIKKGEQIYNIYGELPNSEILRKYGYVE 279


>gi|345795412|ref|XP_544872.3| PREDICTED: SET domain-containing protein 4 [Canis lupus familiaris]
          Length = 440

 Score = 40.8 bits (94), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 56/236 (23%), Positives = 96/236 (40%), Gaps = 23/236 (9%)

Query: 19  SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
           S L  L  +L+ EK  G +S W PY+  L +       A   P+   E E+  L   P K
Sbjct: 105 SPLLALCTFLVSEKHAGDQSLWKPYLEILPQ-------AYTCPVC-LEPEVVNLFPKPLK 156

Query: 79  AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV--- 135
           A+  E+   ++  ++     +     LF +    I    F++     A+  V +  V   
Sbjct: 157 AKAEEQRARVQEFFSSSRDFFSSLQPLFSEAVESI----FSYRALLWAWCTVNTRAVYVK 212

Query: 136 HLQKVSLARRFALVPLGP--PLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGESIVVWC 191
           H Q+   +       L P   LL +S + +   A  ++    ++      +  E + +  
Sbjct: 213 HRQRQCFSTEPNTYALAPYLDLLNHSPEVQVKGAFNEETRCYEIRTASNCRKHEEVFICY 272

Query: 192 GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFH 247
           GP  N +LL+ YGFV   NP+  + V   +  +     DK+M    N K+S+   H
Sbjct: 273 GPHDNQRLLLEYGFVSIHNPHACVYVSEDILVKYLPTTDKQM----NKKISILKDH 324


>gi|402077770|gb|EJT73119.1| hypothetical protein GGTG_09969 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 377

 Score = 40.8 bits (94), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 24/76 (31%), Positives = 33/76 (43%), Gaps = 2/76 (2%)

Query: 143 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 202
           A R ALVP+        + C A      D   +  DR Y  G+ I +  G   N  LL  
Sbjct: 177 ADRLALVPIADLFNHADTGCGASFTP--DGFVVSTDRKYHVGQEIYISYGTHTNDLLLAE 234

Query: 203 YGFVDEDNPYDRLVVE 218
           YGFV   N +D+  ++
Sbjct: 235 YGFVPMANRWDKTCLD 250


>gi|226508108|ref|NP_001151788.1| SET domain containing protein [Zea mays]
 gi|195649689|gb|ACG44312.1| SET domain containing protein [Zea mays]
          Length = 536

 Score = 40.8 bits (94), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 60/240 (25%), Positives = 96/240 (40%), Gaps = 24/240 (10%)

Query: 16  NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 75
           N ++    L L+ M E+      F  PY   L      G       L +    LA L G+
Sbjct: 221 NNITSETMLLLWSMRERYNLGSKF-KPYFDTLPANFNTG-------LSFGIDALAALEGT 272

Query: 76  PTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 135
               EI++  + ++++Y+EL  +      L   +P     +  T++ F  A     S  +
Sbjct: 273 LLFDEIIQARQHLRQQYDELFPL------LCTNFPEMFRKDVCTWDDFLWACELWYSNSM 326

Query: 136 HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV---VDRPYKAGESIVVWCG 192
            +   S      LVP+   L    S        VD+A + +   + RP  AGE   +  G
Sbjct: 327 MIVLSSGKLSTCLVPVAGLLNHSVSPHILNYGRVDEATKSLKFPLSRPCDAGEQCFLSYG 386

Query: 193 PQPNSKLLINYGFVDE-DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 251
             P S L+  YGF+   DNPYD + ++      D    D+ + AQ +   S Q  H+  G
Sbjct: 387 KHPGSHLVTFYGFLPRGDNPYDVIPLDL-----DTSVDDEDIAAQSSATTS-QTTHMVRG 440


>gi|145354661|ref|XP_001421597.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144581835|gb|ABO99890.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 341

 Score = 40.8 bits (94), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 73/320 (22%), Positives = 126/320 (39%), Gaps = 30/320 (9%)

Query: 49  RQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG-----IKREYN-ELDTVWFMA 102
           R+R +G ++  +P + S  E        T      R EG       REY  + +  W  A
Sbjct: 2   RERAKGGVSAYAPFVESLYEHTPARAVETSRAARARLEGHAAAETMREYERDAEDGWRAA 61

Query: 103 GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC 162
              F+ +P       FT   F++A   V++     +     R  ALVP+   LL  +   
Sbjct: 62  RRTFETFPSIFSVHEFTRAAFEEALAIVRANSFEARSEDGTRARALVPMAHLLLHDTGSE 121

Query: 163 KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG----FVDEDNPYDRLVVE 218
              +  VD    + VD  ++ G+ +    G   +++    +G    +  E N  ++  ++
Sbjct: 122 VPCVKIVDGVFVINVD-EHEEGDELSCSHGDYSDAETFARFGVSAFYSAEKNARNK--IK 178

Query: 219 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV 278
            A  +E   Y  K +   R G +    F   AG    A  + +  LRL   ++T E  ++
Sbjct: 179 FAFPSE--IYSMKSL--DRCGSVENIAF-TDAG----ATEEFMCALRLASANET-EWAAI 228

Query: 279 ISSLGPI-----CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH--PKK 331
             S   +      P+S   E AV + L       L  YP++ +EDE +L    L   P +
Sbjct: 229 SKSKASVRALRKKPLSEESEIAVYEALFATLTELLNSYPSSDNEDERLLQSRTLQSAPDE 288

Query: 332 RVATQLVRMEKKMLNACLQV 351
             A  +   EK++  + L  
Sbjct: 289 ERAVTIRLREKRLALSSLNA 308


>gi|159464317|ref|XP_001690388.1| hypothetical protein CHLREDRAFT_144255 [Chlamydomonas reinhardtii]
 gi|158279888|gb|EDP05647.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 486

 Score = 40.8 bits (94), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 23/55 (41%), Positives = 35/55 (63%), Gaps = 3/55 (5%)

Query: 170 DDAVQ---LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 221
           D+A Q   +VV R   AG+ +++  G   N +LL +YGFV +DNP+D   ++AAL
Sbjct: 244 DEARQQYVIVVRRRVAAGQQVLLCYGRHTNLELLEHYGFVMQDNPHDTAPLDAAL 298


>gi|158295743|ref|XP_001688855.1| AGAP006364-PD [Anopheles gambiae str. PEST]
 gi|347965224|ref|XP_003435732.1| AGAP013401-PA [Anopheles gambiae str. PEST]
 gi|333469389|gb|EGK97284.1| AGAP013401-PA [Anopheles gambiae str. PEST]
          Length = 451

 Score = 40.8 bits (94), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 41/86 (47%), Gaps = 13/86 (15%)

Query: 174 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT------EDPQ 227
            L  D  Y+AGE I +  G   N+KLL+ YGF    NP D   VE  + T       DP+
Sbjct: 276 NLHTDTAYRAGEQIFISYGTHNNTKLLLEYGFSIPSNPDD--FVELTIGTINAFMKHDPE 333

Query: 228 YQDKRMVAQR-----NGKLSVQVFHV 248
            +  R+  ++     + +L  Q+F V
Sbjct: 334 LRCLRLPREKYRFLADHRLDEQLFFV 359


>gi|332229557|ref|XP_003263953.1| PREDICTED: SET domain-containing protein 4 [Nomascus leucogenys]
          Length = 440

 Score = 40.8 bits (94), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 55/240 (22%), Positives = 94/240 (39%), Gaps = 24/240 (10%)

Query: 19  SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
           S L  L  +L+ EK  G +S W PY+  L +       A   P+   E E+  L     K
Sbjct: 105 SPLLALCTFLVSEKHAGDRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLK 156

Query: 79  AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL- 137
           A+  E+   ++  +      +     LF +    I    F++     A+  V +  V+L 
Sbjct: 157 AKAEEQRAHVQEFFASSRDFFSSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLR 212

Query: 138 --QKVSLARRFALVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
             Q   L+       L P   LL +S   + KA       + ++     ++  E + +  
Sbjct: 213 PRQWECLSAEPDTCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICY 272

Query: 192 GPQPNSKLLINYGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 246
           GP  N +L + YGFV   NP+      R ++   L + D Q   K  + + +G +    F
Sbjct: 273 GPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332


>gi|410082051|ref|XP_003958604.1| hypothetical protein KAFR_0H00600 [Kazachstania africana CBS 2517]
 gi|372465193|emb|CCF59469.1| hypothetical protein KAFR_0H00600 [Kazachstania africana CBS 2517]
          Length = 508

 Score = 40.8 bits (94), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 47/196 (23%), Positives = 85/196 (43%), Gaps = 21/196 (10%)

Query: 24  LALYLMYE-KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 82
           L L L+YE K + +KS W  Y   L         A+   + W+E EL +L  S     I 
Sbjct: 89  LILCLLYEWKGKKEKSKWWKYFNVLPSSN-----AMNGLMYWNEQELEHLRPSLVLDRIG 143

Query: 83  ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE---IFKQAFVAVQSCVVHLQK 139
           +++   K  Y+++ T+  +  S F +   ++  E F +    I   +F         L +
Sbjct: 144 KKS--AKNMYHKVLTL--VKESKFPEVLCNVEWEDFVYAASVIMAYSFDVENGESQTLNE 199

Query: 140 VSLARRF--------ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
               +          +++PL   L + + +C A L   D  +++   +P K GE +    
Sbjct: 200 EDDDQDEEENTGYIKSMIPLADTLNSDTHQCNANLMYDDKFLKMYAIKPIKKGEQVFNIY 259

Query: 192 GPQPNSKLLINYGFVD 207
           G  PN+++L  YG+V+
Sbjct: 260 GNHPNAEILRRYGYVE 275


>gi|254577261|ref|XP_002494617.1| ZYRO0A05654p [Zygosaccharomyces rouxii]
 gi|238937506|emb|CAR25684.1| ZYRO0A05654p [Zygosaccharomyces rouxii]
          Length = 494

 Score = 40.8 bits (94), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 50/216 (23%), Positives = 95/216 (43%), Gaps = 32/216 (14%)

Query: 17  KLSELACLALYLMYE-KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLT-- 73
           +L     L L ++YE K  G++SFW  Y++ L + +    L     + WS  +LA L   
Sbjct: 84  ELGHWEGLILCMVYEIKVMGQQSFWWNYLQVLPKSQDLNTL-----VYWSADQLAALEPS 138

Query: 74  ------GSPTKAE----ILERAEGIKREY-NELDTVWF----MAGSLFQQYPYDIPTEAF 118
                 G+    E    IL+  +    E+ +++  + F       S+   Y +D+  +  
Sbjct: 139 LVVGRLGADESQEMYRQILKYIQNFGPEFQSKIGQLTFEEFVHVASVIMSYSFDVDLKGE 198

Query: 119 TFEIFKQAFVAVQ----SCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 174
             E  +      +    S V H + +      ++VPL   L A + +  A L    ++++
Sbjct: 199 DDEDDEDEDEGEEEEGESNVAHDKYMK-----SMVPLADTLNADTKQFNAHLVYDKESLK 253

Query: 175 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 210
           +V  +P K G+ +  + G  PN+++L  YG+V+ D 
Sbjct: 254 MVSVKPIKMGQQVYNFYGEHPNAEILRRYGYVEWDG 289


>gi|449520517|ref|XP_004167280.1| PREDICTED: LOW QUALITY PROTEIN: sulfate transporter 4.1,
           chloroplastic-like, partial [Cucumis sativus]
          Length = 923

 Score = 40.8 bits (94), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 44/175 (25%), Positives = 73/175 (41%), Gaps = 22/175 (12%)

Query: 13  LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 72
           L  N++  +A LA+ ++ E K G  S W PYI  L +        + + + W E+EL  +
Sbjct: 768 LLGNEIGNVAKLAVVVLLEHKLGLGSEWAPYIIRLPQ-----PWEMHNTIFWKESELEMI 822

Query: 73  TGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 132
             S    E L +   IKRE+  +           + +P  I  +  + + F  A+  V S
Sbjct: 823 RKSSLYEESLNQRSQIKREFLAIRKA-------LEAFPEII--DRISCDDFMHAYALVTS 873

Query: 133 CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGE 185
                +        +L+P     L +    +AML   DD    ++V DR +  GE
Sbjct: 874 -----RAWRSTEGVSLIPFA-DFLNHDGASEAMLLNDDDKQLSEVVADRDFAPGE 922


>gi|358395377|gb|EHK44764.1| hypothetical protein TRIATDRAFT_80097 [Trichoderma atroviride IMI
           206040]
          Length = 463

 Score = 40.8 bits (94), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 69/310 (22%), Positives = 123/310 (39%), Gaps = 57/310 (18%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTK 78
           LA+YL++ +           +RE   +  R  +A       S + ++E EL    G+   
Sbjct: 77  LAVYLLFVR-----------LREHGYEGPRSHVAAMPARYSSSIFFNEDELEVCAGTSLY 125

Query: 79  AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ 138
               +  E I+ +Y        +   +F Q+P  +P    + + +K A   V S  +   
Sbjct: 126 TITKQLEERIEDDYR------VLVMRVFTQHPDLLPLAKISIQDYKWALCTVWSRAMDF- 178

Query: 139 KVSLARRFALVPLGPPL---------LAYSSKCKAMLAAVDDAVQLVV--DRPYKAGESI 187
                    ++P G PL         + +S + K   A    +  L V   + Y+ G+ I
Sbjct: 179 ---------VLPNGKPLRVLAPFADMINHSPEVKQCHAYDPSSGNLSVLAGKDYEIGDQI 229

Query: 188 VVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFH 247
            +  G  PN++LL  YGFV  +NP D   +  + +   P Y+ K+ +    G  S     
Sbjct: 230 YISYGSIPNNRLLRLYGFVIPENPNDSYDLVLSTHPMAPFYEQKQKLWASAGLDSASTIP 289

Query: 248 VHAGREKEAISDMLP-----YLRLGYVSDTSEMQSV-ISSLGPICPVSPCMERAVLDQLA 301
           +        + D LP     YLR+  + D S++ ++ +  L     +S   E  +L  L 
Sbjct: 290 L-------TLIDPLPKSVLRYLRIQRL-DASDLAAIALQKLDTNEKISNSKEVEILQFLV 341

Query: 302 DYFKARLAGY 311
           +   A L G+
Sbjct: 342 ESISALLDGF 351


>gi|148671823|gb|EDL03770.1| SET domain containing 4, isoform CRA_d [Mus musculus]
          Length = 397

 Score = 40.8 bits (94), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 56/234 (23%), Positives = 95/234 (40%), Gaps = 24/234 (10%)

Query: 18  LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
           +S L  L  +L+ EK  G +S W  Y+  L +       +   P+   E E+  L  SP 
Sbjct: 61  VSPLLALCTFLVSEKHAGCRSLWKSYLDILPK-------SYTCPVCL-EPEVVDLLPSPL 112

Query: 78  KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
           KA+  E+   ++  +      +     LF + P D     F++  F  A+  V +  V+L
Sbjct: 113 KAKAEEQRARVQDLFTSARGFFSTLQPLFAE-PVD---SVFSYRAFLWAWCTVNTRAVYL 168

Query: 138 Q---KVSLARRFALVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVW 190
           +   +  L+       L P   LL +S   + KA         ++      +  + + + 
Sbjct: 169 RSRRQECLSAEPDTCALAPFLDLLNHSPHVQVKAAFNEKTRCYEIRTASRCRKHQEVFIC 228

Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVVEAA-----LNTEDPQYQDKRMVAQRNG 239
            GP  N +LL+ YGFV   NP+  + V A      L   D Q   K  + + +G
Sbjct: 229 YGPHDNQRLLLEYGFVSVRNPHACVPVSADMLVKFLPAADKQLHRKITILKDHG 282


>gi|148671819|gb|EDL03766.1| SET domain containing 4, isoform CRA_a [Mus musculus]
          Length = 378

 Score = 40.8 bits (94), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 56/234 (23%), Positives = 95/234 (40%), Gaps = 24/234 (10%)

Query: 18  LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
           +S L  L  +L+ EK  G +S W  Y+  L +       +   P+   E E+  L  SP 
Sbjct: 42  VSPLLALCTFLVSEKHAGCRSLWKSYLDILPK-------SYTCPVCL-EPEVVDLLPSPL 93

Query: 78  KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
           KA+  E+   ++  +      +     LF + P D     F++  F  A+  V +  V+L
Sbjct: 94  KAKAEEQRARVQDLFTSARGFFSTLQPLFAE-PVD---SVFSYRAFLWAWCTVNTRAVYL 149

Query: 138 Q---KVSLARRFALVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVW 190
           +   +  L+       L P   LL +S   + KA         ++      +  + + + 
Sbjct: 150 RSRRQECLSAEPDTCALAPFLDLLNHSPHVQVKAAFNEKTRCYEIRTASRCRKHQEVFIC 209

Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVVEAA-----LNTEDPQYQDKRMVAQRNG 239
            GP  N +LL+ YGFV   NP+  + V A      L   D Q   K  + + +G
Sbjct: 210 YGPHDNQRLLLEYGFVSVRNPHACVPVSADMLVKFLPAADKQLHRKITILKDHG 263


>gi|384248108|gb|EIE21593.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 229

 Score = 40.8 bits (94), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 19/45 (42%), Positives = 28/45 (62%), Gaps = 2/45 (4%)

Query: 8   YAAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 50
           +A  LL TNK  +     LA +L++E  +G++SFW PY+  L RQ
Sbjct: 64  FAEALLATNKQSIGSSQVLAAHLLHEASKGQESFWRPYLATLPRQ 108


>gi|357131408|ref|XP_003567330.1| PREDICTED: ribosomal N-lysine methyltransferase 3-like
           [Brachypodium distachyon]
          Length = 495

 Score = 40.8 bits (94), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 19/55 (34%), Positives = 32/55 (58%)

Query: 170 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 224
           D+ +++++ R   AGE +    G   N+ LL  YGF + DNPYD + ++  L T+
Sbjct: 251 DEDLEMIIVRDANAGEEVYNTYGTMGNAALLHRYGFTELDNPYDIVNIDLTLVTK 305


>gi|449019745|dbj|BAM83147.1| similar to protein N-methyltransferase [Cyanidioschyzon merolae
           strain 10D]
          Length = 576

 Score = 40.4 bits (93), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 23/62 (37%), Positives = 36/62 (58%), Gaps = 3/62 (4%)

Query: 155 LLAYSSKCKAMLAA--VDDAVQLVV-DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 211
           L  +SS+ ++ +A     DA  L + +R   AG+ + +  G   N +LL  YGFV+EDNP
Sbjct: 411 LFNHSSRVQSKVAYEYFYDAFSLSISNRDTHAGDQVFISYGTLTNDELLALYGFVEEDNP 470

Query: 212 YD 213
           +D
Sbjct: 471 HD 472


>gi|10177069|dbj|BAB10511.1| unnamed protein product [Arabidopsis thaliana]
          Length = 447

 Score = 40.4 bits (93), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 40/179 (22%), Positives = 80/179 (44%), Gaps = 22/179 (12%)

Query: 171 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV--EAALNTEDPQY 228
           +A  L   R Y+ GE +++  G   N +LL +YGF+ E+N  D++ +  E +L +    +
Sbjct: 213 NAYCLYARRNYQLGEQVLLCYGTYTNLELLEHYGFMLEENSNDKVFIPLETSLFSLASSW 272

Query: 229 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPV 288
               +   ++GKLS                 ++  LRL  +  +   +SV+  +     +
Sbjct: 273 PKDSLYIHQDGKLSFA---------------LISTLRLWLIPQSQRDKSVMRLVYAGSQI 317

Query: 289 SPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 347
           S   E  V+  +++   + L   P +++ED  +     LH   ++    +R+E+K   A
Sbjct: 318 SVKNEILVMKWMSEKCGSVLRDLPTSVTEDTVL-----LHNIDKLQDPELRLEQKETEA 371


>gi|302790237|ref|XP_002976886.1| hypothetical protein SELMODRAFT_416932 [Selaginella moellendorffii]
 gi|300155364|gb|EFJ21996.1| hypothetical protein SELMODRAFT_416932 [Selaginella moellendorffii]
          Length = 177

 Score = 40.4 bits (93), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 25/64 (39%), Positives = 35/64 (54%), Gaps = 2/64 (3%)

Query: 155 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 214
            L  S  CK  + AV +++++   R  KAG    +  G  PN  LL  YGFV E+NP+D 
Sbjct: 54  FLWASELCK--IDAVTNSLKVYSLRSCKAGMQCFISYGALPNIDLLCFYGFVLENNPFDT 111

Query: 215 LVVE 218
           + VE
Sbjct: 112 IPVE 115


>gi|189190580|ref|XP_001931629.1| conserved hypothetical protein [Pyrenophora tritici-repentis
           Pt-1C-BFP]
 gi|187973235|gb|EDU40734.1| conserved hypothetical protein [Pyrenophora tritici-repentis
           Pt-1C-BFP]
          Length = 372

 Score = 40.4 bits (93), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 21/77 (27%), Positives = 37/77 (48%), Gaps = 2/77 (2%)

Query: 145 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 204
           R A++P+          C+A  A+  +    + DR Y+ GE + +  G      LL  YG
Sbjct: 179 RLAILPVADLFNHADVGCEARFAS--ENYSFIADRDYRTGEELHISYGSHSTDFLLTEYG 236

Query: 205 FVDEDNPYDRLVVEAAL 221
           FV  +N +D + ++ A+
Sbjct: 237 FVPTENCWDVVCLDEAI 253


>gi|302835223|ref|XP_002949173.1| hypothetical protein VOLCADRAFT_120737 [Volvox carteri f.
           nagariensis]
 gi|300265475|gb|EFJ49666.1| hypothetical protein VOLCADRAFT_120737 [Volvox carteri f.
           nagariensis]
          Length = 593

 Score = 40.4 bits (93), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 47/158 (29%), Positives = 70/158 (44%), Gaps = 14/158 (8%)

Query: 65  SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK 124
           S+ ++A L G+P   + + RA     E  E  +  F   SL   YP     E F++E + 
Sbjct: 276 SQEDVALLEGTPLHGDAV-RARQHLSEAFESSSPAFR--SLLGAYPDYFKPEWFSWESYL 332

Query: 125 QAFVAVQSCVVHLQKVSLARRFALVP-LG-----P-PLLAYSSKCKAMLAAVDDAVQLVV 177
            A     S  + +Q  S   R  L P LG     P P + + SK       +    ++  
Sbjct: 333 WAAELWYSYGIQVQFASGDIRTCLAPYLGLMNHHPLPHVVHFSKVDPETGCL----RVRA 388

Query: 178 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 215
            RP +AG  + +  GP  N+KLL+ YGF   DNP D +
Sbjct: 389 FRPCEAGNQLFLSYGPYSNAKLLLFYGFAVRDNPADEV 426


>gi|195396323|ref|XP_002056781.1| GJ16703 [Drosophila virilis]
 gi|194146548|gb|EDW62267.1| GJ16703 [Drosophila virilis]
          Length = 539

 Score = 40.4 bits (93), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 63/312 (20%), Positives = 119/312 (38%), Gaps = 36/312 (11%)

Query: 28  LMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG 87
           L+ EK +G  S W P+I  L  +         + L ++  ++  L G+   +  +     
Sbjct: 195 LIIEKVRGAASNWQPFIDTLPTR-------YNTVLYFTVEQMQRLRGTSACSAAVRHCRV 247

Query: 88  IKREYNEL---------DTVWFMAGSLFQQYP--YDIPTEAFTFEIFKQAFVAVQSCVVH 136
           I R Y  +         D+V     +LF +Y   Y++   A +    +Q  V  Q     
Sbjct: 248 IARIYASMYKCAYMQPDDSVMAGMANLFTEYGLCYELYRWAVSTVTTRQNLVPRQ-LATD 306

Query: 137 LQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPN 196
              V  +   AL+P          K  +        ++ +    +KAGE   ++ G + N
Sbjct: 307 SDGVRNSPMSALIPFWDMANHRCGKITSYYKPSAQQMECIAQEAFKAGEQFFIYYGDRCN 366

Query: 197 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV-----AQRNGKLSVQVFHVHAG 251
           +  L+++GF+D +N  D + +   L+  D   + + ++      +R  +L V     H  
Sbjct: 367 ADRLVHHGFLDMNNLKDYVHIRLGLSPTDALAEQRALLLSELNIERKAELRVLPAPEHIS 426

Query: 252 REKEAISDMLPYLRLGYVSD------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFK 305
            E      +L ++R+  +S        S+++  +  L   C +   +E      L    K
Sbjct: 427 GE------LLAFVRVFNMSKEQLEHWCSDLERAVDLLHIDCALETDLETRTWQYLYQRLK 480

Query: 306 ARLAGYPATLSE 317
             L    ATL E
Sbjct: 481 LLLGVLDATLKE 492


>gi|71656153|ref|XP_816628.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70881769|gb|EAN94777.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 565

 Score = 40.4 bits (93), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 51/187 (27%), Positives = 82/187 (43%), Gaps = 31/187 (16%)

Query: 184 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRL-----VVEAALNTEDPQYQDKR--MVAQ 236
           G  I +  GP  N +LL  YGFV E N ++RL       E A+  E   +  +R  +VA 
Sbjct: 362 GREIWMSYGPLQNWELLQFYGFVVEGNEHERLPFPFDFPEGAVGDE---WDGRRAALVAT 418

Query: 237 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 296
               L+   +  H GR   A   ++  LR+ ++++  E  ++  + GP   +    E  V
Sbjct: 419 YGLHLAGCCWICHDGRPPPA---LVALLRV-HLAEAEEFDTMERN-GPFASLGAGTEARV 473

Query: 297 LDQLADYFKARLAGYPATLSEDEAML------------TDYNLHP---KKRVATQLVRME 341
           +  +AD  +  L  +  +L EDE +L             D N  P    KR+A  L+R+ 
Sbjct: 474 VATIADTIRCILDLFSTSLEEDERLLENGSGPVATHAGDDGNTQPLSCNKRLAI-LLRVG 532

Query: 342 KKMLNAC 348
            K +  C
Sbjct: 533 MKRIAHC 539


>gi|424512980|emb|CCO66564.1| predicted protein [Bathycoccus prasinos]
          Length = 542

 Score = 40.4 bits (93), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 18/43 (41%), Positives = 25/43 (58%)

Query: 179 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 221
           + YK GE +++  G  PN +LL+ YGFVD+ N  D    E  L
Sbjct: 350 KSYKKGEEVLISYGLMPNDELLLRYGFVDDQNVADTYQFEGLL 392


>gi|71019075|ref|XP_759768.1| hypothetical protein UM03621.1 [Ustilago maydis 521]
 gi|46099291|gb|EAK84524.1| hypothetical protein UM03621.1 [Ustilago maydis 521]
          Length = 685

 Score = 40.4 bits (93), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 38/83 (45%), Gaps = 6/83 (7%)

Query: 146 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 205
            ++ P+   L A      A L      +++   +P  AGE I       PNS LL  YG 
Sbjct: 368 ISMTPMADMLNAKFESDNARLFYKSHVLEMRATKPIAAGEQIFNTYADPPNSDLLRRYGH 427

Query: 206 VDEDNPYD------RLVVEAALN 222
           VDE N  D      +LVV+AA+N
Sbjct: 428 VDEPNGNDVVELDAKLVVQAAVN 450


>gi|71659283|ref|XP_821365.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70886742|gb|EAN99514.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 661

 Score = 40.4 bits (93), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 45/95 (47%), Gaps = 10/95 (10%)

Query: 13  LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA-- 70
           L ++ +  +AC+A Y+ YEKKQ + +  L Y R L       Q+ V++  LW+   L   
Sbjct: 466 LDSSNMESIACIAAYMFYEKKQPEIALRL-YRRLL-------QMGVQTTELWNNLGLCCF 517

Query: 71  YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSL 105
           Y +        L+RA  I  E   L  VW+  G +
Sbjct: 518 YSSQYDIALSCLQRAVAISTEDETLADVWYNIGHI 552


>gi|297845640|ref|XP_002890701.1| hypothetical protein ARALYDRAFT_472886 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297336543|gb|EFH66960.1| hypothetical protein ARALYDRAFT_472886 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 471

 Score = 40.4 bits (93), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 38/175 (21%), Positives = 71/175 (40%), Gaps = 19/175 (10%)

Query: 167 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL----------V 216
           A  +  V++V +   K  + +++  G   N   L++YGFV E NPYD +           
Sbjct: 252 AESNTLVKVVAETELKENDPLLLNYGCLSNDFFLLDYGFVIESNPYDTIELKYDEQLMDA 311

Query: 217 VEAALNTEDPQYQD-----KRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD 271
              A     P++        ++++Q N    +    V  G  +     +L  +R+    +
Sbjct: 312 ASMAAGVSSPKFSSPAPWQHQLLSQLNLAGEMPNLKVTIGGPEPVEGRLLAAIRILLCGE 371

Query: 272 TSEMQ----SVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 322
             E++      + SL  I P+    E AV   +       L+ +P  + EDEA++
Sbjct: 372 MVEVEKHDLDTLKSLSAIAPLGIANEIAVFRTVIALCVIALSHFPTKIMEDEAII 426


>gi|323473309|gb|ADX78230.1| CIA6 [Chlamydomonas reinhardtii]
          Length = 699

 Score = 40.4 bits (93), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 23/55 (41%), Positives = 35/55 (63%), Gaps = 3/55 (5%)

Query: 170 DDAVQ---LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 221
           D+A Q   +VV R   AG+ +++  G   N +LL +YGFV +DNP+D   ++AAL
Sbjct: 396 DEARQQYVIVVRRRVAAGQQVLLCYGRHTNLELLEHYGFVMQDNPHDTAPLDAAL 450


>gi|195040205|ref|XP_001991024.1| GH12451 [Drosophila grimshawi]
 gi|193900782|gb|EDV99648.1| GH12451 [Drosophila grimshawi]
          Length = 573

 Score = 40.4 bits (93), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 50/238 (21%), Positives = 94/238 (39%), Gaps = 38/238 (15%)

Query: 28  LMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG 87
           L+ EK +G  S W P+I  L  +         + L ++  ++  L G+   +  +     
Sbjct: 203 LIIEKVRGAASAWQPFIDTLPTR-------YSTVLYFTVDQMQRLRGTSACSAAMRHCLV 255

Query: 88  IKREYNEL---------DTVWFMAGSLFQQYP--YDIPTEAFTFEIFKQAFVAVQ-SCVV 135
           I R Y  +         D V     +LF +Y   Y++   A +    +Q  V  + S V 
Sbjct: 256 IARLYASMYKCAYIQPGDNVMAAKANLFTEYGLCYELYRWAVSTVTTRQNLVPRELSTVG 315

Query: 136 HLQKV----------------SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ---LV 176
            + +V                + AR   +  L P     + +C  + +  D A Q     
Sbjct: 316 EVDQVCQLGGFEGTEIKRDAETGARNAPISALIPYWDMTNHRCGKITSYYDRAAQQMECT 375

Query: 177 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 234
               +KAGE   ++ G + N+  L+++GF+D  N  D + +   L+  DP  + + ++
Sbjct: 376 AQEAFKAGEQFFIYYGDRSNADRLVHHGFLDMHNLKDYVQIRLGLSPTDPLVEQRSLL 433


>gi|396469509|ref|XP_003838423.1| similar to SET domain-containing protein [Leptosphaeria maculans
           JN3]
 gi|312214991|emb|CBX94944.1| similar to SET domain-containing protein [Leptosphaeria maculans
           JN3]
          Length = 415

 Score = 40.4 bits (93), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 36/75 (48%), Gaps = 2/75 (2%)

Query: 139 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 198
           K++ A  +A+ P         S C+    A      ++ DR Y+AGE + V  GP  N  
Sbjct: 190 KLTSADCYAMCPFMDYFNHSDSGCEPQHNA--HGYSVLADRAYRAGEEVYVSYGPHTNDF 247

Query: 199 LLINYGFVDEDNPYD 213
           LL+ YGF+ + N  D
Sbjct: 248 LLVEYGFLLDANSND 262


>gi|303272215|ref|XP_003055469.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226463443|gb|EEH60721.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 468

 Score = 40.4 bits (93), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 32/117 (27%), Positives = 50/117 (42%), Gaps = 10/117 (8%)

Query: 10  AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
           A+ L   +L     L + +M E+  G +S W  Y   L   RG   L    P+ W+E  L
Sbjct: 87  AKELRDARLGGGLALNVAVMVERALGSESRWRDYFAVLP-SRGERTL----PMFWTEARL 141

Query: 70  AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 126
             L G+     + E AE ++ +Y+E      +   L   +P     E  TFE + +A
Sbjct: 142 EALKGTDLATHVREDAENLRADYDEE-----VVNGLCVAHPEKFRREELTFERYLEA 193


>gi|303279242|ref|XP_003058914.1| set domain protein [Micromonas pusilla CCMP1545]
 gi|226460074|gb|EEH57369.1| set domain protein [Micromonas pusilla CCMP1545]
          Length = 457

 Score = 40.4 bits (93), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 20/48 (41%), Positives = 29/48 (60%)

Query: 170 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 217
           DDA+ + V +  KAGE +    G   N+KLL +YGF   DNP D++ +
Sbjct: 232 DDALFVRVVKASKAGEEVFNTYGKLGNAKLLCSYGFAQLDNPADKVTI 279


>gi|297598048|ref|NP_001044988.2| Os01g0879500 [Oryza sativa Japonica Group]
 gi|255673923|dbj|BAF06902.2| Os01g0879500 [Oryza sativa Japonica Group]
          Length = 263

 Score = 40.4 bits (93), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 20/63 (31%), Positives = 33/63 (52%)

Query: 162 CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 221
           C   +   D+ ++++V R    GE +    G   N+ LL  YGF + DN YD + ++ AL
Sbjct: 15  CSYYVGDDDEDLEMIVVRDVNEGEEVFNTYGTMGNAALLHRYGFTEMDNSYDIVNIDLAL 74

Query: 222 NTE 224
            T+
Sbjct: 75  VTK 77


>gi|320166344|gb|EFW43243.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 514

 Score = 40.4 bits (93), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 57/230 (24%), Positives = 95/230 (41%), Gaps = 20/230 (8%)

Query: 16  NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 75
           N +  +  LAL LMYE+ +   S W  ++R L          +ES L W++ EL  +   
Sbjct: 132 NAIDPMTALALGLMYERSRA-DSPWRAWLRMLPD-------PIESMLEWNDVELWPVEQL 183

Query: 76  PTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 135
             K    ER   ++  Y  + T           Y  D+    FT E F  A V  Q+  +
Sbjct: 184 YVKELREERIRNLEAVYESVIT------PFIDTYESDLVGVDFTIEAFVWAAVIAQTRGL 237

Query: 136 HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 195
           H    S     +L+P+   ++ +  +  A++ A    + +      KAGE I +      
Sbjct: 238 H---ESEKNGLSLLPIV-DMINHHREPNAVVVASGPNILVRTKTSLKAGEEITI-DYEMS 292

Query: 196 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQV 245
           +  LL+ YGFV+     D   +  +  ++D  Y  +  + +  G LS QV
Sbjct: 293 SHVLLLLYGFVEMSENLDFYPIRLSWESKDIDYPRRLRLLEGRG-LSRQV 341


>gi|121703688|ref|XP_001270108.1| SET domain protein [Aspergillus clavatus NRRL 1]
 gi|119398252|gb|EAW08682.1| SET domain protein [Aspergillus clavatus NRRL 1]
          Length = 492

 Score = 40.4 bits (93), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 58/242 (23%), Positives = 90/242 (37%), Gaps = 46/242 (19%)

Query: 13  LTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 69
           L +  L EL     L L ++YE    ++S W PY R             ++ + WS  EL
Sbjct: 81  LLSQDLGELGPWLSLMLVMIYEYLLREQSAWAPYYRIFPEN-------FDTLMFWSPAEL 133

Query: 70  AYLTGSPTKAEILERA-----------------------EGIKREYNELDTVWFMA---- 102
             L GS    +I  +                        +G+     E  T   +     
Sbjct: 134 QELQGSAIVDKIGRQGAEESILQMIAPVVKANPSLFPPIQGLSSWEGEAGTQALLGLAHV 193

Query: 103 -GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSK 161
            GSL   Y +DI       +   +      +     Q         +VPL   L A + +
Sbjct: 194 MGSLIMAYAFDIEKVNDEDDEDNEGEDGYMTDEEEDQSSK-----GMVPLADILNADADR 248

Query: 162 CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED-NPYDRLVVEAA 220
             A L   +D++ +   +P  AG+ I    G  P S LL  YG+V ++  PYD  V+EA+
Sbjct: 249 NNARLFQEEDSLVMKAIKPIAAGDEIFNDYGELPRSDLLRRYGYVTDNYAPYD--VIEAS 306

Query: 221 LN 222
           L+
Sbjct: 307 LD 308


>gi|156054286|ref|XP_001593069.1| hypothetical protein SS1G_05991 [Sclerotinia sclerotiorum 1980]
 gi|154703771|gb|EDO03510.1| hypothetical protein SS1G_05991 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 418

 Score = 40.4 bits (93), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 51/231 (22%), Positives = 95/231 (41%), Gaps = 24/231 (10%)

Query: 8   YAAELLTTNKLSELACLA-LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSE 66
           +  E L T K  +   +   +LM +  + +KS W  YIR L +      L +  P+ W E
Sbjct: 74  FPIEFLETLKQDDPNIIGHFFLMQQYLKCEKSPWWQYIRLLPQPGDPKSLGI--PIWWPE 131

Query: 67  TELAYLTGSPT----------------KAEILERAEGIKREYNELDTVWFMAGSLFQQYP 110
            +  +L G+                  K  +L R     +EY+ +   W  A ++F    
Sbjct: 132 EDQKFLAGTNAGPPLQKREQMWRDQWKKGVVLLRELPNHKEYSYILYQW--AATIFDSRS 189

Query: 111 YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD 170
           +  P+     E   ++   +   + H++    +  + LV +G        + K  L++  
Sbjct: 190 FR-PSLTICPEALSESSKEMDLNLDHVRNDRFSILYPLVDIGNHNGINQVEWKKDLSS-- 246

Query: 171 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 221
           ++  LV       G+ I  + G + NS+LL+ YGF+  ++  +R VV   L
Sbjct: 247 NSFDLVHSAGVSEGDQIYNYYGNKSNSELLLGYGFILPNDIVNRNVVNLKL 297


>gi|145346652|ref|XP_001417799.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144578027|gb|ABO96092.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 490

 Score = 40.4 bits (93), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 50/206 (24%), Positives = 82/206 (39%), Gaps = 25/206 (12%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS-------P 76
           L + +M E   GK S W  Y+  +      G     S L W + ++  L G+        
Sbjct: 110 LVIAVMCEMCAGKSSAWCEYLETVHEAVRVGH----SVLAWDDEQVTALFGTDAWRDAYE 165

Query: 77  TKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF-TFEIFKQAFVAVQSCVV 135
              E L+     +  +  + T++F    LF +    +  EA         A VA  S  +
Sbjct: 166 NDDETLDLPMMTEEHFENVVTLFF---KLFPKLASGLSVEALRELHFAATAMVAGYSFTL 222

Query: 136 HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD----AVQLVVDRPYKAGESIVVWC 191
              ++      A+VP    +L ++  C+A +    D     +Q++  R  K GE +    
Sbjct: 223 GDDEIQ-----AMVPFWD-MLNHAPPCEASVRLHHDQKNGCLQMITVRGVKKGEEVFNTY 276

Query: 192 GPQPNSKLLINYGFVDEDNPYDRLVV 217
           GP  N++LL  YGFV   NP+    V
Sbjct: 277 GPLRNAELLRRYGFVLPRNPHGGTTV 302


>gi|392563539|gb|EIW56718.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
          Length = 441

 Score = 40.0 bits (92), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 16/46 (34%), Positives = 29/46 (63%)

Query: 173 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 218
           + LV+  P   G  ++   GP+PN++L++ YGF   +NP D +V++
Sbjct: 250 ISLVIHTPTTTGSELLNNYGPKPNAELILGYGFSLPNNPDDTIVLK 295


>gi|172073177|ref|NP_663457.2| SET domain-containing protein 4 [Mus musculus]
 gi|148671824|gb|EDL03771.1| SET domain containing 4, isoform CRA_e [Mus musculus]
          Length = 439

 Score = 40.0 bits (92), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 56/234 (23%), Positives = 95/234 (40%), Gaps = 24/234 (10%)

Query: 18  LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
           +S L  L  +L+ EK  G +S W  Y+  L +       +   P+   E E+  L  SP 
Sbjct: 103 VSPLLALCTFLVSEKHAGCRSLWKSYLDILPK-------SYTCPVCL-EPEVVDLLPSPL 154

Query: 78  KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
           KA+  E+   ++  +      +     LF + P D     F++  F  A+  V +  V+L
Sbjct: 155 KAKAEEQRARVQDLFTSARGFFSTLQPLFAE-PVD---SVFSYRAFLWAWCTVNTRAVYL 210

Query: 138 Q---KVSLARRFALVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVW 190
           +   +  L+       L P   LL +S   + KA         ++      +  + + + 
Sbjct: 211 RSRRQECLSAEPDTCALAPFLDLLNHSPHVQVKAAFNEKTRCYEIRTASRCRKHQEVFIC 270

Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVVEAA-----LNTEDPQYQDKRMVAQRNG 239
            GP  N +LL+ YGFV   NP+  + V A      L   D Q   K  + + +G
Sbjct: 271 YGPHDNQRLLLEYGFVSVRNPHACVPVSADMLVKFLPAADKQLHRKITILKDHG 324


>gi|340517549|gb|EGR47793.1| hypothetical protein TRIREDRAFT_122428 [Trichoderma reesei QM6a]
          Length = 482

 Score = 40.0 bits (92), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 56/246 (22%), Positives = 99/246 (40%), Gaps = 48/246 (19%)

Query: 35  GKKSF----WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 90
           G ++F    W  YIR L R        +  P +W+  E   L G+  +A +  +   +  
Sbjct: 111 GTRAFASTPWTEYIRFLPR-------PIPVPTMWTNDERELLKGTSLEAAVSAKLSALSS 163

Query: 91  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 150
           EY++L      A +L   +   + +E+ T E +  A    +S  + L +       A+VP
Sbjct: 164 EYDKLCE---EASAL--SFWSTLLSESATLEDWVLADAWYRSRCLELPRAG----HAMVP 214

Query: 151 LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYK---AGESIVVWCG-PQPNSKLLINYGFV 206
            G  +  +S    A      D   +++ RP     AG  I +  G  +P +++L +YGF+
Sbjct: 215 -GLDMANHSQSHSAYYDESSDGDVVLLPRPGSKIPAGAEITISYGEAKPAAEMLFSYGFI 273

Query: 207 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 266
           D+D+    L +      +DP              L    FH++ G          P +RL
Sbjct: 274 DKDSTVKELTLHLEALPDDP--------------LGRAKFHIYKGP---------PTVRL 310

Query: 267 GYVSDT 272
             ++D 
Sbjct: 311 SIINDN 316


>gi|195565510|ref|XP_002106342.1| GD16174 [Drosophila simulans]
 gi|194203718|gb|EDX17294.1| GD16174 [Drosophila simulans]
          Length = 395

 Score = 40.0 bits (92), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 22/79 (27%), Positives = 37/79 (46%)

Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
           AL+P          K  +  AAV   ++        AGE   ++ G + N+ LL++ GFV
Sbjct: 229 ALIPYWDMANHRPGKITSFYAAVPRQLECTAQEAVDAGEQFFIYYGDRSNTDLLVHNGFV 288

Query: 207 DEDNPYDRLVVEAALNTED 225
           D++N  D + +   L+  D
Sbjct: 289 DDNNLKDYVNIRVGLSLTD 307


>gi|17865444|sp|P58467.1|SETD4_MOUSE RecName: Full=SET domain-containing protein 4
 gi|17061796|gb|AAK68849.1| C21orf18 [Mus musculus]
          Length = 439

 Score = 40.0 bits (92), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 56/234 (23%), Positives = 95/234 (40%), Gaps = 24/234 (10%)

Query: 18  LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
           +S L  L  +L+ EK  G +S W  Y+  L +       +   P+   E E+  L  SP 
Sbjct: 103 VSPLLALCTFLVSEKHAGCRSLWKSYLDILPK-------SYTCPVCL-EPEVVDLLPSPL 154

Query: 78  KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
           KA+  E+   ++  +      +     LF + P D     F++  F  A+  V +  V+L
Sbjct: 155 KAKAEEQRARVQDLFTSARGFFSTLQPLFAE-PVD---SVFSYRAFLWAWCTVNTRAVYL 210

Query: 138 Q---KVSLARRFALVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVW 190
           +   +  L+       L P   LL +S   + KA         ++      +  + + + 
Sbjct: 211 RSRRQECLSAEPDTCALAPFLDLLNHSPHVQVKAAFNEKTRCYEIRTASRCRKHQEVFIC 270

Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVVEAA-----LNTEDPQYQDKRMVAQRNG 239
            GP  N +LL+ YGFV   NP+  + V A      L   D Q   K  + + +G
Sbjct: 271 YGPHDNQRLLLEYGFVSVRNPHACVPVSADMLVKFLPAADKQLHRKITILKDHG 324


>gi|329663327|ref|NP_001192753.1| SET domain-containing protein 4 [Bos taurus]
 gi|296490853|tpg|DAA32966.1| TPA: SET domain containing 4 [Bos taurus]
          Length = 440

 Score = 40.0 bits (92), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 47/201 (23%), Positives = 85/201 (42%), Gaps = 19/201 (9%)

Query: 19  SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
           S L  L  +L+ EK  G +S W PY+  L +       A   P+   E E+  L  +P K
Sbjct: 105 SPLLALCTFLVSEKHAGDRSPWKPYLEVLPK-------AYTCPVC-LEPEVVNLLPNPLK 156

Query: 79  AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ 138
            +  E+   +   ++     +     LF +    I    F++   + A+ AV +  V+++
Sbjct: 157 TKAWEQRSHVWEFFSSSRGFFSSLQPLFSEAVETI----FSYRALRWAWCAVNTRAVYMK 212

Query: 139 K---VSLARRFALVPLGP--PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGE--SIVVWC 191
           +   + L+       L P   LL +S   +   A  ++     +    + G+   + +  
Sbjct: 213 RPPLLCLSPEPDTCALAPYLDLLNHSPDVQVKAAFNEETRCYEIRTATRCGKHKEVFICY 272

Query: 192 GPQPNSKLLINYGFVDEDNPY 212
           GP  N +LL+ YGFV   NP+
Sbjct: 273 GPHDNHRLLLEYGFVCVSNPH 293


>gi|340522118|gb|EGR52351.1| predicted protein [Trichoderma reesei QM6a]
          Length = 377

 Score = 40.0 bits (92), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 39/164 (23%), Positives = 71/164 (43%), Gaps = 25/164 (15%)

Query: 61  PLLWSETELAYLTGSPTKAEI-LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 119
           P+LW   EL  L   P ++++ LER E       E    W      F++   D+P + +T
Sbjct: 107 PMLWPR-ELKQLL--PLESQVTLERRE------KEFQDNW----DDFKEAFPDVPRDDYT 153

Query: 120 FEIFKQAFVAVQSCVVHLQ-----KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 174
           +     A++ V +   + +     K     R AL+P+       +  C+   +  +    
Sbjct: 154 Y-----AWLVVNTRTFYHETPETLKYPWEDRLALIPVADLFNHAAGGCRVYYSP-EGCYH 207

Query: 175 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 218
           +V DR YK GE + +      N   L+ YGF+ ++N  D + ++
Sbjct: 208 VVADRAYKKGEELFISYSSHSNDYNLLEYGFIPDENSLDDVYID 251


>gi|406606937|emb|CCH41659.1| hypothetical protein BN7_1200 [Wickerhamomyces ciferrii]
          Length = 577

 Score = 40.0 bits (92), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 55/222 (24%), Positives = 87/222 (39%), Gaps = 58/222 (26%)

Query: 42  PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF- 100
           P+I  L   R  G     SP  W+E E + +    T A++      I    N+L   W+ 
Sbjct: 110 PFIEFLPTGREIG-----SPFFWNEMERSLIKN--TDADL-----AIDVGLNKLVEEWYD 157

Query: 101 MAGSL---FQQYPYDIPTEAFT-----------FEIFKQAFVAVQSCVVHLQKVSL--AR 144
           +   L   FQ Y Y    + F            FE F    V+  S   +L   ++  +R
Sbjct: 158 IVTKLPKKFQSYQYQKDLKFFHDFQKDRDVSKHFEFFNDDSVSWTSFAAYLWSSTIFTSR 217

Query: 145 RFALVPLGPPLLAYSSKCK----AMLAAVDDA-----------------VQLVVDRPYKA 183
            F      P L++ + +C+     ML  + D                  +    ++  K 
Sbjct: 218 GF------PFLISSTDECRDLNEGMLVPIQDLSNHNPSVEIKWGRLDKFMTFTTEQIVKK 271

Query: 184 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 225
           G+ I    GP+ N +LL  YGFV ++N YD+ V+  AL  +D
Sbjct: 272 GDEIFSNYGPKSNHELLFGYGFVMDNNIYDKAVL--ALRLQD 311


>gi|170093191|ref|XP_001877817.1| SET-domain protein [Laccaria bicolor S238N-H82]
 gi|164647676|gb|EDR11920.1| SET-domain protein [Laccaria bicolor S238N-H82]
          Length = 524

 Score = 40.0 bits (92), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 59/228 (25%), Positives = 98/228 (42%), Gaps = 32/228 (14%)

Query: 146 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 205
            A+VP+   L A      A L   ++ ++++  RP K GE I    G  PN++LL  YG 
Sbjct: 276 IAMVPMADILNARYGSENAKLFYEENYLKMISTRPIKGGEQIWNTYGDLPNAELLRRYGH 335

Query: 206 VD--------EDNPYD------RLVVEAA-----LNTEDPQYQDKRMVAQRNGKLSVQVF 246
           VD        + NP D       L+V  A     L+T+D   +    + +    +    F
Sbjct: 336 VDVIQLPNGGQGNPGDVAEIRADLIVSVAAEQHSLSTDDTHERIDWWLEEGGDDVFDLYF 395

Query: 247 HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 306
            +      E    ++  +RL  + D  E + +        P  P M+   L  L +  + 
Sbjct: 396 DL------EIPPSIISVIRLLLLPD-EEWEKIKEKA---KPPKPKMDAVALTVLHEVLQR 445

Query: 307 RLAGYPATLSEDEAML-TDYNLHPKKRVATQLVRMEKKMLNACLQVTA 353
           RL  YP ++ +DE +L T  +L+ +  +  +L   EKK+L+  L  TA
Sbjct: 446 RLKEYPTSIQDDEQLLMTAPSLNLRHAIIVRL--GEKKILDGILTKTA 491


>gi|145354549|ref|XP_001421544.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144581782|gb|ABO99837.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 488

 Score = 40.0 bits (92), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 17/42 (40%), Positives = 27/42 (64%)

Query: 172 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 213
            V+L+  R   +GE I +  G   N +LL++YGF+ +DNP+D
Sbjct: 276 GVELIARRALTSGEPIELSYGNLSNDELLLDYGFIVKDNPFD 317


>gi|299748031|ref|XP_002911244.1| tho2 protein [Coprinopsis cinerea okayama7#130]
 gi|298407787|gb|EFI27750.1| tho2 protein [Coprinopsis cinerea okayama7#130]
          Length = 2474

 Score = 40.0 bits (92), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 15/47 (31%), Positives = 30/47 (63%)

Query: 172 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 218
           ++ L+       G+ +    GP+PNS+L+++YGF  +DNP D ++++
Sbjct: 209 SISLIAHSAIWTGQEVFNNYGPKPNSELILSYGFSIQDNPDDSIILK 255


>gi|453088140|gb|EMF16181.1| SET domain-containing protein, partial [Mycosphaerella populorum
           SO2202]
          Length = 307

 Score = 40.0 bits (92), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 45/185 (24%), Positives = 70/185 (37%), Gaps = 24/185 (12%)

Query: 61  PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 120
           PL WS     +L  S    + LER     RE  E D  + ++         +   + +  
Sbjct: 112 PLFWSPELCHHLPES--VKQPLERM----REDYEKDLTYMLS--------LNCDDQTWKE 157

Query: 121 EIFKQAFVAVQSCVVHLQKVSLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 179
           E FK  +  V S   H +   L   F  L P    +    +     ++      ++V DR
Sbjct: 158 EDFKYYWAIVNSRCFHFKPPGLKPGFMVLCPFIDYMNHGPTGTGVKVSQSPKGYEVVADR 217

Query: 180 PYKAGESIVVWCGPQPNSKLLINYGFV---------DEDNPYDRLVVEAALNTEDPQYQD 230
            Y+    I+   G  PN KLL++YGF          D+D   D +++ A       Q QD
Sbjct: 218 DYEPNTEILATYGSHPNDKLLVHYGFCLSYKPNEPSDDDIRLDHILLPAMSANTKSQLQD 277

Query: 231 KRMVA 235
             M+ 
Sbjct: 278 VGMLG 282


>gi|260946533|ref|XP_002617564.1| hypothetical protein CLUG_03008 [Clavispora lusitaniae ATCC 42720]
 gi|238849418|gb|EEQ38882.1| hypothetical protein CLUG_03008 [Clavispora lusitaniae ATCC 42720]
          Length = 430

 Score = 40.0 bits (92), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 46/202 (22%), Positives = 85/202 (42%), Gaps = 22/202 (10%)

Query: 18  LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
           LS    LA+YL+ EK++G  SFW P+I   D      +L++ +P++W   ++      P 
Sbjct: 117 LSSFQLLAIYLVLEKERGAASFWKPFI---DMLPSIEELSL-APVVWKVLQV------PH 166

Query: 78  KAEILERAEGIKREYNELDTVWFMAGSLFQQYPY--DIPT-EAFTFEIFKQAFVAVQSCV 134
             ++        R++ E      +     + Y    D+P+  AF    F  A++ + S  
Sbjct: 167 CDDLWRMLSRSARKHAE-----SVVARFEKDYAVVCDLPSVPAFERSSFLWAWMCINSRC 221

Query: 135 VHL---QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
           +++   Q    +  F + P     L +S++ +  +        ++    YK  E +    
Sbjct: 222 LYMSMPQAKDTSDNFTMAPY-VDFLNHSNEDQCGIKIDPHGFHVLTSSAYKPQEELYFSY 280

Query: 192 GPQPNSKLLINYGFVDEDNPYD 213
           GP  N  LL  YGF    N ++
Sbjct: 281 GPHSNEFLLCEYGFTLPHNKWN 302


>gi|159471213|ref|XP_001693751.1| transcription factor, E2F and DP-related [Chlamydomonas
           reinhardtii]
 gi|158283254|gb|EDP09005.1| transcription factor, E2F and DP-related [Chlamydomonas
           reinhardtii]
          Length = 656

 Score = 40.0 bits (92), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 41/149 (27%), Positives = 62/149 (41%), Gaps = 6/149 (4%)

Query: 70  AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 129
           A L GSP  AE  +    +   +      +    SL + YP       F++E +  A   
Sbjct: 164 AALAGSPLAAEAGQARRHLAEAFAASQPAFE---SLLKAYPDYFQPHWFSWESYLWAAEL 220

Query: 130 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA---VQLVVDRPYKAGES 186
             S  + +Q  +   R  LVP    +  +        + VD A   +++   RP   G  
Sbjct: 221 WYSYGIQVQVAAGDIRTCLVPYLGLMNHHPLPHVVHFSKVDPASRGLRVRAFRPCARGRQ 280

Query: 187 IVVWCGPQPNSKLLINYGFVDEDNPYDRL 215
           + +  GP PNSKLL+ YGF   DNP D +
Sbjct: 281 LFLSYGPYPNSKLLLFYGFALPDNPVDEV 309


>gi|410970027|ref|XP_003991492.1| PREDICTED: SET domain-containing protein 4 [Felis catus]
          Length = 440

 Score = 40.0 bits (92), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 57/237 (24%), Positives = 98/237 (41%), Gaps = 25/237 (10%)

Query: 19  SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
           S L  L  +L+ EK  G +S W PY+  L +       A   P+   E E+  L   P +
Sbjct: 105 SPLLALCTFLVSEKHAGDQSVWKPYLEILPK-------AYTCPVC-LEPEVVNLFPKPLR 156

Query: 79  AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ 138
           A+  E+   ++  ++     +     LF +    I    F++     A+  V +  V++ 
Sbjct: 157 AKAEEQRARVREFFSSSRGFFSSLQPLFSEAVGSI----FSYRALLWAWCTVNTRAVYV- 211

Query: 139 KVSLARRFALVP----LGP--PLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGESIVVW 190
           K    R F+  P    L P   LL +S   +   A  ++    ++      +  E + + 
Sbjct: 212 KPRRRRCFSAEPDTCALAPYLDLLNHSPHVQVEAAFNEETRCYEIRTASSCRKHEEVFIC 271

Query: 191 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFH 247
            GP  N +LL+ YGFV   NP+  + V   +  +     DK+M    N K+S+   H
Sbjct: 272 YGPHDNQRLLLEYGFVSIHNPHACVYVSEDILVKYLPSTDKQM----NKKISILKDH 324


>gi|342875304|gb|EGU77102.1| hypothetical protein FOXB_12400 [Fusarium oxysporum Fo5176]
          Length = 371

 Score = 40.0 bits (92), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 23/77 (29%), Positives = 35/77 (45%), Gaps = 2/77 (2%)

Query: 145 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 204
           R   +P           CK   +A+  +VQ   DR Y  GE + V  GP  N  LL  YG
Sbjct: 178 RLVCMPTADLFNHADQGCKLAYSALGYSVQ--ADRVYHQGEEVYVSYGPHSNDFLLSEYG 235

Query: 205 FVDEDNPYDRLVVEAAL 221
           F+ + N +D + ++  +
Sbjct: 236 FILDTNRWDEVYLDEVI 252


>gi|296232125|ref|XP_002761462.1| PREDICTED: SET domain-containing protein 4 [Callithrix jacchus]
          Length = 440

 Score = 40.0 bits (92), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 55/240 (22%), Positives = 93/240 (38%), Gaps = 24/240 (10%)

Query: 19  SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
           S L  L  +L+ EK  G +S W PY+  L +       A   P+   E E+  L     K
Sbjct: 105 SPLLALCTFLVSEKHAGDRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPISLK 156

Query: 79  AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL- 137
           A+  E+   ++  +      +     LF +    I    F++     A+  V +  V+L 
Sbjct: 157 AKAEEQRAHVQEFFASSRDFFSSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLR 212

Query: 138 --QKVSLARRFALVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
             Q   L+       L P   LL +S   + KA         ++     ++  E + +  
Sbjct: 213 PRQWECLSAEPDTCALAPYLDLLNHSPHVQVKAAFNEETHCYEIRTTSRWRKHEEVFICY 272

Query: 192 GPQPNSKLLINYGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 246
           GP  N +L + YGFV   NP+      R ++   L + D Q   K  + + +G +    F
Sbjct: 273 GPHDNHRLFLEYGFVSGHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332


>gi|323455796|gb|EGB11664.1| hypothetical protein AURANDRAFT_61664 [Aureococcus anophagefferens]
          Length = 1916

 Score = 40.0 bits (92), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 17/46 (36%), Positives = 27/46 (58%)

Query: 171 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 216
           DA  +   R Y AG+ +    G + N++L+ NYGF++  NP+D  V
Sbjct: 294 DAFAVNAHRDYDAGDEVHASYGKKSNAQLVANYGFLEPGNPFDDYV 339


>gi|392587357|gb|EIW76691.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
          Length = 430

 Score = 39.7 bits (91), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 16/51 (31%), Positives = 30/51 (58%)

Query: 168 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 218
           + +  + +V    +  G+ I    GP+PNS+L++ YGF   +NP D +V++
Sbjct: 230 STETTISIVSHDAFNEGQEIYNNYGPKPNSELILGYGFSLPNNPDDTIVLQ 280


>gi|389646769|ref|XP_003721016.1| hypothetical protein MGG_02740 [Magnaporthe oryzae 70-15]
 gi|86196443|gb|EAQ71081.1| hypothetical protein MGCH7_ch7g488 [Magnaporthe oryzae 70-15]
 gi|351638408|gb|EHA46273.1| hypothetical protein MGG_02740 [Magnaporthe oryzae 70-15]
 gi|440466942|gb|ELQ36183.1| hypothetical protein OOU_Y34scaffold00666g44 [Magnaporthe oryzae
           Y34]
 gi|440488101|gb|ELQ67845.1| hypothetical protein OOW_P131scaffold00283g3 [Magnaporthe oryzae
           P131]
          Length = 390

 Score = 39.7 bits (91), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 23/84 (27%), Positives = 39/84 (46%), Gaps = 2/84 (2%)

Query: 138 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 197
           +++    R  L P+        + C   +A  D+   +  DR Y AGE +++  G   N 
Sbjct: 173 ERLGKEDRMVLQPVADLFNHADAGC--AVAFNDEDFTIRADRDYDAGEEVLICYGNHSND 230

Query: 198 KLLINYGFVDEDNPYDRLVVEAAL 221
            LL  YGFV   N +D + ++ A+
Sbjct: 231 FLLAEYGFVLAANRWDEVCIDDAI 254


>gi|297738159|emb|CBI27360.3| unnamed protein product [Vitis vinifera]
          Length = 449

 Score = 39.7 bits (91), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 43/180 (23%), Positives = 79/180 (43%), Gaps = 23/180 (12%)

Query: 179 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV--EAALNTEDPQYQDKRMVAQ 236
           + YK GE +++  G   N +LL +YGF+ ++NP D+  +  E  +       +D   + Q
Sbjct: 257 KNYKKGEQVLLSYGTYTNLELLEHYGFLLDENPNDKAFIPLEPEVYASSSWPKDSLYIHQ 316

Query: 237 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 296
            NGK S                 +L  LRL + +  S+ +SV   +     +S   E  V
Sbjct: 317 -NGKPSFA---------------LLSALRL-WATPASQRRSVGHLVYSGTQLSSENEIFV 359

Query: 297 LDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 356
           ++ +A      L   P ++ ED  +L+      + ++A Q     K++L  C+    ++I
Sbjct: 360 MEWIAKSCHVVLENLPTSVEEDSLLLS----MERWKLAVQWRLRHKRILVDCISRCTEII 415


>gi|145518912|ref|XP_001445328.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124412772|emb|CAK77931.1| unnamed protein product [Paramecium tetraurelia]
          Length = 761

 Score = 39.7 bits (91), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 47/101 (46%), Gaps = 17/101 (16%)

Query: 11  ELLTTNKLS--ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 68
           EL   N+ S  E   L  YL  EKK+G+KSF+ PY++ +         +  + + WS+ E
Sbjct: 86  ELFDENETSDAEFNILIFYLFNEKKKGEKSFYHPYVQAIQ--------SNNTLIDWSKEE 137

Query: 69  LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQY 109
           L Y+       E       I RE  +L  +W  A  +F ++
Sbjct: 138 LNYIEDPIILDEF-----AIVRE--DLKDLWNQAKEIFNEF 171


>gi|194896580|ref|XP_001978500.1| GG17647 [Drosophila erecta]
 gi|190650149|gb|EDV47427.1| GG17647 [Drosophila erecta]
          Length = 544

 Score = 39.7 bits (91), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 22/79 (27%), Positives = 37/79 (46%)

Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
           AL+P          K  +  AAV   ++       +AGE   ++ G + N+ LL++ GFV
Sbjct: 319 ALIPYWDMANHKPGKITSFYAAVSRQLECTAQEAVEAGEQFFIYYGDRSNTDLLVHNGFV 378

Query: 207 DEDNPYDRLVVEAALNTED 225
           D +N  D + +   L+  D
Sbjct: 379 DVNNLKDYVNIRVGLSPTD 397


>gi|169606334|ref|XP_001796587.1| hypothetical protein SNOG_06204 [Phaeosphaeria nodorum SN15]
 gi|160706968|gb|EAT86035.2| hypothetical protein SNOG_06204 [Phaeosphaeria nodorum SN15]
          Length = 634

 Score = 39.7 bits (91), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 49/226 (21%), Positives = 97/226 (42%), Gaps = 23/226 (10%)

Query: 3   HLLFYYAAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPL 62
           H+LF  +       K+ +     L L+ ++ +GK+S W  YI  L      G  ++ +PL
Sbjct: 82  HVLFVDSRLQPCQGKIPDHILTYLLLIEQRNKGKESPWHAYIACLP-----GAESMTTPL 136

Query: 63  LWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEI 122
            + + ++A+L G+       ER     +++ +   +   AG        D  +  +   I
Sbjct: 137 WFDDEDMAFLAGTSLAPAAKERKSLYYQQWEQALGIMKDAGVALAD-EVDFESLLWAATI 195

Query: 123 F-KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL-----AYSSKCKAMLAAVDD-AVQL 175
           F  +AF++      H            VPL  P++     + S+K +     +   +++L
Sbjct: 196 FTSRAFISTHILPDH----------ETVPLLFPIVDILNHSVSAKVEWEFQPLASFSLKL 245

Query: 176 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 221
           +    + AG+ +     P+ N +LL+ YGF  E NP ++  ++ A 
Sbjct: 246 LEGDTFTAGQELFNNYAPKQNDELLLGYGFCLEHNPIEQFPLKLAF 291


>gi|387219019|gb|AFJ69218.1| set domain-containing protein 3, partial [Nannochloropsis gaditana
           CCMP526]
          Length = 265

 Score = 39.7 bits (91), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 39/163 (23%), Positives = 62/163 (38%), Gaps = 29/163 (17%)

Query: 181 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQ-YQDKRMVAQRNG 239
           YK GE +    G + N++LL+ YGF   DN ++ + +        P  +Q         G
Sbjct: 18  YKKGEEVFTSYGRRTNAELLLFYGFALLDNEHESVALSMPGIPSPPSWFQASHSALGTAG 77

Query: 240 KLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---SEMQSVISSLGPICPVS------- 289
            +         GR +    D+L    L +   T   SE+ +   +L   C ++       
Sbjct: 78  SV--------GGRARSMAEDVLRPSHLLFAGATELPSELVAYFRALTACCSMNEKDLVEQ 129

Query: 290 --------PC--MERAVLDQLADYFKARLAGYPATLSEDEAML 322
                   PC   ER     L  +  A LA +P ++ EDE  L
Sbjct: 130 KLDYMQHFPCSRHERDAFSTLGAHMSASLAAFPTSIEEDEVEL 172


>gi|226492747|ref|NP_001140859.1| uncharacterized protein LOC100272935 [Zea mays]
 gi|194701488|gb|ACF84828.1| unknown [Zea mays]
 gi|413951742|gb|AFW84391.1| hypothetical protein ZEAMMB73_159573 [Zea mays]
          Length = 495

 Score = 39.7 bits (91), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 55/247 (22%), Positives = 92/247 (37%), Gaps = 62/247 (25%)

Query: 22  ACLALYL--MYEKKQGKKSFWLPYIRELDRQRGRGQLAVES-PLLWSETEL-AYLTGSPT 77
            CLAL +  MYE+ QG  S W  Y++ L           ES PL+W   E    L G+  
Sbjct: 81  GCLALTVAVMYERAQGADSPWDAYLQLLPD--------CESVPLVWPAGEAECLLAGTEL 132

Query: 78  KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 137
              + +  E +  ++ E      ++G L      D+  + F+ E +  A   V S    +
Sbjct: 133 DKIVKQDKEFLCEDWKECIEPLMLSGEL------DVDPDDFSLEKYLSAKTLVSSRSFQI 186

Query: 138 QKVSLARRFALVPLGPPLLAYSSKCKAML------------------------------- 166
                +    +VPL   L  + + C+ +                                
Sbjct: 187 DSYHGS---GMVPLAD-LFNHKTDCEHVHFTSASDASDSDGEEEEDDRSDASADDKPTTK 242

Query: 167 ---------AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 217
                     A D+ +++++ R    GE +    G   N+ LL  YGF + DN YD + +
Sbjct: 243 NPTSSPPGSRANDEDLEIIIVRDVNEGEEVYNTYGTMGNAALLHRYGFTELDNQYDIVNI 302

Query: 218 EAALNTE 224
           + AL T+
Sbjct: 303 DLALVTK 309


>gi|116197927|ref|XP_001224775.1| hypothetical protein CHGG_07119 [Chaetomium globosum CBS 148.51]
 gi|88178398|gb|EAQ85866.1| hypothetical protein CHGG_07119 [Chaetomium globosum CBS 148.51]
          Length = 555

 Score = 39.3 bits (90), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 48/184 (26%), Positives = 83/184 (45%), Gaps = 27/184 (14%)

Query: 60  SPLLWSETELAYLTGSP--TKAEILERAEGIKREYNELDTVWFMAGS--LFQQYPYDIPT 115
           S + ++E EL    G+   T  + L+R+  I+ +Y  L  V  +A S  LF       P 
Sbjct: 194 SSIFFAEEELEVCAGTSLYTITKKLDRS--IEDDYRTL-VVRVLAQSRDLF-------PL 243

Query: 116 EAFTFEIFKQAFVAVQSCVVHL-----QKVSLARRFALVPLGPPLLAYSSKCK--AMLAA 168
           + F+ E +K A   V S  +         + L   FA       +L +SS+ +   +  A
Sbjct: 244 DKFSIEDYKWALCTVWSRAMDFVLPDGNSIRLVAPFA------DMLNHSSEVEPCHIYDA 297

Query: 169 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 228
               + ++  + Y+AG+   ++ G  PNS+LL  YGFV   NP D   +  + +   P +
Sbjct: 298 SSGNLSVLAGKDYEAGDQAFIYYGSIPNSRLLRLYGFVMPGNPNDSYDLVISTHPSAPFF 357

Query: 229 QDKR 232
           + K+
Sbjct: 358 ERKQ 361


>gi|194707708|gb|ACF87938.1| unknown [Zea mays]
          Length = 352

 Score = 39.3 bits (90), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 47/205 (22%), Positives = 87/205 (42%), Gaps = 37/205 (18%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           L L L+ E+ +   SFW PYI  L             P+ +   ++  L  +P   ++ +
Sbjct: 3   LGLRLLQERAK-SDSFWWPYIANLPE-------TFTVPIFFPGEDIKNLQYAPILHQVNK 54

Query: 84  RA----EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK 139
           R     E  K    +L TV  +    + Q   D+ + +  +     A  A  S    L  
Sbjct: 55  RCRFLLEFEKEVQQKLHTVPLVDHPFYGQ---DVNSSSLGW-----AMSAASSRAFRLH- 105

Query: 140 VSLARRFALVPLGPPLL-----AYSSKCKAM----LAAVDDAVQLVVDRPYKAGESIVVW 190
                    VP+  PL+     +++   + +    + ++D +V+++ ++  K  E+I + 
Sbjct: 106 -------GEVPMLLPLIDMCNHSFNPNARIVQERSVNSLDMSVKVLAEKKIKQNEAITLN 158

Query: 191 CGPQPNSKLLINYGFVDEDNPYDRL 215
            G  PN   L++YGFV   NPYD++
Sbjct: 159 YGCYPNDFFLLDYGFVITQNPYDQV 183


>gi|195353393|ref|XP_002043189.1| GM17489 [Drosophila sechellia]
 gi|194127287|gb|EDW49330.1| GM17489 [Drosophila sechellia]
          Length = 537

 Score = 39.3 bits (90), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 22/79 (27%), Positives = 36/79 (45%)

Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
           AL+P          K  +  AAV   ++        AGE   ++ G + N+ LL++ GFV
Sbjct: 319 ALIPYWDMANHRQGKITSFYAAVPRQLECTAQEAVDAGEQFFIYYGDRSNTDLLVHNGFV 378

Query: 207 DEDNPYDRLVVEAALNTED 225
           D+ N  D + +   L+  D
Sbjct: 379 DDYNLKDYVNIRVGLSLTD 397


>gi|428177750|gb|EKX46628.1| hypothetical protein GUITHDRAFT_107412 [Guillardia theta CCMP2712]
          Length = 606

 Score = 39.3 bits (90), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 17/49 (34%), Positives = 27/49 (55%)

Query: 173 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 221
           +Q     P K G  + +  GP  N++LL+ YG+ ++DNPY    +E  L
Sbjct: 440 LQFCTMAPIKQGSQVFLNYGPLDNTQLLLYYGYAEQDNPYQTYAIELEL 488


>gi|407846232|gb|EKG02467.1| hypothetical protein TCSYLVIO_006496 [Trypanosoma cruzi]
          Length = 546

 Score = 39.3 bits (90), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 45/95 (47%), Gaps = 10/95 (10%)

Query: 13  LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA-- 70
           L ++ +  +AC+A Y+ YEKKQ + +  L Y R L       Q+ V++  LW+   L   
Sbjct: 351 LDSSNMESIACIAAYMFYEKKQPEIALRL-YRRLL-------QMGVQTTELWNNLGLCCF 402

Query: 71  YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSL 105
           Y +        L+RA  I  E   L  VW+  G +
Sbjct: 403 YSSQYDIALSCLQRAVAISTEDETLADVWYNIGHI 437


>gi|255071473|ref|XP_002499410.1| predicted protein [Micromonas sp. RCC299]
 gi|226514673|gb|ACO60669.1| predicted protein [Micromonas sp. RCC299]
          Length = 323

 Score = 39.3 bits (90), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 16/35 (45%), Positives = 24/35 (68%)

Query: 184 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 218
           GE +V+  G + N +LL  +GF D DNP+D LV++
Sbjct: 199 GEEVVISYGDKTNEELLFVHGFADRDNPHDALVLQ 233


>gi|195480581|ref|XP_002101314.1| GE17555 [Drosophila yakuba]
 gi|194188838|gb|EDX02422.1| GE17555 [Drosophila yakuba]
          Length = 548

 Score = 39.3 bits (90), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 22/79 (27%), Positives = 36/79 (45%)

Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
           AL+P          K  +  AAV   ++        AGE   ++ G + N+ LL++ GFV
Sbjct: 323 ALIPYWDMANHRPGKITSFYAAVSRQLECTAQEAVAAGEQFFIYYGDRSNTDLLVHNGFV 382

Query: 207 DEDNPYDRLVVEAALNTED 225
           D +N  D + +   L+  D
Sbjct: 383 DVNNLKDYVNIRVGLSPTD 401


>gi|440640494|gb|ELR10413.1| hypothetical protein GMDG_00825 [Geomyces destructans 20631-21]
          Length = 492

 Score = 39.3 bits (90), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 86/337 (25%), Positives = 132/337 (39%), Gaps = 48/337 (14%)

Query: 37  KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA-EG 87
           +S W PY   L  +       ++S + WS  ELA L         G     EI  ++   
Sbjct: 104 ESKWAPYFNVLPTK-------LDSLVFWSPEELAELQASAVLKKVGKDKAEEIFHQSISK 156

Query: 88  IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVH-LQKVSLARRF 146
           +  E  ++D ++    S    Y +DIP      E   +        V    QK SLA   
Sbjct: 157 VTPEGTDVD-IFHRVASTIMAYAFDIPD----IEQEDEEGANEDDLVDDDEQKTSLA--- 208

Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
            ++PL   +L   +   A L    + +++    P K GE I+   G  P S LL  YG+V
Sbjct: 209 -MIPLAD-MLNADADNNARLHYDGEELEMRTINPIKTGEEILNDYGQLPRSDLLRRYGYV 266

Query: 207 -DEDNPYDRLVVEAALNT-EDPQYQD---KRMVAQRNGKLSVQVFHV------------- 248
            D+   +D  V E + +T  D  YQD   +  V  R G++  ++                
Sbjct: 267 TDKYATFD--VAEISTSTITDHIYQDLAGELKVYLRAGEIEARLELARREDVYEDAHDVG 324

Query: 249 HAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARL 308
           HA  E   ISD L  L    +     + ++ SS       S      V   L    + R 
Sbjct: 325 HATEEWPCISDELVALVYLLLVGEETLAAIQSSKMSFPSRSKMETELVGKALQRILERRE 384

Query: 309 AGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 345
             Y  T+ EDE +L   N   + ++A Q VRM +K++
Sbjct: 385 REYATTVVEDENLLQSGNHSNRVKMAIQ-VRMGEKVV 420


>gi|356564844|ref|XP_003550657.1| PREDICTED: uncharacterized protein LOC100778605 [Glycine max]
          Length = 549

 Score = 39.3 bits (90), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 41/151 (27%), Positives = 69/151 (45%), Gaps = 14/151 (9%)

Query: 69  LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 128
           +  L G+    EI++  + +  +Y+EL    F A  L   +P   P E +T+E F  A  
Sbjct: 260 ITMLDGTLLLEEIMQARQHLHAQYDEL----FPA--LCNNFPDIFPPELYTWEKFLWACE 313

Query: 129 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVD---DAVQLVVDRPYKA 183
              S  + +       R  L+PL   L    S C  ++    VD   ++++  + RP ++
Sbjct: 314 LWYSNSMKIMYSDGKLRTCLIPLAGFL--NHSLCPHVMHYGKVDPATNSLKFCLSRPCRS 371

Query: 184 GESIVVWCGPQPNSKLLINYGFVDE-DNPYD 213
           GE   +  G   +S L+  YGF+ + DN YD
Sbjct: 372 GEECCLSYGNFSSSHLITFYGFLPQGDNSYD 402


>gi|71409849|ref|XP_807248.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70871208|gb|EAN85397.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 544

 Score = 39.3 bits (90), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 45/95 (47%), Gaps = 10/95 (10%)

Query: 13  LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA-- 70
           L ++ +  +AC+A Y+ YEKKQ + +  L Y R L       Q+ V++  LW+   L   
Sbjct: 349 LDSSNMESIACIAAYMFYEKKQPEIALRL-YRRLL-------QMGVQTTELWNNLGLCCF 400

Query: 71  YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSL 105
           Y +        L+RA  I  E   L  VW+  G +
Sbjct: 401 YSSQYDIALSCLQRAVAISTEDETLADVWYNIGHI 435


>gi|194764087|ref|XP_001964163.1| GF21412 [Drosophila ananassae]
 gi|190619088|gb|EDV34612.1| GF21412 [Drosophila ananassae]
          Length = 1017

 Score = 39.3 bits (90), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 17/52 (32%), Positives = 30/52 (57%)

Query: 183 AGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 234
           AGE   ++ G + N++ L+N GFVD DN  D + +   L+  DP  + + ++
Sbjct: 838 AGEQFFIYYGDRTNTEFLVNNGFVDPDNRNDYVNIRLGLSPTDPLAEKRAII 889


>gi|395848935|ref|XP_003797093.1| PREDICTED: SET domain-containing protein 4 [Otolemur garnettii]
          Length = 440

 Score = 39.3 bits (90), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 53/233 (22%), Positives = 93/233 (39%), Gaps = 24/233 (10%)

Query: 19  SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 78
           S L  L  +L+ EK  G +S W PY+  L +       A   P+   E E+  L   P K
Sbjct: 105 SPLLALCTFLVSEKHAGDQSPWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKPLK 156

Query: 79  AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV--- 135
           A+  E+   ++  +      +     LF +    I    F++     A+  V +  V   
Sbjct: 157 AKAEEQRAHVQEFFASSRDFFSSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLR 212

Query: 136 HLQKVSLARRFALVPLGP--PLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGESIVVWC 191
           H ++  L+       L P   LL +S   +   A  ++    ++     ++  E + +  
Sbjct: 213 HRRRECLSAEPDTCALAPYLDLLNHSPNVQVRAAFNEETRCYEIRTASSWRKHEEVFICY 272

Query: 192 GPQPNSKLLINYGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNG 239
           G   N +LL+ YGFV   NP+      R ++   L + D Q   K  + + +G
Sbjct: 273 GHHDNQRLLLEYGFVSIQNPHACVYVSREILVKYLPSTDKQMNKKISILKDHG 325


>gi|226505024|ref|NP_001151430.1| SET domain containing protein [Zea mays]
 gi|195646778|gb|ACG42857.1| SET domain containing protein [Zea mays]
 gi|413923893|gb|AFW63825.1| SET domain containing protein [Zea mays]
          Length = 491

 Score = 39.3 bits (90), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 47/205 (22%), Positives = 87/205 (42%), Gaps = 37/205 (18%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           L L L+ E+ +   SFW PYI  L             P+ +   ++  L  +P   ++ +
Sbjct: 142 LGLRLLQERAK-SDSFWWPYIANLPE-------TFTVPIFFPGEDIKNLQYAPILHQVNK 193

Query: 84  RA----EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK 139
           R     E  K    +L TV  +    + Q   D+ + +  +     A  A  S    L  
Sbjct: 194 RCRFLLEFEKEVQQKLHTVPLVDHPFYGQ---DVNSSSLGW-----AMSAASSRAFRLH- 244

Query: 140 VSLARRFALVPLGPPLL-----AYSSKCKAM----LAAVDDAVQLVVDRPYKAGESIVVW 190
                    VP+  PL+     +++   + +    + ++D +V+++ ++  K  E+I + 
Sbjct: 245 -------GEVPMLLPLIDMCNHSFNPNARIVQERSVNSLDMSVKVLAEKKIKQNEAITLN 297

Query: 191 CGPQPNSKLLINYGFVDEDNPYDRL 215
            G  PN   L++YGFV   NPYD++
Sbjct: 298 YGCYPNDFFLLDYGFVITQNPYDQV 322


>gi|254568640|ref|XP_002491430.1| Nuclear protein that contains a SET-domain [Komagataella pastoris
           GS115]
 gi|238031227|emb|CAY69150.1| Nuclear protein that contains a SET-domain [Komagataella pastoris
           GS115]
          Length = 538

 Score = 39.3 bits (90), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 19/61 (31%), Positives = 32/61 (52%)

Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
           ++VP    L A ++ C A L    + + +   +P K GE +    G  PNS++L  YG+V
Sbjct: 254 SMVPFADTLNADTNLCNANLTYQSENLVMKAIKPIKKGEQVYNTYGNHPNSEILRRYGYV 313

Query: 207 D 207
           +
Sbjct: 314 E 314


>gi|449544081|gb|EMD35055.1| hypothetical protein CERSUDRAFT_107074 [Ceriporiopsis subvermispora
           B]
          Length = 457

 Score = 39.3 bits (90), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 18/49 (36%), Positives = 30/49 (61%)

Query: 170 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 218
           D AV L++  P   G  ++   GP+PN++L++ YGF    NP D +V++
Sbjct: 275 DLAVSLLLHSPTPRGAELLNNYGPKPNAELVLGYGFALPSNPDDTIVLK 323


>gi|395326815|gb|EJF59220.1| SET domain-containing protein [Dichomitus squalens LYAD-421 SS1]
          Length = 429

 Score = 39.3 bits (90), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 17/44 (38%), Positives = 28/44 (63%)

Query: 175 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 218
           LV+  P   G  ++   GP+PNS+L++ YGF   +NP D +V++
Sbjct: 245 LVIHTPTPRGSELLNNYGPKPNSELILGYGFSLPNNPDDTIVLK 288


>gi|190347905|gb|EDK40262.2| hypothetical protein PGUG_04360 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 466

 Score = 39.3 bits (90), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 55/214 (25%), Positives = 90/214 (42%), Gaps = 30/214 (14%)

Query: 17  KLSELACLALYLMYEKKQGKKSFW------LPYIRELDRQRGRGQLAVESPLLWS-ETEL 69
           +L+     ALY+  EKK+ + SFW      LP + ELD           +P++W  E+E 
Sbjct: 144 QLTAFQRTALYICLEKKRKENSFWCAFISSLPKLEELDF----------APIVWEVESE- 192

Query: 70  AYLTGSPTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFV 128
             LTGS   A+  E      R + +  +V F    +   ++     +E      F  A++
Sbjct: 193 --LTGSKA-ADFFELLPRSSRNHAKKVSVRFNEDYTAVSEFLTAAKSEPLNKMEFLWAWM 249

Query: 129 AVQSCVVHL---QKVSLARRFALVPLGPPLLAY--SSKCKAMLAAVDDAVQLVVDRPYKA 183
            + S  +++      + A  F L P     L +    KC   + +   +V   VD  + A
Sbjct: 250 CINSRCLYMSFPSSKAEADNFTLAPY-VDFLNHDCDEKCAIKIDSRGFSVISCVD--HAA 306

Query: 184 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 217
           G+ ++   GP  N  LL  Y F  E N ++ L V
Sbjct: 307 GQELLFSYGPHSNEFLLCEYAFTMETNKWNNLDV 340


>gi|388579878|gb|EIM20197.1| RuBisCO-cytochrome methylase [Wallemia sebi CBS 633.66]
          Length = 447

 Score = 38.9 bits (89), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 74/380 (19%), Positives = 152/380 (40%), Gaps = 65/380 (17%)

Query: 13  LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 72
           L  + +     L + + +E  QG  S W  Y + L +Q         S + WS+ EL+ L
Sbjct: 76  LENDNIGSWTPLIMAMCWEYNQGGSSKWDAYFKILPKQ-------FTSLMFWSKEELSLL 128

Query: 73  TGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ--AFVAV 130
            G+    +I    E I+ E+  +  +     ++F     DI    +T ++FK+  + +  
Sbjct: 129 KGTTVVDKI--GLEDIENEFERVRDIVKQNENVFG----DIAN--YTLDLFKRMGSLILS 180

Query: 131 QSCVVHLQKV---------------------SLARRFALVPLGPPLLAYSSKCKAMLAAV 169
           +S  V   K                      +     A+VP+   L + +    A     
Sbjct: 181 RSFTVEEWKTEEEREKEEEEEEDEDEEIDLRTSVDDVAMVPMADILNSRTDSVNAHTEYE 240

Query: 170 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD-----ED-----NPYD------ 213
           ++ ++++  +  KAG+ I       PN+ L+  YG VD     +D     N  D      
Sbjct: 241 ENCLRMISLQDIKAGDQIFNTYNDPPNADLIRRYGHVDYSPLSQDPDFMGNKNDVVELPA 300

Query: 214 RLVVEAALNTEDPQYQDKRM--VAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD 271
            +++E AL      ++++R+  +    G+ S ++ H       + + ++L    L +   
Sbjct: 301 DILLELALPDAKESHKERRVEFLLDECGEDSFELTH------DDLVPELLKICVLLFTES 354

Query: 272 TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKK 331
            +E ++   S     P +    +   + L    K R+  Y +TL +D + L + +  P+ 
Sbjct: 355 EAEFKTREKSRK--LPKASGFTKGKAEFLIKAIKQRMEQYGSTLEDDISKLDNKDSLPEN 412

Query: 332 RVATQLVRM-EKKMLNACLQ 350
                +V + E+++LN  ++
Sbjct: 413 NFKALVVTVGERRILNKAIE 432


>gi|254585507|ref|XP_002498321.1| ZYRO0G07502p [Zygosaccharomyces rouxii]
 gi|238941215|emb|CAR29388.1| ZYRO0G07502p [Zygosaccharomyces rouxii]
          Length = 562

 Score = 38.9 bits (89), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 19/59 (32%), Positives = 29/59 (49%)

Query: 155 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 213
            L + +  K      D  +  V     K+GE +    G + N +LL+NYGFV ++N YD
Sbjct: 224 FLNHKNDTKVKWCFQDGKMCFVSKESLKSGEELFNNYGDKSNEELLLNYGFVQDNNQYD 282


>gi|255581713|ref|XP_002531659.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase,
           putative [Ricinus communis]
 gi|223528717|gb|EEF30729.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase,
           putative [Ricinus communis]
          Length = 558

 Score = 38.9 bits (89), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 35/143 (24%), Positives = 64/143 (44%), Gaps = 20/143 (13%)

Query: 80  EILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK 139
           EI++  E ++ +Y+EL        +L   YP   P E +T+E F  A     S  + ++ 
Sbjct: 283 EIMQAKEHLRVQYDEL------VPALCNNYPDVFPPELYTWEQFLWACELWYSNSMKIKF 336

Query: 140 VSLARRFALVP--------LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
           +    R  L+P        L P ++ Y       + ++ + ++  + RP + GE   +  
Sbjct: 337 LDGKLRTCLIPIAGFLNHSLHPHIIHYGK-----VDSITNTLKFPLSRPCRVGEQCCLSY 391

Query: 192 GPQPNSKLLINYGFVDE-DNPYD 213
           G    + L+  YGF+ + DN YD
Sbjct: 392 GNFSGAHLITFYGFLPQGDNRYD 414


>gi|387197713|gb|AFJ68815.1| set domain protein, partial [Nannochloropsis gaditana CCMP526]
          Length = 327

 Score = 38.9 bits (89), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 37/145 (25%), Positives = 63/145 (43%), Gaps = 25/145 (17%)

Query: 18  LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 77
            S+   L + LM+E+ +G+ SF+ PY+  L R         ++ L W+E  L++L     
Sbjct: 143 FSDEDMLLILLMHERGKGETSFFYPYLATLPR-------LPDTLLNWNEEGLSWLQDEGL 195

Query: 78  KAEILERAEGIKREYNELDTVWFMAG--SLFQQYPYDIP------TEAFTFEIFKQAFVA 129
             E+  R   +   Y  L      AG   LF + P D         + ++ E F+ A++ 
Sbjct: 196 SLEVFLRESQLTAHYTRLVEEKLKAGWPGLFGEAPDDASDSESKGADPYSLENFRFAWLT 255

Query: 130 VQSCVVHLQKVSLARRF---ALVPL 151
           +Q+        +  RR    AL+PL
Sbjct: 256 IQA-------RAFGRRLPYSALIPL 273


>gi|159131477|gb|EDP56590.1| SET domain protein [Aspergillus fumigatus A1163]
          Length = 490

 Score = 38.9 bits (89), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 45/193 (23%), Positives = 83/193 (43%), Gaps = 27/193 (13%)

Query: 27  YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAE 86
           ++   +K G  + W  YIR +         +V  P  +S  E   L G+  +  +  +  
Sbjct: 115 FVNKRQKIGISNPWTEYIRFM-------PASVPLPTFYSAEERELLRGTSLQTAVDAKLG 167

Query: 87  GIKREYNEL-----DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS 141
            +++E++ L     +  W       Q++ +D  T  FTF+ +K      +S VV L +  
Sbjct: 168 SLEKEFDHLRQATEEIPWC------QEHWWDEDTGKFTFDDWKYVDAVYRSRVVDLPRSG 221

Query: 142 LARRFALVP---LGPPLLAYSSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWCGPQ-PN 196
                A+VP   +       S K K       +AV QL   +  + GE + +  G + P 
Sbjct: 222 ----HAIVPCVDMANHACEDSVKAKYDEEGAGNAVLQLRTGKKLRVGEEVTISYGDEKPA 277

Query: 197 SKLLINYGFVDED 209
           S+++ +YGFV+ +
Sbjct: 278 SEMVFSYGFVENE 290


>gi|336473420|gb|EGO61580.1| hypothetical protein NEUTE1DRAFT_58975 [Neurospora tetrasperma FGSC
           2508]
 gi|350293291|gb|EGZ74376.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
          Length = 533

 Score = 38.9 bits (89), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 49/205 (23%), Positives = 84/205 (40%), Gaps = 30/205 (14%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-L 82
           L L LM+E  QG  S W PY+  L  Q        ++P+ W+E ELA L  S   A++  
Sbjct: 131 LILILMHEYLQGSSSNWSPYLSILPHQ-------FDTPMFWTEAELAELQASALVAKVGK 183

Query: 83  ERAEGIKRE-----YNELDTVWFMAGSLFQQ-----------YPYDIPTEAFTFEIFKQA 126
           + A+ + R        E + V++ AG+   Q           +       A+ F++ K+ 
Sbjct: 184 DEADKMIRTKIVKVVQENEDVFYPAGTPKTQRLDEGELLKLGHRMGSAIMAYAFDLAKEE 243

Query: 127 FVAVQSCV-----VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY 181
                        V  +   +     +VP+   +L   +   A +   +  +     R  
Sbjct: 244 DDDEDEEEEEDGWVEDKIGGMNDTMGMVPMA-DMLNADAVFNAHINHGEACLTATSLREI 302

Query: 182 KAGESIVVWCGPQPNSKLLINYGFV 206
           K GE I+ + GP  +++LL  YG+V
Sbjct: 303 KEGEEILNYYGPLSSAELLRRYGYV 327


>gi|242210759|ref|XP_002471221.1| predicted protein [Postia placenta Mad-698-R]
 gi|220729780|gb|EED83649.1| predicted protein [Postia placenta Mad-698-R]
          Length = 256

 Score = 38.9 bits (89), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 19/64 (29%), Positives = 33/64 (51%)

Query: 172 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 231
           ++ LV+   + AG  +    GP+PN++L++ YGF    NP D +V++    +     Q  
Sbjct: 77  SISLVLHNAHPAGAELFNNYGPKPNAELILGYGFALPHNPDDTIVLKLGGASAAQHAQHN 136

Query: 232 RMVA 235
             VA
Sbjct: 137 NAVA 140


>gi|452825744|gb|EME32739.1| ribulose-1,5 bisphosphate carboxylase oxygenase large subunit
           N-methyltransferase, putative isoform 1 [Galdieria
           sulphuraria]
          Length = 487

 Score = 38.5 bits (88), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 79/331 (23%), Positives = 125/331 (37%), Gaps = 47/331 (14%)

Query: 38  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE-LD 96
           S W PYI  L      G +       WS +ELA L   P   E+      I + Y E L 
Sbjct: 166 SLWKPYIDILPHALNTGLVY------WSSSELAQLQYRPLIEEV-----KINQYYREALY 214

Query: 97  TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL- 155
           T  F + S     P  +  +     +F  A   VQS    +  V   + +AL+P+   L 
Sbjct: 215 TRVFESLS----SPVRVWLQNEKENVFFWALDMVQSRAFGIPDVG-NKTYALLPMMDMLN 269

Query: 156 LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 215
              +S+   +  ++ +  ++        G  I +  GP  N  LL  YGF+  +NP D  
Sbjct: 270 HRVNSQTHFLYDSIANQYEMKTYSKLSPGTDIYISYGPLDNDHLLHFYGFLQTNNPSDYF 329

Query: 216 VVEAALNTEDPQYQDKRMVAQ------------------RNGKLSVQVFHVHAGREKEAI 257
            V+         Y+ +   AQ                   NGK +  ++H H   E + I
Sbjct: 330 QVKDIFQWLHLMYEQEEWQAQPSHLLEEKLSLLRKYHIYENGK-TFHLYHDHYDDEIDII 388

Query: 258 SDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERA--VLDQLADYFKARLAGYPATL 315
                 LR+   S T   Q   +    +   +  +E    V   +    K  L     ++
Sbjct: 389 ------LRVFMASKTDWQQIQENFAMGLFHKALSLENQLHVWQVIIGGCKHLLKDMKTSV 442

Query: 316 SEDEAMLTDYN-LHPKKRVATQLVRMEKKML 345
            EDE +L + + L  K ++A Q  R+EKK +
Sbjct: 443 EEDEQLLKNKDQLSTKLQLAIQF-RLEKKYI 472


>gi|403331500|gb|EJY64700.1| hypothetical protein OXYTRI_15263 [Oxytricha trifallax]
          Length = 741

 Score = 38.5 bits (88), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 20/64 (31%), Positives = 36/64 (56%), Gaps = 4/64 (6%)

Query: 181 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN----TEDPQYQDKRMVAQ 236
           YK G+ +    G + N  LL+NYGF   +N Y+ L     +N    +E  + ++K++ A+
Sbjct: 440 YKKGQQVFHCYGRRTNRFLLLNYGFCMNNNKYNSLSFRVWINFDWQSERQKDEEKKLKAE 499

Query: 237 RNGK 240
           ++GK
Sbjct: 500 KDGK 503


>gi|260822399|ref|XP_002606589.1| hypothetical protein BRAFLDRAFT_277814 [Branchiostoma floridae]
 gi|229291933|gb|EEN62599.1| hypothetical protein BRAFLDRAFT_277814 [Branchiostoma floridae]
          Length = 459

 Score = 38.5 bits (88), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 53/208 (25%), Positives = 86/208 (41%), Gaps = 36/208 (17%)

Query: 26  LYLMYEKKQGKKSFWLPYIR------ELDRQRGRGQLAVESPLLWSETELAY---LTGSP 76
           L LMYE      S W PY++      +LD+           P+ W+E E+      TG P
Sbjct: 107 LALMYEYT-NPNSRWRPYLQLVPDFSQLDQ-----------PMFWTEDEIERDLCNTGIP 154

Query: 77  TKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ--AFVAVQSCV 134
             +        +K EY  L      A    +++ +    E  +FE++K+  AF+   S  
Sbjct: 155 EASS--SDLTKMKLEYTSL------ALPFIRKHRHIFSEEVHSFELYKRMVAFIMAYSFF 206

Query: 135 VHLQKVSLARRFALVPLGPPL---LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
             +         + +PL  P+   L + +K  A L    D +++V  R   AGE +    
Sbjct: 207 EPVNGREDEGGKSSLPLMVPMADILNHVAKNNAQLEWDADCLRMVTTRTVAAGEEVFNTF 266

Query: 192 GPQPNSKLLINYGFVDE--DNPYDRLVV 217
           G   N +LL  YGF +   +N YD + +
Sbjct: 267 GQLANWQLLHMYGFAEAWPENIYDTVDI 294


>gi|403342378|gb|EJY70508.1| SET domain containing protein [Oxytricha trifallax]
          Length = 653

 Score = 38.5 bits (88), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 23/92 (25%), Positives = 43/92 (46%), Gaps = 8/92 (8%)

Query: 4   LLFYYAAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 63
            +F   A +    +  +   L ++L+YE ++G +SFW PY   +D     G L    P  
Sbjct: 88  FIFDNHASIFKATEDRDYLVLLVFLIYEHQKGTRSFWHPYFEAID----PGLL----PCF 139

Query: 64  WSETELAYLTGSPTKAEILERAEGIKREYNEL 95
           WS+  +  L  S  K +I +  +  + +++ L
Sbjct: 140 WSDQTIEELADSELKDQIRQERDNYEEDWDML 171


>gi|302829721|ref|XP_002946427.1| hypothetical protein VOLCADRAFT_86703 [Volvox carteri f.
           nagariensis]
 gi|300268173|gb|EFJ52354.1| hypothetical protein VOLCADRAFT_86703 [Volvox carteri f.
           nagariensis]
          Length = 658

 Score = 38.5 bits (88), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 29/91 (31%), Positives = 42/91 (46%), Gaps = 12/91 (13%)

Query: 170 DDAVQ---LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL----- 221
           D+A Q   +VV RPY+ GE +++  G   N +LL  YGFV E N +D   ++ AL     
Sbjct: 312 DEATQQYCIVVRRPYREGEQVMLCYGRYTNLELLEYYGFVLEGNLHDTARLDPALLPLPS 371

Query: 222 ----NTEDPQYQDKRMVAQRNGKLSVQVFHV 248
                   P           NG+ S Q+ H+
Sbjct: 372 AARTAGGAPHLAPSDCFLHANGQPSWQLLHL 402


>gi|242210761|ref|XP_002471222.1| predicted protein [Postia placenta Mad-698-R]
 gi|220729781|gb|EED83650.1| predicted protein [Postia placenta Mad-698-R]
          Length = 264

 Score = 38.5 bits (88), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 16/47 (34%), Positives = 29/47 (61%)

Query: 172 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 218
           ++ LV+   + AG  +    GP+PN++L++ YGF    NP D +V++
Sbjct: 216 SISLVLHNAHPAGAELFNNYGPKPNAELILGYGFALPHNPDDTIVLK 262


>gi|219122993|ref|XP_002181819.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217407095|gb|EEC47033.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 579

 Score = 38.5 bits (88), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 17/47 (36%), Positives = 28/47 (59%)

Query: 171 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 217
           +A  L  D+   +G+ + +  GP+ N +LL  YGFV+ +NP D  V+
Sbjct: 361 NAYSLATDQAIPSGDEVYISYGPRSNDQLLQYYGFVERNNPNDVYVM 407


>gi|308802011|ref|XP_003078319.1| N-methyltransferase (ISS) [Ostreococcus tauri]
 gi|116056770|emb|CAL53059.1| N-methyltransferase (ISS) [Ostreococcus tauri]
          Length = 429

 Score = 38.5 bits (88), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 51/185 (27%), Positives = 76/185 (41%), Gaps = 17/185 (9%)

Query: 179 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQD-KRMVAQR 237
           + Y+ GE +++  G   N +L+  YGFVD DN  D    E  ++     Y   KR +   
Sbjct: 246 KDYETGEEVLISYGVLNNDELITRYGFVDVDNVADIYRFEGLMSYLQASYDPMKRALGAD 305

Query: 238 NGKLSVQVFHVH-----AGREKEAISD------MLPYLRLGYVSDTSEMQSVISSLGPIC 286
             +LS  +   H     A  E   ISD      +L  LR   V  T E  +    +    
Sbjct: 306 QKRLST-LKRTHPELDQALWEGNFISDGNADPKLLWALRT--VLATPEEYAAAKGVDGFK 362

Query: 287 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 346
                 ER   D +    ++RLA YP T+ EDE  L   N    +R A Q    +K++L 
Sbjct: 363 LGGGAPERRAADAVRAAVESRLAEYPTTIEEDEEALKTAN--GNERTAIQYRIRKKRILR 420

Query: 347 ACLQV 351
              ++
Sbjct: 421 DASRI 425


>gi|407920105|gb|EKG13323.1| hypothetical protein MPH_09605 [Macrophomina phaseolina MS6]
          Length = 574

 Score = 38.5 bits (88), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 48/219 (21%), Positives = 84/219 (38%), Gaps = 32/219 (14%)

Query: 12  LLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSE----- 66
           LL  N LS +A +   L+     G+KS W PYI  L +        + +P+ ++E     
Sbjct: 76  LLPNNVLSNIALIKELLL-----GEKSLWAPYINCLPKSE-----QLNTPIYFAEEMTQE 125

Query: 67  ------TELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 120
                  + A+L G+        R E  + E+    +V    G         I TE +T+
Sbjct: 126 AINGRRNDTAWLLGTNLDKSWRPRKEQWEEEWKNAVSVLKRQG---------IATEGYTW 176

Query: 121 EIFKQA--FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD 178
           + +  A      +S +        + ++A++     LL +    K      +   Q + +
Sbjct: 177 DAYAWAATIFTSRSFISDPGLSKESSQYAVLMPVIDLLNHRFPTKVAWFFNEGNFQFITE 236

Query: 179 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 217
            P   G  I    G + N +LL  YGF   +N  D + +
Sbjct: 237 EPVPKGHEIFNNYGGKGNEELLNGYGFCIPNNHCDEVAI 275


>gi|367001244|ref|XP_003685357.1| hypothetical protein TPHA_0D02870 [Tetrapisispora phaffii CBS 4417]
 gi|357523655|emb|CCE62923.1| hypothetical protein TPHA_0D02870 [Tetrapisispora phaffii CBS 4417]
          Length = 495

 Score = 38.5 bits (88), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 20/63 (31%), Positives = 32/63 (50%)

Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
           +++PL   L A + +C A L      +++   +P KA E I    G   N +LL  YG+V
Sbjct: 220 SMIPLADTLNADTKRCNANLIYDSGVLKMCAIKPIKANEQIYNTYGNHANFELLRRYGYV 279

Query: 207 DED 209
           + D
Sbjct: 280 EVD 282


>gi|328352061|emb|CCA38460.1| hypothetical protein PP7435_Chr2-0775 [Komagataella pastoris CBS
           7435]
          Length = 846

 Score = 38.1 bits (87), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 19/61 (31%), Positives = 32/61 (52%)

Query: 147 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 206
           ++VP    L A ++ C A L    + + +   +P K GE +    G  PNS++L  YG+V
Sbjct: 254 SMVPFADTLNADTNLCNANLTYQSENLVMKAIKPIKKGEQVYNTYGNHPNSEILRRYGYV 313

Query: 207 D 207
           +
Sbjct: 314 E 314


>gi|413951745|gb|AFW84394.1| hypothetical protein ZEAMMB73_159573, partial [Zea mays]
          Length = 339

 Score = 38.1 bits (87), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 19/57 (33%), Positives = 32/57 (56%)

Query: 168 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 224
           A D+ +++++ R    GE +    G   N+ LL  YGF + DN YD + ++ AL T+
Sbjct: 99  ANDEDLEIIIVRDVNEGEEVYNTYGTMGNAALLHRYGFTELDNQYDIVNIDLALVTK 155


>gi|169595142|ref|XP_001790995.1| hypothetical protein SNOG_00305 [Phaeosphaeria nodorum SN15]
 gi|160701026|gb|EAT91800.2| hypothetical protein SNOG_00305 [Phaeosphaeria nodorum SN15]
          Length = 391

 Score = 38.1 bits (87), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 19/48 (39%), Positives = 26/48 (54%), Gaps = 5/48 (10%)

Query: 175 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN-----PYDRLVV 217
           +  DR YKAGE + V  G   N  LL+ YGF+ + N     P D L++
Sbjct: 203 VTADREYKAGEEVFVSYGAHTNDFLLVEYGFILDSNRNDAIPLDHLIL 250


>gi|242066082|ref|XP_002454330.1| hypothetical protein SORBIDRAFT_04g028760 [Sorghum bicolor]
 gi|241934161|gb|EES07306.1| hypothetical protein SORBIDRAFT_04g028760 [Sorghum bicolor]
          Length = 490

 Score = 38.1 bits (87), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 46/204 (22%), Positives = 82/204 (40%), Gaps = 35/204 (17%)

Query: 24  LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 83
           L L L+ E+ +   SFW PYI  L             P+ +   ++  L  +P   ++ +
Sbjct: 141 LGLRLLQERAK-SDSFWWPYIANLPE-------TFTVPIFFPGEDIKNLQYAPLLHQVNK 192

Query: 84  RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 143
           R   +     E+           QQ  + +P+    F  + Q   +  S +      + +
Sbjct: 193 RCRFLLEFEKEI-----------QQKLHTVPSVDHPF--YGQDVNS--SSLGWAMSAASS 237

Query: 144 RRFALVPLGPPLLAYSSKC------------KAMLAAVDDAVQLVVDRPYKAGESIVVWC 191
           R F L    P LL     C            +  + ++D +V++V ++  +   SI +  
Sbjct: 238 RAFRLHGEIPMLLPLIDMCNHSFNPNARIVQEGSVNSLDMSVKVVAEKKIEQNASITLNY 297

Query: 192 GPQPNSKLLINYGFVDEDNPYDRL 215
           G  PN   L++YGFV   NPYD++
Sbjct: 298 GCHPNDFFLLDYGFVITPNPYDQV 321


>gi|21952799|dbj|BAC06215.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
 gi|22202682|dbj|BAC07340.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
 gi|215769224|dbj|BAH01453.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222619626|gb|EEE55758.1| hypothetical protein OsJ_04288 [Oryza sativa Japonica Group]
          Length = 495

 Score = 38.1 bits (87), Expect = 7.1,   Method: Compositional matrix adjust.
 Identities = 19/55 (34%), Positives = 31/55 (56%)

Query: 170 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 224
           D+ ++++V R    GE +    G   N+ LL  YGF + DN YD + ++ AL T+
Sbjct: 255 DEDLEMIVVRDVNEGEEVFNTYGTMGNAALLHRYGFTEMDNSYDIVNIDLALVTK 309


>gi|134077289|emb|CAK45629.1| unnamed protein product [Aspergillus niger]
          Length = 498

 Score = 38.1 bits (87), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 40/157 (25%), Positives = 69/157 (43%), Gaps = 10/157 (6%)

Query: 57  AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 116
           A+  P  +SE EL  L GS  +  +  +   +++E+  L           ++Y +D  T 
Sbjct: 143 AISLPTFYSEEELELLRGSSLRLAVHAKIASLEKEFEHLRRS-TEGLDWCEKYWWDEDTG 201

Query: 117 AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP-LGPPLLAYSSKCKAMLAAVDDA--- 172
             TF  +K      +S +V L +       A+VP +     A     KA+     D    
Sbjct: 202 KLTFNDWKYVDALYRSRMVDLPR----HGHAMVPCIDMANHASEGTVKALYDEDADGNAV 257

Query: 173 VQLVVDRPYKAGESIVVWCGPQPN-SKLLINYGFVDE 208
           +QL   R  +A E + +  G + + S+L+ +YGF+DE
Sbjct: 258 LQLREGRSLRADEEVTISYGDEKSASELIFSYGFLDE 294


>gi|125528589|gb|EAY76703.1| hypothetical protein OsI_04658 [Oryza sativa Indica Group]
          Length = 495

 Score = 38.1 bits (87), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 19/55 (34%), Positives = 31/55 (56%)

Query: 170 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 224
           D+ ++++V R    GE +    G   N+ LL  YGF + DN YD + ++ AL T+
Sbjct: 255 DEDLEMIVVRDVNEGEEVFNTYGTMGNAALLHRYGFTEMDNSYDIVNIDLALVTK 309


>gi|449302028|gb|EMC98037.1| hypothetical protein BAUCODRAFT_67154 [Baudoinia compniacensis UAMH
           10762]
          Length = 381

 Score = 38.1 bits (87), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 15/41 (36%), Positives = 25/41 (60%)

Query: 171 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 211
           +  +++ +R Y AGE ++   G   N KLL++YGF+ E  P
Sbjct: 204 NGYEMLANRDYDAGEEVLFTYGAHSNDKLLVHYGFICESPP 244


>gi|223946389|gb|ACN27278.1| unknown [Zea mays]
          Length = 289

 Score = 38.1 bits (87), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 19/57 (33%), Positives = 32/57 (56%)

Query: 168 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 224
           A D+ +++++ R    GE +    G   N+ LL  YGF + DN YD + ++ AL T+
Sbjct: 47  ANDEDLEIIIVRDVNEGEEVYNTYGTMGNAALLHRYGFTELDNQYDIVNIDLALVTK 103


>gi|350629837|gb|EHA18210.1| hypothetical protein ASPNIDRAFT_38188 [Aspergillus niger ATCC 1015]
          Length = 480

 Score = 38.1 bits (87), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 40/157 (25%), Positives = 69/157 (43%), Gaps = 10/157 (6%)

Query: 57  AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 116
           A+  P  +SE EL  L GS  +  +  +   +++E+  L           ++Y +D  T 
Sbjct: 125 AISLPTFYSEEELELLRGSSLRLAVHAKIASLEKEFEHLRRS-TEGLDWCEKYWWDEDTG 183

Query: 117 AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP-LGPPLLAYSSKCKAMLAAVDDA--- 172
             TF  +K      +S +V L +       A+VP +     A     KA+     D    
Sbjct: 184 KLTFNDWKYVDALYRSRMVDLPR----HGHAMVPCIDMANHASEGTVKALYDEDADGNAV 239

Query: 173 VQLVVDRPYKAGESIVVWCGPQPN-SKLLINYGFVDE 208
           +QL   R  +A E + +  G + + S+L+ +YGF+DE
Sbjct: 240 LQLREGRSLRADEEVTISYGDEKSASELIFSYGFLDE 276


>gi|317030555|ref|XP_001392774.2| SET domain protein [Aspergillus niger CBS 513.88]
          Length = 473

 Score = 38.1 bits (87), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 40/157 (25%), Positives = 69/157 (43%), Gaps = 10/157 (6%)

Query: 57  AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 116
           A+  P  +SE EL  L GS  +  +  +   +++E+  L           ++Y +D  T 
Sbjct: 143 AISLPTFYSEEELELLRGSSLRLAVHAKIASLEKEFEHLRRS-TEGLDWCEKYWWDEDTG 201

Query: 117 AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP-LGPPLLAYSSKCKAMLAAVDDA--- 172
             TF  +K      +S +V L +       A+VP +     A     KA+     D    
Sbjct: 202 KLTFNDWKYVDALYRSRMVDLPR----HGHAMVPCIDMANHASEGTVKALYDEDADGNAV 257

Query: 173 VQLVVDRPYKAGESIVVWCGPQPN-SKLLINYGFVDE 208
           +QL   R  +A E + +  G + + S+L+ +YGF+DE
Sbjct: 258 LQLREGRSLRADEEVTISYGDEKSASELIFSYGFLDE 294


>gi|428174289|gb|EKX43186.1| hypothetical protein GUITHDRAFT_110913 [Guillardia theta CCMP2712]
          Length = 437

 Score = 38.1 bits (87), Expect = 7.8,   Method: Compositional matrix adjust.
 Identities = 62/268 (23%), Positives = 108/268 (40%), Gaps = 38/268 (14%)

Query: 21  LACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS----- 75
           +  LA++L+ E  Q K SFW PY+  L +        V  P+ +S+     L        
Sbjct: 165 IGALAVFLLLES-QNKSSFWRPYLCSLPKH-------VPLPMFYSKERRQQLKEQLPEDQ 216

Query: 76  --PTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSC 133
                A +  R + +   Y +L  V      LF +YP     E F++E F  A   + S 
Sbjct: 217 RVKFDALVEARRDVVDLHYMQLLPV------LFLKYPTLFSPEVFSYEKFAWAISIIMSR 270

Query: 134 VVHLQKVSLA-----RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD--RPYKAGES 186
                    A     R   +  L P     +     + A  D   ++ ++  +    GE 
Sbjct: 271 TWGKTYFDSALGPRGRNITVHTLAPAADMPNHDSSGLEANRDPRGRMTLNAQKNLSVGEQ 330

Query: 187 IVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE-DPQY--QDKRMVAQRNG-KLS 242
             +  G + +++ L +YGFV  +N + R   +  + +E DP     D  + A R   + S
Sbjct: 331 FFISYGSKCDAEFLAHYGFVPFNNSHKR--CKGRVYSEGDPNMGGHDSNVTAWRKKWRKS 388

Query: 243 VQVFHV---HAGRE-KEAISDMLPYLRL 266
           + ++H     A RE K+A+ ++L   R+
Sbjct: 389 LSLYHALVRKARREGKDALKNLLESSRV 416


>gi|70995934|ref|XP_752722.1| SET domain protein [Aspergillus fumigatus Af293]
 gi|66850357|gb|EAL90684.1| SET domain protein [Aspergillus fumigatus Af293]
          Length = 490

 Score = 38.1 bits (87), Expect = 7.8,   Method: Compositional matrix adjust.
 Identities = 44/193 (22%), Positives = 83/193 (43%), Gaps = 27/193 (13%)

Query: 27  YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAE 86
           ++   +K G  + W  YIR +         +V  P  +S  E   L G+  +  +  +  
Sbjct: 115 FVNKRQKIGISNPWTEYIRFM-------PASVPLPTFYSAEERELLRGTSLQTAVDAKLG 167

Query: 87  GIKREYNEL-----DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS 141
            +++E++ L     +  W       Q++ +D  T  FTF+ +K      +S VV L +  
Sbjct: 168 SLEKEFDHLRQATEEIPWC------QEHWWDEDTGKFTFDDWKYVDAVYRSRVVDLPRSG 221

Query: 142 LARRFALVP---LGPPLLAYSSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWCGPQ-PN 196
                A+VP   +       S K +       +AV QL   +  + GE + +  G + P 
Sbjct: 222 ----HAIVPCVDMANHACEDSVKARYDEEGAGNAVLQLRTGKKLRVGEEVTISYGDEKPA 277

Query: 197 SKLLINYGFVDED 209
           S+++ +YGFV+ +
Sbjct: 278 SEMVFSYGFVENE 290


>gi|159490102|ref|XP_001703025.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158270838|gb|EDO96670.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 471

 Score = 38.1 bits (87), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 19/48 (39%), Positives = 25/48 (52%)

Query: 166 LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 213
           L     AV L   R    GE +++  G   N  L ++YGF+ EDNPYD
Sbjct: 202 LEGGGGAVGLFARRAITEGEPLLLSYGQLSNDFLFMDYGFIVEDNPYD 249


>gi|146415322|ref|XP_001483631.1| hypothetical protein PGUG_04360 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 466

 Score = 37.7 bits (86), Expect = 8.4,   Method: Compositional matrix adjust.
 Identities = 52/211 (24%), Positives = 85/211 (40%), Gaps = 24/211 (11%)

Query: 17  KLSELACLALYLMYEKKQGKKSFW------LPYIRELDRQRGRGQLAVESPLLWS-ETEL 69
           +L+     ALY+  EKK+ + SFW      LP + ELD           +P++W  E+E 
Sbjct: 144 QLTAFQRTALYICLEKKRKENSFWCAFISSLPKLEELDF----------APIVWEVESE- 192

Query: 70  AYLTGSPTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFV 128
             LTGS   A+  E      R + +   V F    +   ++     +E      F  A++
Sbjct: 193 --LTGSKA-ADFFELLPRSSRNHAKKVLVRFNEDYTAVSEFLTAAKSEPLNKMEFLWAWM 249

Query: 129 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR--PYKAGES 186
            + S  +++   S         L P +   +  C    A   D+   +V     + AG+ 
Sbjct: 250 CINSRCLYMSFPSSKAEADNFTLAPYVDFLNHDCDEKCAIKIDSRGFLVISCVDHAAGQE 309

Query: 187 IVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 217
           ++   GP  N  LL  Y F  E N ++ L V
Sbjct: 310 LLFSYGPHSNEFLLCEYAFTMETNKWNNLDV 340


>gi|212544736|ref|XP_002152522.1| hypothetical protein PMAA_003730 [Talaromyces marneffei ATCC 18224]
 gi|210065491|gb|EEA19585.1| hypothetical protein PMAA_003730 [Talaromyces marneffei ATCC 18224]
          Length = 429

 Score = 37.7 bits (86), Expect = 9.5,   Method: Compositional matrix adjust.
 Identities = 19/59 (32%), Positives = 30/59 (50%)

Query: 179 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQR 237
           R YK GE I +  GP PN  L + YGF  E N  D + ++  +  +    + + ++ QR
Sbjct: 264 RLYKKGEEIYMSYGPHPNDFLFVEYGFYLETNESDAIFLDDIIFKDFTVAEKEELIRQR 322


>gi|410079629|ref|XP_003957395.1| hypothetical protein KAFR_0E01060 [Kazachstania africana CBS 2517]
 gi|372463981|emb|CCF58260.1| hypothetical protein KAFR_0E01060 [Kazachstania africana CBS 2517]
          Length = 534

 Score = 37.7 bits (86), Expect = 9.7,   Method: Compositional matrix adjust.
 Identities = 19/44 (43%), Positives = 25/44 (56%)

Query: 170 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 213
           D+ V +V+D+  KAG  I    GP  N  LL  +GF  E NP+D
Sbjct: 307 DECVDIVLDKDVKAGVEIFNSYGPLSNVFLLSRHGFAVEGNPHD 350


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.321    0.135    0.403 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,616,468,379
Number of Sequences: 23463169
Number of extensions: 222431939
Number of successful extensions: 563042
Number of sequences better than 100.0: 766
Number of HSP's better than 100.0 without gapping: 355
Number of HSP's successfully gapped in prelim test: 411
Number of HSP's that attempted gapping in prelim test: 561820
Number of HSP's gapped (non-prelim): 1001
length of query: 376
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 232
effective length of database: 8,980,499,031
effective search space: 2083475775192
effective search space used: 2083475775192
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 78 (34.7 bits)