BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 018962
         (348 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|225452167|ref|XP_002264334.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
           [Vitis vinifera]
          Length = 509

 Score =  644 bits (1662), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 306/348 (87%), Positives = 323/348 (92%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           MYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE+ELAYLTGSPTKAE+LERAEGI
Sbjct: 162 MYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSESELAYLTGSPTKAEVLERAEGI 221

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           KREYNELDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFVA+QSCVVHLQKVSLARRFAL
Sbjct: 222 KREYNELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFVAIQSCVVHLQKVSLARRFAL 281

Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
           VPLGPPLLAY S CKAMLAAVD +VQLVVDRPYKAGESIVVWCGPQPNSKLL+NYGFVDE
Sbjct: 282 VPLGPPLLAYRSNCKAMLAAVDGSVQLVVDRPYKAGESIVVWCGPQPNSKLLLNYGFVDE 341

Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
           DN YDR+VVEAALNTEDPQYQDKRMVAQRNGKL+VQ FHV  G+E+EA+SDMLPYLRLGY
Sbjct: 342 DNSYDRIVVEAALNTEDPQYQDKRMVAQRNGKLTVQKFHVSVGKEREAVSDMLPYLRLGY 401

Query: 241 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 300
           VSD SEMQSVISS GPICPVSPCMERAVLDQL DYF+ RLAGYP T+SEDE +L D NL+
Sbjct: 402 VSDPSEMQSVISSQGPICPVSPCMERAVLDQLVDYFERRLAGYPTTMSEDECLLADSNLN 461

Query: 301 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 348
           PKK VATQLVR+EKKMLNACL+ T D+I  LPD TVSPCPAPY PLL 
Sbjct: 462 PKKLVATQLVRLEKKMLNACLKATVDLINQLPDHTVSPCPAPYTPLLK 509


>gi|359488614|ref|XP_003633789.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
           [Vitis vinifera]
          Length = 515

 Score =  644 bits (1662), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 306/348 (87%), Positives = 323/348 (92%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           MYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE+ELAYLTGSPTKAE+LERAEGI
Sbjct: 168 MYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSESELAYLTGSPTKAEVLERAEGI 227

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           KREYNELDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFVA+QSCVVHLQKVSLARRFAL
Sbjct: 228 KREYNELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFVAIQSCVVHLQKVSLARRFAL 287

Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
           VPLGPPLLAY S CKAMLAAVD +VQLVVDRPYKAGESIVVWCGPQPNSKLL+NYGFVDE
Sbjct: 288 VPLGPPLLAYRSNCKAMLAAVDGSVQLVVDRPYKAGESIVVWCGPQPNSKLLLNYGFVDE 347

Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
           DN YDR+VVEAALNTEDPQYQDKRMVAQRNGKL+VQ FHV  G+E+EA+SDMLPYLRLGY
Sbjct: 348 DNSYDRIVVEAALNTEDPQYQDKRMVAQRNGKLTVQKFHVSVGKEREAVSDMLPYLRLGY 407

Query: 241 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 300
           VSD SEMQSVISS GPICPVSPCMERAVLDQL DYF+ RLAGYP T+SEDE +L D NL+
Sbjct: 408 VSDPSEMQSVISSQGPICPVSPCMERAVLDQLVDYFERRLAGYPTTMSEDECLLADSNLN 467

Query: 301 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 348
           PKK VATQLVR+EKKMLNACL+ T D+I  LPD TVSPCPAPY PLL 
Sbjct: 468 PKKLVATQLVRLEKKMLNACLKATVDLINQLPDHTVSPCPAPYTPLLK 515


>gi|296090251|emb|CBI40070.3| unnamed protein product [Vitis vinifera]
          Length = 428

 Score =  643 bits (1659), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 306/347 (88%), Positives = 323/347 (93%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           MYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE+ELAYLTGSPTKAE+LERAEGI
Sbjct: 81  MYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSESELAYLTGSPTKAEVLERAEGI 140

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           KREYNELDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFVA+QSCVVHLQKVSLARRFAL
Sbjct: 141 KREYNELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFVAIQSCVVHLQKVSLARRFAL 200

Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
           VPLGPPLLAY S CKAMLAAVD +VQLVVDRPYKAGESIVVWCGPQPNSKLL+NYGFVDE
Sbjct: 201 VPLGPPLLAYRSNCKAMLAAVDGSVQLVVDRPYKAGESIVVWCGPQPNSKLLLNYGFVDE 260

Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
           DN YDR+VVEAALNTEDPQYQDKRMVAQRNGKL+VQ FHV  G+E+EA+SDMLPYLRLGY
Sbjct: 261 DNSYDRIVVEAALNTEDPQYQDKRMVAQRNGKLTVQKFHVSVGKEREAVSDMLPYLRLGY 320

Query: 241 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 300
           VSD SEMQSVISS GPICPVSPCMERAVLDQL DYF+ RLAGYP T+SEDE +L D NL+
Sbjct: 321 VSDPSEMQSVISSQGPICPVSPCMERAVLDQLVDYFERRLAGYPTTMSEDECLLADSNLN 380

Query: 301 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 347
           PKK VATQLVR+EKKMLNACL+ T D+I  LPD TVSPCPAPY PLL
Sbjct: 381 PKKLVATQLVRLEKKMLNACLKATVDLINQLPDHTVSPCPAPYTPLL 427


>gi|224117488|ref|XP_002331687.1| SET domain protein [Populus trichocarpa]
 gi|222874165|gb|EEF11296.1| SET domain protein [Populus trichocarpa]
          Length = 502

 Score =  629 bits (1622), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 300/347 (86%), Positives = 320/347 (92%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           MYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE ELAYLTGSPTKAE+L+RA+GI
Sbjct: 155 MYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSEAELAYLTGSPTKAEVLDRADGI 214

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           KREY ELDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFVA+QSCVVHLQKVSLARRFAL
Sbjct: 215 KREYEELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFVAIQSCVVHLQKVSLARRFAL 274

Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
           VPLGPPLLAYSS CKAML AVD AV+LVVDRPYKAGE IVVWCGPQPNSKLL+NYGFVDE
Sbjct: 275 VPLGPPLLAYSSNCKAMLTAVDGAVELVVDRPYKAGEPIVVWCGPQPNSKLLLNYGFVDE 334

Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
           DNPYDR+ VEAALNTEDPQYQDKRMVAQRNGKLSVQVF V+AG+EKEA+SD+LPYLRLGY
Sbjct: 335 DNPYDRIAVEAALNTEDPQYQDKRMVAQRNGKLSVQVFQVYAGKEKEAVSDILPYLRLGY 394

Query: 241 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 300
           VSD SEMQSVISS GP+CPVSPCME+AVLDQL  YF+ RLAGY  ++SEDE ML D NL+
Sbjct: 395 VSDPSEMQSVISSQGPVCPVSPCMEQAVLDQLTVYFRTRLAGYCTSISEDELMLADPNLN 454

Query: 301 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 347
           PKKRVATQLVR+EKKML ACLQ T D+I  LPD T+ PCPAPYAPLL
Sbjct: 455 PKKRVATQLVRLEKKMLKACLQATVDLINQLPDHTMPPCPAPYAPLL 501


>gi|449455876|ref|XP_004145676.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cucumis
           sativus]
 gi|449492872|ref|XP_004159127.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cucumis
           sativus]
          Length = 521

 Score =  628 bits (1620), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 298/347 (85%), Positives = 322/347 (92%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           MYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE EL YL+GSPTK E+LERAEGI
Sbjct: 174 MYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSEDELDYLSGSPTKKEVLERAEGI 233

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           K+EYNELDTVWFMAGSLFQQYPYDIPTEAF+FEIFKQAFVAVQSCVVHLQKVSLARRFAL
Sbjct: 234 KKEYNELDTVWFMAGSLFQQYPYDIPTEAFSFEIFKQAFVAVQSCVVHLQKVSLARRFAL 293

Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
           VPLGPPLLAY S CKAML AVD AV+LVVDRPYKAGESI VWCGPQPNSKLL+NYGFVDE
Sbjct: 294 VPLGPPLLAYRSNCKAMLTAVDGAVELVVDRPYKAGESIAVWCGPQPNSKLLLNYGFVDE 353

Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
           DN YDRLVVEAALNTEDPQYQDKRMVAQRNG+LS+Q F+V+AG+EKEA+ DMLPYLRLGY
Sbjct: 354 DNRYDRLVVEAALNTEDPQYQDKRMVAQRNGRLSIQAFYVYAGKEKEAVLDMLPYLRLGY 413

Query: 241 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 300
           V+  SEMQSVISS GP+CPVSPCMERA+L+Q+ADYFK RLAGYP TLSEDE +L D NL+
Sbjct: 414 VTHPSEMQSVISSQGPVCPVSPCMERAMLEQVADYFKRRLAGYPTTLSEDEFLLADGNLN 473

Query: 301 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 347
           PKKRVATQLVR+EKK+L++CL+VT D I  LPD TVSPCPAPYAPLL
Sbjct: 474 PKKRVATQLVRLEKKLLHSCLEVTIDFINQLPDHTVSPCPAPYAPLL 520


>gi|356571407|ref|XP_003553868.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
           large subunit N-methyltransferase, chloroplastic-like
           isoform 1 [Glycine max]
 gi|356571409|ref|XP_003553869.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
           large subunit N-methyltransferase, chloroplastic-like
           isoform 2 [Glycine max]
          Length = 502

 Score =  613 bits (1581), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 288/347 (82%), Positives = 317/347 (91%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           MYEKKQGKKSFW PYIRELDRQRGRGQL+VESPLLW ++EL YL+GSP K E+++R E I
Sbjct: 155 MYEKKQGKKSFWYPYIRELDRQRGRGQLSVESPLLWLKSELDYLSGSPIKDEVIQREEAI 214

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           ++EYNELDTVWFMAGSLFQQYPYDIPTEAF+FEIFKQAF A+QSCVVHLQKVSLARRFAL
Sbjct: 215 RKEYNELDTVWFMAGSLFQQYPYDIPTEAFSFEIFKQAFAAIQSCVVHLQKVSLARRFAL 274

Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
           VPLGPPLL+Y S CKAML AVD AV+L VDRPYKAG+ IVVWCGPQPNSKLLINYGFVDE
Sbjct: 275 VPLGPPLLSYQSNCKAMLTAVDGAVELAVDRPYKAGDPIVVWCGPQPNSKLLINYGFVDE 334

Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
           +N  DRL+VEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV+AG+E+EA+ DML Y+RLGY
Sbjct: 335 NNSNDRLIVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVYAGKEREAVLDMLRYMRLGY 394

Query: 241 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 300
           VSD SEM+SVISS GP+CPVSPCMERA LDQLADYFKARLAGYP TL+EDE+MLTD NL+
Sbjct: 395 VSDPSEMESVISSQGPVCPVSPCMERAALDQLADYFKARLAGYPTTLAEDESMLTDDNLN 454

Query: 301 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 347
           PKKRVATQ VR+EKKML+ACLQ T D I  LPD T+SPCPAPYAPLL
Sbjct: 455 PKKRVATQYVRLEKKMLHACLQATTDFINQLPDHTISPCPAPYAPLL 501


>gi|356511552|ref|XP_003524489.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
           large subunit N-methyltransferase, chloroplastic-like
           [Glycine max]
          Length = 503

 Score =  608 bits (1568), Expect = e-171,   Method: Compositional matrix adjust.
 Identities = 288/348 (82%), Positives = 316/348 (90%), Gaps = 1/348 (0%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           MYEKKQGKKSFW PYIRELDRQRGRGQL+VESPLLWS++EL YL+GSP K E+++R E I
Sbjct: 155 MYEKKQGKKSFWYPYIRELDRQRGRGQLSVESPLLWSKSELDYLSGSPIKDEVIQREEAI 214

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           ++EY ELDTVWFMAGSLFQQYPYDIPTEAF+FEIFKQAF A+QSCVVHLQKVSLARRFAL
Sbjct: 215 RKEYKELDTVWFMAGSLFQQYPYDIPTEAFSFEIFKQAFAAIQSCVVHLQKVSLARRFAL 274

Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
           VPLGPPLL+Y S CKAML AVD AV+L VDRPYKAG+ IVVWCGPQPNSKLLINYGFVDE
Sbjct: 275 VPLGPPLLSYQSNCKAMLTAVDGAVELAVDRPYKAGDPIVVWCGPQPNSKLLINYGFVDE 334

Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
           +N  DRL+VEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV+AG+E+EA+ DML Y+RLGY
Sbjct: 335 NNSNDRLIVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVYAGKEREAVLDMLRYMRLGY 394

Query: 241 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD-YNL 299
           VSD SEMQSVISS GP+CPVSPCMERA LDQLADYFKARLAGYP  L+EDE+MLTD  NL
Sbjct: 395 VSDPSEMQSVISSQGPVCPVSPCMERAALDQLADYFKARLAGYPTILAEDESMLTDGGNL 454

Query: 300 HPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 347
           +PKKRVATQ VR+EKKML+ACLQ T D I  LPD T+SPCPAPYAPLL
Sbjct: 455 NPKKRVATQYVRLEKKMLHACLQATIDFINQLPDHTISPCPAPYAPLL 502


>gi|357497055|ref|XP_003618816.1| SET domain protein [Medicago truncatula]
 gi|355493831|gb|AES75034.1| SET domain protein [Medicago truncatula]
          Length = 501

 Score =  606 bits (1563), Expect = e-171,   Method: Compositional matrix adjust.
 Identities = 284/347 (81%), Positives = 314/347 (90%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           MYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE+ELAYL GSP K EI++R EGI
Sbjct: 154 MYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSESELAYLEGSPLKDEIVKRIEGI 213

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           ++EYNELDTVWFM+GSLFQQYPYD+PTEAF FEIFKQAF AVQSCVVHLQ VSLARRFAL
Sbjct: 214 RKEYNELDTVWFMSGSLFQQYPYDLPTEAFPFEIFKQAFAAVQSCVVHLQNVSLARRFAL 273

Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
           VPLGPPLLAY S CKAML AVD AVQLVVDRPYKAG+ IVVWCGPQPN+KLL NYGFVDE
Sbjct: 274 VPLGPPLLAYCSNCKAMLTAVDGAVQLVVDRPYKAGDPIVVWCGPQPNTKLLTNYGFVDE 333

Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
           DN  DRL+VE AL+TEDPQYQDKR+VAQRNGKLS+Q F+V+ G+E+EA+SDM+PY+RLGY
Sbjct: 334 DNSNDRLIVEVALSTEDPQYQDKRIVAQRNGKLSIQTFYVYTGKEREAVSDMIPYMRLGY 393

Query: 241 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 300
           VSD SEMQSVISS GP+CPVSPCMERAVLDQLADYF  RLA YP TL+EDE+MLTD +L+
Sbjct: 394 VSDPSEMQSVISSQGPVCPVSPCMERAVLDQLADYFNTRLAAYPTTLAEDESMLTDGSLN 453

Query: 301 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 347
           PK+RVATQLVR+EKKML+ACLQ   D+I  LPD +VSPCPAPYAP L
Sbjct: 454 PKRRVATQLVRLEKKMLHACLQAIMDLISQLPDHSVSPCPAPYAPSL 500


>gi|22326803|ref|NP_196930.2| Rubisco methyltransferase family protein [Arabidopsis thaliana]
 gi|30684815|ref|NP_851038.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
 gi|42573363|ref|NP_974778.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
 gi|17473570|gb|AAL38260.1| putative protein [Arabidopsis thaliana]
 gi|23297671|gb|AAN13005.1| unknown protein [Arabidopsis thaliana]
 gi|332004624|gb|AED92007.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
 gi|332004625|gb|AED92008.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
 gi|332004626|gb|AED92009.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
          Length = 514

 Score =  599 bits (1544), Expect = e-169,   Method: Compositional matrix adjust.
 Identities = 281/348 (80%), Positives = 310/348 (89%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           MYEKKQGKKS W PYIRELDRQRGRGQL  ESPLLWSE EL YLTGSPTKAE+LERAEGI
Sbjct: 167 MYEKKQGKKSVWYPYIRELDRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAEGI 226

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           KREYNELDTVWFMAGSLFQQYP+DIPTEAF+FEIFKQAFVA+QSCVVHLQ V LARRFAL
Sbjct: 227 KREYNELDTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVAIQSCVVHLQNVGLARRFAL 286

Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
           VPLGPPLLAY S CKAML AVD AV+LVVDRPYKAG+ IVVWCGPQPN+KLL+NYGFVDE
Sbjct: 287 VPLGPPLLAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVVWCGPQPNAKLLLNYGFVDE 346

Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
           DNPYDR++VEAALNTEDPQYQDKRMVAQRNGKLS QVF V  G+E+EA+ DMLPYLRLGY
Sbjct: 347 DNPYDRVIVEAALNTEDPQYQDKRMVAQRNGKLSQQVFQVRVGKEREAVQDMLPYLRLGY 406

Query: 241 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 300
           +SD SEMQSVISS GP+CP+SPCMERAVLDQLA+YF  RL+GYP T  ED+A+L D +L 
Sbjct: 407 MSDPSEMQSVISSQGPVCPMSPCMERAVLDQLANYFMRRLSGYPTTPKEDDALLADPSLS 466

Query: 301 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 348
           P+KRVAT+LV++EKK+L ACL  T D++  LPD  +SPCPAPYAP L 
Sbjct: 467 PRKRVATRLVQLEKKILVACLTTTVDLLNQLPDTAISPCPAPYAPSLK 514


>gi|297807453|ref|XP_002871610.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297317447|gb|EFH47869.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 516

 Score =  597 bits (1539), Expect = e-168,   Method: Compositional matrix adjust.
 Identities = 280/348 (80%), Positives = 309/348 (88%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           MYEKKQGKKS W PYIRELDRQRGRGQL  ESPLLWSE EL YLTGSPTKAE+LERAEGI
Sbjct: 169 MYEKKQGKKSVWYPYIRELDRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAEGI 228

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           KREYNELDTVWFMAGSLFQQYP+DIPTEAF+FEIFKQAFVA+QSCVVHLQ V LARRFAL
Sbjct: 229 KREYNELDTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVAIQSCVVHLQNVGLARRFAL 288

Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
           VPLGPPLLAY S CKAML AVD AV+LVVDRPYKAG+ IVVWCGPQPN+KLL+NYGFVDE
Sbjct: 289 VPLGPPLLAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVVWCGPQPNAKLLLNYGFVDE 348

Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
           DNPYDR++VEAALNTEDPQYQDKRMVAQRNGKLS QVF V  G+E+EA+ DMLPYLRLGY
Sbjct: 349 DNPYDRIIVEAALNTEDPQYQDKRMVAQRNGKLSQQVFQVRVGKEREAVQDMLPYLRLGY 408

Query: 241 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 300
           +SD SEMQSVISS GP+C +SPCMERAVLDQLA+YF  RL+GYP T  ED+A+L D +L 
Sbjct: 409 MSDPSEMQSVISSQGPVCTMSPCMERAVLDQLANYFMRRLSGYPTTPKEDDALLADPSLS 468

Query: 301 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 348
           P+KRVAT+LV++EKK+L ACL  T D++  LPD  +SPCPAPYAP L 
Sbjct: 469 PRKRVATRLVQLEKKILAACLTTTVDLLNQLPDTAISPCPAPYAPSLK 516


>gi|18377718|gb|AAL67009.1| unknown protein [Arabidopsis thaliana]
          Length = 514

 Score =  596 bits (1537), Expect = e-168,   Method: Compositional matrix adjust.
 Identities = 280/348 (80%), Positives = 309/348 (88%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           MYEKKQGKKS W PYIRELDRQRGRGQL  ESPLLWSE EL YLTGSPTKAE+LERAEGI
Sbjct: 167 MYEKKQGKKSVWYPYIRELDRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAEGI 226

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           KREYNELDTVWFMAGSLFQQYP+DIPTEAF+FEIFKQAFVA+QSCVVHLQ V LARRFAL
Sbjct: 227 KREYNELDTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVAIQSCVVHLQNVGLARRFAL 286

Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
           VPLGPPLLAY S CKAML AVD AV+LVVDRPYKAG+ IVVWCGPQPN+KLL+NYGFVDE
Sbjct: 287 VPLGPPLLAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVVWCGPQPNAKLLLNYGFVDE 346

Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
           DNPYDR++VEAALNTE PQYQDKRMVAQRNGKLS QVF V  G+E+EA+ DMLPYLRLGY
Sbjct: 347 DNPYDRVIVEAALNTEGPQYQDKRMVAQRNGKLSQQVFQVRVGKEREAVQDMLPYLRLGY 406

Query: 241 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 300
           +SD SEMQSVISS GP+CP+SPCMERAVLDQLA+YF  RL+GYP T  ED+A+L D +L 
Sbjct: 407 MSDPSEMQSVISSQGPVCPMSPCMERAVLDQLANYFMRRLSGYPTTPKEDDALLADPSLS 466

Query: 301 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 348
           P+KRVAT+LV++EKK+L ACL  T D++  LPD  +SPCPAPYAP L 
Sbjct: 467 PRKRVATRLVQLEKKILVACLTTTVDLLNQLPDTAISPCPAPYAPSLK 514


>gi|125536207|gb|EAY82695.1| hypothetical protein OsI_37912 [Oryza sativa Indica Group]
          Length = 505

 Score =  570 bits (1469), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 265/347 (76%), Positives = 302/347 (87%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           MYEKKQG+ SFW PYI+ELDRQRGRGQLAVESPLLW+E+EL YL GSP K E++ R EGI
Sbjct: 158 MYEKKQGQDSFWYPYIKELDRQRGRGQLAVESPLLWTESELNYLKGSPIKDEVVARDEGI 217

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           +REYNELDT+WFMAGSLFQQYP+DIPTEAF FEIFKQAFVAVQSCVVHLQKVSLARRFAL
Sbjct: 218 RREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFAL 277

Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
           VPLGPPLL Y S CKAML AV D+V+LVVDRPYKAGE I+VWCGPQPNS+LL+NYGF+DE
Sbjct: 278 VPLGPPLLTYKSNCKAMLTAVGDSVRLVVDRPYKAGEPIIVWCGPQPNSRLLLNYGFIDE 337

Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
           DNPYDR+V+EA+LN EDPQ+Q+KRMVAQRNGKL++Q FHV  G+EKE I++MLPYLRLGY
Sbjct: 338 DNPYDRIVIEASLNIEDPQFQEKRMVAQRNGKLAIQNFHVCVGKEKETIAEMLPYLRLGY 397

Query: 241 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 300
           +SD  EMQS++SS G  CPVSPC ERAVLDQL  Y ++RLA YP TL ED+AML D NL 
Sbjct: 398 ISDPDEMQSILSSEGDTCPVSPCTERAVLDQLVGYLESRLADYPTTLDEDDAMLADGNLE 457

Query: 301 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 347
           PKK VAT+LVR+EKK+L+ CLQ   + I  LPD TVSPCPAP+AP L
Sbjct: 458 PKKEVATRLVRLEKKLLHGCLQAANEFINDLPDHTVSPCPAPFAPEL 504


>gi|125578929|gb|EAZ20075.1| hypothetical protein OsJ_35675 [Oryza sativa Japonica Group]
          Length = 536

 Score =  570 bits (1468), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 265/348 (76%), Positives = 302/348 (86%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           MYEKKQG+ SFW PYI+ELDRQRGRGQLAVESPLLW+E+EL YL GSP K E++ R EGI
Sbjct: 189 MYEKKQGQDSFWYPYIKELDRQRGRGQLAVESPLLWTESELNYLKGSPIKDEVVARDEGI 248

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           +REYNELDT+WFMAGSLFQQYP+DIPTEAF FEIFKQAFVAVQSCVVHLQKVSLARRFAL
Sbjct: 249 RREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFAL 308

Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
           VPLGPPLL Y S CKAML AV D+V+LVVDRPYKAGE I+VWCGPQPNS+LL+NYGF+DE
Sbjct: 309 VPLGPPLLTYKSNCKAMLTAVGDSVRLVVDRPYKAGEPIIVWCGPQPNSRLLLNYGFIDE 368

Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
           DNPYDR+V+EA+LN EDPQ+Q+KRMVAQRNGKL++Q FHV  G+EKE I++MLPYLRLGY
Sbjct: 369 DNPYDRIVIEASLNIEDPQFQEKRMVAQRNGKLAIQNFHVCVGKEKETIAEMLPYLRLGY 428

Query: 241 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 300
           +SD  EMQS++SS G  CPVSPC ERAVLDQL  Y ++RLA YP TL ED+AML D NL 
Sbjct: 429 ISDPDEMQSILSSEGDTCPVSPCTERAVLDQLVGYLESRLADYPTTLDEDDAMLADGNLE 488

Query: 301 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 348
           PKK VAT+LVR+EKK+L+ CLQ   + I  LPD TVSPCPAP+AP L 
Sbjct: 489 PKKEVATRLVRLEKKLLHGCLQAANEFINDLPDHTVSPCPAPFAPELK 536


>gi|115487958|ref|NP_001066466.1| Os12g0236900 [Oryza sativa Japonica Group]
 gi|77554044|gb|ABA96840.1| SET domain containing protein, expressed [Oryza sativa Japonica
           Group]
 gi|113648973|dbj|BAF29485.1| Os12g0236900 [Oryza sativa Japonica Group]
          Length = 509

 Score =  569 bits (1467), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 265/347 (76%), Positives = 302/347 (87%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           MYEKKQG+ SFW PYI+ELDRQRGRGQLAVESPLLW+E+EL YL GSP K E++ R EGI
Sbjct: 162 MYEKKQGQDSFWYPYIKELDRQRGRGQLAVESPLLWTESELNYLKGSPIKDEVVARDEGI 221

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           +REYNELDT+WFMAGSLFQQYP+DIPTEAF FEIFKQAFVAVQSCVVHLQKVSLARRFAL
Sbjct: 222 RREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFAL 281

Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
           VPLGPPLL Y S CKAML AV D+V+LVVDRPYKAGE I+VWCGPQPNS+LL+NYGF+DE
Sbjct: 282 VPLGPPLLTYKSNCKAMLTAVGDSVRLVVDRPYKAGEPIIVWCGPQPNSRLLLNYGFIDE 341

Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
           DNPYDR+V+EA+LN EDPQ+Q+KRMVAQRNGKL++Q FHV  G+EKE I++MLPYLRLGY
Sbjct: 342 DNPYDRIVIEASLNIEDPQFQEKRMVAQRNGKLAIQNFHVCVGKEKETIAEMLPYLRLGY 401

Query: 241 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 300
           +SD  EMQS++SS G  CPVSPC ERAVLDQL  Y ++RLA YP TL ED+AML D NL 
Sbjct: 402 ISDPDEMQSILSSEGDTCPVSPCTERAVLDQLVGYLESRLADYPTTLDEDDAMLADGNLE 461

Query: 301 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 347
           PKK VAT+LVR+EKK+L+ CLQ   + I  LPD TVSPCPAP+AP L
Sbjct: 462 PKKEVATRLVRLEKKLLHGCLQAANEFINDLPDHTVSPCPAPFAPEL 508


>gi|414881266|tpg|DAA58397.1| TPA: hypothetical protein ZEAMMB73_027665 [Zea mays]
          Length = 512

 Score =  569 bits (1466), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 264/347 (76%), Positives = 303/347 (87%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           MYEKKQGK SFW PYI+ELDR RGRGQLAVESPLLW+E+EL YLTGSP K E++ R E I
Sbjct: 165 MYEKKQGKDSFWYPYIKELDRHRGRGQLAVESPLLWTESELDYLTGSPLKDEVVARDEAI 224

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           +REYNELDT+WFMAGSLFQQYP+DIPTEAF FEIFKQAFVAVQSCVVHLQKVSLARRFAL
Sbjct: 225 RREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFAL 284

Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
           VPLGPPLL Y S CKAML A  D+V+LVVDRPYKAGE I++WCGPQ NS+L++NYGFVDE
Sbjct: 285 VPLGPPLLTYRSNCKAMLTADGDSVRLVVDRPYKAGEPIIIWCGPQTNSRLVLNYGFVDE 344

Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
           DNP+DR+ +EA+LNTEDPQYQ+KRMVAQRNGKL++Q F+V+ G+EK+ +++MLPYLRLGY
Sbjct: 345 DNPFDRVAIEASLNTEDPQYQEKRMVAQRNGKLAIQNFNVYVGKEKQTVAEMLPYLRLGY 404

Query: 241 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 300
           +S+  EMQS++SS G  CPVSPC ERAVLDQL  Y ++RLAGYP TL EDEAML D NL 
Sbjct: 405 ISNPDEMQSILSSEGDTCPVSPCTERAVLDQLVGYLESRLAGYPTTLDEDEAMLADGNLE 464

Query: 301 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 347
           PKK VAT+LVR+EKKML+ACLQ T + I  LPD TVSPCPAPYAP L
Sbjct: 465 PKKEVATRLVRLEKKMLHACLQATNEFINDLPDHTVSPCPAPYAPEL 511


>gi|357160358|ref|XP_003578740.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
           large subunit N-methyltransferase, chloroplastic-like
           [Brachypodium distachyon]
          Length = 516

 Score =  567 bits (1462), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 266/348 (76%), Positives = 299/348 (85%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           MYEKKQGK S W PYI+ELDRQRGRGQLAVESPLLW+E+EL YL GSP + E++ R EGI
Sbjct: 169 MYEKKQGKDSLWYPYIKELDRQRGRGQLAVESPLLWTESELDYLNGSPMRDEVVVRDEGI 228

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           +REYNELDT+WFMAGSLF+QYP+D+PTEAF FEIFKQAFVAVQSCVVHLQKVSLARRFAL
Sbjct: 229 RREYNELDTLWFMAGSLFKQYPFDVPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFAL 288

Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
           VPLGPPLL Y S CKAML AVDD+V+LVVDRPYKAGE I+VWCGPQPNS+LL+NYGFVDE
Sbjct: 289 VPLGPPLLTYKSNCKAMLTAVDDSVRLVVDRPYKAGEPIIVWCGPQPNSRLLLNYGFVDE 348

Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
           DNPYDR+ +EA+LN EDPQYQ+KRMVAQRNGKL++Q F V  G+EKE IS+MLPYLRLGY
Sbjct: 349 DNPYDRIAIEASLNMEDPQYQEKRMVAQRNGKLAIQKFQVCVGKEKETISEMLPYLRLGY 408

Query: 241 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 300
           +SD  EMQ ++SS G  CPVSPC ERAVLDQL  Y K+RLAGYP TL EDEAML D NL 
Sbjct: 409 ISDPDEMQCILSSEGDTCPVSPCSERAVLDQLVVYLKSRLAGYPTTLDEDEAMLADGNLE 468

Query: 301 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 348
           PKK VAT+LVR+EKK+L+ CLQ   + I  LPD TVSPCPA YAP L 
Sbjct: 469 PKKEVATRLVRLEKKLLHGCLQAAHEFISALPDHTVSPCPALYAPNLK 516


>gi|242053769|ref|XP_002456030.1| hypothetical protein SORBIDRAFT_03g029140 [Sorghum bicolor]
 gi|241928005|gb|EES01150.1| hypothetical protein SORBIDRAFT_03g029140 [Sorghum bicolor]
          Length = 512

 Score =  565 bits (1455), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 261/347 (75%), Positives = 303/347 (87%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           MYEKKQGK SFW PYI+ELDR RGRGQLAVESPLLW+E+EL YLTGSP K E++ R E I
Sbjct: 165 MYEKKQGKDSFWYPYIKELDRHRGRGQLAVESPLLWTESELDYLTGSPLKDEVVARDEAI 224

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           +REYNELDT+WFMAGSLFQQYP+DIPTEAF FEIFKQAFVAVQSCVVHLQKVSLARRFAL
Sbjct: 225 RREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFAL 284

Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
           VPLGPPLL Y S CKAML A  D+V+LVVDRPYKAGE I++WCGPQ NS+L++NYGFVDE
Sbjct: 285 VPLGPPLLTYKSNCKAMLTADGDSVRLVVDRPYKAGEPIIIWCGPQTNSRLVLNYGFVDE 344

Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
           DNP+DR+ +EA+LN+EDPQYQ+KRMVAQRNGKL++Q F+V+ G+EK+ +++MLPYLRLGY
Sbjct: 345 DNPFDRIAIEASLNSEDPQYQEKRMVAQRNGKLAIQNFNVYVGKEKQTVAEMLPYLRLGY 404

Query: 241 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 300
           +SD  EMQS++SS G  CP+SPC ERAVLDQL  Y ++RLAGYP TL EDEAML D +L 
Sbjct: 405 ISDPDEMQSILSSEGDTCPLSPCTERAVLDQLVGYLESRLAGYPTTLDEDEAMLADGSLE 464

Query: 301 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 347
           PKK VAT+LVR+EKKM++ACLQ T + I  LPD TVSPCPAPYAP L
Sbjct: 465 PKKEVATRLVRLEKKMIHACLQATNEFINDLPDHTVSPCPAPYAPEL 511


>gi|326510275|dbj|BAJ87354.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326525555|dbj|BAJ88824.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 523

 Score =  560 bits (1442), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 261/348 (75%), Positives = 298/348 (85%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           MYEKKQG+ S W PYI+ELDRQRGRGQLAVESPLLW+E+EL YL GSP + E++ R EGI
Sbjct: 176 MYEKKQGRDSLWYPYIKELDRQRGRGQLAVESPLLWTESELDYLNGSPMRDEVVVRDEGI 235

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           K+EYNELDT+WFMAGSLF+QYP+D+PTEAF FEIFKQAFVAVQSCVVHLQKVSLARRFAL
Sbjct: 236 KKEYNELDTLWFMAGSLFKQYPFDVPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFAL 295

Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
           VPLGPPLL Y S CKAML AVD +V+L+VDRPYKAGE I+VWCGPQPNS+LL+NYGFVDE
Sbjct: 296 VPLGPPLLTYKSNCKAMLTAVDGSVRLLVDRPYKAGEPIIVWCGPQPNSRLLLNYGFVDE 355

Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
           DNPYDR+ +EA+LNTEDPQYQ+KRMVAQRNGKL++Q F V  G+EK+ IS+MLPYLRLGY
Sbjct: 356 DNPYDRIAIEASLNTEDPQYQEKRMVAQRNGKLAIQKFQVCVGKEKQTISEMLPYLRLGY 415

Query: 241 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 300
           +SD  EMQ ++SS G  CPVSPC ERAVLDQL  Y K+RLAGYP  L EDEAML D +L 
Sbjct: 416 ISDPDEMQCILSSEGDTCPVSPCSERAVLDQLVVYLKSRLAGYPTNLDEDEAMLADGSLE 475

Query: 301 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 348
           PKK VAT+LVR+EKKML+ CL+   + I  LPD TVSPCPA YAP L 
Sbjct: 476 PKKEVATRLVRLEKKMLHGCLEAANEFISGLPDHTVSPCPALYAPELK 523


>gi|116786810|gb|ABK24248.1| unknown [Picea sitchensis]
          Length = 507

 Score =  560 bits (1442), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 262/347 (75%), Positives = 300/347 (86%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           MYEKKQG +SFW P+IRELDRQRGRGQLAVESPLLWS  EL Y TGSP K  +LER  GI
Sbjct: 160 MYEKKQGNQSFWRPFIRELDRQRGRGQLAVESPLLWSSEELKYFTGSPMKEIMLERNSGI 219

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           KREY ELDTVWFMAGSLF+QYPYDIPTEAF FEIFKQAFVAVQSCVVHLQ V+LARRFAL
Sbjct: 220 KREYEELDTVWFMAGSLFKQYPYDIPTEAFPFEIFKQAFVAVQSCVVHLQNVNLARRFAL 279

Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
           VPLGPPLL+Y S CKAML AV D+VQL VDR YKAGE IVVWCGPQPN++LL+NYGFVDE
Sbjct: 280 VPLGPPLLSYKSNCKAMLKAVGDSVQLEVDREYKAGEPIVVWCGPQPNARLLLNYGFVDE 339

Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
           DNP+DRL+VE +L+T+DP YQDKR++AQRNGKLSVQ F+++ GREKEA+ DMLPYLRL Y
Sbjct: 340 DNPHDRLIVEVSLDTKDPLYQDKRIIAQRNGKLSVQTFNIYIGREKEAVLDMLPYLRLAY 399

Query: 241 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 300
           VSD SEMQSV+SS GP+CPVSPC ERAVLDQL+ YF+ RLAGYP T SEDE +L D   +
Sbjct: 400 VSDPSEMQSVLSSQGPVCPVSPCTERAVLDQLSRYFRERLAGYPTTASEDEIVLADPTTN 459

Query: 301 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 347
           PK++VATQLV +EKKMLN+CL    ++I  LPD+ V+PCP+PY+P+L
Sbjct: 460 PKRQVATQLVLIEKKMLNSCLAAVYEIIDQLPDLAVTPCPSPYSPIL 506


>gi|7573451|emb|CAB87765.1| putative protein [Arabidopsis thaliana]
          Length = 537

 Score =  520 bits (1338), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 249/316 (78%), Positives = 269/316 (85%), Gaps = 18/316 (5%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           MYEKKQGKKS W PYIRELDRQRGRGQL  ESPLLWSE EL YLTGSPTKAE+LERAEGI
Sbjct: 167 MYEKKQGKKSVWYPYIRELDRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAEGI 226

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ---------- 110
           KREYNELDTVWFMAGSLFQQYP+DIPTEAF+FEIFKQAFVA+QSCVVHLQ          
Sbjct: 227 KREYNELDTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVAIQSCVVHLQVVLVASSNLD 286

Query: 111 --------KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 162
                    V LARRFALVPLGPPLLAY S CKAML AVD AV+LVVDRPYKAG+ IVVW
Sbjct: 287 CYASSCTQNVGLARRFALVPLGPPLLAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVVW 346

Query: 163 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 222
           CGPQPN+KLL+NYGFVDEDNPYDR++VEAALNTEDPQYQDKRMVAQRNGKLS QVF V  
Sbjct: 347 CGPQPNAKLLLNYGFVDEDNPYDRVIVEAALNTEDPQYQDKRMVAQRNGKLSQQVFQVRV 406

Query: 223 GREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAG 282
           G+E+EA+ DMLPYLRLGY+SD SEMQSVISS GP+CP+SPCMERAVLDQLA+YF  RL+G
Sbjct: 407 GKEREAVQDMLPYLRLGYMSDPSEMQSVISSQGPVCPMSPCMERAVLDQLANYFMRRLSG 466

Query: 283 YPATLSEDEAMLTDYN 298
           YP T  ED+A+    N
Sbjct: 467 YPTTPKEDDALEASCN 482


>gi|302794360|ref|XP_002978944.1| hypothetical protein SELMODRAFT_110000 [Selaginella moellendorffii]
 gi|300153262|gb|EFJ19901.1| hypothetical protein SELMODRAFT_110000 [Selaginella moellendorffii]
          Length = 432

 Score =  482 bits (1240), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 233/351 (66%), Positives = 279/351 (79%), Gaps = 4/351 (1%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEG 59
           MYEKK+GK+SFW P+IRELDRQRGRGQ+AVESPLLW+  EL  Y TGS  K  +LER EG
Sbjct: 81  MYEKKRGKESFWYPFIRELDRQRGRGQVAVESPLLWTSEELDEYFTGSRMKEVVLERLEG 140

Query: 60  IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 119
           IKREY ELDTVWFMAGSLF++YP+DIPTEAF+FEIFKQAFVAVQSCVVHLQ VSL RRFA
Sbjct: 141 IKREYQELDTVWFMAGSLFKEYPFDIPTEAFSFEIFKQAFVAVQSCVVHLQGVSLPRRFA 200

Query: 120 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 179
           LVPLGPPLLAY S CKAML A  D V+L VDR YK GE I+VWCGPQPN++LL+NYGFVD
Sbjct: 201 LVPLGPPLLAYKSNCKAMLKAAGDLVRLEVDRAYKKGEQILVWCGPQPNTRLLLNYGFVD 260

Query: 180 EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLG 239
            DNP+DRL VEA+LNT DP YQ+KR++ Q+N +L++Q F +  GREKEA+ +MLPYLRLG
Sbjct: 261 PDNPHDRLSVEASLNTRDPFYQNKRIIVQKNNRLTIQNFQIFKGREKEAVLEMLPYLRLG 320

Query: 240 YVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD--Y 297
           +VSD   M+SV S+ GP CPVS C ERAVLDQLA YF+ R+A Y  T+ ED A+L D   
Sbjct: 321 HVSDPYHMESVFSAEGPTCPVSACNERAVLDQLAQYFQERIAKYKTTIDEDRALLEDGSS 380

Query: 298 NLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVS-PCPAPYAPLL 347
           +++PK+RVATQL+ +EK++L+  L V       LPD +V+ PC   + P L
Sbjct: 381 DINPKQRVATQLLLIEKEILHNTLDVVNGFRNQLPDGSVAPPCCGDFVPKL 431


>gi|302809535|ref|XP_002986460.1| hypothetical protein SELMODRAFT_269129 [Selaginella moellendorffii]
 gi|300145643|gb|EFJ12317.1| hypothetical protein SELMODRAFT_269129 [Selaginella moellendorffii]
          Length = 432

 Score =  482 bits (1240), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 233/351 (66%), Positives = 279/351 (79%), Gaps = 4/351 (1%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEG 59
           MYEKK+GK+SFW P+IRELDRQRGRGQ+AVESPLLW+  EL  Y TGS  K  +LER EG
Sbjct: 81  MYEKKRGKESFWYPFIRELDRQRGRGQVAVESPLLWTSEELDEYFTGSRMKEVVLERLEG 140

Query: 60  IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 119
           IKREY ELDTVWFMAGSLF++YP+DIPTEAF+FEIFKQAFVAVQSCVVHLQ VSL RRFA
Sbjct: 141 IKREYQELDTVWFMAGSLFKEYPFDIPTEAFSFEIFKQAFVAVQSCVVHLQGVSLPRRFA 200

Query: 120 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 179
           LVPLGPPLLAY S CKAML A  D V+L VDR YK GE I+VWCGPQPN++LL+NYGFVD
Sbjct: 201 LVPLGPPLLAYKSNCKAMLKAAGDLVRLEVDRAYKKGEQILVWCGPQPNTRLLLNYGFVD 260

Query: 180 EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLG 239
            DNP+DRL VEA+LNT DP YQ+KR++ Q+N +L++Q F +  GREKEA+ +MLPYLRLG
Sbjct: 261 PDNPHDRLSVEASLNTRDPFYQNKRIIVQKNNRLTIQNFQIFKGREKEAVLEMLPYLRLG 320

Query: 240 YVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD--Y 297
           +VSD   M+SV S+ GP CPVS C ERAVLDQLA YF+ R+A Y  T+ ED A+L D   
Sbjct: 321 HVSDPYHMESVFSAEGPTCPVSACNERAVLDQLAQYFQERIAKYKTTIDEDRALLEDCSS 380

Query: 298 NLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVS-PCPAPYAPLL 347
           +++PK+RVATQL+ +EK++L+  L V       LPD +V+ PC   + P L
Sbjct: 381 DINPKQRVATQLLLIEKEILHNTLDVVNGFRNQLPDGSVAPPCCGDFVPKL 431


>gi|168044593|ref|XP_001774765.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162673920|gb|EDQ60436.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 523

 Score =  479 bits (1234), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 229/348 (65%), Positives = 272/348 (78%), Gaps = 4/348 (1%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEG 59
           MYEKKQGK+S+W PYIRELDRQRGRGQL+V SPLLWS  EL  Y TGS  K  +LER  G
Sbjct: 178 MYEKKQGKESYWYPYIRELDRQRGRGQLSVASPLLWSREELNEYFTGSTMKEVVLERLAG 237

Query: 60  IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 119
           IKREY ELDTVWFMAGSLF+QYP+D+PTEAF+FEIFKQAFVAVQSCVVHLQ VSLARRFA
Sbjct: 238 IKREYEELDTVWFMAGSLFKQYPFDLPTEAFSFEIFKQAFVAVQSCVVHLQGVSLARRFA 297

Query: 120 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 179
           LVPLGPPLLAY S CKAML AVDD V L VDR YKAG+ I VWCGPQPNSKLL+NYGFVD
Sbjct: 298 LVPLGPPLLAYKSNCKAMLKAVDDNVVLEVDRAYKAGDPIAVWCGPQPNSKLLLNYGFVD 357

Query: 180 EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLG 239
           EDNPYDRL VEA+L+TEDP YQ KR + Q+N +L++Q F ++ G+E EA+ DMLPY+RL 
Sbjct: 358 EDNPYDRLAVEASLDTEDPLYQQKRAIVQKNNRLTIQTFQIYKGKEMEAVLDMLPYMRLA 417

Query: 240 YVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNL 299
           +++D  EM++V  + GP+CPVS C ERAVL+QL  YF+ RLAGY ++ + +     D   
Sbjct: 418 HLADPEEMETVSFAQGPVCPVSACNERAVLEQLEQYFEKRLAGYKSSHATEGG---DAKK 474

Query: 300 HPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 347
           + KKRVA +L+ +EK +L   L    ++I  LPD  +SPC  PY P L
Sbjct: 475 NAKKRVAEKLLCIEKSILRNALAAVQELISQLPDSAISPCIGPYLPNL 522


>gi|168020073|ref|XP_001762568.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162686301|gb|EDQ72691.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 427

 Score =  466 bits (1198), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 224/348 (64%), Positives = 265/348 (76%), Gaps = 4/348 (1%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEG 59
           MYEKKQGK+S+W PYIRELDRQRGRGQL+V SPLLWS  EL  Y TGS  K  +LER  G
Sbjct: 82  MYEKKQGKESYWYPYIRELDRQRGRGQLSVASPLLWSPEELNEYFTGSTMKEVVLERLAG 141

Query: 60  IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 119
           IKREY ELDTVWFMAGSLF+QYP+D+PTEAF+FEIFKQAFVAVQSCVVHLQ VSLARRFA
Sbjct: 142 IKREYEELDTVWFMAGSLFKQYPFDLPTEAFSFEIFKQAFVAVQSCVVHLQGVSLARRFA 201

Query: 120 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 179
           LVPLGPPLLAY S CKAML AV D VQL VD  YK G+ I VWCGPQPNSKLL+NYGFVD
Sbjct: 202 LVPLGPPLLAYKSNCKAMLKAVGDNVQLEVDHAYKTGDPIAVWCGPQPNSKLLLNYGFVD 261

Query: 180 EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLG 239
           EDNP+DRL VEA+LNTEDP YQ KR V Q+N +L++Q F ++ G+E EA+ DMLPY+RLG
Sbjct: 262 EDNPFDRLAVEASLNTEDPLYQQKRAVVQKNNRLTIQTFQIYKGKEMEAVRDMLPYMRLG 321

Query: 240 YVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNL 299
           +++D  E+++V  +  P+C VS C ERAVL+Q+  +F+ RLAGY    S D     D   
Sbjct: 322 HLADPEEIETVSFAQEPLCYVSACNERAVLNQIEHFFERRLAGYK---SSDTTKAVDAKK 378

Query: 300 HPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 347
             K+ VA +L+ +EK +L   L    ++I  LPD  +SPC  PY P L
Sbjct: 379 DAKRTVAKKLMSIEKNILRNALAAVHELIRELPDGAISPCIGPYLPNL 426


>gi|326503142|dbj|BAJ99196.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 425

 Score =  438 bits (1126), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 199/255 (78%), Positives = 229/255 (89%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           MYEKKQG+ S W PYI+ELDRQRGRGQLAVESPLLW+E+EL YL GSP + E++ R EGI
Sbjct: 171 MYEKKQGRDSLWYPYIKELDRQRGRGQLAVESPLLWTESELDYLNGSPMRDEVVVRDEGI 230

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           K+EYNELDT+WFMAGSLF+QYP+D+PTEAF FEIFKQAFVAVQSCVVHLQKVSLARRFAL
Sbjct: 231 KKEYNELDTLWFMAGSLFKQYPFDVPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFAL 290

Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
           VPLGPPLL Y S CKAML AVD +V+L+VDRPYKAGE I+VWCGPQPNS+LL+NYGFVDE
Sbjct: 291 VPLGPPLLTYKSNCKAMLTAVDGSVRLLVDRPYKAGEPIIVWCGPQPNSRLLLNYGFVDE 350

Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
           DNPYDR+ +EA+LNTEDPQYQ+KRMVAQRNGKL++Q F V  G+EK+ IS+MLPYLRLGY
Sbjct: 351 DNPYDRIAIEASLNTEDPQYQEKRMVAQRNGKLAIQKFQVCVGKEKQTISEMLPYLRLGY 410

Query: 241 VSDTSEMQSVISSLG 255
           +SD  EMQ ++SS G
Sbjct: 411 ISDPDEMQCILSSEG 425


>gi|212721460|ref|NP_001132025.1| uncharacterized protein LOC100193433 [Zea mays]
 gi|194693232|gb|ACF80700.1| unknown [Zea mays]
 gi|414881264|tpg|DAA58395.1| TPA: hypothetical protein ZEAMMB73_027665 [Zea mays]
 gi|414881265|tpg|DAA58396.1| TPA: hypothetical protein ZEAMMB73_027665 [Zea mays]
          Length = 252

 Score =  374 bits (959), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 173/242 (71%), Positives = 206/242 (85%)

Query: 106 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 165
           ++  QKVSLARRFALVPLGPPLL Y S CKAML A  D+V+LVVDRPYKAGE I++WCGP
Sbjct: 10  LIQEQKVSLARRFALVPLGPPLLTYRSNCKAMLTADGDSVRLVVDRPYKAGEPIIIWCGP 69

Query: 166 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGRE 225
           Q NS+L++NYGFVDEDNP+DR+ +EA+LNTEDPQYQ+KRMVAQRNGKL++Q F+V+ G+E
Sbjct: 70  QTNSRLVLNYGFVDEDNPFDRVAIEASLNTEDPQYQEKRMVAQRNGKLAIQNFNVYVGKE 129

Query: 226 KEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 285
           K+ +++MLPYLRLGY+S+  EMQS++SS G  CPVSPC ERAVLDQL  Y ++RLAGYP 
Sbjct: 130 KQTVAEMLPYLRLGYISNPDEMQSILSSEGDTCPVSPCTERAVLDQLVGYLESRLAGYPT 189

Query: 286 TLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAP 345
           TL EDEAML D NL PKK VAT+LVR+EKKML+ACLQ T + I  LPD TVSPCPAPYAP
Sbjct: 190 TLDEDEAMLADGNLEPKKEVATRLVRLEKKMLHACLQATNEFINDLPDHTVSPCPAPYAP 249

Query: 346 LL 347
            L
Sbjct: 250 EL 251


>gi|413950742|gb|AFW83391.1| hypothetical protein ZEAMMB73_866859 [Zea mays]
 gi|413950743|gb|AFW83392.1| hypothetical protein ZEAMMB73_866859 [Zea mays]
          Length = 252

 Score =  362 bits (930), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 166/240 (69%), Positives = 203/240 (84%)

Query: 106 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 165
           ++  QKVSLARRFALVPLGPPLL Y S CKAML    ++V+LVVDRPYKAGE I++WCGP
Sbjct: 10  LIQEQKVSLARRFALVPLGPPLLTYKSNCKAMLTVDGESVRLVVDRPYKAGEPIIIWCGP 69

Query: 166 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGRE 225
           Q NS+L++NYGFVDE+NP+DR+ +EA+LNTEDPQYQ+KRMVAQRNGK ++Q F+V+ G+E
Sbjct: 70  QTNSRLVLNYGFVDENNPFDRISIEASLNTEDPQYQEKRMVAQRNGKHAIQNFNVYVGKE 129

Query: 226 KEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 285
           K+ +++MLPYLRLGY+SD  EMQS++SS G  CPVSPC ERAVLDQL  Y ++RLAGYP 
Sbjct: 130 KQTVAEMLPYLRLGYISDPDEMQSILSSEGDTCPVSPCTERAVLDQLGGYLESRLAGYPT 189

Query: 286 TLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAP 345
           TL+EDEAML D +L PK+ VAT+LVR+EKKML+ACLQ T + I  LPD TVSPCPA YAP
Sbjct: 190 TLNEDEAMLADGSLEPKQEVATRLVRLEKKMLHACLQATNEFITDLPDHTVSPCPAQYAP 249


>gi|413950744|gb|AFW83393.1| hypothetical protein ZEAMMB73_866859 [Zea mays]
          Length = 281

 Score =  346 bits (887), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 166/269 (61%), Positives = 203/269 (75%), Gaps = 29/269 (10%)

Query: 106 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 165
           ++  QKVSLARRFALVPLGPPLL Y S CKAML    ++V+LVVDRPYKAGE I++WCGP
Sbjct: 10  LIQEQKVSLARRFALVPLGPPLLTYKSNCKAMLTVDGESVRLVVDRPYKAGEPIIIWCGP 69

Query: 166 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGRE 225
           Q NS+L++NYGFVDE+NP+DR+ +EA+LNTEDPQYQ+KRMVAQRNGK ++Q F+V+ G+E
Sbjct: 70  QTNSRLVLNYGFVDENNPFDRISIEASLNTEDPQYQEKRMVAQRNGKHAIQNFNVYVGKE 129

Query: 226 KEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 285
           K+ +++MLPYLRLGY+SD  EMQS++SS G  CPVSPC ERAVLDQL  Y ++RLAGYP 
Sbjct: 130 KQTVAEMLPYLRLGYISDPDEMQSILSSEGDTCPVSPCTERAVLDQLGGYLESRLAGYPT 189

Query: 286 TLSEDEAM-----------------------------LTDYNLHPKKRVATQLVRMEKKM 316
           TL+EDEAM                             L D +L PK+ VAT+LVR+EKKM
Sbjct: 190 TLNEDEAMVMSCDFLRVVSWSLYKLAECYGIGFGHCQLADGSLEPKQEVATRLVRLEKKM 249

Query: 317 LNACLQVTADMIMLLPDVTVSPCPAPYAP 345
           L+ACLQ T + I  LPD TVSPCPA YAP
Sbjct: 250 LHACLQATNEFITDLPDHTVSPCPAQYAP 278


>gi|384246822|gb|EIE20311.1| hypothetical protein COCSUDRAFT_48681 [Coccomyxa subellipsoidea
           C-169]
          Length = 539

 Score =  331 bits (849), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 165/322 (51%), Positives = 222/322 (68%), Gaps = 9/322 (2%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEG 59
           MYEKK G++S W  +I+ELDR +GRGQ+  +SPLLW E ++  YL GSP  AEI ER +G
Sbjct: 99  MYEKKNGRQSVWYEFIKELDRIQGRGQMGAKSPLLWDEGQVDEYLAGSPLVAEIKERLKG 158

Query: 60  IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 119
           I++EY ELDTVWFMAGSLF+ YPYD+PTEAF+ ++F+Q F AVQ+ VVHLQ V L++RFA
Sbjct: 159 IEKEYAELDTVWFMAGSLFKSYPYDVPTEAFSLKLFRQGFAAVQASVVHLQGVPLSKRFA 218

Query: 120 LVPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
           LVPLGPPLL+YSS  KAML    +A  VQL VDR Y  GE I  WCGPQPN +LL+NYG 
Sbjct: 219 LVPLGPPLLSYSSTAKAMLTYNREAKEVQLAVDRSYTKGEPIEAWCGPQPNRRLLLNYGI 278

Query: 178 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 237
           V ++NP+D++ +   L   DP +Q KR V Q+N   + Q F +   R+K     +LPYLR
Sbjct: 279 VTDNNPHDKMALTVTLPHADPLFQAKRAVLQQNNLSTQQTFQLQ--RDKGLPELLLPYLR 336

Query: 238 LGYVSDTSEMQSVISSLGPIC--PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
           L + +D + ++  +++L   C  P+SP  ER VL QLA + + RL  Y  T  EDE ++ 
Sbjct: 337 LAHCTDAASLK--LATLDTCCAAPISPENERTVLHQLASHLQDRLDRYKTTCEEDEVIIR 394

Query: 296 DYNLHPKKRVATQLVRMEKKML 317
                P+++VA +L+R+EK +L
Sbjct: 395 STTAGPRQKVAARLLRIEKAIL 416


>gi|307107385|gb|EFN55628.1| hypothetical protein CHLNCDRAFT_57818 [Chlorella variabilis]
          Length = 435

 Score =  296 bits (758), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 153/321 (47%), Positives = 211/321 (65%), Gaps = 10/321 (3%)

Query: 2   YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA-YLTGSPTKAEILERAEGI 60
           YEKK+GK+  W  +I+ELDR +GRG    +SPLLW E + A  L GSP   EI  R +GI
Sbjct: 91  YEKKRGKEGCWYRFIKELDRMQGRGSQGAKSPLLWDEGQAAELLAGSPVVGEIEARLQGI 150

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           ++EY ELDTVW++AGSLF + P+  PTE F+F +F+QAF AVQS VVHLQ V+L +RFAL
Sbjct: 151 RKEYEELDTVWYLAGSLFNRQPFSPPTEQFSFPVFRQAFTAVQSSVVHLQGVALGKRFAL 210

Query: 121 VPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           VP+GPPLL YSS  KAML    ++  V+L VDR Y+ GE+++ WCGPQPNS+LLINYG V
Sbjct: 211 VPMGPPLLTYSSTAKAMLKFDPESHEVRLAVDRAYQPGEAVLAWCGPQPNSRLLINYGIV 270

Query: 179 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 238
           DE NPYD+L +   + ++DP Y+ KR      G  + Q F + A     A   +LPYLRL
Sbjct: 271 DESNPYDKLPLSITIPSDDPLYRLKRDRLAERGLSTQQTFQLQAAASLPA--QLLPYLRL 328

Query: 239 GYVSDTSEMQSVI--SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
            + +  ++++ V      GP+ P +   E  VL+QL  + + R + Y  T+ EDEA++ D
Sbjct: 329 VHSTREADVEGVKWEEEAGPVAPEN---ELTVLNQLITHLRLRQSRYRTTIEEDEAIIAD 385

Query: 297 YNLHPKKRVATQLVRMEKKML 317
               P+  VA +L+++EK +L
Sbjct: 386 PAKGPRPTVAARLLKIEKGIL 406


>gi|255536985|ref|XP_002509559.1| conserved hypothetical protein [Ricinus communis]
 gi|223549458|gb|EEF50946.1| conserved hypothetical protein [Ricinus communis]
          Length = 348

 Score =  277 bits (709), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 130/160 (81%), Positives = 148/160 (92%)

Query: 189 VEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQ 248
           ++AALNTEDPQYQDKRMVAQRNGKLS+QVF ++ G+EKEAISD+LPYLRLGYVSD SEMQ
Sbjct: 189 LQAALNTEDPQYQDKRMVAQRNGKLSIQVFQIYVGKEKEAISDILPYLRLGYVSDPSEMQ 248

Query: 249 SVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQ 308
           SVISS GPICPVSPCME+AVLDQLADYFK RLAGYP +L+EDE ML D+NL+PKKRVATQ
Sbjct: 249 SVISSQGPICPVSPCMEQAVLDQLADYFKRRLAGYPTSLNEDELMLADHNLNPKKRVATQ 308

Query: 309 LVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 348
           LVR+EKK+LNACLQ TAD+I  LPD++VSPCPAPYAP+L 
Sbjct: 309 LVRLEKKILNACLQATADLINQLPDLSVSPCPAPYAPILK 348



 Score =  224 bits (571), Expect = 5e-56,   Method: Compositional matrix adjust.
 Identities = 103/110 (93%), Positives = 107/110 (97%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           MYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE ELAYLTGSPTKAE+LERA+GI
Sbjct: 81  MYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSEAELAYLTGSPTKAEVLERADGI 140

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ 110
           KREY+ELDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFVA+QSCVVHLQ
Sbjct: 141 KREYDELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFVAIQSCVVHLQ 190


>gi|302847476|ref|XP_002955272.1| hypothetical protein VOLCADRAFT_76643 [Volvox carteri f.
           nagariensis]
 gi|300259344|gb|EFJ43572.1| hypothetical protein VOLCADRAFT_76643 [Volvox carteri f.
           nagariensis]
          Length = 488

 Score =  277 bits (708), Expect = 6e-72,   Method: Compositional matrix adjust.
 Identities = 151/321 (47%), Positives = 200/321 (62%), Gaps = 9/321 (2%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEG 59
           MYEKK  KKSFW PYI+ELD+Q+ RG  A ESPLLW + EL + L GSP    + +R  G
Sbjct: 127 MYEKKLKKKSFWYPYIKELDKQQARGPQAAESPLLWGDQELDSLLKGSPLLPAVRQRQAG 186

Query: 60  IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 119
           I++EY  LDTVWFMAGSLF +YP+D+PTE F+FE+F+QAF  VQ+ +VHLQ V +A+RFA
Sbjct: 187 IRKEYEALDTVWFMAGSLFNKYPFDLPTETFSFELFQQAFAVVQASIVHLQGVPIAKRFA 246

Query: 120 LVPLGPPLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
           LVPLGPPL+AYSS  K M+   +D  +V+LVV  P +AG  +  WCGPQPNS+LL+NYG 
Sbjct: 247 LVPLGPPLMAYSSTSKNMMTYDEDSRSVRLVVSGPVEAGRPVAAWCGPQPNSRLLLNYGV 306

Query: 178 VDEDNPYDRLVVE--AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPY 235
           VDE NP+D+L       L T DP +  KR V    G  + Q F V   R       +LPY
Sbjct: 307 VDEHNPFDKLQARFTFTLPTSDPLFPAKRAVLSEAGLATQQSFDVSVARPLP--PQLLPY 364

Query: 236 LRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
           + L   +   ++ SV  S          +E A L  L  Y + R A Y   L  D  ++ 
Sbjct: 365 MMLALATTPEQVASV--SFSDTAGHDRELEAAALAALMAYVQRRTAAYAHPLWRDLEIIN 422

Query: 296 DYNLHPKKRVATQLVRMEKKM 316
           D +  P+++VA +L ++EK +
Sbjct: 423 DPSSTPRQKVAARLTKIEKSI 443


>gi|413950741|gb|AFW83390.1| hypothetical protein ZEAMMB73_201403, partial [Zea mays]
          Length = 130

 Score =  199 bits (507), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 91/110 (82%), Positives = 100/110 (90%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           MYEKKQGK SFW PYI+ELDR RGRGQLAVESPLLW+E+EL YL+GSP K E++ R E I
Sbjct: 19  MYEKKQGKDSFWYPYIKELDRHRGRGQLAVESPLLWTESELDYLSGSPLKDEVVARDEAI 78

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ 110
           +REYNELDT+WFMAGSLFQQYP+DIPTEAF FEIFKQAFVAVQSCVVHLQ
Sbjct: 79  RREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIFKQAFVAVQSCVVHLQ 128


>gi|168002824|ref|XP_001754113.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162694667|gb|EDQ81014.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 638

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 99/325 (30%), Positives = 152/325 (46%), Gaps = 27/325 (8%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS-ETELAYLTGSPTKAEILERAEG 59
           MYE+  G+KS W PY++              SP+LW+ E +   L GSP   E+ +R+  
Sbjct: 161 MYERSLGEKSVWYPYLQTFPS-------TTLSPILWTAEEQQKLLKGSPALEEVQQRSAA 213

Query: 60  IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 119
           ++ EY +L        S F + P   P E F+ E FK AF  + S  V+L    L   FA
Sbjct: 214 LEGEYEDLQ-------SYFTKDPQAFPQEYFSLEAFKSAFSVILSRAVYLPSADL---FA 263

Query: 120 LVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCG-PQPNSKLLINYG 176
           LVP    L  + +  +A L  +  D AV   VDR YK GE +    G  + N+ LLI YG
Sbjct: 264 LVPYADAL-NHRADSQAYLDYSMEDQAVVFPVDRNYKEGEQVFTSYGRERSNADLLITYG 322

Query: 177 FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL 236
           FVDE+N  D L +E  L   D     K+ + Q+    S Q F ++  R     + +L Y+
Sbjct: 323 FVDENNAMDYLDLEVGLVDGDRLLVLKQQILQQAMLDSPQTFPLYLDRFP---TQLLTYM 379

Query: 237 RLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
           RL  + D +    ++     +  +    E   L  L    + +L  Y   + ++  +L +
Sbjct: 380 RLSRLQDPALFPKIVFDKDIM--LDQANEYECLQLLMGECRTKLGNYEGGVDDEIRLLKN 437

Query: 297 YNLHPKKRVATQLVRMEKKMLNACL 321
             +  ++RVA QL   EKK+L + +
Sbjct: 438 KKISQRERVAAQLRLCEKKILTSTM 462


>gi|298706765|emb|CBJ29688.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Ectocarpus siliculosus]
          Length = 521

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 98/338 (28%), Positives = 154/338 (45%), Gaps = 26/338 (7%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           ++EK +GK+SFW  YI  L          V    LW+E +LA L GSP  A        +
Sbjct: 185 VHEKAKGKESFWSSYIGVLPTVE-----EVYPTYLWAEEDLALLEGSPVIAATESMRRKL 239

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           + EY  ++        L  ++P  +P E  T+E F+ AF  + S  + L  +S     AL
Sbjct: 240 EVEYATVEN------DLLDKFPEILPREVHTYEEFQWAFAMLFSRAIRLGGLSTGEAVAL 293

Query: 121 VPLG------PPLLAY-SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 173
           VP        P   +Y  ++ + +  +  D V +  DR YK  E + +  GP+ NS LL+
Sbjct: 294 VPYADLFNHNPFANSYIDARQQGLFFSKTDEVVVYADRSYKKMEQVYISYGPKGNSDLLL 353

Query: 174 NYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDML 233
            YGF  + NPY+ + V  +L+  D  Y+ K+      G    + F ++  R  +   ++L
Sbjct: 354 LYGFSLDRNPYNSVDVTVSLDENDELYERKKAFLSEAGLPPTKAFPLYNDRYPD---ELL 410

Query: 234 PYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM 293
            YLRL  ++        +  L      +   E  VLD L +  KA +AGYP T  +D  +
Sbjct: 411 QYLRLIQLNTDQLRGRTLEDLSFEKKQTDVNELMVLDSLVEACKATIAGYPTTEEQDSKL 470

Query: 294 LTD----YNLHPKKRVATQLVRMEKKMLNACL-QVTAD 326
           + D      L   +R+A +  R EK +L   +  VT D
Sbjct: 471 MNDPGMFRALSKTQRMAVKHRRQEKVILRRTIAAVTKD 508


>gi|452821842|gb|EME28868.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
           N-methyltransferase I [Galdieria sulphuraria]
          Length = 490

 Score =  110 bits (275), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 97/332 (29%), Positives = 162/332 (48%), Gaps = 39/332 (11%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPL-LWSETELAYLTGSPTKAEILERAEG 59
           +YEK +G  SF+ PY+  L        L   +PL LWS  +L  L GSPT +   +  + 
Sbjct: 174 LYEKAKGFDSFFKPYLDILP------TLDELNPLFLWSNKDLDLLQGSPTLSACEQLRDK 227

Query: 60  IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 119
           + REY       ++  ++  Q P +  ++   F+ F+ AF  + S  +       ++R A
Sbjct: 228 LLREYT------YLGKNIIPQIP-NFASKPIDFKQFQWAFGILFSRAICFPS---SKRIA 277

Query: 120 LVPLGPPLLAYSSKCKAML--------AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 171
           LVP    LL +S  C A +          V +AV + VDR Y+  E + V  GP+ N +L
Sbjct: 278 LVPYAD-LLNHSPFCSAFIDEEKIPFGNGVTEAV-VYVDRLYEPYEQVYVSYGPRSNQEL 335

Query: 172 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD 231
           L+ YGF  E NP+D + +   L+  DP Y +K  + +  GK  +Q F ++  R      +
Sbjct: 336 LLLYGFSLERNPFDCVEITIGLDKTDPLYLEKCRMLESYGKSPLQSFPLYMDRYP---VE 392

Query: 232 MLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 291
           M  +LR   +   +++Q+   ++     VS   E + LD+L +Y   +L  YP +L +DE
Sbjct: 393 MAEFLRFCCIDTETDLQADFGTI-----VSASNEESALDKLLNYIVDQLRKYPTSLEDDE 447

Query: 292 AMLTDY----NLHPKKRVATQLVRMEKKMLNA 319
            ++ D      L   +R+A +    EK++L+A
Sbjct: 448 KIIRDRAMFQTLEKNQRMAIRQRLGEKRILHA 479


>gi|219121061|ref|XP_002185762.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
           N-methyltransferase I [Phaeodactylum tricornutum CCAP
           1055/1]
 gi|209582611|gb|ACI65232.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
           N-methyltransferase I [Phaeodactylum tricornutum CCAP
           1055/1]
          Length = 575

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 92/356 (25%), Positives = 159/356 (44%), Gaps = 52/356 (14%)

Query: 1   MYEKK-QGKKSFWLPYIR---ELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILER 56
           +YE+  +G++S W PY+    ++D         V     W + +LA+L GSP  A     
Sbjct: 179 IYERNVRGEESPWKPYLDVLPDIDE--------VNPTFTWPDEDLAFLNGSPVIAATKSL 230

Query: 57  AEGIKREYNELDTVWFMAG--SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 114
              ++REY+ L     + G   L  +YP   P EAF F+ ++ AF  + S  + L+ +  
Sbjct: 231 QMKLRREYDAL-----LGGEDGLLAKYPDRFPAEAFNFKAWEWAFTMLFSRAIRLRSLKQ 285

Query: 115 ARRFALVPLGPPLLAYSSKCKA----------MLAAVDDAVQLVVDRPYKAGESIVVWCG 164
               ALVP    L+ +S   +A          +  + D+ V L  DR Y+  E I +  G
Sbjct: 286 GETLALVPYAD-LINHSPFSQAYIDARQNGDWLFKSGDEEVILYADRGYRRMEQIYISYG 344

Query: 165 PQPNSKLLINYGFVDEDNPYDRLVVEAALNTE---------------DPQYQDKRMVAQR 209
           P+ N++LL+ YGF  E NP++ + V  ++                  DP  ++K    ++
Sbjct: 345 PKSNAELLLLYGFAVERNPFNSVDVTVSIAPRTASFVKELDDDTIPVDPLAEEKAAFLEQ 404

Query: 210 NGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVL 269
            G+ +   F  +A R      +ML YLRL  ++        ++       +S   E AVL
Sbjct: 405 VGRDATVDFPCYADRYP---VEMLEYLRLMQMTPEDTRGKPLAEFDYSRTISLGNEAAVL 461

Query: 270 DQLADYFKARLAGYPATLSEDEAMLTDYNLHP----KKRVATQLVRMEKKMLNACL 321
             +      +L+ YP +  +D A++ D +L       +R+A +  R EK++L   +
Sbjct: 462 TSVITAVSRQLSNYPQSEEDDAALIKDKSLFRLLSYNQRMAVRHRRNEKRLLKRTI 517


>gi|397613505|gb|EJK62256.1| hypothetical protein THAOC_17139 [Thalassiosira oceanica]
          Length = 648

 Score = 94.7 bits (234), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 91/355 (25%), Positives = 153/355 (43%), Gaps = 49/355 (13%)

Query: 6   QGKKSFWLPYIR---ELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           +G+ SF+  Y+    E+D         V     W + +LA+L GSP  A        ++R
Sbjct: 264 KGEDSFYAAYMGVLPEVDE--------VNPTFTWPDEDLAFLEGSPVVAATRSLQMKLRR 315

Query: 63  EYNELDTVWFMAG--SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           EY++L     + G   L  ++P   P E +TFE ++ AF  + S  + L+ + +  R A+
Sbjct: 316 EYDDL-----LGGPDGLVAKFPLRFPAEHYTFENWEWAFTMLFSRAIRLRNLQVGERLAM 370

Query: 121 VPLGPPLLAYSSKCKAMLAAVDDA----------VQLVVDRPYKAGESIVVWCGPQPNSK 170
           VP    L+ +S+  +A + A +            V L  DR Y+  E + +  G + N++
Sbjct: 371 VPYAD-LINHSAFSQAFIDARESGDWLFKSGEEEVILYADRGYRQMEQVYISYGQKSNAE 429

Query: 171 LLINYGFVDEDNPYDRLVVEAALN-------------TEDPQYQDKRMVAQRNGKLSVQV 217
           LL+ YGF  E NPY+ + V  ++               EDP   +K       G+     
Sbjct: 430 LLLLYGFALERNPYNSVDVTVSIAPRTKQIAEANEGVEEDPLADEKLEFLLSVGRDQTVD 489

Query: 218 FHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFK 277
           F  +A R      +ML YLRL  ++        +S       +S   E +VL  +    K
Sbjct: 490 FPCYADRYP---VEMLEYLRLMMMTPEDTRGKPLSDFDYSRTISSANEASVLRSVVAAVK 546

Query: 278 ARLAGYPATLSEDEAMLTDYNLHP----KKRVATQLVRMEKKMLNACLQVTADMI 328
            +L  +P T  +D A++ D  +       +R+A +  R EK++L   L      I
Sbjct: 547 YQLGLFPQTEEDDAAIIKDKGMFRLFSYNQRMAVRHRRNEKRLLKRTLAALEKQI 601


>gi|223992783|ref|XP_002286075.1| rubisco small subunit small subunit n-methyltransferase
           [Thalassiosira pseudonana CCMP1335]
 gi|220977390|gb|EED95716.1| rubisco small subunit small subunit n-methyltransferase
           [Thalassiosira pseudonana CCMP1335]
          Length = 434

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 84/321 (26%), Positives = 140/321 (43%), Gaps = 38/321 (11%)

Query: 30  VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAG--SLFQQYPYDIPT 87
           V     W + +LA+L GSP  A        ++REY+ L     + G   L  ++P   P 
Sbjct: 123 VNPTFTWKDEDLAFLDGSPVVAATRSLQMKLRREYDAL-----LGGQDGLIAKFPDRFPA 177

Query: 88  EAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD----- 142
           E FT+E +  AF  + S  + L+ + +  R A+VP    L+ +S+   A + A +     
Sbjct: 178 EHFTYENWVWAFTMLFSRAIRLRNLQVGERLAMVPYAD-LINHSAFSGAFIDARESGDWL 236

Query: 143 -----DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE- 196
                + V L  DR Y+  E + +  G + N++LL+ YGF  E NPY+ + V  ++    
Sbjct: 237 FKNGEEEVILYADRGYRQMEQVYISYGQKSNAELLLLYGFALERNPYNSVDVTVSIAPRT 296

Query: 197 ------------DPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
                       DP  Q+K       G+     F  +A R      +ML +LRL  ++  
Sbjct: 297 AALAAANEGIEVDPLAQEKVEFLASVGRDQTVDFPCYADRYP---VEMLEFLRLMMMTPE 353

Query: 245 SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP--- 301
                 ++       +SP  E AVL  + +  K +L  YP +  +D  ++ D  L     
Sbjct: 354 DTRGKPLADFDYSRTISPANEAAVLSSVVEAVKYQLNLYPQSEEDDANIIKDKALFRLLS 413

Query: 302 -KKRVATQLVRMEKKMLNACL 321
             +R+A +  R EK++L   L
Sbjct: 414 YNQRMAVRHRRNEKRLLKRTL 434


>gi|323456050|gb|EGB11917.1| hypothetical protein AURANDRAFT_61181 [Aureococcus anophagefferens]
          Length = 516

 Score = 91.3 bits (225), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 88/329 (26%), Positives = 144/329 (43%), Gaps = 34/329 (10%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           + E+ +G +SFW  YI  L      G     +   W   ELAYL GSP  +        +
Sbjct: 176 ILERSKGSRSFWSEYIAILPTNEDVG-----ATFTWPAEELAYLEGSPAASATASMMAKL 230

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           + E+         A  L      D   E FTFE ++ AF  + S  + L+        A+
Sbjct: 231 RAEH---------AAVLEGNSALD--PEIFTFEAWQWAFTNLFSRAIRLKASRAGELLAM 279

Query: 121 VPL-----GPPLLAYSSKCKAMLAAV-----DDAVQLVVDRPYKAGESIVVWCGPQPNSK 170
           VP        P  +     + +  A      +D V L  DR YK  E + +  GP+ N+ 
Sbjct: 280 VPYVDFINHSPFSSSYVDAREVPKAFPWEEKEDEVVLFADRAYKKFEQVFISYGPKSNAD 339

Query: 171 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK-LSVQVFHVHAGREKEAI 229
           LL+ YGF  + NP++ + +    + +D  Y  K   A+  G+ +S   F ++A R  +  
Sbjct: 340 LLLLYGFALDRNPFNSVDLAVGASKDDALYDAKERFARGAGRDVSSAAFPLYADRFPD-- 397

Query: 230 SDMLPYLRLGYVS-DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLS 288
            +++ +LR+   + D    + +      +  +S   E AVLD + D   A +A YPA   
Sbjct: 398 -ELVQFLRMACATEDHLGARPLDDPDNYVDILSLDNELAVLDTIRDACDAAVAAYPAKSG 456

Query: 289 EDEAMLTDYNLHPKKRVATQLVRMEKKML 317
           +D   + D  L   +R+A +LV  EK++L
Sbjct: 457 DD---VPDAFLSRNQRMAKRLVNTEKRIL 482


>gi|302764082|ref|XP_002965462.1| hypothetical protein SELMODRAFT_406852 [Selaginella moellendorffii]
 gi|300166276|gb|EFJ32882.1| hypothetical protein SELMODRAFT_406852 [Selaginella moellendorffii]
          Length = 481

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 94/324 (29%), Positives = 145/324 (44%), Gaps = 27/324 (8%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLT-GSPTKAEILERAEG 59
           MYE+++ + S W PY++           +  SPLLW + E   L  GS   A++ ++   
Sbjct: 126 MYEQERSQDSPWYPYLKVF-------PASTLSPLLWEQEEQEELLRGSSALAKVKDQLTS 178

Query: 60  IKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF 118
           +++ ++ L DT+         +   D P E FTF  FK AF  V S  V+L    L   F
Sbjct: 179 LRQTFDALKDTL---------KDNKDFPMEKFTFSAFKAAFSVVLSRAVYLPSAEL---F 226

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGF 177
           ALVP G  +   SS+        +  V+L VD+ YK G+ +   +     ++  LI YGF
Sbjct: 227 ALVPFGDLINHESSRSLLDYDIEEQKVKLAVDKRYKKGDQVFASYAQNLTSADFLIRYGF 286

Query: 178 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 237
           +DE +  D + +E  L + D     KR + Q  G    Q F V+  R     + +L Y R
Sbjct: 287 LDESDENDFIEIEVGLVSGDSLAPLKREILQEVGLTVPQKFPVYLNRFP---TQLLTYTR 343

Query: 238 LGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDY 297
           L  + D+     +      I  V    E   L  L    + +L  +  T+ +D   L   
Sbjct: 344 LARIQDSGLFAKITFEKDLI--VCQTNEYETLMLLMADCRTKLLSFSDTMEDDMQTLKRK 401

Query: 298 NLHPKKRVATQLVRMEKKMLNACL 321
           NL  K+RVA QL   EK++L   +
Sbjct: 402 NLSYKQRVAAQLRLKEKRILTDTM 425


>gi|224129218|ref|XP_002320530.1| predicted protein [Populus trichocarpa]
 gi|222861303|gb|EEE98845.1| predicted protein [Populus trichocarpa]
          Length = 503

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 69/216 (31%), Positives = 101/216 (46%), Gaps = 21/216 (9%)

Query: 8   KKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNE 66
           K S W  YI  L RQ         S L W+  EL  YL  S  +   +ER   +   YN+
Sbjct: 161 KSSRWSNYISALPRQ-------PYSLLYWTRAELDTYLEASQIRERAIERITNVTGTYND 213

Query: 67  LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 126
           L         +F +YP+  P E F  E FK +F  + S +V L   S+  R ALVP    
Sbjct: 214 LRL------RIFSKYPHLFPEEVFNMETFKWSFGILFSRLVRLP--SMDGRVALVPWAD- 264

Query: 127 LLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--N 182
           +L +SS+ +  L        V    DRPY+ GE + +  G + N +LL++YGFV  +  N
Sbjct: 265 MLNHSSEVETFLDYDKSSKGVVFTTDRPYQPGEQVFISYGRKSNGELLLSYGFVPREGTN 324

Query: 183 PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
           P D + +  +L   D  Y++K    +++G    Q F
Sbjct: 325 PSDSVELSLSLKKSDKCYKEKLEALKKHGLSVSQCF 360


>gi|297829320|ref|XP_002882542.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297328382|gb|EFH58801.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 504

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 68/219 (31%), Positives = 104/219 (47%), Gaps = 21/219 (9%)

Query: 8   KKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNE 66
           K S W  YI  L RQ         S L W+ TEL  YL  S  +   +ER   +   Y +
Sbjct: 162 KSSRWYNYISALPRQ-------PYSLLYWTRTELDMYLEASQIRERAIERITNVVGTYED 214

Query: 67  LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 126
           L +       +F ++P+  P E F  E FK +F  + S +V L   S+  RFALVP    
Sbjct: 215 LRS------RIFSKHPHLFPKEVFNDETFKWSFGILFSRLVRLP--SMDGRFALVPWAD- 265

Query: 127 LLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--N 182
           +L ++ + +  L        V    DRPY+ GE + +  G + N +LL++YGFV  +  N
Sbjct: 266 MLNHNCEVETFLDYDKSSKGVVFTTDRPYQPGEQVFISYGNKSNGELLLSYGFVPREGTN 325

Query: 183 PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 221
           P D + +  +L   D  Y++K    +++G  + Q F V 
Sbjct: 326 PSDSVELALSLRKNDKCYKEKLDALKKHGLSTPQCFPVR 364


>gi|302823067|ref|XP_002993188.1| hypothetical protein SELMODRAFT_449044 [Selaginella moellendorffii]
 gi|300138958|gb|EFJ05708.1| hypothetical protein SELMODRAFT_449044 [Selaginella moellendorffii]
          Length = 600

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 89/320 (27%), Positives = 140/320 (43%), Gaps = 25/320 (7%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           MYE+++ + S W PY++            +   L   E +   L GS   A++ ++   +
Sbjct: 126 MYEQERSQDSPWYPYVKVFPAS------TLSLLLWEQEEQEELLRGSSALAKVKDQLTSL 179

Query: 61  KREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 119
           ++ ++ L DT+         +   D P E FTF  FK AF  V S  V+L    L   FA
Sbjct: 180 RQTFDALKDTL---------KDNKDFPMEKFTFSAFKTAFSVVLSRAVYLPSAEL---FA 227

Query: 120 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFV 178
           LVP G  +   SS+        +  V+L VD+ YK G+ +   +     ++  LI YGF+
Sbjct: 228 LVPFGDLINHESSRSLLDYDIEEQKVKLAVDKRYKKGDQVFASYAQNLTSADFLIRYGFL 287

Query: 179 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 238
           DE +  D + +E  L + D     KR + Q  G    Q F ++  R     + +L Y RL
Sbjct: 288 DESDENDCIEIEVGLVSGDSLAPLKREILQEVGLTVPQKFPLYLNR---FPTQLLTYTRL 344

Query: 239 GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYN 298
             + D+     +      I  VS   E   L  L    + +L     T+ ++   L   N
Sbjct: 345 ARIQDSGLFAKITFEKDLI--VSQTNEYETLMLLMADCRTKLLSSSDTMEDEMQTLRRKN 402

Query: 299 LHPKKRVATQLVRMEKKMLN 318
           L  K+RVA QL   EK++L 
Sbjct: 403 LSYKQRVAAQLRLKEKRILT 422


>gi|15231493|ref|NP_187424.1| rubisco methyltransferase-like protein [Arabidopsis thaliana]
 gi|6466950|gb|AAF13085.1|AC009176_12 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Arabidopsis thaliana]
 gi|6648179|gb|AAF21177.1|AC013483_1 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Arabidopsis thaliana]
 gi|15028205|gb|AAK76599.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Arabidopsis thaliana]
 gi|19310671|gb|AAL85066.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Arabidopsis thaliana]
 gi|332641064|gb|AEE74585.1| rubisco methyltransferase-like protein [Arabidopsis thaliana]
          Length = 504

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 68/219 (31%), Positives = 103/219 (47%), Gaps = 21/219 (9%)

Query: 8   KKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNE 66
           K S W  YI  L RQ         S L W+ TEL  YL  S  +   +ER   +   Y +
Sbjct: 162 KSSRWFNYISALPRQ-------PYSLLYWTRTELDMYLEASQIRERAIERITNVVGTYED 214

Query: 67  LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 126
           L +       +F ++P   P E F  E FK +F  + S +V L   S+  RFALVP    
Sbjct: 215 LRS------RIFSKHPQLFPKEVFNDETFKWSFGILFSRLVRLP--SMDGRFALVPWAD- 265

Query: 127 LLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--N 182
           +L ++ + +  L        V    DRPY+ GE + +  G + N +LL++YGFV  +  N
Sbjct: 266 MLNHNCEVETFLDYDKSSKGVVFTTDRPYQPGEQVFISYGNKSNGELLLSYGFVPREGTN 325

Query: 183 PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 221
           P D + +  +L   D  Y++K    +++G  + Q F V 
Sbjct: 326 PSDSVELALSLRKNDKCYEEKLDALKKHGLSTPQCFPVR 364


>gi|3065835|gb|AAC14296.1| putative methyltransferase [Arabidopsis thaliana]
          Length = 504

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 68/219 (31%), Positives = 103/219 (47%), Gaps = 21/219 (9%)

Query: 8   KKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNE 66
           K S W  YI  L RQ         S L W+ TEL  YL  S  +   +ER   +   Y +
Sbjct: 162 KSSRWFNYISALPRQ-------PYSLLYWTRTELDMYLEASQIRERAIERITNVVGTYED 214

Query: 67  LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 126
           L +       +F ++P   P E F  E FK +F  + S +V L   S+  RFALVP    
Sbjct: 215 LRS------RIFSKHPQLFPKEVFNDETFKWSFGILFSRLVRLP--SMDGRFALVPWAD- 265

Query: 127 LLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--N 182
           +L ++ + +  L        V    DRPY+ GE + +  G + N +LL++YGFV  +  N
Sbjct: 266 MLNHNCEVETFLDYDKSSKGVVFTTDRPYQPGEQVFISYGNKSNGELLLSYGFVPREGTN 325

Query: 183 PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 221
           P D + +  +L   D  Y++K    +++G  + Q F V 
Sbjct: 326 PSDSVELALSLRKNDKCYEEKLDALKKHGLSTPQCFPVR 364


>gi|21537309|gb|AAM61650.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Arabidopsis thaliana]
          Length = 504

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 68/219 (31%), Positives = 103/219 (47%), Gaps = 21/219 (9%)

Query: 8   KKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNE 66
           K S W  YI  L RQ         S L W+ TEL  YL  S  +   +ER   +   Y +
Sbjct: 162 KSSRWFNYISALPRQ-------PYSLLYWTRTELDMYLEASQIRERAIERITNVVGTYED 214

Query: 67  LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 126
           L +       +F ++P   P E F  E FK +F  + S +V L   S+  RFALVP    
Sbjct: 215 LRS------RIFSKHPQLFPKEVFNDETFKWSFGILFSRLVRLP--SMDGRFALVPWAD- 265

Query: 127 LLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--N 182
           +L ++ + +  L        V    DRPY+ GE + +  G + N +LL++YGFV  +  N
Sbjct: 266 MLNHNCEVETFLDYDKSSKGVIFTTDRPYQPGEQVFISYGNKSNGELLLSYGFVPREGTN 325

Query: 183 PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 221
           P D + +  +L   D  Y++K    +++G  + Q F V 
Sbjct: 326 PSDSVELALSLRKNDKCYEEKLDALKKHGLSTPQCFPVR 364


>gi|357462493|ref|XP_003601528.1| SET domain-containing protein [Medicago truncatula]
 gi|355490576|gb|AES71779.1| SET domain-containing protein [Medicago truncatula]
 gi|388500078|gb|AFK38105.1| unknown [Medicago truncatula]
          Length = 497

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 67/219 (30%), Positives = 104/219 (47%), Gaps = 21/219 (9%)

Query: 8   KKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNE 66
           K S W  YI  L RQ         S L WS+ EL  YL  S  +   +ER   +   YN+
Sbjct: 155 KSSRWFSYISALPRQ-------PYSLLYWSQAELDRYLEASQIRERAIERTNNVIGTYND 207

Query: 67  LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 126
                 M   +F +YP   P E F  E FK +F  + S +V L   S+  + ALVP    
Sbjct: 208 ------MRVRIFSKYPDFFPEEVFNIESFKWSFGILFSRMVRLP--SMDGKNALVPWAD- 258

Query: 127 LLAYSSKCKAMLAAVDDAVQLV--VDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--N 182
           ++ +S + +  L     +  +V   DRPY+ GE + +  G + N +LL++YGFV ++  N
Sbjct: 259 MMNHSCEVETFLDYDKSSKGIVFPTDRPYQPGEQVFISYGKKSNGELLLSYGFVPKEGTN 318

Query: 183 PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 221
           P D + +  +L   D  Y++K  + ++ G    Q F + 
Sbjct: 319 PSDSVELSLSLKKSDESYKEKLELLKKYGLSGSQCFPIR 357


>gi|357469947|ref|XP_003605258.1| SET domain-containing protein [Medicago truncatula]
 gi|355506313|gb|AES87455.1| SET domain-containing protein [Medicago truncatula]
          Length = 494

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 67/219 (30%), Positives = 104/219 (47%), Gaps = 21/219 (9%)

Query: 8   KKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNE 66
           K S W  YI  L RQ         S L WS+ EL  YL  S  +   +ER   +   YN+
Sbjct: 152 KSSRWFSYISALPRQ-------PYSLLYWSQAELDRYLEASQIRERAIERTNNVIGTYND 204

Query: 67  LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 126
                 M   +F +YP   P E F  E FK +F  + S +V L   S+  + ALVP    
Sbjct: 205 ------MRVRIFSKYPDFFPEEVFNIESFKWSFGILFSRMVRLP--SMDGKNALVPWAD- 255

Query: 127 LLAYSSKCKAMLAAVDDAVQLV--VDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--N 182
           ++ +S + +  L     +  +V   DRPY+ GE + +  G + N +LL++YGFV ++  N
Sbjct: 256 MMNHSCEVETFLDYDKSSKGIVFPTDRPYQPGEQVFISYGKKSNGELLLSYGFVPKEGTN 315

Query: 183 PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 221
           P D + +  +L   D  Y++K  + ++ G    Q F + 
Sbjct: 316 PSDSVELSLSLKKSDESYKEKLELLKKYGLSGSQCFPIR 354


>gi|134254196|gb|AAI35195.1| LOC549331 protein [Xenopus (Silurana) tropicalis]
          Length = 507

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 84/351 (23%), Positives = 158/351 (45%), Gaps = 36/351 (10%)

Query: 9   KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
            SFWLPYI+ L  +        ++PL ++E E+ YL  +    ++  + +   R+Y    
Sbjct: 94  NSFWLPYIKTLPNE-------YDTPLYFNEDEVQYLQSTQAILDVFSQYKNTARQY---- 142

Query: 69  TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
             +F    + Q +P    +P  ++FTF+ ++ A  +V +    +     +R   AL+PL 
Sbjct: 143 -AYFY--KVIQTHPNANKLPLKDSFTFDDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 199

Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
                 +S         DD  + V  + +K+GE I ++ G + N++ +I+ GF  E+N +
Sbjct: 200 DMCNHTNSLITTGYNLEDDRCECVALQDFKSGEQIYIFYGTRSNAEFVIHNGFFFENNLH 259

Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL------ 238
           DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+      
Sbjct: 260 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHV-TEPPISAQLLAFLRVFCMNED 318

Query: 239 ---GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDE 291
              G++     +  + +      PVS   E  +  +L  + +AR    L  Y  T+ +D 
Sbjct: 319 ELKGHLIGDHAIDKIFTLGNSEFPVS--WENEI--KLWTFLEARASLLLKTYKTTVEDDN 374

Query: 292 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAP 342
            +L   ++     +A +L R+EK++L   L+  +D   L    +    P P
Sbjct: 375 KVLEQPDMTFHSAMAIKLRRVEKEILEKALKSASDNRKLYSKNSEEGTPLP 425


>gi|10439587|dbj|BAB15525.1| unnamed protein product [Homo sapiens]
          Length = 512

 Score = 85.1 bits (209), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 83/351 (23%), Positives = 161/351 (45%), Gaps = 33/351 (9%)

Query: 9   KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
            SFW PYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y    
Sbjct: 87  NSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY---- 135

Query: 69  TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
             +F    + Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL 
Sbjct: 136 -AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 192

Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
                 +          DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +
Sbjct: 193 DMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 252

Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD- 243
           DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++ 
Sbjct: 253 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEE 311

Query: 244 --------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
                    S +  + +      PVS   E  +   L D     L  Y  T+ ED+++L 
Sbjct: 312 ELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLK 371

Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPL 346
           +++L  + ++A +L   EK++L   ++  A    + P + +SP     APL
Sbjct: 372 NHDLSVRAKMAIKLRLGEKEILEKAVKSAA----VNPGI-LSPTDGGKAPL 417


>gi|443722302|gb|ELU11224.1| hypothetical protein CAPTEDRAFT_181634 [Capitella teleta]
          Length = 541

 Score = 84.7 bits (208), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 79/316 (25%), Positives = 138/316 (43%), Gaps = 20/316 (6%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
           SFWLPY++ L         +  SPL ++  +L  L  SPT +E++ +   I R+Y     
Sbjct: 168 SFWLPYLKILPH-------SYSSPLYFNPEDLQLLKASPTLSEMINQFRNITRQYAYFFN 220

Query: 70  VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF-ALVPLGPPLL 128
           + F    L  + P  I  +   ++ ++ A  +V +    +  +   R   AL+PL     
Sbjct: 221 L-FQGHELASKLP--IQVKNICYDDYRWAVSSVMTRQNQIPTLDGQRMISALIPLWDMCN 277

Query: 129 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 188
             + +     +  +D  +        AG  + ++ G + N++LLI+ GFV   N  DRL 
Sbjct: 278 HTNGQITTDFSLKNDRSECFSLEGTVAGAQVFIFYGSRSNAELLIHNGFVYPQNHSDRLT 337

Query: 189 VEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQ 248
           +   ++  DP +  K  V  R    + ++F +H G      SD L +LR+  +++  +++
Sbjct: 338 IRLGISKNDPLFSMKSEVLSRLSMQASRLFSLHCG-VNPVDSDTLAFLRVVVMTE-DDLR 395

Query: 249 SVISSLGPICP-------VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 301
           + ++    I         VS   ER     LA      L  YP +  ED  +L   +L  
Sbjct: 396 TALACRQQISKLRDFDDFVSEDNERKAWAFLATRVLLLLKAYPTSAQEDATLLQGNDLST 455

Query: 302 KKRVATQLVRMEKKML 317
             R+A QL   EK +L
Sbjct: 456 HARLAVQLRHCEKNIL 471


>gi|146181028|ref|XP_001021989.2| SET domain containing protein [Tetrahymena thermophila]
 gi|146144300|gb|EAS01744.2| SET domain containing protein [Tetrahymena thermophila SB210]
          Length = 590

 Score = 84.3 bits (207), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 83/338 (24%), Positives = 155/338 (45%), Gaps = 49/338 (14%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           + ++++ K+S W PY+  L           + P+ +SE +L++L GSP + ++ E+   I
Sbjct: 234 LLQERKNKESKWKPYLDILPSDYN------QFPIFFSEDDLSWLKGSPFQNQVREKKADI 287

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           KR+Y+++ +V       F +Y         TFE F  A +   S V  LQ ++  +  A 
Sbjct: 288 KRDYDDICSV----APEFAEY---------TFEDFCWARMTASSRVFGLQ-INEQKTDAF 333

Query: 121 VPLGPPLLAYSSKCKAMLAAVDD-----AVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
           VPL   L     + K      DD      +Q + D P   GE +    G + NS+  +NY
Sbjct: 334 VPLADML--NHRRPKQTSWQYDDQREGFVIQALEDIP--RGEQVYDSYGRKCNSRFFLNY 389

Query: 176 GFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL-SVQVFHVHAGREKEAISDMLP 234
           GF++ DN  + + +    + EDP  + K+ +    G +   +V+ +    +++ +S+ + 
Sbjct: 390 GFINLDNDANEVALRLTFDAEDPTIERKKEMM--GGDVPEFKVYRILENYQEQNVSEFMS 447

Query: 235 YLRLGYVSDTSEMQSVISSLG----------------PICPVSPCMERAVLDQLADYFKA 278
           YLR   + D S++  ++SSL                    P+S   E  +  ++++  + 
Sbjct: 448 YLRFILIRDNSKL-LMLSSLHEQQTENSENLSGYKPQKTPPISIQNETDMWVRISNMCQT 506

Query: 279 RLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKM 316
            ++ Y  TL ED+ +L   NL   +R    L   EK++
Sbjct: 507 SISLYNTTLKEDKELLAKDNLTQNQRNCVLLRSGEKEV 544


>gi|194038089|ref|XP_001925323.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Sus scrofa]
 gi|456754196|gb|JAA74239.1| SET domain containing 3 [Sus scrofa]
          Length = 595

 Score = 84.3 bits (207), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 81/330 (24%), Positives = 150/330 (45%), Gaps = 28/330 (8%)

Query: 9   KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
            SFW PYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y    
Sbjct: 169 NSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY---- 217

Query: 69  TVWFMAGSLFQQYP--YDIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
             +F    + Q +P  + +P  E+FT+E ++ A  +V +    +     +R   AL+PL 
Sbjct: 218 -AYFY--KVIQTHPQAHKLPLKESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 274

Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
                 +          DD  + V  R ++AGE I ++ G + N++ +I+ GF  ++N +
Sbjct: 275 DMCNHTNGLITTGYNLEDDRCECVALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 334

Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
           DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++ 
Sbjct: 335 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPVSAQLLAFLRVFCMTEG 393

Query: 245 SEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
              + +        I +LG    PVS   E  +   L D     L  Y  T+ ED+  L 
Sbjct: 394 ELKEHLLGENAIDRIFTLGNSEYPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKTFLK 453

Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
           ++ L  +  +A +L   EK++L   ++  A
Sbjct: 454 NHGLSVRATMAVKLRLGEKEILEKAVESAA 483


>gi|440792294|gb|ELR13522.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
          Length = 568

 Score = 84.3 bits (207), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 95/366 (25%), Positives = 149/366 (40%), Gaps = 62/366 (16%)

Query: 4   KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 63
           +K    SFW PYI+ L +  G       + L ++  EL  L GSPT    ++    +  +
Sbjct: 156 EKHNHSSFWTPYIKTLPKSYG-------TCLYFTLEELEGLRGSPTFTSAIKVIATVAIQ 208

Query: 64  YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV---QSCVVHLQKVSLARRFAL 120
           Y  +         LFQ     +   AFT++ F  A  AV   Q+ V      +L+  +AL
Sbjct: 209 YTYIH-------DLFQIRKDILHINAFTWDEFIWAMSAVGSRQNQVPQWGHNALSE-YAL 260

Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
           +P            +       D+ +    R YK GE + ++ GP+PNS LL++ GFV E
Sbjct: 261 IPAWDMCNHDHGDLQTFWDVNSDSTESHAMRAYKKGEQVYIFYGPRPNSDLLLHAGFVYE 320

Query: 181 DNPYDRLVVEAALNTEDPQYQDK-RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLG 239
           +N +D L +   L  +    +DK R++   N K+  Q +    G       D++ +LR+ 
Sbjct: 321 NNRFDALAIRVRLAPDAEHIKDKLRLLHLNNMKMDSQYYLYGLG----LAVDLMAFLRI- 375

Query: 240 YVSDTSEMQSVISS-------------------------LGPICPVSPCMERAVLDQLAD 274
           +  +  E+Q V+ +                           P   ++   E A L     
Sbjct: 376 HAMNEQELQQVLGAYDQQEAKVHNGEHPASNGEVVASGVFDPRVKLNDRNELAALQLAEA 435

Query: 275 YFKARLAGYPATLS---------EDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
              + L+ YP TL          ED+A L   +L P  R  T L   EK++LN  L    
Sbjct: 436 KCLSLLSLYPTTLQVANGVELKQEDQAALRTTSLTPNMRAVTLLRLKEKEILNRTL---- 491

Query: 326 DMIMLL 331
           D I LL
Sbjct: 492 DAIRLL 497


>gi|332321747|sp|B7ZUF3.1|SETD3_XENTR RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|213624517|gb|AAI71209.1| LOC549331 protein [Xenopus (Silurana) tropicalis]
          Length = 582

 Score = 84.0 bits (206), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 83/351 (23%), Positives = 157/351 (44%), Gaps = 36/351 (10%)

Query: 9   KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
            SFWLPYI+ L  +        ++PL ++E E+ YL  +    ++  + +   R+Y    
Sbjct: 169 NSFWLPYIKTLPNE-------YDTPLYFNEDEVQYLQSTQAILDVFSQYKNTARQY---- 217

Query: 69  TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
             +F    + Q +P    +P  ++FTF+ ++ A  +V +    +     +R   AL+PL 
Sbjct: 218 -AYFY--KVIQTHPNANKLPLKDSFTFDDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 274

Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
                 +          DD  + V  + +K+GE I ++ G + N++ +I+ GF  E+N +
Sbjct: 275 DMCNHTNGLITTGYNLEDDRCECVALQDFKSGEQIYIFYGTRSNAEFVIHNGFFFENNLH 334

Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL------ 238
           DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+      
Sbjct: 335 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHV-TEPPISAQLLAFLRVFCMNED 393

Query: 239 ---GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDE 291
              G++     +  + +      PVS   E  +  +L  + +AR    L  Y  T+ +D 
Sbjct: 394 ELKGHLIGDHAIDKIFTLGNSEFPVS--WENEI--KLWTFLEARASLLLKTYKTTVEDDN 449

Query: 292 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAP 342
            +L   ++     +A +L R+EK++L   L+  +D   L    +    P P
Sbjct: 450 KVLEQPDMTFHSAMAIKLRRVEKEILEKALKSASDNRKLYSKNSEEGTPLP 500


>gi|62857953|ref|NP_001016577.1| histone-lysine N-methyltransferase setd3 [Xenopus (Silurana)
           tropicalis]
 gi|89272100|emb|CAJ81720.1| novel protein containing a SET domain [Xenopus (Silurana)
           tropicalis]
          Length = 581

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 83/351 (23%), Positives = 157/351 (44%), Gaps = 36/351 (10%)

Query: 9   KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
            SFWLPYI+ L  +        ++PL ++E E+ YL  +    ++  + +   R+Y    
Sbjct: 168 NSFWLPYIKTLPNE-------YDTPLYFNEDEVQYLQSTQAILDVFSQYKNTARQY---- 216

Query: 69  TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
             +F    + Q +P    +P  ++FTF+ ++ A  +V +    +     +R   AL+PL 
Sbjct: 217 -AYFY--KVIQTHPNANKLPLKDSFTFDDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 273

Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
                 +          DD  + V  + +K+GE I ++ G + N++ +I+ GF  E+N +
Sbjct: 274 DMCNHTNGLITTGYNLEDDRCECVALQDFKSGEQIYIFYGTRSNAEFVIHNGFFFENNLH 333

Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL------ 238
           DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+      
Sbjct: 334 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHV-TEPPISAQLLAFLRVFCMNED 392

Query: 239 ---GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDE 291
              G++     +  + +      PVS   E  +  +L  + +AR    L  Y  T+ +D 
Sbjct: 393 ELKGHLIGDHAIDKIFTLGNSEFPVS--WENEI--KLWTFLEARASLLLKTYKTTVEDDN 448

Query: 292 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAP 342
            +L   ++     +A +L R+EK++L   L+  +D   L    +    P P
Sbjct: 449 KVLEQPDMTFHSAMAIKLRRVEKEILEKALKSASDNRKLYSKNSEEGTPLP 499


>gi|340780678|pdb|3SMT|A Chain A, Crystal Structure Of Human Set Domain-Containing Protein3
          Length = 497

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 77/329 (23%), Positives = 150/329 (45%), Gaps = 28/329 (8%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
           SFW PYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y     
Sbjct: 169 SFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY----- 216

Query: 70  VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
            +F    + Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL  
Sbjct: 217 AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVXTRQNQIPTEDGSRVTLALIPLWD 274

Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
                +          DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +D
Sbjct: 275 XCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 334

Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL------- 238
           R+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+       
Sbjct: 335 RVKIKLGVSKSDRLYAXKAEVLARAGIPTSSVFALHFT-EPPISAQLLAFLRVFCXTEEE 393

Query: 239 --GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
              ++   S +  + +      PVS   E  +   L D     L  Y  T+ ED+++L +
Sbjct: 394 LKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKN 453

Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
           ++L  + + A +L   EK++L   ++  A
Sbjct: 454 HDLSVRAKXAIKLRLGEKEILEKAVKSAA 482


>gi|3403236|gb|AAC29137.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
           N-methyltransferase I [Spinacia oleracea]
          Length = 491

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 87/326 (26%), Positives = 149/326 (45%), Gaps = 22/326 (6%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           M EKK G  S W PYI  L         +  S + WSE EL+ L GS      L   E +
Sbjct: 135 MREKKLGNSSSWKPYIDILPD-------STNSTIYWSEEELSELQGSQLLNTTLGVKELV 187

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
             E+ +L+    +     Q +P+D+  + F F  F        +C+   Q + L     L
Sbjct: 188 ANEFAKLEEEVLVPHK--QLFPFDVTQDDF-FWAFGMLRSRAFTCL-EGQSLVLIPLADL 243

Query: 121 VPLGPPLLA--YSSKCK-AMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYG 176
               P + A  Y+ + + A L + +    L    P KAG+ +++ +   + N++L ++YG
Sbjct: 244 ANHSPDITAPKYAWEIRGAGLFSRELVFSLRNPTPVKAGDQVLIQYDLNKSNAELALDYG 303

Query: 177 FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL 236
             +  +  +   +   +   D  Y DK  +A+ NG      F +    E+   ++MLPYL
Sbjct: 304 LTESRSERNAYTLTLEIPESDSFYGDKLDIAESNGMGESAYFDIVL--EQPLPANMLPYL 361

Query: 237 RLGYVS--DTSEMQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 291
           RL  +   D   ++S+   S  G +  P+SP  E  +   + D   + L+GY  T++EDE
Sbjct: 362 RLVALGGEDAFLLESIFRNSIWGHLDLPISPANEELICQVIRDACTSALSGYSTTIAEDE 421

Query: 292 AMLTDYNLHPKKRVATQLVRMEKKML 317
            +L + ++ P+  +A  +   EKK+L
Sbjct: 422 KLLAEGDIDPRLEIAITIRLGEKKVL 447


>gi|148744485|gb|AAI42996.1| SET domain containing 3 [Homo sapiens]
          Length = 594

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 77/329 (23%), Positives = 152/329 (46%), Gaps = 28/329 (8%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
           SFW PYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y     
Sbjct: 170 SFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY----- 217

Query: 70  VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
            +F    + Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL  
Sbjct: 218 AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 275

Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
                +          DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +D
Sbjct: 276 MCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 335

Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT- 244
           R+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++  
Sbjct: 336 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEE 394

Query: 245 --------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
                   S +  + +      PVS   E  +   L D     L  Y  T+ ED+++L +
Sbjct: 395 LKEHLLGDSAIDRIFTLGNSKFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKN 454

Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
           ++L  + ++A +L   EK++L   ++  A
Sbjct: 455 HDLSVRAKMAIKLRLGEKEILEKAVKSAA 483


>gi|40068481|ref|NP_115609.2| histone-lysine N-methyltransferase setd3 isoform a [Homo sapiens]
 gi|74750394|sp|Q86TU7.1|SETD3_HUMAN RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|28071092|emb|CAD61927.1| unnamed protein product [Homo sapiens]
 gi|119602070|gb|EAW81664.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
 gi|119602072|gb|EAW81666.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
 gi|119602073|gb|EAW81667.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
 gi|194380984|dbj|BAG64060.1| unnamed protein product [Homo sapiens]
 gi|307686103|dbj|BAJ20982.1| SET domain containing 3 [synthetic construct]
          Length = 594

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 77/329 (23%), Positives = 152/329 (46%), Gaps = 28/329 (8%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
           SFW PYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y     
Sbjct: 170 SFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY----- 217

Query: 70  VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
            +F    + Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL  
Sbjct: 218 AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 275

Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
                +          DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +D
Sbjct: 276 MCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 335

Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT- 244
           R+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++  
Sbjct: 336 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEE 394

Query: 245 --------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
                   S +  + +      PVS   E  +   L D     L  Y  T+ ED+++L +
Sbjct: 395 LKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKN 454

Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
           ++L  + ++A +L   EK++L   ++  A
Sbjct: 455 HDLSVRAKMAIKLRLGEKEILEKAVKSAA 483


>gi|356547583|ref|XP_003542190.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Glycine
           max]
          Length = 499

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 65/219 (29%), Positives = 100/219 (45%), Gaps = 21/219 (9%)

Query: 8   KKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNE 66
           + S W  YI  L RQ         S L W++ EL  YL  S  +   +ER   +   YN+
Sbjct: 157 ESSRWSNYISALPRQ-------PYSLLYWTQAELDRYLEASQIRERAIERINNVIGTYND 209

Query: 67  LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 126
           L         +F +YP   P E F  E FK +F  + S +V L   S+    ALVP    
Sbjct: 210 LRL------RIFSKYPDLFPDEVFNIESFKWSFGILFSRLVRLP--SMGGNVALVPWAD- 260

Query: 127 LLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--N 182
           +L +S   +  L        +    DRPY+ GE + +  G + N +LL++YGFV ++  N
Sbjct: 261 MLNHSCDVETFLDYDKTSKGIVFTTDRPYQPGEQVFISYGKKSNGELLLSYGFVPKEGAN 320

Query: 183 PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 221
           P D + +  +L   D  Y++K  + +  G  + Q F + 
Sbjct: 321 PSDSVELSLSLKKSDASYKEKLELLKNYGLSASQCFPIQ 359


>gi|355718753|gb|AES06373.1| SET domain containing 3 [Mustela putorius furo]
          Length = 585

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 81/330 (24%), Positives = 152/330 (46%), Gaps = 28/330 (8%)

Query: 9   KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
            SFW PYI+ L  +        ++PL + E E+  L  +    ++  + +   R+Y    
Sbjct: 169 NSFWQPYIQTLPSE-------YDTPLYFEEDEVRDLQSTQAIHDVFSQYKNTARQY---- 217

Query: 69  TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
             +F    + Q +P+   +P  +AFT+E ++ A  +V +    +     +R   AL+PL 
Sbjct: 218 -AYFY--KVIQTHPHANKLPLKDAFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 274

Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
                 +          DD  + V  R ++AGE I ++ G + N++ +I+ GF  ++N +
Sbjct: 275 DMCNHTNGLITTGYNLEDDRCECVALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 334

Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
           DR+ ++  ++  D  Y  K  V  R G  +  VF +H+  E    + +L +LR+  +++ 
Sbjct: 335 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPVSAQLLAFLRVFCMTEE 393

Query: 245 SEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
              + +        I +LG    PVS   E  +   L D     L  Y  T+ ED++ L 
Sbjct: 394 ELKEHLLGDNALDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLK 453

Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
           D++L  +  +A +L   EK++L   ++  A
Sbjct: 454 DHDLSVRAAMAIKLRLGEKEILEKAVKSAA 483


>gi|3403234|gb|AAC29136.1| ribulose-1,5-bisphosphate carboxylase/oxygenase N-methyltransferase
           [Spinacia oleracea]
 gi|3403238|gb|AAC29138.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
           N-methyltransferase II [Spinacia oleracea]
          Length = 495

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 84/329 (25%), Positives = 152/329 (46%), Gaps = 24/329 (7%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           M EKK G  S W PYI  L         +  S + WSE EL+ L GS      L   E +
Sbjct: 135 MREKKLGNSSSWKPYIDILPD-------STNSTIYWSEEELSELQGSQLLNTTLGVKELV 187

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAF--TFEIFK-QAFVAVQS---CVVHLQKVSL 114
             E+ +L+    +     Q +P+D+  + F   F + + +AF  ++     ++ L  + +
Sbjct: 188 ANEFAKLEEEVLVPHK--QLFPFDVTQDDFFWAFGMLRSRAFTCLEGQSLVLIPLADLWV 245

Query: 115 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLI 173
            +      +  P  A+  +  A L + +    L    P KAG+ +++ +   + N++L +
Sbjct: 246 QQANHSPDITAPKYAWEIR-GAGLFSRELVFSLRNPTPVKAGDQVLIQYDLNKSNAELAL 304

Query: 174 NYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDML 233
           +YG  +  +  +   +   +   D  Y DK  +A+ NG      F +    E+   ++ML
Sbjct: 305 DYGLTESRSERNAYTLTLEIPESDSFYGDKLDIAESNGMGESAYFDIVL--EQPLPANML 362

Query: 234 PYLRLGYVS--DTSEMQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLS 288
           PYLRL  +   D   ++S+   S  G +  P+SP  E  +   + D   + L+GY  T++
Sbjct: 363 PYLRLVALGGEDAFLLESIFRNSIWGHLDLPISPANEELICQVIRDACTSALSGYSTTIA 422

Query: 289 EDEAMLTDYNLHPKKRVATQLVRMEKKML 317
           EDE +L + ++ P+  +A  +   EKK+L
Sbjct: 423 EDEKLLAEGDIDPRLEIAITIRLGEKKVL 451


>gi|168003103|ref|XP_001754252.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162694354|gb|EDQ80702.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 431

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 68/243 (27%), Positives = 116/243 (47%), Gaps = 25/243 (10%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEG 59
           + E  + + S W PY   L +          S L W+E E+  +LT SP + + LE    
Sbjct: 94  ISEASREESSRWFPYFATLPK-------TPSSILQWTEEEVNTWLTASPVREKALECIRD 146

Query: 60  IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 119
           +   Y +L        ++F ++P   P++ +T   FK AF  + S +V L  V    + A
Sbjct: 147 VTETYRDL------RATIFLKHPEVFPSQVYTLAAFKWAFGILFSRLVRLPSVG---KLA 197

Query: 120 LVPLGPPLLAYSSKCKAMLAAVDD---AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 176
           LVP    +L +S +  + L    +   +V  V DR Y++GE + +  G + + +L + YG
Sbjct: 198 LVPWAD-MLNHSPQVDSFLDFDQNNAKSVVTVTDRAYQSGEQVFISYGKRSSGELFLAYG 256

Query: 177 FV-DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPY 235
           F+  E N +D + +E  ++++DP ++ K   A   G  S Q F V   R+    + +L Y
Sbjct: 257 FIPSELNVHDSVELEMEIDSDDPSFEAKLRAANEQGLSSPQRFPV---RKDGFPAQLLAY 313

Query: 236 LRL 238
            RL
Sbjct: 314 ARL 316


>gi|440907688|gb|ELR57800.1| SET domain-containing protein 3 [Bos grunniens mutus]
          Length = 594

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 78/330 (23%), Positives = 150/330 (45%), Gaps = 28/330 (8%)

Query: 9   KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
            SFW PYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y    
Sbjct: 169 NSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY---- 217

Query: 69  TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
             +F    + Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL 
Sbjct: 218 -AYFY--KVIQTHPHAHKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 274

Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
                 S          DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +
Sbjct: 275 DMCNHTSGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 334

Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
           DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++ 
Sbjct: 335 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEE 393

Query: 245 ---------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
                    S +  + +      PVS   E  +   L D     L  Y  T+ ED++ L 
Sbjct: 394 ELKEHLLGDSAIDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLK 453

Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
           +++L  +  +A +L   EK++L   ++  A
Sbjct: 454 NHDLSARATMAIKLRLGEKEILERAVKSAA 483


>gi|110331827|gb|ABG67019.1| hypothetical protein LOC84193 [Bos taurus]
          Length = 488

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 75/324 (23%), Positives = 146/324 (45%), Gaps = 24/324 (7%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
           SFW PYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y     
Sbjct: 177 SFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY----- 224

Query: 70  VWFMAGSLFQQYPYDIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPL 127
            +F        + + +P  ++FT+E ++ A  +V +    +     +R   AL+PL    
Sbjct: 225 AYFYKVIQTHPHAHKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMC 284

Query: 128 LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 187
              S          DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +DR+
Sbjct: 285 NHTSGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRV 344

Query: 188 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT--- 244
            ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++    
Sbjct: 345 KIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELK 403

Query: 245 ------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYN 298
                 S +  + +      PVS   E  +   L D     L  Y  T+ ED++ L +++
Sbjct: 404 EHLLGDSAIDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLKNHD 463

Query: 299 LHPKKRVATQLVRMEKKMLNACLQ 322
           L  +  +A +L   EK++L   ++
Sbjct: 464 LSARATMAIKLRLGEKEILERAVK 487


>gi|119914085|ref|XP_589822.3| PREDICTED: histone-lysine N-methyltransferase setd3 [Bos taurus]
 gi|297488270|ref|XP_002696879.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Bos taurus]
 gi|296475307|tpg|DAA17422.1| TPA: SET domain containing 3 [Bos taurus]
          Length = 601

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 78/329 (23%), Positives = 150/329 (45%), Gaps = 28/329 (8%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
           SFW PYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y     
Sbjct: 177 SFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY----- 224

Query: 70  VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
            +F    + Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL  
Sbjct: 225 AYFY--KVIQTHPHAHKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 282

Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
                S          DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +D
Sbjct: 283 MCNHTSGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 342

Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT- 244
           R+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++  
Sbjct: 343 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEE 401

Query: 245 --------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
                   S +  + +      PVS   E  +   L D     L  Y  T+ ED++ L +
Sbjct: 402 LKEHLLGDSAIDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLKN 461

Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
           ++L  +  +A +L   EK++L   ++  A
Sbjct: 462 HDLSARATMAIKLRLGEKEILERAVKSAA 490


>gi|255582876|ref|XP_002532210.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase, chloroplast precursor, putative
           [Ricinus communis]
 gi|223528106|gb|EEF30179.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase, chloroplast precursor, putative
           [Ricinus communis]
          Length = 508

 Score = 82.0 bits (201), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 70/220 (31%), Positives = 104/220 (47%), Gaps = 23/220 (10%)

Query: 8   KKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNE 66
           K S W  YI  L RQ         S L W+  EL  YL  S  +   +ER   +   YN+
Sbjct: 166 KSSKWSNYISALPRQ-------PYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYND 218

Query: 67  LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 126
           L         +F +YP   P E F  E FK +F  + S +V L   S+  + ALVP    
Sbjct: 219 LRL------RIFSKYPDLFPEEVFNLETFKWSFGILFSRLVRLP--SMDGKVALVPWAD- 269

Query: 127 LLAYSSKCKAMLAAVDDAVQLVV---DRPYKAGESIVVWCGPQPNSKLLINYGFVDED-- 181
           +L +S + +  L   D + Q VV   DR Y+ GE + +  G + N +LL++YGFV  +  
Sbjct: 270 MLNHSCEVETFLD-YDKSSQGVVFTTDRQYEPGEQVFISYGKKSNGELLLSYGFVPREGT 328

Query: 182 NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 221
           NP D + +  +L   D  Y++K    +++G  + Q F V 
Sbjct: 329 NPSDSVELSLSLKKSDKSYKEKLEALKKHGFSASQCFPVR 368


>gi|293333172|ref|NP_001168589.1| uncharacterized protein LOC100382373 [Zea mays]
 gi|223949395|gb|ACN28781.1| unknown [Zea mays]
 gi|414885391|tpg|DAA61405.1| TPA: hypothetical protein ZEAMMB73_723554 [Zea mays]
          Length = 489

 Score = 82.0 bits (201), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 95/334 (28%), Positives = 148/334 (44%), Gaps = 38/334 (11%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           + E  +G  S W PY+  L RQ        +S + WSE EL  + G+    ++L    G+
Sbjct: 136 LSEVARGADSPWAPYLAILPRQ-------TDSTIFWSEEELLEIQGT----QLLSTTVGV 184

Query: 61  KREY--NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV---VHLQKVSLA 115
           K EY  +E D+V     S  +    D+   + TF+ F  AF  ++S V   +   K++L 
Sbjct: 185 K-EYVQSEFDSVQAEIISTNK----DLFPGSITFDDFLWAFGMLRSRVFPELRGDKLALI 239

Query: 116 RRFALVPLGPPLLAYSS----KCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSK 170
               LV   P + +  S    K K +    +    L      K+G+ I + +   + N++
Sbjct: 240 PFADLVNHSPNITSEGSSWEIKGKGLFGR-ELMFSLRTPVNVKSGQQIYIQYDLDKSNAE 298

Query: 171 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 230
           L ++YGFV+ +   D   V   ++  DP Y DK  +A+ NG      F V     +    
Sbjct: 299 LALDYGFVESNPSRDSFTVTLEISESDPFYGDKLDIAEANGLGETAYFDVIL--NEPLPP 356

Query: 231 DMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGY 283
            MLPYLRL  +  T           SV   L    P+SP  E ++   + D  K+ LA Y
Sbjct: 357 QMLPYLRLLCIGGTDAFLLEALFRNSVWGHLE--LPLSPDNEESICQAMRDACKSALADY 414

Query: 284 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 317
             T+ EDE +    NL P+  +A  +   EKK+L
Sbjct: 415 HTTIEEDEELSGRENLQPRLAIAIGVRAGEKKVL 448


>gi|426248573|ref|XP_004018037.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
           setd3 [Ovis aries]
          Length = 596

 Score = 82.0 bits (201), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 78/330 (23%), Positives = 150/330 (45%), Gaps = 28/330 (8%)

Query: 9   KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
            SFW PYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y    
Sbjct: 176 NSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY---- 224

Query: 69  TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
             +F    + Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL 
Sbjct: 225 -AYFY--RVIQTHPHAHKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 281

Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
                 S          DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +
Sbjct: 282 DMCNHTSGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 341

Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
           DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++ 
Sbjct: 342 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEE 400

Query: 245 ---------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
                    S +  + +      PVS   E  +   L D     L  Y  T+ ED++ L 
Sbjct: 401 ELKEHLLGDSAIDRIFTLGNSEYPVSWDNEVRLWAFLEDRASLLLKTYKTTIEEDKSFLK 460

Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
           +++L  +  +A +L   EK++L   ++  A
Sbjct: 461 NHDLSARATMAVKLRLGEKEILERAVKSAA 490


>gi|114654683|ref|XP_522946.2| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2 [Pan
           troglodytes]
 gi|332843114|ref|XP_003314566.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Pan
           troglodytes]
 gi|397525919|ref|XP_003832895.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1 [Pan
           paniscus]
 gi|397525921|ref|XP_003832896.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2 [Pan
           paniscus]
 gi|410227562|gb|JAA11000.1| SET domain containing 3 [Pan troglodytes]
 gi|410255618|gb|JAA15776.1| SET domain containing 3 [Pan troglodytes]
 gi|410289938|gb|JAA23569.1| SET domain containing 3 [Pan troglodytes]
 gi|410342147|gb|JAA40020.1| SET domain containing 3 [Pan troglodytes]
          Length = 594

 Score = 81.3 bits (199), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 77/329 (23%), Positives = 151/329 (45%), Gaps = 28/329 (8%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
           SFW PYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y     
Sbjct: 170 SFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY----- 217

Query: 70  VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
            +F    + Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL  
Sbjct: 218 AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 275

Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
                +          DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +D
Sbjct: 276 MCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 335

Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT- 244
           R+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++  
Sbjct: 336 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEE 394

Query: 245 --------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
                   S +  + +      PVS   E  +   L D     L  Y  T+ ED+++L +
Sbjct: 395 LKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKN 454

Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
            +L  + ++A +L   EK++L   ++  A
Sbjct: 455 QDLSVRAKMAIKLRLGEKEILEKAVKSAA 483


>gi|281182452|ref|NP_001162549.1| histone-lysine N-methyltransferase setd3 [Papio anubis]
 gi|332321745|sp|A9X1D0.1|SETD3_PAPAN RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|163781076|gb|ABY40825.1| SET domain containing 3, isoform 1 (predicted) [Papio anubis]
          Length = 595

 Score = 81.3 bits (199), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 77/330 (23%), Positives = 151/330 (45%), Gaps = 28/330 (8%)

Query: 9   KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
            SFW PYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y    
Sbjct: 169 NSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY---- 217

Query: 69  TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
             +F    + Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL 
Sbjct: 218 -AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 274

Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
                 +          DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +
Sbjct: 275 DMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 334

Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
           DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++ 
Sbjct: 335 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEE 393

Query: 245 ---------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
                    S +  + +      PVS   E  +   L D     L  Y  T+ ED+++L 
Sbjct: 394 ELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLK 453

Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
           + +L  + ++A +L   EK++L   ++  A
Sbjct: 454 NQDLSVRAKMAIKLRLGEKEILEKAVKSAA 483


>gi|431839268|gb|ELK01195.1| SET domain-containing protein 3 [Pteropus alecto]
          Length = 805

 Score = 81.3 bits (199), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 80/330 (24%), Positives = 151/330 (45%), Gaps = 28/330 (8%)

Query: 9   KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
            SFW PYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y    
Sbjct: 348 NSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY---- 396

Query: 69  TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
             +F    + Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL 
Sbjct: 397 -AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 453

Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
                 +          DD  + V  R ++AGE I ++ G + N++ +I+ GF  ++N +
Sbjct: 454 DMCNHTNGLITTGYNLEDDRCECVALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 513

Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
           DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++ 
Sbjct: 514 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEE 572

Query: 245 SEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
              + +        I +LG    PVS   E  +   L D     L  Y  T+ ED + L 
Sbjct: 573 ELKEHLLGDNAIDRIFTLGNSEYPVSWDNEVKLWTFLEDRASLLLKTYKTTVEEDRSFLR 632

Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
           +++L  +  +A +L   EK++L   ++  A
Sbjct: 633 NHDLSVRAAMAVKLRLGEKEILERAVKSAA 662


>gi|426377975|ref|XP_004055723.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Gorilla
           gorilla gorilla]
          Length = 594

 Score = 81.3 bits (199), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 77/329 (23%), Positives = 151/329 (45%), Gaps = 28/329 (8%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
           SFW PYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y     
Sbjct: 170 SFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY----- 217

Query: 70  VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
            +F    + Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL  
Sbjct: 218 AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 275

Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
                +          DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +D
Sbjct: 276 MCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 335

Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT- 244
           R+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++  
Sbjct: 336 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEE 394

Query: 245 --------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
                   S +  + +      PVS   E  +   L D     L  Y  T+ ED+++L +
Sbjct: 395 LKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKN 454

Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
            +L  + ++A +L   EK++L   ++  A
Sbjct: 455 QDLSVRAKMAIKLRLGEKEILEKTVKSAA 483


>gi|297695854|ref|XP_002825140.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
           [Pongo abelii]
 gi|395746278|ref|XP_003778419.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
           [Pongo abelii]
          Length = 595

 Score = 81.3 bits (199), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 77/329 (23%), Positives = 151/329 (45%), Gaps = 28/329 (8%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
           SFW PYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y     
Sbjct: 170 SFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY----- 217

Query: 70  VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
            +F    + Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL  
Sbjct: 218 AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 275

Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
                +          DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +D
Sbjct: 276 MCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 335

Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT- 244
           R+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++  
Sbjct: 336 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEE 394

Query: 245 --------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
                   S +  + +      PVS   E  +   L D     L  Y  T+ ED+++L +
Sbjct: 395 LKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKN 454

Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
            +L  + ++A +L   EK++L   ++  A
Sbjct: 455 QDLSVRAKMAIKLRLGEKEILEKAVKSAA 483


>gi|332252553|ref|XP_003275417.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
           [Nomascus leucogenys]
 gi|332252555|ref|XP_003275418.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
           [Nomascus leucogenys]
          Length = 595

 Score = 81.3 bits (199), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 77/329 (23%), Positives = 151/329 (45%), Gaps = 28/329 (8%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
           SFW PYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y     
Sbjct: 170 SFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY----- 217

Query: 70  VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
            +F    + Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL  
Sbjct: 218 AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 275

Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
                +          DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +D
Sbjct: 276 MCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 335

Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT- 244
           R+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++  
Sbjct: 336 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEE 394

Query: 245 --------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
                   S +  + +      PVS   E  +   L D     L  Y  T+ ED+++L +
Sbjct: 395 LKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKN 454

Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
            +L  + ++A +L   EK++L   ++  A
Sbjct: 455 QDLSVRAKMAIKLRLGEKEILEKAVKSAA 483


>gi|355778846|gb|EHH63882.1| hypothetical protein EGM_16943 [Macaca fascicularis]
          Length = 595

 Score = 81.3 bits (199), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 77/329 (23%), Positives = 151/329 (45%), Gaps = 28/329 (8%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
           SFW PYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y     
Sbjct: 170 SFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY----- 217

Query: 70  VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
            +F    + Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL  
Sbjct: 218 AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 275

Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
                +          DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +D
Sbjct: 276 MCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 335

Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT- 244
           R+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++  
Sbjct: 336 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEE 394

Query: 245 --------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
                   S +  + +      PVS   E  +   L D     L  Y  T+ ED+++L +
Sbjct: 395 LKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKN 454

Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
            +L  + ++A +L   EK++L   ++  A
Sbjct: 455 QDLSVRAKMAIKLRLGEKEILEKAVKSAA 483


>gi|386780935|ref|NP_001247800.1| SET domain containing 3 [Macaca mulatta]
 gi|355693560|gb|EHH28163.1| hypothetical protein EGK_18532 [Macaca mulatta]
 gi|380817110|gb|AFE80429.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
 gi|383422129|gb|AFH34278.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
 gi|384949778|gb|AFI38494.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
          Length = 595

 Score = 81.3 bits (199), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 77/330 (23%), Positives = 151/330 (45%), Gaps = 28/330 (8%)

Query: 9   KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
            SFW PYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y    
Sbjct: 169 NSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY---- 217

Query: 69  TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
             +F    + Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL 
Sbjct: 218 -AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 274

Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
                 +          DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +
Sbjct: 275 DMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 334

Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
           DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++ 
Sbjct: 335 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEE 393

Query: 245 ---------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
                    S +  + +      PVS   E  +   L D     L  Y  T+ ED+++L 
Sbjct: 394 ELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLK 453

Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
           + +L  + ++A +L   EK++L   ++  A
Sbjct: 454 NQDLSVRAKMAIKLRLGEKEILEKAVKSAA 483


>gi|296215874|ref|XP_002754318.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Callithrix jacchus]
          Length = 610

 Score = 80.9 bits (198), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 79/329 (24%), Positives = 153/329 (46%), Gaps = 28/329 (8%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
           SFW PYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y     
Sbjct: 185 SFWQPYIQTLPSE-------YDTPLYFEEEEVRYLQSTQAVHDVFSQYKNTARQY----- 232

Query: 70  VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
            +F    + Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL  
Sbjct: 233 AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 290

Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
                +          DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +D
Sbjct: 291 MCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 350

Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 245
           R+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++  
Sbjct: 351 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEE 409

Query: 246 EMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
             + +        I +LG    PVS   E  +   L D     L  Y  T+ ED+++L +
Sbjct: 410 LKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKN 469

Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
            +L  + ++A +L   EK++L   ++  A
Sbjct: 470 QDLSVRAKMAIKLRLGEKEILEKAVKSAA 498


>gi|332320543|sp|B0VX69.2|SETD3_CALJA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
          Length = 595

 Score = 80.9 bits (198), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 79/329 (24%), Positives = 153/329 (46%), Gaps = 28/329 (8%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
           SFW PYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y     
Sbjct: 170 SFWQPYIQTLPSE-------YDTPLYFEEEEVRYLQSTQAVHDVFSQYKNTARQY----- 217

Query: 70  VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
            +F    + Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL  
Sbjct: 218 AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 275

Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
                +          DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +D
Sbjct: 276 MCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 335

Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 245
           R+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++  
Sbjct: 336 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEE 394

Query: 246 EMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
             + +        I +LG    PVS   E  +   L D     L  Y  T+ ED+++L +
Sbjct: 395 LKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKN 454

Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
            +L  + ++A +L   EK++L   ++  A
Sbjct: 455 QDLSVRAKMAIKLRLGEKEILEKAVKSAA 483


>gi|168986666|gb|ACA35060.1| SET domain containing 3 isoform a (predicted) [Callithrix jacchus]
          Length = 597

 Score = 80.9 bits (198), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 79/329 (24%), Positives = 153/329 (46%), Gaps = 28/329 (8%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
           SFW PYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y     
Sbjct: 172 SFWQPYIQTLPSE-------YDTPLYFEEEEVRYLQSTQAVHDVFSQYKNTARQY----- 219

Query: 70  VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
            +F    + Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL  
Sbjct: 220 AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 277

Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
                +          DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +D
Sbjct: 278 MCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 337

Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 245
           R+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++  
Sbjct: 338 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEE 396

Query: 246 EMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
             + +        I +LG    PVS   E  +   L D     L  Y  T+ ED+++L +
Sbjct: 397 LKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKN 456

Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
            +L  + ++A +L   EK++L   ++  A
Sbjct: 457 QDLSVRAKMAIKLRLGEKEILEKAVKSAA 485


>gi|332321478|sp|B1MTJ4.2|SETD3_CALMO RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
          Length = 595

 Score = 80.9 bits (198), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 79/329 (24%), Positives = 153/329 (46%), Gaps = 28/329 (8%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
           SFW PYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y     
Sbjct: 170 SFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY----- 217

Query: 70  VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
            +F    + Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL  
Sbjct: 218 AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 275

Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
                +          DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +D
Sbjct: 276 MCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 335

Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 245
           R+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++  
Sbjct: 336 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEE 394

Query: 246 EMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
             + +        I +LG    PVS   E  +   L D     L  Y  T+ ED+++L +
Sbjct: 395 LKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKN 454

Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
            +L  + ++A +L   EK++L   ++  A
Sbjct: 455 QDLSVRAKMAIKLRLGEKEILEKAVKSAA 483


>gi|225447500|ref|XP_002267469.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic [Vitis
           vinifera]
 gi|296085051|emb|CBI28466.3| unnamed protein product [Vitis vinifera]
          Length = 497

 Score = 80.9 bits (198), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 66/220 (30%), Positives = 105/220 (47%), Gaps = 23/220 (10%)

Query: 8   KKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNE 66
           + S W  YI  L RQ         S L W+  EL  YL  S  +   +ER   +   YN+
Sbjct: 155 QSSRWSNYISALPRQ-------PYSLLYWTRAELDKYLEASQIRERAIERINDVTGTYND 207

Query: 67  LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 126
           L         +F ++P+  P E F  E FK +F  + S +V L   S+  + ALVP    
Sbjct: 208 LRL------RIFSKHPHLFPEEVFNMETFKWSFGILFSRLVRLP--SMDEKIALVPWAD- 258

Query: 127 LLAYSSKCKAMLAAVDDAVQLVV---DRPYKAGESIVVWCGPQPNSKLLINYGFVDED-- 181
           +L +S + +  L   D + Q VV   DR Y+  E + +  G + N +LL++YGFV  +  
Sbjct: 259 MLNHSCEVETFLD-YDKSSQGVVFTTDRTYQPSEQVFISYGKKSNGELLLSYGFVPREGT 317

Query: 182 NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 221
           NP D++ +  +L   D  Y++K    +++G  + Q F + 
Sbjct: 318 NPNDKVELLLSLKKSDKCYKEKSEAMKKHGLSTSQCFPIQ 357


>gi|50252331|dbj|BAD28364.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplast precursor
           [Oryza sativa Japonica Group]
 gi|215769445|dbj|BAH01674.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 495

 Score = 80.9 bits (198), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 92/343 (26%), Positives = 146/343 (42%), Gaps = 56/343 (16%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           + E  +G  S W PY+  L RQ        +S + WSE EL  + G+    ++L    G+
Sbjct: 141 LREAARGAGSPWAPYLAILPRQ-------TDSTIFWSEEELLEIQGT----QLLSTTMGV 189

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTE-------AFTFEIFKQAFVAVQSCVVHLQKVS 113
           K EY +         S F+    +I +E         TF  F  AF  ++S V    +  
Sbjct: 190 K-EYVQ---------SEFESVEAEIISENRELFPGTVTFNDFLWAFGILRSRVFAELR-- 237

Query: 114 LARRFALVPLGPPLLAYSS-----------KCKAMLAAVDDAVQLVVDRPYKAGESIVV- 161
              + AL+P    L+ +S            K K +    D    L      K+GE I + 
Sbjct: 238 -GDKLALIPFAD-LVNHSDDITSKESSWEIKGKGLFGR-DVVFSLRTPVNVKSGEQIYIQ 294

Query: 162 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 221
           +   + N++L ++YGF + ++  D   +   ++  DP Y DK  +A+ NG      F + 
Sbjct: 295 YDLDKSNAELALDYGFTESNSSRDAYTLTLEISESDPFYDDKLDIAELNGMGETAYFDIV 354

Query: 222 AGREKEAISDMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLAD 274
            G  +     MLPYLRL  +  T           +V   L    PVS   E A+   + +
Sbjct: 355 LG--ESLPPQMLPYLRLLCLGGTDAFLLEALFRNAVWGHLE--LPVSQDNEEAICQVIRN 410

Query: 275 YFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 317
             K+ L  Y  T+ EDE +L   NL P+ ++A ++   EKK+L
Sbjct: 411 ACKSALGAYHTTIEEDEELLGSENLQPRLQIAVEVRAGEKKVL 453


>gi|218202140|gb|EEC84567.1| hypothetical protein OsI_31339 [Oryza sativa Indica Group]
          Length = 649

 Score = 80.9 bits (198), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 93/344 (27%), Positives = 148/344 (43%), Gaps = 58/344 (16%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           + E  +G  S W PY+  L RQ        +S + WSE EL  + G+    ++L    G+
Sbjct: 295 LREAARGAGSPWAPYLAILPRQ-------TDSTIFWSEEELLEIQGT----QLLSTTMGV 343

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTE-------AFTFEIFKQAFVAVQSCVVHLQKVS 113
           K EY +         S F+    +I +E         TF  F  AF  ++S V    +  
Sbjct: 344 K-EYVQ---------SEFESVEAEIISENRELFPGTVTFNDFLWAFGILRSRVFAELR-- 391

Query: 114 LARRFALVPLGPPLLAYSS-----------KCKAMLAAVDDAVQLVVDRPYKAGESIVV- 161
              + AL+P    L+ +S            K K +    D    L      K+GE I + 
Sbjct: 392 -GDKLALIPFAD-LVNHSDDITSKESSWEIKGKGLFGR-DVVFSLRTPVNVKSGEQIYIQ 448

Query: 162 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 221
           +   + N++L ++YGF + ++  D   +   ++  DP Y DK  +A+ NG      F + 
Sbjct: 449 YDLDKSNAELALDYGFTESNSSRDAYTLTLEISESDPFYDDKLDIAELNGMGETAYFDIV 508

Query: 222 AGREKEAIS-DMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLA 273
            G   E++   MLPYLRL  +  T           +V   L    PVS   E A+   + 
Sbjct: 509 LG---ESLPPQMLPYLRLLCLGGTDAFLLEALFRNAVWGHL--ELPVSQDNEEAICQVIR 563

Query: 274 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 317
           +  K+ L  Y  T+ EDE +L   NL P+ ++A ++   EKK+L
Sbjct: 564 NACKSALGAYHTTIEEDEELLGSENLQPRLQIAVEVRAGEKKVL 607


>gi|169409575|gb|ACA57918.1| SET domain containing 3 isoform a (predicted) [Callicebus moloch]
          Length = 597

 Score = 80.9 bits (198), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 79/329 (24%), Positives = 153/329 (46%), Gaps = 28/329 (8%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
           SFW PYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y     
Sbjct: 172 SFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY----- 219

Query: 70  VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
            +F    + Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL  
Sbjct: 220 AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 277

Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
                +          DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +D
Sbjct: 278 MCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 337

Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 245
           R+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++  
Sbjct: 338 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEE 396

Query: 246 EMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
             + +        I +LG    PVS   E  +   L D     L  Y  T+ ED+++L +
Sbjct: 397 LKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKN 456

Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
            +L  + ++A +L   EK++L   ++  A
Sbjct: 457 QDLSVRAKMAIKLRLGEKEILEKAVKSAA 485


>gi|332321743|sp|C1FXW2.1|SETD3_DASNO RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|226526916|gb|ACO71275.1| SET domain containing 3 isoform a (predicted) [Dasypus
           novemcinctus]
          Length = 589

 Score = 80.9 bits (198), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 79/329 (24%), Positives = 152/329 (46%), Gaps = 28/329 (8%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
           SFW PYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y     
Sbjct: 170 SFWQPYIQSLPGE-------YDTPLYFEEDEVRYLHSTQAIHDVFSQYKNTARQY----- 217

Query: 70  VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
            +F    + Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL  
Sbjct: 218 AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 275

Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
                +          DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +D
Sbjct: 276 MCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 335

Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 245
           R+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++  
Sbjct: 336 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEE 394

Query: 246 EMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
             + +        I +LG    PVS   E  +   L D     L  Y  T+ ED++ L +
Sbjct: 395 LKEHLLGENAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSFLKN 454

Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
           ++L  +  +A +L   EK++L   ++  A
Sbjct: 455 HDLSVRATMAIKLRLGEKEILEKAVKSAA 483


>gi|147843303|emb|CAN82664.1| hypothetical protein VITISV_015206 [Vitis vinifera]
          Length = 507

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 77/296 (26%), Positives = 138/296 (46%), Gaps = 17/296 (5%)

Query: 32  SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 91
           S + WSE EL  + G+      L   E ++ E+ +++    +  S    +P  +    + 
Sbjct: 153 STIYWSEEELVEIQGTQLSNTTLGVKEYVQSEFLKVEEEVILPHSQLFPFPVTLDDFLWA 212

Query: 92  FEIFK-QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLA--YSSKCK-AMLAAVDDAVQL 147
           F I + +AF  ++      Q + L     L+   P +    Y+ + K A L + D    L
Sbjct: 213 FGILRSRAFSRLRG-----QNLVLIPLADLINHSPSITTEEYAWEIKGAGLFSRDQLFSL 267

Query: 148 VVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 206
                 KAGE +++ +   + N++L ++YGF++     +   +   ++  DP + DK  +
Sbjct: 268 RTPVSVKAGEQVLIQYDLDKSNAELALDYGFIESRPNRNSYTLTLEISESDPFFGDKLDI 327

Query: 207 AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS--DTSEMQSVISSL--GPI-CPVS 261
           A+ NG   +  F +  G+   A   MLPYLRL  +   D   ++S+  +   G +  PVS
Sbjct: 328 AESNGLSEIAYFDIVLGQSLPAA--MLPYLRLVALGGPDAFLLESIFRNTIWGHLELPVS 385

Query: 262 PCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 317
              E  +   + D  K+ L+GY  T+ EDE +  + NLHP+  +A  +   EKK+L
Sbjct: 386 RANEELICQVIQDACKSALSGYLTTIEEDEKLKEEGNLHPRLEIAVGVRTGEKKVL 441


>gi|332321744|sp|B5FW36.1|SETD3_OTOGA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|197215622|gb|ACH53017.1| SET domain containing 3 isoform a (predicted) [Otolemur garnettii]
          Length = 595

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 79/329 (24%), Positives = 152/329 (46%), Gaps = 28/329 (8%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
           SFW PYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y     
Sbjct: 170 SFWQPYIQSLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY----- 217

Query: 70  VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
            +F    + Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL  
Sbjct: 218 AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 275

Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
                +          DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +D
Sbjct: 276 MCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 335

Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 245
           R+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++  
Sbjct: 336 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEE 394

Query: 246 EMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
             + +        I +LG    PVS   E  +   L D     L  Y  T+ ED+ +L +
Sbjct: 395 LKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKFVLKN 454

Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
           ++L  +  +A +L   EK++L   ++  A
Sbjct: 455 HDLSVRATMAIKLRLGEKEILEKAVKSAA 483


>gi|225462926|ref|XP_002267249.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
           large subunit N-methyltransferase, chloroplastic [Vitis
           vinifera]
 gi|296087793|emb|CBI35049.3| unnamed protein product [Vitis vinifera]
          Length = 484

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 77/296 (26%), Positives = 138/296 (46%), Gaps = 17/296 (5%)

Query: 32  SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 91
           S + WSE EL  + G+      L   E ++ E+ +++    +  S    +P  +    + 
Sbjct: 153 STIYWSEEELVEIQGTQLSNTTLGVKEYVQSEFLKVEEEVILPHSQLFPFPVTLDDFLWA 212

Query: 92  FEIFK-QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLA--YSSKCK-AMLAAVDDAVQL 147
           F I + +AF  ++      Q + L     L+   P +    Y+ + K A L + D    L
Sbjct: 213 FGILRSRAFSRLRG-----QNLVLIPLADLINHSPSITTEEYAWEIKGAGLFSRDQLFSL 267

Query: 148 VVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 206
                 KAGE +++ +   + N++L ++YGF++     +   +   ++  DP + DK  +
Sbjct: 268 RTPVSVKAGEQVLIQYDLDKSNAELALDYGFIESRPNRNSYTLTLEISESDPFFGDKLDI 327

Query: 207 AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS--DTSEMQSVISSL--GPI-CPVS 261
           A+ NG   +  F +  G+   A   MLPYLRL  +   D   ++S+  +   G +  PVS
Sbjct: 328 AESNGLSEIAYFDIVLGQSLPAA--MLPYLRLVALGGPDAFLLESIFRNTIWGHLELPVS 385

Query: 262 PCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 317
              E  +   + D  K+ L+GY  T+ EDE +  + NLHP+  +A  +   EKK+L
Sbjct: 386 RANEELICQVIQDACKSALSGYLTTIEEDEKLKEEGNLHPRLEIAVGVRTGEKKVL 441


>gi|346474100|gb|AEO36894.1| hypothetical protein [Amblyomma maculatum]
          Length = 459

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 86/354 (24%), Positives = 151/354 (42%), Gaps = 53/354 (14%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           + E   G+ SFW PYI  L         +  + L ++  EL  L+GS    E L+    I
Sbjct: 127 ILELSAGESSFWHPYISVLPD-------SFNTVLYFNIEELELLSGSAVLDEALKLHRSI 179

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV---QSCVVHLQKVSL--- 114
            R+Y     + F    L +  P+    + FT+++++ A  AV   Q+ V   +   L   
Sbjct: 180 ARQYAYFHKI-FRTHPLAKSLPF---KDCFTYDLYRWAVSAVMTRQNAVPWTESDGLGGD 235

Query: 115 -------ARRFALVPL--------GPPLLAYSS-----KCKAMLAAVDDAVQLVVDRPYK 154
                  A   ALVPL        G  L  Y S     +C AM             R + 
Sbjct: 236 DVEIDGTAAVTALVPLWDMCNHSDGKVLTDYDSSASMVRCYAM-------------RDFD 282

Query: 155 AGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLS 214
            GE + ++ G + N++  I+ GFV EDN YD + ++  ++ +DP +  K  + + +  LS
Sbjct: 283 KGEEVTIFYGKRTNAEFFIHNGFVFEDNRYDAVDIKLGVSKKDPLFAVKSKLCE-DHDLS 341

Query: 215 VQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLAD 274
           +        R++    D+  +LR+  + D S+ ++   S   I   S    R  L  L  
Sbjct: 342 LSGTFALVARDRPVSEDLSTFLRILVLKDASQPEAF--SAEHILTSSDSNARDALTFLVV 399

Query: 275 YFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 328
             +  L  +P +  E E ++ D   + + ++A +L  +E K+L + L+   + +
Sbjct: 400 RIELLLKAFPKSDEEYEDIIKDGASNARVKMAARLRLLESKVLASVLETLGNHV 453


>gi|291411315|ref|XP_002721936.1| PREDICTED: SET domain containing 3 [Oryctolagus cuniculus]
          Length = 591

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 81/331 (24%), Positives = 147/331 (44%), Gaps = 30/331 (9%)

Query: 9   KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
            SFW PYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y    
Sbjct: 169 NSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY---- 217

Query: 69  TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
             +F    + Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL 
Sbjct: 218 -AYFY--RVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 274

Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
                 +          DD  + V  R + AGE I ++ G + N++ +I+ GF  ++N +
Sbjct: 275 DMCNHTNGLITTGYNLEDDRCECVALRDFHAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 334

Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
           DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+ +    
Sbjct: 335 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRV-FCMTE 392

Query: 245 SEMQSVISSLGPI----------CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 294
            E++  +   G I           PVS   E  +   L D     L  Y  T+ ED+A+L
Sbjct: 393 EELREHLLGDGAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKAVL 452

Query: 295 TDYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
               L  +  +A +L   EK++L   ++  A
Sbjct: 453 RSPALSARAAMAVKLRLGEKEILEKAVRSAA 483


>gi|224051705|ref|XP_002200601.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Taeniopygia
           guttata]
          Length = 593

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 78/329 (23%), Positives = 153/329 (46%), Gaps = 28/329 (8%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
           SFWLPYI+ L  +        ++PL + E E+ +L  +    ++  + +   R+Y     
Sbjct: 170 SFWLPYIQTLPSE-------YDTPLYFEEDEVQHLQSTQAIHDVFSQYKNTARQY----- 217

Query: 70  VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
            +F    + Q +P    +P  ++FT++ ++ A  +V +    +     +R   AL+PL  
Sbjct: 218 AYFY--KVIQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 275

Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
                +          DD  + V  + +KAGE I ++ G + N++ +I+ GF  ++N +D
Sbjct: 276 MCNHTNGLITTGYNLEDDRCECVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 335

Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 245
           R+ ++  ++  D  Y  K  V  R G  +  VF +H+  E    + +L +LR+  +S+  
Sbjct: 336 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPAISAQLLAFLRVFCMSEEE 394

Query: 246 EMQSVIS--SLGPICPVSPCMERAVLD---QLADYFKAR----LAGYPATLSEDEAMLTD 296
             + +I   ++G I  +         D   +L  + +AR    L  Y  T+  D++ L  
Sbjct: 395 LKEHLIGEHAIGKIFTLGNSDFPVSWDNEVKLWTFLEARASLLLKTYKTTVEVDKSFLET 454

Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
           ++L P   +A +L   EK++L   ++  A
Sbjct: 455 HDLTPHAIMAIKLRLGEKEILEKAVKSAA 483


>gi|395827792|ref|XP_003787079.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Otolemur
           garnettii]
          Length = 595

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 79/329 (24%), Positives = 152/329 (46%), Gaps = 28/329 (8%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
           SFW PYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y     
Sbjct: 170 SFWQPYIQSLPSE-------YDTPLYFEEDEVRYLQSTQAIYDVFSQYKNTARQY----- 217

Query: 70  VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
            +F    + Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL  
Sbjct: 218 AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 275

Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
                +          DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +D
Sbjct: 276 MCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 335

Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 245
           R+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++  
Sbjct: 336 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEE 394

Query: 246 EMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
             + +        I +LG    PVS   E  +   L D     L  Y  T+ ED+ +L +
Sbjct: 395 LKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKFVLKN 454

Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
           ++L  +  +A +L   EK++L   ++  A
Sbjct: 455 HDLSVRATMAIKLRLGEKEILEKAVKSAA 483


>gi|449453618|ref|XP_004144553.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Cucumis
           sativus]
 gi|449511789|ref|XP_004164054.1| PREDICTED: LOW QUALITY PROTEIN: ribulose-1,5 bisphosphate
           carboxylase/oxygenase large subunit N-methyltransferase,
           chloroplastic-like [Cucumis sativus]
          Length = 497

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 68/220 (30%), Positives = 105/220 (47%), Gaps = 23/220 (10%)

Query: 8   KKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNE 66
           K S W  YI  L RQ         S L W+  EL  YL  S  +   +ER   +   YN+
Sbjct: 155 KSSRWNNYISALPRQ-------PYSLLYWTREELDRYLEASEIRERAIERITNVVGTYND 207

Query: 67  LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 126
           L         +F ++P   P E F  E FK +F  + S +V L   S+  + ALVP    
Sbjct: 208 LSI------RVFSKHPELFPEEVFNIETFKWSFGILFSRLVRLP--SMDGKVALVPWAD- 258

Query: 127 LLAYSSKCKAMLAAVDDAVQLVV---DRPYKAGESIVVWCGPQPNSKLLINYGFVDED-- 181
           +L ++ + +  L   D A Q VV   DR Y+ GE + +  G + N +LL++YGFV ++  
Sbjct: 259 MLNHNCEVETFLD-YDKASQGVVFTTDRAYQPGEQVFISYGKKSNGELLLSYGFVPKEGS 317

Query: 182 NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 221
           NP D + +  +L   D  Y++K    +++G  + Q F + 
Sbjct: 318 NPSDSVELLLSLKKSDKCYKEKLEALKKHGLRASQCFPIQ 357


>gi|403274243|ref|XP_003928891.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Saimiri
           boliviensis boliviensis]
          Length = 513

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 79/330 (23%), Positives = 152/330 (46%), Gaps = 28/330 (8%)

Query: 9   KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
            SFW PYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y    
Sbjct: 87  NSFWQPYIQTLPSE-------YDTPLYFEEEEVRYLQSTQAIHDVFSQYKNTARQY---- 135

Query: 69  TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
             +F    + Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL 
Sbjct: 136 -AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 192

Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
                 +          DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +
Sbjct: 193 DMCNHTNGLITTGYNLEDDRCECVALQDFQAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 252

Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
           DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++ 
Sbjct: 253 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEE 311

Query: 245 SEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
              + +        I +LG    PVS   E  +   L D     L  Y  T+ ED+ +L 
Sbjct: 312 ELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKFVLK 371

Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
           + +L  + ++A +L   EK++L   ++  A
Sbjct: 372 NQDLSVRAKMAIKLRLGEKEILEKAVKSAA 401


>gi|302804384|ref|XP_002983944.1| hypothetical protein SELMODRAFT_119151 [Selaginella moellendorffii]
 gi|300148296|gb|EFJ14956.1| hypothetical protein SELMODRAFT_119151 [Selaginella moellendorffii]
          Length = 439

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 87/339 (25%), Positives = 147/339 (43%), Gaps = 35/339 (10%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           +  +K    S W  YIR L R        ++SPL WSE ELA L G+   + +    E +
Sbjct: 71  LLHEKAKPHSDWSAYIRVLPR-------TLDSPLFWSEEELAELKGTQLLSSMNGFKEFL 123

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV------HLQKVSL 114
           KREY+++ T       + +  P       +T E F  AF  ++S         +L  V L
Sbjct: 124 KREYDKVMT------EVIEPRPDVFDRSLYTLEAFTWAFGILRSRTFPPLIGDNLALVPL 177

Query: 115 A----RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 170
           A      F L    P    +  K   + A  +           +  E ++ +   + N++
Sbjct: 178 ADFVNHGFGLTNEDP---GWKVKSAGVFARQETLTLQAAANCAEKQEVLIQYGKKKGNAQ 234

Query: 171 LLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEA 228
           L  +YGFVD D  N  D   +   ++  +    DK  +AQ  G  S   F+++  R +  
Sbjct: 235 LATDYGFVDSDEKNNRDSFTLTLQVSLSERFADDKVDIAQMAGLDSTAYFNLY--RNQGP 292

Query: 229 ISDMLPYLRLG--YVSDTSEMQSVISSL---GPICPVSPCMERAVLDQLADYFKARLAGY 283
             DM+ YLRL   + SD+  ++++  +        P+S   E A+ + + +  +A L  Y
Sbjct: 293 PEDMIAYLRLIALFGSDSFLLEALFRNTVWDHLRLPISRENEEAICEAMIEGCRATLREY 352

Query: 284 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 322
            +T+ ED  +L    L  +K++A  +   EK++L   LQ
Sbjct: 353 SSTIDEDTMLLNSSELSTRKKMAVVVRLGEKRILQEQLQ 391


>gi|348554489|ref|XP_003463058.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cavia
           porcellus]
          Length = 789

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 80/330 (24%), Positives = 151/330 (45%), Gaps = 28/330 (8%)

Query: 9   KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
            SFWLPYI+ L  +        ++PL + E E+  L  +    ++  + +   R+Y    
Sbjct: 169 NSFWLPYIQTLPSE-------YDTPLYFEEEEVQCLQSTQAIHDVFSQYKNTARQY---- 217

Query: 69  TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
             +F    + Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL 
Sbjct: 218 -AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 274

Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
                 +          DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +
Sbjct: 275 DMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 334

Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
           DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++ 
Sbjct: 335 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEE 393

Query: 245 SEMQSVIS--------SLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
              + ++         +LG    PVS   E  +   L D     L  Y  T+ ED+A+L 
Sbjct: 394 ELKEHLLGENAIDRIFTLGNSEFPVSWENEVKLWSFLEDRASLLLKTYKTTIEEDKAVLK 453

Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
              L  + ++A +L   EK++L   +Q  A
Sbjct: 454 GPELPTRMKMAVKLRLGEKEILERTVQSAA 483


>gi|344273731|ref|XP_003408672.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Loxodonta
           africana]
          Length = 597

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 79/330 (23%), Positives = 151/330 (45%), Gaps = 28/330 (8%)

Query: 9   KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
            SFW PYI+ L  +        ++PL + E E+ +L  +    ++  + +   R+Y    
Sbjct: 169 NSFWQPYIQTLPSE-------YDTPLYFEEDEVRHLQSTQAIHDVFSQYKNTARQY---- 217

Query: 69  TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
             +F    + Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL 
Sbjct: 218 -AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 274

Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
                 +          DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +
Sbjct: 275 DMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 334

Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
           DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++ 
Sbjct: 335 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEE 393

Query: 245 SEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
              + +        I +LG    PVS   E  +   L D     L  Y  T+ ED+A L 
Sbjct: 394 ELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKAFLK 453

Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
            ++L  +  +A +L   EK++L   ++  A
Sbjct: 454 GHDLSIRATMAVKLRLGEKEILERAVKSAA 483


>gi|226501968|ref|NP_001140387.1| uncharacterized protein LOC100272441 [Zea mays]
 gi|194699272|gb|ACF83720.1| unknown [Zea mays]
 gi|413923744|gb|AFW63676.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
           N-methyltransferase I [Zea mays]
          Length = 503

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 63/214 (29%), Positives = 101/214 (47%), Gaps = 21/214 (9%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELD 68
           S W+ YI  L RQ         S L W+  EL AYL  SP +   ++R   +   YN+L 
Sbjct: 163 SRWISYIAALPRQ-------PYSLLYWTRAELDAYLVASPIRKRAIQRITDVIGTYNDLR 215

Query: 69  TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 128
                   +F ++P   P E +  E F  +F  + S +V L   S+  R ALVP    +L
Sbjct: 216 D------RIFSRHPDLFPEEVYNIETFLWSFGILFSRLVRLP--SMDGRVALVPWAD-ML 266

Query: 129 AYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPY 184
            +S + +  L        +    DR Y+ GE + +  G + + +LL++YGFV ++  NP 
Sbjct: 267 NHSPEVETFLDFDKSSRGIVFTTDRSYQPGEQVFISYGKKSSGELLLSYGFVPKEGTNPN 326

Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
           D + +  +L+  D  Y++K    +RNG    + F
Sbjct: 327 DSVELLVSLDKSDNCYKEKLQALKRNGLSESESF 360


>gi|432098266|gb|ELK28072.1| Histone-lysine N-methyltransferase setd3 [Myotis davidii]
          Length = 585

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 77/329 (23%), Positives = 147/329 (44%), Gaps = 28/329 (8%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
           SFW PYI+ L  +        ++PL + E E+  L  +    ++  + +   R+Y     
Sbjct: 159 SFWQPYIQTLPSE-------YDTPLYFEEDEVRSLQSTQAVHDVFSQYKNTARQY----- 206

Query: 70  VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
            +F    + Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL  
Sbjct: 207 AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 264

Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
                +          DD  + V  R ++AGE I ++ G + N++ +I+ GF  ++N +D
Sbjct: 265 MCNHTNGLITTGYNLEDDRCECVALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 324

Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL------- 238
           R+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+       
Sbjct: 325 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-MEPPISAQLLAFLRVFCMTEEE 383

Query: 239 --GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
              ++   + +  + +      PVS   E  +   L D     L  Y  T  ED++ L +
Sbjct: 384 LKDHLLGDNAIDKIFTLGNSEYPVSWDNEVKLWTFLEDRASLLLKTYKTTSEEDKSFLKN 443

Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
           ++L  + R+A +L   EK++L   +   A
Sbjct: 444 HDLSVRARMAIKLRLGEKEILEKAVTSAA 472


>gi|332321746|sp|B2KI88.1|SETD3_RHIFE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|183637154|gb|ACC64548.1| SET domain containing 3 isoform a (predicted) [Rhinolophus
           ferrumequinum]
          Length = 594

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 78/330 (23%), Positives = 151/330 (45%), Gaps = 28/330 (8%)

Query: 9   KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
            SFW PYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y    
Sbjct: 169 NSFWQPYIQTLPSE-------YDTPLYFGEDEVRYLQSTQAIHDVFSQYKNTARQY---- 217

Query: 69  TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
             +F    + Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL 
Sbjct: 218 -AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 274

Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
                 +          DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +
Sbjct: 275 DMCNHTNGLITTGYNLEDDRCECVALQDFQAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 334

Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
           DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++ 
Sbjct: 335 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEE 393

Query: 245 SEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
              + +        I +LG    PVS   E  +   L D     L  Y   + ED++ L 
Sbjct: 394 ELKEHLLGDNAIDRIFTLGNSEYPVSWDNEVKLWTFLEDRASLLLKTYKTNIEEDKSFLK 453

Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
           +++L  +  +A +L   EK++L   ++  A
Sbjct: 454 NHDLSVRATMAIKLRLGEKEILEKAVKSAA 483


>gi|410962953|ref|XP_003988033.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Felis catus]
          Length = 591

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 79/328 (24%), Positives = 148/328 (45%), Gaps = 36/328 (10%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
           SFW PYI+ L  +        ++PL + E E+  L  +    ++  + +   R+Y     
Sbjct: 170 SFWQPYIQTLPSE-------YDTPLYFEEDEVRDLQSTQAIHDVFSQYKNTARQY----- 217

Query: 70  VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 126
            +F    + Q +P+   +P  +AFT+E ++   V   S  +    + L     +   G P
Sbjct: 218 AYFY--KVIQTHPHANKLPLKDAFTYEDYRLGLV---SLALGRWALGLECGVGIARCGKP 272

Query: 127 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 186
            +      +      DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +DR
Sbjct: 273 QITTGYNLE------DDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDR 326

Query: 187 LVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE 246
           + ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++   
Sbjct: 327 VKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPVSAQLLAFLRVFCMTEEEL 385

Query: 247 MQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDY 297
            + +        I +LG    PVS   E  +   L D     L  Y  T+ ED+A L ++
Sbjct: 386 KEHLLGDNAIDRIFTLGNSEYPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKAFLKNH 445

Query: 298 NLHPKKRVATQLVRMEKKMLNACLQVTA 325
           NL  +  +A +L   EK++L   ++  A
Sbjct: 446 NLSVRATMAIKLRLGEKEILEKAVKSAA 473


>gi|357137766|ref|XP_003570470.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like
           [Brachypodium distachyon]
          Length = 389

 Score = 78.6 bits (192), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 64/208 (30%), Positives = 102/208 (49%), Gaps = 23/208 (11%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELD 68
           S W  YI  L RQ         S L W+ TE+ AYL  SP +   + R   +   YN+L 
Sbjct: 74  SRWSSYIDALPRQ-------PYSLLYWTRTEIDAYLVASPIRERAISRIGDVIGTYNDL- 125

Query: 69  TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 128
                   +F ++P   P E +  E F+ +F  + S +V L   S+  + ALVP    +L
Sbjct: 126 -----RDRIFSKHPELFPEEVYNMENFRWSFGILFSRLVRLP--SMDGKVALVPWAD-ML 177

Query: 129 AYSSKCKAMLAAVDDAVQLVV---DRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NP 183
            ++ +  A L   D + Q +V   DR Y+ GE + +  G + + +LL++YGFV ++  NP
Sbjct: 178 NHNPEVDAFLD-FDKSSQGIVFTTDRSYQPGEQVFISYGKKSSGELLLSYGFVPKEGTNP 236

Query: 184 YDRLVVEAALNTEDPQYQDKRMVAQRNG 211
            D +    +LN  D  Y++K    +R+G
Sbjct: 237 NDSVEFSVSLNKSDDCYREKLQALKRHG 264


>gi|332321742|sp|E2RBS6.1|SETD3_CANFA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
          Length = 588

 Score = 78.6 bits (192), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 79/330 (23%), Positives = 151/330 (45%), Gaps = 28/330 (8%)

Query: 9   KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
            SFW PYI+ L  +        ++PL + E E+  L  +    ++  + +   R+Y    
Sbjct: 169 NSFWQPYIQTLPSE-------YDTPLYFEEDEVRDLQSTQAIHDVFSQYKNTARQY---- 217

Query: 69  TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
             +F    + Q +P+   +P  +AFT+E ++ A  +V +    +     +R   AL+PL 
Sbjct: 218 -AYFY--KVIQTHPHANKLPLKDAFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 274

Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
                 +          DD  + V  R ++AGE I ++ G + N++ +I+ GF  ++N +
Sbjct: 275 DMCNHTNGLITTGYNLEDDRCECVALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 334

Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
           DR+ ++  ++  D  Y  K  V  R G  +  VF +H   +    + +L +LR+  +++ 
Sbjct: 335 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHY-TDPPVSAQLLAFLRVFCMTEE 393

Query: 245 SEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
              + +        I +LG    PVS   E  +   L D     L  Y  T+ ED++ L 
Sbjct: 394 ELKEHLLGDNALDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLR 453

Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
           +++L  +  +A +L   EK++L   ++  A
Sbjct: 454 NHDLSVRATMAIKLRLGEKEILEKAVKSAA 483


>gi|73964462|ref|XP_547974.2| PREDICTED: SET domain containing 3 [Canis lupus familiaris]
          Length = 589

 Score = 78.6 bits (192), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 79/330 (23%), Positives = 151/330 (45%), Gaps = 28/330 (8%)

Query: 9   KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
            SFW PYI+ L  +        ++PL + E E+  L  +    ++  + +   R+Y    
Sbjct: 169 NSFWQPYIQTLPSE-------YDTPLYFEEDEVRDLQSTQAIHDVFSQYKNTARQY---- 217

Query: 69  TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
             +F    + Q +P+   +P  +AFT+E ++ A  +V +    +     +R   AL+PL 
Sbjct: 218 -AYFY--KVIQTHPHANKLPLKDAFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 274

Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
                 +          DD  + V  R ++AGE I ++ G + N++ +I+ GF  ++N +
Sbjct: 275 DMCNHTNGLITTGYNLEDDRCECVALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 334

Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
           DR+ ++  ++  D  Y  K  V  R G  +  VF +H   +    + +L +LR+  +++ 
Sbjct: 335 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHY-TDPPVSAQLLAFLRVFCMTEE 393

Query: 245 SEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
              + +        I +LG    PVS   E  +   L D     L  Y  T+ ED++ L 
Sbjct: 394 ELKEHLLGDNALDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLR 453

Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
           +++L  +  +A +L   EK++L   ++  A
Sbjct: 454 NHDLSVRATMAIKLRLGEKEILEKAVKSAA 483


>gi|302754606|ref|XP_002960727.1| hypothetical protein SELMODRAFT_449995 [Selaginella moellendorffii]
 gi|300171666|gb|EFJ38266.1| hypothetical protein SELMODRAFT_449995 [Selaginella moellendorffii]
          Length = 430

 Score = 78.6 bits (192), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 88/339 (25%), Positives = 147/339 (43%), Gaps = 35/339 (10%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           +  +K    S W  YIR L R        ++SPL WSE ELA L G+   + I    E +
Sbjct: 71  LLHEKAKPHSDWSAYIRVLPR-------TLDSPLFWSEEELAELKGTQLLSSINGFKEFL 123

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV------HLQKVSL 114
           KREY+++ T       + +  P       +T E F  AF  ++S         +L  V L
Sbjct: 124 KREYDKVMT------EVIEPRPDVFDRSLYTLEAFTWAFGILRSRTFPPLIGDNLALVPL 177

Query: 115 A----RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 170
           A      F L    P    +  K   + A  +           +  E ++ +   + N++
Sbjct: 178 ADFVNHGFGLTNEDP---YWHVKSAGVFARQETLTLQAAANCAEKQEVLMQYGKKKGNAQ 234

Query: 171 LLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEA 228
           L  +YGFVD D  N  D   +   ++  +    DK  +AQ  G  S   F+++  R +  
Sbjct: 235 LATDYGFVDSDEKNNRDSFTLTLQVSLSERFADDKVDIAQMAGLDSTAYFNLY--RNQGP 292

Query: 229 ISDMLPYLRLG--YVSDTSEMQSVISSL---GPICPVSPCMERAVLDQLADYFKARLAGY 283
             DM+ YLRL   + SD+  ++++  +        P+S   E A+ + + +  +A L  Y
Sbjct: 293 PEDMIAYLRLIALFGSDSFLLEALFRNTVWDHLRLPISRENEEAICEAMIEGCRATLREY 352

Query: 284 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 322
            +T+ ED  +L    L  +K++A  +   EK++L   LQ
Sbjct: 353 SSTIDEDTMLLNSSELSTRKKMAVVVRLGEKRILQEQLQ 391


>gi|444915331|ref|ZP_21235465.1| SET domain containing protein [Cystobacter fuscus DSM 2262]
 gi|444713560|gb|ELW54457.1| SET domain containing protein [Cystobacter fuscus DSM 2262]
          Length = 449

 Score = 78.6 bits (192), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 82/336 (24%), Positives = 146/336 (43%), Gaps = 26/336 (7%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           + ++K  + SFW PYI  L           + PL +   E A L G      +  +A+ +
Sbjct: 99  LLQEKHREGSFWKPYIDSLPESYS------QMPLFYGSEEHALLKGCFALTLLTHQAQSL 152

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           + +Y           SL Q  P     E FT   F  A ++V S +  L+K     +  L
Sbjct: 153 REDYL----------SLCQNVP---GYERFTPGEFVWARLSVSSRLFSLKKGGFLGQ-TL 198

Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVV--DRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           VP+   +L +      +    +D    V+  +    AG+ +    G + N  +L+++GFV
Sbjct: 199 VPMAD-MLNHRRPPDVLWETTEDGESFVMKANNAVAAGDEVHDSYGAKSNDLMLLHFGFV 257

Query: 179 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 238
            +DN +D   +   +   DP    K+M+       + + F +              +LR+
Sbjct: 258 TDDNEHDEAFLGLRILDGDPLAATKQMLLMLPSPTAARPFKISRPYVHTTTRMAFSFLRI 317

Query: 239 GYV--SDTSEMQS-VISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
                +D  ++ S V+S    + P+S   E  VL+ LA   +ARL+ +P +L++DE +L 
Sbjct: 318 AAAVPNDIEDISSRVMSGERALGPLSVENEENVLELLAATCQARLSIFPTSLAQDEELLR 377

Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLL 331
             +L P  R    + R EK+++   L++T   + LL
Sbjct: 378 GESLSPNARNCVLVRRAEKQLIEDYLEMTRVCLKLL 413


>gi|444909511|ref|ZP_21229702.1| hypothetical protein D187_00317 [Cystobacter fuscus DSM 2262]
 gi|444720460|gb|ELW61244.1| hypothetical protein D187_00317 [Cystobacter fuscus DSM 2262]
          Length = 445

 Score = 78.2 bits (191), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 85/331 (25%), Positives = 140/331 (42%), Gaps = 30/331 (9%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           + EK +G  SFW P++  L       +     PL +SE E A + GS      LER   +
Sbjct: 94  LEEKHRGADSFWKPFVDSLP------EAYPHVPLFYSEQERARMKGSQ-----LERLVEV 142

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPT-EAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 119
           +R+  E +         + Q    +P  E F FE +  A +++ S +  L+        +
Sbjct: 143 QRQSFEQE---------YAQLREKLPEYERFGFEEYVWARISLYSRLFSLKGGLQGP--S 191

Query: 120 LVPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
           LVPL   +  +      + +  +D    +++  R   AG  I    G + +   L++ GF
Sbjct: 192 LVPLSD-MFNHRQPPDVLWSTSEDGQTFRMIAQRAVPAGTEIHTHYGAKSSDVFLLHSGF 250

Query: 178 V-DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL 236
           V D +   D + +   L   DP    K+ +       +   F V    +  A   +  +L
Sbjct: 251 VPDGNEENDEVYLSVGLPPGDPLASVKQQMFGLASATAKHPFKVSRQGKYLASWSVFSFL 310

Query: 237 RLGYVSDTSEM---QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM 293
           R+ + S    +     ++S    I PVS   E  VL  LA   + RL  +P TL EDE +
Sbjct: 311 RMAHASPDEFLALSNRLLSGTKTIAPVSVACEERVLGTLAAACEERLKAFPTTLEEDERL 370

Query: 294 LTDYNLHPKKRVATQLVRMEKKMLNACLQVT 324
           L +  L P +R    L R EK++L   L++T
Sbjct: 371 LREGPLSPNERSCVLLRRQEKRLLGDYLELT 401


>gi|301764186|ref|XP_002917505.1| PREDICTED: SET domain-containing protein 3-like [Ailuropoda
           melanoleuca]
          Length = 591

 Score = 78.2 bits (191), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 79/330 (23%), Positives = 151/330 (45%), Gaps = 28/330 (8%)

Query: 9   KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
            SFW PYI+ L  +        ++PL + E E+  L  +    ++  + +   R+Y    
Sbjct: 169 NSFWQPYIQTLPSE-------YDTPLYFEEEEVRDLQCTQAIHDVFSQYKNTARQY---- 217

Query: 69  TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
             +F    + Q +P+   +P  +AFT+E ++ A  +V +    +     +R   AL+PL 
Sbjct: 218 -AYFY--KVIQTHPHANKLPLKDAFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 274

Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
                 +          DD  + V  R ++AGE I ++ G + N++ +I+ GF  ++N +
Sbjct: 275 DMCNHTNGLITTGYNLEDDRCECVALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 334

Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
           DR+ ++  ++  D  Y  K  V  R G  +  VF +H   +    + +L +LR+  +++ 
Sbjct: 335 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TDPPVSAQLLAFLRVFCMTEE 393

Query: 245 SEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
              + +        I +LG    PVS   E  +   L D     L  Y  T+ ED++ L 
Sbjct: 394 ELKEHLLGDNALDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLK 453

Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
           +++L  +  +A +L   EK++L   ++  A
Sbjct: 454 NHDLSVRATMAIKLRLGEKEILEKAVKSAA 483


>gi|281338628|gb|EFB14212.1| hypothetical protein PANDA_005835 [Ailuropoda melanoleuca]
          Length = 585

 Score = 77.8 bits (190), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 79/330 (23%), Positives = 151/330 (45%), Gaps = 28/330 (8%)

Query: 9   KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
            SFW PYI+ L  +        ++PL + E E+  L  +    ++  + +   R+Y    
Sbjct: 169 NSFWQPYIQTLPSE-------YDTPLYFEEEEVRDLQCTQAIHDVFSQYKNTARQY---- 217

Query: 69  TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
             +F    + Q +P+   +P  +AFT+E ++ A  +V +    +     +R   AL+PL 
Sbjct: 218 -AYFY--KVIQTHPHANKLPLKDAFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 274

Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
                 +          DD  + V  R ++AGE I ++ G + N++ +I+ GF  ++N +
Sbjct: 275 DMCNHTNGLITTGYNLEDDRCECVALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 334

Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
           DR+ ++  ++  D  Y  K  V  R G  +  VF +H   +    + +L +LR+  +++ 
Sbjct: 335 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TDPPVSAQLLAFLRVFCMTEE 393

Query: 245 SEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
              + +        I +LG    PVS   E  +   L D     L  Y  T+ ED++ L 
Sbjct: 394 ELKEHLLGDNALDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLK 453

Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
           +++L  +  +A +L   EK++L   ++  A
Sbjct: 454 NHDLSVRATMAIKLRLGEKEILEKAVKSAA 483


>gi|338719872|ref|XP_001488117.2| PREDICTED: histone-lysine N-methyltransferase setd3-like [Equus
           caballus]
          Length = 609

 Score = 77.8 bits (190), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 77/330 (23%), Positives = 147/330 (44%), Gaps = 28/330 (8%)

Query: 9   KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
            SFW PYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y    
Sbjct: 183 NSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAVHDVFSQYKNTARQY---- 231

Query: 69  TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
             +F    + Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL 
Sbjct: 232 -AYFY--RVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 288

Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
                 +          DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +
Sbjct: 289 DMCNHTTGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 348

Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL------ 238
           DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+      
Sbjct: 349 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEE 407

Query: 239 ---GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
               ++   + +  + +      PVS   E  +   L D     L  Y  T+ ED A L 
Sbjct: 408 ELKDHLLGDNAIDRIFTLGNSEYPVSWDNEVKLWTFLEDRALLLLKTYKTTVEEDRAFLK 467

Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
           + +L  +  +A +L   EK++L   ++  A
Sbjct: 468 NSDLSVRATMAIKLRLGEKEILEKAVKSAA 497


>gi|195651313|gb|ACG45124.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
           N-methyltransferase I [Zea mays]
          Length = 503

 Score = 77.8 bits (190), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 62/214 (28%), Positives = 100/214 (46%), Gaps = 21/214 (9%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELD 68
           S W+ YI  L RQ         S L W+  EL AYL  SP +   ++R   +   YN+L 
Sbjct: 163 SRWISYIAALPRQ-------PYSLLYWTRAELDAYLVASPIRKRAIQRITDVIGTYNDLR 215

Query: 69  TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 128
                   +F ++P   P E +  E F  +F  + S +V L   S+  R  LVP    +L
Sbjct: 216 D------RIFSRHPDLFPEEVYNIETFLWSFGILFSRLVRLP--SMDGRVVLVPWAD-ML 266

Query: 129 AYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPY 184
            +S + +  L        +    DR Y+ GE + +  G + + +LL++YGFV ++  NP 
Sbjct: 267 NHSPEVETFLDFDKSSRGIVFTTDRSYQPGEQVFISYGKKSSGELLLSYGFVPKEGTNPN 326

Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
           D + +  +L+  D  Y++K    +RNG    + F
Sbjct: 327 DSVELLVSLDKSDNCYKEKLQALKRNGLSESESF 360


>gi|343961019|dbj|BAK62099.1| SET domain containing 3 isoform a [Pan troglodytes]
          Length = 492

 Score = 77.4 bits (189), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 73/314 (23%), Positives = 143/314 (45%), Gaps = 28/314 (8%)

Query: 9   KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
            SFW PYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y    
Sbjct: 169 NSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY---- 217

Query: 69  TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
             +F    + Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL 
Sbjct: 218 -AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 274

Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
                 +          DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +
Sbjct: 275 DMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 334

Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
           DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++ 
Sbjct: 335 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEE 393

Query: 245 ---------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
                    S +  + +      PVS   E  +   L D     L  Y  T+ ED+++L 
Sbjct: 394 ELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLK 453

Query: 296 DYNLHPKKRVATQL 309
           + +L  + ++A +L
Sbjct: 454 NQDLSVRAKMAIKL 467


>gi|410928182|ref|XP_003977480.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Takifugu
           rubripes]
          Length = 598

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 82/330 (24%), Positives = 147/330 (44%), Gaps = 38/330 (11%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
           SFWLPYIR L ++        ++PL + + E+  L G+    ++L +     R+Y     
Sbjct: 170 SFWLPYIRTLPQE-------YDTPLFYEQDEVQLLQGTQAVQDVLSQYRNTARQY----- 217

Query: 70  VWFMAGSLFQQYPYD--IP-TEAFTFEIFKQAFVAVQSCVVHL-----QKVSLARRFALV 121
            +F    L Q +P    +P  ++FTF+ ++ A  +V +    +     ++V+LA    L+
Sbjct: 218 AYFY--KLIQTHPASSKLPLKDSFTFDDYRWAVSSVMTRQNQIPTEDGRQVTLA----LI 271

Query: 122 PLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 181
           PL       +          DD  + V  + YK  E I ++ G + N++ +I+ GF  ++
Sbjct: 272 PLWDMCNHRNGLITTGYNLEDDRCECVALQDYKKNEQIYIFYGTRSNAEFVIHNGFFYQE 331

Query: 182 NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV 241
           N +D++ ++  ++  +  Y  K  V  R G     +F ++   E+   + +L +LR+ + 
Sbjct: 332 NAHDQVKIKLGISKSERLYAMKAEVLARAGIPVSSIFALYCN-EQPISAQLLAFLRV-FC 389

Query: 242 SDTSEMQ---------SVISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 291
               E++         + I +LG +  PVS   E  +   L       L  Y  T  ED 
Sbjct: 390 MKEEELRDYLLGGHAINKIVTLGSMEFPVSWDNEIKLWTFLETRVALLLKAYKTTSEEDS 449

Query: 292 AMLTDYNLHPKKRVATQLVRMEKKMLNACL 321
           + L    L P  R+A QL   EK +L   L
Sbjct: 450 STLEKSELSPHSRMAIQLRLAEKWILEKAL 479


>gi|302785554|ref|XP_002974548.1| hypothetical protein SELMODRAFT_101776 [Selaginella moellendorffii]
 gi|300157443|gb|EFJ24068.1| hypothetical protein SELMODRAFT_101776 [Selaginella moellendorffii]
          Length = 467

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 60/213 (28%), Positives = 101/213 (47%), Gaps = 20/213 (9%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIK 61
           E   GK S W PYI  L R+ G       S LLW+  ++ A+L+ +  K   L+    ++
Sbjct: 121 EASLGKSSPWYPYIAALPRRPG-------SILLWTALDVEAHLSATSIKDRALQCVREVE 173

Query: 62  REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 121
             +N+L+   FM          + P E F  E FK AF  + S +V L   SL ++ AL+
Sbjct: 174 DTFNDLNKQVFMKNR------EEFPPEVFNLESFKWAFGILFSRLVRLP--SLGQKLALI 225

Query: 122 PLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV- 178
           P G  +L + ++    L   +   ++   +DR Y++ + + +  G + N +LL+ YGFV 
Sbjct: 226 PFGD-MLNHDTEVTTFLDFDSGSKSITCTLDRGYESNKEVFISYGKRSNGELLVAYGFVP 284

Query: 179 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 211
              N  D + +   L+  D  Y+ K    + +G
Sbjct: 285 SGKNSEDSVSITLGLDPADEMYEAKLGALKEHG 317


>gi|218191491|gb|EEC73918.1| hypothetical protein OsI_08761 [Oryza sativa Indica Group]
          Length = 502

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 58/193 (30%), Positives = 94/193 (48%), Gaps = 16/193 (8%)

Query: 32  SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEA 89
           S L W+  EL AYL  SP +   ++R   +   YN+L D ++     LF       P E 
Sbjct: 177 SLLYWTRPELDAYLVASPIRERAIQRITDVVGTYNDLRDRIFSKHSDLF-------PEEV 229

Query: 90  FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQL 147
           +  E F+ +F  + S +V L   S+  R ALVP    +L +S + +  L        +  
Sbjct: 230 YNLETFRWSFGILFSRLVRLP--SMDGRVALVPWAD-MLNHSPEVETFLDYDKSSGGIVF 286

Query: 148 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRM 205
             DR Y+ GE + +  G + + +LL++YGFV ++  NP D + +  +LN  D  Y++K  
Sbjct: 287 TTDRSYQPGEQVFISYGKKSSGELLLSYGFVPKEGTNPNDSVELLVSLNKSDKCYKEKLQ 346

Query: 206 VAQRNGKLSVQVF 218
             +RNG    + F
Sbjct: 347 ALKRNGLSEFESF 359


>gi|302821397|ref|XP_002992361.1| hypothetical protein SELMODRAFT_430576 [Selaginella moellendorffii]
 gi|300139777|gb|EFJ06511.1| hypothetical protein SELMODRAFT_430576 [Selaginella moellendorffii]
          Length = 463

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 58/242 (23%), Positives = 98/242 (40%), Gaps = 47/242 (19%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           ++EK+ G+ S W PYI  L R        + S   W + ELA +  S    E + R   I
Sbjct: 114 LFEKRAGEASIWAPYISCLPRWG-----TIHSTAFWRKEELAMIQESSLSYETMSRRAAI 168

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           + E+NE+  +       FQ+Y + +     ++  FK A+V    C     ++    + A+
Sbjct: 169 REEFNEMQPI-------FQRYEH-VFGGPVSYASFKHAYVTATVCS-RAWRIDGLEKLAM 219

Query: 121 VPLGPPLLAYSSKCKAMLAAVDD---------------------AVQLVVDRPYKAGESI 159
           VP     + +     AML    D                       QL  D+ Y AGE +
Sbjct: 220 VPFAD-FMNHDWSSNAMLTYDTDNGSTEVEEVKVYSDCLDIALFCAQLFADKNYAAGEQV 278

Query: 160 VVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFH 219
            +  GP  N+ L +++GF    NP+D++ +   ++  D   ++K           +Q  H
Sbjct: 279 TISFGPLCNASLALDFGFTVPYNPWDKVQLWLGISRRDSLRKEK-----------LQYLH 327

Query: 220 VH 221
            H
Sbjct: 328 AH 329


>gi|115448405|ref|NP_001047982.1| Os02g0725200 [Oryza sativa Japonica Group]
 gi|45735887|dbj|BAD12920.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase [Oryza sativa Japonica
           Group]
 gi|45736017|dbj|BAD13045.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase [Oryza sativa Japonica
           Group]
 gi|113537513|dbj|BAF09896.1| Os02g0725200 [Oryza sativa Japonica Group]
 gi|215737236|dbj|BAG96165.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222623589|gb|EEE57721.1| hypothetical protein OsJ_08208 [Oryza sativa Japonica Group]
          Length = 502

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 58/193 (30%), Positives = 94/193 (48%), Gaps = 16/193 (8%)

Query: 32  SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEA 89
           S L W+  EL AYL  SP +   ++R   +   YN+L D ++     LF       P E 
Sbjct: 177 SLLYWTRPELDAYLVASPIRERAIQRITDVVGTYNDLRDRIFSKHSDLF-------PEEV 229

Query: 90  FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQL 147
           +  E F+ +F  + S +V L   S+  R ALVP    +L +S + +  L        +  
Sbjct: 230 YNLETFRWSFGILFSRLVRLP--SMDGRVALVPWAD-MLNHSPEVETFLDYDKSSGGIVF 286

Query: 148 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRM 205
             DR Y+ GE + +  G + + +LL++YGFV ++  NP D + +  +LN  D  Y++K  
Sbjct: 287 TTDRSYQPGEQVFISYGKKSSGELLLSYGFVPKEGTNPNDSVELLVSLNKSDKCYKEKLQ 346

Query: 206 VAQRNGKLSVQVF 218
             +RNG    + F
Sbjct: 347 ALKRNGLSEFESF 359


>gi|449280698|gb|EMC87934.1| SET domain-containing protein 3 [Columba livia]
          Length = 593

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 81/334 (24%), Positives = 153/334 (45%), Gaps = 36/334 (10%)

Query: 9   KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
            SFWLPYI+ L  +         +PL + E E+ YL  +    ++  + +   R+Y    
Sbjct: 169 NSFWLPYIQTLPSE-------YNTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY---- 217

Query: 69  TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
             +F    + Q +P    +P  ++FT++ ++ A  +V +    +     +R   AL+PL 
Sbjct: 218 -AYFY--KVIQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 274

Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
                 +          DD  + V  + +KAGE I ++ G + N++ +I+ GF  ++N +
Sbjct: 275 DMCNHTNGLITTGYNLEDDRCECVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 334

Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
           DR+ ++  ++  D  Y  K  V  R G  +  VF +H+  E    + +L +LR+  +S+ 
Sbjct: 335 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMSEE 393

Query: 245 SEMQSVIS--------SLGPI-CPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDE 291
              + +I         +LG    PVS   E     +L  + +AR    L  Y  T+ +D+
Sbjct: 394 ELKEHLIGEHAIDKIFTLGNSEFPVSWDNEV----KLWTFLEARASLLLKTYKTTVEDDK 449

Query: 292 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
           + L  ++L     +A +L   EK++L   ++  A
Sbjct: 450 SFLETHDLTSHAIMAIKLRLGEKEILEKAVKSAA 483


>gi|427784595|gb|JAA57749.1| Putative histone-lysine n-methyltransferase setd3 [Rhipicephalus
           pulchellus]
          Length = 485

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 76/326 (23%), Positives = 140/326 (42%), Gaps = 23/326 (7%)

Query: 7   GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 66
           G+ SFW PYI  L         +  + L +S  EL  L GS    E L+    I R+Y+ 
Sbjct: 163 GESSFWHPYISTL-------PASFNTVLYFSVEELELLHGSTVLDEALKLHRSIARQYSY 215

Query: 67  LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS--CVVHLQKVSLARR------- 117
              + F    L +  PY    + FT+++++ A  AV +    V L   +           
Sbjct: 216 FHKI-FRTHPLAKSLPY---KDCFTYDLYRWAVSAVMTRQNAVPLTDTAGGDDEDGTDAM 271

Query: 118 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
            A+VPL         K         + ++    R ++ G+ + ++ G + N++  I+ GF
Sbjct: 272 TAMVPLWDMCNHSDGKVFTDYDISANMLRCYAMRDFEKGQEVTIFYGRRTNAEFFIHNGF 331

Query: 178 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 237
           V  +N +D + ++  ++ +DP Y  K  +   +      +F +   RE+    D+  +LR
Sbjct: 332 VFPENRHDSVDIKLGISKQDPLYAVKAKLCDDHELTPSGIFAL-VPRERPVCEDLSTFLR 390

Query: 238 LGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDY 297
           +  + D S+  S       I   +    R  L+ L    +  L  +P +  E E ++ D 
Sbjct: 391 ILVLKDASQAASFTDE--HIMVATDDNAREALNFLIVRIQLLLRAFPKSDQEYENIIADE 448

Query: 298 NLHPKKRVATQLVRMEKKMLNACLQV 323
             + + ++A QL  +E+K+L A L+ 
Sbjct: 449 GSNGRLKMAAQLRLLERKILTAVLET 474


>gi|326921018|ref|XP_003206761.1| PREDICTED: LOW QUALITY PROTEIN: SET domain-containing protein
           3-like [Meleagris gallopavo]
          Length = 593

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 80/334 (23%), Positives = 153/334 (45%), Gaps = 36/334 (10%)

Query: 9   KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
            SFWLPYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y    
Sbjct: 169 NSFWLPYIQTLPNE-------YDTPLYFEEDEVQYLRSTQAIHDVFSQYKNTARQY---- 217

Query: 69  TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
             +F    + Q +P    +P  ++FT++ ++ A  +V +    +     +R   AL+PL 
Sbjct: 218 -AYFY--KVIQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 274

Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
                 +          DD  + V  + +KAGE I ++ G + N++ +I+ GF  ++N +
Sbjct: 275 DMCNHTNGLITTGYNLEDDRCECVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 334

Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
           DR+ ++  ++  D  Y  K  V  R G  +  VF +H+  E    + +L +LR+  +++ 
Sbjct: 335 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-IEPPISAQLLAFLRVFCMNEE 393

Query: 245 SEMQSVIS--------SLGPI-CPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDE 291
              + +I         +LG    P+S   E     +L  + +AR    L  Y  T+ +D+
Sbjct: 394 ELKEHLIGEHAIDKIFTLGNSEFPISWDNEV----KLWTFLEARASLLLKTYKTTVEDDK 449

Query: 292 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
             L  ++L     +A +L   EKK+L   ++  A
Sbjct: 450 LFLETHDLTSHATMAIKLRLGEKKILEKTVKSAA 483


>gi|57529914|ref|NP_001006486.1| histone-lysine N-methyltransferase setd3 [Gallus gallus]
 gi|363734802|ref|XP_003641459.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Gallus
           gallus]
 gi|75571462|sp|Q5ZML9.1|SETD3_CHICK RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|53127281|emb|CAG31024.1| hypothetical protein RCJMB04_1k10 [Gallus gallus]
          Length = 593

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 79/334 (23%), Positives = 154/334 (46%), Gaps = 36/334 (10%)

Query: 9   KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
            SFWLPYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y    
Sbjct: 169 NSFWLPYIQTLPSE-------YDTPLYFEEDEVQYLRSTQAIHDVFSQYKNTARQY---- 217

Query: 69  TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
             +F    + Q +P    +P  ++FT++ ++ A  +V +    +     +R   AL+PL 
Sbjct: 218 -AYFY--KVIQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 274

Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
                 +          DD  + V  + +KAGE I ++ G + N++ +I+ GF  ++N +
Sbjct: 275 DMCNHTNGLITTGYNLEDDRCECVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 334

Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
           DR+ ++  ++  D  Y  K  V  R G  +  VF +H+  E    + +L +LR+  +++ 
Sbjct: 335 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-IEPPISAQLLAFLRVFCMNEE 393

Query: 245 SEMQSVIS--------SLGPI-CPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDE 291
              + +I         +LG    P+S   E     +L  + +AR    L  Y  T+ +D+
Sbjct: 394 ELKEHLIGEHAIDKIFTLGNSEFPISWDNEV----KLWTFLEARASLLLKTYKTTVEDDK 449

Query: 292 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
           + L  ++L     +A +L   EK++L   ++  A
Sbjct: 450 SFLETHDLTSHATMAIKLRLGEKEILEKAVKSAA 483


>gi|326495906|dbj|BAJ90575.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 507

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 62/208 (29%), Positives = 102/208 (49%), Gaps = 23/208 (11%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELD 68
           S W  YI  L RQ         S L W+ TE+ AYL  SP +   + R   +   YN+L 
Sbjct: 166 SRWSSYIDALPRQ-------PYSLLYWTRTEIDAYLVASPIRERAISRISDVIGTYNDLR 218

Query: 69  TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 128
                   +F ++P   P + +  E F+ +F  + S +V L+  S+  + ALVP    +L
Sbjct: 219 D------RIFSKHPDLFPEKVYNMENFRWSFGILFSRLVRLE--SMGGKVALVPWAD-ML 269

Query: 129 AYSSKCKAMLAAVDDAVQLVV---DRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NP 183
            +S +  A L   D + Q +V   DR Y+ GE + +  G + + +LL++YGFV ++  NP
Sbjct: 270 NHSPEVDAFLD-YDKSSQGIVFTTDRSYQPGEQVFISYGKKSSGELLLSYGFVPKEGTNP 328

Query: 184 YDRLVVEAALNTEDPQYQDKRMVAQRNG 211
            D +    +L   D  Y++K    +++G
Sbjct: 329 NDSVEFLVSLKKSDECYKEKLQALKKHG 356


>gi|41056027|ref|NP_956348.1| histone-lysine N-methyltransferase setd3 [Danio rerio]
 gi|82187658|sp|Q7SXS7.1|SETD3_DANRE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|32766447|gb|AAH55261.1| SET domain containing 3 [Danio rerio]
          Length = 596

 Score = 75.5 bits (184), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 75/317 (23%), Positives = 134/317 (42%), Gaps = 24/317 (7%)

Query: 12  WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 71
           WLPYI+ L  +        ++PL + E E+ +L  +    ++L + +   R+Y      +
Sbjct: 172 WLPYIKTLPSE-------YDTPLYFEEEEVRHLLATQAIQDVLSQYKNTARQY-----AY 219

Query: 72  FMAGSLFQQYPYDIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLA 129
           F            +P  +AFTF+ ++ A  +V +    +     +R   AL+PL      
Sbjct: 220 FYKVIHTHPNASKLPLKDAFTFDDYRWAVSSVMTRQNQIPTADGSRVTLALIPLWDMCNH 279

Query: 130 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 189
            +          DD  + V  + YK GE I ++ G + N++ +I+ GF  EDN +DR+ +
Sbjct: 280 TNGLITTGYNLEDDRCECVALKDYKEGEQIYIFYGTRSNAEFVIHNGFFFEDNAHDRVKI 339

Query: 190 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL---------GY 240
           +  ++  +  Y  K  V  R G  +  +F +H   E    + +L +LR+          Y
Sbjct: 340 KLGVSKGERLYAMKAEVLARAGIPASSIFALHCS-EPPISAQLLAFLRVFCMTEEELRDY 398

Query: 241 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 300
           +     +  + +      PVS   E  +   L       L  Y     ED +ML   +L 
Sbjct: 399 LVGDHAINKIFTLGNTEFPVSWENEIKLWTFLETRAALLLKTYKTASEEDRSMLEKPDLS 458

Query: 301 PKKRVATQLVRMEKKML 317
              R+A +L   EK++L
Sbjct: 459 LHSRIAIKLRLAEKEIL 475


>gi|47215092|emb|CAF98166.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 444

 Score = 75.5 bits (184), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 79/329 (24%), Positives = 143/329 (43%), Gaps = 36/329 (10%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
           SFWLPYIR L ++        ++PL + + ++  L G+    ++L +     R+Y     
Sbjct: 88  SFWLPYIRTLPQE-------YDTPLFYQQQDVQLLHGTQAIQDVLSQYRNTARQY----- 135

Query: 70  VWFMAGSLFQQYPYD--IP-TEAFTFEIFKQAFVAVQSCVVHL-----QKVSLARRFALV 121
            +F    L Q +P    +P  ++FTF+ ++ A  +V +    +     ++V+LA    L+
Sbjct: 136 AYFY--KLVQTHPASSKLPLKDSFTFDDYRWAVSSVMTRQNQIPTEDGRQVTLA----LI 189

Query: 122 PLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 181
           PL       +          DD  + V  + YK  E I ++ G + N++ +I+ GF  ++
Sbjct: 190 PLWDMCNHRNGLITTGYNLEDDRCECVALQDYKKNEQIYIFYGTRSNAEFVIHNGFFYQE 249

Query: 182 NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL--- 238
           N +D++ ++  ++  +  Y  K  V  R G     VF ++   E    + +L +LR+   
Sbjct: 250 NAHDQVKIKLGISKSERLYAMKAEVLGRAGIPVSSVFALYCN-EPPISAQLLAFLRVFCM 308

Query: 239 ------GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEA 292
                  Y+     +  +++      PVS   E  +   L       L  Y  T  ED +
Sbjct: 309 MEEELKDYLFGAQAINRLVTLGSMEFPVSWENEIKLWTFLETRAALLLKAYKTTAEEDSS 368

Query: 293 MLTDYNLHPKKRVATQLVRMEKKMLNACL 321
            L   +L P  R+A QL   EK +L   L
Sbjct: 369 TLDKTDLSPHSRMAVQLRLAEKAILEKAL 397


>gi|357153645|ref|XP_003576520.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
           large subunit N-methyltransferase, chloroplastic-like
           [Brachypodium distachyon]
          Length = 492

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 82/334 (24%), Positives = 148/334 (44%), Gaps = 38/334 (11%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           + E  +G  S W PY+  L RQ        +S + WSE EL  + G+   +  +   E +
Sbjct: 138 LREAARGGDSLWAPYLAILPRQ-------TDSTIFWSEEELLEIQGTQLLSTTMGVKEYV 190

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           + E++ ++    + G     +P     +  TF+ F  AF  ++S V    +     + AL
Sbjct: 191 QSEFDNVEAK--IIGPNKDLFP-----DTITFDDFLWAFGILRSRVFPELR---GDKLAL 240

Query: 121 VPLGPPLLAYSS-----------KCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPN 168
           +P    L+ +S+           + K  L   D    L      K+GE + V +   + N
Sbjct: 241 IPFAD-LINHSADITSKQSCWEIQGKGFLGR-DVVFSLRTPMEVKSGEQVYVQYDLDKSN 298

Query: 169 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEA 228
           ++L ++YGF + ++  D   +   ++  DP Y DK  +A+ NG      F V  G  +  
Sbjct: 299 AELALDYGFTETNSTRDSYTLTLEISESDPFYGDKLDIAELNGMGETAYFDVVLG--ESL 356

Query: 229 ISDMLPYLRLGYVSDTSE--MQSVISS--LGPI-CPVSPCMERAVLDQLADYFKARLAGY 283
              M+ YLRL  +  T    ++++  +   G +  PVS   E ++   +    K+ L  Y
Sbjct: 357 PPQMITYLRLLCLGGTDAFLLEALFRNKVWGFLELPVSRDNEESICQVIQTACKSALTAY 416

Query: 284 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 317
             T+ EDE +L   +L  + ++A ++   EKK+L
Sbjct: 417 HTTIEEDEELLKREDLQSRHQIAVEVRAGEKKVL 450


>gi|242066146|ref|XP_002454362.1| hypothetical protein SORBIDRAFT_04g029430 [Sorghum bicolor]
 gi|241934193|gb|EES07338.1| hypothetical protein SORBIDRAFT_04g029430 [Sorghum bicolor]
          Length = 499

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 65/209 (31%), Positives = 102/209 (48%), Gaps = 25/209 (11%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL- 67
           S W  YI  L RQ         S L W+  EL AYL  SP +   ++R   +   YN+L 
Sbjct: 159 SRWSSYIAALPRQ-------PYSLLYWTRAELDAYLVASPIRKRAIQRITDVIGTYNDLR 211

Query: 68  DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 127
           D ++     LF       P E +  E F  +F  + S +V L   S+  + ALVP    +
Sbjct: 212 DRIFSRHSDLF-------PEEVYNIETFLWSFGILFSRLVRLP--SMDEKVALVPWAD-M 261

Query: 128 LAYSSKCKAMLAAVDDAVQLVV---DRPYKAGESIVVWCGPQPNSKLLINYGFVDED--N 182
           L +S + +  L   D + Q +V   DR Y+ GE + +  G + + +LL++YGFV ++  N
Sbjct: 262 LNHSPEVETFLD-FDKSSQGIVFTTDRSYQPGEQVFISYGKKSSGELLLSYGFVPKEGTN 320

Query: 183 PYDRLVVEAALNTEDPQYQDKRMVAQRNG 211
           P D + +  +L+  D  Y++K    +RNG
Sbjct: 321 PNDSVELLVSLDKSDKCYKEKLQALKRNG 349


>gi|148686777|gb|EDL18724.1| mCG18357, isoform CRA_b [Mus musculus]
          Length = 466

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 78/330 (23%), Positives = 151/330 (45%), Gaps = 28/330 (8%)

Query: 9   KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
            SFW PYI+ L  +        ++PL + E E+  L  +    ++  + +   R+Y    
Sbjct: 41  NSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY---- 89

Query: 69  TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
             +F    + Q +P+   +P  E+FT+E ++ A  +V +    +     +R   AL+PL 
Sbjct: 90  -AYFY--KVIQTHPHANKLPLKESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 146

Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
                 +          DD  + V  + ++AG+ I ++ G + N++ +I+ GF  ++N +
Sbjct: 147 DMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSH 206

Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
           DR+ ++  ++  D  Y  K  V  R G  +  VF +H+  E    + +L +LR+  +++ 
Sbjct: 207 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEE 265

Query: 245 SEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
              + +        I +LG    PVS   E  +   L D     L  Y  T+ ED+ +L 
Sbjct: 266 ELKEHLLGDSAIDRIFTLGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLK 325

Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
           + +L  +  +A +L   EK++L   ++  A
Sbjct: 326 NPDLSVRATMAIKLRLGEKEILEKAVKSAA 355


>gi|432952574|ref|XP_004085141.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Oryzias
           latipes]
          Length = 606

 Score = 75.1 bits (183), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 77/321 (23%), Positives = 140/321 (43%), Gaps = 28/321 (8%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
           SFW PYIR L ++        ++PL + + ++  L G+    ++L + +   R+Y     
Sbjct: 170 SFWSPYIRSLPQE-------YDTPLYYQQEDVQLLLGTQAVQDVLNQYKNTARQY----- 217

Query: 70  VWFMAGSLFQQYP--YDIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
            +F    L Q +P    +P  + F+F+ ++ A  +V +    +  V  +R   AL+PL  
Sbjct: 218 AYFY--KLVQTHPAASKLPLKDGFSFDDYRWAVSSVMTRQNQIPTVDGSRVTLALIPLWD 275

Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
                +          DD  + V  + YK  E I ++ G + N++ +I+ GF  +DN +D
Sbjct: 276 MCNHTNGLITTGYNLEDDRCECVALQDYKKNEQIYIFYGTRSNAEFVIHNGFFFQDNAHD 335

Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL------- 238
           R+ ++  ++  +  Y  K  V  R G  +  VF +H   +    + +L +LR+       
Sbjct: 336 RVKIKLGVSKSERLYAMKAEVLARAGIPASCVFALHC-NDPPISAQLLAFLRVFCMTEEE 394

Query: 239 --GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
              Y+     +  + +      PVS   E  +   L       L  Y  T  ED ++L  
Sbjct: 395 LKDYLLGERAINKIFTLGNSDFPVSWENEIKLWTFLETRAALLLKTYKTTSEEDRSILEK 454

Query: 297 YNLHPKKRVATQLVRMEKKML 317
            +L    R+A QL   EK++L
Sbjct: 455 PDLSLHTRLAVQLRLAEKQIL 475


>gi|148686779|gb|EDL18726.1| mCG18357, isoform CRA_d [Mus musculus]
          Length = 597

 Score = 75.1 bits (183), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 78/329 (23%), Positives = 151/329 (45%), Gaps = 28/329 (8%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
           SFW PYI+ L  +        ++PL + E E+  L  +    ++  + +   R+Y     
Sbjct: 173 SFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY----- 220

Query: 70  VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
            +F    + Q +P+   +P  E+FT+E ++ A  +V +    +     +R   AL+PL  
Sbjct: 221 AYFY--KVIQTHPHANKLPLKESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 278

Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
                +          DD  + V  + ++AG+ I ++ G + N++ +I+ GF  ++N +D
Sbjct: 279 MCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHD 338

Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 245
           R+ ++  ++  D  Y  K  V  R G  +  VF +H+  E    + +L +LR+  +++  
Sbjct: 339 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEE 397

Query: 246 EMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
             + +        I +LG    PVS   E  +   L D     L  Y  T+ ED+ +L +
Sbjct: 398 LKEHLLGDSAIDRIFTLGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKN 457

Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
            +L  +  +A +L   EK++L   ++  A
Sbjct: 458 PDLSVRATMAIKLRLGEKEILEKAVKSAA 486


>gi|268370088|ref|NP_082538.2| histone-lysine N-methyltransferase setd3 [Mus musculus]
 gi|81879567|sp|Q91WC0.1|SETD3_MOUSE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=Endothelial differentiation inhibitory protein D10;
           AltName: Full=SET domain-containing protein 3
 gi|16359331|gb|AAH16123.1| SET domain containing 3 [Mus musculus]
 gi|18044800|gb|AAH19973.1| Setd3 protein [Mus musculus]
 gi|26327255|dbj|BAC27371.1| unnamed protein product [Mus musculus]
 gi|74145116|dbj|BAE27425.1| unnamed protein product [Mus musculus]
 gi|74151505|dbj|BAE38861.1| unnamed protein product [Mus musculus]
 gi|148686776|gb|EDL18723.1| mCG18357, isoform CRA_a [Mus musculus]
          Length = 594

 Score = 75.1 bits (183), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 78/329 (23%), Positives = 151/329 (45%), Gaps = 28/329 (8%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
           SFW PYI+ L  +        ++PL + E E+  L  +    ++  + +   R+Y     
Sbjct: 170 SFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY----- 217

Query: 70  VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
            +F    + Q +P+   +P  E+FT+E ++ A  +V +    +     +R   AL+PL  
Sbjct: 218 AYFY--KVIQTHPHANKLPLKESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 275

Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
                +          DD  + V  + ++AG+ I ++ G + N++ +I+ GF  ++N +D
Sbjct: 276 MCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHD 335

Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 245
           R+ ++  ++  D  Y  K  V  R G  +  VF +H+  E    + +L +LR+  +++  
Sbjct: 336 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEE 394

Query: 246 EMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
             + +        I +LG    PVS   E  +   L D     L  Y  T+ ED+ +L +
Sbjct: 395 LKEHLLGDSAIDRIFTLGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKN 454

Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
            +L  +  +A +L   EK++L   ++  A
Sbjct: 455 PDLSVRATMAIKLRLGEKEILEKAVKSAA 483


>gi|160774366|gb|AAI55279.1| SET domain containing 3 [Danio rerio]
          Length = 596

 Score = 74.7 bits (182), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 74/317 (23%), Positives = 133/317 (41%), Gaps = 24/317 (7%)

Query: 12  WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 71
           WLPYI+ L  +        ++PL + E E+ +L  +    ++L + +   R+Y      +
Sbjct: 172 WLPYIKTLPSE-------YDTPLYFEEEEVRHLLATQAIQDVLSQYKNTARQY-----AY 219

Query: 72  FMAGSLFQQYPYDIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLA 129
           F            +P  +AFTF+ ++ A  +V +    +     +R   AL+PL      
Sbjct: 220 FYKVIHTHPNASKLPLKDAFTFDDYRWAVSSVMTRQNQIPTADGSRVTLALIPLWDMCNH 279

Query: 130 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 189
            +          DD  + V  + YK GE I ++ G + N++ +I+ GF  EDN +DR+ +
Sbjct: 280 TNGLITTGYNLEDDRCECVALKDYKEGEQIYIFYGTRSNAEFVIHNGFFFEDNAHDRVKI 339

Query: 190 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL---------GY 240
           +  ++  +  Y  K  V  R G  +  +F +H   E    + +L +LR+          Y
Sbjct: 340 KLGVSKSERLYAMKAEVLARAGIPASSIFALHCS-EPPISAQLLAFLRVFCMTEEELRDY 398

Query: 241 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 300
           +     +  + +      PVS   E  +   L       L  Y     ED +ML   +L 
Sbjct: 399 LVGDHAINKIFTLGNTEFPVSWENEIKLWTFLETRAALLLKTYKTASEEDRSMLEKPDLS 458

Query: 301 PKKRVATQLVRMEKKML 317
              R+  +L   EK++L
Sbjct: 459 LHSRITIKLRLAEKEIL 475


>gi|308807993|ref|XP_003081307.1| putative methyltransferase (ISS) [Ostreococcus tauri]
 gi|116059769|emb|CAL55476.1| putative methyltransferase (ISS) [Ostreococcus tauri]
          Length = 505

 Score = 74.3 bits (181), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 86/318 (27%), Positives = 132/318 (41%), Gaps = 56/318 (17%)

Query: 5   KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKRE 63
           + G    +  YIR L R+ G       S L W E E+   L GSP++    ER E +   
Sbjct: 150 ESGNAGTFGEYIRALPRRTG-------SVLDWPEDEVETLLKGSPSRLAAAERQESVNAA 202

Query: 64  YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 123
             E+ +           +P DI   A  +     AF  + S ++ L   ++    ALVP 
Sbjct: 203 IAEIRS----------SFP-DITEGALRW-----AFDILFSRLIRLD--AMGGELALVPW 244

Query: 124 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE--D 181
              +L +   C A +     AV L  DR Y AGE +    G +P+S+LLI+YGF  E  +
Sbjct: 245 AD-MLNHKPGCAAFIDLNGSAVNLTTDRAYAAGEQVWASYGQRPSSELLISYGFAPEVGE 303

Query: 182 NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH-AGREKEAI----------- 229
           NP D   +   ++  DP  Q K  V +R G   V+ F +   G  ++ +           
Sbjct: 304 NPDDEYSLTLGVDVNDPYAQAKADVLRRMGLSPVETFPLRLNGYPRQLLQYASFILCNPD 363

Query: 230 --SDMLPYLRLGYVSDTSEMQSVISSL------------GPICPVSPCMERAVLDQLADY 275
             S++    R  +    +  QS+  S+            G I    P  E AV + LAD 
Sbjct: 364 KPSELEGLARTAFTGSANFGQSIFDSVRGLAQGQARGKQGVILGGVPG-EIAVREMLADM 422

Query: 276 FKARLAGYPATLSEDEAM 293
               L+ YP +L +D+ +
Sbjct: 423 CAEALSAYPNSLEKDKGI 440


>gi|354483159|ref|XP_003503762.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Cricetulus
           griseus]
 gi|344254671|gb|EGW10775.1| SET domain-containing protein 3 [Cricetulus griseus]
          Length = 577

 Score = 74.3 bits (181), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 76/329 (23%), Positives = 148/329 (44%), Gaps = 28/329 (8%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
           SFW PYI+ L  +        ++PL + E E+  L  +    ++  + +   R+Y     
Sbjct: 170 SFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY----- 217

Query: 70  VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
            +F    + Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL  
Sbjct: 218 AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 275

Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
                +          DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +D
Sbjct: 276 MCNHTNGLITTGYNLEDDRCECVALQDFQAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 335

Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT- 244
           R+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++  
Sbjct: 336 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEE 394

Query: 245 --------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
                   S +  + +      PVS   E  +   L D     L  Y  T+ ED+ +L +
Sbjct: 395 LKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKN 454

Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
            +L  +  +A +L   EK++L   ++  A
Sbjct: 455 PDLSVRATMAIKLRLGEKEILEKAVKSAA 483


>gi|12848462|dbj|BAB27964.1| unnamed protein product [Mus musculus]
 gi|46241521|gb|AAS82953.1| endothelial differentiation inhibitory protein D10 [Mus musculus]
          Length = 594

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 78/329 (23%), Positives = 151/329 (45%), Gaps = 28/329 (8%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
           SFW PYI+ L  +        ++PL + E E+  L  +    ++  + +   R+Y     
Sbjct: 170 SFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY----- 217

Query: 70  VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
            +F    + Q +P+   +P  E+FT+E ++ A  +V +    +     +R   AL+PL  
Sbjct: 218 AYFY--KVIQTHPHANKLPLKESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 275

Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
                +          DD  + V  + ++AG+ I ++ G + N++ +I+ GF  ++N +D
Sbjct: 276 MCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAESVIHSGFFFDNNSHD 335

Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 245
           R+ ++  ++  D  Y  K  V  R G  +  VF +H+  E    + +L +LR+  +++  
Sbjct: 336 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEE 394

Query: 246 EMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
             + +        I +LG    PVS   E  +   L D     L  Y  T+ ED+ +L +
Sbjct: 395 LKEHLLGDSAIDRIFTLGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKN 454

Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
            +L  +  +A +L   EK++L   ++  A
Sbjct: 455 PDLSVRATMAIKLRLGEKEILEKAVKSAA 483


>gi|302759643|ref|XP_002963244.1| hypothetical protein SELMODRAFT_80789 [Selaginella moellendorffii]
 gi|300168512|gb|EFJ35115.1| hypothetical protein SELMODRAFT_80789 [Selaginella moellendorffii]
          Length = 467

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 58/213 (27%), Positives = 99/213 (46%), Gaps = 20/213 (9%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIK 61
           E   GK S W PYI  L R+ G       S LLW+  ++  +L+ +  K   L+    ++
Sbjct: 121 EASLGKSSPWYPYIAALPRRPG-------SILLWTALDVETHLSATSIKDRALQCVREVE 173

Query: 62  REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 121
             +N+L+   FM          + P E F  + FK AF  + S +V L   SL ++ AL+
Sbjct: 174 DTFNDLNKQVFMKNR------EEFPPEVFNLKSFKWAFGILFSRLVRLP--SLGQKLALI 225

Query: 122 PLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV- 178
           P G  +L + ++    L   +   ++   +DR Y++   + +  G + N +LL+ YGFV 
Sbjct: 226 PFGD-MLNHDTEVTTFLDFDSGSKSITCTLDRGYESNREVFISYGKRSNGELLVAYGFVP 284

Query: 179 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 211
              N  D + +   L+  D  Y+ K    + +G
Sbjct: 285 SGKNSEDSVSITLGLDPADEMYEAKLGTLKEHG 317


>gi|58177849|gb|AAH89108.1| Setd3 protein [Rattus norvegicus]
          Length = 450

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 75/330 (22%), Positives = 148/330 (44%), Gaps = 28/330 (8%)

Query: 9   KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
            SFW PYI+ L  +        ++PL + E E+  L  +    ++  + +   R+Y    
Sbjct: 23  NSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY---- 71

Query: 69  TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
             +F    + Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL 
Sbjct: 72  -AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 128

Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
                 +          DD  + V  + ++AG+ I ++ G + N++ +I+ GF  ++N +
Sbjct: 129 DMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSH 188

Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
           DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++ 
Sbjct: 189 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEE 247

Query: 245 ---------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
                    S +  + +      PVS   E  +   L D     L  Y  T+ ED+ +L 
Sbjct: 248 ELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKTVLK 307

Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
           + +L  +  +A +L   EK++L   ++  A
Sbjct: 308 NPDLSVRATMAIKLRLGEKEILEKAVKSAA 337


>gi|149044197|gb|EDL97579.1| rCG27725, isoform CRA_c [Rattus norvegicus]
          Length = 468

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 75/330 (22%), Positives = 148/330 (44%), Gaps = 28/330 (8%)

Query: 9   KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
            SFW PYI+ L  +        ++PL + E E+  L  +    ++  + +   R+Y    
Sbjct: 41  NSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY---- 89

Query: 69  TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
             +F    + Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL 
Sbjct: 90  -AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 146

Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
                 +          DD  + V  + ++AG+ I ++ G + N++ +I+ GF  ++N +
Sbjct: 147 DMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSH 206

Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
           DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++ 
Sbjct: 207 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEE 265

Query: 245 ---------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
                    S +  + +      PVS   E  +   L D     L  Y  T+ ED+ +L 
Sbjct: 266 ELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKTVLK 325

Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
           + +L  +  +A +L   EK++L   ++  A
Sbjct: 326 NPDLSVRATMAIKLRLGEKEILEKAVKSAA 355


>gi|392341246|ref|XP_002726820.2| PREDICTED: histone-lysine N-methyltransferase setd3 [Rattus
           norvegicus]
 gi|392349051|ref|XP_216781.6| PREDICTED: histone-lysine N-methyltransferase setd3 [Rattus
           norvegicus]
 gi|149044195|gb|EDL97577.1| rCG27725, isoform CRA_a [Rattus norvegicus]
          Length = 596

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 75/330 (22%), Positives = 148/330 (44%), Gaps = 28/330 (8%)

Query: 9   KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
            SFW PYI+ L  +        ++PL + E E+  L  +    ++  + +   R+Y    
Sbjct: 169 NSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY---- 217

Query: 69  TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
             +F    + Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL 
Sbjct: 218 -AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 274

Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
                 +          DD  + V  + ++AG+ I ++ G + N++ +I+ GF  ++N +
Sbjct: 275 DMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSH 334

Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
           DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++ 
Sbjct: 335 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEE 393

Query: 245 ---------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 295
                    S +  + +      PVS   E  +   L D     L  Y  T+ ED+ +L 
Sbjct: 394 ELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKTVLK 453

Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
           + +L  +  +A +L   EK++L   ++  A
Sbjct: 454 NPDLSVRATMAIKLRLGEKEILEKAVKSAA 483


>gi|330822500|ref|XP_003291689.1| hypothetical protein DICPUDRAFT_57488 [Dictyostelium purpureum]
 gi|325078125|gb|EGC31794.1| hypothetical protein DICPUDRAFT_57488 [Dictyostelium purpureum]
          Length = 540

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 76/314 (24%), Positives = 130/314 (41%), Gaps = 32/314 (10%)

Query: 36  WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 95
           W   +   L GSP     +    G  R+Y      +F    LF   P  + T  FT+E F
Sbjct: 185 WGIEDFKQLEGSPVFQTAVNYTRGSMRQY-----CYFY--QLFDNNPGILQTSNFTYEAF 237

Query: 96  KQAFVAVQSCVVHLQKVSLARRFALVPLGP--PLLAYSSKCKAMLAAVDDAVQLVVDRPY 153
             A   VQS       V   +  AL+P        ++  K    +  V   +     + Y
Sbjct: 238 IWAVATVQS---RQNPVGGGQEMALIPFWDFCNHSSHGGKITTFIDPVKHVLTCSAAKSY 294

Query: 154 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQ--YQDK-RMVAQRN 210
           K GE + ++ GP+PNS+  +  GF  + N  D    +  L+ ED +    DK  ++ +R 
Sbjct: 295 KKGEQVYMYYGPRPNSQFYLFQGFSLKTNLNDDYSFDMDLDNEDDRDIAHDKIHILEERC 354

Query: 211 GKLSVQVFHVHAGREKEAI-SDMLPYLRLGYVS--DTSEM-----------QSVISSLGP 256
           G    Q   +      E + ++++P+ R+  +S  +T ++                 + P
Sbjct: 355 GLRVGQTVSLSQNPSSEKLPAEIIPFYRIAALSPEETKKLAPPQEEGHHHHHQGPMDMKP 414

Query: 257 --ICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEK 314
                +S   E+     L D  KARL+GYP TL++DE  + + N   ++R    ++  EK
Sbjct: 415 EAFNIISEENEKKAFKLLLDSLKARLSGYPTTLAQDEQEMKN-NPTTQRRYVLYILINEK 473

Query: 315 KMLNACLQVTADMI 328
           K+L   ++    +I
Sbjct: 474 KILERNIKYVEQLI 487


>gi|395504553|ref|XP_003756612.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Sarcophilus
           harrisii]
          Length = 602

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 79/333 (23%), Positives = 151/333 (45%), Gaps = 36/333 (10%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
           SFWLPYI+ L  +        ++PL + E E+ +L  +    ++  + +   R+Y     
Sbjct: 177 SFWLPYIQTLPSE-------YDTPLYFEEDEVQHLQSTQAIHDVFSQYKNTARQY----- 224

Query: 70  VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
            +F    + Q +P    +P  ++FT+E ++ A  +V +    +     +R   AL+PL  
Sbjct: 225 AYFY--KVIQTHPNANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 282

Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
                +          DD  + V  + +  GE I ++ G + N++ +I+ GF  ++N +D
Sbjct: 283 MCNHTNGLITTGYNLEDDRCECVALQDFNVGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 342

Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 245
           R+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++  
Sbjct: 343 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEE 401

Query: 246 EMQSVIS--------SLGPI-CPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEA 292
             + +I         +LG    PVS   E     +L  + +AR    L  Y  T+ ED++
Sbjct: 402 LKEHLIGEHAIDRIFTLGNSEFPVSWDNEV----KLWTFLEARASLLLKTYKTTIEEDKS 457

Query: 293 MLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
            L  ++L     +A +L   EK++L   ++  A
Sbjct: 458 FLATHDLTFHATMAIKLRLGEKEILEKAVKSAA 490


>gi|308802083|ref|XP_003078355.1| ribulose-1,5-bisphosphate carb (ISS) [Ostreococcus tauri]
 gi|116056807|emb|CAL53096.1| ribulose-1,5-bisphosphate carb (ISS) [Ostreococcus tauri]
          Length = 520

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 84/340 (24%), Positives = 145/340 (42%), Gaps = 39/340 (11%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET-ELAYLTGSPTKAEILERAEGIK 61
           E+ +G  S W PY++ L       +   ++PL W++  + A L GSP  A+ +ER++  +
Sbjct: 132 ERIKGGASEWAPYVKTL-------RANPDAPLFWTDAKDFALLKGSPVAADAIERSKSAR 184

Query: 62  REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 121
            EY  +  V        +  P   P EA+ F    +   A+ +       +  A+ +ALV
Sbjct: 185 TEYASITEV-------IKSDPSSYPPEAYEFLTEARFVDALATVCAKATWLPTAQCYALV 237

Query: 122 PL----------GPPLLAYSS-----KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP- 165
           PL           P +L  S+     +C      VD A  ++      A  S V+     
Sbjct: 238 PLLDVISIGGAPVPGVLPPSASDGVVRCGPADYDVDTASVVLRCATKAAANSEVIQLDAL 297

Query: 166 -QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV-QVFHVHAG 223
            + N +L +N G+VD+ +P D + ++  + T D  +  K+ V +  G  +  Q F V+  
Sbjct: 298 QRNNGELFLNTGYVDQKHPGDYIYMKTDIQTSDRLFTAKKQVLEGMGFTAADQYFPVYKD 357

Query: 224 REKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGY 283
           R     + +  YLR   V D  EM +V      I  VS   E  +L  L    +  +A Y
Sbjct: 358 RMP---TQLYSYLRFSRVQDPGEMMAVSFEEDKI--VSVMNEYEILQILMGDCRELMAEY 412

Query: 284 PATLSEDEAMLTDYNLHPKKRVATQLVRM-EKKMLNACLQ 322
                ++  +L   +  P + +    +RM EKK++ A + 
Sbjct: 413 DTNEEDELNLLKLSDQMPVREIEAAKLRMSEKKLIGATMN 452


>gi|126290266|ref|XP_001367810.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Monodelphis domestica]
          Length = 595

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 77/329 (23%), Positives = 147/329 (44%), Gaps = 28/329 (8%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
           SFWLPYI+ L  +        ++PL + E E+ +L  +    ++  + +   R+Y     
Sbjct: 170 SFWLPYIQTLPSE-------YDTPLYFEEDEVQHLQSTQAIHDVFSQYKNTARQY----- 217

Query: 70  VWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGP 125
            +F    + Q +P    +P  ++FT+E ++ A  +V +    +     +R   AL+PL  
Sbjct: 218 AYFY--KVIQTHPNANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWD 275

Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
                +          DD  + V  + +  GE I ++ G + N++ +I+ GF  ++N +D
Sbjct: 276 MCNHTNGLITTGYNLEDDRCECVALQDFNVGEQIYIFYGTRSNAEFVIHSGFFFDNNSHD 335

Query: 186 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 245
           R+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++  
Sbjct: 336 RVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEE 394

Query: 246 EMQSVIS--------SLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
             + +I         +LG    PVS   E  +   L       L  Y  T+ ED++ L  
Sbjct: 395 LKEHLIGEHAIDRIFTLGNSEFPVSWDNEVKLWTFLEARASLLLKTYKTTIEEDKSFLAT 454

Query: 297 YNLHPKKRVATQLVRMEKKMLNACLQVTA 325
           ++L     +A +L   EK++L   ++  A
Sbjct: 455 HDLTFHATMAIKLRLGEKEILEKAVKSAA 483


>gi|387016380|gb|AFJ50309.1| Histone-lysine N-methyltransferase setd3 [Crotalus adamanteus]
          Length = 592

 Score = 72.0 bits (175), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 80/334 (23%), Positives = 151/334 (45%), Gaps = 36/334 (10%)

Query: 9   KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
            SFWLPYI+ L  +         + L + E E+ YL  +    +I  + +   R+Y    
Sbjct: 169 NSFWLPYIQTLPNE-------YNTALYFEEDEVQYLQSTQAIHDIFSQYKNTARQY---- 217

Query: 69  TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
             +F    + Q +P    +P  ++FT++ ++ A  +V +    +     +R   AL+PL 
Sbjct: 218 -AYFY--KVVQTHPNASKLPLKDSFTYDDYRWAVSSVMARQNQIPAEDGSRVTLALIPLW 274

Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
                 +          DD  + V  + +KAGE I ++ G + N++ +I+ GF  ++N +
Sbjct: 275 DMCNHTNGLITTGYNLKDDRCECVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 334

Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
           DR+ ++  ++  D  Y  K  V  R G  +  VF +H+  E    + +L +LR+  +++ 
Sbjct: 335 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTED 393

Query: 245 SEMQSVIS--------SLGPI-CPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDE 291
              + +I         +LG    PVS   E     +L  + +AR    L  Y  T+ +D+
Sbjct: 394 ELKEHLIGEHTIDRIFTLGNSEFPVSWDNEV----KLWTFLEARASLLLKTYKTTIHDDK 449

Query: 292 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
            +L   +L     +A +L   EK++L   ++  A
Sbjct: 450 FILETQDLTHNATMAIKLRLGEKEILEKAIKSAA 483


>gi|17368377|sp|P94026.1|RBCMT_TOBAC RecName: Full=Ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic; AltName:
           Full=[Ribulose-bisphosphate carboxylase]-lysine
           N-methyltransferase; Short=RuBisCO LSMT; Short=RuBisCO
           methyltransferase; Short=rbcMT; Flags: Precursor
 gi|1731475|gb|AAC49565.1| ribulose-1,5-bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase [Nicotiana tabacum]
 gi|1731477|gb|AAC49566.1| ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase [Nicotiana tabacum]
          Length = 491

 Score = 71.6 bits (174), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 71/301 (23%), Positives = 136/301 (45%), Gaps = 20/301 (6%)

Query: 29  AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 88
           + +S + WSE EL+ + G+   +  +   + ++ E+ +++    +     Q +P+ I  +
Sbjct: 156 STDSTIYWSEEELSEIQGTQLLSTTMSVKDYVQNEFQKVEEEVILRNK--QLFPFPITLD 213

Query: 89  AF--TFEIFK-QAFVAVQS-CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA 144
            F   F I + +AF  +++  ++ +    L    A V       A+  +  A L + D  
Sbjct: 214 DFFWAFGILRSRAFSRLRNQNLILVPFADLTNHNARVTTEDH--AHEVRGPAGLFSWDLL 271

Query: 145 VQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 203
             L      KAG+ + + +   + N+ + ++YGF++  +  D   +   ++  D  Y DK
Sbjct: 272 FSLRSPLKLKAGDQLFIQYDLNKSNADMALDYGFIEPSSARDAFTLTLEISESDEFYGDK 331

Query: 204 RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM-------QSVISSLGP 256
             +A+ NG      F +  G+       M+PYLRL  +  T           SV   LG 
Sbjct: 332 LDIAETNGIGETAYFDIKIGQSLPPT--MIPYLRLVALGGTDAFLLESIFRNSVWGHLG- 388

Query: 257 ICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKM 316
             PVS   E  +   + D  K+ L+GY  T+ EDE ++ + NL  + ++A  +   EK++
Sbjct: 389 -LPVSRANEELICKVVRDACKSALSGYHTTIEEDEKLMEEGNLSTRLQIAVGIRLGEKRV 447

Query: 317 L 317
           L
Sbjct: 448 L 448


>gi|297849804|ref|XP_002892783.1| hypothetical protein ARALYDRAFT_471564 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297338625|gb|EFH69042.1| hypothetical protein ARALYDRAFT_471564 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 482

 Score = 71.2 bits (173), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 80/313 (25%), Positives = 142/313 (45%), Gaps = 27/313 (8%)

Query: 29  AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 88
           + +S + WSE ELA L G+   +  L   E ++ E+ +L+    +          D+ + 
Sbjct: 148 STDSTVFWSEEELAELKGTQLLSTTLGVKEYVENEFLKLEQEILLPNK-------DLFSS 200

Query: 89  AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL------GPPLLA--YSSKCK-AMLA 139
             T + F  AF  ++S      +    +   L+PL       P +    Y+ + K A L 
Sbjct: 201 RITLDDFIWAFGILKSRAFSRLR---GQNLVLIPLADLINHNPAITTEDYAYEIKGAGLF 257

Query: 140 AVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 198
           + D    L      KAGE + + +   + N++L ++YGFV+ +   +   +   +   DP
Sbjct: 258 SRDLLFSLKSPVYVKAGEQVYIQYDLNKSNAELALDYGFVESNPNRNSYTLTIEIPESDP 317

Query: 199 QYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV--SDTSEMQSVISSL-- 254
            + DK  +A+ N       F V  G+   A   ML YLRL  +  SD   ++S+ ++   
Sbjct: 318 FFGDKLDIAETNKMGETGYFDVVDGQTLPA--GMLQYLRLVALGGSDAFLLESIFNNTIW 375

Query: 255 GPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRME 313
           G +  PVS   E  +   + D  K+ L+G+  T+ EDE +L +  L P+  +A ++   E
Sbjct: 376 GHLELPVSRSNEELICRVVRDACKSALSGFSTTIEEDEKLLEEGKLDPRLEMALKIRIGE 435

Query: 314 KKMLNACLQVTAD 326
           K++L    Q+  D
Sbjct: 436 KRVLQQIDQIFKD 448


>gi|260803924|ref|XP_002596839.1| hypothetical protein BRAFLDRAFT_284593 [Branchiostoma floridae]
 gi|229282099|gb|EEN52851.1| hypothetical protein BRAFLDRAFT_284593 [Branchiostoma floridae]
          Length = 500

 Score = 71.2 bits (173), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 84/331 (25%), Positives = 144/331 (43%), Gaps = 34/331 (10%)

Query: 4   KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 63
           +K    SFW PYI               +PL + E E+ +L GS   +++L + + I R+
Sbjct: 177 EKYSSNSFWAPYINIFPG-------TYTTPLYFEEGEMLHLQGSLNFSDVLNQYKSIARQ 229

Query: 64  YNELDTVWFMAGSLFQQYP--YDIP-TEAFTFEIFKQAFVAVQSCVVHLQKV--SLARRF 118
           Y      +F    LFQ  P    +P  E FTF+ ++ A   V + +    +V  S  R  
Sbjct: 230 Y-----AYFY--KLFQTQPEAAGLPLKECFTFDEYRWA---VSTVMTRQNQVPTSDGRHL 279

Query: 119 --ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 176
             AL+P+       + +         D+ + +  R +     + ++ G + N++ LI+ G
Sbjct: 280 ITALIPMWDMCNHSNGEVSTEFNLGSDSAECLAMREFPTDSQVYIFYGMRSNAEFLIHNG 339

Query: 177 FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL 236
           FV  +N +DR+ V+  ++  D  +  K  V  R G  +   F VH G++     ++L +L
Sbjct: 340 FVYPENVHDRVNVKLGVSKNDSLFAMKAEVLSRAGIHASTSFQVHCGKDP-IPPELLVFL 398

Query: 237 RL-----GYVSD--TSEMQSV-ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATL 287
           R+     G + D  TSE QS  +S LG   C V+   E      L       +  Y  ++
Sbjct: 399 RVFTMVEGDLRDLLTSEHQSAYLSCLGRSDCMVTQEQETKAWAFLETRLSLLIRSYRTSI 458

Query: 288 SEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 318
            + E  L   ++    R A QL   E ++L+
Sbjct: 459 KDVETELQAPDMTYHSRAALQLKLAEMQILS 489


>gi|115657973|ref|XP_798530.2| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Strongylocentrotus purpuratus]
          Length = 682

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 82/325 (25%), Positives = 138/325 (42%), Gaps = 26/325 (8%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
           SFW PY+  L         +   PL ++  E+    GS    E L++ + I R+Y  L  
Sbjct: 171 SFWKPYLDVLPS-------SYSLPLYFTPDEIQLFQGSTMYGECLKQHKNIARQYAYL-- 221

Query: 70  VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL-QKVSLARRFALVPLGPPLL 128
             F   +L +     I  E FT++ ++ A   V +    +  K       +L+PL     
Sbjct: 222 --FKLLNLPENSKLHI-REYFTYDFYRWAVSTVMTRQNQIPAKDGKGMSLSLIPLWDMCN 278

Query: 129 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 188
             + + K       D+   +  R +  GE I +  G + ++ LL+  GFV   N YD + 
Sbjct: 279 HANGEMKTDFIEERDSCVNMALRDFSVGEQIFICYGRRSSADLLLYSGFVYPGNVYDGMA 338

Query: 189 VEAALNTEDPQYQDKRMVAQRNGKLSV--QVFHVHAGREKEAISDMLPYLRLGYVSD--- 243
           ++  L++ D  Y  K  +     KL V  Q +H+ AG+E   + ++L +LR+  + D   
Sbjct: 339 IQLGLSSSDRLYAMKAQLCSVM-KLGVPSQNYHISAGKEPVTL-ELLTFLRIFCMQDLEL 396

Query: 244 ------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDY 297
                  +  Q++ S +     +S   E      LA         Y  ++ EDE  L D 
Sbjct: 397 RDRLLGDNRAQALFSLVDRSQIISKLNELRTCVYLATRVTLLQRQYKTSIQEDEEKLKDG 456

Query: 298 NLHPKKRVATQLVRMEKKMLNACLQ 322
           NL  ++R A QL+ +EK  L   L+
Sbjct: 457 NLSAQERSALQLLLIEKCTLENVLE 481


>gi|168043570|ref|XP_001774257.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674384|gb|EDQ60893.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 458

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 55/213 (25%), Positives = 96/213 (45%), Gaps = 19/213 (8%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           +  K  GK S W PYIR L   RG     +++ + W + EL  L  S    +   R   I
Sbjct: 113 LLHKHAGKASPWEPYIRCLPPLRG-----LQNTVFWRDEELELLRQSNVYDQTEHRKTLI 167

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
             +++ +  V         +YP ++  E  T E FK A+    S    ++ +       +
Sbjct: 168 SNQFDLVQAV-------VNKYP-ELFGETVTLESFKHAYCVASSRSWGVEALG---SITM 216

Query: 121 VPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           VP    +  + S  +A+LA  ++    ++V D+ Y  G  +V+  G  PNS L +++GF 
Sbjct: 217 VPF-VDMFNHDSSARALLAYYEEEGYAEVVADKDYNQGSQVVITYGTLPNSSLALDFGFT 275

Query: 179 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 211
             DNP+D + +     + DP   +K  + + +G
Sbjct: 276 LPDNPHDEVQIWMEAPSGDPLRAEKLKLLRDHG 308


>gi|168063638|ref|XP_001783777.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664720|gb|EDQ51429.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 395

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 61/224 (27%), Positives = 101/224 (45%), Gaps = 25/224 (11%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           + E+  G+ S W PYI  L         A+ S + W + EL  +  +    E ++R   I
Sbjct: 52  LVEQHLGQASQWAPYINCLPTCG-----ALHSTVFWKKEELELVRFTSLHRETMQRRAVI 106

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
             E+  +  V        Q+ P+ I  E      FKQA+   +S      + S   R   
Sbjct: 107 GSEFASVLPV-------LQKCPH-IFGERVLHSKFKQAYATGKSL-----RRSSNTRILT 153

Query: 121 VPLGPPLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           VP       + S C+A+L+  ++    +++ D+ Y  GE +V+  G  PN+ L +++GF 
Sbjct: 154 VPF-VDFFNHDSNCRALLSYDEERACAEVIADKNYARGEQVVISYGRLPNTTLALDFGFT 212

Query: 179 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 222
              NPYD++ V  AL+  DP  + K  +   +G  +V    VHA
Sbjct: 213 ISCNPYDQVEVWMALSHRDPLRKMKLALLHAHGMPTV----VHA 252


>gi|255080880|ref|XP_002504006.1| predicted protein [Micromonas sp. RCC299]
 gi|226519273|gb|ACO65264.1| predicted protein [Micromonas sp. RCC299]
          Length = 529

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 91/346 (26%), Positives = 140/346 (40%), Gaps = 48/346 (13%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIK 61
           E+ +G  S W PY+  L           + PLLW+  EL   L GSP + + + R E   
Sbjct: 130 ERHKGPLSEWAPYVATLP------SAGSDHPLLWTAGELQTLLQGSPVREQAVSRLESAD 183

Query: 62  REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV-AVQSCVVHLQKVSLARRFAL 120
            EY  +           +  P D P +A+ F + + AFV A+ + +     ++ A  +A+
Sbjct: 184 DEYASI-------ADQIRSNPNDFPPDAYEF-LTRDAFVDALATVLARAVWLNAANCYAM 235

Query: 121 VPL-------------------------GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 155
           VPL                         G P LA ++      AA +    +  +   + 
Sbjct: 236 VPLVDLLPLVGSPPPGVSPAAAAGGPAVGKPGLAAAAGVVDYDAATECVAVVSANDAQQT 295

Query: 156 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG-KLS 214
              + V    +    L +  G VDE +  D L   A+    D  Y+ KR + +  G    
Sbjct: 296 ARVVCVDPLARNAGDLFLATGAVDESHCGDYLAFAASCTQTDRLYEAKRQILEGMGMSAD 355

Query: 215 VQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLAD 274
            Q F V A R       +L Y+R   V D  E+ SV      I  VSP  E  VL  L  
Sbjct: 356 GQTFPVFADRMP---MQLLAYMRFARVQDPGELMSVSFEEDRI--VSPMNEYEVLQLLMQ 410

Query: 275 YFKARLAGYPATLSEDEAM-LTDYNLHPKKRVATQLVRMEKKMLNA 319
             +  LA Y ++  E E + L +  L  ++RVA +L   EK+++NA
Sbjct: 411 DAREMLAEYESSSEEFELLQLKEKGLSARQRVAAKLRLAEKRLINA 456


>gi|217038301|gb|ACJ76599.1| SET domain-containing protein 3 (predicted) [Oryctolagus cuniculus]
          Length = 394

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 58/234 (24%), Positives = 107/234 (45%), Gaps = 19/234 (8%)

Query: 9   KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
            SFW PYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y    
Sbjct: 169 NSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFY 221

Query: 69  TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
            V        Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL 
Sbjct: 222 RV-------IQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 274

Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
                 +          DD  + V  R + AGE I ++ G + N++ +I+ GF  ++N +
Sbjct: 275 DMCNHTNGLITTGYNLEDDRCECVALRDFHAGEQIYIFYGTRSNAEFVIHSGFFFDNNSH 334

Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 238
           DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+
Sbjct: 335 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRV 387


>gi|145524453|ref|XP_001448054.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124415587|emb|CAK80657.1| unnamed protein product [Paramecium tetraurelia]
          Length = 581

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 85/365 (23%), Positives = 151/365 (41%), Gaps = 64/365 (17%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           + ++K    SFW PY+  L       Q     P+ ++  +L +L GSP   +I ++   +
Sbjct: 219 LLQEKSRPNSFWKPYLDIL------PQSYPSFPIFFNNYDLEWLQGSPFLKQINDKLSDL 272

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           K++YN++  V       F QY         +F  F  A +   S +  +  +   +  A 
Sbjct: 273 KKDYNDICNV----APEFSQY---------SFYEFCWARMTASSRIFGI-NIKGVKTDAF 318

Query: 121 VPLG-------PPLLA--YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 171
           VPL        P L +  YS + +  +   D+ +    DR    G+ I    G + NS+ 
Sbjct: 319 VPLADMLNHKRPKLTSWCYSEEKQGFIIETDEKI----DR----GQMIFDSYGRKCNSRF 370

Query: 172 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDK--------------RMVAQRNGKLSVQV 217
           L+NYGFV +DN  + + V  A    DP  Q K              R++   +G   +  
Sbjct: 371 LLNYGFVVDDNDANEVNVTVAAEFNDPLIQLKEDATEEQLKQPKTFRLIMDTDGINEITH 430

Query: 218 FHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISS-----LGP--ICPVSPCMERAVLD 270
           F       +  + + + Y+R   + D +++Q +++      + P  I P+    E  + D
Sbjct: 431 FL------EATVMEFMSYIRFLVIRDQTQLQFLLNERESKYIKPTKIQPLGIHNELDMWD 484

Query: 271 QLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIML 330
            +       L+ YP TL +D+ +L   +L   +R    L   EK++L    Q +  M  L
Sbjct: 485 LIRRICYVSLSRYPTTLEQDKEILQICDLTTNQRNCLILRMGEKEILKFYYQFSEKMKQL 544

Query: 331 LPDVT 335
           L +  
Sbjct: 545 LSNFN 549


>gi|302814473|ref|XP_002988920.1| hypothetical protein SELMODRAFT_129035 [Selaginella moellendorffii]
 gi|300143257|gb|EFJ09949.1| hypothetical protein SELMODRAFT_129035 [Selaginella moellendorffii]
          Length = 389

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 52/206 (25%), Positives = 97/206 (47%), Gaps = 20/206 (9%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           +  +K+ + S W PYI  L         ++ S + W++ EL YL  SP   E ++R + +
Sbjct: 52  LAHQKKKETSAWAPYISCLPPFG-----SMHSTIFWTQDELVYLKVSPVYRETVQRKDVV 106

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           + E+   +    +        P+   +     E FK A+  V S    ++ +   +  AL
Sbjct: 107 RMEFAAAENALLLC-------PHIFGSRVSALE-FKHAYATVCSRAWGIETI---KSLAL 155

Query: 121 VPLGPPLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           VP       + + C+AML+  +D    ++V DR Y  G+ +V+  G   N+ L +++GF 
Sbjct: 156 VPF-VDFFNHDANCRAMLSYDEDRHCAEVVSDRDYATGDQVVISYGQLSNATLALDFGFA 214

Query: 179 DEDNPYDRLV-VEAALNTEDPQYQDK 203
              NP+D++  +  +L+ +DP    K
Sbjct: 215 LPFNPHDQVAGIWLSLSEKDPLRDSK 240


>gi|15223054|ref|NP_172856.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase
           [Arabidopsis thaliana]
 gi|17369870|sp|Q9XI84.1|RBCMT_ARATH RecName: Full=[Fructose-bisphosphate aldolase]-lysine
           N-methyltransferase, chloroplastic; AltName:
           Full=Aldolases N-methyltransferase; AltName:
           Full=[Ribulose-bisphosphate carboxylase]-lysine
           N-methyltransferase-like; Short=AtLSMT-L;
           Short=LSMT-like enzyme; Flags: Precursor
 gi|5080779|gb|AAD39289.1|AC007576_12 Putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase [Arabidopsis thaliana]
 gi|28973755|gb|AAO64193.1| putative ribulose-1,5 bisphosphate carboxylase oxygenase large
           subunit N-methyltransferase [Arabidopsis thaliana]
 gi|332190979|gb|AEE29100.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase
           [Arabidopsis thaliana]
          Length = 482

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 77/314 (24%), Positives = 139/314 (44%), Gaps = 29/314 (9%)

Query: 29  AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 88
           + +S + WSE ELA L G+   +  L   E ++ E+ +L+    +          D+ + 
Sbjct: 148 STDSTVFWSEEELAELKGTQLLSTTLGVKEYVENEFLKLEQEILLPNK-------DLFSS 200

Query: 89  AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCK----------AML 138
             T + F  AF  ++S      +    +   L+PL   L+ ++   K          A L
Sbjct: 201 RITLDDFIWAFGILKSRAFSRLR---GQNLVLIPLAD-LINHNPAIKTEDYAYEIKGAGL 256

Query: 139 AAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 197
            + D    L      KAGE + + +   + N++L ++YGFV+ +   +   +   +   D
Sbjct: 257 FSRDLLFSLKSPVYVKAGEQVYIQYDLNKSNAELALDYGFVESNPKRNSYTLTIEIPESD 316

Query: 198 PQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS--DTSEMQSVISSL- 254
           P + DK  +A+ N       F +  G+   A   ML YLRL  +   D   ++S+ ++  
Sbjct: 317 PFFGDKLDIAESNKMGETGYFDIVDGQTLPA--GMLQYLRLVALGGPDAFLLESIFNNTI 374

Query: 255 -GPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 312
            G +  PVS   E  +   + D  K+ L+G+  T+ EDE +L    L P+  +A ++   
Sbjct: 375 WGHLELPVSRTNEELICRVVRDACKSALSGFDTTIEEDEKLLDKGKLEPRLEMALKIRIG 434

Query: 313 EKKMLNACLQVTAD 326
           EK++L    Q+  D
Sbjct: 435 EKRVLQQIDQIFKD 448


>gi|307108530|gb|EFN56770.1| hypothetical protein CHLNCDRAFT_8187, partial [Chlorella
           variabilis]
          Length = 398

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 93/340 (27%), Positives = 148/340 (43%), Gaps = 34/340 (10%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           ++E+  G  S W PYI  L    G       SP+ W E +LA L GS     +       
Sbjct: 50  LHERAMGNASRWAPYIALLPADSG-------SPVQWEEADLAELQGSQVLGTVQGYRAYF 102

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV------VHLQKVSL 114
           ++ +++L    F   S      +D P   F F+ F  A   V++         ++  V L
Sbjct: 103 QQRFDQLQAEVFGPNS----QAFD-PI-VFNFDAFLWAACTVRARAHPPLDGGNIALVPL 156

Query: 115 ARRFALVPLGPPLLA-YSSKCKAMLAAVDDAVQLVVDRP--YKAGESIVVWCGPQ-PNSK 170
           A      P  PP  A +  K    L        LV++      AG++I +  GPQ  + +
Sbjct: 157 ADMVRSQPSWPPDSAGWQLKQTGGLFGAGSTQALVMEASGSMAAGDAIAMDFGPQKSDGQ 216

Query: 171 LLINYGFVDE--DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV-HAGREKE 227
           LL+++G +D   + P   L +E  L+ ED  Y DK  + + N +L+    H+  A R  +
Sbjct: 217 LLVDHGVIDPLVNQPSYALTLE--LSKEDRNYDDKADILELN-ELAESTEHILRADRAPD 273

Query: 228 AISDMLPYLRLGYVSDTSEMQSVISSL---GPICPVSPCMERAVLDQLADYFKARLAGYP 284
           A    +  L     +D   ++S+  +        PVS   ER    QL D   A LA YP
Sbjct: 274 AGLLPVLRLLNLSGTDAFLLESIFRNEVWEHMQLPVSEDNERGCYQQLIDGCTAALAAYP 333

Query: 285 ATLSEDEAMLTDYNLHPKKRVATQL-VRM-EKKMLNACLQ 322
            ++ ED A++   +L P  R  + + VR+ EK+ L+A L+
Sbjct: 334 TSIDEDLALMASGSLQPGSRRQSAVRVRLGEKEALDATLR 373


>gi|302755392|ref|XP_002961120.1| hypothetical protein SELMODRAFT_402746 [Selaginella moellendorffii]
 gi|300172059|gb|EFJ38659.1| hypothetical protein SELMODRAFT_402746 [Selaginella moellendorffii]
          Length = 371

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 56/201 (27%), Positives = 89/201 (44%), Gaps = 35/201 (17%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLA-VESPLLWSETELAYLTGSPTKAEILERAEG 59
           + EK + ++S W PY+R L       QL  +ES + W + ELA+L  SPT  E +E  + 
Sbjct: 56  LSEKHKAQESQWAPYLRCL------PQLGDIESTMFWKDEELAWLKHSPTYRETMECLKI 109

Query: 60  IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 119
           IK E++ L+   F        +  D+  E  +   F  A+   Q                
Sbjct: 110 IKSEFHVLEANVF-------PWCRDVLGEV-SLTDFMHAYSTDQ---------------- 145

Query: 120 LVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
            +P       +   C+  L+     D    V D+ YKAG+ I +  G  PNS L ++YGF
Sbjct: 146 -IPFAD-FFNHDHNCQTRLSYDKEKDCAVAVADQDYKAGDEIFLSYGSTPNSILAVDYGF 203

Query: 178 VDEDNPYDRLVVEAALNTEDP 198
               NP++++ V   ++  DP
Sbjct: 204 AVASNPHEQVEVPMGVSLTDP 224


>gi|168067849|ref|XP_001785817.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162662541|gb|EDQ49381.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 489

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 85/344 (24%), Positives = 148/344 (43%), Gaps = 49/344 (14%)

Query: 9   KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL- 67
            S W PY+  L +       +++SPL WS+ ELA L G+     +    E ++ EYN L 
Sbjct: 127 SSKWRPYLDSLPK-------SLDSPLFWSDEELAELVGTQLLGSVTGYLEFLENEYNNLV 179

Query: 68  DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL---- 123
           + V      +F    Y       TF+ FK AF  ++S              ALVP+    
Sbjct: 180 EEVLEPNNKIFNPAVY-------TFDGFKWAFGILRSRTFSPLT---GEDIALVPIADLV 229

Query: 124 --------GPP--LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP-QPNSKLL 172
                   G P  +   +S+   +     D + +     + AGE +++  G  + N+ L 
Sbjct: 230 NHGKGLGDGSPSWVRKGTSQFWNIGKGSSDLLTVRASANFSAGEQVLMQYGATKSNADLA 289

Query: 173 INYGFVDEDNPY--------DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGR 224
           ++YGFV+ D           D L +   ++ +D    DK  + + NG      F +  G+
Sbjct: 290 LDYGFVERDRGSQFSPGIERDSLALSLEISPDDRFVDDKADILEINGFQCSMQFDLSRGQ 349

Query: 225 EKEAISDMLPYLRLGYVS--DTSEMQSVI--SSLGPIC-PVSPCMERAVLDQLADYFKAR 279
                 +M+ +LRL  +S  D+  ++++    + G +  PVS   E A+   + +  KA 
Sbjct: 350 GPS--DEMITFLRLSALSGPDSFLLEALFRNEAWGHVSLPVSRDNEEALCTSMLEGLKAA 407

Query: 280 LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQV 323
           L GY  T+ +D  +L   +L  +  +A  +VR+ +K +   LQ 
Sbjct: 408 LDGYSTTVEQDMELLARGDLSTRMEIAV-VVRLGEKRVMQELQT 450


>gi|8778402|gb|AAF79410.1|AC068197_20 F16A14.25 [Arabidopsis thaliana]
          Length = 474

 Score = 68.6 bits (166), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 76/308 (24%), Positives = 138/308 (44%), Gaps = 25/308 (8%)

Query: 29  AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 88
           + +S + WSE ELA L G+   +  L   E ++ E+ +L+    +          D+ + 
Sbjct: 148 STDSTVFWSEEELAELKGTQLLSTTLGVKEYVENEFLKLEQEILLPNK-------DLFSS 200

Query: 89  AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL----LAYSSKCKAMLAAVDDA 144
             T + F  AF      +++ + ++    F  +   P +     AY  K  A L + D  
Sbjct: 201 RITLDDFIWAF-----GILNRESLTSMFEFEQINHNPAIKTEDYAYEIKG-AGLFSRDLL 254

Query: 145 VQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 203
             L      KAGE + + +   + N++L ++YGFV+ +   +   +   +   DP + DK
Sbjct: 255 FSLKSPVYVKAGEQVYIQYDLNKSNAELALDYGFVESNPKRNSYTLTIEIPESDPFFGDK 314

Query: 204 RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS--DTSEMQSVISSL--GPI-C 258
             +A+ N       F +  G+   A   ML YLRL  +   D   ++S+ ++   G +  
Sbjct: 315 LDIAESNKMGETGYFDIVDGQTLPA--GMLQYLRLVALGGPDAFLLESIFNNTIWGHLEL 372

Query: 259 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 318
           PVS   E  +   + D  K+ L+G+  T+ EDE +L    L P+  +A ++   EK++L 
Sbjct: 373 PVSRTNEELICRVVRDACKSALSGFDTTIEEDEKLLDKGKLEPRLEMALKIRIGEKRVLQ 432

Query: 319 ACLQVTAD 326
              Q+  D
Sbjct: 433 QIDQIFKD 440


>gi|326492674|dbj|BAJ90193.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 449

 Score = 68.2 bits (165), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 85/335 (25%), Positives = 143/335 (42%), Gaps = 40/335 (11%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           + E  +G  S W PY+  L RQ        +S + WSE EL  + G+   +  +   E +
Sbjct: 95  LREAARGGDSLWAPYLAILPRQ-------TDSTIFWSEEELLEIQGTQLLSTTMGVKEYV 147

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           + E++ ++     AG +      D+     TF+ F  AF  ++S V    +     + AL
Sbjct: 148 QSEFDNVE-----AGII--NVNKDLFPGTITFDDFLWAFGVLRSRVFPELR---GDKLAL 197

Query: 121 VPLGPPL----------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNS 169
           +P    +            +  K K  L   D    L      K+GE I V +   + N+
Sbjct: 198 IPFADLINHNGDITSKESCWEIKGKGFLGR-DTVFSLRTPVDVKSGEQIYVQYDLDKSNA 256

Query: 170 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI 229
           +L ++YGF + ++  D   +   ++  DP Y+DK  +A+ NG      F V  G  +   
Sbjct: 257 ELALDYGFTESNSSRDSYTLTLEISESDPFYEDKLDIAELNGMGETAYFDVVLG--ESLP 314

Query: 230 SDMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLADYFKARLAG 282
             M+ YLRL  +  T            V   L    PVS   E ++   + +  K+ LA 
Sbjct: 315 PQMITYLRLLCLGGTDAFLLEALFRNKVWEHLE--LPVSRDNEESICQVIQNACKSALAA 372

Query: 283 YPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 317
           Y  T+ EDE +L   +L  + ++A ++   EKK+L
Sbjct: 373 YHTTIEEDEELLEREDLQSRHQIAVEVRVGEKKVL 407


>gi|326496433|dbj|BAJ94678.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 453

 Score = 67.8 bits (164), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 85/335 (25%), Positives = 143/335 (42%), Gaps = 40/335 (11%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           + E  +G  S W PY+  L RQ        +S + WSE EL  + G+   +  +   E +
Sbjct: 99  LREAARGGDSLWAPYLAILPRQ-------TDSTIFWSEEELLEIQGTQLLSTTMGVKEYV 151

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           + E++ ++     AG +      D+     TF+ F  AF  ++S V    +     + AL
Sbjct: 152 QSEFDNVE-----AGII--NVNKDLFPGTITFDDFLWAFGVLRSRVFPELR---GDKLAL 201

Query: 121 VPLGPPL----------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNS 169
           +P    +            +  K K  L   D    L      K+GE I V +   + N+
Sbjct: 202 IPFADLINHDGDITSKESCWEIKGKGFLGR-DTVFSLRTPVDVKSGEQIYVQYDLDKSNA 260

Query: 170 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI 229
           +L ++YGF + ++  D   +   ++  DP Y+DK  +A+ NG      F V  G  +   
Sbjct: 261 ELALDYGFTESNSSRDSYTLTLEISESDPFYEDKLDIAELNGMGETAYFDVVLG--ESLP 318

Query: 230 SDMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLADYFKARLAG 282
             M+ YLRL  +  T            V   L    PVS   E ++   + +  K+ LA 
Sbjct: 319 PQMITYLRLLCLGGTDAFLLEALFRNKVWEHLE--LPVSRDNEESICQVIQNACKSALAA 376

Query: 283 YPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 317
           Y  T+ EDE +L   +L  + ++A ++   EKK+L
Sbjct: 377 YHTTIEEDEELLEREDLQSRHQIAVEVRVGEKKVL 411


>gi|145344456|ref|XP_001416748.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144576974|gb|ABO95041.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 515

 Score = 67.8 bits (164), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 90/339 (26%), Positives = 137/339 (40%), Gaps = 43/339 (12%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE-LAYLTGSPTKAEILERAEGIK 61
           E+ +G  S W PY++ L           ++PL W+E E  A L GSP   + +ER+   +
Sbjct: 131 ERIKGGASDWAPYVKTL-------AANPDAPLFWTEAEDFALLKGSPIVNDAVERSRSAR 183

Query: 62  REYNELDTVWFMAGSLFQQYPYDIPTEAFTF---EIFKQAFVAVQSCVVHLQKVSLARRF 118
            EY  +  V        +  P   P EA+ F   E F  A   V +    L   S    +
Sbjct: 184 EEYAAIVEV-------IKGDPTAFPAEAYEFFTEERFVDALATVCAKATWLPTASC---Y 233

Query: 119 ALVPL-------GPPLLAY---SSKCKAMLAAVD---DAVQLVVDRPYKA-GESIVVWCG 164
           ALVPL       G P+      S+K      A D   D+  +V+    KA   S VV   
Sbjct: 234 ALVPLLDVITIAGSPVPGVSPPSAKDGIARCAADYDVDSACVVLSAVVKAPANSRVVQLD 293

Query: 165 P--QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG-KLSVQVFHVH 221
           P  + N +L +N G VD+ +P D L +   +   D  +  K+ V +  G     Q F V+
Sbjct: 294 PLQRNNGELFLNTGRVDQKHPGDYLYMRTEIQPSDRLFSAKKQVLEGMGFTAENQYFPVY 353

Query: 222 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 281
              E    + +  YLR   V D  EM +V      I  VS   E  +L  L    +  ++
Sbjct: 354 ---EDRMPTQLYSYLRFARVQDPGEMMAVSFEEDKI--VSVMNEYEILQLLMGDCRELMS 408

Query: 282 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNAC 320
            Y     ++  +L   +    + +    +RM +K L  C
Sbjct: 409 EYDTNEEDELNLLKLSDTMRVREIEAAKLRMSEKKLIGC 447


>gi|340720054|ref|XP_003398458.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Bombus
           terrestris]
          Length = 484

 Score = 67.4 bits (163), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 84/337 (24%), Positives = 143/337 (42%), Gaps = 31/337 (9%)

Query: 4   KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 63
           +K  + S W PY+  L            + L  +  ++  L GSPT    L++   I R+
Sbjct: 165 EKHKEYSKWKPYLDILPT-------TYTTVLYMTAADMNELKGSPTLEAALKQCRNIARQ 217

Query: 64  YNELDTVWFMAGSLFQQYPYDIPT---EAFTFEIFKQAF--VAVQSCVVHLQKVSLARRF 118
           Y      +F    LFQ+    +     + FT+E +  A   V  +  ++  +  SL    
Sbjct: 218 Y-----AYF--NKLFQKNNNAVSAILRDVFTYEKYCWAVSTVMTRQNIIPSKDGSLMIH- 269

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           AL+P+        SK      A  +  +    R +K  E I +  GP+ NS   ++ GFV
Sbjct: 270 ALIPMWDMCNHEDSKITTDFNATLNCCECYALRDFKKAEQIFISYGPRTNSDFFVHSGFV 329

Query: 179 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD-MLPYLR 237
             DN  D   +   ++  DP ++++  +  +    +V  F +  G   E ISD +L +LR
Sbjct: 330 YMDNEQDGFKLRLGISKADPLHKERVELLNKLDLPAVGEFLLKPG--TEPISDTLLAFLR 387

Query: 238 LGYVSDTSEMQSVISS--LGPICPVSPCMERAVLDQLADYFKARL----AGYPATLSEDE 291
           + +     E+   I S  +  +  +   +E  V + +  +   RL    A YP TL ED 
Sbjct: 388 V-FSMRKEELAHWIQSDRVNDLKHMDCALETVVEENVKKFLLTRLQLLIANYPTTLKEDL 446

Query: 292 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 328
            +L +  L   K++A QL   EK++L   L+     I
Sbjct: 447 QLL-ETTLPRIKKLAIQLRVTEKRILQGALEYVQQWI 482


>gi|145350419|ref|XP_001419603.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144579835|gb|ABO97896.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 524

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 78/317 (24%), Positives = 124/317 (39%), Gaps = 54/317 (17%)

Query: 5   KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKRE 63
           + G    +  YIR L R+ G       S L W E E+   L GSP++    ER + +   
Sbjct: 167 ESGTAGTFGEYIRALPRRTG-------SVLDWPEDEVDKLLKGSPSRLAAAERQDSVNAA 219

Query: 64  YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 123
            +E+ +           +P        T    + AF  + S ++ L   ++    ALVP 
Sbjct: 220 IDEIRSY----------FPE------ITVGALRWAFDILFSRLIRLD--AMGGELALVPW 261

Query: 124 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE--D 181
              +L +   C A +    DAV L  DR Y  GE +    G +P+S+LLI+YGF  E  +
Sbjct: 262 -ADMLNHKPGCAAFIDLNGDAVNLTTDRSYVKGEQVWASYGQRPSSELLISYGFAPEVGE 320

Query: 182 NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGR--------------EKE 227
           NP D   +   ++  DP    K  V +  G   V+ F +                    E
Sbjct: 321 NPDDEYALTLGVDVNDPLADAKAQVLRDMGLSPVETFPLRLNGYPRQLLQYASFILCNPE 380

Query: 228 AISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSP-----------CMERAVLDQLADYF 276
             S++    +  +    +  QS+  S+  +                  E AV + LAD  
Sbjct: 381 KPSELKGLAQSAFTGSANIGQSIFDSVRGLTNGKARGKQGVILGGVAGEIAVREMLADLC 440

Query: 277 KARLAGYPATLSEDEAM 293
              L+ YP TL +D+ +
Sbjct: 441 AEALSAYPNTLEKDKGL 457


>gi|298711968|emb|CBJ32910.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 247

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 64/196 (32%), Positives = 92/196 (46%), Gaps = 23/196 (11%)

Query: 147 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT-EDPQYQDK-R 204
           +   R + AGE +++  GP+ N  LL  YGFV++DNP D   +   ++   D   +D  R
Sbjct: 40  VTTQRGWTAGEQVLISYGPRSNDHLLRRYGFVEQDNPNDVYRITGLIDKLSDVLGKDSVR 99

Query: 205 MVAQRNGKLSVQ--------VFHVHAGR-----EKEAISDMLPYLRLGYVSDTS--EMQS 249
           ++ +  GKL           V  V  GR     EKE    ++P  RL  V D    E ++
Sbjct: 100 VLRESGGKLGTTGDNAEGRPVESVTVGRSGLLGEKEE-GRVMPVFRLAVVKDDQLPEGKA 158

Query: 250 VISSLGPIC-PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDY--NLHPKKRVA 306
              SL      +SP  E A  D L         G+  TL+EDEA L+    +L  +KRVA
Sbjct: 159 AGISLKDFSNEISPANEAAARDALRKLCIKEREGFATTLAEDEAYLSSLGNSLGAQKRVA 218

Query: 307 TQLVRMEKK-MLNACL 321
               RMEKK +L+A +
Sbjct: 219 FSF-RMEKKRVLDAAI 233


>gi|146162512|ref|XP_001009518.2| SET domain containing protein [Tetrahymena thermophila]
 gi|146146406|gb|EAR89273.2| SET domain containing protein [Tetrahymena thermophila SB210]
          Length = 789

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 87/351 (24%), Positives = 148/351 (42%), Gaps = 58/351 (16%)

Query: 4   KKQGKKSFWLPYIRELDRQRGRGQLAVES---PLLWSETELAYLTGSPTKAEILERAEGI 60
           +K+  KSFW P++           L VE    P+L+++ EL +L GSP   ++ ER E I
Sbjct: 433 EKKNHKSFWKPFL---------DILPVEYTTFPILYTDEELFWLKGSPFLNQVKERRECI 483

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEA--FTFEIFKQAFVAVQSCVVHLQKVSLARRF 118
            ++Y              Q     IP  A   T + F  A +   S +  L  ++  R  
Sbjct: 484 TQDY--------------QAIVSKIPEFAKLCTLDEFAWARMMAASRIYGL-FINKKRTD 528

Query: 119 ALVPLGPPLL----AYSS------KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPN 168
           A VPL         AY++      K   ML A +D          + G+ I   CG + N
Sbjct: 529 AFVPLADMFNHRRPAYTNWGFCEDKGGFMLKASEDI---------RRGDQIYYSCGRKCN 579

Query: 169 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK-RMVAQRNGKLSVQVFHVHAGREKE 227
           S+ L+NYGFV ++N  + + +    + +D     K +M+ +R  K    +F +H   E++
Sbjct: 580 SRFLLNYGFVVKNNEANEIQLRVDFDKKDETLPIKLQMIGKR--KPESLIFRIHINYEEK 637

Query: 228 AISDMLPYLRLGYVSDTSEMQ-----SVISSLGPIC--PVSPCMERAVLDQLADYFKARL 280
           ++ +   +LR   + D   ++     S      P+   P S   E+ +  ++       +
Sbjct: 638 SVLEFFGFLRFVLIRDYIVLEKFHEMSEGKEFDPLRTPPFSIENEKQMWTEIHKICAEIM 697

Query: 281 AGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLL 331
             YP TL ED+ +L    L   ++    L   EK++L   + +   M  LL
Sbjct: 698 IQYPTTLDEDKKILETSKLTINQKNCVILRMGEKEILMYYITMADRMKKLL 748


>gi|361129824|gb|EHL01706.1| putative Ribosomal N-lysine methyltransferase 4 [Glarea lozoyensis
           74030]
          Length = 483

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 91/357 (25%), Positives = 146/357 (40%), Gaps = 50/357 (14%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA--E 58
           M  + Q   S W PY+  L ++       ++S + WSE ELA L  S    +I   +  E
Sbjct: 94  MMSEGQQSDSRWAPYLAVLPQK-------LDSLVFWSEEELAELQASSVAKKIGRSSAEE 146

Query: 59  GIKREYNELDTVWF------MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL--- 109
              +  + L    F         S+   Y +DIP E    E  KQ     +     L   
Sbjct: 147 MFTKHISPLGLGEFNVELCHQVASVIMAYAFDIPEE----EPAKQENGGAEGETDDLVSD 202

Query: 110 ----QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 165
               +K  L+    ++PL   L A + +  A +   ++ +++   +P  AGE I    G 
Sbjct: 203 DGEDEKTILS----MIPLADMLNADAERNNARIYYENEDLEMRTIKPIMAGEEIFNDYGQ 258

Query: 166 QPNSKLLINYGFVDED-NPYDRLVVEAA----LNTEDPQ---------------YQDKRM 205
            P S LL  YG+V E+   YD + + +A    L TE PQ                +++  
Sbjct: 259 LPRSDLLRRYGYVTENYAQYDVVEISSASIKSLMTEKPQEIQSGQFLDPLTSAEAEERVA 318

Query: 206 VAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCME 265
           +A R G L        A  E+ AI D L  L   ++ D   ++++++S   +   S    
Sbjct: 319 LADREGILEDSYDVNIANAEERAIPDELLALLYLFLLDNENLEAIVTSQSALPSRSKLAT 378

Query: 266 RAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 322
             V   L    + R A Y  TL EDE +L   NL  +  +A Q+   EK++L   ++
Sbjct: 379 ELVGKVLVKVLRHREAEYATTLEEDEKLLQAANLPRRTAMAIQVRHGEKRVLRLAVE 435


>gi|270005261|gb|EFA01709.1| hypothetical protein TcasGA2_TC007289 [Tribolium castaneum]
          Length = 230

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 55/219 (25%), Positives = 100/219 (45%), Gaps = 12/219 (5%)

Query: 118 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
           +AL+PL       +         V D  + +  + +KAGE + ++ G + N+ L ++ GF
Sbjct: 14  YALIPLWDMCNHTNGTISTAYNPVLDRSECLAVKNFKAGEQLFIFYGSRSNADLFVHNGF 73

Query: 178 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD--MLPY 235
           V E+N YD   +   ++  DP  Q +  +    GKLS+      + R+  +  D  +L +
Sbjct: 74  VFENNDYDVYWIRLGISKSDPLQQKRGHLL---GKLSIASTCDFSIRKGASPIDGQLLAF 130

Query: 236 LRLGYVSDTSEMQSVISS-----LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSE 289
           LR+ +  +  ++   I+S     LG I C +   +E      L    K  L+ Y  TL E
Sbjct: 131 LRV-FNMNEEQLDHWINSDKSADLGHIDCALDTALETKSWRFLHARLKLLLSTYKTTLDE 189

Query: 290 DEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 328
           DE +L +    P + +A ++   EK+++   L+     I
Sbjct: 190 DEKLLAEAQATPNRLLAIKMRATEKRIIRETLEYVEQYI 228


>gi|350595011|ref|XP_003484025.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Sus
           scrofa]
          Length = 326

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 46/150 (30%), Positives = 77/150 (51%), Gaps = 7/150 (4%)

Query: 73  MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS 132
           M   +F +YP   P E F  E FK +F  + S +V L   S+  + ALVP    ++ +S 
Sbjct: 73  MRVRIFSKYPDFFPEEVFNIESFKWSFGILFSRMVRLP--SMDGKNALVPWAD-MMNHSC 129

Query: 133 KCKAMLAAVDDAVQLV--VDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLV 188
           + +  L     +  +V   DRPY+ GE + +  G + N +LL++YGFV ++  NP D + 
Sbjct: 130 EVETFLDYDKSSKGIVFPTDRPYQPGEQVFISYGKKSNGELLLSYGFVPKEGTNPSDSVE 189

Query: 189 VEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
           +  +L   D  Y++K  + ++ G    Q F
Sbjct: 190 LSLSLKKSDESYKEKLELLKKYGLSGSQCF 219


>gi|356534483|ref|XP_003535783.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Glycine
           max]
          Length = 463

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 50/205 (24%), Positives = 96/205 (46%), Gaps = 16/205 (7%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           + EKK G+ S W PYI  L +Q   G+L   + + W+E+EL  +  S    E +++   I
Sbjct: 112 LIEKKLGQGSEWYPYISCLPQQ---GEL--HNTVFWTESELEMIRPSSVYQETIDQKSQI 166

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           ++++  +  ++  +   F    Y     A T  +F    V +    V  +        AL
Sbjct: 167 EKDFLAIKHIFECSHQSFGDSTYKDFMHACTLVLFDHFNVELP---VGSRAWGSTNGLAL 223

Query: 121 VPLGPPLLAYSSKCKAMLAAVDD-------AVQLVVDRPYKAGESIVVWCGPQPNSKLLI 173
           +P     L +    +A++ + DD       ++Q++ DR Y  GE +++  G   N+ L++
Sbjct: 224 IPFAD-FLNHDGVSEAIVMSDDDKQCSEVQSLQIIADRDYAPGEQVLIRYGKFSNATLML 282

Query: 174 NYGFVDEDNPYDRLVVEAALNTEDP 198
           ++GF    N YD++ ++  +   DP
Sbjct: 283 DFGFTIPYNIYDQVQIQFDIPKHDP 307


>gi|52545671|emb|CAH56365.1| hypothetical protein [Homo sapiens]
          Length = 380

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 61/262 (23%), Positives = 123/262 (46%), Gaps = 14/262 (5%)

Query: 77  LFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSS 132
           + Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL       + 
Sbjct: 9   VIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNG 68

Query: 133 KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 192
                    DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  
Sbjct: 69  LITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLG 128

Query: 193 LNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT-------- 244
           ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++         
Sbjct: 129 VSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLG 187

Query: 245 -SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKK 303
            S +  + +      PVS   E  +   L D     L  Y  T+ ED+++L +++L  + 
Sbjct: 188 DSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNHDLSVRA 247

Query: 304 RVATQLVRMEKKMLNACLQVTA 325
           ++A +L   EK++L   ++  A
Sbjct: 248 KMAIKLRLGEKEILEKAVKSAA 269


>gi|189236574|ref|XP_975615.2| PREDICTED: similar to SET domain containing 3 [Tribolium castaneum]
          Length = 667

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 54/213 (25%), Positives = 99/213 (46%), Gaps = 12/213 (5%)

Query: 118 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
           +AL+PL       +         V D  + +  + +KAGE + ++ G + N+ L ++ GF
Sbjct: 451 YALIPLWDMCNHTNGTISTAYNPVLDRSECLAVKNFKAGEQLFIFYGSRSNADLFVHNGF 510

Query: 178 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD--MLPY 235
           V E+N YD   +   ++  DP  Q +  +    GKLS+      + R+  +  D  +L +
Sbjct: 511 VFENNDYDVYWIRLGISKSDPLQQKRGHLL---GKLSIASTCDFSIRKGASPIDGQLLAF 567

Query: 236 LRLGYVSDTSEMQSVISS-----LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSE 289
           LR+ +  +  ++   I+S     LG I C +   +E      L    K  L+ Y  TL E
Sbjct: 568 LRV-FNMNEEQLDHWINSDKSADLGHIDCALDTALETKSWRFLHARLKLLLSTYKTTLDE 626

Query: 290 DEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 322
           DE +L +    P + +A ++   EK+++   L+
Sbjct: 627 DEKLLAEAQATPNRLLAIKMRATEKRIIRETLE 659


>gi|190402231|gb|ACE77646.1| hypothetical protein [Sorex araneus]
          Length = 350

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 61/248 (24%), Positives = 114/248 (45%), Gaps = 11/248 (4%)

Query: 88  EAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 146
           E+FT+E ++ A  +V +    +     +R   AL+PL       +          DD  +
Sbjct: 12  ESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCE 71

Query: 147 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 206
            V  + ++AGE I ++ G + N++ +++ GF  ++N +DR+ ++  ++  D  Y  K  V
Sbjct: 72  CVALQDFRAGEQIYIFYGTRSNAEFVVHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEV 131

Query: 207 AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS--------SLGPI- 257
             R G  +  VF +H   E    + +L +LR+  +++    + ++         +LG   
Sbjct: 132 LARAGIPTSSVFALHV-TELPISAQLLAFLRVFCMTEEELREHLLGENAIDRIFTLGNSE 190

Query: 258 CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 317
            PVS   E  +   L D     L  Y  T+ ED+A L    L P+  +A +L   EK++L
Sbjct: 191 YPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKAFLQSPGLSPRAAMAVKLRLGEKEIL 250

Query: 318 NACLQVTA 325
              ++  A
Sbjct: 251 EKAVRSAA 258


>gi|302766942|ref|XP_002966891.1| hypothetical protein SELMODRAFT_408134 [Selaginella moellendorffii]
 gi|300164882|gb|EFJ31490.1| hypothetical protein SELMODRAFT_408134 [Selaginella moellendorffii]
          Length = 374

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 59/206 (28%), Positives = 90/206 (43%), Gaps = 42/206 (20%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLA-VESPLLWSETELAYLTGSPTKAEILERAEG 59
           + EK + ++S W PY+R L       QL  +ES + W   ELA+L  SPT  E +E  + 
Sbjct: 56  LSEKHKAQESQWAPYLRCL------PQLGDIESTMFWKAEELAWLKHSPTYRETMECLKI 109

Query: 60  IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL----- 114
           IK E++ L     +A    Q +P+                     C   L +VSL     
Sbjct: 110 IKSEFHLLT----LANK--QVFPW---------------------CRDALGEVSLTDFMH 142

Query: 115 ARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLL 172
           A     +P       +   C+  L+     D    V D+ YKAG+ I +  G  PNS L 
Sbjct: 143 AYSTDQIPFAD-FFNHDHNCQTRLSYDKEKDCAVAVADQDYKAGDEIFLSYGSTPNSILA 201

Query: 173 INYGFVDEDNPYDRLVVEAALNTEDP 198
           ++YGF    NP++++ V   ++  DP
Sbjct: 202 VDYGFAVASNPHEQVEVPMGVSLTDP 227


>gi|79315114|ref|NP_001030864.1| SET domain-containing protein [Arabidopsis thaliana]
 gi|51971180|dbj|BAD44282.1| unnamed protein product [Arabidopsis thaliana]
 gi|332645817|gb|AEE79338.1| SET domain-containing protein [Arabidopsis thaliana]
          Length = 353

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 55/226 (24%), Positives = 102/226 (45%), Gaps = 27/226 (11%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           EKK G+KS W+PYI  L +        + S + W E EL+ +  S    E +++   I++
Sbjct: 9   EKKMGQKSRWVPYISRLPQP-----AEMHSSIFWGEDELSMIRCSAVHQETVKQKAQIEK 63

Query: 63  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 122
           +++      F+A +  Q  P  I TE    E F  A+  V S     +    ++R +L+P
Sbjct: 64  DFS------FVAQAFKQHCP--IVTERPDLEDFMYAYALVGS-----RAWENSKRISLIP 110

Query: 123 LGPPLLAYSSKCKAMLAAVDD------AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 176
               +         +L   D+       +Q+  DR Y  G+ + +  G   N+ L++++G
Sbjct: 111 FADFMNHDGLSASIVLRDEDNQLSEFSTLQVTADRNYSPGDEVFIKYGEFSNATLMLDFG 170

Query: 177 FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ---VFH 219
           F    N +D + ++  +  +DP    K  + Q +   +V+   +FH
Sbjct: 171 FTFPYNIHDEVQIQMDVPNDDPLRNMKLGLLQTHHTRTVKDINIFH 216


>gi|348537527|ref|XP_003456245.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Oreochromis niloticus]
          Length = 607

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 80/340 (23%), Positives = 138/340 (40%), Gaps = 51/340 (15%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY----- 64
           SFWLPYIR L ++        + PL + + ++  L G+    ++L + +   R+Y     
Sbjct: 170 SFWLPYIRSLPQE-------YDIPLYYQQEDVQLLLGTQAVQDVLSQYKNTARQYAYFYK 222

Query: 65  -----NELDTVWF---------MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ 110
                  L +V           M G LF Q+       A +  + +Q  +  +       
Sbjct: 223 LVQDKGMLGSVELRLFASLTPVMGGKLFDQW-------AVSSVMTRQNQIPTEDG----S 271

Query: 111 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 170
           +V+LA    L+PL       +          DD  + V  + YK  E I ++ G + N++
Sbjct: 272 RVTLA----LIPLWDMCNHTNGLITTGYNLEDDRCECVALQDYKENEQIYIFYGTRSNAE 327

Query: 171 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 230
            +I+ GF  +D+ +DR+ ++  ++  +  Y  K  V  R G  +  VF +H   E    +
Sbjct: 328 FVIHNGFFFQDDAHDRVKIKLGVSKSERLYAMKAEVLARAGIPASYVFALHCN-EPPISA 386

Query: 231 DMLPYLRLG---------YVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 281
            +L +LR+          Y+     +  + +      PVS   E  +   L       L 
Sbjct: 387 QLLAFLRVFCMTEDELKYYLLGDRAINKIFTLGNSEFPVSWENEIKLWTFLETRAALLLK 446

Query: 282 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 321
            Y  T  ED +ML   +L    R+A QL   EK++L   L
Sbjct: 447 TYKTTSEEDRSMLEKPDLSLHSRMAIQLRLAEKQILEKAL 486


>gi|42565948|ref|NP_191068.2| SET domain-containing protein [Arabidopsis thaliana]
 gi|56236044|gb|AAV84478.1| At3g55080 [Arabidopsis thaliana]
 gi|59958342|gb|AAX12881.1| At3g55080 [Arabidopsis thaliana]
 gi|332645816|gb|AEE79337.1| SET domain-containing protein [Arabidopsis thaliana]
          Length = 463

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 54/221 (24%), Positives = 102/221 (46%), Gaps = 22/221 (9%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           EKK G+KS W+PYI  L +        + S + W E EL+ +  S    E +++   I++
Sbjct: 124 EKKMGQKSRWVPYISRLPQP-----AEMHSSIFWGEDELSMIRCSAVHQETVKQKAQIEK 178

Query: 63  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 122
           +++      F+A +  Q  P  I TE    E F  A+  V S     +    ++R +L+P
Sbjct: 179 DFS------FVAQAFKQHCP--IVTERPDLEDFMYAYALVGS-----RAWENSKRISLIP 225

Query: 123 LGPPLLAYSSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 181
               +         +L   D+ + ++  DR Y  G+ + +  G   N+ L++++GF    
Sbjct: 226 FADFMNHDGLSASIVLRDEDNQLSEVTADRNYSPGDEVFIKYGEFSNATLMLDFGFTFPY 285

Query: 182 NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ---VFH 219
           N +D + ++  +  +DP    K  + Q +   +V+   +FH
Sbjct: 286 NIHDEVQIQMDVPNDDPLRNMKLGLLQTHHTRTVKDINIFH 326


>gi|297726941|ref|NP_001175834.1| Os09g0411650 [Oryza sativa Japonica Group]
 gi|255678893|dbj|BAH94562.1| Os09g0411650, partial [Oryza sativa Japonica Group]
          Length = 206

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 47/158 (29%), Positives = 82/158 (51%), Gaps = 9/158 (5%)

Query: 166 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGRE 225
           + N++L ++YGF + ++  D   +   ++  DP Y DK  +A+ NG      F +  G  
Sbjct: 10  KSNAELALDYGFTESNSSRDAYTLTLEISESDPFYDDKLDIAELNGMGETAYFDIVLG-- 67

Query: 226 KEAIS-DMLPYLRLGYVSDTSE--MQSVISS--LGPI-CPVSPCMERAVLDQLADYFKAR 279
            E++   MLPYLRL  +  T    ++++  +   G +  PVS   E A+   + +  K+ 
Sbjct: 68  -ESLPPQMLPYLRLLCLGGTDAFLLEALFRNAVWGHLELPVSQDNEEAICQVIRNACKSA 126

Query: 280 LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 317
           L  Y  T+ EDE +L   NL P+ ++A ++   EKK+L
Sbjct: 127 LGAYHTTIEEDEELLGSENLQPRLQIAVEVRAGEKKVL 164


>gi|302786274|ref|XP_002974908.1| hypothetical protein SELMODRAFT_102436 [Selaginella moellendorffii]
 gi|300157067|gb|EFJ23693.1| hypothetical protein SELMODRAFT_102436 [Selaginella moellendorffii]
          Length = 389

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 51/201 (25%), Positives = 96/201 (47%), Gaps = 22/201 (10%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           +  +K+ + S W PYI  L         ++ S + W++ EL YL  SP   E ++R + +
Sbjct: 52  LAHQKKKETSAWAPYISCLPPFG-----SMHSTIFWTQDELVYLKVSPVYRETVQRKDVV 106

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           + E+   + V      L QQ    + +   T       ++ V S    ++ +   +  AL
Sbjct: 107 RMEFAAAENVCM----LMQQVKLFVCSRILT------DYITVCSRAWGIETI---KSLAL 153

Query: 121 VPLGPPLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           VP       + + C+AML+  +D    ++V DR Y  G+ +V+  G   N+ L +++GF 
Sbjct: 154 VPF-VDFFNHDANCRAMLSYDEDRHCAEVVSDRDYATGDQVVISYGQLSNATLALDFGFA 212

Query: 179 DEDNPYDRLV-VEAALNTEDP 198
              NP+D++  +  +L+ +DP
Sbjct: 213 LPFNPHDQVAGIWLSLSEKDP 233


>gi|307109960|gb|EFN58197.1| hypothetical protein CHLNCDRAFT_142047 [Chlorella variabilis]
          Length = 485

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 79/300 (26%), Positives = 133/300 (44%), Gaps = 27/300 (9%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           ++  G    W PY+R L  + G         L WSE E+ +L GS   ++ LE     + 
Sbjct: 141 QRAAGSAGGWWPYVRLLPERTG-------CVLEWSEEEVEWLCGSQLHSDALEIRAAAEA 193

Query: 63  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 122
            + E+  V   A +  +   +     AF     + AF  + S +V L    L  + AL+P
Sbjct: 194 SWAEMQAVLAAAKAQGRAPAHG----AFGRAQLQWAFAVLLSRLVRL--AGLGDQEALLP 247

Query: 123 LGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
               LL +     + L  +A + AV L  +R Y+AGE +++  G + + +LL++YGF  +
Sbjct: 248 WAD-LLNHDCAAASFLDWSATEAAVVLRAERRYRAGEQLLISYGQKTSGELLLSYGFCPD 306

Query: 181 --DNPYD--RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPY- 235
              NP+D  RL++E  L   D     K    +++G  + Q+F +   R   A  +++ Y 
Sbjct: 307 LGSNPHDGCRLLLE--LAPGDAARNWKAAALRQHGLAASQLFPL---RMAAAPFELVHYT 361

Query: 236 -LRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 294
                 V    E + +   L     + P ++ A L+ +    KA LA YP +   D A L
Sbjct: 362 AFSAAVVGSRQEAEQLARRLFEEGDIPPALQTAALEAVVAACKAALAAYPRSFDGDRAEL 421


>gi|302771638|ref|XP_002969237.1| hypothetical protein SELMODRAFT_410177 [Selaginella moellendorffii]
 gi|300162713|gb|EFJ29325.1| hypothetical protein SELMODRAFT_410177 [Selaginella moellendorffii]
          Length = 336

 Score = 64.7 bits (156), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 61/217 (28%), Positives = 95/217 (43%), Gaps = 24/217 (11%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           M EK++G+ S W PYI  L      G+  + S + W   ELA L  SP      ER   +
Sbjct: 117 MAEKRKGQNSAWAPYIECLP---SFGE--IHSTIFWDPKELACLECSPIHRGTGERNALL 171

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           + EY E+  V     S    Y  D+     + E FK  +  V S        S      +
Sbjct: 172 QSEYREVKKV---VESCPHLYDPDV-----SLEQFKHEYATVSSRAWGQGPHS---DMTM 220

Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVV-DRPYKAGES-----IVVWCGPQPNSKLLIN 174
           +PL      +  + + + +  DD   +VV  R Y+ G+      + +  G   N+ L ++
Sbjct: 221 IPL-VDFANHDPRSRTLFSHADDNCTVVVASRDYQTGDENFHLKVHICYGDHSNAVLALD 279

Query: 175 YGFVDEDNPYDRLVVEAALNTEDPQYQDK-RMVAQRN 210
           YGFV  DNP+D   +   + +EDP  + K + +AQ N
Sbjct: 280 YGFVVPDNPFDEAEIFLEIPSEDPLREIKLQYMAQNN 316


>gi|302754340|ref|XP_002960594.1| hypothetical protein SELMODRAFT_402971 [Selaginella moellendorffii]
 gi|300171533|gb|EFJ38133.1| hypothetical protein SELMODRAFT_402971 [Selaginella moellendorffii]
          Length = 403

 Score = 64.3 bits (155), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 61/217 (28%), Positives = 95/217 (43%), Gaps = 24/217 (11%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           M EK++G+ S W PYI  L      G+  + S + W   ELA L  SP      ER   +
Sbjct: 117 MAEKRKGQNSAWAPYIECLP---SFGE--IHSTIFWDPKELACLECSPIHRGTGERNALL 171

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           + EY E+  V     S    Y  D+     + E FK  +  V S        S      +
Sbjct: 172 QSEYREVKKV---VESCPHLYDPDV-----SLEQFKHEYATVSSRAWGQGPHS---DMTM 220

Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVV-DRPYKAGE-----SIVVWCGPQPNSKLLIN 174
           +PL      +  + + + +  DD   +VV  R Y+ G+      + +  G   N+ L ++
Sbjct: 221 IPL-VDFANHDPRSRTLFSHADDNCTVVVASRDYQTGDENFHLKVHICYGDHSNAVLALD 279

Query: 175 YGFVDEDNPYDRLVVEAALNTEDPQYQDK-RMVAQRN 210
           YGFV  DNP+D   +   + +EDP  + K + +AQ N
Sbjct: 280 YGFVVPDNPFDEAEIFLEIPSEDPLREIKLQYMAQNN 316


>gi|156064409|ref|XP_001598126.1| hypothetical protein SS1G_00212 [Sclerotinia sclerotiorum 1980]
 gi|154691074|gb|EDN90812.1| hypothetical protein SS1G_00212 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 470

 Score = 64.3 bits (155), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 86/348 (24%), Positives = 144/348 (41%), Gaps = 46/348 (13%)

Query: 6   QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA-------- 57
           Q + S W PY+  L  +       ++S + WS++ELA L  S    +I ++         
Sbjct: 96  QMENSKWAPYLAVLPER-------LDSLVFWSDSELAELQASAVVKKIGKKDAEDMFKSY 148

Query: 58  ---EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQK 111
              +G+K    E+        S+   Y +DIP  +       +   A    V      +K
Sbjct: 149 IAPQGLKHSSTEM---CHKVASVIMAYAFDIPDPSDAPTSGGKGGEAGDDLVSDDGEDEK 205

Query: 112 VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 171
             L+    ++PL   L A + +  A L   ++ +++   +P   GE I    G  P S L
Sbjct: 206 TILS----MIPLADMLNADADRNNARLICDNEELEMRAIKPISKGEEIFNDYGQLPRSDL 261

Query: 172 LINYGFV-DEDNPYD------RLVVEAALN----------TEDPQYQDKRMVAQRNGKLS 214
           L  YG+V D  + YD       L+V    N          T+D   + +  +A+R G   
Sbjct: 262 LRRYGYVTDGYSAYDVAEISAELIVSLFRNGKVHPSLHKLTQD-GLKTRLELAEREGVYE 320

Query: 215 VQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLAD 274
                VH+  ++ +I D L       + D S ++++++S   I   S          LA 
Sbjct: 321 DSFDLVHSSPDEPSIPDELLAFLYLLLVDESHLKAILNSESSIPSRSKLTTELAGQVLAT 380

Query: 275 YFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 322
             +AR   Y  TL EDE +L + +L  +  +A Q+   EKK+L A +Q
Sbjct: 381 LLQAREKEYSTTLEEDEDLLKNADLPVRHAMAIQVRSGEKKVLRAAMQ 428


>gi|449442309|ref|XP_004138924.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Cucumis
           sativus]
          Length = 503

 Score = 64.3 bits (155), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 74/313 (23%), Positives = 135/313 (43%), Gaps = 29/313 (9%)

Query: 30  VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 89
            +S + WSE ELA + G+   +  L   E +K E+ +++    +          D+    
Sbjct: 170 TDSTVFWSEEELAEIQGTQLLSTTLNVKEYVKSEFLKVEEEILLRHK-------DLFPSR 222

Query: 90  FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL----------LAYSSKCKAMLA 139
            T + F  AF  ++S      +    +   L+P    +           A+  K  A L 
Sbjct: 223 ITLDDFFWAFGILRSRAFSRLR---GQNLVLIPFADLVNHSANVTTEEHAWEVKGPAGLF 279

Query: 140 AVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 198
           + D    L      KAG+ + + +   + N+ L ++YGF+++ +  +   +   +   D 
Sbjct: 280 SWDVLFSLRSPLSVKAGDQVFIQYDLKKSNADLALDYGFIEQKSDRNAYTLTLEIPESDL 339

Query: 199 QYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE--MQSVI--SSL 254
            + DK  +A+ NG      F +    E+     MLP+LRL  +  T    ++S+   S  
Sbjct: 340 FFDDKLDIAETNGLNQTAYFDIIL--ERPFPPAMLPFLRLLALGGTDAFLLESLFRNSVW 397

Query: 255 GPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRME 313
           G +  PVS   E  +   + +  +A L+GY  T+ EDE  L + NL  + R+A  +   E
Sbjct: 398 GHLEMPVSRANEELICQVVRNACEAALSGYHTTIEEDEK-LKEENLDSRLRIAVGIREGE 456

Query: 314 KKMLNACLQVTAD 326
           K++L   +Q+  D
Sbjct: 457 KRVLQQIIQIFKD 469


>gi|145528147|ref|XP_001449873.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124417462|emb|CAK82476.1| unnamed protein product [Paramecium tetraurelia]
          Length = 605

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 76/351 (21%), Positives = 146/351 (41%), Gaps = 43/351 (12%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           + ++K+ + SFW PY+  L +           P+ +++++L +L GSP   ++ ++   +
Sbjct: 244 LLQEKKIQDSFWKPYLDVLPKSYSNF------PIFFNDSDLEWLKGSPFLKQVKDKITDL 297

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           K++Y ++  V   A    Q           +F+ F  A +   S +  +  +   +  A 
Sbjct: 298 KKDYCDICQV---APEFLQN----------SFDEFCWARMTASSRIFGIN-IKGVKTDAF 343

Query: 121 VPLG-------PPLLA--YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 171
           VPL        P L +  YS + +  +   D+ ++         G+ I    G + NS+ 
Sbjct: 344 VPLADMLNHKRPKLTSWCYSDERQGFIIETDENIE--------KGQMIFDSYGSKCNSRF 395

Query: 172 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD 231
           L+NYGFV +DN  + + V    +      Q K  +++   +   + F +       + SD
Sbjct: 396 LLNYGFVVDDNNANEVNVMVEPDGTISLIQLKEGLSRETLQFP-KSFRLVIDPNDVSFSD 454

Query: 232 MLPYLRLGYVSDTSEMQSVISSLGPICP-----VSPCMERAVLDQLADYFKARLAGYPAT 286
            + ++R   + +  E  +++     I P     +S   E A  + + +     L  YP T
Sbjct: 455 FMSFIRFILIQEEKEFANLLGKNSYIKPTKIHFISIQNELATWNLIENICIRALNQYPTT 514

Query: 287 LSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVS 337
           L +D  +L    L   +R    L   EKK+LN   Q +  M  L  +   S
Sbjct: 515 LEQDLEILKICELTTNQRNCLILRMGEKKILNFYKQFSEKMRQLFSNFDFS 565


>gi|449495943|ref|XP_004159992.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Cucumis
           sativus]
          Length = 503

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 74/313 (23%), Positives = 135/313 (43%), Gaps = 29/313 (9%)

Query: 30  VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 89
            +S + WSE ELA + G+   +  L   E +K E+ +++    +          D+    
Sbjct: 170 TDSTVFWSEEELAEIQGTQLLSTTLNVKEYVKSEFLKVEEEILLRHK-------DLFPSR 222

Query: 90  FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL----------LAYSSKCKAMLA 139
            T + F  AF  ++S      +    +   L+P    +           A+  K  A L 
Sbjct: 223 ITLDDFFWAFGILRSRAFSRLR---GQNLVLIPFADLVNHSANVTTEEHAWEVKGPAGLF 279

Query: 140 AVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 198
           + D    L      KAG+ + + +   + N+ L ++YGF+++ +  +   +   +   D 
Sbjct: 280 SWDVLCSLRSPLSVKAGDQVFIQYDLKKSNADLALDYGFIEQKSDRNAYTLTLEIPESDL 339

Query: 199 QYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE--MQSVI--SSL 254
            + DK  +A+ NG      F +    E+     MLP+LRL  +  T    ++S+   S  
Sbjct: 340 FFDDKLDIAETNGLNQTAYFDIIL--ERPFPPAMLPFLRLLALGGTDAFLLESLFRNSVW 397

Query: 255 GPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRME 313
           G +  PVS   E  +   + +  +A L+GY  T+ EDE  L + NL  + R+A  +   E
Sbjct: 398 GHLEMPVSRANEELICQVVRNACEAALSGYHTTIEEDEK-LKEENLDSRLRIAVGIREGE 456

Query: 314 KKMLNACLQVTAD 326
           K++L   +Q+  D
Sbjct: 457 KRVLQQIIQIFKD 469


>gi|297820264|ref|XP_002878015.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323853|gb|EFH54274.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 473

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 48/197 (24%), Positives = 92/197 (46%), Gaps = 19/197 (9%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           EKK+G+KS W+PYI  L +        + S + W E E + +  S    E +++   I++
Sbjct: 124 EKKKGQKSRWVPYISRLPQP-----AEMHSTIFWGEDEFSMIRCSAVHKETVKQKAQIEK 178

Query: 63  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 122
           E++      F+A +  Q YP  I  E    E F  A+  V S     +    ++  +L+P
Sbjct: 179 EFS------FVAQAFKQHYPMVI--ERPYLEDFMYAYALVGS-----RAWETSKGISLIP 225

Query: 123 LGPPLLAYSSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 181
               +         +L+  D+ + ++  DR Y  G+ + +  G   N+ L++++GF    
Sbjct: 226 FADFMNHDGLSASIVLSDEDNQLSEVTADRNYSPGDEVFIKYGEFSNATLMLDFGFTVPY 285

Query: 182 NPYDRLVVEAALNTEDP 198
           N +D + ++  +  +DP
Sbjct: 286 NIHDEVQIQMDVPNDDP 302


>gi|392349055|ref|XP_003750278.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Rattus
           norvegicus]
          Length = 416

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 55/234 (23%), Positives = 107/234 (45%), Gaps = 19/234 (8%)

Query: 9   KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 68
            SFW PYI+ L  +        ++PL + E E+  L  +    ++  + +   R+Y    
Sbjct: 108 NSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQYAYFY 160

Query: 69  TVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLG 124
            V        Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL 
Sbjct: 161 KV-------IQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLW 213

Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
                 +          DD  + V  + ++AG+ I ++ G + N++ +I+ GF  ++N +
Sbjct: 214 DMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSH 273

Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 238
           DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+
Sbjct: 274 DRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRV 326


>gi|350408192|ref|XP_003488333.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Bombus
           impatiens]
          Length = 484

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 83/337 (24%), Positives = 142/337 (42%), Gaps = 31/337 (9%)

Query: 4   KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 63
           +K  + S W PY+  L            + L  +  ++  L GSPT    L++   I R+
Sbjct: 165 EKHKEYSKWKPYLDILPT-------TYTTVLYMTAADMNELKGSPTLEAALKQCRNIARQ 217

Query: 64  YNELDTVWFMAGSLFQQYPYDIPT---EAFTFEIFKQAF--VAVQSCVVHLQKVSLARRF 118
           Y      +F    LFQ+    +     + FT+E +  A   V  +  ++  +  SL    
Sbjct: 218 Y-----AYF--NKLFQKNNNAVSAILRDVFTYEKYCWAVSTVMTRQNIIPSKDGSLMIH- 269

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           AL+P+       +SK      A  +  +    R +K  E I +  G + NS   ++ GFV
Sbjct: 270 ALIPMWDMCNHENSKITTDFNATLNCCECYALRDFKKAEQIFISYGARTNSDFFVHSGFV 329

Query: 179 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD-MLPYLR 237
             DN  D   +   ++  DP  +++  +  +    +V  F +  G   E ISD +L +LR
Sbjct: 330 YMDNEQDGFKLRLGISKADPLQKERVELLNKLDLPAVGEFLLKPG--TEPISDTLLAFLR 387

Query: 238 LGYVSDTSEMQSVISS--LGPICPVSPCMERAVLDQLADYFKARL----AGYPATLSEDE 291
           + +     E+   I S  +  +  +   +E  V + +  +   RL    A YP TL ED 
Sbjct: 388 V-FSMRKEELAHWIQSDRVNDLKHMDCALETVVEENVKKFLLTRLQLLIANYPTTLKEDL 446

Query: 292 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 328
            +L +  L   K++A QL   EK++L   L+     I
Sbjct: 447 QLL-ETTLPRIKKLAIQLRVTEKRILQGALEYVQQWI 482


>gi|302753470|ref|XP_002960159.1| hypothetical protein SELMODRAFT_437298 [Selaginella moellendorffii]
 gi|300171098|gb|EFJ37698.1| hypothetical protein SELMODRAFT_437298 [Selaginella moellendorffii]
          Length = 377

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 49/190 (25%), Positives = 85/190 (44%), Gaps = 26/190 (13%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           + E+ +G+ S W PYI  L +        +++  LW +TEL+YL  SP   +  ER E I
Sbjct: 119 LMERYKGQSSVWAPYISCLPQPA-----ELDNTFLWEDTELSYLKASPLYGKTRERLEMI 173

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
             E+ ++     +   LF +          + E FK  +  V S     + +++     L
Sbjct: 174 TTEFGQVQNALNVWPQLFGK---------VSLEDFKHVYATVFS-----RSLAIGEDSTL 219

Query: 121 VPLGPPLLAY-----SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
           V +  P+L +     +S  K     + +   +  DR Y   + I +  G   N++L ++Y
Sbjct: 220 VMI--PMLDFFNHNATSFAKLSFNGLLNYAVVTADRAYTENDQIWINYGDLSNAELALDY 277

Query: 176 GFVDEDNPYD 185
           GF   +NPYD
Sbjct: 278 GFTVPENPYD 287


>gi|145516108|ref|XP_001443948.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124411348|emb|CAK76551.1| unnamed protein product [Paramecium tetraurelia]
          Length = 572

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 73/342 (21%), Positives = 139/342 (40%), Gaps = 45/342 (13%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
           SFW PYI  L             P+ ++ ++L +L GSP   +I ++   ++++YN++  
Sbjct: 228 SFWKPYIDILPSSYPSF------PIFYNNSDLEWLKGSPFLKQIKDKLADLQKDYNDICN 281

Query: 70  VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG----- 124
           V       F QY          F  F  A +   S +  +  ++  +  A VPL      
Sbjct: 282 V----VPEFTQY---------QFHEFCWARMTASSRIFGI-NINGVKTDAFVPLADMLNH 327

Query: 125 --PPLLA--YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
             P L +  YS + +  +   D+ ++         G+ I    G + NS+  +NYGFV E
Sbjct: 328 KRPKLTSWCYSDEKQGFIIETDEKIE--------RGQMIFDSYGRKCNSRFFLNYGFVVE 379

Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
            N  + + +    +  DP  Q K    + + +   + F +    ++ A+ D + ++R   
Sbjct: 380 GNDANEVNLAVEADQNDPLLQLKEQAIKESLQWP-KNFKLLMDTDETAVIDFMSHIRFLV 438

Query: 241 VSDTSEMQSVISSLGP-------ICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM 293
           + D ++++ +++             P+    E  +   +    K  L  YP T  +D+ +
Sbjct: 439 IRDEAQLKLLLNQKNSQNFKSTKTQPLGIYNELEMWKMIGRICKKTLKQYPTTFEQDQEI 498

Query: 294 LTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVT 335
           L    L   +R    L   EK++L    Q +  M  LL +  
Sbjct: 499 LQICELTTNQRNCLILRMGEKEILKFYFQFSERMKELLSNFN 540


>gi|327259114|ref|XP_003214383.1| PREDICTED: SET domain-containing protein 3-like, partial [Anolis
           carolinensis]
          Length = 311

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 55/197 (27%), Positives = 97/197 (49%), Gaps = 18/197 (9%)

Query: 142 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 201
           DD  + V  + +KAGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y 
Sbjct: 9   DDRCECVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 68

Query: 202 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS--------S 253
            K  V  R G  +  VF +HA  E    + +L +LR+  +++    + +I         +
Sbjct: 69  MKAEVLARAGIPTSSVFALHA-TEPPISAQLLAFLRVFCMTEDELKEHLIGEHAIDRIFT 127

Query: 254 LGPI-CPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVATQ 308
           LG    PVS   E     +L  + +AR    L  Y  T+SED+A L   +L     +A +
Sbjct: 128 LGNSEFPVSWDNEV----KLWTFLEARASLLLKTYKTTVSEDKAFLGTQDLTCNATMAIK 183

Query: 309 LVRMEKKMLNACLQVTA 325
           L   EK++L   ++  A
Sbjct: 184 LRLGEKEILEKAIKSAA 200


>gi|255947868|ref|XP_002564701.1| Pc22g06730 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211591718|emb|CAP97961.1| Pc22g06730 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 679

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 60/203 (29%), Positives = 98/203 (48%), Gaps = 15/203 (7%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           M +  +G + FW PY+R L +    GQL   +PL + E ++ ++ G+      +ER +  
Sbjct: 111 MGQYLRGSEGFWYPYLRTLPQP---GQLT--TPLFFGEEDVDWIQGTGIPEAAVERIKVW 165

Query: 61  KREYN----ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA-VQSCVVHLQKVSLA 115
           +++Y+    +LD + F      +QY +++   A T  I  +AF A V S  V    +   
Sbjct: 166 EQKYDLGYLKLDEIGFPD---CEQYTWELYLWASTI-ITSRAFSAKVLSGAVQPDDLPED 221

Query: 116 RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
              AL+PL   L  +    K    A D+ + L+V   + AG+ I    GP+ N +LLINY
Sbjct: 222 GVSALLPL-IDLPNHRPMAKVEWRAGDEDIGLLVLEDHSAGQEISNNYGPRNNEQLLINY 280

Query: 176 GFVDEDNPYDRLVVEAALNTEDP 198
           GF    NP D  +V   +  + P
Sbjct: 281 GFCIAGNPTDYRIVLLGVKPDSP 303


>gi|367029027|ref|XP_003663797.1| hypothetical protein MYCTH_2080826, partial [Myceliophthora
           thermophila ATCC 42464]
 gi|347011067|gb|AEO58552.1| hypothetical protein MYCTH_2080826, partial [Myceliophthora
           thermophila ATCC 42464]
          Length = 357

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 59/201 (29%), Positives = 90/201 (44%), Gaps = 32/201 (15%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVES-PLLWSETELAYLTGSPTKAEILERAEGIK 61
           E  +GK SFW PYI  L       Q+AV + P  W + ++AYL G+     I E  E +K
Sbjct: 122 EYLKGKDSFWWPYIATLPPPE---QVAVWALPPFWPDHDIAYLEGTNAHVAIQEIQENVK 178

Query: 62  REYNELDTVWFMAGSLFQQYPY-DIPTEAFTFEIFKQAFVAVQSCVVH----LQKVSLAR 116
           RE+ +       A  L ++  + D+P  A+T  ++K AF    S        L   +  R
Sbjct: 179 REFKQ-------ARKLLKEEDFPDLP--AYTQLLYKWAFCIFTSRSFRPSLVLSDATKRR 229

Query: 117 RFALVPLG---------PPLLAYSSKCKAM-----LAAVDDAVQLVVDRPYKAGESIVVW 162
             AL+P G          PLL  ++            +V D  +L+   PY+ G  +   
Sbjct: 230 LSALLPQGVQLDDFSVLQPLLDIANHSPTARYTWDTTSVPDTCRLICHDPYQPGTQVYNN 289

Query: 163 CGPQPNSKLLINYGFVDEDNP 183
            G + NS+LL+ YGF+  + P
Sbjct: 290 YGLKTNSELLLAYGFILPETP 310


>gi|198413420|ref|XP_002131202.1| PREDICTED: similar to SET domain containing 3 [Ciona intestinalis]
          Length = 577

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 71/312 (22%), Positives = 142/312 (45%), Gaps = 39/312 (12%)

Query: 40  ELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA---FTFEIFK 96
           E+ YL G+   +  + + + I R++  L  V+   GS  ++    +P +A   FTF+ ++
Sbjct: 195 EMKYLKGTAALSVAINQFKSIVRQFALLYQVF--NGSHQKEDVEKLPLQARNAFTFDTYR 252

Query: 97  QAFVAVQS----CVVHLQKV----SLARRFALVPL--------GPPLLAYSSKCKAMLAA 140
               AV +       H+  V          AL+P+        GP   AY+        A
Sbjct: 253 WCASAVTTRQNKIPTHVGDVLGDLDENSTLALIPMWDMFNHAIGPLSTAYN--------A 304

Query: 141 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 200
           +   ++ +  + +K GE + +  G + NS LLI+ GFV +++P+D++ +   ++ +DP Y
Sbjct: 305 LTRGIECLAMQDFKTGEQVKICYGARTNSDLLIHNGFVMKESPFDKVRIHLGVSQKDPLY 364

Query: 201 QDK-RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT------SEMQSVISS 253
             K +++ + N ++S Q               +L +LR+ ++++        + ++ +SS
Sbjct: 365 SLKAKLLEKLNVEVSGQFAVCSMDNSLPTSPQLLVFLRVFHMNEEELRSWLEKQKNELSS 424

Query: 254 LGPI---CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLV 310
           L  I     V    +  V + L +  K  L G+       E M+ D +L  + ++A Q  
Sbjct: 425 LREIYISGEVKFKSDVKVWEFLENRVKLLLMGFKKIGDNIEEMMEDKSLTHRSKLALQFR 484

Query: 311 RMEKKMLNACLQ 322
             E ++L+AC+ 
Sbjct: 485 IEEHRILSACVN 496


>gi|219126444|ref|XP_002183467.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217405223|gb|EEC45167.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 519

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 91/343 (26%), Positives = 134/343 (39%), Gaps = 57/343 (16%)

Query: 4   KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 63
           K  G  SF+ PY   L             P+ WS  EL  L GS   ++I +R + I+ +
Sbjct: 125 KTHGSSSFFHPYYEILP------PTLRNMPIFWSAFELQELEGSHLLSQIADRGQAIQDD 178

Query: 64  YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 123
           Y  +  V    G+L             T + FK A + V S    LQ +   R  ALVP 
Sbjct: 179 YEAILEVAPSLGTLC------------TLDEFKWARMCVCSRNFGLQ-IDGHRTSALVPH 225

Query: 124 GPPLLAYSSK-CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 182
              L  Y  +  K     V     +   +  +AG  +    G + N + L+NYGF  EDN
Sbjct: 226 ADMLNHYRPRETKWTFDEVTQCFTITSLQSIQAGAQVYDSYGQKCNHRFLLNYGFAVEDN 285

Query: 183 PY------DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL 236
                   + + +E  ++  D  +QDK     R               E   IS      
Sbjct: 286 RELDGFCPNEVPLELYVDPADILFQDKLEFWTRG--------------ETNQIS------ 325

Query: 237 RLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
             G V+     Q+V  S+G   P S   E      +    + RLA YP T+S+D A L D
Sbjct: 326 --GAVTAGLIAQAVGGSMGRGVP-SHAAESYTSGPVVK--RVRLASYPTTISQDMADLQD 380

Query: 297 YNLHPK---KRVATQLVRMEKKMLN---ACLQVTADMIMLLPD 333
              +P+   +R A   VR EK++L+      +   DM+  + D
Sbjct: 381 EASYPQFSNRRHAKIQVRGEKEVLHHFRVWSETALDMLTFIED 423


>gi|440797255|gb|ELR18348.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
          Length = 431

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 49/162 (30%), Positives = 76/162 (46%), Gaps = 15/162 (9%)

Query: 32  SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 91
           + L +S+ EL+YL GS       +R   I+ +Y+ + T       LF  YP     E F+
Sbjct: 138 TTLFFSDDELSYLEGSSLHHFARQRRRAIESQYDTIFT------PLFVDYPEHFAPEQFS 191

Query: 92  FEIFKQAFVAV--QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV 149
            + +K A   +  +S VV   K  L     +  + P     + + K  + AVD  +    
Sbjct: 192 LDAWKWALSVIWSRSFVVDEGKRGLVPWADMFNMAPE----TEQVKVAVDAVDHHLIYSA 247

Query: 150 DRPYKAGESIVVWCGPQ---PNSKLLINYGFVDEDNPYDRLV 188
             P K GE I V  G      N++LL++YGFV E+NP+D +V
Sbjct: 248 RSPIKKGEQIFVAYGQSRQMSNAQLLMDYGFVLENNPHDAVV 289


>gi|380015248|ref|XP_003691619.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Apis
           florea]
          Length = 483

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 76/333 (22%), Positives = 139/333 (41%), Gaps = 23/333 (6%)

Query: 4   KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 63
           ++  + S W PY+  L            + L  +  ++  L GSPT    L++   I R+
Sbjct: 164 ERHKENSKWKPYLDILPT-------TYTTVLYMTAADMIELKGSPTLEAALKQCRNIARQ 216

Query: 64  YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF-ALVP 122
           Y+  + V+    +       D+    FT+E +  A   V +    +     +R   AL+P
Sbjct: 217 YSYFNKVFQNNNNAVSAILRDV----FTYERYCWAVSTVMTRQNLIPSEDGSRMIHALIP 272

Query: 123 LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 182
           +       + +      A  +  +    R +K GE I +  GP+ NS   ++ GFV  +N
Sbjct: 273 MWDMCNHENGRITTDFNATSNYCECYALRDFKKGEQIFISYGPRTNSDFFVHSGFVYMEN 332

Query: 183 PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD-MLPYLRLGYV 241
             D   +   ++  D   +++  +  +    +V  F +  G   E ISD +L +LR+ + 
Sbjct: 333 KQDGFKLRLGISKADSLQKERIELLNKLDLPTVGEFLLKLG--TEPISDLLLAFLRV-FS 389

Query: 242 SDTSEMQSVISS--LGPICPVSPCMERAVLDQLADYFKARL----AGYPATLSEDEAMLT 295
              +E+   I S  +  +  +   +E  V + +  +   RL    A YP TL ED  +L 
Sbjct: 390 MRKAELAHWIRSDRVNDLKHMDCALETVVEENVRKFLLTRLQLLIANYPTTLKEDLQLL- 448

Query: 296 DYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 328
           +  L   K++  QL   EK++L   L+     I
Sbjct: 449 ETTLPQIKKLTIQLRVTEKRILQGALEYVEQWI 481


>gi|399949805|gb|AFP65462.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Chroomonas mesostigmatica
           CCMP1168]
          Length = 464

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 67/326 (20%), Positives = 134/326 (41%), Gaps = 31/326 (9%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           E+  G+KS W  Y   L +++    +       W  +++ +L GS          E IK 
Sbjct: 147 ERAIGEKSKWKVYYEILPKEKDLNLV-----FRWKISDIVFLRGSKVLNASFYLKEKIKI 201

Query: 63  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 122
           ++  ++   F    L        P + F  + ++ A   + S  + LQ +   ++ ALVP
Sbjct: 202 QFLRIEKTIFSKNRLV------YPEKIFNLQSWEWAISLLLSRAIFLQNM---KKIALVP 252

Query: 123 LG-----PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
                   P        K +  + ++ + +  D+ Y   + I    G + N +LL+ YGF
Sbjct: 253 YADFINHNPFSTSYINSKKIAFSENNEIVMYADKDYNKFDQIFTTYGQKTNLELLVLYGF 312

Query: 178 VDEDNPYDRLVVEAALNTEDPQYQDKRMV---AQRNGKLSVQVFHVHAGREKEAISDMLP 234
           + E NP+D + +  AL+T+D  Y  K       ++  +++  VF+    +E      +  
Sbjct: 313 IIERNPFDSIELRVALSTKDELYNKKEKFINDCEKTEQITFPVFYYKYPKE------LYE 366

Query: 235 YLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 294
           ++RL             S+L      +  +E+ +   +    K  L  Y  T++E++ + 
Sbjct: 367 FMRLCLSGPRDFFGEEFSNLNFTDEENFNLEKIIRKTVIFACKKNLKAYNKTINEEKILN 426

Query: 295 TDYN---LHPKKRVATQLVRMEKKML 317
              N   L   ++ + +  + EKK+L
Sbjct: 427 NLSNIIVLTKNQKTSIKQRKCEKKIL 452


>gi|302896454|ref|XP_003047107.1| SET domain protein [Nectria haematococca mpVI 77-13-4]
 gi|256728035|gb|EEU41394.1| SET domain protein [Nectria haematococca mpVI 77-13-4]
          Length = 1037

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 96/385 (24%), Positives = 153/385 (39%), Gaps = 83/385 (21%)

Query: 1    MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK-------AEI 53
            MYE  QG+KS W PY   L         + ++P+ WSE+EL  L  S  +       AE 
Sbjct: 672  MYEYLQGEKSQWKPYFDVLPS-------SFDTPMFWSESELDQLQASHMRHKIGKADAES 724

Query: 54   LERAE-------------GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 100
            + R               G  R  ++L  +    GS    Y +D+  +    E     +V
Sbjct: 725  MFRKTLLPIIRKNSSVFGGENRSDDDLVEIAHRMGSTIMAYAFDLENDEDEEEEETDGWV 784

Query: 101  AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 160
              +     +          +VP+   +L   ++  A +   ++++ +   RP KAGE I 
Sbjct: 785  EDREGKSMM---------GMVPMAD-ILNADAEFNAHVNHEEESLTVTSLRPIKAGEEIF 834

Query: 161  VWCGPQPNSKLLINYGFVDEDNP-YDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFH 219
             + GP PNS+LL  YG+V E +  YD  VVE   +  +        V + N  +S QV  
Sbjct: 835  NYYGPHPNSELLRRYGYVTERHSRYD--VVEIPWDVVES-------VMRLNFGISGQVLE 885

Query: 220  V--HAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGP-----------------ICPV 260
               H   E+E   D     R     +T E+ S  +  GP                 +  V
Sbjct: 886  KLRHGLEEEEEFEDTFVLER-----ETGEVNSDGTFSGPARFESMPEDLQEQLKTFLKGV 940

Query: 261  SPCMERAVLDQ----------LADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLV 310
                  A+ D+          LA   +A  + YP + SED  +L   +L  + R+A ++ 
Sbjct: 941  KKAQPEAIPDKRKRDEIHHAVLAKTLQALASKYPTSTSEDGILLQRQDLSQRTRMAIEVR 1000

Query: 311  RMEKKMLNACLQVTA--DMIMLLPD 333
              EKK+L   +  T+  D+ M + D
Sbjct: 1001 LGEKKLLQEAIASTSSVDVEMTVDD 1025


>gi|242049248|ref|XP_002462368.1| hypothetical protein SORBIDRAFT_02g024510 [Sorghum bicolor]
 gi|241925745|gb|EER98889.1| hypothetical protein SORBIDRAFT_02g024510 [Sorghum bicolor]
          Length = 489

 Score = 60.8 bits (146), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 90/334 (26%), Positives = 142/334 (42%), Gaps = 39/334 (11%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           + E  +G  S W PY+  L RQ              S    A L  S  + ++L    G+
Sbjct: 136 LSEVARGADSPWAPYLAILPRQTD------------STIFCAGLKKSSLRYKLLSTTVGV 183

Query: 61  KREY--NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV---VHLQKVSLA 115
           K EY  +E D+V     S  +    D+   + TF+ F  AF  ++S V   +   K++L 
Sbjct: 184 K-EYVQSEFDSVQAEIISRNK----DLFPGSITFDDFLWAFGILRSRVFPELRGDKLALV 238

Query: 116 RRFALVPLGPPLLAYSS----KCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSK 170
               LV   P + +  S    K K +    +    L      K+G+ I + +   + N++
Sbjct: 239 PFADLVNHSPDITSEGSSWEIKGKGLFGR-EPMFSLRTPVDVKSGQQIYIQYDLDKSNAE 297

Query: 171 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 230
           L ++YGFV+ +   D   V   ++  DP Y DK  +A+ N       F +    ++    
Sbjct: 298 LALDYGFVESNPSRDSYTVTLEISESDPFYGDKLDIAELNELGETAYFDIIL--DEPLPP 355

Query: 231 DMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGY 283
            MLPYLRL  +  T           SV   L    P+SP  E ++   + D  K+ LA Y
Sbjct: 356 QMLPYLRLLCIGGTDAFILEALFRNSVWGHLE--LPLSPDNEESICQVMRDACKSALAAY 413

Query: 284 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 317
             T+ EDE +    NL P+  +A  +   EKK+L
Sbjct: 414 HTTIEEDEELSERENLQPRLTIAIGVRAGEKKVL 447


>gi|322694827|gb|EFY86647.1| SET domain protein [Metarhizium acridum CQMa 102]
          Length = 467

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 86/368 (23%), Positives = 150/368 (40%), Gaps = 61/368 (16%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEG 59
           MYE   G +S W PYI  L        L  ++P+ WSE EL+YL  S T  +I +  AE 
Sbjct: 102 MYEFFLGDESKWKPYIDVL-------PLTFDTPMFWSEEELSYLQASATVNKIGKADAEE 154

Query: 60  IKREY--------------------NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 99
           + R                       +L  +    GS    Y +D+  E    +     +
Sbjct: 155 MFRTRLIPAIRGNPSVFVSSGDCSDEDLIGLAHRMGSTIMAYAFDLENEEAENDEESDGW 214

Query: 100 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESI 159
           V  +     +  V++A           +L   ++  A +   D+ + +   R  KAGE I
Sbjct: 215 VEDREGKSMMGMVAMA----------DILNADAEFNAHVNHGDEELTVTSIRDIKAGEEI 264

Query: 160 VVWCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAALNTEDPQYQD------KRM 205
           + + GP PNS+LL  YG++ E +        P+D   V+ +L +E    QD      +RM
Sbjct: 265 LNYYGPHPNSELLRRYGYITEKHSRYDVVEIPWD--AVQHSLMSELGVPQDIMAETMERM 322

Query: 206 VAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE-MQSVISSL----GPICPV 260
              ++    + V    +G      +   P +  G   D  E +++ I  L    G +   
Sbjct: 323 --DQDDLEDIFVLERDSGEPNPDGTFAGPAVVDGMPPDLKEQLKATIKLLQKLDGNLISD 380

Query: 261 SPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNAC 320
               +  +   + +  +   + Y  T++EDE +L   +L  ++R+A Q+   EKK+L   
Sbjct: 381 KRKRDDILRSTMVETLRLIASRYSTTIAEDEVLLAQDSLTRRQRMAVQVRLGEKKLLQEA 440

Query: 321 LQVTADMI 328
               ++M+
Sbjct: 441 CDHFSEMV 448


>gi|380480025|emb|CCF42668.1| SET domain-containing protein RMS1 [Colletotrichum higginsianum]
          Length = 318

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 55/230 (23%), Positives = 107/230 (46%), Gaps = 34/230 (14%)

Query: 118 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
             +VP+   +L   ++  A +   +D + +V  RP KAGE I+ + GP PNS+LL  YG+
Sbjct: 71  LGMVPMAD-ILNADAEFNAHVNHGEDDLSVVALRPIKAGEEILNYYGPHPNSELLRRYGY 129

Query: 178 VDEDN--------PYDRLVVEAALNTE---------------DPQ-YQDKRMVAQRNGKL 213
           V   +        P+D  +V++ L  +               DP+ ++D  ++ + +G+ 
Sbjct: 130 VTPKHSRYDVVEIPWD--LVQSILTEQLRLTDDVWKQLAEHVDPEDFEDVFVLERDSGEP 187

Query: 214 SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLA 273
             +       + +E  +++   L+       + ++++    G + P     +      +A
Sbjct: 188 DSEGRLTTPAKVQEVSAELEEQLK-------AVLKAIKKVRGDLIPDKRKRDEVYQHVVA 240

Query: 274 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQV 323
              +  LA YP T  EDEA+L   NL  ++R+A ++   EK++L   LQ+
Sbjct: 241 AALQKLLAQYPTTAEEDEALLASGNLTSRQRMAVEVRLGEKRLLKEALQM 290


>gi|424513480|emb|CCO66102.1| predicted protein [Bathycoccus prasinos]
          Length = 571

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 89/354 (25%), Positives = 147/354 (41%), Gaps = 54/354 (15%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA-YLTGSPTKAEILERAEGIK 61
           EK + K S W PY++ L+           S L ++E E    L G+    E  +R +  K
Sbjct: 171 EKAKAKGSEWWPYLKTLNGNPN-------SVLRFTEEEFKELLKGTSIDKEARQRRDSAK 223

Query: 62  REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV-AVQSCVVHLQKVSLARRFAL 120
            EY  L            + P   P + + F + + AF+ A+       Q ++ A  +A+
Sbjct: 224 EEYEALRAA-------IAEDPGKYPLDVYAF-LTESAFIDALDIVCARAQWLNSANCYAM 275

Query: 121 VPL--------GPPLLA-------------------YSSKCKAMLAAVDDA-VQLVVDRP 152
           VPL         PP ++                    + +C      VD A V L  +  
Sbjct: 276 VPLMDAIPICGAPPPVSPEDPSFARFYEIRDIKTGLTAVRCGYADYDVDSASVVLCANTR 335

Query: 153 YKAGESIV-VWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 211
             AG  I+ +    + NS+L +++G VD+ +P D       L+  DP Y  K+ V +  G
Sbjct: 336 ASAGSKILQIDHSVRNNSELYLSFGDVDDQHPGDYEYWPTELSENDPLYAAKKSVLEAQG 395

Query: 212 KLSV-QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLD 270
                Q F V+  R      + L YLR   V+++ E+ +V  +   +  VSP  E   L 
Sbjct: 396 FADKGQTFPVYKDRMPR---EFLSYLRFARVTNSEELFAVSFTEDKV--VSPMNEYETLQ 450

Query: 271 QLADYFKARLAGYPATLSEDEAMLTDY-NLHPKKRVATQLVRMEKKMLNACLQV 323
            L    + R++ Y  T  EDE +L    ++  K R A++L R EK+++   +  
Sbjct: 451 LLMADCRDRMSAYD-TNEEDELLLQKRDDVSLKIRNASRLRRCEKELVGEMMNA 503


>gi|440804743|gb|ELR25614.1| SET domain containing protein, partial [Acanthamoeba castellanii
           str. Neff]
          Length = 273

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 48/159 (30%), Positives = 74/159 (46%), Gaps = 15/159 (9%)

Query: 32  SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 91
           + L +S+ EL+YL GS       +R   I+ +Y+ + T       LF  YP     E F+
Sbjct: 124 TTLFFSDDELSYLEGSSLHYFARQRRRAIESQYDTIFT------PLFVDYPEHFAPEQFS 177

Query: 92  FEIFKQAFVAV--QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV 149
            + +K A   +  +S VV   K  L     +  + P     + + K  + AVD  +    
Sbjct: 178 LDAWKWALSVIWSRSFVVDEGKSGLVPWADMFNMAPE----TEQVKVAVDAVDHHLIYSA 233

Query: 150 DRPYKAGESIVVWCGPQ---PNSKLLINYGFVDEDNPYD 185
             P K GE I V  G      N++LL++YGFV E+NP+D
Sbjct: 234 RSPIKKGEQIFVAYGQSRQMSNAQLLMDYGFVLENNPHD 272


>gi|302836231|ref|XP_002949676.1| Rubisco large subunit N-methyltransferase [Volvox carteri f.
           nagariensis]
 gi|300265035|gb|EFJ49228.1| Rubisco large subunit N-methyltransferase [Volvox carteri f.
           nagariensis]
          Length = 484

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 84/312 (26%), Positives = 141/312 (45%), Gaps = 26/312 (8%)

Query: 30  VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 89
           +++PLLWSE EL  L G+    ++L+   G    +    T   +   LF   P   P   
Sbjct: 144 LDTPLLWSEDELQELQGT----QVLQTLGGYLTFFRS--TFQQLQSGLFTSNPAAFPPSI 197

Query: 90  FTFEIFKQAFVAVQSCV---VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 146
           FT   F  A  AV+S     +   K++LA    L  L     A +SK     A +    Q
Sbjct: 198 FTLPRFLWAVAAVRSRSHPPLDGPKIALA---PLTELVSHRRAANSKLSVRSAGLFGRGQ 254

Query: 147 LVV---DRPYKAGESIVVWCGP-QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQD 202
           ++V    R  + GE + +  GP + +  +L++YG +D  +P     +   +   D    D
Sbjct: 255 VLVLEATRAIRKGEPLSMDYGPGKLDGPVLVDYGVMDVTSPKPGYSLTLKMPDSDRFIDD 314

Query: 203 KRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV--SDTSEMQSVISS--LGPIC 258
           K  + + N      V+++    +++   +ML +LRL  +  SD   ++S+  +   G + 
Sbjct: 315 KLDILESNDLPQSVVYNLTP--DEQPTIEMLAFLRLMQLKGSDAFLLESIFRNDVWGFMQ 372

Query: 259 -PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPK--KRVATQLVRM-EK 314
            PVS   E AV + L++  +A L GY  T+ +D A L       K  +R A  L+R+ EK
Sbjct: 373 EPVSEGNEEAVCNTLSEGARAALGGYGTTIDQDLAELRAQGSRAKGSRREAALLIRLGEK 432

Query: 315 KMLNACLQVTAD 326
           + L+A  +   D
Sbjct: 433 EALDAVARFFED 444


>gi|159479580|ref|XP_001697868.1| rubisco large subunit N-methyltransferase [Chlamydomonas
           reinhardtii]
 gi|158273966|gb|EDO99751.1| rubisco large subunit N-methyltransferase [Chlamydomonas
           reinhardtii]
          Length = 475

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 79/306 (25%), Positives = 130/306 (42%), Gaps = 27/306 (8%)

Query: 32  SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 91
           +PLLWSE E   L G+     +       +  + +L         LF   P   P   FT
Sbjct: 146 TPLLWSEEETRALAGTQVAGTLNSYLTFFRSTFAQLQA------GLFTANPAAFPPAVFT 199

Query: 92  FEIFKQAFVAVQSCV---VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 148
              F  A  AV+S     +   K++LA    LV L     A ++K     + +    Q+ 
Sbjct: 200 LPNFVWAVAAVRSRSHPPLEGDKIALA---PLVDLVSHRRAANTKLSVRSSGLFGRGQVA 256

Query: 149 V---DRPYKAGESIVVWCGP-QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 204
           V    R  + GE++ +   P + +  +L++YG +D  +P     +   L+  D    DK 
Sbjct: 257 VVEATRAIRKGEALGMDYAPGKLDGPVLLDYGVMDTASPKPGYSLTLTLDESDKFVDDKA 316

Query: 205 MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS--DTSEMQSVISSLGPICPVSP 262
            + +  G      + +    +++   +M+ +LRL  +   D   ++S+  +      VS 
Sbjct: 317 DIVEGAGLRPSMTYSITP--DQQPGEEMMAFLRLMNIKAMDAFLLESIFRN-----EVSE 369

Query: 263 CMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRV-ATQLVRM-EKKMLNAC 320
             E AV   LA+  +A LAGYP TL +D A L   +     R  A  LVR+ EK+ L+A 
Sbjct: 370 GNEEAVCAMLAEGARAALAGYPTTLDQDLAALRSNSTPLGSRAEAALLVRLGEKESLDAV 429

Query: 321 LQVTAD 326
            +   D
Sbjct: 430 ARFFED 435


>gi|159477607|ref|XP_001696900.1| rubisco small subunit N-methyltransferase [Chlamydomonas
           reinhardtii]
 gi|158274812|gb|EDP00592.1| rubisco small subunit N-methyltransferase [Chlamydomonas
           reinhardtii]
          Length = 411

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 52/182 (28%), Positives = 83/182 (45%), Gaps = 14/182 (7%)

Query: 151 RPYKAGESIVVWCGP---------QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 201
           R   A + +VVW G          +PN +LL+  G + ++N  D L   A L   D  Y 
Sbjct: 166 RAAGARKGVVVWDGAGSEMLLNDGRPNGELLLATGTLQDNNSSDFLSWPAGLVPADRYYM 225

Query: 202 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVS 261
            K  V +  G  + + F V+A R       +L YLRL  V+D + +     +      +S
Sbjct: 226 MKSQVLESMGYSAAEEFPVYADRMP---IQLLAYLRLSRVADPALLAKC--TFEADVELS 280

Query: 262 PCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 321
              E  +L  L    + RLA Y  +  ED  +    +L PK+R+A +L   EK+++NA +
Sbjct: 281 QMNEYEILQILMGDCRERLASYTKSYEEDVKIAQQSDLSPKERLAVKLRLGEKRIINATM 340

Query: 322 QV 323
           + 
Sbjct: 341 EA 342


>gi|425766115|gb|EKV04742.1| hypothetical protein PDIG_87340 [Penicillium digitatum PHI26]
 gi|425778867|gb|EKV16969.1| hypothetical protein PDIP_33360 [Penicillium digitatum Pd1]
          Length = 679

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 56/207 (27%), Positives = 95/207 (45%), Gaps = 23/207 (11%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           M +  +G + FW PY+R L +    GQL   +PL + E ++ ++ G+      +ER +  
Sbjct: 111 MAQYLRGPEGFWYPYLRTLPQP---GQLT--TPLFFGEEDVDWIQGTGIPEAAVERIKIW 165

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR--- 117
           + +Y   D+ +   G+    +P     E +T+E++  A   + S     + +S A +   
Sbjct: 166 EEKY---DSGYLQLGA--TGFP---DCETYTWELYLWASTIITSRAFSAKVLSGAVQPGD 217

Query: 118 ------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 171
                  AL+PL   L  +    K    A D  + L+V   + AG+ I    GP+ N +L
Sbjct: 218 LPEDGVSALLPL-IDLPNHRPMAKVEWRAGDKDIGLLVLEDHSAGQEISNNYGPRNNEQL 276

Query: 172 LINYGFVDEDNPYDRLVVEAALNTEDP 198
           LINYGF    NP D  +V   +  + P
Sbjct: 277 LINYGFCIAGNPTDYRIVHLGVKPDSP 303


>gi|322703179|gb|EFY94792.1| UV-endonuclease UVE-1 [Metarhizium anisopliae ARSEF 23]
          Length = 1118

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 84/368 (22%), Positives = 150/368 (40%), Gaps = 61/368 (16%)

Query: 1    MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEG 59
            MYE   G +S W PYI  L        L  ++P+ WSE EL+YL  S T  +I +  AE 
Sbjct: 753  MYEFFLGHQSKWKPYIDVL-------PLTFDTPMFWSEEELSYLQASATVNKIGKADAEE 805

Query: 60   IKREY--------------------NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 99
            + R                       +L  +    GS    Y +D+  E    +     +
Sbjct: 806  MFRTRLIPAIRGNPSVFASSGDCSDEDLIGLAHRMGSTIMAYAFDLENEEAENDDESDGW 865

Query: 100  VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESI 159
            V  +     +  V++A           +L   ++  A +   D+ + +   R  KAGE I
Sbjct: 866  VEDREGKSMMGMVAMA----------DILNADAEFNAHVNHGDEELTVTSIRDIKAGEEI 915

Query: 160  VVWCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAALNTEDPQYQDKRMVAQRNG 211
            + + GP PNS+LL  YG++ E +        P+D   V+ +L +E    QD  ++ +   
Sbjct: 916  LNYYGPHPNSELLRRYGYITEKHSRYDVVEIPWD--AVQHSLMSELGVPQD--IMTETMD 971

Query: 212  KL------SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE-MQSVISSL----GPICPV 260
            K+       + V    +G      +   P +  G   D  E +++ I  L    G +   
Sbjct: 972  KMDQDDLEDIFVLERDSGEPNPDGTFAGPAVVDGMPPDLKEQLKATIKLLQKVDGNLISD 1031

Query: 261  SPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNAC 320
                +  +   + +  +   + Y  T++EDE +L   +L  ++R+A ++   EKK+L   
Sbjct: 1032 KRKRDDILRSTMVETLRLIASRYSTTIAEDEILLAQDSLTRRQRMAVRVRLGEKKLLQEA 1091

Query: 321  LQVTADMI 328
                ++M+
Sbjct: 1092 FDHFSEMV 1099


>gi|351701197|gb|EHB04116.1| SET domain-containing protein 3 [Heterocephalus glaber]
          Length = 705

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 62/251 (24%), Positives = 116/251 (46%), Gaps = 14/251 (5%)

Query: 88  EAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 146
           ++FT+E ++ A  +V +    +     +R   AL+PL       +          DD  +
Sbjct: 346 DSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCE 405

Query: 147 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 206
            V  + ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y  K  V
Sbjct: 406 CVALQDFQAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEV 465

Query: 207 AQRNG---KLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG 255
             R G    +   VF +H   E    + +L +LR+  +++    + +        I +LG
Sbjct: 466 LARAGIPTYVWSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGENAIDRIFTLG 524

Query: 256 PI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEK 314
               PVS   E  +   L D     L  Y  T  ED+A+L + +L  + ++A +L   EK
Sbjct: 525 NSEFPVSWENEVKLWSFLEDRASLLLKTYKTTTEEDKAVLKNPDLPARTKMAIKLRLGEK 584

Query: 315 KMLNACLQVTA 325
           ++L   +Q  A
Sbjct: 585 EILEKAVQSAA 595


>gi|148686778|gb|EDL18725.1| mCG18357, isoform CRA_c [Mus musculus]
          Length = 536

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 49/193 (25%), Positives = 95/193 (49%), Gaps = 10/193 (5%)

Query: 142 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 201
           DD  + V  + ++AG+ I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y 
Sbjct: 234 DDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 293

Query: 202 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISS 253
            K  V  R G  +  VF +H+  E    + +L +LR+  +++    + +        I +
Sbjct: 294 MKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFT 352

Query: 254 LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 312
           LG    PVS   E  +   L D     L  Y  T+ ED+ +L + +L  +  +A +L   
Sbjct: 353 LGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRLG 412

Query: 313 EKKMLNACLQVTA 325
           EK++L   ++  A
Sbjct: 413 EKEILEKAVKSAA 425


>gi|332020870|gb|EGI61268.1| SET domain-containing protein 3 [Acromyrmex echinatior]
          Length = 232

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 55/211 (26%), Positives = 95/211 (45%), Gaps = 11/211 (5%)

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           AL+P+       + +      A  D  +    R +K GE + +  GP+ NS   ++ GFV
Sbjct: 18  ALIPMWDMCNHENGRITTDFNATSDRCECYALRDFKKGEQVFISYGPRTNSDFFVHSGFV 77

Query: 179 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD-MLPYLR 237
             DN  D   +   ++  D   +++  +  +    SV  F +  G   E ISD +L +LR
Sbjct: 78  CMDNEQDGFKLRLGISKADSLQKERIELLSKLDLPSVGEFLLKPG--TEPISDTLLAFLR 135

Query: 238 LGYVSDTSEMQSVISS-----LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 291
           + +    +E+   + S     L  + C +   +E  V   L    +  +A YP TL ED 
Sbjct: 136 V-FSMRKAELTHWLRSDKVFDLKHVDCALETVVEENVRKFLLTRLQLLIANYPTTLKEDL 194

Query: 292 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 322
            +L +  L   K++A QL   EK++L+  L+
Sbjct: 195 ELL-ETTLPQMKKMAVQLRVTEKRILSGALE 224


>gi|148686780|gb|EDL18727.1| mCG18357, isoform CRA_e [Mus musculus]
          Length = 458

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 49/193 (25%), Positives = 95/193 (49%), Gaps = 10/193 (5%)

Query: 142 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 201
           DD  + V  + ++AG+ I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y 
Sbjct: 156 DDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 215

Query: 202 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISS 253
            K  V  R G  +  VF +H+  E    + +L +LR+  +++    + +        I +
Sbjct: 216 MKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFT 274

Query: 254 LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 312
           LG    PVS   E  +   L D     L  Y  T+ ED+ +L + +L  +  +A +L   
Sbjct: 275 LGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRLG 334

Query: 313 EKKMLNACLQVTA 325
           EK++L   ++  A
Sbjct: 335 EKEILEKAVKSAA 347


>gi|342881738|gb|EGU82570.1| hypothetical protein FOXB_06936 [Fusarium oxysporum Fo5176]
          Length = 467

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 88/374 (23%), Positives = 153/374 (40%), Gaps = 53/374 (14%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEG 59
           +YE  QG+ S W PY   L         + ++P+ WS+ EL  L  S  + +I +  AE 
Sbjct: 107 IYEYLQGENSKWKPYFDVLPS-------SFDTPMFWSDNELDQLQASHMRHKIGKADAEN 159

Query: 60  --------IKREYNELDTVWFMAGS--------LFQQYPYDIPTEAFTFEIFKQAFVAVQ 103
                   I R   E+    F AG+        +  +    I   AF  E  ++      
Sbjct: 160 MFQKTLLPIIRSNAEI----FNAGNKTDAELIEIAHRMGSTIMAYAFDLENDEEEEEEAD 215

Query: 104 SCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 163
             V      S+     +VP+   +L   ++  A +   ++++ +   RP KAGE I+ + 
Sbjct: 216 GWVEDRDGKSM---MGMVPMAD-ILNADAEFNAHVNHEEESLTVTSLRPIKAGEEILNYY 271

Query: 164 GPQPNSKLLINYGFVDEDN--------PYDRLVVEAALNTE-----DPQYQDKRMVAQRN 210
           GP PNS+LL  YG+V E +        P+D  +VE+AL +          Q +  + +  
Sbjct: 272 GPHPNSELLRRYGYVTEKHSRYDVVEIPWD--IVESALTSNFGIPGQVLEQIRGALEEDE 329

Query: 211 GKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE-MQSVISSLGPICP--VSPCMERA 267
                 V     G      +   P        D  E +++ +  +    P  +    +R 
Sbjct: 330 EFEDTFVLERETGEVNSDGTFAEPARFESMPEDLQEQLKTFLKGIKKAQPDAIPDKRKRD 389

Query: 268 VLDQ--LADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
            + Q  LA   +A +A YP ++SEDE +L   +L+ + R+A  +   EKK+L   +  ++
Sbjct: 390 EIHQAVLAKTLEALVARYPTSISEDENLLK-QDLNQRTRMAIAVRLGEKKLLQEAITASS 448

Query: 326 DMIMLLPDVTVSPC 339
             + +  D    P 
Sbjct: 449 GDVEMTMDDESGPA 462


>gi|238485948|ref|XP_002374212.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
 gi|83768069|dbj|BAE58208.1| unnamed protein product [Aspergillus oryzae RIB40]
 gi|220699091|gb|EED55430.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
          Length = 713

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 54/190 (28%), Positives = 87/190 (45%), Gaps = 7/190 (3%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           M +  QGK+ FW PYIR L  Q G    A+ +PL +   +L +L G+       ++A  +
Sbjct: 134 MGQYLQGKEGFWYPYIRTLP-QPG----ALTTPLYYEGDDLEWLEGTSLSPARQQKANLL 188

Query: 61  KREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 119
           K +Y  + T    AG    ++Y +D+   A T  + +     V S V+   ++       
Sbjct: 189 KEKYGTVYTELCKAGFDGAEKYTWDLYLWASTIFVSRAFSAKVLSGVIPDTQLPEENVSV 248

Query: 120 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 179
           L+P    +L +    K    A    V  +V     AG+ I    GP+ N +L++NYGF  
Sbjct: 249 LLPF-IDILNHRPLAKVEWRAGKGNVAFLVLEDVAAGQEISNNYGPRNNEQLMMNYGFCL 307

Query: 180 EDNPYDRLVV 189
            +NP D  +V
Sbjct: 308 PNNPCDYRIV 317


>gi|347836900|emb|CCD51472.1| similar to SET domain-containing protein [Botryotinia fuckeliana]
          Length = 470

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 86/350 (24%), Positives = 142/350 (40%), Gaps = 44/350 (12%)

Query: 6   QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA-------- 57
           Q   S W PY+  L  Q       + S + WS++ELA L  S    +I ++         
Sbjct: 96  QVDDSKWAPYLAILPEQ-------LNSLVFWSDSELAELQASAVVKKIGKQGAEDMFKTY 148

Query: 58  ---EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQK 111
              +G++    E+        S+   Y +DIP  +       +   A    V      +K
Sbjct: 149 ITPQGLQHSSTEM---CHKVASVIMAYAFDIPDPSEGPTSGGKGEEAADDLVSDDGEDEK 205

Query: 112 VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 171
             L+    ++PL   L A + +  A L   ++ +++   +P   GE I    G  P S L
Sbjct: 206 TILS----MIPLADMLNADADRNNARLICDNEDLEMRAIKPIAKGEEIFNDYGQLPRSDL 261

Query: 172 LINYGFV-DEDNPYDRLVVEAAL----------NTEDPQY-QDKRMV----AQRNGKLSV 215
           L  YG+V D  + YD   + A L          +   P+  QDK  V    A+R G    
Sbjct: 262 LRRYGYVTDGYSAYDVAEISAELIVSLFRNGKVHPSLPKLTQDKLKVRLDLAEREGVYDE 321

Query: 216 QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADY 275
               VH+  ++ +I D L       + D S +++++ S   +   S          LA  
Sbjct: 322 SFDLVHSSPDEPSIPDELLAFLYLLLVDESHLKAILDSESSLPSRSKLTTELAGQVLAIL 381

Query: 276 FKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
            +AR   Y  T+ EDE +L + +L  +  +A Q+   EKK+L A ++  A
Sbjct: 382 LQARENEYSTTVEEDEDLLKNADLPIRTAMAIQVRSGEKKVLRAAIREAA 431


>gi|242007310|ref|XP_002424484.1| SET domain-containing protein, putative [Pediculus humanus
           corporis]
 gi|212507902|gb|EEB11746.1| SET domain-containing protein, putative [Pediculus humanus
           corporis]
          Length = 492

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 73/333 (21%), Positives = 141/333 (42%), Gaps = 33/333 (9%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           E  + + SFW  YI  L            + L +   +   +  SPT     +  + I R
Sbjct: 170 EHVKNENSFWHHYISSLPSD-------YNTVLYFDLNDFLEMKNSPTFEMATKHCKNIAR 222

Query: 63  EYNELDTVWFMAGSLFQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF- 118
           +Y      +F   +LFQ    +   I    FT+++++ A   V +    +   S +    
Sbjct: 223 QY-----AYF--NNLFQNSNDEASLILRNVFTYQLYRWAVSTVMTRQNFIPSSSTSNDVE 275

Query: 119 ----ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 174
                L+PL   +  +++   +    VD +  L   +P+K GE ++++ G + NS  L++
Sbjct: 276 NGINGLIPLWD-MCNHTNGYLSTQYKVDRSECLAC-KPFKKGEQVLIFYGERSNSDFLVH 333

Query: 175 YGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP 234
            GFV ++NP+D   +   ++  D  +  +  + +  G      F++++G E     ++L 
Sbjct: 334 NGFVYDENPHDSFRLRLGISKSDKLHGLRCELLKDLGIPDSGDFYLYSGSEP-VRENLLA 392

Query: 235 YLRLGY--VSDTSEMQSVISSLGPI----CPVSPCMERAVLDQLADYFKARLAGYPATLS 288
           +LR+    V + +  +S  S L  +    C +   +E  V + L D     L  Y     
Sbjct: 393 FLRIFNMDVENLNHWKSHSSRLSDLMWKDCALDTKIESKVWNFLYDRINLLLKTYKG--D 450

Query: 289 EDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 321
           E E  + D N    +R+    ++ EKK L++ L
Sbjct: 451 EVEVRVEDSNSTECRRLVRAQLKCEKKFLSSIL 483


>gi|34784341|gb|AAH57968.1| Setd3 protein [Mus musculus]
          Length = 408

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 49/193 (25%), Positives = 95/193 (49%), Gaps = 10/193 (5%)

Query: 142 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 201
           DD  + V  + ++AG+ I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y 
Sbjct: 106 DDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 165

Query: 202 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISS 253
            K  V  R G  +  VF +H+  E    + +L +LR+  +++    + +        I +
Sbjct: 166 MKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFT 224

Query: 254 LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 312
           LG    PVS   E  +   L D     L  Y  T+ ED+ +L + +L  +  +A +L   
Sbjct: 225 LGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRLG 284

Query: 313 EKKMLNACLQVTA 325
           EK++L   ++  A
Sbjct: 285 EKEILEKAVKSAA 297


>gi|317144568|ref|XP_001820210.2| SET domain protein [Aspergillus oryzae RIB40]
 gi|391871646|gb|EIT80803.1| N-methyltransferase [Aspergillus oryzae 3.042]
          Length = 703

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 54/190 (28%), Positives = 87/190 (45%), Gaps = 7/190 (3%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           M +  QGK+ FW PYIR L  Q G    A+ +PL +   +L +L G+       ++A  +
Sbjct: 124 MGQYLQGKEGFWYPYIRTLP-QPG----ALTTPLYYEGDDLEWLEGTSLSPARQQKANLL 178

Query: 61  KREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 119
           K +Y  + T    AG    ++Y +D+   A T  + +     V S V+   ++       
Sbjct: 179 KEKYGTVYTELCKAGFDGAEKYTWDLYLWASTIFVSRAFSAKVLSGVIPDTQLPEENVSV 238

Query: 120 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 179
           L+P    +L +    K    A    V  +V     AG+ I    GP+ N +L++NYGF  
Sbjct: 239 LLPF-IDILNHRPLAKVEWRAGKGNVAFLVLEDVAAGQEISNNYGPRNNEQLMMNYGFCL 297

Query: 180 EDNPYDRLVV 189
            +NP D  +V
Sbjct: 298 PNNPCDYRIV 307


>gi|255562948|ref|XP_002522479.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase, chloroplast precursor, putative
           [Ricinus communis]
 gi|223538364|gb|EEF39971.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase, chloroplast precursor, putative
           [Ricinus communis]
          Length = 502

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 47/170 (27%), Positives = 86/170 (50%), Gaps = 9/170 (5%)

Query: 154 KAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 212
           K+GE +++ +   + N++L ++YGF+++    +   +   ++  DP + DK  +A+ NG 
Sbjct: 293 KSGEQVLIQYDLNKSNAELALDYGFIEKTPDRNTYTLTLQISESDPFFGDKLDIAETNGS 352

Query: 213 LSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE--MQSVISS--LGPI-CPVSPCMERA 267
                F +  G        MLPYLRL  +  T    ++S+  +   G +  P+S   E  
Sbjct: 353 GETADFDIVLGNPLPPA--MLPYLRLVALGGTDAFLLESIFRNTIWGHLELPISRANEEL 410

Query: 268 VLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 317
           +   + D  K+ L+GY  T+ EDE  L   +L+P+  +A  +   EKK+L
Sbjct: 411 ICRVVRDACKSALSGYHTTIEEDEK-LEAADLNPRLEIAVGIRAGEKKVL 459


>gi|340519125|gb|EGR49364.1| predicted protein [Trichoderma reesei QM6a]
          Length = 963

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 81/358 (22%), Positives = 143/358 (39%), Gaps = 63/358 (17%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE----- 55
           MYE  +G +S W PYI  L         + E+P+ WS+ EL  L  S T++++ +     
Sbjct: 597 MYEYFKGSESKWKPYIDVLPS-------SFETPMFWSDAELDELQASATRSKVGKASAEE 649

Query: 56  ----------RAEG----IKREYNELDTVWFMA--GSLFQQYPYDIPTEAFTFEIFKQAF 99
                     RA        + Y++ D +      GS    Y +D          F+   
Sbjct: 650 MFQDKVLPVIRANQHLFPTSQTYSDDDLIQLAHRMGSTIMSYSFD----------FQNED 699

Query: 100 VAVQSCVVHLQKVSLARR-FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 158
              +       +   A+    +VP+   +L   ++  A +   DDA+ +   R  KAGE 
Sbjct: 700 EEDEDETEEWVEEREAKSTMGMVPMAD-ILNADAEYNAHVNYGDDALTVTALRTIKAGEE 758

Query: 159 IVVWCGPQPNSKLLINYGFVD-EDNPYDRL---------VVEAALNTEDPQYQDKRMVAQ 208
           I  + GP PNS+LL  YG+V  + + YD +          V A+L     Q  DK     
Sbjct: 759 IFNYYGPHPNSELLRRYGYVTPKHSRYDVVELPWTLVEESVAASLGLSSEQL-DKARECL 817

Query: 209 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT--SEMQSVISSLGPICPVSPCMER 266
            + +L           E      +    R   + +    +++S++ ++    P S   +R
Sbjct: 818 DSDELEDTFVLERETEEPNPDGTLTGSARFSEIPEDLRDQLKSLLKAIRKAVPSSVVDKR 877

Query: 267 A-------VLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 317
                   +L +  D   +R   YP ++SEDE +L   ++  ++R A  +   EK+++
Sbjct: 878 KRDEIQHNILIRALDALASR---YPTSISEDERILAGNDISERRRAAVTVRLGEKRLI 932


>gi|449017905|dbj|BAM81307.1| similar to ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplast precursor
           [Cyanidioschyzon merolae strain 10D]
          Length = 567

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 98/379 (25%), Positives = 148/379 (39%), Gaps = 78/379 (20%)

Query: 6   QGKKSFWLPYIRELDR--QRGRGQLAVES----PL----LWSETELAYLTGSPTKAEILE 55
           +G  SFW PY+  L       +G    ES    PL    LW+E E+ +L GSPT      
Sbjct: 184 RGPGSFWGPYLDILPPVPWEFKGAEPAESLSMDPLDALWLWAEDEMQWLQGSPTLLSARA 243

Query: 56  RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE-AFTFEIFKQAFVAVQSCVVHLQKVSL 114
               ++REY E          L++++P+    E AF  E F  AF  + S  V L   + 
Sbjct: 244 LRSKVEREYAE------ACERLYRRHPHIFDLEGAFRLERFLWAFGVLFSRAVSLPAENG 297

Query: 115 ARRFALVPLGPPLLAYSSKCKAMLAA----------------------------VDDA-- 144
               ALVP    L  +S+ C + + A                             DDA  
Sbjct: 298 M--LALVPYAD-LANHSAFCVSFIDARTAAFPYAFRASSKQKRGQWWQRFLAPNSDDAGA 354

Query: 145 ----------------VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 188
                           V    DR Y   E + V  G + N++LL+ YGFV + NPY+ + 
Sbjct: 355 VANTDSSHYREDAQREVVAYADRFYDKFEQVYVSYGQKSNAELLLLYGFVSDRNPYNSVE 414

Query: 189 VEAALNTEDPQ----YQDKRMVAQRNGK--LSVQVFHVHAGREKEAISDMLPYLRLGYVS 242
           V  +L+  +         KR      G+     + F ++A R    +  +L +  L    
Sbjct: 415 VCVSLSGSEAAGAGLLDRKRSFLLACGRDPDKPECFPLYADRYPLELMQLLRFASL--TE 472

Query: 243 DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYN---- 298
             +   S +  +    PV+   E A    L    K  L  YP +  ED+A L D +    
Sbjct: 473 QDAAGYSDLEQIDVAQPVNRENEIAAKSALLQACKIALQAYPTSADEDDAALKDKSMAQL 532

Query: 299 LHPKKRVATQLVRMEKKML 317
           L  K+R++ +L R EK++L
Sbjct: 533 LSRKQRLSVRLRRSEKRIL 551


>gi|149044196|gb|EDL97578.1| rCG27725, isoform CRA_b [Rattus norvegicus]
          Length = 538

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 47/193 (24%), Positives = 92/193 (47%), Gaps = 10/193 (5%)

Query: 142 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 201
           DD  + V  + ++AG+ I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y 
Sbjct: 234 DDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 293

Query: 202 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVIS 252
            K  V  R G  +  VF +H   E    + +L +LR+  +++          S +  + +
Sbjct: 294 MKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFT 352

Query: 253 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 312
                 PVS   E  +   L D     L  Y  T+ ED+ +L + +L  +  +A +L   
Sbjct: 353 LGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKTVLKNPDLSVRATMAIKLRLG 412

Query: 313 EKKMLNACLQVTA 325
           EK++L   ++  A
Sbjct: 413 EKEILEKAVKSAA 425


>gi|358392567|gb|EHK41971.1| hypothetical protein TRIATDRAFT_251278, partial [Trichoderma
           atroviride IMI 206040]
          Length = 956

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 83/387 (21%), Positives = 150/387 (38%), Gaps = 74/387 (19%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEG 59
           +YE  +G +S W PY+  L         + E+P+ WS+ E+  L  S T+++I +  AE 
Sbjct: 593 IYEYFKGDQSSWKPYLDVL-------PASFETPMFWSDAEVDELQASATRSKIGKTNAEE 645

Query: 60  I--------------------KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 99
           +                     +   EL  +    GS    Y +D          F+   
Sbjct: 646 MFHAKILPVIRGNPDIFQTSQAKSDEELIQLAHRMGSTIMSYAFD----------FQNED 695

Query: 100 VAVQSCVVHLQKVSLARR-FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 158
              +       +   A+    +VP+   +L   ++  A +   DDA+ +   R  KAGE 
Sbjct: 696 EEEEDDSEEWVEDREAKSTMGMVPMAD-ILNADAEYNAHVNYGDDALTVATLRTIKAGEE 754

Query: 159 IVVWCGPQPNSKLLINYGFVDEDN--------PY----DRLVVEAALNTED--------- 197
           I+ + GP PNS+LL  YG+V   +        P+    D L     L++E          
Sbjct: 755 ILNYYGPHPNSELLRRYGYVTPKHSRYDVVELPWKMIEDALAANLGLSSEQLDSAREHLD 814

Query: 198 -PQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGP 256
             ++++  ++ + + + +      +  +  E   D+   L+         M   I  + P
Sbjct: 815 LDEFEETFVLERESDEPNPDGTFANPAKFSEIPEDLREQLK--------SMLKAIRKVDP 866

Query: 257 ICPVSPCMERAVLDQ-LADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKK 315
            C V       V    L     A  + YP T+ EDE +L+  NL  +++ A  +   EK+
Sbjct: 867 SCIVDKRKRDEVQHTVLITALDALTSQYPTTIIEDELILSGSNLSERRKAAVTVRLGEKR 926

Query: 316 MLNACLQVTADMIMLLPDVTVSPCPAP 342
           +L     + ++   +  D  +   PAP
Sbjct: 927 LLQEARVLLSE---IASDAILDDAPAP 950


>gi|302768639|ref|XP_002967739.1| hypothetical protein SELMODRAFT_408995 [Selaginella moellendorffii]
 gi|300164477|gb|EFJ31086.1| hypothetical protein SELMODRAFT_408995 [Selaginella moellendorffii]
          Length = 421

 Score = 58.5 bits (140), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 47/203 (23%), Positives = 80/203 (39%), Gaps = 36/203 (17%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           ++EK+ G+ S W PYI  L R        + S   W + EL  +  S    E + R   I
Sbjct: 114 LFEKRAGEASIWAPYISCLPRWG-----TIHSTAFWRKEELTMIQESSLSYETMSRRAAI 168

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           + E+NE+ +V F          +D  + A            V+   V+   + +A   A 
Sbjct: 169 REEFNEMQSVPFA-----DFMNHDWSSNAMLTYDTDNGSTEVEEVKVYSDCLYIALFCA- 222

Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
                                    QL  D+ Y AGE + +  GP  N+ L +++GF   
Sbjct: 223 -------------------------QLFADKNYAAGEQVTISFGPLCNASLALDFGFTVP 257

Query: 181 DNPYDRLVVEAALNTEDPQYQDK 203
            NP+D++ +   ++  D   ++K
Sbjct: 258 YNPWDKVQLWLGISRRDSLRKEK 280


>gi|307190530|gb|EFN74527.1| SET domain-containing protein 3 [Camponotus floridanus]
          Length = 232

 Score = 58.5 bits (140), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 54/211 (25%), Positives = 93/211 (44%), Gaps = 11/211 (5%)

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           AL+P+       + +      A  D  +    R +K GE + +  GP+ NS   ++ GFV
Sbjct: 18  ALIPMWDMCNHENGRITTDFNATSDHCECYALRNFKKGEQVFISYGPRTNSDFFVHSGFV 77

Query: 179 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD-MLPYLR 237
             +N  D   +   ++  D   +++  +  + G  SV  F +  G   E ISD +L +LR
Sbjct: 78  YMNNKQDGFKLRLGISKADSLQKERIELLSKLGLPSVGEFLLKPG--TEPISDTLLAFLR 135

Query: 238 LGYVSDTSEMQSVISSLGPI------CPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 291
           + +    +E+   + S          C +   +E  V   L    +  +A YP TL ED 
Sbjct: 136 V-FSMRKAELAHWLRSDKVFDLKHMDCALETVVEENVRKFLLTRLQLLIANYPTTLKEDL 194

Query: 292 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 322
            +L +  L   K++A QL   EK++L   L+
Sbjct: 195 ELL-ETTLPQIKKMAVQLRVTEKRILLGALE 224


>gi|449542715|gb|EMD33693.1| hypothetical protein CERSUDRAFT_56467 [Ceriporiopsis subvermispora
           B]
          Length = 510

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 65/226 (28%), Positives = 98/226 (43%), Gaps = 31/226 (13%)

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           A+VP+   L A      A L   +  +++V  +P  AGE I    G  PNS LL  YG V
Sbjct: 260 AMVPMADMLNARFESENAKLFYEEHYLKMVATKPINAGEQIWNTYGDPPNSDLLRRYGHV 319

Query: 179 D----------EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQV-FHVHAGREK- 226
           D          E NP D + + A L          R   +  G L V+V F +    +  
Sbjct: 320 DVVPLGEPLSGEGNPADVVEIRADL-----VVSAVRKARKAAGDLQVRVDFWLEEADDDT 374

Query: 227 -------EAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 279
                  E   ++L ++RL  +S T +  + + + G +      +E  +L  + D  K R
Sbjct: 375 FVLMTDCEVPEELLSFIRL--LSLTKDEWNKVKAKGKLP--KGKLELELLPAIVDVLKER 430

Query: 280 LAGYPATLSEDEAML---TDYNLHPKKRVATQLVRMEKKMLNACLQ 322
           L  YP T+ EDE++L   +  NL   KR A  +   EK++L   LQ
Sbjct: 431 LKEYPTTIEEDESLLGPDSAVNLSFNKRNAVVVRLGEKRILRGALQ 476


>gi|384251065|gb|EIE24543.1| hypothetical protein COCSUDRAFT_40909 [Coccomyxa subellipsoidea
           C-169]
          Length = 685

 Score = 58.2 bits (139), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 52/171 (30%), Positives = 76/171 (44%), Gaps = 18/171 (10%)

Query: 37  SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK 96
           +E E++ L G+P     +E  + I+ +Y     V     +L   YP DI  +  T + F 
Sbjct: 65  TEEEVSMLEGTPAHTTFVEARQHIREQYRAAQPVL---QALTAAYPDDITPDLVTEDKFI 121

Query: 97  QAFVAVQSCVVHLQKVSLARRFALVPLG--------PPLLAYSSKCKAMLAAVDDAVQLV 148
            A     S  + ++ V  A R  LVP+         P ++ Y       L A  D+++L 
Sbjct: 122 WACELWYSYAIEVEYVDGAVRQTLVPIAHLLNHSPWPHIVRY-----GRLDAATDSLRLR 176

Query: 149 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR--LVVEAALNTED 197
             R   AGE   +  GP PN KLL+ YGF   DNP+D   +  EA  N  D
Sbjct: 177 AFRHCAAGEQCFLSYGPLPNLKLLLFYGFALPDNPHDTVPITFEAEKNEGD 227


>gi|310799999|gb|EFQ34892.1| SET domain-containing protein [Glomerella graminicola M1.001]
          Length = 478

 Score = 58.2 bits (139), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 85/371 (22%), Positives = 149/371 (40%), Gaps = 72/371 (19%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           +YE  QG  S W  Y   L  +        ++ + W   +L YL GS   ++I +     
Sbjct: 106 IYEYLQGNASPWKTYFEVLPEK-------FDTLMFWESPDLEYLKGSAVLSKIGKDEADE 158

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR--- 117
                 L  +   AG  F Q     P+E+   ++  +    + +    L+      +   
Sbjct: 159 MFRSRILPVISANAGIFFPQ-GVSPPSESELLQLAHRMGSIIMAYAFDLENEEEPEQEDE 217

Query: 118 -----------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 166
                        +VP+   +L   ++  A +   +D + +   RP KAGE I+ + GP 
Sbjct: 218 EWVEDREGKTMLGMVPMAD-ILNADAEFNAHVNHGEDDLSVTALRPIKAGEEILNYYGPH 276

Query: 167 PNSKLLINYGFVDEDN--------PYDRLVVEAALNTEDPQYQDK--RMVAQRNGKLSVQ 216
           PNS+LL  YG+V   +        P+D  +V++ L TE  +  D+  + VA+      ++
Sbjct: 277 PNSELLRRYGYVTPKHSRYDVVEIPWD--LVQSTL-TEQLRLTDEVWKQVAEHVDPEDLE 333

Query: 217 VFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSL---------------GPICP-- 259
              V      E  S+       G++   +++Q V + L               G + P  
Sbjct: 334 DVFVLERESGEPDSE-------GHLQTPAKVQEVSAELEEQLKDVLKAIKKVRGDLIPDK 386

Query: 260 -----VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEK 314
                V  C+  + L +L       LA YP T  EDEAML   N+  ++++A ++   EK
Sbjct: 387 RKRDEVYQCVVVSTLQKL-------LAQYPTTAEEDEAMLASGNVTSRQKLAVEVRLGEK 439

Query: 315 KMLNACLQVTA 325
           +++   LQV  
Sbjct: 440 RLIKEALQVAG 450


>gi|348675930|gb|EGZ15748.1| hypothetical protein PHYSODRAFT_561468 [Phytophthora sojae]
          Length = 430

 Score = 58.2 bits (139), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 84/355 (23%), Positives = 142/355 (40%), Gaps = 36/355 (10%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           M+E+ +G +SF+ P+I  +           + P+ W+E EL  L G+          + +
Sbjct: 89  MHERSKGGESFFAPFIASM-------PTTFDLPVFWTEAELNELKGTNVLLLTQLMKQHL 141

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           +R++  +        + F      +PT   T + +  A   + S       VS   ++  
Sbjct: 142 ERDFENIHQA---VAADFPDIFASLPT--LTIDDYMWAMSVIWSRAF---GVSKGGKYLH 193

Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLV---------VDRPYKAGESIVVWCGPQPNSKL 171
           V L P +  ++     +   +DD V            V     AG ++ +  G   N+KL
Sbjct: 194 V-LCPAMDMFNHDV-TVRKPLDDFVSFNEEKQMMTHHVPEDVAAGSAVHISYGQYSNAKL 251

Query: 172 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD 231
           L +YGFV  +N    +     +   DP ++ K+ V   N     Q +  H       + +
Sbjct: 252 LYSYGFVSPENFRRGVDFWMKIPLSDPYFKLKQTVLDSNELTKEQTYDFHGTLLSNDVDE 311

Query: 232 -MLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSED 290
            +L  LR+  +++    Q   +    I  V    E AV + L    + +L+ Y  TL ED
Sbjct: 312 RLLATLRVILMNEQEIRQYKKAFESSILSVRN--ELAVYENLQSTCRRKLSNYATTLEED 369

Query: 291 EAMLTDYNLHPKKRVATQL-VRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYA 344
           EA+L +     K R+A  + VRME K      QVT  +I  L     S    P A
Sbjct: 370 EAILAETETESKPRLAFAVRVRMEDK------QVTTSVIETLEQWKQSLASKPDA 418


>gi|336261436|ref|XP_003345507.1| hypothetical protein SMAC_07495 [Sordaria macrospora k-hell]
 gi|380088183|emb|CCC13858.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 499

 Score = 58.2 bits (139), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 48/197 (24%), Positives = 84/197 (42%), Gaps = 28/197 (14%)

Query: 6   QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 65
           +GK S W PYI  L       + A+  P  W+E ++  L G+     I E  + +K EY 
Sbjct: 115 KGKSSLWAPYISTLTDPSQLDKWAL--PPFWTEHDIELLRGTNAYVAIQEIQDNVKSEYK 172

Query: 66  ELDTVWFMAGS----LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 121
           +   +    GS     + Q  Y+     FT   F+ + +  +S   ++++        L+
Sbjct: 173 QARKILKQEGSPDYRAYTQVLYNWAYCMFTSRSFRPSLILSESAREYVER--------LL 224

Query: 122 PLGPPLLAYS--------------SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 167
           P G  +  +S              ++    L +   A +L+    Y+ G+ +    G + 
Sbjct: 225 PEGAKIDDFSILQPLYDIGNHSPEAEYSWNLTSEPSACELICRNSYEPGQQVFNNYGKKT 284

Query: 168 NSKLLINYGFVDEDNPY 184
           NS+LL+ YGFV E+N Y
Sbjct: 285 NSELLLGYGFVTENNDY 301


>gi|408393455|gb|EKJ72719.1| hypothetical protein FPSE_07119 [Fusarium pseudograminearum CS3096]
          Length = 465

 Score = 58.2 bits (139), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 83/383 (21%), Positives = 153/383 (39%), Gaps = 71/383 (18%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEG 59
           +YE  +G  S W  Y   L         + ++P+ WSE EL  L  S  + +I +  AE 
Sbjct: 103 IYEYLRGDSSQWKSYFDVLPS-------SFDTPMFWSENELDQLQASHMRHKIGKADAEN 155

Query: 60  I-------------------KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 100
           +                    R  +EL  +    GS    Y +D+  +    E  ++   
Sbjct: 156 MFKKTLVPIIRSNPSIFNAENRSDSELVEIAHRMGSTIMAYAFDLENDEEEEEETEEWVE 215

Query: 101 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 160
                 +            +VP+   +L   ++  A +   ++++ +   RP KAGE I+
Sbjct: 216 DRDGKSM----------MGMVPMAD-ILNADAEFNAHVNHEEESLTVTSLRPIKAGEEIL 264

Query: 161 VWCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAALNT----------------E 196
            + GP PNS+LL  YG+V E +        P+D  +VE+ L                   
Sbjct: 265 NYYGPHPNSELLRRYGYVTEKHSRYDVVEIPWD--IVESVLTNFGISSKILKQIRGEFEG 322

Query: 197 DPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGP 256
           + +++D  ++ +  G+++         + +    D+   L+    S    ++ V S   P
Sbjct: 323 EEEFEDTFVLERDTGEINSDGTFAEPAKFEGMPEDLQEQLK----SFLKGIKKVQSDTIP 378

Query: 257 ICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKM 316
                  + +AVL +  +   AR   YP ++SED+ +L   NL  + R+AT +   EKK+
Sbjct: 379 DKRKRDEIHQAVLVKTLEALAAR---YPTSISEDQTLLNGQNLDQRARMATVVRLGEKKL 435

Query: 317 LNACLQVTADMIMLLPDVTVSPC 339
           L   +   ++ + +  D    P 
Sbjct: 436 LQEAIATFSEDVEMTMDDESGPA 458


>gi|303277863|ref|XP_003058225.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226460882|gb|EEH58176.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 612

 Score = 57.8 bits (138), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 71/252 (28%), Positives = 107/252 (42%), Gaps = 40/252 (15%)

Query: 1   MYEK-KQGKKSFWLPYIREL---------DRQRGRGQLAVESPLLW-SETELAYLTGSPT 49
           +YEK   G +S W P +  L           +   G L + +P  W +E   A L G+P 
Sbjct: 249 VYEKYALGDRSPWAPLLASLPMDDGGGDDGDRTAAGALGL-TPASWPAEVTDALLRGAPL 307

Query: 50  KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 109
             + ++  E   R++  L    F A  L + +P   PTE +T   F+ A  A  +  + +
Sbjct: 308 LDDAVKARETTARQHAAL----FPA--LGEHFPEVFPTELYTLRRFRIASEAWNAYGMTV 361

Query: 110 QKVSLARRFALVPLGPPLLAYSSKCKAMLA-------AV------DDAVQLVVDRPYKAG 156
           Q  ++          PP         A+L        AV      DDA+ L + R  +AG
Sbjct: 362 QAETVGGASGGGEHHPPAPTTCLPPIALLCNHATWPHAVRYSRLRDDALHLPIARGVRAG 421

Query: 157 ESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQR-----NG 211
           E I V  G + N++LL+ YGF   DNPYD    +  L+ E PQ + + + A R       
Sbjct: 422 EEIFVSYGAKSNAELLLFYGFGVRDNPYD----DVPLSLELPQGEVRDVSALRERVLHRA 477

Query: 212 KLSVQVFHVHAG 223
           KLS+    V  G
Sbjct: 478 KLSLSPHSVRCG 489


>gi|307195794|gb|EFN77608.1| SET domain-containing protein 3 [Harpegnathos saltator]
          Length = 245

 Score = 57.8 bits (138), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 55/211 (26%), Positives = 91/211 (43%), Gaps = 11/211 (5%)

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           AL+P+       + +      A  D  +    R ++ GE I +  GP+ NS   ++ GFV
Sbjct: 31  ALIPMWDMCNHENGRITTDFNATSDRCECYALRNFQKGEQIFISYGPRTNSDFFVHSGFV 90

Query: 179 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDM-LPYLR 237
             DN  D   +   ++  D   +++  +  +    SV  F +  G   E ISDM L +LR
Sbjct: 91  YMDNEQDGFKLRLGISKADSLQKERTELLGKLDLPSVGEFLLKPG--TEPISDMLLAFLR 148

Query: 238 LGYVSDTSEMQSVISSLGPI------CPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 291
           + +    +E+   + S          C +   +E  V   L    +  +A YP TL ED 
Sbjct: 149 V-FSMRKAELAHWLRSDKVFDLKHMDCALETVVEENVRKFLLTRLQLLIANYPTTLKEDL 207

Query: 292 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 322
            +L +  L   K++  QL   EKK+L   L+
Sbjct: 208 ELL-ETTLPQVKKMTVQLRVTEKKILLGALE 237


>gi|301094750|ref|XP_002896479.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262109454|gb|EEY67506.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 478

 Score = 57.8 bits (138), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 76/320 (23%), Positives = 128/320 (40%), Gaps = 31/320 (9%)

Query: 33  PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 92
           P+ WSE EL++L GS    +I ER   I+++Y+ +  V             D     F+ 
Sbjct: 144 PIFWSEEELSWLKGSYIIQQIQERKAAIRKDYDVICRV-------------DPSFARFSL 190

Query: 93  EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA-MLAAVDDAVQLVVDR 151
           + F  A + V S    L  +   +  ALVP    L  Y  +  +       DA  +    
Sbjct: 191 DRFSWARMIVCSRNFGL-TIDGVKTAALVPFADMLNHYRPRETSWTFDQSIDAFTITSLG 249

Query: 152 PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY------DRLVVEAALNTEDPQ-YQDKR 204
               G  +    G + N + L+NYGF  EDN        + ++++  L+  D Q + DKR
Sbjct: 250 TIGTGAQVYDSYGKKCNHRFLLNYGFAVEDNTEEDGRNPNEVLIDFQLSPADGQLFYDKR 309

Query: 205 MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPIC---PVS 261
                +G  ++    +          +   + RL  +  T E  S +    P     P+S
Sbjct: 310 AYLHESGIYTMDA-RLSCSHSDANTREGFSFARL--IVATEEEFSTMKMKSPAHSSPPIS 366

Query: 262 PCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD--YNLHPKKRVATQLVRMEKKMLNA 319
              E   L+ L +    +L+ Y  T+ ED  +L    Y L   +  A   +R EK++   
Sbjct: 367 FDNEIRALEYLRNLMTHQLSLYDTTIEEDNELLASKQYPLFSNRIQALFFIRGEKQVCRY 426

Query: 320 CLQVTADMIMLLPDVTVSPC 339
             Q  AD ++ L  + ++ C
Sbjct: 427 -FQELADKVIPLFSLPLAEC 445


>gi|255083899|ref|XP_002508524.1| set domain protein [Micromonas sp. RCC299]
 gi|226523801|gb|ACO69782.1| set domain protein [Micromonas sp. RCC299]
          Length = 425

 Score = 57.8 bits (138), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 59/212 (27%), Positives = 88/212 (41%), Gaps = 29/212 (13%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELA-YLTGSPTKAEILERAEGIKREYNELD 68
           S +  Y+R L R+ G         L W E ++   L GSP++   +ER   +    +E+ 
Sbjct: 87  SRFATYVRALPRRTG-------GVLDWPEEDVKELLAGSPSQRAAMERQASVDAAIDEIR 139

Query: 69  TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 128
                       +P   P         + AF  + S ++ L     A   ALVP    +L
Sbjct: 140 A----------SFPQLTPG------ALRWAFDVLFSRLIRLPNRGGA--LALVPWAD-ML 180

Query: 129 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE--DNPYDR 186
            +   C A +     AV L  DR YK GE +    GP+P+S+LLI+YGF     +NP D 
Sbjct: 181 NHRPGCDAYIDDTGGAVCLSPDRRYKPGEQVYASYGPRPSSELLISYGFAPAVGENPDDE 240

Query: 187 LVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
             V   ++  D     K    +R G   V+ F
Sbjct: 241 FEVVLGIDPNDRHADAKADALRRIGLSPVEAF 272


>gi|46130858|ref|XP_389160.1| hypothetical protein FG08984.1 [Gibberella zeae PH-1]
          Length = 1000

 Score = 57.8 bits (138), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 84/383 (21%), Positives = 155/383 (40%), Gaps = 71/383 (18%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEG 59
           +YE  QG  S W  Y   L         + ++P+ WSE EL  L  S  + +I +  AE 
Sbjct: 638 IYEYLQGDSSQWKSYFDVLPS-------SFDTPMFWSENELDQLQASHMRHKIGKADAED 690

Query: 60  I-------------------KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 100
           +                    R   EL  +    GS    Y +D+  +    E  ++   
Sbjct: 691 MFKKTLVPIIRSNPSIFNAENRSDYELVEIAHRMGSTIMAYAFDLENDEEEEEETEEWVE 750

Query: 101 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 160
             +   +            +VP+   +L   ++  A +   ++++ +   RP KAGE I+
Sbjct: 751 DREGKSM----------MGMVPMAD-ILNADAEFNAHVNHEEESLTVTSLRPIKAGEEIL 799

Query: 161 VWCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAALNT----------------E 196
            + GP PNS+LL  YG+V E +        P+D  +VE+ L                  E
Sbjct: 800 NYYGPHPNSELLRRYGYVTEKHSRYDVVEIPWD--IVESVLTNFGISSKILEQIRGEFEE 857

Query: 197 DPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGP 256
           + +++D  ++ +  G+++         + +    D+   L+  ++    ++Q   S   P
Sbjct: 858 EEEFEDTFVLERDTGEVNSDGTFAEPAKFEGMPEDLQEQLK-SFLKGIKKLQ---SDTIP 913

Query: 257 ICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKM 316
                  + +AVL +  +   AR   YP ++SED+ +L   NL  + R+AT +   EKK+
Sbjct: 914 DKRKRDEIHQAVLVKTLEALAAR---YPTSISEDQILLNGQNLDQRARMATVVRLGEKKL 970

Query: 317 LNACLQVTADMIMLLPDVTVSPC 339
           L   +   ++ + +  D    P 
Sbjct: 971 LQEAIATFSEDVEMTMDDESGPA 993


>gi|229596469|ref|XP_001008992.3| SET domain containing protein [Tetrahymena thermophila]
 gi|225565279|gb|EAR88747.3| SET domain containing protein [Tetrahymena thermophila SB210]
          Length = 629

 Score = 57.8 bits (138), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 48/152 (31%), Positives = 65/152 (42%), Gaps = 20/152 (13%)

Query: 33  PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIP--TEAF 90
           P L+SE EL YL G+     + E+ E IK +Y  +  V              IP     F
Sbjct: 260 PALFSEEELQYLEGTAALKLVQEQKEDIKTDYESISQV--------------IPEFKSEF 305

Query: 91  TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD 150
           +FE F+ AF+   S V  + KV   +   +VPL   L    S  +      DDA      
Sbjct: 306 SFEQFRWAFLCSHSRVFGI-KVKGVKTSVMVPLADMLNHKHSGQEDSEWVFDDATNCFTV 364

Query: 151 RPYKA---GESIVVWCGPQPNSKLLINYGFVD 179
           +  K     + I    G + NSKL +NYGFVD
Sbjct: 365 KALKKIQRNQQIHFSYGSKCNSKLFLNYGFVD 396


>gi|170067683|ref|XP_001868579.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167863782|gb|EDS27165.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 269

 Score = 57.4 bits (137), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 50/189 (26%), Positives = 89/189 (47%), Gaps = 24/189 (12%)

Query: 153 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 212
           Y+ GE I ++ G + N+  L++ GFV  DN    + +  +LN  + Q++ ++ + ++ G 
Sbjct: 74  YRKGEQIFIYYGNRTNADFLVHNGFVYPDNANSAVAIPLSLNPTEEQFEQRKQLLEKLGL 133

Query: 213 LSVQVFHVHAGREKEAIS-DMLPYLR--------LGYVSDTSEMQSVISSLGPICPVSPC 263
            S   F+V  G     IS ++L + R        LG+      +QS +  L P C   P 
Sbjct: 134 ASSGDFNVQRGGGDSFISPELLGFARVFNMTKEQLGHWQGEDAVQSQL--LEPDC---PG 188

Query: 264 MERAVLDQLADYFKARLA----GYPATLSEDEAMLTDYN------LHPKKRVATQLVRME 313
           +E ++ +++  Y   RL         TL +DEA+L +        L   K +  Q   +E
Sbjct: 189 LEASLREKVWKYLSIRLQLALRMTGTTLDQDEALLANQGQKGAQKLGHIKSMLVQFRVVE 248

Query: 314 KKMLNACLQ 322
           KK+L+  L+
Sbjct: 249 KKILSEALE 257


>gi|281207217|gb|EFA81400.1| mRNA-decapping enzyme 2 [Polysphondylium pallidum PN500]
          Length = 1078

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 78/321 (24%), Positives = 134/321 (41%), Gaps = 62/321 (19%)

Query: 30  VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 89
           + S L +S+ E   L GSP   +++ +    K  +  L        SLF++  +     A
Sbjct: 130 LNSGLFFSDAEAELLNGSPAYIDLMNQRLDAKELFGRL-------KSLFKEQQFS--KCA 180

Query: 90  FTFEIFKQAFVAVQSCVVHLQKVSL---ARRFALVPLGPPLLAYSSKCKAMLAAVD---- 142
            T++  K A+  V S  ++ +  +L      F  V L P  L Y +  +   AA D    
Sbjct: 181 MTYDRLKWAYSVVDSRKIYTEAPNLDANGNPFITVVLAP-FLDYFNHAEDAQAAYDFDYD 239

Query: 143 -DAVQLVVDRPYKAGESIVVWCGPQP-NSKLLINYGFVDEDNPYDRLV---VEAALNT-- 195
             A+++V  +P K GE I +  G Q  NS LLI+YGF+D+ +     V   VE  LNT  
Sbjct: 240 ESAIKVVALQPIKKGEQIFLNYGNQDCNSDLLIHYGFIDQSSTAKHCVNVLVEELLNTIP 299

Query: 196 -EDPQYQDKRMVAQRNGKLS--VQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS 252
             DPQ  +K  +  +  + +  +++F      E   IS  L Y                 
Sbjct: 300 ASDPQLIEKTELLTKAFEQNERMKLFKDSLTEELLKISKYLSYKNF-------------- 345

Query: 253 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML---TDYNLHPKKRVATQL 309
                         ++L  L      ++  YP T+ ED A++   T++    ++   + +
Sbjct: 346 --------------SLLPYLKSLIDMKMKAYPTTMEEDRAIIEATTEFEKLSQRSKMSII 391

Query: 310 VRMEK----KMLNACLQVTAD 326
           +R+++    K + A +QV  D
Sbjct: 392 MRLQEKETLKEIGALIQVKID 412


>gi|444705829|gb|ELW47217.1| Histone-lysine N-methyltransferase setd3 [Tupaia chinensis]
          Length = 539

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 46/193 (23%), Positives = 91/193 (47%), Gaps = 10/193 (5%)

Query: 142 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 201
           DD  + V  + ++ GE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y 
Sbjct: 234 DDRCECVALQDFRPGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 293

Query: 202 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVIS 252
            K  V  R G  +  VF +H   +    + +L +LR+  +++          S +  + +
Sbjct: 294 MKAEVLARAGIPTSSVFALHF-TDPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFT 352

Query: 253 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 312
                 PVS   E  +   L D     L  Y  T+ ED+++L   +L  +  +A +L   
Sbjct: 353 LGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKSRDLSVRATMAIKLRLG 412

Query: 313 EKKMLNACLQVTA 325
           EK++L   ++  A
Sbjct: 413 EKEILERAVRSAA 425


>gi|428175234|gb|EKX44125.1| hypothetical protein GUITHDRAFT_109909 [Guillardia theta CCMP2712]
          Length = 442

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 46/189 (24%), Positives = 77/189 (40%), Gaps = 28/189 (14%)

Query: 30  VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM---------AGSLFQQ 80
           + +PL WS+ E   L GS                YN LD  W M         A  L Q 
Sbjct: 161 LTTPLFWSDKEREELQGSNL--------------YNMLDG-WTMNVEKLHRSTARVLGQH 205

Query: 81  YPY-DIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA---RRFALVPLGPPLLAYSSKCKA 136
             + D+P   ++ + FK A+  + +    +   S     R+  + P+         K   
Sbjct: 206 NVFPDLPKAIYSLKEFKWAYATIFARAFDVDGKSFGFSGRQRIMAPMADLFNHGDVKTSY 265

Query: 137 MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 196
              A     +L   + +  GE I +    + N++ L+ YGFV E NP+D + + A++  +
Sbjct: 266 TFNAASGHFELFTQQFFSRGEQIFMNYDSKNNAEFLLQYGFVIESNPHDYVGIAASIGND 325

Query: 197 DPQYQDKRM 205
            P Y+DK +
Sbjct: 326 QPFYRDKSL 334


>gi|156384284|ref|XP_001633261.1| predicted protein [Nematostella vectensis]
 gi|156220328|gb|EDO41198.1| predicted protein [Nematostella vectensis]
          Length = 403

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 78/333 (23%), Positives = 140/333 (42%), Gaps = 40/333 (12%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
           S W PY+  L R       +  + L +S  ++  L GSP+  E L++  GI ++Y     
Sbjct: 86  STWAPYLNILPR-------SYSTCLYFSPDDMMALQGSPSMGEALKQFRGIVKQY----- 133

Query: 70  VWFMAGSLFQQYPYDIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 128
           V+F            +P   +FTF+ F+ A   V +    ++  S     AL+P+     
Sbjct: 134 VYFFRLVQINPEASRLPLKNSFTFDDFRWAVSTVMTRQNDVKVSSNETVKALIPM----W 189

Query: 129 AYSSKCKA-MLAAVDDA---VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
              + C        DD+   V+ +  +P +AG+ + ++ G + N+  L + GFV  +   
Sbjct: 190 DMCNHCNGPFTTGFDDSTKEVKSLAFKPTRAGDQVFIFYGRRNNADRLFHNGFVYTEAEE 249

Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS-DMLPYLRLGYVSD 243
           D + ++  ++  D  Y  K  +    G L          R  E IS ++  +LR+ +  +
Sbjct: 250 DWVNIQLGVSKNDRLYAMKAQILAMVG-LDASGRSYRVLRGPEPISPELRIFLRV-FSMN 307

Query: 244 TSEMQSV--------ISSLGPICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDE 291
           T E++          ++ L  +C     +      +L  +F  R    L  Y  T  EDE
Sbjct: 308 TGELKPYLFNPEGLPVTPLAELCKAEFTLSEENELKLWSFFHTRLQLILGQYKTTKQEDE 367

Query: 292 AMLT--DYNLHPKKRVATQLVRMEKKMLNACLQ 322
           A+L+  D  LH   R   +L   E+ +L + L+
Sbjct: 368 ALLSRDDNTLH--TRNCIRLRMSERDILVSALE 398


>gi|348671353|gb|EGZ11174.1| hypothetical protein PHYSODRAFT_361758 [Phytophthora sojae]
          Length = 486

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 76/319 (23%), Positives = 126/319 (39%), Gaps = 29/319 (9%)

Query: 33  PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 92
           P+ WS+ EL +L GS    +I ER   I+++Y+ +  V             D     F+ 
Sbjct: 152 PIFWSDEELGWLKGSYIIQQIQERKAAIRKDYDVICRV-------------DPAFARFSL 198

Query: 93  EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA-MLAAVDDAVQLVVDR 151
           + F  A + V S    L  +   +  ALVP    L  Y  +  +       DA  +    
Sbjct: 199 DRFSWARMIVCSRNFGL-TIDGVKTAALVPFADMLNHYRPRETSWTFDQSIDAFTITSLG 257

Query: 152 PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY------DRLVVEAALNTEDPQ-YQDKR 204
               G  +    G + N + L+NYGF  EDN        + ++++  L+  D Q + DKR
Sbjct: 258 TIGTGAQVYDSYGKKCNHRFLLNYGFAVEDNTEEDGRNPNEVLIDFQLSQADGQLFYDKR 317

Query: 205 MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS--VISSLGPICPVSP 262
                +G  ++    +          +   + RL  V+   E  S  + S      P+S 
Sbjct: 318 AYLHESGIYTMDA-RLSCSHSDANTREGFSFARL-IVATEDEFSSMKMKSPAHSSPPISF 375

Query: 263 CMERAVLDQLADYFKARLAGYPATLSEDEAMLTD--YNLHPKKRVATQLVRMEKKMLNAC 320
             E   L  L D    +L+ Y  T+ ED  +L    Y L   +  A   +R EK++    
Sbjct: 376 DNEIRALQYLRDLMTHQLSLYDTTIEEDNELLASKQYPLFSNRIQALFFIRGEKQVCR-Y 434

Query: 321 LQVTADMIMLLPDVTVSPC 339
            Q  AD ++ L  + ++ C
Sbjct: 435 FQELADKVIQLFSLPLAEC 453


>gi|160331079|ref|XP_001712247.1| met [Hemiselmis andersenii]
 gi|159765694|gb|ABW97922.1| met [Hemiselmis andersenii]
          Length = 464

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 66/323 (20%), Positives = 140/323 (43%), Gaps = 33/323 (10%)

Query: 7   GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 66
           G KS W  Y   L R+       +     W+  ++ +L GS T    L   E IK ++  
Sbjct: 156 GDKSKWKIYFDILPREED-----LNLGFRWNLNDIVFLRGSKTLNASLYLKEKIKIQFLR 210

Query: 67  LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP- 125
           L+   F    L  +YP  I    F    ++ A   + S  + LQ +   ++ +LVP    
Sbjct: 211 LEKTIFSKNRL--KYPVSI----FNLAQWEWALSILLSRAIFLQNL---KKVSLVPYADF 261

Query: 126 ----PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 181
               P        K +  + +  + +  D+ Y   + I    G + N +LL+ YGF+ E 
Sbjct: 262 MNHNPFSTSYINSKKISFSKNHEIVMYADKDYNKFDQIFTTYGQKTNLELLLLYGFILER 321

Query: 182 NPYDRLVVEAALNTEDPQYQDKRMV---AQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 238
           NP+D + +  +L+ +D  ++ K+      ++  +++  +F+    +E      +  +LR 
Sbjct: 322 NPFDSIELRISLSDKDSFFEKKKQFMIECEKTSEITFPIFYYKYPKE------LYEFLRF 375

Query: 239 GYVSDTSEMQSV-ISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDE---AML 294
             +S+  E+ S  +S        +  +E+ +   +    +  L  Y   +SE++   ++ 
Sbjct: 376 C-ISNQEELGSTDLSDFNFNDENNYEIEKIIRKLVLFSCEKLLKNYSKKVSEEKILNSLN 434

Query: 295 TDYNLHPKKRVATQLVRMEKKML 317
           +++ +   +++A +  + EKK++
Sbjct: 435 SNFLISKNQKMALKQSKCEKKII 457


>gi|296419472|ref|XP_002839331.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295635461|emb|CAZ83522.1| unnamed protein product [Tuber melanosporum]
          Length = 541

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 52/197 (26%), Positives = 93/197 (47%), Gaps = 22/197 (11%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           E  +GK+SFW PY+  L R+        ++PL +S+ +L +L G   +A  +E  + I R
Sbjct: 94  EWLKGKESFWWPYLCVLPRE-------FDTPLYFSDEDLQFLQGCNLEATEVEARKLIWR 146

Query: 63  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA--- 119
           E  E       A S+ Q+  YD  TE +T+E++  A     S     + +   R      
Sbjct: 147 EEFE------AAVSILQREGYD--TEYYTWELYLWASTIFTSRSFPGKLMDWDRIIVHED 198

Query: 120 -LVPLGPPLLAYSSKCKAMLAA---VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
             +P+  PL+   +   A +      D +++++      AG  +    GP+ N +LL+ Y
Sbjct: 199 DTMPILFPLIDSLNHYPATIITWQPSDTSLRIISGVGVSAGAEVYNNYGPKANEELLMGY 258

Query: 176 GFVDEDNPYDRLVVEAA 192
           GF    NP+D  +++++
Sbjct: 259 GFTLLQNPFDSFLLKSS 275


>gi|19112238|ref|NP_595446.1| ribosomal lysine methyltransferase Set10 [Schizosaccharomyces pombe
           972h-]
 gi|74626910|sp|O74738.1|SET10_SCHPO RecName: Full=Ribosomal N-lysine methyltransferase set10
 gi|3738151|emb|CAA21252.1| ribosomal lysine methyltransferase Set10 [Schizosaccharomyces
           pombe]
          Length = 547

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 62/240 (25%), Positives = 103/240 (42%), Gaps = 31/240 (12%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           E  +G +S W  YI  L +          +PL ++E + A+L  +   +   ER    K 
Sbjct: 89  ESLKGIQSKWYGYIEYLPK-------TFNTPLYFNENDNAFLISTNAYSAAQERLHIWKH 141

Query: 63  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQSCVVHLQKVSLARRFALV 121
           EY E       A SL     +  PTE FTF+++   A V    C        + +     
Sbjct: 142 EYQE-------ALSL-----HPSPTERFTFDLYIWSATVFSSRC---FSSNLIYKDSEST 186

Query: 122 PLGPPLL-AYSSKCKAMLAAVDD-----AVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
           P+  PL+ + + K K  +    D     +VQL+       G  +    GP+ N +LL+ Y
Sbjct: 187 PILLPLIDSLNHKPKQPILWNSDFQDEKSVQLISQELVAKGNQLFNNYGPKGNEELLMGY 246

Query: 176 GFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG--KLSVQVFHVHAGREKEAISDML 233
           GF   DNP+D + ++ A++ + P    K  + + +   +LS  VF +    +KE    +L
Sbjct: 247 GFCLPDNPFDTVTLKVAIHPDLPHKDQKAAILENDCQFQLSNLVFFLPKSPDKEIFQKIL 306


>gi|357615786|gb|EHJ69829.1| putative SET domain containing 3 [Danaus plexippus]
          Length = 489

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 61/254 (24%), Positives = 103/254 (40%), Gaps = 49/254 (19%)

Query: 4   KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 63
           +K    SFW PYI  L  +         + L ++  ELA L  SP     L+    I R+
Sbjct: 168 EKNNPNSFWKPYIDVLPEK-------YSTVLYFNSEELAELRPSPVFESSLKLYRSIVRQ 220

Query: 64  Y----NELDTVWFMAGSLFQQYPYDIPT-----EAFTFEIFKQAFVAV---QSCVVHLQK 111
           Y    N++ T+             D+P      + FTF+ ++ A   V   Q+ +V    
Sbjct: 221 YAYFYNKIHTI-------------DLPVLKNLQDIFTFDNYRWAVSTVMTRQNNIVQGTA 267

Query: 112 VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR-------PYKAGESIVVWCG 164
            +L    A +PL          C      +     L ++R        Y+  E I ++ G
Sbjct: 268 FTLTN--AFIPLW-------DMCNHKHGKITTDFNLELNRGECYALQDYRRDEQIFIFYG 318

Query: 165 PQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGR 224
            +PNS L ++ GFV  DN YD L +   ++  D     K  +  + G   V  F ++ G 
Sbjct: 319 ARPNSDLFLHNGFVYPDNDYDSLSIALGISPNDALRNGKVNLLNKLGLSGVTNFSLYKGA 378

Query: 225 EKEAISDMLPYLRL 238
              ++ ++L ++R+
Sbjct: 379 SPISV-ELLAFIRI 391


>gi|321470773|gb|EFX81748.1| hypothetical protein DAPPUDRAFT_317395 [Daphnia pulex]
          Length = 495

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 76/336 (22%), Positives = 134/336 (39%), Gaps = 25/336 (7%)

Query: 9   KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL- 67
           KSFW PY+  L         + ++ + ++  E+  L GSP   + L     I R+Y+   
Sbjct: 167 KSFWKPYLDALPS-------SYDTVMYFTPDEITELKGSPAFDDALRMCRNIARQYSYFY 219

Query: 68  ----DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 123
                 V     +L   + Y+    A +  + +Q  +  Q  +    K  L    AL+PL
Sbjct: 220 SLLQKNVDPALSNLRANFTYNDYRWAVSTVMTRQNLIPSQEEISGNDKDQLPPVNALIPL 279

Query: 124 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 183
                    +                 R +  GE + ++ G +  ++  I+ GFVD +N 
Sbjct: 280 WDFCNHQDGQFSTEFQLESRRTVCQAGRDFGPGEQVFIFYGTRTCAEQFIHNGFVDINNA 339

Query: 184 YDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ------VFHVHAGREKEAISDMLPYLR 237
           +D L ++  L+  DP    +  +  +   LS +       F + AG  +     +L +LR
Sbjct: 340 HDALTLKVGLSKSDPLAGQRATLLCKLRILSDEKISGPIAFQLKAG-PQPVDGKLLAFLR 398

Query: 238 LGYVSDTSEMQSVISSLGPICPVSPC-MERAVLDQLADYFKAR----LAGYPATLSEDEA 292
           L  ++  S  + + S          C +E  V D+   + KAR    L  YP T   D  
Sbjct: 399 LFCMTKDSLDRWLQSDNASNLMHEECGIETEVDDKSWSFLKARCQLLLQLYPTTKEADLK 458

Query: 293 MLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 328
           ML + +L   +R+   L   EK++L + ++  A  I
Sbjct: 459 MLEE-DLSSHRRMCVLLRLAEKRILLSAIECAAQRI 493


>gi|302792358|ref|XP_002977945.1| hypothetical protein SELMODRAFT_107696 [Selaginella moellendorffii]
 gi|300154648|gb|EFJ21283.1| hypothetical protein SELMODRAFT_107696 [Selaginella moellendorffii]
          Length = 467

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 61/240 (25%), Positives = 100/240 (41%), Gaps = 55/240 (22%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEG 59
           MYE+ +GK S W  Y++ L RQ          P LWSE E+   L G+     + E    
Sbjct: 82  MYERSKGKGSKWYRYLKTLPRQE-------SVPFLWSEEEIDGLLLGTELHKALKEDKLL 134

Query: 60  IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ-KVSLARRF 118
           +K ++ E          L ++ P + P + FTFE    +++A +S V     ++     +
Sbjct: 135 MKEDWEE------NIAPLTKEDPLEFPAQDFTFE----SYLAAKSLVSSRSFEIDAEHGY 184

Query: 119 ALVPLGPPLLAYSSKCKA-----MLAA-----------VDDA---------------VQL 147
            +VPL      ++ K  A     ML A           +DD                +++
Sbjct: 185 GMVPLAD---LFNHKTDAEDVHFMLNASDSDDDDNGLIIDDGLANGDCREISSDKSVLEM 241

Query: 148 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD--RLVVEAALNTEDPQYQDKRM 205
           V+ +   AG  I    G   N+ LL  YGF + +NP+D   L ++  L     ++Q KR+
Sbjct: 242 VMVKDVAAGSEIFNTYGQLGNAALLHRYGFTEPNNPHDIVNLDMDCVLEVLLSRFQKKRV 301


>gi|212542185|ref|XP_002151247.1| SET domain protein [Talaromyces marneffei ATCC 18224]
 gi|210066154|gb|EEA20247.1| SET domain protein [Talaromyces marneffei ATCC 18224]
          Length = 709

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 65/213 (30%), Positives = 94/213 (44%), Gaps = 24/213 (11%)

Query: 8   KKSFWLPYIRELDRQRGRGQLAVESPLLWSETE--LAYLTGSPTKAEILERAEGI----- 60
           +  FW PYIR L      G+  + +PL + E E  L +L G  + A   ER   I     
Sbjct: 119 EHGFWYPYIRSLP-----GKEELTTPLFFREEEGDLEWL-GMTSLAASRERRLAIWRGNY 172

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA--VQSCVVHLQKVSLARRF 118
           +R Y  L  + F      + Y +D+   A T  I  +AF A  + S +  L+   + R  
Sbjct: 173 ERGYTMLKELGFEG---VEGYTWDLYLWASTI-ISSRAFTAKVLASVIPELKNAEVDRVS 228

Query: 119 ALVPLGPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 176
            L+PL   + A + K   K    A  D++ LVV     AGE +    GP+ N +L++NYG
Sbjct: 229 VLLPL---IDATNHKPLSKVEWRAGTDSIGLVVMSDVAAGEEVGNNYGPRNNEQLMMNYG 285

Query: 177 FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQR 209
           F   DNP +  VV      + P  Q K    Q 
Sbjct: 286 FCIPDNPCEYRVVSLRAPLDSPLAQIKAQYEQH 318


>gi|162606198|ref|XP_001713614.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Guillardia theta]
 gi|13794534|gb|AAK39909.1|AF165818_117 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Guillardia theta]
          Length = 460

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 65/331 (19%), Positives = 147/331 (44%), Gaps = 41/331 (12%)

Query: 6   QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 65
           + KKSFW PYI  L  +     L       W   EL ++ GS          + +K +Y 
Sbjct: 153 KNKKSFWFPYIGILPEEYDLKLL-----FRWPLKELFFIKGSRLSKASDYLKKKLKAQYE 207

Query: 66  ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP 125
            ++        +FQ+     P++ F ++ ++ +   + S  + LQ+    ++  L+P   
Sbjct: 208 MVNK------EVFQRNRLLYPSKIFNYQNWEWSMSILLSRTISLQE---TKKVVLIPY-I 257

Query: 126 PLLAYSSKCKAMLAA----VDDAVQLVV--DRPYKAGESIVVWCGPQPNSKLLINYGFVD 179
            LL ++    + ++     + D+ ++VV  D+     + + +  G + N +LL  YGF+ 
Sbjct: 258 DLLNHNPFSSSFISYRKIPLSDSKEIVVYSDKNCNKFDQLYISYGQKSNLELLNLYGFIA 317

Query: 180 EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL- 238
           E NPYD +++  +++ +D  +++K+     N K     + +   +  +   +M+ ++++ 
Sbjct: 318 ERNPYDSVIIRISMSPKDIFFKEKKSFLFSNKKFFYNSYPIFLYKYPD---EMIEFIKIC 374

Query: 239 ---GYVSDTSEMQSVISSLGPICPVSPC----MERAVLDQLADYFKARLAGYPATLSEDE 291
                ++D +   + I +      +  C    +E+++     DY   R       L E+ 
Sbjct: 375 LFNTNINDKNFNLNKIENYDYTKIIKSCIVTVIEKSLNSNYNDYENLR----NIMLKENL 430

Query: 292 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 322
             ++D      ++++ +   +EKK+LN  L+
Sbjct: 431 LHISD-----NQKISIKYNALEKKILNRFLE 456


>gi|296804474|ref|XP_002843089.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
 gi|238845691|gb|EEQ35353.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
          Length = 455

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 74/305 (24%), Positives = 128/305 (41%), Gaps = 20/305 (6%)

Query: 32  SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 91
           + + ++E EL    GS   A   +  + ++ +Y +L         L  Q+    P + FT
Sbjct: 107 ASIFFTEDELQVCAGSSLYALTRQLEQRVRDDYRQLLV------PLLSQHRDLFPLDQFT 160

Query: 92  FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYS---SKCKAMLAAVDDAVQLV 148
            E +K A  ++ S  +    VS      LV     +L +S    +C A      D + ++
Sbjct: 161 IEDYKWALCSIWSRAMDF-AVSGTTSVRLVAPLADMLNHSPDVKQCHAYDPTSGD-LSIL 218

Query: 149 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 208
             + Y+ G+ + ++ G  PN++LL  YGFV  DNP D   +    +   P Y+ K  +  
Sbjct: 219 AAKDYQVGDQVFIYYGSVPNNRLLRLYGFVLPDNPNDSYDLVLQTSPLAPLYEQKERLWA 278

Query: 209 RNGKLSVQVFHVHAGREKEAI-SDMLPYLRLGYVSDTSEMQSVISSL--GPICPVSPCME 265
             G  S     +     K+ + +++L YLR+  + D S +  +   L  G    VS   E
Sbjct: 279 LAGLDSTCTIPLTV---KDPLPNNVLRYLRIQRL-DESNITDITLQLVNGTDGKVSDGNE 334

Query: 266 RAVLDQLADYFKARLAGYPATLSEDEAMLT--DYNLHPKKRVATQLVRMEKKMLNACLQV 323
             VL  L D   + L G+   L + EA L   DY        A  +   E+++L    + 
Sbjct: 335 MQVLQFLVDSIGSLLEGFGIPLEKLEAQLAAGDYPAGGNAWAAAHVSAGEQRVLTRAKRT 394

Query: 324 TADMI 328
             D++
Sbjct: 395 AEDLL 399


>gi|357444999|ref|XP_003592777.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase [Medicago truncatula]
 gi|355481825|gb|AES63028.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase [Medicago truncatula]
          Length = 451

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 50/211 (23%), Positives = 94/211 (44%), Gaps = 22/211 (10%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           +  K  G+ S W PYI  L  Q       + + + W+E+EL  +  S    E + +   I
Sbjct: 113 LIHKNLGQDSEWHPYISCLPPQA-----EMHNTIFWNESELEMIRQSSVYQETIYQKSQI 167

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           ++++ E+  V       FQ  P+      FT++ F  A   V S     +     +  +L
Sbjct: 168 EKDFLEIKPV-------FQ--PFCQSFGDFTWKDFMHACTLVGS-----RAWGSTKGLSL 213

Query: 121 VPLGPPLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           +P     L +    +A++ + DD    ++  DR Y  GE +++  G   N+ L++++GF 
Sbjct: 214 IPFAD-FLNHDGISEAIVMSDDDNKCSEVFSDRDYVPGEQVLIRYGKFSNATLMLDFGFT 272

Query: 179 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQR 209
              N YD++ ++  +   DP    K  + Q+
Sbjct: 273 IPYNIYDQVQIQYDIPKYDPLRHTKLELLQQ 303


>gi|340966944|gb|EGS22451.1| hypothetical protein CTHT_0019870 [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 499

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 46/183 (25%), Positives = 74/183 (40%), Gaps = 12/183 (6%)

Query: 6   QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 65
           +G+ SFW PYI  L +        +  P  W E ++ +L G+     I E    IKREY 
Sbjct: 119 KGENSFWWPYIATLPQPEQVNSWTL--PAFWPEDDIQFLEGTNAHVAIGEIQANIKREYK 176

Query: 66  ELDTVW----FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK-VSLARRFAL 120
           +   V     F     + Q  Y      FT   F+ + +  QS   ++   +  AR    
Sbjct: 177 QARKVLKEENFPNWKEYSQMLYKWAFSIFTSRSFRPSLILSQSVKDYVSTLLPSAREIDD 236

Query: 121 VPLGPPLLAYSSKCKAMLAAVD-----DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
             +  PL   ++         D     +  QL+    Y+ G+ +    G + NS+LL+ Y
Sbjct: 237 FSILQPLFDIANHSMTATYTWDTTSDPNCCQLICQDSYRPGDQVFNNYGFKTNSELLLAY 296

Query: 176 GFV 178
           GF+
Sbjct: 297 GFI 299


>gi|358366345|dbj|GAA82966.1| SET domain protein [Aspergillus kawachii IFO 4308]
          Length = 673

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 59/211 (27%), Positives = 89/211 (42%), Gaps = 7/211 (3%)

Query: 6   QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 65
           +G + FW PYIR L  Q G    ++ +P  +   +L +L G+   A   +R E +K +Y 
Sbjct: 95  RGTEGFWYPYIRTLP-QPG----SLTTPPYYEGEDLQWLDGTSLLAAREKRLEVLKEKYE 149

Query: 66  ELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG 124
           +  T    AG      Y +D+   A +  I +     V S V     +S  +   L+P+ 
Sbjct: 150 KGSTALRNAGFEGADAYTWDLYLWAASMFISRAFSARVLSGVFPETDLSEEKLSVLLPI- 208

Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
             +  +    K    A  D V  VV     AG+ I    GP+ N +L++NYGF    NP 
Sbjct: 209 IDMGNHRPLAKVEWRAGKDDVAFVVLEDVSAGQEISNNYGPRNNEQLMMNYGFCIPGNPC 268

Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 215
           D  +V        P Y  K    Q    L+V
Sbjct: 269 DHRIVSLRAPPGSPLYMAKSHQLQMYPDLAV 299


>gi|302804448|ref|XP_002983976.1| hypothetical protein SELMODRAFT_423083 [Selaginella moellendorffii]
 gi|300148328|gb|EFJ14988.1| hypothetical protein SELMODRAFT_423083 [Selaginella moellendorffii]
          Length = 266

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 50/189 (26%), Positives = 81/189 (42%), Gaps = 28/189 (14%)

Query: 4   KKQGK--KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIK 61
           K Q K   S W PYI  L          +++  LW +TEL+YL  SP   +  ER E I 
Sbjct: 77  KGQAKLQSSAWAPYISCLPEPA-----ELDNTFLWEDTELSYLRASPLYGKTRERLEMIT 131

Query: 62  REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 121
            E+ ++     +   LF +          + E FK  +  V S     + +++     LV
Sbjct: 132 TEFGQVQNALDVWPQLFGK---------VSLEDFKHVYATVFS-----RSLAIGEDSTLV 177

Query: 122 PLGPPLLAY-----SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 176
            +  P+L +     +S  K     + +   +  DR Y   + I +  G   N++L ++YG
Sbjct: 178 MI--PMLDFFNHNATSFAKLSFNGLLNYAVVTADRDYAENDQIWINYGDLSNAELALDYG 235

Query: 177 FVDEDNPYD 185
           F   +NPYD
Sbjct: 236 FAVPENPYD 244


>gi|388516285|gb|AFK46204.1| unknown [Lotus japonicus]
          Length = 271

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 39/133 (29%), Positives = 65/133 (48%), Gaps = 7/133 (5%)

Query: 93  EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVD 150
           E FK +F  + S +V L   S+  + ALVP    +L +S   +  L        +    D
Sbjct: 2   ESFKWSFGILFSRMVRLP--SMDGKVALVPWAD-MLNHSCDVETFLDYDKQSKGIVFTTD 58

Query: 151 RPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQ 208
           RPY+ GE + +  G + N +LL++YGFV  +  NP D + +  +L   D  Y++K  + +
Sbjct: 59  RPYQPGEQVFISYGKKSNGELLLSYGFVTREGANPSDSVELSLSLKKSDGSYKEKLELLK 118

Query: 209 RNGKLSVQVFHVH 221
           + G    Q F + 
Sbjct: 119 KYGLSGSQCFPIR 131


>gi|109158151|pdb|2H21|A Chain A, Structure Of Rubisco Lsmt Bound To Adomet
 gi|109158152|pdb|2H21|B Chain B, Structure Of Rubisco Lsmt Bound To Adomet
 gi|109158153|pdb|2H21|C Chain C, Structure Of Rubisco Lsmt Bound To Adomet
 gi|109158154|pdb|2H23|A Chain A, Structure Of Rubisco Lsmt Bound To Trimethyllysine And
           Adohcy
 gi|109158155|pdb|2H23|B Chain B, Structure Of Rubisco Lsmt Bound To Trimethyllysine And
           Adohcy
 gi|109158156|pdb|2H23|C Chain C, Structure Of Rubisco Lsmt Bound To Trimethyllysine And
           Adohcy
 gi|109158157|pdb|2H2E|A Chain A, Structure Of Rubisco Lsmt Bound To Azaadomet And Lysine
 gi|109158158|pdb|2H2E|B Chain B, Structure Of Rubisco Lsmt Bound To Azaadomet And Lysine
 gi|109158159|pdb|2H2E|C Chain C, Structure Of Rubisco Lsmt Bound To Azaadomet And Lysine
 gi|109158160|pdb|2H2J|A Chain A, Structure Of Rubisco Lsmt Bound To Sinefungin And
           Monomethyllysine
 gi|109158161|pdb|2H2J|B Chain B, Structure Of Rubisco Lsmt Bound To Sinefungin And
           Monomethyllysine
 gi|109158162|pdb|2H2J|C Chain C, Structure Of Rubisco Lsmt Bound To Sinefungin And
           Monomethyllysine
          Length = 440

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 83/325 (25%), Positives = 136/325 (41%), Gaps = 24/325 (7%)

Query: 12  WLPYIRELDRQRGR---------GQL--AVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           WL  I  L R+R R         G L    +S + WSE EL  L GS      +   E +
Sbjct: 77  WLSVILFLIRERSREDSVWKHYFGILPQETDSTIYWSEEELQELQGSQLLKTTVSVKEYV 136

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQS-CVVHLQKVSLARRF 118
           K E  +L+    +        P  +    + F I + +AF  +++  +V +    L    
Sbjct: 137 KNECLKLEQEIILPNKRLFPDPVTLDDFFWAFGILRSRAFSRLRNENLVVVPMADLINHS 196

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGF 177
           A V       AY  K  A L + D    L      KAGE + + +   + N++L ++YGF
Sbjct: 197 AGVTTEDH--AYEVKGAAGLFSWDYLFSLKSPLSVKAGEQVYIQYDLNKSNAELALDYGF 254

Query: 178 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 237
           ++ +       +   ++  DP + DK  VA+ NG      F +   R       +LPYLR
Sbjct: 255 IEPNENRHAYTLTLEISESDPFFDDKLDVAESNGFAQTAYFDIFYNRTLPP--GLLPYLR 312

Query: 238 LGYVSDTSE--MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEA 292
           L  +  T    ++S+   +  G +   VS   E  +   + +  K+ LAGY  T+ +D  
Sbjct: 313 LVALGGTDAFLLESLFRDTIWGHLELSVSRDNEELLCKAVREACKSALAGYHTTIEQDRE 372

Query: 293 MLTDYNLHPKKRVATQLVRMEKKML 317
            L + NL  +  +A  +   EK +L
Sbjct: 373 -LKEGNLDSRLAIAVGIREGEKMVL 396


>gi|17367341|sp|Q43088.1|RBCMT_PEA RecName: Full=Ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic; AltName:
           Full=[Fructose-bisphosphate aldolase]-lysine
           N-methyltransferase; AltName:
           Full=[Ribulose-bisphosphate carboxylase]-lysine
           N-methyltransferase; Short=PsLSMT; Short=RuBisCO LSMT;
           Short=RuBisCO methyltransferase; Short=rbcMT; Flags:
           Precursor
 gi|508551|gb|AAA69903.1| ribulose-1,5 bisphosphate carboxylase large subunit
           N-methyltransferase [Pisum sativum]
          Length = 489

 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 83/325 (25%), Positives = 136/325 (41%), Gaps = 24/325 (7%)

Query: 12  WLPYIRELDRQRGR---------GQL--AVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           WL  I  L R+R R         G L    +S + WSE EL  L GS      +   E +
Sbjct: 125 WLSVILFLIRERSREDSVWKHYFGILPQETDSTIYWSEEELQELQGSQLLKTTVSVKEYV 184

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQS-CVVHLQKVSLARRF 118
           K E  +L+    +        P  +    + F I + +AF  +++  +V +    L    
Sbjct: 185 KNECLKLEQEIILPNKRLFPDPVTLDDFFWAFGILRSRAFSRLRNENLVVVPMADLINHS 244

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGF 177
           A V       AY  K  A L + D    L      KAGE + + +   + N++L ++YGF
Sbjct: 245 AGVTTEDH--AYEVKGAAGLFSWDYLFSLKSPLSVKAGEQVYIQYDLNKSNAELALDYGF 302

Query: 178 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 237
           ++ +       +   ++  DP + DK  VA+ NG      F +   R       +LPYLR
Sbjct: 303 IEPNENRHAYTLTLEISESDPFFDDKLDVAESNGFAQTAYFDIFYNRTLPP--GLLPYLR 360

Query: 238 LGYVSDTSE--MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEA 292
           L  +  T    ++S+   +  G +   VS   E  +   + +  K+ LAGY  T+ +D  
Sbjct: 361 LVALGGTDAFLLESLFRDTIWGHLELSVSRDNEELLCKAVREACKSALAGYHTTIEQDRE 420

Query: 293 MLTDYNLHPKKRVATQLVRMEKKML 317
            L + NL  +  +A  +   EK +L
Sbjct: 421 -LKEGNLDSRLAIAVGIREGEKMVL 444


>gi|24987776|pdb|1MLV|A Chain A, Structure And Catalytic Mechanism Of A Set Domain Protein
           Methyltransferase
 gi|24987777|pdb|1MLV|B Chain B, Structure And Catalytic Mechanism Of A Set Domain Protein
           Methyltransferase
 gi|24987778|pdb|1MLV|C Chain C, Structure And Catalytic Mechanism Of A Set Domain Protein
           Methyltransferase
 gi|33357815|pdb|1OZV|A Chain A, Crystal Structure Of The Set Domain Of Lsmt Bound To
           Lysine And Adohcy
 gi|33357816|pdb|1OZV|B Chain B, Crystal Structure Of The Set Domain Of Lsmt Bound To
           Lysine And Adohcy
 gi|33357817|pdb|1OZV|C Chain C, Crystal Structure Of The Set Domain Of Lsmt Bound To
           Lysine And Adohcy
 gi|33357822|pdb|1P0Y|A Chain A, Crystal Structure Of The Set Domain Of Lsmt Bound To
           Melysine And Adohcy
 gi|33357823|pdb|1P0Y|B Chain B, Crystal Structure Of The Set Domain Of Lsmt Bound To
           Melysine And Adohcy
 gi|33357824|pdb|1P0Y|C Chain C, Crystal Structure Of The Set Domain Of Lsmt Bound To
           Melysine And Adohcy
          Length = 444

 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 83/325 (25%), Positives = 136/325 (41%), Gaps = 24/325 (7%)

Query: 12  WLPYIRELDRQRGR---------GQL--AVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           WL  I  L R+R R         G L    +S + WSE EL  L GS      +   E +
Sbjct: 81  WLSVILFLIRERSREDSVWKHYFGILPQETDSTIYWSEEELQELQGSQLLKTTVSVKEYV 140

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQS-CVVHLQKVSLARRF 118
           K E  +L+    +        P  +    + F I + +AF  +++  +V +    L    
Sbjct: 141 KNECLKLEQEIILPNKRLFPDPVTLDDFFWAFGILRSRAFSRLRNENLVVVPMADLINHS 200

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGF 177
           A V       AY  K  A L + D    L      KAGE + + +   + N++L ++YGF
Sbjct: 201 AGVTTEDH--AYEVKGAAGLFSWDYLFSLKSPLSVKAGEQVYIQYDLNKSNAELALDYGF 258

Query: 178 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 237
           ++ +       +   ++  DP + DK  VA+ NG      F +   R       +LPYLR
Sbjct: 259 IEPNENRHAYTLTLEISESDPFFDDKLDVAESNGFAQTAYFDIFYNRTLPP--GLLPYLR 316

Query: 238 LGYVSDTSE--MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEA 292
           L  +  T    ++S+   +  G +   VS   E  +   + +  K+ LAGY  T+ +D  
Sbjct: 317 LVALGGTDAFLLESLFRDTIWGHLELSVSRDNEELLCKAVREACKSALAGYHTTIEQDRE 376

Query: 293 MLTDYNLHPKKRVATQLVRMEKKML 317
            L + NL  +  +A  +   EK +L
Sbjct: 377 -LKEGNLDSRLAIAVGIREGEKMVL 400


>gi|168005531|ref|XP_001755464.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693592|gb|EDQ79944.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1033

 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 45/188 (23%), Positives = 83/188 (44%), Gaps = 24/188 (12%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           + EK +G  S W  YI  L +        +  P+L  + EL  +   P   E+++  + I
Sbjct: 606 LREKAKGSSSAWASYINILPQN-------MTVPILLEDHELHEVQWWPVLRELVQVRKSI 658

Query: 61  KREYNELDTVWFMAGSLFQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR 117
           +  ++ L +V  +AG+ F++Y +    + + AFT  +F     A    + ++  ++    
Sbjct: 659 RESFSLL-SVDDLAGADFEEYRWAAMMVHSRAFTLPVFADDHYAPYVMMPYMDMINHHYH 717

Query: 118 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
           +    +  P+                 V++V  R  K GE +    GP+ N  L + YGF
Sbjct: 718 YQADWMSQPIWG-------------GKVEIVARRDIKKGEELFASFGPRANDNLFLYYGF 764

Query: 178 VDEDNPYD 185
           V +DNP+D
Sbjct: 765 VLKDNPFD 772


>gi|168014081|ref|XP_001759585.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689124|gb|EDQ75497.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 340

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 51/212 (24%), Positives = 97/212 (45%), Gaps = 28/212 (13%)

Query: 127 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 186
           LL +SS+ +++       +++V ++  + GE++V+  GP  N  LL++YGFV   NP DR
Sbjct: 123 LLQHSSESQSL-----PVLEVVAEKDLEKGENVVLNYGPLSNDILLLDYGFVMPKNPNDR 177

Query: 187 --------------LVVEAALNT-EDPQYQDKRMVAQRN--GKLSVQVFHVHAGREKEAI 229
                         LV +  +++ +DP      ++ + N  G  S Q+  V  G  +   
Sbjct: 178 VELRYDDQLLHMACLVAKVNIDSFKDPTTSQLALLTRLNLHGPSSSQM--VTLGGTELVE 235

Query: 230 SDMLPYLRLGYVSDTSEMQSV----ISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 285
             +L  +R+ +  D  E+  V    + +     P+    ER  +  L       LA +P 
Sbjct: 236 GRLLAAVRVMHAQDPMELLDVDLEALQTWNQSPPLGVLNERKTIRTLIGLGMLALASFPT 295

Query: 286 TLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 317
            + ED++ L   ++    R+A Q   ++K++L
Sbjct: 296 EIEEDQSELVKGDISENHRLAIQFRMLKKRLL 327


>gi|302815683|ref|XP_002989522.1| hypothetical protein SELMODRAFT_129980 [Selaginella moellendorffii]
 gi|300142700|gb|EFJ09398.1| hypothetical protein SELMODRAFT_129980 [Selaginella moellendorffii]
          Length = 464

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 86/370 (23%), Positives = 147/370 (39%), Gaps = 62/370 (16%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           +YE+ Q K S+W PYI  L         +   P+ +S  ++  +  +P   ++ +R   +
Sbjct: 110 LYERAQ-KGSYWWPYISMLPH-------SFTLPIFFSGVDIESIDYAPVTHQVKKRCRFL 161

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTF---EIFKQAFVAVQSCVVHLQKVSLARR 117
            +  +EL        +  +  P +I   A  F        A  AV S    +  V+    
Sbjct: 162 LQFSSEL--------AKLESLPEEIHPFAGQFVDSGALGWAMAAVSSRAFRIHGVTNKLC 213

Query: 118 FALVPLGPPLLAYSSKCKAMLAAVDDAV----------QLVVDRPYKAGESIVVWCGPQP 167
            A++    PL+   +      A +++ +          ++V  R  + G +I +  GP  
Sbjct: 214 SAMML---PLIDMCNHSFQPNAHIEEDLSRDAQDVSFLKVVTKRNLEKGSAITLNYGPLS 270

Query: 168 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA--QRNGK------LSVQVFH 219
           N  LL++YGFV  DNP+DR+     L  +    ++ RM+A   R G        S QV  
Sbjct: 271 NDLLLLDYGFVIPDNPHDRI----ELRYDGSLMENARMIAGLSRTGSPPFSSPASWQVDR 326

Query: 220 --------------VHAGREKEAISDMLPYLRLGYVSDTS--EMQSVIS--SLGPICPVS 261
                         V  G  +E    +L  LR+ +       E + ++S  + G    VS
Sbjct: 327 LKQLGLADSGESQKVTLGGPEEVDGRLLAALRILHAESQEPLERRELVSLQAWGVESMVS 386

Query: 262 PCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 321
              E  VL  L          +  T+ EDEA L+D +L    R+A Q    +K+++   L
Sbjct: 387 SDNEERVLRTLCGLGAIVFNQFKTTIEEDEAKLSDKSLAETSRIAVQFRLTKKRLVVRVL 446

Query: 322 QVTADMIMLL 331
           +     +M L
Sbjct: 447 ESLKKRLMDL 456


>gi|412990750|emb|CCO18122.1| predicted protein [Bathycoccus prasinos]
          Length = 543

 Score = 54.3 bits (129), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 57/201 (28%), Positives = 85/201 (42%), Gaps = 31/201 (15%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIK 61
           E     K  +  Y++ L R  G       S L W E+++   L GSP+    LER   + 
Sbjct: 179 EDNSSNKYRFATYVKALPRSTG-------SVLEWPESDVRTLLAGSPSLFSALERRASVA 231

Query: 62  REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 121
               E+   +        Q+ +DI        +F        S ++ L+  SL    ALV
Sbjct: 232 AAIAEIRVNFPELNEKTLQWAFDI--------LF--------SRLIRLE--SLGGNLALV 273

Query: 122 PLGPPLLAYSSKCKAM--LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 179
           P    +L +   C+A   L      V L  DR Y+ GE +    G +P+S+LLI+YGF  
Sbjct: 274 PWAD-MLNHQPGCEAFIDLDRGSRKVCLTTDRSYEPGEQVWASYGQRPSSELLISYGFAP 332

Query: 180 E--DNPYDRLVVEAALNTEDP 198
              DNP D   +   ++ EDP
Sbjct: 333 AVGDNPDDEYALNLQIDEEDP 353


>gi|325183831|emb|CCA18289.1| conserved hypothetical protein [Albugo laibachii Nc14]
 gi|325183979|emb|CCA18437.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 561

 Score = 54.3 bits (129), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 73/301 (24%), Positives = 121/301 (40%), Gaps = 38/301 (12%)

Query: 33  PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 92
           P+ WS+ EL++L GS    +I ER   I+++Y+ +                D     F+ 
Sbjct: 227 PIFWSDQELSWLKGSYILHQIQERKAAIRKDYDAICRA-------------DPSFSRFSL 273

Query: 93  EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA-MLAAVDDAVQLVVDR 151
           E F  A + V S    L  +   +  ALVP    L  Y  +  +       D   +    
Sbjct: 274 ERFSWARMIVCSRNFGL-TIDGVKTAALVPFADMLNHYRPRETSWTFDQKLDGFTITSLE 332

Query: 152 PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY------DRLVVEAALNTEDPQYQDKRM 205
              +G  +    G + N + L+NYGF  EDN        + ++V+  L+  D Q     +
Sbjct: 333 SICSGAQVYDSYGKKCNHRFLLNYGFAVEDNTEEDGSNPNEIMVDFQLDPGDGQ-----L 387

Query: 206 VAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY-----VSDTSEMQSVISSLGPIC-- 258
           +  +   L     +    R   + SD  P  R G+     ++ T +  S +    P    
Sbjct: 388 LYDKTAYLYESGIYTMNARLSCSHSD--PSTREGFSFARLIAATEDEFSSMKMRSPAHAS 445

Query: 259 -PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML--TDYNLHPKKRVATQLVRMEKK 315
            P+S   E A L+ L      +L  YP +L E EA+L   +Y L+  +  A   +R EK+
Sbjct: 446 PPISFRNEIAALNLLKQLMDTQLDQYPTSLDEGEAILKSKEYPLYSNRIQALFFIRGEKQ 505

Query: 316 M 316
           +
Sbjct: 506 V 506


>gi|148908465|gb|ABR17345.1| unknown [Picea sitchensis]
          Length = 350

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 43/154 (27%), Positives = 73/154 (47%), Gaps = 16/154 (10%)

Query: 153 YKAGESIVVWCG-PQPNSKLLINYGFVDED----NPYDRLVVEAALNTEDPQYQDKRMVA 207
           ++ GE +++  G  + N +L ++YGFV+ +    +  D   +   ++  DP + DK  +A
Sbjct: 136 FRTGEQVLMQYGMNKSNGQLALDYGFVERNRKNGSNRDIFTLTLEISESDPFFADKLDIA 195

Query: 208 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM-------QSVISSLGPICPV 260
           + NG  +   F +  G  +     ML +LRL  +  T           SV   L    PV
Sbjct: 196 ELNGMETTAYFDITQG--QGVPESMLTFLRLIALGGTDAFLLEPLFRDSVWEHLS--LPV 251

Query: 261 SPCMERAVLDQLADYFKARLAGYPATLSEDEAML 294
           S   E A+   + D  ++ L+GY  T+ EDEA+L
Sbjct: 252 SQENEAAICKVVLDGCQSTLSGYGTTIEEDEALL 285


>gi|442753255|gb|JAA68787.1| Putative set domain-containing protein [Ixodes ricinus]
          Length = 428

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 83/192 (43%), Gaps = 16/192 (8%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           +K  G+ S W P++  L R          +P+    T    L     + E+  R   I+R
Sbjct: 113 QKLLGEASRWWPFVDSLPR-------TFTTPVFLRRTVFESLP-KDLREEVHTRITSIQR 164

Query: 63  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF---- 118
            + +L  +  + G + ++      +  FT+  F  A+ AV +  +  Q  + +  +    
Sbjct: 165 TFLKLKVL--LGGHVEEEPEVQSLSTGFTWNNFVWAWTAVNTRCIFAQGSNSSSLWENDH 222

Query: 119 -ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
            AL P     L +  K     A V +  +++  + + A E + +  GP  N +L ++YGF
Sbjct: 223 CALAPF-LDCLNHHWKASIETAMVGENFEILSHKSHDANEQVFISYGPHSNRRLFLDYGF 281

Query: 178 VDEDNPYDRLVV 189
           V  DNP D +VV
Sbjct: 282 VLPDNPNDVVVV 293


>gi|440804394|gb|ELR25271.1| rubisco lsmt substrate-binding protein [Acanthamoeba castellanii
           str. Neff]
          Length = 408

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 80/346 (23%), Positives = 131/346 (37%), Gaps = 39/346 (11%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           M  + +   SFW PY   L  +       V  P+ W++ ++  L GSP  A +L + +  
Sbjct: 52  MLHELRKPDSFWRPYFDALPSR-------VNLPMFWADEDMQLLAGSPLHAAVLAQKKQA 104

Query: 61  KREYNELDTVWFMAGSLFQQYP--YDIPTE------AFTFEIFKQAFVAVQSCVVHLQKV 112
           +  + E          + ++YP  + +  +      +++   F+     + S       +
Sbjct: 105 RDWHTE------HIVPIVRRYPRPFGVSDDDSSLEPSYSLARFEWVLSMIASRAFWHFDL 158

Query: 113 SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV---VDRPYKAGESIVVWCGPQPNS 169
                  +VP+   +    +         DD  Q     V +PY  GE + +      N 
Sbjct: 159 KDTWEPHMVPMADLINHSLTNDNVSKYTFDDKTQTFIVHVQQPYAEGEQVFITYCTDSNF 218

Query: 170 KLLINYGFVDEDN-------PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 222
           +LL  Y  + EDN         D   +      E  +    R + QR   L+ Q + V  
Sbjct: 219 ELLKTYAMMVEDNYNKYTEIRLDETTIARICPDEVERLTKTRALTQRG--LAKQTYPV-- 274

Query: 223 GREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAG 282
            + +E   D++  LRL ++  T       S+     PVS   E  V D +A   K  L+ 
Sbjct: 275 -KSEEFPLDLVQALRLYHLPLTDSHTE--STCFETDPVSVQNELMVYDTIAGCVKELLSQ 331

Query: 283 YPATLSEDEAMLT-DYNLHPKKRVATQLVRMEKKMLNACLQVTADM 327
           YP T  ED AML  D  L    R+A    R +K  L     V A+M
Sbjct: 332 YPITAQEDAAMLAHDPRLSATARLAVAYRREDKLFLTEVGSVFAEM 377


>gi|429861365|gb|ELA36056.1| set domain protein [Colletotrichum gloeosporioides Nara gc5]
          Length = 471

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 77/360 (21%), Positives = 138/360 (38%), Gaps = 55/360 (15%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-LERAEG 59
           +YE  QG  S W PY   L  +         + + W  ++L  L GS   ++I  E A+ 
Sbjct: 104 IYEFLQGAASPWKPYFEVLPEK-------FHTLMFWESSDLENLKGSAVLSKIGKEEADE 156

Query: 60  IKREY----------------------NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ 97
           + R                         EL  +    GS+   Y +D+  E    +    
Sbjct: 157 MFRSRILTVIAANPAIFYPEGSSPLGEAELLQLAHRMGSIIMAYAFDLDNEEEPEQEEDD 216

Query: 98  AFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGE 157
            ++  +     L          +VP+   +L   ++  A +   DD + +   RP  AGE
Sbjct: 217 EWIEDRDGKTML---------GMVPMAD-ILNADAEFNAHVNHGDDELTVTALRPIPAGE 266

Query: 158 SIVVWCGPQPNSKLLINYGFVDEDN--------PYD--RLVVEAALNTEDPQYQDKRMVA 207
            I+ + GP PNS+LL  YG+V   +        P+D  +  V   L   D  ++  +   
Sbjct: 267 EILNYYGPHPNSELLRRYGYVTPKHSRYDVVEIPWDLVQASVSEHLKIGDDVWKQVQEYV 326

Query: 208 QRNGKLSVQVFHVHAGR-EKEAISDMLPYLRLGYVSDTSEMQSVISSL----GPICPVSP 262
                  V V    +G  + E     +  +R        ++++V+ ++    G + P   
Sbjct: 327 DPEELEDVFVLERESGEPDSEGQFRTVAEVREISAELEEQLKAVLKAIKKINGDLIPDKR 386

Query: 263 CMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 322
             +      +    +  L+ YP +  EDEA+L   +L  ++R+A  +   EKK+L   L+
Sbjct: 387 KRDEVFHAVIVSTLQKILSQYPTSTQEDEALLATSDLTNRQRMAIHVRLGEKKLLKEALE 446


>gi|299473350|emb|CBN77749.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 563

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 79/298 (26%), Positives = 125/298 (41%), Gaps = 30/298 (10%)

Query: 33  PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIP-TEAFT 91
           P+ W+E E+  L GS    ++ ER + I+ +Y          G +   YP   P  +  T
Sbjct: 212 PIFWTEEEMRLLQGSYLVTQVEERNQAIEGDY----------GVICDLYP---PFRDVAT 258

Query: 92  FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSK-CKAMLAAVDDAVQLVVD 150
            E FK A + V S    L  ++  R  ALVP    L  Y  +  K           +   
Sbjct: 259 LEEFKWARMCVCSRNFGLD-INGLRTSALVPYADMLNHYRPRETKWTYDNNRGGFTITTL 317

Query: 151 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY------DRLVVEAALNTEDPQYQDKR 204
                G  +    G + N + L+NYGF  E+N        + + +   L+  DP  Q K 
Sbjct: 318 HRILGGAQVYDSYGQKCNHRFLLNYGFAIENNQEANGFCPNEVPLLFRLDARDPLRQKKA 377

Query: 205 MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS--VISSLGPI-CPVS 261
              + +G    +V  +  G + +A+   L  LR+  V+D +EM +  +  ++  +  P+S
Sbjct: 378 RFWRMDGPEQRRV-RLCVG-DTDAVRGALSMLRV-IVADAAEMGARYMYRTVKDVRFPLS 434

Query: 262 PCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP--KKRVATQLVRMEKKML 317
              E A +++L       L  YP TL ED A L +  L P   +R A   V  EK +L
Sbjct: 435 VRNEVAAMERLLLLTTGALDAYPTTLEEDRAALKNGGLEPFSNRRHALIQVYGEKVVL 492


>gi|315039895|ref|XP_003169325.1| hypothetical protein MGYG_08872 [Arthroderma gypseum CBS 118893]
 gi|311337746|gb|EFQ96948.1| hypothetical protein MGYG_08872 [Arthroderma gypseum CBS 118893]
          Length = 455

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 72/305 (23%), Positives = 128/305 (41%), Gaps = 20/305 (6%)

Query: 32  SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 91
           + + +++ EL    GS   A   +  + +  +Y +L        +L  Q+    P + FT
Sbjct: 107 ASIFFTDDELQVCKGSSLYALTPQLEQRVHDDYRQLLV------ALLSQHRDLFPLDQFT 160

Query: 92  FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYS---SKCKAMLAAVDDAVQLV 148
            E +K A  ++ S  +       A    + PL   +L +S    +C A      D + ++
Sbjct: 161 IEDYKWALCSIWSRAMDFAVSETASVRLVAPLAD-MLNHSPDVKQCHAYDPTSGD-LSIL 218

Query: 149 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 208
             + Y+ G+ I ++ G  PN++LL  YGFV  DNP D   +    +   P Y+ K  +  
Sbjct: 219 AAKDYQVGDQIFIYYGSVPNNRLLRLYGFVLPDNPNDSYDLVLQTSPLAPLYEQKERLWA 278

Query: 209 RNGKLSVQVFHVHAGREKEAI-SDMLPYLRLGYVSDTSEMQSVISSL--GPICPVSPCME 265
             G  S     +     K+ + +++L YLR+  + D S +  +   L  G    V+   E
Sbjct: 279 LAGLDSTCTIPLTV---KDPLPNNVLRYLRIQRL-DESNITDITLRLVNGTDGKVNDGNE 334

Query: 266 RAVLDQLADYFKARLAGYPATLSEDEAMLT--DYNLHPKKRVATQLVRMEKKMLNACLQV 323
             VL  L D   + L G+   L + EA L   DY        A  +   E+++L    + 
Sbjct: 335 IQVLQFLVDSIGSLLEGFGIPLEKLEAQLVAGDYPAGGNAWAAAHVSAGEQRVLTRAKKT 394

Query: 324 TADMI 328
             D++
Sbjct: 395 AEDLL 399


>gi|302762396|ref|XP_002964620.1| hypothetical protein SELMODRAFT_81798 [Selaginella moellendorffii]
 gi|300168349|gb|EFJ34953.1| hypothetical protein SELMODRAFT_81798 [Selaginella moellendorffii]
          Length = 464

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 57/213 (26%), Positives = 90/213 (42%), Gaps = 30/213 (14%)

Query: 145 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 204
           +++V  R  + G +I +  GP  N  LL++YGFV  DNP+DR+     L  +    ++ R
Sbjct: 248 LKVVTKRNLEKGSAITLNYGPLSNDLLLLDYGFVIPDNPHDRI----ELRYDGSLMENAR 303

Query: 205 MVA--QRNGK------LSVQVFH--------------VHAGREKEAISDMLPYLRLGYVS 242
           M+A   R G        S QV                V  G  +E    +L  LR+ +  
Sbjct: 304 MIAGLSRTGSPPFSSPASWQVDRLKQLGLADSGESQKVTLGGPEEVDGRLLAALRILHAE 363

Query: 243 DTS--EMQSVIS--SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYN 298
                E + ++S  + G    VS   E  VL  L          +  T+ EDEA L+D +
Sbjct: 364 SQEPLERRELVSLQAWGVESMVSSDNEERVLRTLCGLAAIVFNQFKTTIEEDEAKLSDKS 423

Query: 299 LHPKKRVATQLVRMEKKMLNACLQVTADMIMLL 331
           L    R+A Q    +K+++   L+     +M L
Sbjct: 424 LAETSRIAVQFRLTKKRLVVRVLESLKKRLMDL 456


>gi|260831632|ref|XP_002610762.1| hypothetical protein BRAFLDRAFT_91548 [Branchiostoma floridae]
 gi|229296131|gb|EEN66772.1| hypothetical protein BRAFLDRAFT_91548 [Branchiostoma floridae]
          Length = 604

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 54/217 (24%), Positives = 103/217 (47%), Gaps = 35/217 (16%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           + EK +GK SFW PYIR L         +  +P+ ++E+EL  L+ S     + E+A  +
Sbjct: 244 LLEKNKGKDSFWYPYIRSLPN-------SFTTPVYFTESELNALSPS-----LQEKARDL 291

Query: 61  KRE----YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS--- 113
           K+E    +N+L+   F+   L +          FTF+ F+ A+  +++  ++ +      
Sbjct: 292 KKELLHAFNDLEP--FVTSCLPEL------DSTFTFDAFRWAWSVLKTRTLYQEDCRSPY 343

Query: 114 LARR----FALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQP 167
           L+ +      LVP+   L+ +S   KA           ++ V  PY+  + + +  G + 
Sbjct: 344 LSNKEPQTSTLVPM-LDLINHSPSAKARFGYNVNTSCYEVRVLEPYRKYDQVFISYGFEE 402

Query: 168 NSKLLINYGFVDEDNPYDRLVVEAALNTED-PQYQDK 203
           N++L++ +GF   +NP D + +  +   E  PQ  D+
Sbjct: 403 NTELMLKFGFFVPENPKDFMKINLSEMLESLPQINDE 439


>gi|225448769|ref|XP_002275729.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Vitis
           vinifera]
          Length = 480

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 48/199 (24%), Positives = 90/199 (45%), Gaps = 25/199 (12%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           E K G+ S W PYI  L  Q G     + S + WSE EL  +  S    E + +   I++
Sbjct: 123 EWKMGQDSEWAPYINRLP-QPGE----MHSTIFWSEGELKMIQQSSVYQETINQKAQIQK 177

Query: 63  EYNELDTVW-FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSC-VVHLQKVSLARRFAL 120
           ++  +  V    + +LF+    DI  + F           + +C +V  +     +  +L
Sbjct: 178 DFLAIKPVLHHFSENLFK----DISLKEF-----------MHACALVGSRAWGSTKGLSL 222

Query: 121 VPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           +P     + +     ++L   +D    +++ DR Y  GE +++  G  PN+ LL+++GF 
Sbjct: 223 IPFA-DFVNHDGFSDSVLLGDEDKQLSEVIADRNYAPGEQVLIRYGKFPNATLLLDFGFT 281

Query: 179 DEDNPYDRLVVEAALNTED 197
              N YD++ ++  +   D
Sbjct: 282 LPYNIYDQVQIQVNIPHHD 300


>gi|302826668|ref|XP_002994755.1| hypothetical protein SELMODRAFT_432653 [Selaginella moellendorffii]
 gi|300136963|gb|EFJ04180.1| hypothetical protein SELMODRAFT_432653 [Selaginella moellendorffii]
          Length = 688

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 47/191 (24%), Positives = 82/191 (42%), Gaps = 26/191 (13%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           +K Q + S W PYI  L          +++  LW +TEL+YL  SP   +  ER E I  
Sbjct: 503 QKFQLQSSAWAPYISCLPEPA-----ELDNTFLWEDTELSYLRASPLYGKTRERLEIITT 557

Query: 63  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 122
           E+ ++     +   LF +          + E F   +  V S     + +++     LV 
Sbjct: 558 EFGQVQNALDVWPQLFGK---------VSVEDFMHVYATVFS-----RPLAIGEDSTLVM 603

Query: 123 LGPPLLAYSSKCKAMLAAVD-----DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
           +  P+L + +   A  A +      +   +  DR     + I + CG   N++L ++YGF
Sbjct: 604 I--PMLDFFNHNAASFAKLSFNGLLNYAVVTADRDCAENDQIWINCGDLSNAELALDYGF 661

Query: 178 VDEDNPYDRLV 188
              +N YD ++
Sbjct: 662 TVPENRYDEVM 672


>gi|145553305|ref|XP_001462327.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124430166|emb|CAK94954.1| unnamed protein product [Paramecium tetraurelia]
          Length = 481

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 74/357 (20%), Positives = 139/357 (38%), Gaps = 55/357 (15%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           + ++ + + SFW PYI  L +           P  +   + A L GSPT   ++ + +  
Sbjct: 123 LLQENRRQNSFWKPYIDVLPKD------VSGFPTNFDAEQDALLKGSPTLFTVMNQRKTF 176

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF-- 118
           + EY+ L      A   FQ+Y Y       T+  F           V  + ++++R F  
Sbjct: 177 QEEYDNLKE----AVKEFQRYGY-------TYNDF-----------VKFRTLTISRSFPV 214

Query: 119 ---------ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD--RPYKAGESIVVWCGPQP 167
                     LVPL    + + +          DA    +   R  + GE +    G   
Sbjct: 215 YIGENEQQQLLVPLAD-FINHDNNGFLQYGYSPDADGFFMQAVRNIQKGEELFYNYGQWS 273

Query: 168 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN---GKLSVQVFHVHAGR 224
           N    +NYGF    NP ++   +  L+  D  ++ K  +   N   G   V        R
Sbjct: 274 NKYFFMNYGFASLTNPMNQFDFDICLDRNDRMFKMKVELTGGNICWGNRLVNETDHDTFR 333

Query: 225 EKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICP---VSPC---MERAVLDQLADYFKA 278
           +  A    + + ++  + D  +++  + +     P    +P    +E+A    L D   +
Sbjct: 334 QSLAT---VRFAQISKLDDFLQLEEDVQNYNQFWPGWHTTPKTIELEKATFKALRDLLVS 390

Query: 279 RLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVT 335
            LA + +T+ +D+  L D +    +R    L   EK+++   ++V  DM++ + D T
Sbjct: 391 ELANFASTIEDDQRRLNDPSTPEFRRHIIMLTMREKQIIKKNIEV-CDMMLSVIDKT 446


>gi|345325919|ref|XP_001512656.2| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Ornithorhynchus anatinus]
          Length = 345

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 51/197 (25%), Positives = 94/197 (47%), Gaps = 18/197 (9%)

Query: 142 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 201
           DD  + V  + + AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y 
Sbjct: 42  DDRCECVALQDFTAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 101

Query: 202 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS--------S 253
            K  V  R G  +  VF +H   E    + +L +LR+  +++    + +I         +
Sbjct: 102 MKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLIGDHAIDKIFT 160

Query: 254 LGPI-CPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVATQ 308
           LG    PVS   E     +L  + +AR    L  Y  T+ ED++ L   +L     +A +
Sbjct: 161 LGNSEFPVSWDNEV----KLWTFLEARASLLLKTYKTTIEEDKSFLETPDLTFHATMAIK 216

Query: 309 LVRMEKKMLNACLQVTA 325
           L   EK++L   ++  A
Sbjct: 217 LRLGEKEILEKAVKSAA 233


>gi|167521575|ref|XP_001745126.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776740|gb|EDQ90359.1| predicted protein [Monosiga brevicollis MX1]
          Length = 390

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 49/204 (24%), Positives = 85/204 (41%), Gaps = 14/204 (6%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL-D 68
           SFW PYI  L        ++   PL W   +L  L GS    E +   + + R+Y  L +
Sbjct: 118 SFWRPYIACL-------PVSYSVPLYWDLPDLMSLRGSSLFVEAIRLYKHVCRQYGYLHN 170

Query: 69  TVWFMAGSLFQQYPY--DIPTEAFTFEIFKQAFVAVQSCVVHLQKVS----LARRFALVP 122
            +   A      +P    +  EAFTFE ++ A   V +    + +      +    AL+P
Sbjct: 171 KLSVRANPSCSCFPLTLGLSPEAFTFEDWRWAVATVMTRQNSIPQAGPDGQMKPTLALIP 230

Query: 123 LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 182
           L   +   +        +  + ++ V   P K G  I +W G + N + L++ GF    +
Sbjct: 231 LWDMINHANHPMSTQFDSERECLEFVCPAPAKPGSQITMWYGDRNNGQFLLHQGFFFAGH 290

Query: 183 PYDRLVVEAALNTEDPQYQDKRMV 206
             D + V  +L+  D  Y+ K ++
Sbjct: 291 ANDYVNVPFSLDETDSLYKIKALL 314


>gi|367048695|ref|XP_003654727.1| hypothetical protein THITE_2117893 [Thielavia terrestris NRRL 8126]
 gi|347001990|gb|AEO68391.1| hypothetical protein THITE_2117893 [Thielavia terrestris NRRL 8126]
          Length = 481

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 53/189 (28%), Positives = 77/189 (40%), Gaps = 18/189 (9%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           E  +G+ SFW PYI  L +       A+  P  W E ++AYL G+     I E    +K 
Sbjct: 112 EYLKGRDSFWAPYIATLPQPEHVSAWAL--PAFWPEEDIAYLAGTNAHVAIAEIQANVKS 169

Query: 63  EYNE----LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV----AVQSCVVHLQKVSL 114
           E+ +    L    F A   + Q  Y      FT   F+ + V    A Q     L     
Sbjct: 170 EFKQARKALKAAGFPAWQDYTQMLYKWAFCIFTSRSFRPSLVLSEPAKQQMAELLPPGCQ 229

Query: 115 ARRFALVPLGPPLLAYSSKCKAMLAAVDDA-----VQLVVDRPYKAGESIVVWCGPQPNS 169
              F+++    PL   ++       A D A      QLV    Y+ GE +    G + NS
Sbjct: 230 LDDFSILQ---PLFDIANHSMTARYAWDVASDPASCQLVCHDAYQPGEQVYNNYGLKTNS 286

Query: 170 KLLINYGFV 178
           +LL+ YGF+
Sbjct: 287 ELLLAYGFI 295


>gi|57899520|dbj|BAD87034.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
 gi|57899939|dbj|BAD87851.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
          Length = 509

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 45/210 (21%), Positives = 78/210 (37%), Gaps = 36/210 (17%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           + E+  G +S W PYI+ L  +       + + +LW   EL  +  S    E +E  E  
Sbjct: 110 IMEQHLGNESGWAPYIKSLPTKD-----QMHNMVLWDLNELHAVQNSSIYDEAIEHKEQA 164

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           K+E+  L         LF +        A   +      V     +   QK         
Sbjct: 165 KKEFLALKPALDHFPHLFGEVKLGDFMHASALDFLNHDGVFGSVLIYDEQK--------- 215

Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
                                 D  +++ DR Y  GE +++  G   N+ L +N+GF   
Sbjct: 216 ----------------------DVCEIIADRNYAVGEQVMIRYGKYSNATLALNFGFTLA 253

Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRN 210
            N YD+ ++   +  +DP Y+ K  + Q++
Sbjct: 254 RNIYDQALIRIDMPVQDPLYKKKLDIWQKH 283


>gi|168046556|ref|XP_001775739.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162672891|gb|EDQ59422.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 524

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 60/197 (30%), Positives = 86/197 (43%), Gaps = 23/197 (11%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET-ELAYLTGSPTKAEILERAEG 59
           +YE+ +G+ SFW PYI  L ++    +L    P+LWS+     +L GSP   ++ ER   
Sbjct: 161 LYERARGETSFWYPYIAVLPKEL---ELIGIHPMLWSQKMRREWLEGSPM-LDVTERRLA 216

Query: 60  IKREYNELDTVWFMAGSLF----QQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL- 114
           I RE  E   +   AG L     +  P  I   A     +    +  +S  ++LQ   L 
Sbjct: 217 ICREDYEA-MLLAGAGRLTPRGNEGEPISITETAVQ---WAATMLLSRSFSLNLQTQKLR 272

Query: 115 -----ARRFALVPLGPPLLAYSSKCKAMLAAVDD---AVQLVVDRPYKAGESIVVWCGPQ 166
                    ALVP    L   SS  +      D       L   R Y  GE +    GP 
Sbjct: 273 PGSFAEDTIALVPWADMLNHSSSAGRESCLVYDQKSGVATLQAHRTYSEGEQVFDSYGPS 332

Query: 167 PN-SKLLINYGFVDEDN 182
            + S+LL++YGFVDE+N
Sbjct: 333 CSPSRLLLDYGFVDEEN 349


>gi|449662705|ref|XP_002165483.2| PREDICTED: uncharacterized protein LOC100209819 [Hydra
           magnipapillata]
          Length = 819

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 78/328 (23%), Positives = 130/328 (39%), Gaps = 49/328 (14%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY-LTGSPTKAEILERAEG 59
           M+E  Q K SFW  Y+  +      G      PL W E E      G P   +I+   E 
Sbjct: 191 MWEFNQ-KDSFWASYLLLVPEISEFGH-----PLFWKEEEYNLEFQGMPLLNDIIVDREN 244

Query: 60  IKREYNELDTVWF-----MAGSLFQQYPYDIPTEAFTFEIFKQ--AFVAVQSCVVHLQKV 112
           I+ EY E   ++      + GSL          E ++ E FK+  AFV   S     +  
Sbjct: 245 IETEYAEFVLLFLRRNKDLFGSL----------ENYSLEFFKRMVAFVMAYSFTEDEESP 294

Query: 113 SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLL 172
           S+      VP+   +L + S   A L      +Q++  R  K GE +    G   N++LL
Sbjct: 295 SM------VPMAD-ILNHHSNNNAHLVFHKSNLQMISIRRIKKGEEVFNTFGKLGNTELL 347

Query: 173 INYGFVD-EDNPYDRLVV-----------EAALNTEDPQYQDKRMVAQRNGKLSVQVFHV 220
             YG+V+   N YD L++           +     +DP    K  +  R G   V  F +
Sbjct: 348 QMYGYVEIPSNQYDSLLLPVKDFYKIMTSKNGTANDDPYLLAKINLLNRTGIAEVDAFFM 407

Query: 221 HAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVS---PCMERAVLDQLADYFK 277
                     D++ +L++ + SD  E++ ++ +     P S     + +  L +  +  K
Sbjct: 408 FDKNGLRCGPDLIQFLKIFHASD-RELEKILKTRASKRPESFYHKLLRKLRLSKKTE--K 464

Query: 278 ARLAGYPATLSEDEAMLTDYNLHPKKRV 305
             L      ++ED+  +   N + +K V
Sbjct: 465 NSLGMTVIDITEDDTEMDIENFNKRKNV 492


>gi|317038661|ref|XP_001401929.2| SET domain protein [Aspergillus niger CBS 513.88]
          Length = 699

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 55/199 (27%), Positives = 85/199 (42%), Gaps = 7/199 (3%)

Query: 6   QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 65
           +G + FW PYIR L  Q G    ++ +P  +   +L +L G+   A   +R E +K +Y 
Sbjct: 121 RGTEGFWYPYIRTLP-QPG----SLTTPPYYEGEDLQWLDGTSLLAAREKRLEVLKEKYE 175

Query: 66  ELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG 124
           +  T    AG      Y +D+   A +  I +     V S V     +S  +   L+P+ 
Sbjct: 176 KGSTELRNAGFEGADAYTWDLYLWAASMFISRAFSAKVLSGVFPETDLSEEKLSVLLPI- 234

Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
             +  +    K    A  D +  VV     AG+ I    GP+ N +L++NYGF    NP 
Sbjct: 235 IDMGNHRPLAKVEWRAGKDDIAFVVLEDVWAGQEISNNYGPRNNEQLMMNYGFCIPGNPC 294

Query: 185 DRLVVEAALNTEDPQYQDK 203
           D  +V        P Y  K
Sbjct: 295 DHRIVSLRAPPGSPLYMAK 313


>gi|121707885|ref|XP_001271968.1| SET domain protein [Aspergillus clavatus NRRL 1]
 gi|119400116|gb|EAW10542.1| SET domain protein [Aspergillus clavatus NRRL 1]
          Length = 677

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 54/195 (27%), Positives = 88/195 (45%), Gaps = 27/195 (13%)

Query: 6   QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 65
           QG+  FW PYIR L +      L++ +PL +   +L +L G+       +R E +K  Y 
Sbjct: 105 QGEDGFWFPYIRTLPQP-----LSLTTPLYYEGDDLGWLKGTSLWPAREQRMELLKEAYE 159

Query: 66  ELDTVWFMAGSLFQ---QYPYD--------IPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 114
             + V  +  + FQ   +Y +D        I + AF+ ++  +AF  +    +    VS+
Sbjct: 160 --NGVRELRKAGFQDVDKYTWDLYLWASSMIVSRAFSPKVLAEAFADID---LPEDGVSV 214

Query: 115 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 174
                L+P    L+ +    K    A    V  +V     AG+ I    GP+ N +L++N
Sbjct: 215 -----LLPC-IDLMNHRPLAKVEWRAGKQDVAYLVLEDVAAGQEIANNYGPRNNEQLMMN 268

Query: 175 YGFVDEDNPYDRLVV 189
           YGF   DNP D  +V
Sbjct: 269 YGFCLPDNPCDYRIV 283


>gi|350632383|gb|EHA20751.1| hypothetical protein ASPNIDRAFT_120572 [Aspergillus niger ATCC
           1015]
          Length = 668

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 55/199 (27%), Positives = 85/199 (42%), Gaps = 7/199 (3%)

Query: 6   QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 65
           +G + FW PYIR L  Q G    ++ +P  +   +L +L G+   A   +R E +K +Y 
Sbjct: 95  RGTEGFWYPYIRTLP-QPG----SLTTPPYYEGEDLQWLDGTSLLAAREKRLEVLKEKYE 149

Query: 66  ELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG 124
           +  T    AG      Y +D+   A +  I +     V S V     +S  +   L+P+ 
Sbjct: 150 KGSTELRNAGFEGADAYTWDLYLWAASMFISRAFSAKVLSGVFPETDLSEEKLSVLLPI- 208

Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
             +  +    K    A  D +  VV     AG+ I    GP+ N +L++NYGF    NP 
Sbjct: 209 IDMGNHRPLAKVEWRAGKDDIAFVVLEDVWAGQEISNNYGPRNNEQLMMNYGFCIPGNPC 268

Query: 185 DRLVVEAALNTEDPQYQDK 203
           D  +V        P Y  K
Sbjct: 269 DHRIVSLRAPPGSPLYMAK 287


>gi|134074534|emb|CAK38827.1| unnamed protein product [Aspergillus niger]
          Length = 625

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 55/199 (27%), Positives = 85/199 (42%), Gaps = 7/199 (3%)

Query: 6   QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 65
           +G + FW PYIR L  Q G    ++ +P  +   +L +L G+   A   +R E +K +Y 
Sbjct: 47  RGTEGFWYPYIRTLP-QPG----SLTTPPYYEGEDLQWLDGTSLLAAREKRLEVLKEKYE 101

Query: 66  ELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG 124
           +  T    AG      Y +D+   A +  I +     V S V     +S  +   L+P+ 
Sbjct: 102 KGSTELRNAGFEGADAYTWDLYLWAASMFISRAFSAKVLSGVFPETDLSEEKLSVLLPI- 160

Query: 125 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 184
             +  +    K    A  D +  VV     AG+ I    GP+ N +L++NYGF    NP 
Sbjct: 161 IDMGNHRPLAKVEWRAGKDDIAFVVLEDVWAGQEISNNYGPRNNEQLMMNYGFCIPGNPC 220

Query: 185 DRLVVEAALNTEDPQYQDK 203
           D  +V        P Y  K
Sbjct: 221 DHRIVSLRAPPGSPLYMAK 239


>gi|358397725|gb|EHK47093.1| hypothetical protein TRIATDRAFT_298882 [Trichoderma atroviride IMI
           206040]
          Length = 481

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 47/185 (25%), Positives = 83/185 (44%), Gaps = 14/185 (7%)

Query: 6   QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 65
           +G++SFW PYI+ L +       A+  P  W E E   L G+  +  + +  + +KRE+ 
Sbjct: 98  RGEESFWWPYIQALPQPEDVDDWAL--PPFWPEEEAELLEGTNVEVGLDKIRDDLKREFR 155

Query: 66  ELDTVWFMA--------GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL-QKVSLAR 116
           E   +   +          L  +  Y+     F+   F+ + V  ++    L + VS+  
Sbjct: 156 EAKAMLLASQKDAEDDFSELLTRELYNWAYCIFSSRSFRASLVMTEAQQQALPEDVSVDD 215

Query: 117 RFALVPL---GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 173
              L+PL   G   +A   + +   A    A QL V R ++ G+ I     P+ N++LL+
Sbjct: 216 FSVLLPLFDIGNHDMAVDVRWELDAANSGAACQLRVGREHQPGQQIFNNYSPKTNAELLL 275

Query: 174 NYGFV 178
            YGF+
Sbjct: 276 GYGFM 280


>gi|406860468|gb|EKD13526.1| putative SET domain-containing protein [Marssonina brunnea f. sp.
           'multigermtubi' MB_m1]
          Length = 474

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 84/351 (23%), Positives = 140/351 (39%), Gaps = 51/351 (14%)

Query: 6   QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEGIKREY 64
           Q + S W PY+  L  +       ++S + WSE+EL  L  S    +I    AE +  E+
Sbjct: 95  QQEDSKWAPYLALLPSR-------LDSLVFWSESELLELQASTVVNKIGRASAEQLFLEH 147

Query: 65  -------NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-----CVVHLQKV 112
                  N    +     S+   Y +DIP         K+     +S      +V   + 
Sbjct: 148 ISPLGLSNTNTEMCHKVASVVMAYAFDIPE--------KKGHDDPESPEDGDDLVSDNEE 199

Query: 113 SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLL 172
                 +++PL   L A +    A L   ++ +++   +P   GE I+   G  P S LL
Sbjct: 200 EENTILSMIPLADMLNADADGNNARLCCDNEELEMRSIKPISKGEEILNDYGQLPRSDLL 259

Query: 173 INYGFV-DEDNPYDRLVVE-------AALNTEDP-------------QYQDKRMVAQRNG 211
             YG++ D+   YD  V E       A+L+TE P             + + +  +AQR G
Sbjct: 260 RRYGYISDKYAAYD--VAELSTQSLLASLSTEQPLLAGGTLQPLSREKLEQRVELAQREG 317

Query: 212 KLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQ 271
                    H G +  +I D L  L    + D   + ++ +S   +   S      V   
Sbjct: 318 VYEDSYDLTHPGPDDPSIPDELLALLYILLLDNENLAAIETSHASLPSRSKLATSLVGQI 377

Query: 272 LADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 322
           L    ++R   Y  T+  D+A+L   NL  +KR+A ++   EK +L   +Q
Sbjct: 378 LTKILESRKQEYATTIEADQAILQADNLPSRKRMAVEVRLGEKLVLEKAIQ 428


>gi|242769547|ref|XP_002341787.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
 gi|218724983|gb|EED24400.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
          Length = 739

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 63/219 (28%), Positives = 95/219 (43%), Gaps = 29/219 (13%)

Query: 8   KKSFWLPYIRELDRQRGRGQLAVESPLLWSE--TELAYLTGSPTKAEILERAEGIKREYN 65
           ++ FW PYI+ L      G   + +PLL+ E   +LA+L  +   A    R +  K  Y 
Sbjct: 149 EEGFWYPYIQSLP-----GPEELTTPLLFKEEDGDLAWLNMTSLAASRERRLQIWKVNYE 203

Query: 66  ELDTVWFMAGSLFQQ--------YPYDIPTEAFTFEIFKQAFVA--VQSCVVHLQKVSLA 115
           +       A S+ Q         Y +D+   A T  I  +AF A  + S +  LQ     
Sbjct: 204 K-------AYSMMQDLGVENARLYTWDLYLWASTI-ISSRAFTAKVLASVIPKLQTAEEG 255

Query: 116 RRFALVPLGPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 173
            R ++  L P + A + K   K    A  D++ LVV    +AG+ +    GP+ N +L++
Sbjct: 256 DRISV--LLPLIDATNHKPLSKVEWRAGTDSIGLVVMSDLRAGDEVGNNYGPRNNEQLMM 313

Query: 174 NYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 212
           NYGF   DNP +  VV      + P  Q K    Q   K
Sbjct: 314 NYGFCIPDNPCEYRVVSLRAPPDSPLAQIKAQYEQHCSK 352


>gi|358056332|dbj|GAA97699.1| hypothetical protein E5Q_04377 [Mixia osmundae IAM 14324]
          Length = 347

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 36/107 (33%), Positives = 53/107 (49%), Gaps = 4/107 (3%)

Query: 88  EAFTFEIFKQAFVAVQS-CV-VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA- 144
           E   F+ F+ A++ V S CV + L   +    F LVPL   +  +SS C       D A 
Sbjct: 122 EIIDFDAFRWAWLCVNSRCVWLDLDYEAHEENFTLVPL-LDMANHSSTCANATVKYDHAH 180

Query: 145 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEA 191
            +L + RP K GE IV   G    + L   YGF++  NP++R+ + A
Sbjct: 181 FELKLTRPVKRGEEIVFEYGGHDQATLWAEYGFIESSNPHERIDLTA 227


>gi|358388339|gb|EHK25932.1| hypothetical protein TRIVIDRAFT_82204 [Trichoderma virens Gv29-8]
          Length = 915

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 78/352 (22%), Positives = 149/352 (42%), Gaps = 51/352 (14%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           M+E  +G +S W PY+  L         + E+P+ WS  EL  L  S T+ ++  +A+  
Sbjct: 550 MFEYFKGDESKWKPYMDVL-------PASFETPMFWSGAELDELQASATRTKV-GKADAE 601

Query: 61  KREYNELDTVWFMAGSLF---QQYPYDIPTE----------AFTFEIFKQAFVAVQSCVV 107
           +  + ++  V      +F   Q Y  D   +          ++ F+   +     +    
Sbjct: 602 EMFHAKVLPVIRANHEIFPSSQSYSDDELVQLAHRMGSTIMSYAFDFQNEDEEDEEDEEE 661

Query: 108 HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 167
            ++         +VP+   +L   ++  A +   DDA+ +   R  KAGE I+ + GP P
Sbjct: 662 WVEDRESKSTMGMVPMAD-ILNADAEYNAHVNYGDDALTVTALRTIKAGEEILNYYGPHP 720

Query: 168 NSKLLINYGFVD-EDNPYD------RLV---VEAALNTEDPQYQDKRMVAQRNGKLSVQV 217
           NS+LL  YG+V  + + YD      +LV   + A+L   + Q  + R     +      V
Sbjct: 721 NSELLRRYGYVTPKHSRYDVVELPWKLVENSLAASLGLSEQQLDNAREYLDMDEIEDTFV 780

Query: 218 FHVHAGREKEAISDMLPYLRLGYVSDTSE-----MQSVISSLGPICPVSPCMER------ 266
                 RE +  +    +      SD  E     ++ ++ ++  + P S   +R      
Sbjct: 781 L----DRESDEPNPDGTFTGSARFSDIPEDLRDQLKLLLKAVRKVDPSSVADKRKRDEIQ 836

Query: 267 -AVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 317
            +VL +  D   ++   YP T+ EDE +L+  ++  ++R A  +   EK+++
Sbjct: 837 HSVLIKALDALASQ---YPTTIPEDERILSGSDISERQRAAVTVRLGEKRLI 885


>gi|224098926|ref|XP_002311320.1| SET domain-containing protein [Populus trichocarpa]
 gi|222851140|gb|EEE88687.1| SET domain-containing protein [Populus trichocarpa]
          Length = 490

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 80/349 (22%), Positives = 144/349 (41%), Gaps = 46/349 (13%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           +  +K  + S W PY+  L         +  S + WSE ELA L G+   +  L     +
Sbjct: 128 LIREKLKEDSTWRPYLDVLPE-------STNSTIFWSEEELAELQGTQLLSTTLGVKSYL 180

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           +RE+ +++    +     Q +P  +     T + F  AF  ++S      +    +   L
Sbjct: 181 RREFLKVEEEILVPHK--QLFPSPV-----TLDDFSWAFGILRSRSFSRLR---GQNLVL 230

Query: 121 VPLG-----------------PPLL----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESI 159
           +PL                  P +      Y  K  A L + D    L      KAGE +
Sbjct: 231 IPLADLCNFLHTWLLDQVNHSPDITIEDGVYEIKG-AGLFSRDLIFSLRSPISLKAGEQV 289

Query: 160 VVWCGPQ-PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
           ++       N++L ++YGF++  +  +   +   ++  DP + DK  +A+ NG   +  F
Sbjct: 290 LIQYNLNLSNAELAVDYGFIEAKSDRNMYTLTLQISESDPFFGDKLDIAETNGLGEIADF 349

Query: 219 H-VHAGREKEAISDMLPYLRLGYVSDTSEMQSVISS--LGPI-CPVSPCMERAVLDQLAD 274
             V        +   L  + LG  +D+  ++S+  +   G +  PVS   E  +   + D
Sbjct: 350 DIVLGNPLPPTLLPYLRLVALGG-TDSFLLESIFRNTIWGHLELPVSRANEELICRVVRD 408

Query: 275 YFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQV 323
             K+ L+GY  T+ EDE  L    L+P+  +A  +   EKK+L    ++
Sbjct: 409 ACKSALSGYHTTIEEDEK-LKGEELNPRLEIAVGIRAGEKKVLQQIEEI 456


>gi|297736447|emb|CBI25318.3| unnamed protein product [Vitis vinifera]
          Length = 487

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 48/205 (23%), Positives = 91/205 (44%), Gaps = 31/205 (15%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           E K G+ S W PYI  L  Q G     + S + WSE EL  +  S    E + +   I++
Sbjct: 124 EWKMGQDSEWAPYINRLP-QPGE----MHSTIFWSEGELKMIQQSSVYQETINQKAQIQK 178

Query: 63  EYNELDTVWF-MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSC-VVHLQKVSLARRFAL 120
           ++  +  V    + +LF+    DI  + F           + +C +V  +     +  +L
Sbjct: 179 DFLAIKPVLHHFSENLFK----DISLKEF-----------MHACALVGSRAWGSTKGLSL 223

Query: 121 VPLGPPLLAYSSKCKAMLAAVDD--------AVQLVVDRPYKAGESIVVWCGPQPNSKLL 172
           +P     + +     ++L   +D         ++++ DR Y  GE +++  G  PN+ LL
Sbjct: 224 IPFAD-FVNHDGFSDSVLLGDEDKQLSESSSTLEVIADRNYAPGEQVLIRYGKFPNATLL 282

Query: 173 INYGFVDEDNPYDRLVVEAALNTED 197
           +++GF    N YD++ ++  +   D
Sbjct: 283 LDFGFTLPYNIYDQVQIQVNIPHHD 307


>gi|328854233|gb|EGG03367.1| hypothetical protein MELLADRAFT_90239 [Melampsora larici-populina
           98AG31]
          Length = 509

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 57/216 (26%), Positives = 92/216 (42%), Gaps = 24/216 (11%)

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           A+VPL   L A S    A L      + +   +  + GE I       PN+ LL  YG V
Sbjct: 260 AMVPLADILNAKSGCENAKLFYEPTTLNMTTTKSIRKGEQIYNTYADPPNADLLRRYGHV 319

Query: 179 DEDNPYD----------RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV-------- 220
           D++NP+D          RL  E +L+  DPQ Q+     +   K +++V  +        
Sbjct: 320 DDENPFDLAEVSLELCIRLAAE-SLHPSDPQNQNTLDELKSRAKWALEVSDIDEIFMLPT 378

Query: 221 HAGRE-KEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 279
            + RE KE + D L  +    +S   E Q+   S G +    P M   +          R
Sbjct: 379 KSQREPKEILPDELVIMLRILLSTEEEFQT-WKSKGKVP--KPAMSEPIAQLAIQILSNR 435

Query: 280 LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKK 315
           L  Y  T+  D+ +L D +L  ++++ +  VR+ +K
Sbjct: 436 LNQYSTTIQNDQDLLKDQSL-SRRKLKSIKVRLGEK 470


>gi|328700922|ref|XP_003241429.1| PREDICTED: SET domain-containing protein 3-like [Acyrthosiphon
           pisum]
          Length = 463

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 47/200 (23%), Positives = 83/200 (41%), Gaps = 17/200 (8%)

Query: 5   KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 64
           ++ KKSFW  Y+  L        +   +P+ +   +L  L GSP     L+    I R+Y
Sbjct: 174 RKDKKSFWHSYLTTL-------PVTYSTPVYFDVADLEALKGSPAFEAALKLNRNIARQY 226

Query: 65  NELDTVWFMAGSLFQQYPYD-IPTEAFTFEIFKQA---FVAVQSCVVHLQKVSLARRFAL 120
                ++ ++       P   I  + FT+E ++ A    ++ Q+ V      S     AL
Sbjct: 227 AYFKKLFQLSND-----PASVILKDTFTYEYYRWAVSTLMSRQNTVPSSDNPS-ENVSAL 280

Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
           +PL       S +         +      D  Y A E + ++ G + N+  L++ GFV  
Sbjct: 281 IPLWDMFNHRSGRLSTDFVKSSNVCVCYADGDYAADEQVYIFYGVRTNADFLVHNGFVYP 340

Query: 181 DNPYDRLVVEAALNTEDPQY 200
           DN +D + +   ++  DP Y
Sbjct: 341 DNEHDAVKIRLGVSRSDPLY 360


>gi|66828265|ref|XP_647487.1| hypothetical protein DDB_G0268558 [Dictyostelium discoideum AX4]
 gi|60475797|gb|EAL73732.1| hypothetical protein DDB_G0268558 [Dictyostelium discoideum AX4]
          Length = 459

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 60/218 (27%), Positives = 91/218 (41%), Gaps = 35/218 (16%)

Query: 5   KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 64
           K   KSFW PY+  L ++         + + +SE EL  L  S  K   + R +GI+R Y
Sbjct: 123 KLNDKSFWYPYVSVLPKE-------FTTSIYFSEEELDELQSSKLKEFTIIRKDGIERHY 175

Query: 65  NELDTVWFMAGSLFQQYPYDIPT---EAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL- 120
           N   T     G + +  P    T   + +T E+F  A     SCV        +R F+L 
Sbjct: 176 NSTFTRLSNRG-IAEFSPTSTQTLQQKGYTLELFTWAL----SCV-------WSRAFSLS 223

Query: 121 ------VPLGPPLLAYS-SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG---PQPNSK 170
                 VPL     A   SK K      D  +        + GE I    G   P  +S+
Sbjct: 224 DSDGGMVPLADMFNAEEISKSKVQPKVTDSTLDYYASDDIEIGEQIFTPYGVYKPLSSSQ 283

Query: 171 LLINYGFV-DEDNPYDRLVVEAAL-NTEDPQYQDKRMV 206
           +L++YGFV D   P D + +   + + ++P  Q K+ +
Sbjct: 284 MLMDYGFVFDHGTPSDNVAISVPIFHPDEPNIQVKQSI 321


>gi|367036287|ref|XP_003648524.1| hypothetical protein THITE_2106073 [Thielavia terrestris NRRL 8126]
 gi|346995785|gb|AEO62188.1| hypothetical protein THITE_2106073 [Thielavia terrestris NRRL 8126]
          Length = 496

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 52/198 (26%), Positives = 80/198 (40%), Gaps = 32/198 (16%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG- 59
           +YE  QG+ S W PY+  L           ++P+ WS TEL+ L  S   A++  RAE  
Sbjct: 114 IYEHLQGEASRWRPYLDVL-------PPTFDTPMFWSPTELSELQASALVAKV-GRAEAD 165

Query: 60  ---------IKREYNELDTVWFMAG----------SLFQQYPYDIPTEAFTFEIFKQAFV 100
                    + R + E   V+F  G           L  +    I   AF  E       
Sbjct: 166 RMIEAKVLPVIRAHEE---VFFPPGRAKLDDAQLFELAHRMGSTIMAYAFDLENDDSDND 222

Query: 101 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 160
                   ++         +VP+   +L   ++  A +   DDA+     RP +AG+ I+
Sbjct: 223 EADEDDEWVEDREGRTMLGMVPMAD-MLNADAEFNAHINHGDDALTATALRPIRAGDEIL 281

Query: 161 VWCGPQPNSKLLINYGFV 178
            + GP PN +LL  YG+V
Sbjct: 282 NYYGPLPNGELLRRYGYV 299


>gi|400602586|gb|EJP70188.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
          Length = 797

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 27/77 (35%), Positives = 40/77 (51%), Gaps = 2/77 (2%)

Query: 117 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 176
           R AL+P+   L   ++ C    +   +A  +  DR Y+AGE +    G   N  LL  YG
Sbjct: 593 RLALLPVADVLNHANAGCSVAFST--EAYDITADRAYQAGEEVYTSYGAHSNDFLLAEYG 650

Query: 177 FVDEDNPYDRLVVEAAL 193
           FV  DNP+D+L ++  L
Sbjct: 651 FVLPDNPWDQLCLDKVL 667


>gi|291235388|ref|XP_002737626.1| PREDICTED: SET domain containing 4-like [Saccoglossus kowalevskii]
          Length = 353

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 52/203 (25%), Positives = 93/203 (45%), Gaps = 26/203 (12%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           E+ + K SFW  YI+ L +       +  +P+ ++  E+ +L     K ++ +  E I  
Sbjct: 116 ERSRQKDSFWYNYIKVLPK-------SYSNPVYFTNEEINWLP-RRIKRKVFDECEKINT 167

Query: 63  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK-----VSLAR- 116
            Y EL  ++ +  S F  +        F +  F+ A+  V +  V++ +     +S+ R 
Sbjct: 168 AYRELKNLFSILESTFVSFK-----GIFEYSAFRWAWCTVNTRSVYMLQEQNPHLSIERD 222

Query: 117 RFALVPLGPPLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 174
            +AL P    LL +++  + KA    V    ++      K  + + ++ GP  N KL I 
Sbjct: 223 HYALAPF-LDLLNHTNTVEVKASYNPVSKCYEIFTCTACKKYDQMFIYYGPHDNVKLFIE 281

Query: 175 YGFVDEDNPYDRLVVEAALNTED 197
           YGFV   N ++  VVE  L+ ED
Sbjct: 282 YGFVLPQNQHN--VVE--LDFED 300


>gi|365989356|ref|XP_003671508.1| hypothetical protein NDAI_0H00910 [Naumovozyma dairenensis CBS 421]
 gi|343770281|emb|CCD26265.1| hypothetical protein NDAI_0H00910 [Naumovozyma dairenensis CBS 421]
          Length = 540

 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 50/192 (26%), Positives = 77/192 (40%), Gaps = 19/192 (9%)

Query: 1   MYEKK-QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLT--------GSPTKA 51
           MYE K    +S W  Y   L        L     + W++ EL++LT        G     
Sbjct: 143 MYEMKVLQHESRWSSYFNVLPSSESLNTL-----MYWNDKELSFLTPSLVVNRVGKGDAE 197

Query: 52  EILERAEGIKREYNELDTVWFMAGSL----FQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 107
            +  R      E+NE D +    GS+    F   P  I   +F  EI             
Sbjct: 198 TMYRRILDTINEFNE-DILTEKLGSISWEEFLYIPSIIMAYSFDVEIKNDDDENEGDEEF 256

Query: 108 HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 167
             ++       +++PL   L A + KC A L    D+++++  +P K GE +    G  P
Sbjct: 257 DEKEEEPELLKSMIPLADTLNADTHKCNANLTYDKDSLKMLAIKPIKKGEQVYNTYGELP 316

Query: 168 NSKLLINYGFVD 179
           NS+LL  YG+V+
Sbjct: 317 NSELLRKYGYVE 328


>gi|340520781|gb|EGR51016.1| N-methyltransferase [Trichoderma reesei QM6a]
          Length = 470

 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 50/186 (26%), Positives = 79/186 (42%), Gaps = 21/186 (11%)

Query: 32  SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 91
           S + +SE EL    G+       +  + IK +Y +L      A  LF Q+P   P + FT
Sbjct: 107 SSIFFSEGELEVCAGTSLYTVTKQLEQRIKDDYRQL------AVRLFAQHPDLFPLQKFT 160

Query: 92  FEI----------FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS---KCKAML 138
            E           +K A   V S  +       +    L P    +L +SS   +C A  
Sbjct: 161 IEDVRLLRRATDPYKWALCTVWSRSMDFTLPDGSSIRLLAPFAD-MLNHSSEVKQCHAYD 219

Query: 139 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 198
               D + +   + Y+ G+ + ++ GP PN++LL  YGFV  DNP D   +    +   P
Sbjct: 220 VKSGD-LSVFAGKDYEIGDQVYIYYGPIPNNRLLRLYGFVIPDNPNDSYDLVLTTHPMAP 278

Query: 199 QYQDKR 204
            Y+ K+
Sbjct: 279 FYEQKQ 284


>gi|66825817|ref|XP_646263.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|60474297|gb|EAL72234.1| SET domain-containing protein [Dictyostelium discoideum AX4]
          Length = 567

 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 74/345 (21%), Positives = 129/345 (37%), Gaps = 45/345 (13%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
           S W PYI+ L +Q         +   W   E     GSP     +    G  R+Y     
Sbjct: 164 SEWGPYIKLLPKQ-------YNTVYYWGLKEFTQFRGSPNLEYAMRYVRGAMRQY----- 211

Query: 70  VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR-----FALVPLG 124
             ++   + +     +P  +FT++ F  A   VQS     Q    A        AL+P  
Sbjct: 212 -CYLYSMIDRTQSNIMPISSFTWDAFVWAISTVQS----RQNPVYAGNGNGSIMALIPFW 266

Query: 125 P--PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 182
                 +  SK  +      + +     + +K GE + ++ GP+ N++LL++ GF  + N
Sbjct: 267 DFCNHSSTGSKITSFYHMDSNCMTSGAIKDFKKGEQVYMFYGPRDNTQLLMHAGFATKTN 326

Query: 183 PYDRLVVEAAL--NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 240
            +D    E  L     + ++    ++ +R  +  V V         E   +++P+ R+ Y
Sbjct: 327 LHDSYPFELHLLEGNHEIRHDKVHLLEERGIRDGVVVNLNQNPTSNELPLELIPFYRI-Y 385

Query: 241 VSDTSEMQSVIS---------------SLGPIC--PVSPCMERAVLDQLADYFKARLAGY 283
                E +++                  L P+    ++   E      L    K +LA Y
Sbjct: 386 ALSEQETRAIAPPQVPGEHNHHHGHQLELKPLAFKIITQENEEKAYSNLVQALKGKLASY 445

Query: 284 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 328
           P TL EDE  L   N    +R        EKK+L+  ++    +I
Sbjct: 446 PTTLEEDEQELKK-NPPANQRFILYTKINEKKILDRNIKYLESLI 489


>gi|302810436|ref|XP_002986909.1| hypothetical protein SELMODRAFT_235145 [Selaginella moellendorffii]
 gi|300145314|gb|EFJ11991.1| hypothetical protein SELMODRAFT_235145 [Selaginella moellendorffii]
          Length = 447

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 60/242 (24%), Positives = 99/242 (40%), Gaps = 57/242 (23%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEG 59
           MYE+ +GK S W  Y++ L  Q          P LWSE E+   L G+     + E    
Sbjct: 58  MYERSKGKGSKWYRYLKTLPCQE-------SVPFLWSEEEIDGLLLGTELHKALKEDKLL 110

Query: 60  IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ-KVSLARRF 118
           +K ++ E          L ++ P + P + FTFE    +++A +S V     ++     +
Sbjct: 111 MKEDWEE------NIAPLTKEDPLEFPAQDFTFE----SYLAAKSLVSSRSFEIDAEHGY 160

Query: 119 ALVPLGPPLLAYSSKCKA-----MLAA-------------VDDA---------------V 145
            +VPL      ++ K  A     ML A             +DD                +
Sbjct: 161 GMVPLAD---LFNHKTDAEDVHFMLNASDSDDDDDNNGLIIDDGLANGDCREISSDKSVL 217

Query: 146 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD--RLVVEAALNTEDPQYQDK 203
           ++V+ +   AG  I    G   N+ LL  YGF + +NP+D   L ++  L     ++Q K
Sbjct: 218 EMVMVKDVAAGSEIFNTYGQLGNAALLHRYGFTEPNNPHDIVNLDMDCLLEVLLSRFQKK 277

Query: 204 RM 205
           R+
Sbjct: 278 RV 279


>gi|189189204|ref|XP_001930941.1| SET domain-containing protein RMS1 [Pyrenophora tritici-repentis
           Pt-1C-BFP]
 gi|187972547|gb|EDU40046.1| SET domain-containing protein RMS1 [Pyrenophora tritici-repentis
           Pt-1C-BFP]
          Length = 476

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 54/210 (25%), Positives = 82/210 (39%), Gaps = 44/210 (20%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL------ 54
           +YE   G  S W PY   L  +         + + W+E ELA L  S    ++       
Sbjct: 91  LYEYHNGSASNWAPYFAVLPTE-------FNTLMFWTEDELAELQASAVVGKVGKESADE 143

Query: 55  ---------------------ERAEGIKREYNELDTVWFM--AGSLFQQYPYDIPTEAFT 91
                                ERA+   +E   L+ +  M   GSL   Y +D+     T
Sbjct: 144 AFLEQLLPVIEEFADIVFSGDERAKDKAKEMRSLENLELMHKMGSLIMAYAFDVEPATPT 203

Query: 92  FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 151
            E+ ++   A +     L K        +VPL   L A + +C A L    D +++   +
Sbjct: 204 KEVDEEG-FAEEEEDAALPK-------GMVPLADMLNADADRCNARLFYEKDCLEMKALK 255

Query: 152 PYKAGESIVVWCGPQPNSKLLINYGFVDED 181
           P +AGE I    GP P S LL  YG+V ++
Sbjct: 256 PIQAGEEIFNDYGPLPRSDLLRRYGYVTDN 285


>gi|145549620|ref|XP_001460489.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124428319|emb|CAK93092.1| unnamed protein product [Paramecium tetraurelia]
          Length = 482

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 69/342 (20%), Positives = 130/342 (38%), Gaps = 32/342 (9%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           + ++ + K SFW PYI  L +           P  +   + A L GSPT   ++ + +  
Sbjct: 124 LLQENRRKYSFWKPYIDVLPKD------VSGFPTYFDAEQDALLKGSPTLFTVINQRKVF 177

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           K EY  L      A   FQ+Y Y       T++ F +  +   S    +Q     ++  L
Sbjct: 178 KEEYENLKE----AVKEFQKYGY-------TYDDFIKFRILTISRSFTVQIGEKEQQQLL 226

Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVD--RPYKAGESIVVWCGPQPNSKLLINYGFV 178
           VPL    + + +          DA    +   R  + GE +    G   N    +NYGF 
Sbjct: 227 VPLAD-FINHDNNGFLKYGYSKDADGFFMQAVRNIQKGEELFYNYGQWSNKYFFMNYGFA 285

Query: 179 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN---GKLSVQVFHVHAGREKEAISDMLPY 235
              NP ++  ++  LN  D  +  K  + + N   G   V        R+  A    + +
Sbjct: 286 SLTNPMNQFDLDICLNKNDRLFNLKISLTKGNMCWGNRLVNETDHDTFRQSLAT---VRF 342

Query: 236 LRLGYVSDTSEMQSVISSLGPICP------VSPCMERAVLDQLADYFKARLAGYPATLSE 289
            ++  + D  +++  + +     P       +  +E+A    L       L  + +T+ +
Sbjct: 343 TQISKLDDFLQLEEDVQNFKQFWPGWHTTIKTIELEKATFKALKGILVTELGNFASTIED 402

Query: 290 DEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLL 331
           DE  L D      ++    L   EK+++   + +   M+ ++
Sbjct: 403 DERRLNDPQTPEFRKHIIMLTLREKQIIKKNIDICDLMLQVI 444


>gi|322712432|gb|EFZ04005.1| histone-lysine N-methyltransferase [Metarhizium anisopliae ARSEF
           23]
          Length = 462

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 64/271 (23%), Positives = 112/271 (41%), Gaps = 24/271 (8%)

Query: 32  SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 91
           S + +++ EL    G+       +  + I+ +Y +L         +  QYP   P + FT
Sbjct: 107 SSIFFTDDELEVCAGASLYTITKQLQQRIEDDYRDLVV------RVLVQYPDLFPLDKFT 160

Query: 92  FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS---KCKAMLAAVDDAVQLV 148
              +K A  AV S  +  Q    +    L P    +L +SS   +C    A+  D + ++
Sbjct: 161 LHHYKWALCAVWSRAMDFQLSDGSSIRLLAPFAD-MLNHSSESKQCHVYDASSGD-LSVL 218

Query: 149 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 208
             + Y+AG+ + +  G  PN +LL  YGF+   NP D   +  A +   P ++ K+ +  
Sbjct: 219 AGKDYEAGDQVYIHYGSIPNHRLLRLYGFIIPGNPNDSYDLVLATHPLAPFFELKQKLWA 278

Query: 209 RNGKLSVQVFHVHAGREKEAISDMLP-----YLRLGYVSDTSEMQSVISSLGPICPVSPC 263
             G  S     +        ++D LP     YLR+  + D S++ S+         +S  
Sbjct: 279 LAGLDSTCTISL-------TLTDPLPKNVIRYLRIQRL-DESDLASIALGQAADEKISNS 330

Query: 264 MERAVLDQLADYFKARLAGYPATLSEDEAML 294
            E  VL  L +   + L  +   L + E  L
Sbjct: 331 NEVQVLQSLVESIASLLGSFGTRLEKLEEQL 361


>gi|357131865|ref|XP_003567554.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Brachypodium distachyon]
          Length = 316

 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 46/181 (25%), Positives = 74/181 (40%), Gaps = 22/181 (12%)

Query: 8   KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 67
           KKS W PY+R L R        + + + W   EL  +  S    E +ER E   +E++ +
Sbjct: 38  KKSGWAPYVRSLPRND-----QMHNMMFWDLNELHMVRISSICDEAIERRERAMKEFSAV 92

Query: 68  DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 127
                    LF +            E F  A     S +V  +    +R  +L+P     
Sbjct: 93  KPSLECFPHLFGE---------IKLEDFMHA-----SALVSSRAWQTSRGVSLIPFAD-F 137

Query: 128 LAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
           L +     ++L      D  +++ DR Y  GE ++V  G   N+ L +N+GF    N YD
Sbjct: 138 LNHDGVSDSILLYDGQKDIAEVISDRNYAVGEQVMVRYGKYSNAMLALNFGFTLPRNIYD 197

Query: 186 R 186
           +
Sbjct: 198 Q 198


>gi|328772383|gb|EGF82421.1| hypothetical protein BATDEDRAFT_86633 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 648

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 85/361 (23%), Positives = 138/361 (38%), Gaps = 51/361 (14%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA-EILERAEGIKREYNELD 68
           S WLPY+  L +           PL+W+   +  L G  +    ++ER E I+   N   
Sbjct: 146 SHWLPYLASLPKNYAL-------PLMWTRDRIQNLLGGTSLLYMMIERLEWIQ---NSTK 195

Query: 69  TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR----------- 117
            V    G  F       PT A T +  + A  ++ S      K SL  +           
Sbjct: 196 VVENACGHYF-------PTGALTVQSMQWATCSIWSRAFPKAKPSLDLQDGSHQDVQDWI 248

Query: 118 -FALVPLGPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 174
             + + L P L  ++ K   +      +  V  +       G  ++   GP+ N  LL N
Sbjct: 249 GLSEICLFPILDMFNHKRGYRVEWRMTEKGVSFITPDGICKGSELLNNYGPKGNENLLSN 308

Query: 175 YGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ--RNGKLSVQVFHVHAGREKEAISDM 232
           YGFV E+NP D   V   L  EDP Y  K+ V +      LS  VF      + +  ++M
Sbjct: 309 YGFVIENNPEDYFKVFLGLQQEDPLYTAKKAVLEVVSENDLSHLVF-----LKDDLPTNM 363

Query: 233 LPYLRLGYVSDTSEMQSVISSLGPICPVSP-----CMERAV--LDQLADYFKARLAGYPA 285
           +   R+  V+++ E+  + + L     VSP     C+   +  L  L +   ++L    +
Sbjct: 364 ISISRV-LVANSWELAILEARLTSSTKVSPHSTPICLHNEILALSTLYNLLNSKLKVLKS 422

Query: 286 TLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAP 345
           T    E    ++   P    A +LV + +    + L+        L  +T S C   Y  
Sbjct: 423 TTPNYEQDTIEF---PNDDEARKLVHIYRAGQQSILEHAIQNTRTLASMT-SSCHKQYET 478

Query: 346 L 346
           L
Sbjct: 479 L 479


>gi|159476096|ref|XP_001696150.1| protein N-methyltransferase [Chlamydomonas reinhardtii]
 gi|158275321|gb|EDP01099.1| protein N-methyltransferase [Chlamydomonas reinhardtii]
          Length = 474

 Score = 50.8 bits (120), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 56/213 (26%), Positives = 95/213 (44%), Gaps = 35/213 (16%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           ++ K+QG +S   P+I +L    G        PL WS+ +LA L      A++ E+    
Sbjct: 146 LWHKRQGSQSPLAPWIAQLPADTG-------VPLNWSDKQLAALQYPYLVAQVKEQ---- 194

Query: 61  KREYNEL-DTV-----------------WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 102
           +RE+  L DT+                 W+  G + +   +  P    T     +    V
Sbjct: 195 QREWTALYDTLRGSGMAAGAAPPSREEFWWAMG-VVRSRTFSGPYIGSTLSDRLRLAGLV 253

Query: 103 QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAA--VDDAVQLVVDRPYKAGESIV 160
            + VV L + SL +++A+ PL   L  ++S  ++ ++     D+  +V  R +K GE + 
Sbjct: 254 AALVVILSR-SL-KQYAICPL-IDLFNHTSAAQSEVSYNYFGDSYSVVASRDFKKGEQVF 310

Query: 161 VWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 193
           +  G Q N  L+  YGF + DNP D  V+   L
Sbjct: 311 ITYGAQSNDSLMQYYGFAEADNPQDTYVISDVL 343


>gi|451999637|gb|EMD92099.1| hypothetical protein COCHEDRAFT_1134267 [Cochliobolus
           heterostrophus C5]
          Length = 476

 Score = 50.8 bits (120), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 78/327 (23%), Positives = 130/327 (39%), Gaps = 54/327 (16%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL------ 54
           +YE   G  S W PY   L           ++ + W+E EL  L  S    +I       
Sbjct: 91  LYEYHNGSASNWAPYFAVLPTD-------FDTLMFWTEDELTELQASAVVNKIGKEGANE 143

Query: 55  ---------------------ERAEGIKREYNELDTVWFM--AGSLFQQYPYDIPTEAFT 91
                                ERA+ + +E    + +  M   GSL   Y +D+   A +
Sbjct: 144 VFIEQLLPVIEEFADVIFSGDERAKDLAKEMRAPENLELMHKMGSLIMAYAFDVEP-AIS 202

Query: 92  FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 151
            +   +   A +     L K        +VPL   L A + +C A L    D +++   +
Sbjct: 203 DKEVDEEGFAEEEEDAALPK-------GMVPLADMLNADADRCNARLFYEKDGLEMKALK 255

Query: 152 PYKAGESIVVWCGPQPNSKLLINYGFVDED-NPYDRLVVEAALNTE----DPQYQDKRMV 206
           P +AGE I    GP P S LL  YG++ E+   YD + + A L ++    D  + +KR+ 
Sbjct: 256 PIQAGEEIFNDYGPLPRSDLLRRYGYITENYAQYDVVEIPADLVSQALAHDGLWHEKRIE 315

Query: 207 AQRNGKLSVQVFHVHAG---REKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPC 263
                ++    + + A      +E++S  L  L    +  + E +  + S G + P +  
Sbjct: 316 YLDEQEIVDTGYDIAASVPFSLEESLSPELVILVETMLLPSEEFER-LQSKGRL-PKAEK 373

Query: 264 MERAVLDQLADYFKARLAGYPATLSED 290
           M     + L    +AR+A YP TL +D
Sbjct: 374 MTGKAAEILYKIVQARIAQYPTTLEQD 400


>gi|308806489|ref|XP_003080556.1| SET-domain transcriptional regulator-like protein (ISS)
           [Ostreococcus tauri]
 gi|116059016|emb|CAL54723.1| SET-domain transcriptional regulator-like protein (ISS)
           [Ostreococcus tauri]
          Length = 394

 Score = 50.8 bits (120), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 52/186 (27%), Positives = 83/186 (44%), Gaps = 20/186 (10%)

Query: 12  WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 71
           W P +R L  +         S L W+  +L  + G      + E    +K +Y+ L    
Sbjct: 77  WAPALRALPTR-------ASSSLAWNADDLGAVAGEDLANRLREYRRSVKVQYDAL---- 125

Query: 72  FMAGSLFQQYPYDIPTEAF-TFEIFKQAFVAVQSCVVHLQK-VSLARRFALVP----LGP 125
           F A  L +Q P   P  AF  +  F++A+    S  + +Q   SL  R  +VP       
Sbjct: 126 FPA--LCEQVPEAFPARAFGDYAKFERAYDIWTSYAMKVQDPDSLQIREVIVPGVFLCNH 183

Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
            L A+S +  ++      A +L + R    GE+I +  G   N+ LL+ YGF  E+NPYD
Sbjct: 184 SLSAHSVRYTSLERGTK-AFRLELSRGCVEGEAITISYGRLDNADLLMFYGFSLENNPYD 242

Query: 186 RLVVEA 191
           R+ + +
Sbjct: 243 RVSLHS 248


>gi|402224283|gb|EJU04346.1| hypothetical protein DACRYDRAFT_114691 [Dacryopinax sp. DJM-731 SS1]
          Length = 1313

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 57/215 (26%), Positives = 90/215 (41%), Gaps = 19/215 (8%)

Query: 119  ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
            A+VP+   L A      A L    D +Q++  +P   GE I    G  PNS LL  YG+V
Sbjct: 1060 AMVPMADMLNARCGCNNAKLFYTRDDLQMMATKPIAKGEQIWNTYGDPPNSDLLRRYGYV 1119

Query: 179  DE-------DNPYDRLVVEAALNTEDPQ---YQDKRMVAQRNGKLSVQVFHVHAGREKEA 228
            D         +P D + + A    E  +   YQD+       G     V  V       +
Sbjct: 1120 DALTLPDGVGSPSDVVEINADTVVEAAKVQSYQDRIDWWLEEGGDDAFVLDV-----TYS 1174

Query: 229  ISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLS 288
            + D +  L    + +  + +   S      P  P ++    + L    + RLA YP +L+
Sbjct: 1175 VPDEMLSLVRLLLLNQEDWEKAQSK---GKPPKPKLDEKSYEVLLVVLQKRLAMYPISLT 1231

Query: 289  EDEAMLTDYN-LHPKKRVATQLVRMEKKMLNACLQ 322
            E E ML   N L+ K+R A  +   E+++L+  L+
Sbjct: 1232 EQEGMLRSSNELNEKRRNALIVTTGEQRILHKTLE 1266


>gi|451852693|gb|EMD65988.1| hypothetical protein COCSADRAFT_86793 [Cochliobolus sativus ND90Pr]
          Length = 478

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 82/355 (23%), Positives = 138/355 (38%), Gaps = 51/355 (14%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           +YE  QG+ S W  Y+  L +       A E+P+ W+  EL  L G+    E + + E  
Sbjct: 114 LYEYLQGEASRWKTYLDILPQ-------AFETPIFWTPDELKELEGTSLTTEKIGKKESD 166

Query: 61  K--RE--------------------YNELD--TVWFMAGSLFQQYPYDIPTEAFTFEIFK 96
           +  RE                     NE D  ++    GS    Y +D+  E    E  +
Sbjct: 167 RMLRERILPIVTSHPDVFSPPGAPRLNEDDLLSLAHRMGSTIMAYAFDLENEEEQSEDEE 226

Query: 97  QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 156
             ++  +         SL     +VP+   +L  +++  A +   D      +     AG
Sbjct: 227 DGWIEDRDGK------SL---IGMVPMAD-MLNANAEFNAHVHHGDQLQVTSLRESIPAG 276

Query: 157 ESIVVWCGPQPNSKLLINYGFV-DEDNPYDRL-----VVEAALNTEDPQYQDKRMVAQRN 210
             I+ + GP P+S+LL  YG+V  E + YD       +V  AL  E    +D     +R 
Sbjct: 277 SEILNYYGPLPSSELLRRYGYVTSEHHRYDVAEISWSLVRTALAEELKLSEDTIADIERK 336

Query: 211 GKLSVQVFHVHAGREKEAIS----DMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMER 266
            +  ++ F V      E  S       P LR        + ++ + +L    P     E 
Sbjct: 337 LESELEEFFVIERDAGEPSSYGTLTQPPVLREISTELEEQTKAFLKALKKRDPKRKRSET 396

Query: 267 AVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 321
                L    + RL  YP +  +DE++L+   L  + R+A ++   EK++L   L
Sbjct: 397 ICNTVLEKALRTRLGQYPTSAKQDESLLSKEGLSKRHRMAVEVRLGEKRLLQEAL 451


>gi|301122791|ref|XP_002909122.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262099884|gb|EEY57936.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 426

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 79/357 (22%), Positives = 147/357 (41%), Gaps = 38/357 (10%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           M+E+ +   SF+ P+I  L           + P+ WSE+EL  L G+          + +
Sbjct: 89  MHERSKRSDSFFAPFIASLPT-------TFDLPVFWSESELNELKGTNVLLLTQLMKQQL 141

Query: 61  KREYNELDTVWFMAGSLFQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR 117
           +R++  +        ++ + +P     +PT   T E +  A   + S    + +     R
Sbjct: 142 QRDFENIHQ------AVVEDFPEVFALLPT--LTLEDYTWAMSVIWSRAFGVTREKKYLR 193

Query: 118 FALVPLGPPLLAYSSKCKAML---AAVDDAVQLV---VDRPYKAGESIVVWCGPQPNSKL 171
             L P    +  +    + +L    + D+  Q++   V +   AG ++ +  G   N+KL
Sbjct: 194 -VLCP-AMDMFNHDVSLRILLDDFVSFDEETQMLTHHVPKEVAAGSALQISYGQYSNAKL 251

Query: 172 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGR-EKEAIS 230
           L +YGFV ++N    +     +   DP  + K+ V   N     Q +       E +   
Sbjct: 252 LFSYGFVAKENSRRAVDFWMKIPPNDPYLKLKQTVLDSNELTRDQTYDFCGTLFENDVDE 311

Query: 231 DMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSED 290
            +L  LR+  +++  E++ +         +S   E AV + L +  + +LA +  TL ED
Sbjct: 312 RLLATLRVILMNE-QEIR-LYKKAFETSIISIRNELAVYENLQNTCRRKLANFATTLEED 369

Query: 291 EAMLTDYNLHPKKRVATQL-VRMEKK--------MLNACLQVTADMIMLLPDVTVSP 338
           EA+L +       R++  + VR+E K         L    QV A  + + P  T  P
Sbjct: 370 EAILAEMATESSPRLSFAVRVRVEDKQVLTGVIDTLEKWKQVLASNLEMYPPSTTRP 426


>gi|395332633|gb|EJF65011.1| SET domain-containing protein [Dichomitus squalens LYAD-421 SS1]
          Length = 502

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 61/223 (27%), Positives = 97/223 (43%), Gaps = 27/223 (12%)

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           A+VP+   L A      A L   +  +++V  +P +AGE I    G  PNS LL  YG V
Sbjct: 257 AMVPMADMLNARFESENAKLFYEERELKMVTTKPVEAGEQIWNTYGDPPNSDLLRRYGHV 316

Query: 179 D----------EDNPYDRLVVEAAL----NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGR 224
           D            NP D + V A L     ++  +Y  +  V     +    VF +    
Sbjct: 317 DVVPLRPPLSGMGNPRDIVEVRADLIVSAVSKKVEYSLQERVDWWLEEAEDDVFILRT-- 374

Query: 225 EKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP 284
           + E   +++ + RL ++S+   +++   S  P   V P     VL    D   ARL  YP
Sbjct: 375 DCELPEELVSFERLLFLSEDEWIKTAKKSKLPKPKVDP----DVLTVAIDVLSARLKEYP 430

Query: 285 ATLSEDEAMLT-----DYNLHPKKRVATQLVRMEKKMLNACLQ 322
            ++ EDE +L+       +L+ K  V  +L   EK++L   L+
Sbjct: 431 TSIEEDEKLLSADKVESLSLNKKHAVIVRL--GEKRILQGTLK 471


>gi|384484604|gb|EIE76784.1| hypothetical protein RO3G_01488 [Rhizopus delemar RA 99-880]
          Length = 400

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 56/237 (23%), Positives = 95/237 (40%), Gaps = 48/237 (20%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           MYE ++   SFW PY   L R+         +P+ W++ +L  L G+   ++I  + E  
Sbjct: 90  MYEIEK-PDSFWKPYFDVLPRE-------FTTPMFWNQEDLKELEGTDIISKI-GKKESE 140

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK--QAFVAVQSCVVHLQKVSLARR- 117
           +  +NEL+ +        ++YP     +  T E+F    + +   S    LQK       
Sbjct: 141 ELFHNELEPI-------IKKYPNLFDEQKHTIELFHICGSLIMAYSFNDELQKAPKENNK 193

Query: 118 -----------------------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYK 154
                                   ++VP+   L   +    A L    D++Q+   +  K
Sbjct: 194 EEEKEEEEEEEEEEEEEEEEEGLISMVPMADMLNHKTGFNNARLFHEPDSLQMRAIKDIK 253

Query: 155 AGESIVVWCGPQPNSKLLINYGFVDEDNPYDR------LVVEAALNTEDPQYQDKRM 205
            GE I    G   N+ LL  YGFVDE N +D       L+VE     +D   +++++
Sbjct: 254 EGEQIYNTYGDLCNADLLRKYGFVDEKNDFDLVELDGPLLVEVCCEDQDEALKERKI 310


>gi|330806388|ref|XP_003291152.1| hypothetical protein DICPUDRAFT_155733 [Dictyostelium purpureum]
 gi|325078672|gb|EGC32310.1| hypothetical protein DICPUDRAFT_155733 [Dictyostelium purpureum]
          Length = 465

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 77/305 (25%), Positives = 122/305 (40%), Gaps = 34/305 (11%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           MY K   +KSFW PY+  L         +  + L +S+ EL  L  S  K   + R +GI
Sbjct: 125 MYSK-LNEKSFWHPYVTVLPE-------SFSTSLYFSDNELDELQASQLKEFTIIRKDGI 176

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA-FVAVQSCVVHLQKVSLARR-F 118
           +R Y           S F +    +P E     ++ Q  F    SCV   +  SLA    
Sbjct: 177 ERHYE----------STFSRLSKLVP-EFSNLALYNQELFTWALSCVWS-RAFSLAENDG 224

Query: 119 ALVPLGPPLLAYS-SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG---PQPNSKLLIN 174
            +VPL     A   SK K +    D  +          GE I    G   P  +S++L++
Sbjct: 225 GMVPLADMFNAEDRSKSKVLPKVTDTTLDYYASDDIAEGEQIFTPYGVYKPLSSSQMLMD 284

Query: 175 YGFV-DEDNPYDRLVVEA-ALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDM 232
           YGF+ DE    D + +     + ++P    K+ + + N  ++ +VF +       A  D+
Sbjct: 285 YGFIFDEGTVSDNVAITVPVFHNDEPNLSTKQEILEENDIIN-EVFLLQKTDPLPA--DL 341

Query: 233 LPYLRLGYVSDTSEMQSVISSLGP---ICPVSPCMERAVLDQLADYFKARLAGYPATLSE 289
           L Y R+  +      Q+    L P     P++   E+  L  L +     L  Y   L  
Sbjct: 342 LLYARVKNLIAKECDQAKKHFLSPNTRNTPLNTRNEKVSLRFLENLIHRYLDSYGTNLES 401

Query: 290 DEAML 294
           D+ +L
Sbjct: 402 DKNLL 406


>gi|71895277|ref|NP_001025965.1| SET domain-containing protein 4 [Gallus gallus]
 gi|53134599|emb|CAG32346.1| hypothetical protein RCJMB04_23h14 [Gallus gallus]
          Length = 439

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 77/347 (22%), Positives = 136/347 (39%), Gaps = 52/347 (14%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           EK  G++S W PY+  L +           P+   E ++  L   P + +  E+   +  
Sbjct: 116 EKHAGERSLWKPYLDVLPK-------TYSCPVC-LEQDVVQLLPEPLRKQAQEQRTAVHE 167

Query: 63  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 122
            Y      +    SLF +    I    F +   + A+  + +  +++ K S    F+L P
Sbjct: 168 LYMSSKAFFSSLQSLFAENTATI----FNYSALEWAWCTINTRTIYM-KHSQRECFSLEP 222

Query: 123 ----LGP--PLLAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 174
               L P   LL +S   + KA         ++  +   K  E + +  GP  N +LL+ 
Sbjct: 223 DVYALAPYLDLLNHSPNVQVKAAFNEQSRNYEIQTNSQCKKYEEVFICYGPHDNQRLLLE 282

Query: 175 YGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP 234
           YGFV  DNP+  + V +    +     DK    Q+N KLS+   H           D+L 
Sbjct: 283 YGFVAVDNPHSSVYVSSDTLLKYFPSLDK----QKNAKLSILKEH-----------DLLE 327

Query: 235 YLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM- 293
            L  G+   +  + + +  L        C  RA+   L D   AR        +E +A+ 
Sbjct: 328 NLTFGWDGPSWRLLTALKVLSLGGDEFTCWRRAL---LGDVISAR--------NEQQALN 376

Query: 294 ----LTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTV 336
               +  + +   + V  Q+ +M++   N   Q+T    + L D+ +
Sbjct: 377 ITTKICHFLIEETQHVLLQISQMKRDKENLKEQLTLVEALRLEDLKI 423


>gi|392569623|gb|EIW62796.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
          Length = 509

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 61/221 (27%), Positives = 92/221 (41%), Gaps = 23/221 (10%)

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           A+VP+   L A      A L   +  +++V  +P KAGE I    G  PNS LL  YG V
Sbjct: 264 AMVPMADMLNARFESENAKLFYDERELKMVSTKPIKAGEQIWNTYGDPPNSDLLRRYGHV 323

Query: 179 D----------EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG----KLSVQVFHVHAGR 224
           D            NP D + V A L       + K  + +R      +    VF +    
Sbjct: 324 DLVPLSAPLSGLGNPGDVVEVRADLIVSVAAKKVKHDLKERVDWWLEEADDDVFVLRT-- 381

Query: 225 EKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP 284
           + E   +++ ++RL  +      ++   S  P     P +++ VL    D  + RL  YP
Sbjct: 382 DCELAEELVSFVRLLLLPKDEWEKAAQKSKLP----KPKLDKDVLTIAVDVLEKRLKDYP 437

Query: 285 ATLSEDEAMLTDY---NLHPKKRVATQLVRMEKKMLNACLQ 322
            TL EDEA+        L   KR A  +   EK++L   L+
Sbjct: 438 TTLEEDEALFAPERFGELSLNKRHAVVVRLGEKRILRGTLK 478


>gi|344277088|ref|XP_003410336.1| PREDICTED: SET domain-containing protein 4 [Loxodonta africana]
          Length = 440

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 57/225 (25%), Positives = 93/225 (41%), Gaps = 25/225 (11%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           EK  G +S W PY+  L +           P+ W E E+  L   P +A+  E+   ++ 
Sbjct: 117 EKHAGDQSSWKPYLETLPK-------TYTCPVCW-EPEVVNLLPRPLRAKAQEQRTRVQE 168

Query: 63  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 122
            +      +     LF +   +I    FT+     A+  V +  V+L+   L R F+  P
Sbjct: 169 FFTSFRDFFSSLQPLFSEAVENI----FTYSALLWAWCTVNTRAVYLRHRQL-RCFSAEP 223

Query: 123 ----LGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 174
               L P   LL +S   + KA         ++V     +  E + +  GP  N +LL+ 
Sbjct: 224 DTCALAPYLDLLNHSPDVQVKAAFNEKTRCYEIVAVSSCRKHEEVFICYGPHDNHRLLLE 283

Query: 175 YGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFH 219
           YGFV   NP+  + V   +  +     DK+M    N K+S+   H
Sbjct: 284 YGFVSTRNPHACVYVSRDILVKYLPSTDKQM----NKKISILKDH 324


>gi|336467028|gb|EGO55192.1| hypothetical protein NEUTE1DRAFT_147775 [Neurospora tetrasperma
           FGSC 2508]
 gi|350288355|gb|EGZ69591.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
          Length = 504

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 48/193 (24%), Positives = 84/193 (43%), Gaps = 20/193 (10%)

Query: 6   QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 65
           +GK SFW PYI  L       + A+  P  W+E ++  L G+     I E    +K EY 
Sbjct: 115 KGKSSFWAPYISTLADPSQLDKWAL--PPFWAEDDIELLKGTNAYVAIQEIQSNVKSEYK 172

Query: 66  ELDTVWFMAG----SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKV----SLARR 117
           +   +    G      + Q  Y+     FT   F+ + V  +S   +++++    S    
Sbjct: 173 QARKILKKEGFPDYRDYTQVLYNWAYCMFTSRSFRPSLVLSESAREYVERLLPEGSKIDD 232

Query: 118 FALVPLGPPLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLL 172
           F+++    PL      ++ +     L +   A +L+ +  Y  G+ +    G + NS+LL
Sbjct: 233 FSILQ---PLYDIGNHSWDASYTWNLTSEPSACELICNDSYGPGQQVFNNYGFKTNSELL 289

Query: 173 INYGFVDEDNPYD 185
           + YGF+   NP D
Sbjct: 290 LGYGFI--INPKD 300


>gi|119500300|ref|XP_001266907.1| SET domain protein [Neosartorya fischeri NRRL 181]
 gi|119415072|gb|EAW25010.1| SET domain protein [Neosartorya fischeri NRRL 181]
          Length = 704

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 54/195 (27%), Positives = 86/195 (44%), Gaps = 27/195 (13%)

Query: 6   QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 65
           +G + FW PYIR L +      L++ +PL +   +L +L G+       +R    K +Y 
Sbjct: 132 KGSEGFWFPYIRTLPQP-----LSLTTPLYYEGGDLRWLDGTSLAPAREQRMGVWKEKYK 186

Query: 66  ELDTVWFMAGSLFQ---QYPYDI--------PTEAFTFEIFKQAFVAVQSCVVHLQKVSL 114
              T    AG  FQ   QY +D+         + AF+ ++  +A   V+   +    VS+
Sbjct: 187 NGITELRKAG--FQDVDQYTWDLYLWSSSILVSRAFSAKVLAEAVTDVE---LPEDGVSV 241

Query: 115 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 174
                L+P    L+ +    K    A    V  VV     +G+ I    GP+ N +L++N
Sbjct: 242 -----LLPC-IDLMNHRPLAKVEWRAGKQDVAFVVLEDVGSGQEISNNYGPRNNEQLMMN 295

Query: 175 YGFVDEDNPYDRLVV 189
           YGF   DNP D  +V
Sbjct: 296 YGFCLPDNPCDYRIV 310


>gi|159490820|ref|XP_001703371.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158280295|gb|EDP06053.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 339

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 48/193 (24%), Positives = 82/193 (42%), Gaps = 13/193 (6%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           M+E+ +G +S W PY+  L             PL W   E   L G+    ++L RA+  
Sbjct: 125 MWERWRGPESRWAPYLALLPDD------MTHMPLYWKRREFRELRGTAAYDKMLGRAQHP 178

Query: 61  KREYNELDTVWF-MAGSLFQQYP-YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF 118
                ++  +W  + G    ++P   +P     +E+++ A  AV S    L      +  
Sbjct: 179 SDAPTQVPLLWSEVVGPFIAEHPELGLPGGERGYELYRWATAAVASYSFILGD---DKYQ 235

Query: 119 ALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 176
           A+VP+   L   +      L   +    +Q++  R   AG  +V   G   N++LL  YG
Sbjct: 236 AMVPVWDLLNHITGDVNVRLHHCSKRHVLQMIAMRDIVAGSELVNNYGELSNAELLRGYG 295

Query: 177 FVDEDNPYDRLVV 189
           FV+  N Y+ + V
Sbjct: 296 FVERANRYNHIPV 308


>gi|409080258|gb|EKM80618.1| hypothetical protein AGABI1DRAFT_71041 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 492

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 60/221 (27%), Positives = 94/221 (42%), Gaps = 25/221 (11%)

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           A+VP+   L A      A L    D +++V  +P K GE I    G  PN++LL  YG V
Sbjct: 245 AMVPMADILNARYQTENAKLFHEKDELKMVTTKPIKTGEQIWNTYGDLPNAELLRRYGHV 304

Query: 179 D--------EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 230
           D          NP D + ++A L            V     K  +  + +  G E   I 
Sbjct: 305 DFLSLPSGGHGNPGDVVEIKADLII-SAVSSTPEAVKDDEAKERID-WWLEEGGEDVFIL 362

Query: 231 D--------MLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAG 282
           D        M+ +++L  ++     ++   S     P    +E  + D L    + RLA 
Sbjct: 363 DYEYDLPPVMISFVKLLLLTQADWEKAREKS----KPPKSRLEGILYDILISTLEKRLAE 418

Query: 283 YPATLSEDEAMLT-DYNLHPKKRVATQLVRMEKKMLNACLQ 322
           YP T+  D+A+LT D  L+ K  +  +L   EK++L+  LQ
Sbjct: 419 YPTTIETDKALLTNDTPLNNKNAIIVRL--GEKEILHGILQ 457


>gi|281201870|gb|EFA76078.1| hypothetical protein PPL_10657 [Polysphondylium pallidum PN500]
          Length = 1234

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 51/199 (25%), Positives = 82/199 (41%), Gaps = 17/199 (8%)

Query: 1    MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
            +YEK +   SFW P+   L            + + ++ TEL  L G+   AE L+    +
Sbjct: 820  IYEK-ENPNSFWRPFFDTLPS-------YFPTSIHYTSTELLELEGTNLFAETLQ----V 867

Query: 61   KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
            K     +  + F    L +QYP   P   F++E F  A     S  + L K+       L
Sbjct: 868  KEHLQSIRDMLF--PELSEQYPTIFPESLFSWENFLWARSLFDSRAIQL-KIDDKITNCL 924

Query: 121  VPLGPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
            VP+   +  + +    +      D   ++V          I +  G   N +L + YGFV
Sbjct: 925  VPMADMINHHHNAQISQRFFDQTDQCFKMVSCCSVPPNAQIFLHYGALQNRELALYYGFV 984

Query: 179  DEDNPYDRLVVEAALNTED 197
             +DNPYD +++   L  ED
Sbjct: 985  IQDNPYDSMLIGFDLPDED 1003


>gi|358056251|dbj|GAA97802.1| hypothetical protein E5Q_04481 [Mixia osmundae IAM 14324]
          Length = 433

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 74/308 (24%), Positives = 126/308 (40%), Gaps = 40/308 (12%)

Query: 31  ESPLLWSETELAYLTGSPTKAEI-LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 89
           +S + WS+ EL  L GS    +I  E AEG    Y+ L         +F+      P EA
Sbjct: 110 DSLMFWSDDELRELQGSSVLDKIGREEAEG--SYYSILVPYLSKHADIFK------PLEA 161

Query: 90  FTFEIFKQAFVAVQSCVVHLQKVSL---------------ARRFALVPLGPPLLAYSSKC 134
           ++  ++ +    + S   H+                         +VP+   L A S   
Sbjct: 162 YSLALYHRCGSLILSRSFHVSNQDDSASDASDDDDAAYHEVETVGMVPMADVLNAKSGSA 221

Query: 135 KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL- 193
            A L    DA+ +   +   AGE I       PN+ LL  YG VDE N  D + + A L 
Sbjct: 222 NACLVYHPDALVMTTTKEIAAGEQIFNTYNDPPNADLLRRYGHVDEVNLNDNVEISADLI 281

Query: 194 NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI-SDMLPYLRLGYVSDTSEMQSVIS 252
             +D          +R   L  ++  V+   + E +  D +  +++   +  SE + +  
Sbjct: 282 GCKD---------LERVDWLLDRLDDVYTLTQAEDLPEDFITAVKI-LTASKSEFRKIQK 331

Query: 253 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 312
           +     P    ++ A   ++ +  + RLA Y +T+ EDE++L    +   +  A  LVR+
Sbjct: 332 A--DDLP-DDVLDEATAMRVREILQMRLAQYSSTIEEDESLLASSTMLTSRSRAALLVRL 388

Query: 313 -EKKMLNA 319
            EK++L A
Sbjct: 389 GEKRILAA 396


>gi|327295326|ref|XP_003232358.1| hypothetical protein TERG_07206 [Trichophyton rubrum CBS 118892]
 gi|326465530|gb|EGD90983.1| hypothetical protein TERG_07206 [Trichophyton rubrum CBS 118892]
          Length = 692

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 48/195 (24%), Positives = 87/195 (44%), Gaps = 23/195 (11%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           ++E+ +G+ S W PY+  L R        + S L + +++L +L G+            +
Sbjct: 113 VHEQLKGRDSHWWPYLATLPRAS-----ELTSALFFQDSDLEWLQGTSLYETHRAYRNTV 167

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF--VAVQSCVVHLQKVSLARRF 118
           K EY+       +A S+ +   Y +  E++T++IF  A+  +A ++    +    L+   
Sbjct: 168 KEEYD-------LAISILRDEGY-LAIESYTWDIFCWAYTLIASRAFTSRVLDAYLSNHP 219

Query: 119 AL-----VPLGPPLLAYSSK---CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 170
           +L       +  PL+ +S+     K    A    ++L V  P   GE +    GP  N +
Sbjct: 220 SLKQEEEFQIMLPLVDFSNHKPLAKIEWQAEATEIRLKVVEPTFTGEEVHNNYGPLNNQQ 279

Query: 171 LLINYGFVDEDNPYD 185
           L+  YGF   DNP D
Sbjct: 280 LMTTYGFCIVDNPCD 294


>gi|346319394|gb|EGX88996.1| Protein kinase-like domain [Cordyceps militaris CM01]
          Length = 1753

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 70/302 (23%), Positives = 121/302 (40%), Gaps = 18/302 (5%)

Query: 34   LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 93
            + ++E EL    GS       +  + ++ ++ +L         L  Q+    P + FT E
Sbjct: 993  IFFAEDELQVCEGSSLHTLTTQLEQRVQDDFRQLLV------QLLSQHRDLFPLDQFTIE 1046

Query: 94   IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYS---SKCKAMLAAVDDAVQLVVD 150
             +K A   + S  +            + PL   +L +S    +C A      D + ++  
Sbjct: 1047 DYKWALCTIWSRAMDFAVSDTTSVRLVAPLAD-MLNHSLDVKQCHAYDPTSGD-LSILAA 1104

Query: 151  RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 210
            + Y+ G+ I ++ G  PN++LL  YGFV  DNP D   +    +   P Y+ K  +    
Sbjct: 1105 KDYQVGDQIFIYYGSVPNNRLLRLYGFVLLDNPNDSYDLVLQTSPMAPLYEQKERLWALA 1164

Query: 211  GKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSL--GPICPVSPCMERAV 268
            G  S     + A  +     ++L YLR   + D +++  +   L  G    V+   E  V
Sbjct: 1165 GLDSTCTIPLTA--KHPLPKNVLRYLRTQRL-DAADVADMTLQLLNGTDGKVNDGNEIQV 1221

Query: 269  LDQLADYFKARLAGYPATLSEDEAMLTD--YNLHPKKRVATQLVRMEKKMLNACLQVTAD 326
            L  L D   + L G+   L + EA L    Y        A Q+   E+ +L    +   D
Sbjct: 1222 LQFLIDSLGSVLEGFGIPLEKLEAQLAGGFYPAGGNAWAAAQVSAGEQGILTRAKKTAED 1281

Query: 327  MI 328
            M+
Sbjct: 1282 ML 1283


>gi|390354259|ref|XP_001201449.2| PREDICTED: SET domain-containing protein 4-like [Strongylocentrotus
           purpuratus]
          Length = 455

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 49/198 (24%), Positives = 82/198 (41%), Gaps = 19/198 (9%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT--KAEILERAE 58
           + E+ +GK SFW PYI  L +          +P   S T+ A     PT  ++  + + +
Sbjct: 127 LTERSKGKSSFWYPYINVLPKD-------FTTPAFGS-TKQADFDVLPTIARSRAINQLQ 178

Query: 59  GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF----KQAFVAVQSCVVHLQKVSL 114
            I+  +     ++      F QY      ++F +  F    +  ++    C     K S 
Sbjct: 179 DIRAAFESASCLFEDIERTFPQYRIFFSLDSFVWAWFVINSRSVYIEPSGCEAFDPKAS- 237

Query: 115 ARRFALVPLGPPLLAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLL 172
              FAL P    LL +S  ++  A    V +  ++     Y A + + +  GP  N  LL
Sbjct: 238 -DDFALAPFLD-LLNHSPGAEVTAGFDPVSNCYRIKTLDSYHAYDQVFIHYGPHDNVNLL 295

Query: 173 INYGFVDEDNPYDRLVVE 190
           + YGFV   NP+D +  E
Sbjct: 296 LEYGFVIPSNPHDAVSFE 313


>gi|66827459|ref|XP_647084.1| hypothetical protein DDB_G0267502 [Dictyostelium discoideum AX4]
 gi|60475269|gb|EAL73204.1| hypothetical protein DDB_G0267502 [Dictyostelium discoideum AX4]
          Length = 472

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 69/329 (20%), Positives = 133/329 (40%), Gaps = 67/329 (20%)

Query: 31  ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 90
           ++ L + E E+ YL GSP   +I+   E   + Y++L    F    + +       +   
Sbjct: 163 DTSLYFDEKEIEYLAGSPAFVDIMVEKEVATKLYDQLSQTLFKDNVILEMCQG--QSTII 220

Query: 91  TFEIFKQAFVAVQSCVVHLQK----VSLARRFALVPLGPPLLAYSSKCKAMLAAVD---- 142
            ++ F+ A   + +  +++       S  ++  L P+ PP++ Y +      A +D    
Sbjct: 221 GWDQFRWAHSTITARKIYVTDPDSVGSDGKQMKLSPVVPPIVDYFNHGNQPSAEIDYNEE 280

Query: 143 -DAVQLVVDRPYKAGESIVV-----WCGPQPNSKLLINYGF----VDEDNPYDRLVVE-- 190
             +V +   +  K GE I V     +CG    S LL++YG+    +D+ +  + L+ E  
Sbjct: 281 LGSVDVKAIKDIKKGEEIFVSYDHHYCG----SDLLVDYGYLPNQIDDKSCVNVLMEELL 336

Query: 191 AALNTEDPQYQDK-----RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 245
             +N +DP   DK     +++  ++ KL +                              
Sbjct: 337 ETINLDDPIKDDKYYLVNKLLETKDIKLKIS----------------------------- 367

Query: 246 EMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML--TDYN-LHPK 302
            M S+   L  I       + ++L+ L      ++  YP T+ +D+  L   +YN L  +
Sbjct: 368 -MDSLTEDLLKISKYMSYKQESLLEYLKRLVSLKIGHYPTTIIQDKEFLLSKEYNQLSAR 426

Query: 303 KRVATQLVRMEKKMLNAC---LQVTADMI 328
            ++A  L   EK++L+     LQ   D I
Sbjct: 427 SKLAFNLAFQEKQILSNVYTKLQENIDTI 455


>gi|336468018|gb|EGO56181.1| hypothetical protein NEUTE1DRAFT_83233 [Neurospora tetrasperma FGSC
           2508]
 gi|350289741|gb|EGZ70966.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
          Length = 459

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 45/177 (25%), Positives = 82/177 (46%), Gaps = 13/177 (7%)

Query: 32  SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 91
           S +L++E +L    G+       +  + I+ ++  L         LF Q+P   P + FT
Sbjct: 107 SSILFAEDDLEACAGTSLYTITKQLEQSIEDDHRALVV------RLFVQHPDLFPLDKFT 160

Query: 92  FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP--PLLAYSSKCKA--MLAAVDDAVQL 147
            E +K A   V S  +      LA   ++  L P   +L ++S+ K   +       + +
Sbjct: 161 VEDYKWALCTVWSRAMDF---VLADGNSIRLLAPFADMLNHTSEVKQCHVYDPSSGNLSV 217

Query: 148 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 204
           +  + Y+AG+ + +  GP PNS+LL  YGFV   NP D   +  + + + P ++ K+
Sbjct: 218 LAGKDYEAGDQVFINYGPVPNSRLLRLYGFVIPGNPNDSYDLVLSTHPQAPFFEQKQ 274


>gi|303275964|ref|XP_003057276.1| set domain protein [Micromonas pusilla CCMP1545]
 gi|226461628|gb|EEH58921.1| set domain protein [Micromonas pusilla CCMP1545]
          Length = 308

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 54/208 (25%), Positives = 88/208 (42%), Gaps = 31/208 (14%)

Query: 15  YIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFM 73
           Y++ L R+ G         L W E ++   L GSP++    ER   +     E+      
Sbjct: 93  YVKALPRRTG-------GVLDWPEEDVKTLLAGSPSQRAAYERQASVDGAIEEIRA---- 141

Query: 74  AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSK 133
                 ++P   P         + AF  + S ++ L   +     ALVP    +L +   
Sbjct: 142 ------EFPQLTPG------ALRWAFDVLFSRLIRLP--NRGGELALVPWAD-MLNHKPG 186

Query: 134 CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE--DNPYDRLVVEA 191
           C A +      V L  DR YK GE +    G +P+++LLI+YGF  E  +NP D   +  
Sbjct: 187 CNAYIDDSGGKVCLQPDRAYKPGEQVFASYGQRPSAELLISYGFAPEVGENPDDEYEITL 246

Query: 192 ALNTEDPQYQDKRMVA-QRNGKLSVQVF 218
            ++  D +Y D +  A ++ G   V+ F
Sbjct: 247 GIDPND-RYADAKAAALEKIGLRPVESF 273


>gi|302834219|ref|XP_002948672.1| hypothetical protein VOLCADRAFT_104004 [Volvox carteri f.
           nagariensis]
 gi|300265863|gb|EFJ50052.1| hypothetical protein VOLCADRAFT_104004 [Volvox carteri f.
           nagariensis]
          Length = 510

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 91/378 (24%), Positives = 150/378 (39%), Gaps = 70/378 (18%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           ++ K+QG +S   P+I +L    G        P+LW E ++A L      A++ E+    
Sbjct: 143 LWHKRQGSQSPLAPWIAQLPSDTG-------VPVLWDERQIAALQYPYLIAQVKEQQREW 195

Query: 61  KREYNEL---------------DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ-----AFV 100
           ++ Y +L               D  W M  S  +   +  P    T +   +     A +
Sbjct: 196 QQLYGDLVRSGTPAGVQAPSREDFFWAM--SCVRSRTFSGPYIGSTLQDRLRTAGLVAVL 253

Query: 101 AVQSCVVHL---QKVSLA-------------------RRFALVPLGPPLLAYSSKCKAML 138
           A  + V+ L   QK   A                   +++A+ PL   L  +SS  ++ +
Sbjct: 254 AAGNTVLGLADPQKTLSAAIAVLLFNVLYELILSRSLKQYAICPL-IDLFNHSSAVQSEV 312

Query: 139 AA--VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN-- 194
           A     D+  +V  R +K GE + +  G Q N  L+  YGF + +NP D  V+   L   
Sbjct: 313 AYNYFGDSYSVVASREFKKGEQVFISYGAQSNDSLMQYYGFAEANNPQDVYVMTDMLRWL 372

Query: 195 TEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSL 254
           T        R+ A +   L+  +  V   R     S+ L  +R    +D SE  + +SS 
Sbjct: 373 TAVRSVGQSRLDALKGSPLANSLQQVAIQRAGFP-SETLQAVRFLLAAD-SEAGADVSSF 430

Query: 255 GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKR---------V 305
                 SP  E  + + +A+  +  L    ++L ED A+L+        R         V
Sbjct: 431 SKSG--SPDQEAQLAEVVAEVVRRELGHLGSSLQEDLALLSSTGASAGGRKGGTAAAAAV 488

Query: 306 ATQLVRMEKK-MLNACLQ 322
           A    R+EKK +L A LQ
Sbjct: 489 AAVAFRVEKKRLLTAVLQ 506


>gi|322698908|gb|EFY90674.1| putative histone-lysine N-methyltransferase [Metarhizium acridum
           CQMa 102]
          Length = 437

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 38/135 (28%), Positives = 63/135 (46%), Gaps = 13/135 (9%)

Query: 77  LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL-----QKVSLARRFALVPLGPPLLAYS 131
           +  QYP   P + FT E +K A   V S  +       + + L   FA       +L +S
Sbjct: 146 MLGQYPDLFPLDKFTVEDYKWALCTVWSRAMDFVLPDGKSIRLLAPFA------DMLNHS 199

Query: 132 SKCKA--MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 189
           S+ K   +  A    + ++  + Y+AG+ + +  GP PN++LL  YGFV   NP D   +
Sbjct: 200 SEAKQCHVYDASSGNLSVLAGKDYEAGDQVFINYGPMPNNRLLRLYGFVVPGNPNDSYDL 259

Query: 190 EAALNTEDPQYQDKR 204
             A +   P ++ K+
Sbjct: 260 VLATHPMAPFFKQKQ 274


>gi|449464220|ref|XP_004149827.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Cucumis
           sativus]
          Length = 499

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 46/189 (24%), Positives = 81/189 (42%), Gaps = 22/189 (11%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           + E K G  S W PYI  L +        + + + W E+EL  +  S    E L +   I
Sbjct: 127 LLEHKLGLGSEWAPYIIRLPQP-----WEMHNTIFWKESELEMIRKSSLYEESLNQRSQI 181

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           KRE+  +           + +P  I  +  + + F  A+  V S     +        +L
Sbjct: 182 KREFLAIRKA-------LEAFPEII--DRISCDDFMHAYALVTS-----RAWRSTEGVSL 227

Query: 121 VPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           +P     L +    +AML   DD    ++V DR +  GE +++  G   N+ L++++GF 
Sbjct: 228 IPFAD-FLNHDGASEAMLLNDDDKQLSEVVADRDFAPGEHVLIRYGKYSNATLMLDFGFA 286

Query: 179 DEDNPYDRL 187
              N +D++
Sbjct: 287 LPYNIHDQV 295


>gi|164423408|ref|XP_963594.2| hypothetical protein NCU08733 [Neurospora crassa OR74A]
 gi|157070080|gb|EAA34358.2| conserved hypothetical protein [Neurospora crassa OR74A]
          Length = 459

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 45/177 (25%), Positives = 81/177 (45%), Gaps = 13/177 (7%)

Query: 32  SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 91
           S +L++E +L    G+       +  + I+ ++  L         LF Q+P   P + FT
Sbjct: 107 SSILFAEDDLEACAGTSLYTITKQLEQSIEDDHRALVV------RLFVQHPDLFPLDKFT 160

Query: 92  FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP--PLLAYSSKCKA--MLAAVDDAVQL 147
            E +K A   V S  +      LA   ++  L P   +L ++S+ K   +       + +
Sbjct: 161 VEDYKWALCTVWSRAMDF---VLADGNSIRLLAPFADMLNHTSEVKQCHVYDPSSGTLSV 217

Query: 148 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 204
              + Y+AG+ + +  GP PNS+LL  YGFV   NP D   +  + + + P ++ K+
Sbjct: 218 FAGKDYEAGDQVFINYGPVPNSRLLRLYGFVIPGNPNDSYDLVLSTHPQAPFFEQKQ 274


>gi|70993754|ref|XP_751724.1| SET domain protein [Aspergillus fumigatus Af293]
 gi|66849358|gb|EAL89686.1| SET domain protein [Aspergillus fumigatus Af293]
 gi|159125354|gb|EDP50471.1| SET domain protein [Aspergillus fumigatus A1163]
          Length = 674

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 52/193 (26%), Positives = 84/193 (43%), Gaps = 23/193 (11%)

Query: 6   QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 65
           +G + FW PYIR L +      L++ +PL +   +L +L G+       +R    K +Y 
Sbjct: 102 RGSEGFWFPYIRTLPQP-----LSLTTPLYYEGDDLRWLDGTSLAPAREQRMGVWKEKYE 156

Query: 66  ELDTVWFMAG-SLFQQYPYDI--------PTEAFTFEIFKQAFVAVQSCVVHLQKVSLAR 116
              T    AG     QY +D+         + AF+ ++  +A   V+   +    VS+  
Sbjct: 157 NGITELRKAGFEDVDQYTWDLYLWSSSILVSRAFSAKVLAEAVTDVE---LPEDGVSV-- 211

Query: 117 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 176
              L+P    L+ +    K    A    V  VV     +G+ I    GP+ N +L++NYG
Sbjct: 212 ---LLPC-IDLMNHRPLAKVEWRAGKQDVAFVVLEDVASGQEISNNYGPRNNEQLMMNYG 267

Query: 177 FVDEDNPYDRLVV 189
           F   DNP D  +V
Sbjct: 268 FCLPDNPCDYRIV 280


>gi|255568191|ref|XP_002525071.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase, chloroplast precursor, putative
           [Ricinus communis]
 gi|223535652|gb|EEF37318.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase, chloroplast precursor, putative
           [Ricinus communis]
          Length = 456

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 46/199 (23%), Positives = 93/199 (46%), Gaps = 24/199 (12%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           + ++K G++S W PYI  L  Q G     + S + WS++EL  +  S    E +++   I
Sbjct: 98  LVDQKVGQESKWAPYISRLP-QLGE----MHSTIFWSKSELDMIFQSSVYKETIKQKAQI 152

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           ++++  +  V        + +P    + + TF+ F  A+  V+S     +     +  +L
Sbjct: 153 EKDFLTIKPV-------LEHFPQ--ISRSITFQDFMHAYALVKS-----RAWGSTKGVSL 198

Query: 121 VPLGPPLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           +P     L +    +A++   +D    ++  DR Y   E +++  G   N+ LL+++GF 
Sbjct: 199 IPFAD-FLNHDGFSEAVVLNDEDKQVSEVAADRNYAPHEEVLIRYGKFSNATLLLDFGFS 257

Query: 179 DEDNPYDRLVVEAALNTED 197
              N +++  VE  +N  D
Sbjct: 258 LPYNIHEQ--VEIQINIPD 274


>gi|389741836|gb|EIM83024.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
          Length = 502

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 60/231 (25%), Positives = 102/231 (44%), Gaps = 34/231 (14%)

Query: 119 ALVPLGPPLLA-YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
           A+VP+   L A Y S+   +    +D ++++  +P   GE I    G  PNS LL  YG 
Sbjct: 254 AMVPMADMLNARYRSENAKLFYETED-LRMITTKPILKGEQIFNTYGDPPNSDLLRRYGH 312

Query: 178 VDE--------DNPYDRLVVE-----AALNTEDPQYQDKRMVAQR------NGKLSVQVF 218
           VD          NP D  +VE     A  +  +   Q     A+R       G   V + 
Sbjct: 313 VDLVPLPNGDIGNPAD--IVELRGDLAFFSISERHKQPVESSAERVDWWLEEGGEDVFIL 370

Query: 219 HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 278
             +     E   +++P+ RL   S +   ++   S  P   V    + ++L  +A+  + 
Sbjct: 371 ETN----HELPDELVPFCRLLLQSQSEWEKTKSKSKLPKAKV----DESILSTIANALER 422

Query: 279 RLAGYPATLSEDEAMLTD-YNLHPKKRVATQLVRMEKKMLNACLQVTADMI 328
           RLA YP ++ ED+ +LT+  +L+ K  V  +L   EK++L+  L    + +
Sbjct: 423 RLAEYPTSVEEDQKLLTEPLSLNRKHAVIVRL--GEKRILHGTLSTVKEKL 471


>gi|145353540|ref|XP_001421068.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144581304|gb|ABO99361.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 813

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 54/195 (27%), Positives = 80/195 (41%), Gaps = 23/195 (11%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           E+ +G+K+    ++  L R       + ++   WSE EL  LTG+    E +   E  K 
Sbjct: 77  ERFRGEKATRAAHVATLPR-------SFDTAFFWSEEELRELTGTTCLRETMNLREETKN 129

Query: 63  EYNELDTVW--FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           +Y  L         G   +++  D       +E +  A   + S    L      R  A+
Sbjct: 130 DYETLTKKMEAIGEGGWMREHEVD-------YERYAWARSNLWSRQCDLLMPDGKRTRAM 182

Query: 121 VPLGPPLLAYSSKC----KAMLAAVDDAVQLVVDRPYKAGES--IVVWCGPQPNSKLLIN 174
           VP    +  +S+K        L A  + V +     YKAGE   I    G   NSKLL  
Sbjct: 183 VPT-FDIFNHSAKAPLGKTHKLNAEKNCVTVYAADDYKAGEQAFISYGSGEAANSKLLTW 241

Query: 175 YGFVDEDNPYDRLVV 189
           YGF  +DNPY+ L V
Sbjct: 242 YGFCIDDNPYEELDV 256


>gi|308802149|ref|XP_003078388.1| related to histone-lysine N-methyltransferase (ISS) [Ostreococcus
           tauri]
 gi|116056840|emb|CAL53129.1| related to histone-lysine N-methyltransferase (ISS), partial
           [Ostreococcus tauri]
          Length = 446

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 86/346 (24%), Positives = 137/346 (39%), Gaps = 51/346 (14%)

Query: 7   GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 66
           G++S + P++R L  +        ++ + W E EL  L GS   A      +  + EY+ 
Sbjct: 85  GERSAFWPWLRLLPSE-------TDAAVGWDEDELRELQGSNVVAFARAIKKSWREEYDA 137

Query: 67  LDTVWFMAGSLFQQYPYDIPTEAF--------TFEIFKQAFVAVQSCVVHLQKVSLARRF 118
           LD         F     D P EAF        TFE F  A   V S  + L+  S +   
Sbjct: 138 LD---------FAGLGVDFP-EAFGGEHAAHYTFEKFTWARFVVWSRAIDLKTDSTSA-- 185

Query: 119 ALVPLGPPLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 173
            ++ +  P+L     A S K      A  +AV++     +K    +      +P+   L+
Sbjct: 186 PVIRMLVPILDMANHAPSGKLLPRWDAKANAVKIYAGSAFKRNTELRFNYDTKPSQYFLL 245

Query: 174 NYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDML 233
            YGF+ E NP + + V   L+  D   + K  + +R+G    +       R  +   D+L
Sbjct: 246 QYGFIPEANPAECVEVTMQLSQRDNLRERKEALLRRHGLDPTKRNFEWKVRGLD--YDLL 303

Query: 234 PYLRLGYVSDTSEMQSVIS-----SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLS 288
              R+    D SE+    S     S   +   +    +AVL +        L GY  TL 
Sbjct: 304 AAARI-IAMDESELDDDTSVALSVSGASVSAKNDARTKAVLLK---SLITSLDGYGTTLG 359

Query: 289 EDEAMLTDYNLH----PKKRVA-TQLVRMEKKMLNACLQVTADMIM 329
           ED + +  +N      PKKR     L+RM +K     L  +AD + 
Sbjct: 360 EDNSYIARFNTSSDELPKKRKRFAVLLRMREK---GILLASADALF 402


>gi|303271159|ref|XP_003054941.1| methyltransferase [Micromonas pusilla CCMP1545]
 gi|226462915|gb|EEH60193.1| methyltransferase [Micromonas pusilla CCMP1545]
          Length = 544

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 87/343 (25%), Positives = 136/343 (39%), Gaps = 43/343 (12%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIK 61
           E+ +G  S W PY+  L          V+ PL W E+E+ + L GSPT  + + RA   +
Sbjct: 145 ERAKGSLSDWAPYVNTLPT-----GCTVDHPLRWEESEIRSLLKGSPTCEQAVGRAVDAR 199

Query: 62  REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 121
            EY  +          +    Y+  TE      F  A   V +  V L   ++   +ALV
Sbjct: 200 EEYASIRAAIAADADAYPADAYEFLTEL----AFTDALATVLARAVWLNAANV---YALV 252

Query: 122 PL--------------GPPLLAYSSKCKAMLAAV-----DDAVQLV-VDRPYKAGESIVV 161
           PL               P   A  +  + + AAV     D A + V V     A ++  V
Sbjct: 253 PLVDLLPVVGAPPPGVNPAAAAADAGARGLDAAVGVVDYDAATECVAVVSANDARQTAPV 312

Query: 162 WCGP---QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG-KLSVQV 217
            C     +    L ++ G V+  +  D L    +    D  Y  K+ + +  G     Q 
Sbjct: 313 VCADALGRNAGDLFLSTGRVNGAHVGDYLTFVTSTVMSDKLYAAKKQILEGMGYSADAQA 372

Query: 218 FHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFK 277
           F V A R       +  Y+R   V + SE+ +V      I  VSP  E  +L  L    +
Sbjct: 373 FPVFADRMP---LQLFAYMRFARVQEPSELMTVSFEEDRI--VSPMNEYEILQLLMGDAR 427

Query: 278 ARLAGYPATLSEDEAM-LTDYNLHPKKRVATQLVRMEKKMLNA 319
             LA Y  +  E E + L + N+  ++  A +L   EK+++NA
Sbjct: 428 EMLAEYENSSEEFELLQLKETNISERQMTAAKLRLGEKRLINA 470


>gi|428182808|gb|EKX51668.1| hypothetical protein GUITHDRAFT_102933 [Guillardia theta CCMP2712]
          Length = 436

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 59/275 (21%), Positives = 116/275 (42%), Gaps = 35/275 (12%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           EK +G +S + P+I  L             PLLW++ E+  L G+    EIL   E ++ 
Sbjct: 115 EKMKGSESSYKPFIDVL-------PWDSLHPLLWTDEEVDLLEGTYAHREILAFREQVEV 167

Query: 63  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF----KQAFVAVQS----CVVHLQKVSL 114
                + V    G  ++Q+   I TE  T E F    + AF +V S      +      L
Sbjct: 168 ATELFEPVLNPKG--WKQFFQTIETEKMTPEEFGFMMRGAFASVLSRAFDSKIGRGDKGL 225

Query: 115 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY-----------KAGESIVVWC 163
             R  +    P L  ++          D A++   ++ +           + GE +  + 
Sbjct: 226 EERVVI----PLLDIFNHGSYGPSITFDTALERDNEKGFPVRVADKGKSIEEGEELFGFY 281

Query: 164 GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV-QVFHVHA 222
           G +PN  +L  YGFV  +       +  +++ +DP +  K  + +  G ++V Q+F +  
Sbjct: 282 GDKPNWNMLTTYGFVSPNPKCQETTLSVSIDEKDPYFAQKEEILKARGMVAVEQLFDIR- 340

Query: 223 GREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPI 257
             + + +  ++ Y R+  +S+ +++  V ++ G +
Sbjct: 341 -HDTDPMGPLINYFRIREISNEADLTKVQTNYGEM 374


>gi|156361027|ref|XP_001625323.1| predicted protein [Nematostella vectensis]
 gi|156212150|gb|EDO33223.1| predicted protein [Nematostella vectensis]
          Length = 447

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 50/192 (26%), Positives = 79/192 (41%), Gaps = 20/192 (10%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           MYEK   K SFW PYIR L            +P  ++  EL +L     + +  E+   I
Sbjct: 135 MYEKYLEKGSFWAPYIRTLPD-------TFNTPCYFTRKEL-FLLPEQCREQAFEQVTQI 186

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-CVVHLQKVSLAR--- 116
           K+ Y      +     + Q +  +       FE FK A+  V +  V H +    A+   
Sbjct: 187 KQSYKSFAKAY---NDVLQDFDCNF-WRTVDFESFKWAWCVVNTRSVYHDEPNRRAQPID 242

Query: 117 -RFALVPLGPPLLAYSSKCK--AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 173
              AL PL   LL +  K +      +     ++ V   Y+ G  + +  GP  N++L +
Sbjct: 243 GNCALAPL-LDLLNHCDKAEMCGRFNSSSKNYEINVITEYQKGTQVFINYGPHDNTRLFL 301

Query: 174 NYGFVDEDNPYD 185
            YGFV   N ++
Sbjct: 302 EYGFVLPRNVHN 313


>gi|85093434|ref|XP_959692.1| hypothetical protein NCU09581 [Neurospora crassa OR74A]
 gi|28921141|gb|EAA30456.1| predicted protein [Neurospora crassa OR74A]
          Length = 504

 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 46/191 (24%), Positives = 79/191 (41%), Gaps = 28/191 (14%)

Query: 6   QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 65
           +GK SFW PYI  L       + A+  P  W+E ++  L G+     I E    +K EY 
Sbjct: 115 KGKSSFWAPYISTLADPSQLDKWAL--PPFWAEDDIELLQGTNAYIAIQEIQNNVKSEYK 172

Query: 66  ELDTVWFMAG----SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 121
           +   +    G      + Q  Y+     FT   F+ + V  +S   ++++        L+
Sbjct: 173 QARKILKKEGFPDYREYTQVLYNWAYCMFTSRSFRPSLVLSESAREYVER--------LL 224

Query: 122 PLGP---------PLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 167
           P G          PL      ++ +     L +   A +L+ +  Y  G+ +    G + 
Sbjct: 225 PEGTKIDDFSVLQPLYDIGNHSWDASYTWNLTSEPSACELICNDSYGPGQQVFNNYGFKT 284

Query: 168 NSKLLINYGFV 178
           NS+LL+ YGF+
Sbjct: 285 NSELLLGYGFI 295


>gi|50557134|ref|XP_505975.1| YALI0F28061p [Yarrowia lipolytica]
 gi|49651845|emb|CAG78787.1| YALI0F28061p [Yarrowia lipolytica CLIB122]
          Length = 454

 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 58/223 (26%), Positives = 90/223 (40%), Gaps = 38/223 (17%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           +Y  K G  + W PY   L  Q       ++S ++W++ EL  L GS    +I     G 
Sbjct: 89  LYMMKAGSMTKWKPYFDVLPTQ-------LDSLMMWTDDELEGLKGSMIVKKI--GKAGA 139

Query: 61  KREYNE---------------LDTV---WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 102
           + +Y E                DT    +   G L   Y +D P ++F+    +      
Sbjct: 140 EEDYQEKLKPIIDAHPEYFKDCDTSLESFHRMGGLIMAYSFDAP-DSFS----EDEEDDE 194

Query: 103 QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA-VQLVVDRPYKAGESIVV 161
                 L    L +  A+VPL   L A++  C A L A DD    +   +P K GE +  
Sbjct: 195 DIEHDDLYNEGLVK--AMVPLADTLNAHTRFCNANLIAEDDGGFSMTAIQPIKKGEQVYN 252

Query: 162 WCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALNTEDPQYQDK 203
             G  PN   L  YG+V +E   +D  +VE +++     Y +K
Sbjct: 253 TYGELPNCDFLRRYGYVENEGTEFD--IVEFSMDEISDFYANK 293


>gi|281207968|gb|EFA82146.1| hypothetical protein PPL_04566 [Polysphondylium pallidum PN500]
          Length = 510

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 28/104 (26%), Positives = 48/104 (46%), Gaps = 12/104 (11%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           +YE   G +S W  YI+ L        L+V+ P+LW++ +L  L G+  +  + E  E +
Sbjct: 107 LYEASIGSQSKWYGYIKSL-------PLSVDLPILWNDADLKNLKGTSIETVVYENKETV 159

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 104
              YN+     ++   L   +P       F+ + FK+A   V S
Sbjct: 160 DATYNK-----YIKSKLIANHPDVFNEHVFSLDNFKRASCLVSS 198


>gi|358386801|gb|EHK24396.1| hypothetical protein TRIVIDRAFT_168260 [Trichoderma virens Gv29-8]
          Length = 370

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 33/140 (23%), Positives = 59/140 (42%), Gaps = 6/140 (4%)

Query: 54  LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS 113
           LE  E +++   E    W    + F+    D+P E +T+        +  +      K  
Sbjct: 113 LESREHLRKREKEFQGNW----NAFKDAFPDVPYEEYTYAWMIVNTRSFYNETPETLKYP 168

Query: 114 LARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 173
              R AL+P+          CK   +A  D   +V DR YK GE + +      N  +L+
Sbjct: 169 WEDRLALIPVADLFNHSDDGCKVYYSA--DGYHIVADREYKKGEELFISYSSHSNDYILL 226

Query: 174 NYGFVDEDNPYDRLVVEAAL 193
            YGF+ +++  D + ++ A+
Sbjct: 227 EYGFIPDESLDDDVYIDDAV 246


>gi|449283795|gb|EMC90389.1| SET domain-containing protein 4 [Columba livia]
          Length = 440

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 67/285 (23%), Positives = 112/285 (39%), Gaps = 39/285 (13%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           EK  G+KS W PY+  L +           P+   E ++  L   P + +  E+   +  
Sbjct: 117 EKHAGEKSLWKPYLDVLPK-------TYSCPVC-LEHDVVSLLPEPLRKKAQEQRTKVHE 168

Query: 63  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 122
            Y      +     LF +    I    F +   + A+  + +  +++ K S  + F+L P
Sbjct: 169 LYISSKAFFSSLQPLFAENTETI----FNYSALEWAWCTINTRTIYM-KHSQRKCFSLEP 223

Query: 123 ----LGP--PLLAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 174
               L P   LL +S   + KA       + ++  +   K  E + +  GP  N +LL+ 
Sbjct: 224 DVYALAPYLDLLNHSPNVQVKAAFNEQTRSYEIRTNSLCKKYEEVFICYGPHDNQRLLLE 283

Query: 175 YGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP 234
           YGFV  DNP+  + V +A   +     DK    QRN K+S+   H           D+L 
Sbjct: 284 YGFVAMDNPHSSVYVSSATLLKYFPPLDK----QRNAKVSILKDH-----------DLLE 328

Query: 235 YLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 279
            L  G+   +  + + +  L        C  R +   L D   AR
Sbjct: 329 NLTFGWDGPSWRLLTALKVLSLGADEFTCWRRTL---LGDVISAR 370


>gi|396495152|ref|XP_003844476.1| similar to SET domain-containing protein [Leptosphaeria maculans
           JN3]
 gi|312221056|emb|CBY00997.1| similar to SET domain-containing protein [Leptosphaeria maculans
           JN3]
          Length = 475

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 54/212 (25%), Positives = 80/212 (37%), Gaps = 48/212 (22%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI------- 53
           +YE   G  S W PY   L  +         + + WSE ELA L  S   A+I       
Sbjct: 91  LYEHLNGDASNWAPYFAVLPNE-------FNTLMFWSEHELAELQASAVLAKIGREGANE 143

Query: 54  ------------------------LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 89
                                    ++AE ++ E N   T+    GSL   Y +DI    
Sbjct: 144 AFLGQLVPVIKEFAGIFFSGDSRAAQKAEEMRDEKN--ITLMHKMGSLIMAYAFDIEPAT 201

Query: 90  FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV 149
                  +  V  +      +  +L +   ++PL   L A + +C A L      +++  
Sbjct: 202 ------PRKDVDEEGFAEEEEDEALPK--GMIPLADMLNADADRCNARLFYEQKYLEMKA 253

Query: 150 DRPYKAGESIVVWCGPQPNSKLLINYGFVDED 181
            +P KAGE I    GP P S LL  YG+V E+
Sbjct: 254 LKPIKAGEEIFNDYGPLPRSDLLRRYGYVTEN 285


>gi|322697804|gb|EFY89580.1| putative histone-lysine N-methyltransferase [Metarhizium acridum
           CQMa 102]
          Length = 466

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 43/175 (24%), Positives = 75/175 (42%), Gaps = 9/175 (5%)

Query: 32  SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 91
           S + ++E EL    G+       +  + I+ +Y +L         +    P   P   FT
Sbjct: 107 SSIFFTEDELEVCAGTSLYTITKQLKQRIEDDYKDL------IARVLGPRPDLFPLNKFT 160

Query: 92  FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA--MLAAVDDAVQLVV 149
              +K A   V S  +  +    +    L P    +L +SS+ K   +  A    + ++ 
Sbjct: 161 IHHYKWALCTVWSRAMDFELYDGSSMRLLAPFAD-MLNHSSESKQCHVYDASTGNLSILA 219

Query: 150 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 204
            + Y+AG+ + +  G  PNS+LL  YGFV  DNP D   +  A +   P ++ K+
Sbjct: 220 GKDYEAGDQVYIHYGSIPNSRLLRLYGFVIPDNPNDSYDLVLATHPMAPFFEQKQ 274


>gi|302804174|ref|XP_002983839.1| hypothetical protein SELMODRAFT_445692 [Selaginella moellendorffii]
 gi|300148191|gb|EFJ14851.1| hypothetical protein SELMODRAFT_445692 [Selaginella moellendorffii]
          Length = 236

 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 46/186 (24%), Positives = 79/186 (42%), Gaps = 29/186 (15%)

Query: 8   KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 67
           + S W PYI  L    G     +++  LW +TEL+YL  SP   +  ER E I  E+ ++
Sbjct: 74  QSSAWAPYISCLPEPAG-----LDNTFLWEDTELSYLRASPLYGKTRERLEIITTEFGQV 128

Query: 68  DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 127
                +   LF +          + E F   +  V S     + +++     LV +  P+
Sbjct: 129 QNALDVWPQLFGK---------VSVEDFMHVYATVFS-----RPLAIGEDSTLVMI--PM 172

Query: 128 LAYSSKCKAMLAAVD-----DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 182
           L + +   A  A +      +   +  DR     + I + CG   N++L ++YGF     
Sbjct: 173 LDFFNHNAASFAKLSFNGLLNYAVVTADRDCAENDQIWINCGDLSNAELALDYGFT---V 229

Query: 183 PYDRLV 188
           P +RL+
Sbjct: 230 PENRLI 235


>gi|302840199|ref|XP_002951655.1| hypothetical protein VOLCADRAFT_105180 [Volvox carteri f.
           nagariensis]
 gi|300262903|gb|EFJ47106.1| hypothetical protein VOLCADRAFT_105180 [Volvox carteri f.
           nagariensis]
          Length = 517

 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 44/193 (22%), Positives = 81/193 (41%), Gaps = 13/193 (6%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           MYEK +G++S W PY+  +             PL W   E   L G+    +++ + +  
Sbjct: 192 MYEKSRGRQSRWAPYLNLIPDD------MTHMPLYWKHREFKELRGTAAYDKMMGKVQCP 245

Query: 61  KREYNELDTVWF-MAGSLFQQYP-YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF 118
                ++  +W  +     Q++P  ++P     +++++ A  AV S    L         
Sbjct: 246 ADAPTQVPVLWSEVVEPFIQEHPELELPEGKAGYDLYRWATCAVASYSFILGDDKYQ--- 302

Query: 119 ALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 176
           A+VP+   L   + +    L   A    + ++  R    GE +V   G   N++LL  YG
Sbjct: 303 AMVPVWDLLNHITGRVNVRLHHCAKRHVLHMIATRDILRGEELVNNYGELSNAELLRGYG 362

Query: 177 FVDEDNPYDRLVV 189
           FV+  N  + + V
Sbjct: 363 FVEARNRNNHVQV 375


>gi|409045252|gb|EKM54733.1| hypothetical protein PHACADRAFT_97093 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 513

 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 63/222 (28%), Positives = 97/222 (43%), Gaps = 26/222 (11%)

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
           A+VP+   L    +   A L   D+ V +++     KAGE I    G  PNS LL  YGF
Sbjct: 271 AMVPMADMLNGRFNTETARLFYDDEHVLRMMTVHEIKAGEQIWNTYGDPPNSDLLRRYGF 330

Query: 178 VD----------EDNPYD------RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 221
           +D            NP D       LVVEAA      + QD+  V     +    VF V 
Sbjct: 331 IDVTKLESPLSGAGNPADIVEIPANLVVEAATKHTTSKTQDR--VDWWLEEAEDDVFVV- 387

Query: 222 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 281
            G + E   +M+   RL  +   +E +    + G +    P M+  +     D  ++RL 
Sbjct: 388 -GTDCELPPEMVSLARL-LLQPKAEWEKT-KAKGKVP--KPTMDTTIAAIAMDVLQSRLK 442

Query: 282 GYPATLSEDEAMLTDYN-LHPKKRVATQLVRMEKKMLNACLQ 322
            YP ++ EDE +L D + L   +++A  +   EK++L   L+
Sbjct: 443 EYPTSVEEDERLLADESQLGFNRKMAVTVRLGEKRILAGTLR 484


>gi|426197159|gb|EKV47086.1| hypothetical protein AGABI2DRAFT_203917 [Agaricus bisporus var.
           bisporus H97]
          Length = 491

 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 59/221 (26%), Positives = 94/221 (42%), Gaps = 25/221 (11%)

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           A+VP+   L A      A L    D +++V  +P ++GE I    G  PN++LL  YG V
Sbjct: 244 AMVPMADILNARYQTENAKLFHEKDELKMVTTKPIRSGEQIWNTYGDLPNAELLRRYGHV 303

Query: 179 D--------EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 230
           D          NP D + ++A L            V     K  +  + +  G E   I 
Sbjct: 304 DFLSLPSEGHGNPGDVVEIKADLII-SAVSSIPEAVKDDEAKERID-WWLEEGGEDIFIL 361

Query: 231 D--------MLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAG 282
           D        M+ +++L  +      ++   S     P    +E  + D L    + RLA 
Sbjct: 362 DYEYDLPPVMISFVKLLLLPQADWEKAREKS----KPPKSKLEGILYDILISTLEKRLAE 417

Query: 283 YPATLSEDEAMLTDYNLHPKKRVATQLVRM-EKKMLNACLQ 322
           YP T+  D+A+LT  N  P  +    +VR+ EK++L+  LQ
Sbjct: 418 YPTTIETDKALLT--NDTPLNKKNAIIVRLGEKEILHGILQ 456


>gi|326480913|gb|EGE04923.1| SET domain-containing protein [Trichophyton equinum CBS 127.97]
          Length = 692

 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 47/200 (23%), Positives = 78/200 (39%), Gaps = 35/200 (17%)

Query: 2   YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIK 61
           +E+ +G+ S W PY+  L R        + S L + +++L +L G+            +K
Sbjct: 114 HEQLKGRDSHWWPYLATLPRAS-----ELTSALFYQDSDLDWLQGTNLYQTHQAYRNTVK 168

Query: 62  REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 121
            EY+   ++    G L          E+++++IF  A+  + S      +   +R     
Sbjct: 169 EEYDSAISILRDEGCL--------AVESYSWDIFCWAYTLIAS------RAFTSRVLDAY 214

Query: 122 PLGPPLLAYSSKCKAMLAAVDDA----------------VQLVVDRPYKAGESIVVWCGP 165
               P L    + + ML  VD +                + L V  P   GE I    GP
Sbjct: 215 LSNHPTLKQDEEFQIMLPLVDSSNHKPLAKIEWRAEATEIGLKVIEPTFTGEEIHNNYGP 274

Query: 166 QPNSKLLINYGFVDEDNPYD 185
             N +L+  YGF   DNP D
Sbjct: 275 LNNQQLMTTYGFCIVDNPCD 294


>gi|294948379|ref|XP_002785721.1| hypothetical protein Pmar_PMAR008080 [Perkinsus marinus ATCC 50983]
 gi|239899769|gb|EER17517.1| hypothetical protein Pmar_PMAR008080 [Perkinsus marinus ATCC 50983]
          Length = 353

 Score = 47.8 bits (112), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 21/72 (29%), Positives = 37/72 (51%)

Query: 118 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
             ++PL       S+K    +  V++  Q++ ++P K GE I    G   N  LL+ +GF
Sbjct: 171 LCVIPLADQFNHSSTKWHTRVREVEEGFQMLAEKPVKKGEEIFNNYGLYTNEMLLLTHGF 230

Query: 178 VDEDNPYDRLVV 189
           ++ DNP+D  + 
Sbjct: 231 IEFDNPHDHFIT 242


>gi|241603784|ref|XP_002405757.1| SET domain-containing protein, putative [Ixodes scapularis]
 gi|215502568|gb|EEC12062.1| SET domain-containing protein, putative [Ixodes scapularis]
          Length = 429

 Score = 47.8 bits (112), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 44/192 (22%), Positives = 81/192 (42%), Gaps = 16/192 (8%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           +K  G+ S W P++  L R          +P+         L     + E+      I+R
Sbjct: 114 QKLLGEASRWWPFVDSLPR-------TFTTPVFLRRKVFESLP-KDLREEVQTGITFIQR 165

Query: 63  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF---- 118
            + +L  +  + G + ++      +  FT+  F  A+ AV +  +  Q  + +  +    
Sbjct: 166 TFLKLKVL--LGGHVEEEPEVQCLSTGFTWNNFVWAWTAVNTRCIFAQGSNSSSLWEDDH 223

Query: 119 -ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
            AL P     L +  K     A V +  +++  + + A E + +  GP  N +L ++YGF
Sbjct: 224 CALAPF-LDCLNHHWKASIETAMVGENFEILSHKSHDANEQVFISYGPHSNRRLFLDYGF 282

Query: 178 VDEDNPYDRLVV 189
           V  DNP D +VV
Sbjct: 283 VLPDNPNDVVVV 294


>gi|356577306|ref|XP_003556768.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Glycine
           max]
          Length = 487

 Score = 47.8 bits (112), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 71/324 (21%), Positives = 135/324 (41%), Gaps = 18/324 (5%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           +  ++    S W  Y   L ++        +S + WSE EL+ L G+          + +
Sbjct: 130 LIRERSRSNSLWKHYFSVLPKE-------TDSTIYWSEEELSELQGTQLLNTTRSVKQYV 182

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQS-CVVHLQKVSLARRF 118
           + EY  L+    +        P  +    + F I + +AF  +++  +V +         
Sbjct: 183 ENEYRRLEEEIILPNKKLFPSPLTLDDFFWAFGILRSRAFSRLRNENLVVIPFADFINHS 242

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGF 177
           A V       AY  K  A L + D    L      KAG+ + + +   + N++L ++YGF
Sbjct: 243 ARVTTEDH--AYEIKGAAGLFSWDYLFSLRSPLSLKAGDQVYIQYDLNKSNAELALDYGF 300

Query: 178 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV-HAGREKEAISDMLPYL 236
           ++ +   +   +   ++  DP + DK  +A+ NG      F + ++      +   L  +
Sbjct: 301 IEPNADRNAYTLTLQISESDPFFGDKLDIAESNGFGETAYFDIFYSRPLPPGLLPYLRLV 360

Query: 237 RLGYVSDTSEMQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM 293
            LG  +D   ++S+   S  G +  PVS   E  +   + +  K  LAGY  T+ ED+  
Sbjct: 361 ALG-GTDAFLLESIFRNSIWGHLELPVSRDNEELICRVVRETCKTALAGYHTTIEEDQK- 418

Query: 294 LTDYNLHPKKRVATQLVRMEKKML 317
           L +  L  +  +A  +   EK++L
Sbjct: 419 LKEAKLDSRHAIAVGIREGEKQLL 442


>gi|326473914|gb|EGD97923.1| hypothetical protein TESG_05224 [Trichophyton tonsurans CBS 112818]
          Length = 692

 Score = 47.8 bits (112), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 47/200 (23%), Positives = 78/200 (39%), Gaps = 35/200 (17%)

Query: 2   YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIK 61
           +E+ +G+ S W PY+  L R        + S L + +++L +L G+            +K
Sbjct: 114 HEQLKGRDSHWWPYLATLPRAS-----ELTSALFYQDSDLDWLQGTNLYQTHQAYRNTVK 168

Query: 62  REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 121
            EY+   ++    G L          E+++++IF  A+  + S      +   +R     
Sbjct: 169 EEYDSAISILRDEGCL--------AVESYSWDIFCWAYTLIAS------RAFTSRVLDAY 214

Query: 122 PLGPPLLAYSSKCKAMLAAVDDA----------------VQLVVDRPYKAGESIVVWCGP 165
               P L    + + ML  VD +                + L V  P   GE I    GP
Sbjct: 215 FSNHPTLKQDEEFQIMLPLVDSSNHKPLAKIEWRAEATEIGLKVIEPTFTGEEIHNNYGP 274

Query: 166 QPNSKLLINYGFVDEDNPYD 185
             N +L+  YGF   DNP D
Sbjct: 275 LNNQQLMTTYGFCIVDNPCD 294


>gi|255071849|ref|XP_002499599.1| predicted protein [Micromonas sp. RCC299]
 gi|226514861|gb|ACO60857.1| predicted protein [Micromonas sp. RCC299]
          Length = 588

 Score = 47.8 bits (112), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 62/240 (25%), Positives = 99/240 (41%), Gaps = 32/240 (13%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVE-----SPLLW-SETELAYLTGSPTKAEIL 54
           + E+  G+ S W   I  L    G G+ +       +P+ W  E   A L G+P  A+ +
Sbjct: 242 IAERALGEASPWHAVIASLPWPEG-GEGSASPCGGCTPVSWPREACDALLGGTPLLADAI 300

Query: 55  ERAEGIKREYNELDTVWFMAGSLFQQYPYDI-PTEAFTFEIFKQAFVAVQSCVVHLQKVS 113
             +E + R++  L    F A     ++  D+ P  A+T + F++A  A  S  + +Q   
Sbjct: 301 AASEKLARQHAAL----FPA---LSEHMADVFPASAYTLDNFRRAHEAWNSYGMTVQASP 353

Query: 114 LARRFALVP---------LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG 164
                  +P         L P ++ YS          D  ++L V R   AGE + V  G
Sbjct: 354 GEPAATCLPPVAMLCNHALWPHVVRYSRL-------RDGTLRLPVARSVHAGEEVFVSYG 406

Query: 165 PQPNSKLLINYGFVDEDNPYDRLVVEAAL-NTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 223
            + N++LL+ YGF    NPYD + +   L   E       R  A     L++    V AG
Sbjct: 407 AKSNAELLLFYGFALPGNPYDDVPLSLELPGGEVADVTKAREAALARAGLTLSPHAVRAG 466


>gi|413923745|gb|AFW63677.1| hypothetical protein ZEAMMB73_839660 [Zea mays]
 gi|413923746|gb|AFW63678.1| hypothetical protein ZEAMMB73_839660 [Zea mays]
          Length = 306

 Score = 47.8 bits (112), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 44/151 (29%), Positives = 65/151 (43%), Gaps = 19/151 (12%)

Query: 9   KSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL 67
            S W+ YI  L RQ         S L W+  EL AYL  SP +   ++R   +   YN+L
Sbjct: 162 SSRWISYIAALPRQ-------PYSLLYWTRAELDAYLVASPIRKRAIQRITDVIGTYNDL 214

Query: 68  DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 127
                    +F ++P   P E +  E F  +F  + S +V L   S+  R ALVP    +
Sbjct: 215 ------RDRIFSRHPDLFPEEVYNIETFLWSFGILFSRLVRLP--SMDGRVALVPWA-DM 265

Query: 128 LAYSSKCKAML--AAVDDAVQLVVDRPYKAG 156
           L +S + +  L        +    DR Y+ G
Sbjct: 266 LNHSPEVETFLDFDKSSRGIVFTTDRSYQPG 296


>gi|240278777|gb|EER42283.1| conserved hypothetical protein [Ajellomyces capsulatus H143]
 gi|325090312|gb|EGC43622.1| conserved hypothetical protein [Ajellomyces capsulatus H88]
          Length = 471

 Score = 47.8 bits (112), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 45/169 (26%), Positives = 73/169 (43%), Gaps = 21/169 (12%)

Query: 32  SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 91
           S + ++E EL   TG+   A   +    I+ +Y  L         L  Q+    P   FT
Sbjct: 107 SSIFFTEDELEVCTGTSLYAITKQLGRCIQDDYKALVV------RLLIQHRDLFPLSKFT 160

Query: 92  FEIFKQAFVAVQSCVVHL-----QKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDA 144
            E +K A   V S  +       + + L   FA       +L +SS  +   A   +   
Sbjct: 161 IEDYKWALCTVWSRAMDFVLPDGKSIRLLAPFA------DMLNHSSDVRQCHAYDPLSGN 214

Query: 145 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR--LVVEA 191
           + ++  + YKAG+ + ++ G  PN++LL  YGF+   NP D   LV+E 
Sbjct: 215 LSILAGKDYKAGDQVFIYYGSIPNNRLLRLYGFIIPSNPNDNYELVLET 263


>gi|171679805|ref|XP_001904849.1| hypothetical protein [Podospora anserina S mat+]
 gi|170939528|emb|CAP64756.1| unnamed protein product [Podospora anserina S mat+]
          Length = 468

 Score = 47.8 bits (112), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 49/189 (25%), Positives = 75/189 (39%), Gaps = 29/189 (15%)

Query: 8   KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE- 66
           K S+W PYI  L +       A+  P +W E ++  L  +     + E    IK+EY   
Sbjct: 114 KDSYWWPYISTLPQPDRVDTWAL--PAVWPEDDIECLEETNAHVAVREIQANIKKEYKHA 171

Query: 67  ---LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 123
              L  V F     + Q  Y      FT   F+ + +  Q    H+          L P 
Sbjct: 172 RKLLKEVDFPGWQEYTQLLYKWAFCIFTSRSFRPSLILSQETQDHV--------LGLTPH 223

Query: 124 GP---------PLLAY-----SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 169
           G          PLL       +S+ +  L  VD   QL+ +  Y+ G+ +    G + NS
Sbjct: 224 GTKVDDFSILQPLLDIGNHDPTSQYQWNLE-VDGTCQLICNNAYQPGQQVFNNYGLKSNS 282

Query: 170 KLLINYGFV 178
           +LL+ YGF+
Sbjct: 283 ELLLGYGFI 291


>gi|393230612|gb|EJD38215.1| SET domain-containing protein [Auricularia delicata TFB-10046 SS5]
          Length = 381

 Score = 47.4 bits (111), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 51/211 (24%), Positives = 86/211 (40%), Gaps = 19/211 (9%)

Query: 14  PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 73
           PY+   D    R QL   +PL ++  ELA L G+   A   +R    + E      V   
Sbjct: 78  PYV---DSLPSRAQL--RTPLHFTPQELALLKGTNMAAATTDREADWRSECERCRAVLGH 132

Query: 74  AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL-LAYSS 132
            G       Y   +   +   F    ++ +  ++     + +    LVPL   L  A + 
Sbjct: 133 WGEHLTWEHYLTASTHLSSRAFPSTLLSPEPALI----PTPSSHPVLVPLIDSLNHARAH 188

Query: 133 KCKAMLAAVDDAVQ---LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 189
                ++  D+      +V   P  AG  ++   GP+PN++L++ YGF   DNP D LV+
Sbjct: 189 PVSWSVSPADNGAHTLSIVQHAPVAAGAEVLNNYGPKPNAELVLGYGFALPDNPDDTLVL 248

Query: 190 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHV 220
           + +         D+R + +  G L   +F V
Sbjct: 249 KVS------GAADRREIWRAGGGLQRILFDV 273


>gi|320163219|gb|EFW40118.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 1188

 Score = 47.4 bits (111), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 45/202 (22%), Positives = 81/202 (40%), Gaps = 22/202 (10%)

Query: 33  PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 92
           PL W++ EL +L G+     I ER   ++ ++  +  V      L ++ P   P + FT+
Sbjct: 245 PLWWNDAELDHLDGTNIGGYIQERRNQVRNQFLNVFPV------LSREQPALFPKDVFTY 298

Query: 93  EIFKQAFVAVQSCVVHLQ-KVSLARRFALVPLGPPLLAYSSKC------------KAMLA 139
           E +  AF    S    L+  V+         +G P+     +C             A + 
Sbjct: 299 EAYLWAFSTCSSRAFPLRVTVNPTTGVESHAIGNPMKEPCVECLLPLLDMMNHQFGASIT 358

Query: 140 AVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 197
              D  +V+       + GE +    GP+ N +LL+ YGF   +N  D + ++  +   D
Sbjct: 359 WFTDETSVRFFTGAKVRKGEQVYNNYGPKSNEELLMGYGFCLPNNEADHVKIQLTVGN-D 417

Query: 198 PQYQDKRMVAQRNGKLSVQVFH 219
           P  + K  + + +G       H
Sbjct: 418 PDGEAKLAILRWHGLSLTHFLH 439


>gi|403215215|emb|CCK69715.1| hypothetical protein KNAG_0C06190 [Kazachstania naganishii CBS
           8797]
          Length = 496

 Score = 47.4 bits (111), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 46/193 (23%), Positives = 81/193 (41%), Gaps = 29/193 (15%)

Query: 7   GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERAE 58
           G+KS W PY++ L ++    QL     + W++ EL  L         G+    E+ E   
Sbjct: 102 GEKSKWWPYLQVLPKKTDMNQL-----IYWADDELELLKPSLILERVGADKAKEMFENVV 156

Query: 59  GI--KREYNELDT-----VW---FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVH 108
            I  K    E D+      W    +  S+   Y +D+  + +  E  K+     +     
Sbjct: 157 DIINKSTLKEKDSYILKVTWENFLLVASIIMSYSFDV--QDYVEE--KEGGTDEEEDDNE 212

Query: 109 LQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPN 168
            + V   +   ++PL   L + + KC A L    + +++   +  K GE I    G  PN
Sbjct: 213 SENVRSLK--CMIPLADTLNSNTHKCNAHLIHGSNLLEMRSIKAIKKGEQIYNIYGDHPN 270

Query: 169 SKLLINYGFVDED 181
           S++L  YG+++ D
Sbjct: 271 SEILRRYGYIEPD 283


>gi|302784522|ref|XP_002974033.1| hypothetical protein SELMODRAFT_414219 [Selaginella moellendorffii]
 gi|300158365|gb|EFJ24988.1| hypothetical protein SELMODRAFT_414219 [Selaginella moellendorffii]
          Length = 527

 Score = 47.4 bits (111), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 53/197 (26%), Positives = 78/197 (39%), Gaps = 34/197 (17%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW-SETELAYLTGSPTKAEILERAEG 59
           + E+ +GK+SFW PYI  L       +L++  PLLW +ET    L GSP    I  R + 
Sbjct: 117 LVERSRGKESFWHPYISALPSVE---ELSISHPLLWPAETIQELLQGSPMLDTIATRLKL 173

Query: 60  IKREYNELDTVW---FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA- 115
            + ++  L T     F+ G                 E   +  V   S V+  +  SL  
Sbjct: 174 CQEDHEALLTAGIEKFLPGG----------------ETLSEGDVRWASAVLLSRAFSLEL 217

Query: 116 ------RRFALVPLGPPLLAYSSKCKAMLAAVDD---AVQLVVDRPYKAGESIVVWCGPQ 166
                     LVP    L   SS  +      D       L   + Y  G+ +    GP 
Sbjct: 218 DVDDDFDTLCLVPWADMLNHCSSAGEESCLIFDQDTKTASLEAHKSYSKGDEVFDSYGPA 277

Query: 167 -PNSKLLINYGFVDEDN 182
              S+L ++YGFVD++N
Sbjct: 278 LTGSQLFLDYGFVDDEN 294


>gi|212546319|ref|XP_002153313.1| SET domain protein [Talaromyces marneffei ATCC 18224]
 gi|210064833|gb|EEA18928.1| SET domain protein [Talaromyces marneffei ATCC 18224]
          Length = 481

 Score = 47.4 bits (111), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 84/377 (22%), Positives = 147/377 (38%), Gaps = 71/377 (18%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT---------KA 51
           +YE   G++S W  Y + L  +        ++ + WSE E + L  S           + 
Sbjct: 100 IYEYSLGEQSNWKQYFQVLPSK-------FDTLMFWSEEEFSQLQASAVVDKVGKRDAEE 152

Query: 52  EILERAEGIKREYNEL------------DT-------VWFMAGSLFQQYPYDIPTEAFTF 92
           +I E+   + R + +L            DT       +    GSL   Y +DI       
Sbjct: 153 DIFEKVLPLVRAHPDLFPPIDGVMSYDDDTGAQALLELAHRMGSLIMAYAFDIEKAEEEE 212

Query: 93  EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRP 152
              +  ++        L K        +VPL   L A + +  A L   + A+ +   +P
Sbjct: 213 SEGEDGYLTDDEE--QLPK-------GMVPLADLLNADADRNNARLFQEEGALVMRAIKP 263

Query: 153 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL----------NTEDPQYQD 202
            KAG+ I    G  P S LL  YG+V  DN     VVE  L          N ED +Y  
Sbjct: 264 IKAGDEIFNDYGELPRSDLLRRYGYV-TDNYAQYDVVELPLTGICHAAGFDNIEDKEYPQ 322

Query: 203 KRMVAQRNGKLSVQVFHVHAGREKEAISDMLP----YLRLGYVSDTSEMQSVISSLGPIC 258
            +++ Q   ++    + +     ++ + D+LP     L      D+ E+Q ++S      
Sbjct: 323 LKLLDQL--EILEDGYCILRPSPEDTLLDILPDELLALLKTLTLDSEELQRLLSKNKHPK 380

Query: 259 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDY-------NLHPKKRVATQLVR 311
           P+    E  +   L D  ++++  Y  T+ ED+ +L  +           ++ +A Q+  
Sbjct: 381 PILGAREARI---LLDAAQSKMGQYGTTIQEDKILLQQFASSSVLRTRERRRHMAVQVRV 437

Query: 312 MEKKMLNACLQVTADMI 328
            EK++L A L +  D +
Sbjct: 438 GEKEILQALLMMLQDFL 454


>gi|225561342|gb|EEH09622.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
          Length = 487

 Score = 47.4 bits (111), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 78/339 (23%), Positives = 134/339 (39%), Gaps = 50/339 (14%)

Query: 32  SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 91
           S + +++ EL    GS   A        I+ +Y  L         L  Q+    P + FT
Sbjct: 125 SSIFFTDDELEVCAGSSLYALTKRLGRCIEDDYRALVV------RLLVQHQDLFPLDKFT 178

Query: 92  FEIFKQAFVAVQSCVVHL-----QKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDA 144
            E +K A   V S  +       + + L   FA       +L +SS+ +   A   +   
Sbjct: 179 IEDYKWALCTVWSRAMDFVLPGGKSIRLMAPFA------DMLNHSSEVRQCHAYDPLSGN 232

Query: 145 VQLVVDRPYKAGES-----IVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQ 199
           + ++  + Y+AG+      + ++ G  PN++LL  YGFV   NP D   +    +   P 
Sbjct: 233 LTILAGKDYEAGDQGVFFQVFIYYGSIPNNRLLRLYGFVMPGNPNDSYDLVLETHPMAPF 292

Query: 200 YQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV----SDTSEMQSVISSLG 255
           ++ KR +    G  S     +        ++D LP   LGY+    SD S++ S+     
Sbjct: 293 FEQKRKLWDLAGFDSTSTISI-------TLTDPLPKNVLGYLRIQRSDESDLASIARQ-- 343

Query: 256 PICP----VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKR---VATQ 308
            I P    +S   E  VL  L + F   L  +   L   E  L +  ++P +     A  
Sbjct: 344 RIDPKYEKISDSNEVEVLQSLIESFCGLLDSFGTQLESLEKQLAE-GVYPSRGNAWAAAH 402

Query: 309 LVRMEKKMLNACLQVTADMIMLLPDVTVS-----PCPAP 342
           +   E+++L    +   DM+  +   + +     P PAP
Sbjct: 403 VSLGEQQVLRLARKRAEDMLAAVESGSGNEKGSLPAPAP 441


>gi|452841392|gb|EME43329.1| hypothetical protein DOTSEDRAFT_131367 [Dothistroma septosporum
           NZE10]
          Length = 445

 Score = 47.0 bits (110), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 80/348 (22%), Positives = 136/348 (39%), Gaps = 50/348 (14%)

Query: 7   GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI------------- 53
           G +S W PY   L        ++ ++ + WS+ EL +L GS    +I             
Sbjct: 97  GSQSRWKPYFDVL-------PVSFDNLMFWSDRELRHLEGSTVVGKIGKEAADATFREQL 149

Query: 54  ---LERAEGIKREYNE-LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 109
              +ER    K   NE L  +    GS    Y +D+ T +   +   + +         L
Sbjct: 150 IPVIERISKAKAADNEELLRMCHRMGSTIMAYGFDLETSSDQAKNDGEEWEEDSDAGETL 209

Query: 110 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 169
            K        +VPL   L A + +  A L   DD V +   +P KAGE +    G  P +
Sbjct: 210 PK-------GMVPLADMLNADADRNNAKLFYEDDKVVMKTIKPVKAGEELYNDFGSLPRA 262

Query: 170 KLLINYGFVDEDN---------PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV 220
            LL  YG++  DN         P D +   A L T+D    ++   A+  G L       
Sbjct: 263 DLLRRYGYL-TDNYAQYDVVEIPADLIKERAGLRTQD--VDERWQYAEEQGVLDDGYDVS 319

Query: 221 HAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARL 280
            A  E+    + L  L        +E + V +      P      + +L  +  Y   R 
Sbjct: 320 RASSEEGQFPEELCVLLNLLALPRAEFEKVKNKDKIPKPDLTTNAKKLLRTILVY---RY 376

Query: 281 AGYPATLSEDEAMLTDYNLHPKKR-VATQLVRMEKKMLNACLQVTADM 327
           A YP  + +   M +D +L+ ++R +A  +++ EK++L   +   +++
Sbjct: 377 AAYPGNVDQ---MHSDVSLNDRRRKMAIVVIQGEKQVLQEAVDAISEI 421


>gi|356521657|ref|XP_003529470.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Glycine
           max]
          Length = 487

 Score = 47.0 bits (110), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 72/316 (22%), Positives = 130/316 (41%), Gaps = 20/316 (6%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
           S W  Y   L ++        +S + WSE EL+ L G+          + ++ E+  L+ 
Sbjct: 139 SLWKHYFSILPKE-------TDSTIYWSEEELSELQGTQLLNTTRSVKQYVQNEFRRLEE 191

Query: 70  VWFMAGSLFQQYPYDIPTEAF--TFEIFK-QAFVAVQS-CVVHLQKVSLARRFALVPLGP 125
              +     + +P  I  + F   F I + +AF  +++  +V +    L    A V    
Sbjct: 192 EIIIPNK--KLFPSSITLDDFFWAFGILRSRAFSRLRNENLVVIPLADLINHSARVTTDD 249

Query: 126 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPY 184
              AY  K  A L + D    L      KAG+ + + +   + N++L ++YGF++ +   
Sbjct: 250 H--AYEIKGAAGLFSWDYLFSLRSPLSLKAGDQVYIQYDLNKSNAELALDYGFIEPNTDR 307

Query: 185 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 244
           +   +   ++  DP + DK  +A+ NG      F +   R           L     +D 
Sbjct: 308 NAYTLTLQISESDPFFGDKLDIAESNGFGETAYFDIFYNRPLPPGLLPYLRLVALGGTDA 367

Query: 245 SEMQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 301
             ++S+   S  G +  PVS   E  +   + +  K  LAGY  T+ ED+  L +  L  
Sbjct: 368 FLLESIFRNSIWGHLELPVSRDNEELICRVVRETCKTALAGYHTTIEEDQK-LKEAKLDS 426

Query: 302 KKRVATQLVRMEKKML 317
           +  +A  +   EK +L
Sbjct: 427 RHAIAVGIREGEKNLL 442


>gi|195439104|ref|XP_002067471.1| GK16171 [Drosophila willistoni]
 gi|194163556|gb|EDW78457.1| GK16171 [Drosophila willistoni]
          Length = 511

 Score = 47.0 bits (110), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 64/310 (20%), Positives = 121/310 (39%), Gaps = 34/310 (10%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           EK +G  S W PYI  L  +         + L ++  ++  L G+   +  L +   I R
Sbjct: 166 EKSRGADSIWKPYIDVLPSR-------YNTVLYFTVEQMRRLRGTSVCSSALRQCRMIAR 218

Query: 63  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR----- 117
           +Y +L    +   S  +            +E+++ A   V +   +L    +A +     
Sbjct: 219 KYAKLYAFAYCDSSYLRPDTGLFTQHGLCYELYRWAVSTVMT-RQNLVPREIATKDDGNS 277

Query: 118 --FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
              AL+P          K  +   +    ++       KAG    ++ G +PN+ LL++ 
Sbjct: 278 PISALIPCWDMANHRPGKITSFYDSNAHQMECTAQEFCKAGNQFFIYYGDRPNADLLVHN 337

Query: 176 GFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREK-----EAIS 230
           GFVD +N  D + +   L+  D        +A++  +L  ++   H G  +     E IS
Sbjct: 338 GFVDPNNNKDFVNIRLGLSPTDG-------LAEKRSRLLDRLNIEHKGEFRVLPAPEYIS 390

Query: 231 -DMLPYLRLGYVSD------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGY 283
             +L ++R+  +S        S+++  +  L   C +   +E          FK  L   
Sbjct: 391 GQLLAFVRVFNMSSDQLDHWCSDLERAVDLLHIDCALETDLETRTWQYFHQRFKLLLGVL 450

Query: 284 PATLSEDEAM 293
            ATL E + +
Sbjct: 451 EATLREADEL 460


>gi|387191841|gb|AFJ68625.1| set domain-containing protein [Nannochloropsis gaditana CCMP526]
          Length = 736

 Score = 47.0 bits (110), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 38/193 (19%), Positives = 74/193 (38%), Gaps = 22/193 (11%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           E+ +G +SFW PY+R L  +          P+ ++ +E   +        +  R   +  
Sbjct: 267 ERMKGPQSFWWPYLRNLPEK------YAHMPIFYNNSEFGSIQIPSLMRTVQSRCRMLVN 320

Query: 63  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 122
                     ++    +Q  +  P E    +      +    C    + +         P
Sbjct: 321 ----------ISDGYLRQLSHGGPAENPFLDDVHANDMGWGLCAASSRALRNIPGLGSTP 370

Query: 123 LGPPLLAYSSKCKAMLAAVDD------AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 176
           L  P++ +     +    + D      ++QLV  R  + G+++ +  G   N +LL++YG
Sbjct: 371 LMVPVIDFCEHAVSPTCYIKDYRKSGGSIQLVAGRDLQPGDALTISYGNLTNPQLLLDYG 430

Query: 177 FVDEDNPYDRLVV 189
           F   DNP+DR  V
Sbjct: 431 FTLSDNPHDRFEV 443


>gi|308812602|ref|XP_003083608.1| unnamed protein product [Ostreococcus tauri]
 gi|116055489|emb|CAL58157.1| unnamed protein product [Ostreococcus tauri]
          Length = 427

 Score = 47.0 bits (110), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 65/271 (23%), Positives = 113/271 (41%), Gaps = 27/271 (9%)

Query: 57  AEGIKREYNELDTV--WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 114
           AE + R Y+E D V  W  A   F+ +P     + FT   F++A   V++    + +   
Sbjct: 134 AEKVMRRYDE-DIVRGWNAAVRTFRTFPTIFRAQDFTRSKFEEALAIVRANSFEVTRADG 192

Query: 115 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 174
            R   LVPL   L+  +S     +  VDD   + VD  ++AG+ +    G   +++    
Sbjct: 193 VRERVLVPLAHLLVHDTSSSVPCVKMVDDTFVINVD-EHRAGDELSCSHGEYSDAETFAR 251

Query: 175 YG----FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL--SVQVFHVHAGREKEA 228
           +G    + +E+N  D +            + D+  + +  G    +  +     G    A
Sbjct: 252 FGTSAVYSEENNARDVITF---------TFPDEVHLKEEIGSCGPAEDIGFTRDG----A 298

Query: 229 ISDMLPYLRLGYVSDT--SEMQSVISSLGPIC--PVSPCMERAVLDQLADYFKARLAGYP 284
            ++++  LRL   + T  SEM+     L  +   P+S   E AV D L       L  YP
Sbjct: 299 SAELMCALRLVSANATEWSEMRKPNFDLQSLKNRPLSEESEVAVYDALFATLTDLLNSYP 358

Query: 285 ATLSEDEAMLTDYNLHPKKRVATQLVRMEKK 315
            +  +DE +L    L   +R A ++   EK+
Sbjct: 359 YSDVDDEHLLRGDRLADDERRAVKIRLREKR 389


>gi|322706860|gb|EFY98439.1| SET domain protein [Metarhizium anisopliae ARSEF 23]
          Length = 595

 Score = 47.0 bits (110), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 52/209 (24%), Positives = 96/209 (45%), Gaps = 21/209 (10%)

Query: 12  WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 71
           W  YIR L R        +  P +W+E E   L G+  +A +  +   + +E++ L  V 
Sbjct: 223 WTEYIRLLPR-------PIPVPTMWTEPERLLLNGTSLEAALEAKLLSLGKEFDTLREV- 274

Query: 72  FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR-FALVPLGPPLLAY 130
                  + +P+         E+  + +V V +     + + L R   A+VP G  ++ +
Sbjct: 275 ------SEDFPFWNEFLWSGEEVSLEDWVLVDAWY-RSRCLELPRSGTAMVP-GLDMVNH 326

Query: 131 SSKCKAMLAAVD-DAVQLVV--DRPYKAGESIVVWCG-PQPNSKLLINYGFVDEDNPYDR 186
           SSK  A     D D V L++    P ++GE + +  G  +P S++L +YGF+D +N  D+
Sbjct: 327 SSKATAYYEEDDHDNVVLLIRPGCPVRSGEEVTISYGDAKPASEMLFSYGFIDPNNIVDK 386

Query: 187 LVVEAALNTEDPQYQDKRMVAQRNGKLSV 215
           L +      +DP  + K  ++     L++
Sbjct: 387 LTLRLDPFPDDPLARAKLRISNSGPTLTI 415


>gi|400594002|gb|EJP61885.1| histone-lysine N-methyltransferase [Beauveria bassiana ARSEF 2860]
          Length = 481

 Score = 47.0 bits (110), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 64/277 (23%), Positives = 111/277 (40%), Gaps = 24/277 (8%)

Query: 34  LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 93
           + +S+ E+    GS       +    I  +Y +L T   M      ++    P   F  E
Sbjct: 115 IFFSDEEMQVCKGSSLYTLTTQLRGRIGDDYKKLLTRVLM------RHRNLFPLSKFGIE 168

Query: 94  IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS---KCKAMLAAVDDAVQLVVD 150
            +K A   V S  +    VS      L+     +L +SS   +C A      D + ++  
Sbjct: 169 HYKWALCTVWSRGMDF-TVSEGNSLRLLAPFADMLNHSSDVKQCHAYDPTTGD-LSILAS 226

Query: 151 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 210
           + Y  G+ + ++ GP PN++LL  YGFV  +NP+D   +    +   P Y+ K  + +  
Sbjct: 227 KDYNVGDQVFIYYGPVPNNRLLRLYGFVLPENPHDSYDLVLQTSPMAPLYEQKERLWKLA 286

Query: 211 GKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSL------GPICPVSPCM 264
           G  +     + A       +D LP   L Y+      +S++ ++      G    +S   
Sbjct: 287 GLDTACTIPLTA-------NDPLPRSVLRYLRIQRLDESLLGAMTMQIATGADEKISDDS 339

Query: 265 ERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 301
           E  +L  L D   A L G+   L    A L   +++P
Sbjct: 340 ETLILQFLIDSISAILEGFSIPLDILTAQLAAGDVYP 376


>gi|294868786|ref|XP_002765694.1| hypothetical protein Pmar_PMAR013760 [Perkinsus marinus ATCC 50983]
 gi|239865773|gb|EEQ98411.1| hypothetical protein Pmar_PMAR013760 [Perkinsus marinus ATCC 50983]
          Length = 330

 Score = 47.0 bits (110), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 21/72 (29%), Positives = 36/72 (50%)

Query: 118 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
             ++PL       S+K    +  V+   Q++ ++P K GE I    G   N  LL+ +GF
Sbjct: 171 LCVIPLADQFNHSSTKWHTRVREVEGGFQMLAEKPVKKGEEIFNNYGLYTNEMLLLTHGF 230

Query: 178 VDEDNPYDRLVV 189
           ++ DNP+D  + 
Sbjct: 231 IEFDNPHDHFIT 242


>gi|302754814|ref|XP_002960831.1| hypothetical protein SELMODRAFT_402223 [Selaginella moellendorffii]
 gi|300171770|gb|EFJ38370.1| hypothetical protein SELMODRAFT_402223 [Selaginella moellendorffii]
          Length = 486

 Score = 46.6 bits (109), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 43/162 (26%), Positives = 71/162 (43%), Gaps = 18/162 (11%)

Query: 32  SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF- 90
           S   W +TEL+YL  SP   +  ER E I  E+ ++    F    L Q    D+  + F 
Sbjct: 313 STFRWEDTELSYLRASPLYGKARERLEMITTEFGQVQND-FCTCVLEQ--ALDVWPQLFG 369

Query: 91  --TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY-----SSKCKAMLAAVDD 143
             + E  K  +  V S     + +++     LV +  P+L +     +S  K     + +
Sbjct: 370 KVSLEDLKHVYATVFS-----RSLAIGEDSTLVMI--PMLDFFNHNATSFAKLSFNGLLN 422

Query: 144 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
              +  DR Y   + I +  G   N++L ++YGF   +NPYD
Sbjct: 423 YAVVTADRDYAENDQIWINYGDLSNAELALDYGFTVPENPYD 464


>gi|330933580|ref|XP_003304225.1| hypothetical protein PTT_16721 [Pyrenophora teres f. teres 0-1]
 gi|311319308|gb|EFQ87682.1| hypothetical protein PTT_16721 [Pyrenophora teres f. teres 0-1]
          Length = 476

 Score = 46.6 bits (109), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 53/210 (25%), Positives = 80/210 (38%), Gaps = 44/210 (20%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL------ 54
           +YE   G  S W PY   L  +         + + W+E ELA L  S    +I       
Sbjct: 91  LYEYHNGSASNWAPYFAVLPTE-------FNTLMFWTEDELAELQASAVVGKIGKESADE 143

Query: 55  ---------------------ERAEGIKREYNELDTVWFM--AGSLFQQYPYDIPTEAFT 91
                                E+A+   +E      +  M   GSL   Y +D+     T
Sbjct: 144 AFLEQLLPVIEEFADIVFSGDEKAKDKAKEMRSPKNLELMHKMGSLIMAYAFDVEPATPT 203

Query: 92  FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 151
            E+ ++   A +     L K        +VPL   L A + +C A L    D +++   +
Sbjct: 204 KEVDEEG-FAEEEEDAALPK-------GMVPLADMLNADADRCNARLFYEKDCLEMKALK 255

Query: 152 PYKAGESIVVWCGPQPNSKLLINYGFVDED 181
           P +AGE I    GP P S LL  YG+V ++
Sbjct: 256 PIQAGEEIFNDYGPLPRSDLLRRYGYVTDN 285


>gi|299470104|emb|CBN78133.1| protein N-methyltransferase [Ectocarpus siliculosus]
          Length = 482

 Score = 46.6 bits (109), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 36/74 (48%)

Query: 117 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 176
           R AL+PL   +  YS     M    D A+ L V   +  G+      GP  N  LL  YG
Sbjct: 215 RMALLPLIDSINHYSRMPTHMYWEADGALSLSVGAAFDPGDHAFASYGPVSNDDLLQYYG 274

Query: 177 FVDEDNPYDRLVVE 190
           FV++DNP D  V+E
Sbjct: 275 FVEQDNPSDTYVLE 288


>gi|400597281|gb|EJP65016.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
          Length = 484

 Score = 46.6 bits (109), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 52/220 (23%), Positives = 82/220 (37%), Gaps = 75/220 (34%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           +YE  +G  S W PY+  L           E+P+ W+  EL  L    T  ++       
Sbjct: 112 LYEHLRGAASAWRPYLDVL-------PATFETPMFWTGAELGALQAGATAGKV------- 157

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
            RE  E DT   +   + + +P          ++F+ +       +V     +LA R   
Sbjct: 158 GRESAE-DTFRGILLPVVRAHP----------DVFQGSAALSDEALV-----ALAHR--- 198

Query: 121 VPLGPPLLAYS---------------------SKCKAMLAAV------------------ 141
             +G  ++AY+                        KAM+  V                  
Sbjct: 199 --MGSTIMAYAFDLENDEEREDEEDEDGWVEDRDGKAMMGMVPMADILNADAEFNAHVNH 256

Query: 142 -DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
            D+ + +   RP KAGE I+ + GP PNS+LL  YG+V E
Sbjct: 257 GDNELTVTALRPIKAGEEILNYYGPHPNSELLRRYGYVTE 296


>gi|347967018|ref|XP_321037.5| AGAP002018-PA [Anopheles gambiae str. PEST]
 gi|333469795|gb|EAA01259.5| AGAP002018-PA [Anopheles gambiae str. PEST]
          Length = 493

 Score = 46.6 bits (109), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 77/345 (22%), Positives = 136/345 (39%), Gaps = 55/345 (15%)

Query: 9   KSFWLPYIREL-DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 67
           KS W PY+  L DR          +PL ++  ++  L  +      L+  + I R+Y  +
Sbjct: 163 KSDWKPYLDLLPDR--------YTTPLYYTTEDMGELAETDAFLPALKLCKHIARQYGFI 214

Query: 68  DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS----CVVHLQKV-SLARRFALVP 122
                     F Q   D   + FT+++F+ A   V +      V+L +   +    AL+P
Sbjct: 215 RR--------FVQEKVDELRDCFTYDVFRWAVSTVMTRQNKVPVNLAEFDGMDHTLALIP 266

Query: 123 L------GPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYK--AGESIVVWCGPQPNSKLL 172
           L        P  A  ++C A     A ++ ++  + R     A   I +  G + +++ L
Sbjct: 267 LWDMANHAFPDTANETRCVAETCYNATNEQLECSLTREVSDIASVPIFIVYGTRTDAEFL 326

Query: 173 INYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDM 232
           ++ GFV   NP+  +     L    P Y+++  + +  G  +   F     RE  A +  
Sbjct: 327 VHNGFVCPRNPHANVQKRFTLVPAIPLYKERAHLLELLGMPTTGTFSFGPAREPAAATTT 386

Query: 233 LP----YLRLGYVSDTS------------EMQSVISSLGPICPVSPC--MERAVLDQLAD 274
            P     + L  VS  +              +  + +   + P   C   ER     LA 
Sbjct: 387 TPISQELISLARVSSMTAKELDEYTAMKETQRQTLRTYQALLPAELCARTER----WLAT 442

Query: 275 YFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 319
             K  L  YP T+ +DEA+L   N H  +R+  +    EK++L +
Sbjct: 443 VMKIMLLRYPTTIEQDEALLKT-NRHHIRRLLIEYRLGEKQILRS 486


>gi|380089029|emb|CCC12973.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 465

 Score = 46.6 bits (109), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 61/270 (22%), Positives = 105/270 (38%), Gaps = 21/270 (7%)

Query: 32  SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 91
           S + ++E EL    G+       +  + I+ ++  L         LF Q+    P + F+
Sbjct: 116 SSIFFTEEELEVCAGTSLYTITKQLEQSIEDDHRAL------VMQLFIQHRDLFPLDKFS 169

Query: 92  FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCK--AMLAAVDDAVQLVV 149
            E +K A   V S  +  Q         L P    +L +SS+ K   +       + ++ 
Sbjct: 170 IEDYKWALCTVWSRRMDFQLRDGKSMRLLAPFAD-MLNHSSEAKPCHVYDVSSGNLSVLA 228

Query: 150 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQR 209
            + Y+ G+ + +  G  PNS+LL  YGFV   NP D   +  + + + P Y+ K  +   
Sbjct: 229 GKDYEPGDQVFINYGSVPNSRLLRLYGFVIPGNPNDTYDLVLSTHPQAPFYEQKHKLWVS 288

Query: 210 NGKLSVQVFHVHAGREKEAISDMLP-----YLRLGYVSDTSEMQSVISSLGPICPVSPCM 264
            G  S     +        ++D LP     YLR+     +      + +      VS   
Sbjct: 289 AGLDSTSTIPL-------TLTDPLPKNVLRYLRIQRADASDLAAMALQNAKADEKVSDSN 341

Query: 265 ERAVLDQLADYFKARLAGYPATLSEDEAML 294
           E  +L  L + F   L G+   L + E  L
Sbjct: 342 EVEILQFLVESFGHLLGGFGTPLEKLEEQL 371


>gi|336260071|ref|XP_003344832.1| hypothetical protein SMAC_06115 [Sordaria macrospora k-hell]
          Length = 456

 Score = 46.6 bits (109), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 61/270 (22%), Positives = 105/270 (38%), Gaps = 21/270 (7%)

Query: 32  SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 91
           S + ++E EL    G+       +  + I+ ++  L         LF Q+    P + F+
Sbjct: 107 SSIFFTEEELEVCAGTSLYTITKQLEQSIEDDHRAL------VMQLFIQHRDLFPLDKFS 160

Query: 92  FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCK--AMLAAVDDAVQLVV 149
            E +K A   V S  +  Q         L P    +L +SS+ K   +       + ++ 
Sbjct: 161 IEDYKWALCTVWSRRMDFQLRDGKSMRLLAPFAD-MLNHSSEAKPCHVYDVSSGNLSVLA 219

Query: 150 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQR 209
            + Y+ G+ + +  G  PNS+LL  YGFV   NP D   +  + + + P Y+ K  +   
Sbjct: 220 GKDYEPGDQVFINYGSVPNSRLLRLYGFVIPGNPNDTYDLVLSTHPQAPFYEQKHKLWVS 279

Query: 210 NGKLSVQVFHVHAGREKEAISDMLP-----YLRLGYVSDTSEMQSVISSLGPICPVSPCM 264
            G  S     +        ++D LP     YLR+     +      + +      VS   
Sbjct: 280 AGLDSTSTIPL-------TLTDPLPKNVLRYLRIQRADASDLAAMALQNAKADEKVSDSN 332

Query: 265 ERAVLDQLADYFKARLAGYPATLSEDEAML 294
           E  +L  L + F   L G+   L + E  L
Sbjct: 333 EVEILQFLVESFGHLLGGFGTPLEKLEEQL 362


>gi|323447496|gb|EGB03414.1| hypothetical protein AURANDRAFT_72732 [Aureococcus anophagefferens]
          Length = 403

 Score = 46.6 bits (109), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 44/151 (29%), Positives = 62/151 (41%), Gaps = 14/151 (9%)

Query: 33  PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 92
           P+ W   EL +L GS    +I ER   IK +Y  +  +W          P  I  +  T 
Sbjct: 141 PIFWQPDELEWLKGSYLLTQIEERKRAIKADYEAICGIW----------PSFI--DVCTL 188

Query: 93  EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSK-CKAMLAAVDDAVQLVVDR 151
           E FK A + V S    +  V+ AR  A+VP    L  +  +  K        A  +   +
Sbjct: 189 EEFKWARMCVCSRNFGVV-VNGARTSAMVPYADMLNHFRPRETKWTFDNSRGAFTITSLQ 247

Query: 152 PYKAGESIVVWCGPQPNSKLLINYGFVDEDN 182
               G  I    G + N + L+NYGF  EDN
Sbjct: 248 KISVGSQIYDSYGQKCNHRFLLNYGFAIEDN 278


>gi|367009050|ref|XP_003679026.1| hypothetical protein TDEL_0A04830 [Torulaspora delbrueckii]
 gi|359746683|emb|CCE89815.1| hypothetical protein TDEL_0A04830 [Torulaspora delbrueckii]
          Length = 484

 Score = 46.2 bits (108), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 39/68 (57%), Gaps = 1/68 (1%)

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           +++PL   L A +SKC A L    +++++   +P   GE +    G  PNS+LL  YG+V
Sbjct: 213 SMIPLADTLNANTSKCNANLVYDIESLKMCATKPIGMGEQVYNIYGDHPNSELLRRYGYV 272

Query: 179 D-EDNPYD 185
           + E + YD
Sbjct: 273 EWEGSKYD 280


>gi|302754816|ref|XP_002960832.1| hypothetical protein SELMODRAFT_437299 [Selaginella moellendorffii]
 gi|300171771|gb|EFJ38371.1| hypothetical protein SELMODRAFT_437299 [Selaginella moellendorffii]
          Length = 418

 Score = 46.2 bits (108), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 46/172 (26%), Positives = 76/172 (44%), Gaps = 22/172 (12%)

Query: 36  WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF---TF 92
           W +TEL+YL  SP   +  ER E I  E+ ++    F    L Q    D+  + F   + 
Sbjct: 202 WEDTELSYLRASPLYGKARERLEMITTEFGQVQND-FCTCVLEQ--ALDVWPQLFGKVSL 258

Query: 93  EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY-----SSKCKAMLAAVDDAVQL 147
           E  K  +  V S     + +++     LV +  P+L +     +S  K     + +   +
Sbjct: 259 EDLKHVYATVFS-----RSLAIGEDSTLVMI--PMLDFFNHNATSFAKLSFNGLLNYAVV 311

Query: 148 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQ 199
             DR Y   + I +  G   N++L ++YGF   +NPYD    E  L T+ P+
Sbjct: 312 TADRDYAENDQIWINYGDLSNAELALDYGFTVPENPYD----ETELLTQFPE 359


>gi|1150596|emb|CAA86307.1| putative transcription regulator [Saccharomyces cerevisiae]
          Length = 496

 Score = 46.2 bits (108), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 81/192 (42%), Gaps = 19/192 (9%)

Query: 8   KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA-E 58
           ++S W PY +  ++      L     + W + EL  L         G     E+ ER  +
Sbjct: 103 ERSRWAPYFKVWNKPSDMNAL-----IFWDDNELQLLKPSLVLERIGKKEAKEMHERIIK 157

Query: 59  GIKREYNELDTVWFMAGSLFQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 115
            IK+   E  T      S F  + Y    I + +F  E+   +    +      +++   
Sbjct: 158 SIKQIGGEFSTCVANCPSKFDNFAYIASIILSYSFDLEMQDSSVNENEEEETSEEELENE 217

Query: 116 RRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 174
           R   +++PL   L A +SKC A L    + +++V  R  +  E +    G  PNS+LL  
Sbjct: 218 RYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRR 277

Query: 175 YGFVDED-NPYD 185
           YG+V+ D + YD
Sbjct: 278 YGYVEWDGSKYD 289


>gi|219125216|ref|XP_002182882.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217405676|gb|EEC45618.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 488

 Score = 46.2 bits (108), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 40/161 (24%), Positives = 63/161 (39%), Gaps = 17/161 (10%)

Query: 33  PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 92
           P  WSE+EL  L GSP    + +  EGI+++YN +   W       + +    P+    F
Sbjct: 138 PRQWSESELKALRGSPLLIRVRKAREGIQKDYNLIREAW------KEIHSSHSPSTTANF 191

Query: 93  ---EIFKQAFVAVQSCVVH-----LQKVSLARRFALVPLGPPLLAYSSKCKAMLAAV--- 141
              E F  A   V S           K        ++PL      +  +C     +    
Sbjct: 192 PKLEEFSSAMAVVSSRAFSGMAGFETKGGAVNDTTMIPLLDLCNHHRGRCVTKNVSYRFK 251

Query: 142 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 182
           D  V +        G+++ +  G Q N++L +NYGF   DN
Sbjct: 252 DGTVMVKAVTDIAIGDTLKITYGAQGNAQLFLNYGFCVADN 292


>gi|80479475|gb|AAI08868.1| Unknown (protein for MGC:132347) [Xenopus laevis]
          Length = 456

 Score = 46.2 bits (108), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 45/194 (23%), Positives = 84/194 (43%), Gaps = 19/194 (9%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           E+  G++S W PY+  +             P+ W E E+ +L  +P + + LE+    K 
Sbjct: 117 ERFAGERSQWKPYLDVIPS-------TYSCPVYW-ELEIVHLLPAPLRQKALEQ----KT 164

Query: 63  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA--- 119
           E  EL T      +  Q    D   + +T++  + A+  V +  V+++     R  A   
Sbjct: 165 EVQELHTESLAFFNSLQPLFCDNVADIYTYDALRWAWCTVNTRTVYMKHTQQDRLLAQQD 224

Query: 120 LVPLGP--PLLAYSSKCK--AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
           +  L P   LL +S + +  A  +      ++  +   +  +   +  GP  N +LL+ Y
Sbjct: 225 VCALAPYLDLLNHSPEVQVEAEFSKDRRCYEIRTNSGCRKHDQAFICYGPHDNQRLLLEY 284

Query: 176 GFVDEDNPYDRLVV 189
           GFV  +NP+  + V
Sbjct: 285 GFVAANNPHRSVYV 298


>gi|302803412|ref|XP_002983459.1| hypothetical protein SELMODRAFT_445547 [Selaginella moellendorffii]
 gi|300148702|gb|EFJ15360.1| hypothetical protein SELMODRAFT_445547 [Selaginella moellendorffii]
          Length = 536

 Score = 46.2 bits (108), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 53/197 (26%), Positives = 78/197 (39%), Gaps = 34/197 (17%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW-SETELAYLTGSPTKAEILERAEG 59
           + E+ +GK+SFW PYI  L       +L++  PLLW +ET    L GSP    I  R + 
Sbjct: 117 LVERSRGKESFWHPYIAALPSV---DELSISHPLLWPAETIQELLQGSPMLDTIATRLKL 173

Query: 60  IKREYNELDTVW---FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA- 115
            + ++  L T     F+ G                 E   +  V   S V+  +  SL  
Sbjct: 174 CQEDHEALLTAGIEKFLPGG----------------ETLSEGDVRWASAVLLSRAFSLEL 217

Query: 116 ------RRFALVPLGPPLLAYSSKCKAMLAAVDD---AVQLVVDRPYKAGESIVVWCGPQ 166
                     LVP    L   SS  +      D       L   + Y  G+ +    GP 
Sbjct: 218 DVDDDFDTLCLVPWADMLNHCSSAGEESCLIFDQDTKTASLEAHKSYSKGDEVFDSYGPA 277

Query: 167 -PNSKLLINYGFVDEDN 182
              S+L ++YGFVD++N
Sbjct: 278 LTGSQLFLDYGFVDDEN 294


>gi|403414266|emb|CCM00966.1| predicted protein [Fibroporia radiculosa]
          Length = 420

 Score = 46.2 bits (108), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 54/212 (25%), Positives = 92/212 (43%), Gaps = 27/212 (12%)

Query: 14  PYIREL---DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE---- 66
           PYIR L   D+ R        +PL +S+TEL  L GS      L+R    + E+ +    
Sbjct: 117 PYIRMLPAPDKLR--------TPLHFSDTELEALKGSNLYGATLDRRRDWQSEWEQCQKT 168

Query: 67  LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 126
           + TV    G  F    Y   +   +   F    ++    +V     S    + ++  G  
Sbjct: 169 IATVDLTWGEQFSWERYLSASTYLSSRAFPSMVLSPNPSLV-----STEESYPVLLPGID 223

Query: 127 LLAYS-----SKCKAMLAAVD-DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
            L +S     S   ++  + D + + LV+ +   AG  ++   GP+PN++L++ YGF   
Sbjct: 224 SLNHSRGQPVSWVVSIGTSSDVNRISLVLHKSTPAGSELLNNYGPKPNAELILGYGFSLP 283

Query: 181 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 212
           +NP D +V++   N+     Q K  V  RN +
Sbjct: 284 ENPDDTIVLKIGGNSASGLQQQKWEVG-RNAQ 314


>gi|346324642|gb|EGX94239.1| SET domain-containing protein RMS1 [Cordyceps militaris CM01]
          Length = 482

 Score = 46.2 bits (108), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 52/206 (25%), Positives = 81/206 (39%), Gaps = 41/206 (19%)

Query: 7   GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-LERAEGIKREY- 64
           G    W PY+  L         A  +P+ WS  EL  L  SP  A++  E A+ + R   
Sbjct: 121 GAACRWRPYLDVL-------PAAFATPMFWSPAELGALQASPAVAKVGRESADNMFRGIL 173

Query: 65  -------------------NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSC 105
                               ++  +    GS    Y +D+  E    E  +  +V  +  
Sbjct: 174 LPAVRAHAHVFAGSERLSDEQIVALAHRMGSTIMAYAFDLDKEEDEDEDGEDGWVEDRDG 233

Query: 106 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 165
              +  V +A           +L   ++    +   DD + +   RP +AGE I+ + GP
Sbjct: 234 KALMGMVPMA----------DILNADAEFNVHVNHGDDDLTVTALRPIRAGEEILNYYGP 283

Query: 166 QPNSKLLINYGFVDEDNP-YDRLVVE 190
            PNS+LL  YG+V E +  YD  VVE
Sbjct: 284 HPNSELLRRYGYVTERHARYD--VVE 307


>gi|428179206|gb|EKX48078.1| hypothetical protein GUITHDRAFT_106158 [Guillardia theta CCMP2712]
          Length = 410

 Score = 46.2 bits (108), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 49/199 (24%), Positives = 87/199 (43%), Gaps = 21/199 (10%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           + EK + ++S W  +IR +         + ++   WSE  +A L   P  A    R   +
Sbjct: 31  LVEKWKAERSRWWRFIRSIPP-------SYDTLENWSEQSVARLQYKPFLAIAARRKRVV 83

Query: 61  KREYNELDTV--------WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKV 112
             E+++L  +        W    +  +     +   +F+ E +  A   V +   H ++ 
Sbjct: 84  NDEFSQLQRLLSRCKKRSWNEPEAAEEAERIQLGFSSFSREDYLWAAGTVSTRSCHYERK 143

Query: 113 S-LARRFALVPLGPPLLAYSSKCKAMLAAV---DDAV--QLVVDRPYKAGESIVVWCGPQ 166
           S  + R   V    P+L + +   A +AA     DA+  ++   R Y+ GE +++  G  
Sbjct: 144 SGYSLRGETVGCLVPVLDFLNHSTAPVAACGFCKDAMVYRVTCLRSYEEGEQVMIHYGNW 203

Query: 167 PNSKLLINYGFVDEDNPYD 185
            N+ LL +YGFV EDNP D
Sbjct: 204 SNAGLLEHYGFVLEDNPLD 222


>gi|440302460|gb|ELP94773.1| hypothetical protein EIN_341910 [Entamoeba invadens IP1]
          Length = 823

 Score = 45.8 bits (107), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 72/317 (22%), Positives = 122/317 (38%), Gaps = 79/317 (24%)

Query: 4   KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 63
            K  +K F  PYI  L             PL +SE EL  L G+    ++L   E  K  
Sbjct: 93  NKTNEKCFHFPYINTLPT-------TFSCPLSYSENELKMLKGT----KLLVTVEKTKTF 141

Query: 64  YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV-- 121
             +L   +    +L  QYP    T    F+ F Q  V          +V  +R F ++  
Sbjct: 142 LKKLSDYY---ETLTHQYP----TRFQQFDDFYQRLVWAH-------QVFWSRAFLVIYP 187

Query: 122 -PLGP-----PLLAYSSKCKAMLAAVDDAVQLVVDRPYKA----GESIVVWCGPQ----- 166
            P+G      P   +S+         +  V  V +R  +         V+ CG Q     
Sbjct: 188 DPIGDVASLIPFADFSNH------NTETKVTYVSNRQTQTFSLQTNEKVLHCGEQIFNNY 241

Query: 167 ---PNSKLLINYGFVDEDNPYDRLVV-----EAALNTEDPQYQDKRM------------- 205
              PN K+L+ YGFV  +NPYD +++     E     +  + ++ +M             
Sbjct: 242 RIRPNEKMLLGYGFVISENPYDEVLLRINFKERHFEKQVEESEESKMEVENKENERMEVE 301

Query: 206 -------VAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS--DTSEMQSVISSLGP 256
                  + Q   +  V  F  +  REKE  +D+L  LR+  +S  + ++    +  L  
Sbjct: 302 EEDNEDEITQILKREGVDRFDYYLTREKELPTDLLRVLRIVNLSLVEANQYSQALLDLSY 361

Query: 257 ICPVSPC-MERAVLDQL 272
           + P++     R++++Q+
Sbjct: 362 VSPINEIKATRSLMEQI 378


>gi|378731232|gb|EHY57691.1| hypothetical protein HMPREF1120_05719 [Exophiala dermatitidis
           NIH/UT8656]
          Length = 714

 Score = 45.8 bits (107), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 59/214 (27%), Positives = 81/214 (37%), Gaps = 46/214 (21%)

Query: 7   GKKSFWLPYIRELD--RQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 64
           G KS+W PYI  L         Q   E+ LLW       L G+  KA     A   K  Y
Sbjct: 138 GDKSWWAPYISSLPTVEDVSHSQFEDEADLLW-------LEGTNLKAGFAAEAARWKEMY 190

Query: 65  NELDTVWFMAG--SLFQQYPYDIPTEAFTFEIFKQAFV--AVQSCVVHLQKVSLARRFAL 120
                   + G   L Q    +    A+T+E F+ A      +S    +   +L    AL
Sbjct: 191 --------LKGMHQLKQSQWENAVNGAYTWERFRWAMTIFGSRSFTSQVLDATLPADKAL 242

Query: 121 VP------------LGP----------PLLAYSSK---CKAMLAAVDDAVQLVVDRPYKA 155
           +             LG           PL+  S+     K    A    V L V  PY++
Sbjct: 243 LQQYRHDDGRDLCVLGELFAQHFGVLLPLVDISNHKPGAKVEWQARYSFVGLQVLEPYES 302

Query: 156 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 189
           G+ I    GP+ N  LL+ YGF   DNP+D +V+
Sbjct: 303 GQEIFNNYGPRDNETLLVAYGFTIPDNPFDHVVI 336


>gi|403158396|ref|XP_003307692.2| hypothetical protein PGTG_00642 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375163798|gb|EFP74686.2| hypothetical protein PGTG_00642 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 622

 Score = 45.8 bits (107), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 30/78 (38%), Positives = 39/78 (50%), Gaps = 2/78 (2%)

Query: 118 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
            A+VPL   L A +    A L    D +++   R  K GE I    G  PNS LL  YG 
Sbjct: 345 IAMVPLADLLNAKTGSENARLFYETDCLKMKATRNIKKGEQIYNTYGDPPNSDLLRRYGH 404

Query: 178 VDEDNPYDRLVVEAALNT 195
           VD+ N +D  VVE ++ T
Sbjct: 405 VDDPNRFD--VVEISIKT 420


>gi|391340216|ref|XP_003744440.1| PREDICTED: SET domain-containing protein 4-like [Metaseiulus
           occidentalis]
          Length = 381

 Score = 45.8 bits (107), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 28/101 (27%), Positives = 47/101 (46%), Gaps = 5/101 (4%)

Query: 90  FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV 149
             FE F  A+ AV +  ++++         L P    LL +  K     + V++   +  
Sbjct: 165 LNFETFSWAWSAVNTRCIYVE----GHGSTLAPF-LDLLNHHWKASIETSFVNNHFIIRS 219

Query: 150 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 190
           +  Y+AG  + +  G   N  L +NYGFV ++NP D + VE
Sbjct: 220 NVGYEAGSEVFIGYGSHDNRTLFLNYGFVLDENPNDCITVE 260


>gi|302658278|ref|XP_003020845.1| SET domain protein [Trichophyton verrucosum HKI 0517]
 gi|291184711|gb|EFE40227.1| SET domain protein [Trichophyton verrucosum HKI 0517]
          Length = 692

 Score = 45.8 bits (107), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 45/201 (22%), Positives = 77/201 (38%), Gaps = 35/201 (17%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           ++E+ +G+ S W PY+  L R          S L + + +L +L G+            +
Sbjct: 113 VHEQLKGRDSHWWPYLATLPRAS-----EFTSALFYQDNDLEWLQGTNLYQTHQAYRNAV 167

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           + EY+   ++    G L          E++ ++IF  A+  + S      +   +R    
Sbjct: 168 QEEYDSAISILRDEGFL--------AVESYRWDIFCWAYTLIAS------RAFTSRVLDA 213

Query: 121 VPLGPPLLAYSSKCKAMLAAVDDA----------------VQLVVDRPYKAGESIVVWCG 164
                P L    + + ML  VD +                + L V  P  +GE +    G
Sbjct: 214 YFSNHPTLKQDEEFQIMLPLVDSSNHKPLAKIEWRAEATEIGLKVIEPTSSGEEVHNNYG 273

Query: 165 PQPNSKLLINYGFVDEDNPYD 185
           P  N +L+  YGF   DNP D
Sbjct: 274 PLNNQQLMTTYGFCIVDNPCD 294


>gi|328772335|gb|EGF82373.1| hypothetical protein BATDEDRAFT_86177 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 966

 Score = 45.8 bits (107), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 51/206 (24%), Positives = 78/206 (37%), Gaps = 41/206 (19%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
           S W PY   L R           P+L S  +LA  TG+    E++     + R+Y  L  
Sbjct: 652 SLWQPYFDLLPR-------VFTIPVLGSAQDLAAYTGTSIIDEVVHDMIALMRQYLYLQH 704

Query: 70  VWFMAGSLFQQYPYD-IPTEAFTFEIFKQAFVAVQS-----CVVHLQKVSLARRFALVPL 123
           +       F+  P   IP   FTF  F  A   V +     C  +     + +   L+PL
Sbjct: 705 I-------FKSIPEPPIPLADFTFAAFSWARAIVSTRQNEICYANPSTSEMQQFLCLIPL 757

Query: 124 --------GPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 173
                   G     + +K  C   +A+ D             GE I +  G + N ++L+
Sbjct: 758 FDMFNHKPGNSTTQFDTKEYCSETIASCD----------VSPGEQIFIHYGKRSNQEMLL 807

Query: 174 NYGFVDEDN-PYDRLVVEAALNTEDP 198
             GFVD  N  YD + +  ++   DP
Sbjct: 808 YSGFVDPTNIEYDHIKLSVSIPQSDP 833


>gi|348684109|gb|EGZ23924.1| hypothetical protein PHYSODRAFT_296170 [Phytophthora sojae]
          Length = 452

 Score = 45.8 bits (107), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 51/192 (26%), Positives = 77/192 (40%), Gaps = 36/192 (18%)

Query: 30  VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 89
           V+ PL W + +   L G       ++      R Y+++    F A + F      +  EA
Sbjct: 143 VDLPLYWDDKQFEELQGCEEARRAMQHG---ARFYSQVYKHLFGANNQF------VNAEA 193

Query: 90  FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY---SSKCKAMLAAVDDAVQ 146
           F + I         S ++        + FAL+P           S  C+  L + D+ VQ
Sbjct: 194 FFWAI---------SILMSRATSGQNQPFALIPFFDWFNHAGNGSDNCRHALDS-DECVQ 243

Query: 147 ---------LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD--RLVVEAAL-- 193
                    +   R Y+ GE + +  G   N +LL NYGF   +NPYD   L + AAL  
Sbjct: 244 DFDMQKGFTIHTTRSYEPGEQLFINYGSHGNLRLLRNYGFTMPNNPYDVVNLPMPAALQQ 303

Query: 194 -NTEDPQYQDKR 204
            N  DP +  KR
Sbjct: 304 PNEADPAFAQKR 315


>gi|324503528|gb|ADY41532.1| SET domain-containing protein 3 [Ascaris suum]
          Length = 502

 Score = 45.8 bits (107), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 63/275 (22%), Positives = 109/275 (39%), Gaps = 51/275 (18%)

Query: 4   KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 63
           +K    S WLPY+  L +          +PL +S  EL  L+ SP   E L     + R+
Sbjct: 167 QKLSPDSSWLPYLDALPQ-------TFSTPLYFSALELRKLSPSPAYEESLIMYRNVARQ 219

Query: 64  YNELDTVWFMAGS----------------------LFQQYPYDIPTEAFTFEIFKQAFVA 101
           +     V+F+A                        LF   P+ +    FTF++++ A   
Sbjct: 220 F-----VYFLAAVQRSERSRSAKKDKNHAAVGMEPLFLNAPFTVSN--FTFDLYRWAVAC 272

Query: 102 VQSCVVHL----QKVSLARRFALVPLGPPLL-----AYSSKCKAMLAAVDDAVQLVVDRP 152
           V + +  +     K S  +  A VP   PLL      +        +   D   +   + 
Sbjct: 273 VTTRINFIPSQYAKDSNGQPVA-VPCLIPLLDMANHEFDHPLTVHFSTEGDYASIKATKD 331

Query: 153 YKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALNTEDPQYQDKRMV---AQ 208
           YKAG+ + ++ G + N +  ++ GFV D +N  D   ++      D Q + +  +   A 
Sbjct: 332 YKAGDEVTIFYGIRTNRQFFLHNGFVPDGENKNDTYKLKIGFPRGDKQVRARLKLMHDAG 391

Query: 209 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD 243
            N +  V VF V+A      +S +L + R+  V +
Sbjct: 392 FNAESRVFVFEVNASERPVPLS-LLDFARVFLVEN 425


>gi|323452617|gb|EGB08490.1| hypothetical protein AURANDRAFT_71532 [Aureococcus anophagefferens]
          Length = 1114

 Score = 45.4 bits (106), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 80/331 (24%), Positives = 130/331 (39%), Gaps = 54/331 (16%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK---------- 50
           +YE+++G KS W P+I  L            + L WSE ELA L GS             
Sbjct: 106 LYERRKGAKSRWGPHIALLP------ATPPHALLRWSEAELAELAGSDALELANRWRSQV 159

Query: 51  ----AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 106
               +EI++++     E +    +   A     ++P+ +  E F++ +           +
Sbjct: 160 SSDFSEIVDKSRAAVEESDPGKQL-SAAVKASLRFPW-LDLEGFSWAV----------SM 207

Query: 107 VHLQKVSLARRFALVPLGPPLLAY--------SSKCKAMLAAVDDAVQLVVDR---PYKA 155
           +  + VS++R+ A     PP+ A+                   DDA    V R     K 
Sbjct: 208 IWSRCVSVSRKGA-----PPIKAFLPVVDMHNHDPGAPENHGFDDARDGFVLRRTGNAKK 262

Query: 156 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 215
           G+ + +     PN+ LL+ YGF  +   +    + A L+ E P Y+ KR   +   KL +
Sbjct: 263 GDELKLCYDGLPNAWLLLLYGFALDHAAHAGRDLYAPLSPEAPHYEAKRAALE---KLGL 319

Query: 216 QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADY 275
                 A   + A  D LP  RL  ++     ++ +  L  +   S    RA    L   
Sbjct: 320 GATADGAAPFRLAADDALPE-RL--LTALMAQRATLDELPGLPATSEATARAAAGDLVAA 376

Query: 276 FKARLAGYPATLSEDEAMLTDYNLHPKKRVA 306
             A LA Y  +  ED A L D    P+ R+A
Sbjct: 377 CDALLAAYRGSEDEDAAALADPATPPRLRLA 407


>gi|346327621|gb|EGX97217.1| SET domain-containing protein, putative [Cordyceps militaris CM01]
          Length = 371

 Score = 45.4 bits (106), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 38/77 (49%), Gaps = 2/77 (2%)

Query: 117 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 176
           R AL+P+       S  C    +   +   +  DR Y+A E +    G   N  LL  YG
Sbjct: 178 RLALLPVADMFNHASVGCAVAFST--EVYDVTADRDYEADEELYTSYGAHSNDFLLAEYG 235

Query: 177 FVDEDNPYDRLVVEAAL 193
           F+ +DNP+D+L ++A L
Sbjct: 236 FMLQDNPHDQLCLDAVL 252


>gi|302832548|ref|XP_002947838.1| hypothetical protein VOLCADRAFT_88145 [Volvox carteri f.
           nagariensis]
 gi|300266640|gb|EFJ50826.1| hypothetical protein VOLCADRAFT_88145 [Volvox carteri f.
           nagariensis]
          Length = 508

 Score = 45.4 bits (106), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 45/160 (28%), Positives = 72/160 (45%), Gaps = 8/160 (5%)

Query: 139 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV--DEDNPYDRLVVEAALNTE 196
            A   A+ L  DRPY AG+ + V  GP+ + +LL++YGF      NP+    +  A++ +
Sbjct: 242 GATKGALVLRADRPYAAGQQVYVSYGPKSSGELLLSYGFCPPPASNPHQDCRLRVAVDRQ 301

Query: 197 -DPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL--RLGYVSDTSEMQSVISS 253
            DP    K     R+G  S   F +      E +   L +L  R     +T E+ SV+  
Sbjct: 302 GDPLADLKEQALARHGLPSELEFPLKLEGIPEGLLQYLAFLDARPKVAQETFELASVLFE 361

Query: 254 LGPICPVSPCMERAV--LDQLADYFKARLAGYPATLSEDE 291
            G   P+    +  V  L  L++   A L  YP ++  D+
Sbjct: 362 SGGF-PLLDGQDTLVLALRGLSNRCTAALKAYPTSMEADQ 400


>gi|428171155|gb|EKX40074.1| hypothetical protein GUITHDRAFT_113813 [Guillardia theta CCMP2712]
          Length = 353

 Score = 45.4 bits (106), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 24/79 (30%), Positives = 41/79 (51%), Gaps = 1/79 (1%)

Query: 111 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 170
           ++ L+R FA    G   L +S              Q+V ++ +K G+S+ +  G + N +
Sbjct: 183 EIVLSRAFAFSRTGGDDLVFSG-TSVKYDNSKQEFQIVAEKDFKVGQSVEISYGLKSNHE 241

Query: 171 LLINYGFVDEDNPYDRLVV 189
           LL++YGF+  DNP D  V+
Sbjct: 242 LLLSYGFILPDNPEDFFVI 260


>gi|408392258|gb|EKJ71616.1| hypothetical protein FPSE_08255 [Fusarium pseudograminearum CS3096]
          Length = 527

 Score = 45.4 bits (106), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 23/63 (36%), Positives = 35/63 (55%), Gaps = 2/63 (3%)

Query: 131 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 190
           S  CK + +A+  +VQ   DR YK GE + V  GP  N  LL  YGF+ + N +D + ++
Sbjct: 194 SQGCKLVYSALGYSVQ--TDRAYKQGEEVFVSYGPHSNDFLLTEYGFILDTNRWDEVYLD 251

Query: 191 AAL 193
             +
Sbjct: 252 EVI 254


>gi|345566622|gb|EGX49564.1| hypothetical protein AOL_s00078g53 [Arthrobotrys oligospora ATCC
           24927]
          Length = 611

 Score = 45.4 bits (106), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 68/296 (22%), Positives = 123/296 (41%), Gaps = 36/296 (12%)

Query: 8   KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA-EILERAEGIKREYNE 66
           +  FW PYIR L +         ++PL +++ E+  L G+   A ++L R +    E+  
Sbjct: 125 QSQFW-PYIRLLPK-------TFDTPLYFNDDEMERLAGTNLGAGDVLLRKQLWMEEWEA 176

Query: 67  LDTVWFMAG---SLFQQYPYDIPTEA---FTFEIFKQAFVAV--QSCVVHLQKVSLARRF 118
                F+ G      ++Y +D+   A   +T   F    V +   S +     +S    F
Sbjct: 177 GKQ--FLEGVGAERAREYTWDLFLRAATIYTSRSFPSKLVGITMDSSIEENTMLSDDNGF 234

Query: 119 -ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
             L+PL   +L +    K +      +  L+       G  +    GP+ N +LL+ YGF
Sbjct: 235 PVLIPL-VDILNHKPNTKIIWEPTQTSFSLITPETISEGSQVFNNYGPKGNEELLMGYGF 293

Query: 178 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV-QVFHVHAGREKEAISDMLPYL 236
           V  +NP D L ++  ++   P+ Q  ++  QR  K +  +VFH+    +    +  +P L
Sbjct: 294 VIPENPGDSLAMKFTIS---PRGQAAQIWEQRALKQTWREVFHLTKSADSGQKTSTVPAL 350

Query: 237 RLGY-----------VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 281
              +           V++ +E+  + +      P+S   E AV   L    K +LA
Sbjct: 351 ESDWPEAFVDLFRILVANENEIDDLENGDINATPISIRNELAVALGLKAAIKQKLA 406


>gi|390602144|gb|EIN11537.1| SET domain-containing protein [Punctularia strigosozonata HHB-11173
           SS5]
          Length = 503

 Score = 45.4 bits (106), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 59/220 (26%), Positives = 87/220 (39%), Gaps = 24/220 (10%)

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           A+VP+   L A      A L      +++V  +P  +GE I    G  PNS LL  YG V
Sbjct: 255 AMVPMADMLNARYGSENAKLFYESRDLRMVTTKPIASGEQIWNTYGDPPNSDLLRRYGHV 314

Query: 179 D---------EDNPYDRLVVEA--ALNTEDPQYQDKRMVAQ-----RNGKLSVQVFHVHA 222
           D           NP D + V A   LN  + + Q   +  +       G   V VF    
Sbjct: 315 DLLALSDGDGMGNPSDIVEVRADLVLNHVNSKKQSHELEERIDWWLEEGGDDVFVF---- 370

Query: 223 GREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAG 282
            R+ E  S+++  +RL  +  T   ++      P   V       +L  +      RL  
Sbjct: 371 TRDAELPSELVSLIRLLILPPTEWTKTRDKGKLPKGKVDDVR---ILHVVTGALHERLQQ 427

Query: 283 YPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 322
           YP ++ +DEA+L    L   KR A  +   EK +L   L 
Sbjct: 428 YPTSIEDDEALLA-TALSENKRQAVIVRLAEKHILRKALH 466


>gi|116206234|ref|XP_001228926.1| hypothetical protein CHGG_02410 [Chaetomium globosum CBS 148.51]
 gi|88183007|gb|EAQ90475.1| hypothetical protein CHGG_02410 [Chaetomium globosum CBS 148.51]
          Length = 442

 Score = 45.4 bits (106), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 48/190 (25%), Positives = 76/190 (40%), Gaps = 20/190 (10%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           E  +GK SFW PY+  L          +  P  W E ++AYL  +     I E    +K 
Sbjct: 118 EYLKGKDSFWWPYLATLPSPDQVNAWVL--PAFWPEDDIAYLECTNAHVAIQEIQANVKG 175

Query: 63  EYNE----LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL-----QKVS 113
           E+ +    L    F   + +    Y      FT   F+ + +   +   H+     Q V 
Sbjct: 176 EFKQARKILKNENFPDVAAYTSLMYKWAFTIFTSRSFRPSLILSDTTKRHISTLLPQSVE 235

Query: 114 LARRFALVPLGPPLLAYSSKCKAMLAAVD-----DAVQLVVDRPYKAGESIVVWCGPQPN 168
           L   F+++    PLL  ++     + + D     DA  LV    Y  G  +    G + N
Sbjct: 236 L-DDFSILQ---PLLDIANHSPTAVYSWDTTSPADACTLVCGDRYPPGAQVFNNYGLKTN 291

Query: 169 SKLLINYGFV 178
           S+LL+ YGF+
Sbjct: 292 SELLLGYGFI 301


>gi|400596811|gb|EJP64567.1| histone-lysine N-methyltransferase [Beauveria bassiana ARSEF 2860]
          Length = 406

 Score = 45.1 bits (105), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 54/227 (23%), Positives = 100/227 (44%), Gaps = 12/227 (5%)

Query: 76  SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCK 135
            +F ++    P + F+F+ +K A  ++ S  +    +S      L+     +L ++S  K
Sbjct: 147 GVFMRHRDLFPLDKFSFQHYKWALSSIWSRGMDF-TISEGNSVRLMAPFADMLNHASDAK 205

Query: 136 AMLAAVDDAVQLVV--DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 193
              A       L V   R Y+ G+ + ++ G   NS+LL  YGFV  DNP D   +    
Sbjct: 206 QCHAYDPSTGSLTVLACRDYEVGDQVFIYYGNVSNSRLLRLYGFVLPDNPNDNYELVLQT 265

Query: 194 NTEDPQYQDKRMVAQRNGKLSV--QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVI 251
           ++  P Y+ K    QR  KL+   ++  +    +      +L YLR+  + D S++ ++ 
Sbjct: 266 SSMAPLYEQK----QRLWKLAGLDEISTIPLSLQNPLPDSVLRYLRIQRL-DASDLGTMT 320

Query: 252 SSLG--PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
             +       +S   E  +L  L+   +A L G+  +L + E  L +
Sbjct: 321 MQIATESYTKISDENESQILLFLSQSIEALLEGFEISLEKLETQLAE 367


>gi|384248321|gb|EIE21805.1| SET domain-containing protein, partial [Coccomyxa subellipsoidea
           C-169]
          Length = 275

 Score = 45.1 bits (105), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 50/202 (24%), Positives = 85/202 (42%), Gaps = 30/202 (14%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           M EKK GK S W  Y+  L +           PL W   +L  L G+     ++E+  G 
Sbjct: 88  MTEKKLGKSSKWKGYLDFLPKS------IPGMPLFWDSEQLQSLEGT----SLIEKMNGC 137

Query: 61  KR----------EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ 110
           K           ++N +  + F+  +   + P++  +    + ++  A V+  S  +   
Sbjct: 138 KAMPDRPLEPPCKFNSV-VLPFLQSNAHLKLPHNAASTRRLY-VWATAMVSAYSFTIGED 195

Query: 111 KVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQPN 168
           +       A+VP+   L   +      L   A   A++++       GE ++   G  PN
Sbjct: 196 RFQ-----AMVPMWDALNHITGHANVRLHHCARKGALRMIATCLITKGEQVINSYGDLPN 250

Query: 169 SKLLINYGFVDED-NPYDRLVV 189
           S+LL  YGFV+ D NP+D L V
Sbjct: 251 SELLRRYGFVETDPNPHDCLEV 272


>gi|384246211|gb|EIE19702.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 503

 Score = 45.1 bits (105), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 82/322 (25%), Positives = 132/322 (40%), Gaps = 40/322 (12%)

Query: 33  PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 92
           PL WSE EL+ L G+   + +    +  + +Y EL+        LF  +    P ++   
Sbjct: 155 PLFWSEEELSQLEGTQLLSSVQGYRQFFEAKYAELEE------QLFAPHREAFPPKSHQL 208

Query: 93  EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY----SSKCKAMLAA--VDDAVQ 146
           + F  A   V+S V            ALVPL   L+ +     ++ +  LA      A  
Sbjct: 209 DDFLWAVATVRSRV---HSPLDGEDVALVPLAD-LVQHRKLQGARWQLQLAGGLFSKAQA 264

Query: 147 LVVD--RPYKAGESIVVWCGP--------QPNSKLLINYGFVDEDNPY-------DRLVV 189
           LVV+  R Y  GE + +  G         + +S++L++YG +D D P           ++
Sbjct: 265 LVVEAQRDYAEGEVVTMDFGAPLTEEDQEKLDSQVLLDYGALDADRPQADPGVVQGGFIL 324

Query: 190 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 249
             AL  +D  Y DK  + + NG      F + A  E          L      D   ++ 
Sbjct: 325 SLALPEDDKYYDDKADILELNGLSEAASFVLRANEEPSEQLLGFLRLLNLSGQDAFLLEP 384

Query: 250 VISSLG---PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVA 306
           +  +      + PVS   ERAV + + +  +A L GY  ++ +D   L D    P  R+ 
Sbjct: 385 LFRNEAWGHMLAPVSEANERAVYESMMEGCRAALQGYATSIDDDLRALRDT--QPGTRLE 442

Query: 307 TQ-LVRM-EKKMLNACLQVTAD 326
              LVR+ EK+ L+A L    D
Sbjct: 443 KAILVRLGEKETLDATLAFFED 464


>gi|46129354|ref|XP_389038.1| hypothetical protein FG08862.1 [Gibberella zeae PH-1]
          Length = 478

 Score = 45.1 bits (105), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 23/67 (34%), Positives = 36/67 (53%), Gaps = 2/67 (2%)

Query: 127 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 186
           L  +   CK + +A+  +VQ   DR YK GE + V  GP  N  LL  YGF+ + N +D 
Sbjct: 187 LFNHDQGCKLVYSALGYSVQ--TDRVYKQGEEVYVSYGPHSNDFLLTEYGFILDTNRWDE 244

Query: 187 LVVEAAL 193
           + ++  +
Sbjct: 245 VYLDEVI 251


>gi|366987955|ref|XP_003673744.1| hypothetical protein NCAS_0A08050 [Naumovozyma castellii CBS 4309]
 gi|342299607|emb|CCC67363.1| hypothetical protein NCAS_0A08050 [Naumovozyma castellii CBS 4309]
          Length = 499

 Score = 45.1 bits (105), Expect = 0.054,   Method: Compositional matrix adjust.
 Identities = 27/85 (31%), Positives = 47/85 (55%), Gaps = 3/85 (3%)

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           +++PL   L + ++KC A L   +D++++      KAGE +    G  PN+++L  YG+V
Sbjct: 229 SMIPLADTLNSDTNKCNAHLIYDEDSLKMRAISNIKAGEQVYNIYGNHPNAEILRRYGYV 288

Query: 179 D-EDNPYD--RLVVEAALNTEDPQY 200
           + E + YD   L +E  + T   QY
Sbjct: 289 EWEGSKYDFGELPLEVIIETLHEQY 313


>gi|320163048|gb|EFW39947.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 476

 Score = 45.1 bits (105), Expect = 0.056,   Method: Compositional matrix adjust.
 Identities = 84/377 (22%), Positives = 155/377 (41%), Gaps = 51/377 (13%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-LERAEG 59
           MYE    K SFW PY+  L          +++P+ W++ +L  L G+ T + +  E AE 
Sbjct: 106 MYEI-TNKDSFWRPYLDLLPE-------TLDTPMFWNDDDLELLEGTSTLSHLGKEDAET 157

Query: 60  IKRE----YNELDTVWF----MAGSLFQQYPYDIPTEAFTFEIFKQAFV----------- 100
           I  E    + +L    F       +L+ +    I   +F+ +  +               
Sbjct: 158 IFTEQIVPFMKLHPTHFDLKVHNMALYHRVASVIMAYSFSEDDDEDDDDEDDDEEEDCCD 217

Query: 101 --AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCK-AMLAAVDDAVQLVVDRPYKAGE 157
             A   C    ++  +  + A+VPL   +L + + C  A L      + +    P  AG 
Sbjct: 218 GDANNECCSQKRQKRM-EKIAMVPLAD-MLDHKTGCNNARLFYGKTTLAMSCIEPCAAGH 275

Query: 158 SIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL---NTEDPQYQDKRMVAQRNGKLS 214
            +    G   NS+LL  YGF+D+ N ++ + +   +     E   + ++ M A       
Sbjct: 276 ELYNTYGDLSNSELLRKYGFIDDVNEHNSVDIPVEMLEERFESCSFMEEAMEALEEIGCW 335

Query: 215 VQVFHVHAG----REKEAISDML-----PYLRLGYVSDTSEMQSVISSLGPICPVSPCME 265
           +  FH+ A     +E EA   +L         L  + D  E++S +++L     V+ C  
Sbjct: 336 LPEFHIPADALPPQELEASIALLFQSPKQVRALRALDDEDEIRSFLATL-----VNKC-R 389

Query: 266 RAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
           R V + L  + + R   Y  T  EDE  L + +L  ++++A ++   E+ +L+  +    
Sbjct: 390 RKVSETLLAFGQKRAEEYTTTREEDEERLKESDLTHRQKMALRVRIGERTILHNYISHLK 449

Query: 326 DMIMLLPDVTVSPCPAP 342
           + +   P    +  PAP
Sbjct: 450 ERLETTPPDQETKEPAP 466


>gi|403412960|emb|CCL99660.1| predicted protein [Fibroporia radiculosa]
          Length = 508

 Score = 44.7 bits (104), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 63/227 (27%), Positives = 92/227 (40%), Gaps = 37/227 (16%)

Query: 120 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 179
           +VP+   L A      A L   +  +++V  +P KAGE I    G  PNS LL  YG VD
Sbjct: 258 MVPMADMLNARFGSENAKLFYEEHHLKMVTTKPIKAGEQIWNTYGDPPNSDLLRRYGHVD 317

Query: 180 ----------EDNPYD------RLVVEAALNTEDPQYQDK----RMVAQRNGKLSVQVFH 219
                       NP D       L V AA      + QDK      VA  +       F 
Sbjct: 318 LVPLEPPLAGLGNPADIVEIGADLAVFAAKKDSPEKLQDKIDWWLEVANDD------TFV 371

Query: 220 VHAGREKEAISDMLPYLRLGYV-SDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 278
           +  G + +   +++ + RL ++  D  E     S L       P ++  VL    D    
Sbjct: 372 I--GTDCQLPEELVSFARLLFLPRDEWEKVRQKSKL-----PKPKIDAQVLSVAEDVLSR 424

Query: 279 RLAGYPATLSEDEAMLTDYNLHP---KKRVATQLVRMEKKMLNACLQ 322
           R+  Y  T+ +DEA+L   N  P    K+ A  +   EK++L+  LQ
Sbjct: 425 RINEYSTTIEDDEALLALENAQPLSLNKKHALIVRHGEKRILHGTLQ 471


>gi|328869852|gb|EGG18227.1| hypothetical protein DFA_03714 [Dictyostelium fasciculatum]
          Length = 504

 Score = 44.7 bits (104), Expect = 0.062,   Method: Compositional matrix adjust.
 Identities = 31/124 (25%), Positives = 54/124 (43%), Gaps = 16/124 (12%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           MYE  +G +S W  YI+ +          ++ P+LW +  + YL G+  +  ++E  E +
Sbjct: 92  MYETSKGVQSKWYSYIQSM-------PTVIDLPILWDKESIEYLVGTDLEEIVIENIETL 144

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
           + +Y E          + + +P       FT E FK A   V S   ++ +       +L
Sbjct: 145 EEQYRE------DVEPIIKNHPETFKENIFTLESFKIASTIVSSRAFNIDQYHGE---SL 195

Query: 121 VPLG 124
           VPL 
Sbjct: 196 VPLA 199


>gi|348690659|gb|EGZ30473.1| hypothetical protein PHYSODRAFT_553476 [Phytophthora sojae]
          Length = 437

 Score = 44.7 bits (104), Expect = 0.063,   Method: Compositional matrix adjust.
 Identities = 87/349 (24%), Positives = 147/349 (42%), Gaps = 51/349 (14%)

Query: 1   MYEK-KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG 59
           +YEK  +G KS W  +I  L R          + L +   EL  L GS       +  E 
Sbjct: 102 LYEKFVRGSKSKWAKHIELLPR-------TYHNALYFGPEELRALEGSNVYFIAQQMEEK 154

Query: 60  IKREYNELDTVWFMAGSLFQQYP----YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 115
           +  +Y  L     +   LF+  P     D+  E F+ E +K A   + S       V +A
Sbjct: 155 VAHDYARLKESVLL--ELFENVPEGINVDLFDEFFSLENYKWALSTIWS---RFGDVPVA 209

Query: 116 RRF--ALVPLGPPLLAYSSKCKAMLAAVDDAVQ---LVVDRPYKAGESIVVWCGPQPNSK 170
           ++   A+VP+   +L +  + + M    D + Q   LV  + + AG  + +  GP  N K
Sbjct: 210 KQSFKAMVPVFD-MLNHDPEAE-MSHFFDMSTQRFKLVSHQHWNAGAQMFINYGPLSNHK 267

Query: 171 LLINYGFVDEDNPYDRLVVEAALNTEDPQ---YQDKRMVAQRNGKLSVQVFHVHAGREKE 227
           LL  YGFV   NP+D   VE  L  ++     +Q+K  +   NG         HA    E
Sbjct: 268 LLALYGFVIIGNPFD--AVEMWLPMDEASTKFFQEKEQLLLTNGL-------DHATNPFE 318

Query: 228 AISD-----MLPYLRLGYVS--DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARL 280
            ++D     +L   R+  +      E + + +       +S   E+  L +L    +  L
Sbjct: 319 LVADESNDLLLMAARIQEIDCETVEEFEELANKALEGEMISLENEQEALTRLIYTLEKML 378

Query: 281 AGYPATLSEDEAML------TDYNLHPKKRVATQLVRMEKKMLNACLQV 323
             +P ++ ED+ +L      TD NL+  +R+A  + R +K +L+  + +
Sbjct: 379 ESFPTSIEEDDILLEQDDKKTD-NLN-HERMAVAVRRSDKYILSENINM 425


>gi|451854554|gb|EMD67847.1| hypothetical protein COCSADRAFT_34629 [Cochliobolus sativus ND90Pr]
          Length = 476

 Score = 44.7 bits (104), Expect = 0.064,   Method: Compositional matrix adjust.
 Identities = 90/377 (23%), Positives = 143/377 (37%), Gaps = 71/377 (18%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL------ 54
           +YE   G  S W PY   L           ++ + W+E ELA L  S    +I       
Sbjct: 91  LYEYHNGSASNWAPYFAVLPTD-------FDTLMFWTEDELAELQASAVVNKIGKEGANE 143

Query: 55  ---------------------ERAEGIKREYNELDTVWFMA--GSLFQQYPYDIPTEAFT 91
                                ERA+   +E    + +  M   GSL   Y +D+   A +
Sbjct: 144 VFIEQLLPVIEEFADVIFSGDERAKHKAKEMRAPENLELMHKMGSLIMAYAFDVEP-AIS 202

Query: 92  FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 151
            +   +   A +     L K        +VPL   L A   +C A L    D +++   +
Sbjct: 203 DKEVDEEGFAEEEEDAALPK-------GMVPLADMLNADGDRCNARLFYEKDGLEMKALK 255

Query: 152 PYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYD-----------RLVVEAALNTEDPQ 199
           P +AG+ I    GP P S LL  YG++ D    YD            L  +   + E  +
Sbjct: 256 PIQAGDEIFNDYGPLPRSDLLRRYGYITDNYAQYDVVEIPVDLVSQTLAHDGLWHEERIE 315

Query: 200 YQDKRMVAQRNGKLSVQV-FHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPIC 258
           Y D++ +      ++  + F +     +E++S  L  L    +    E +  + S G + 
Sbjct: 316 YLDEQEIVDTGYDIAASIPFSL-----EESLSPELVILVETMLLPREEFER-LQSKGRL- 368

Query: 259 PVSPCMERAVLDQLADYFKARLAGYPATLSED-----EAMLTDYNLHPKKRVA-TQLVRM 312
           P +  M       L    +AR+A YP TL +D     E          ++RVA  + VR+
Sbjct: 369 PKAEKMTGKAAKFLYKIVQARIAQYPTTLEQDLQISSETQPVQTMSRKERRVAMARAVRI 428

Query: 313 -EKKMLNACLQVTADMI 328
            EKK+L    +  AD I
Sbjct: 429 GEKKLLVQTEERLADKI 445


>gi|299115489|emb|CBN75653.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 451

 Score = 44.7 bits (104), Expect = 0.065,   Method: Compositional matrix adjust.
 Identities = 21/70 (30%), Positives = 38/70 (54%)

Query: 142 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 201
           +D + LV  + + +G  +    GP PNS+LL+ +GF   DNP++ + + A +    P + 
Sbjct: 229 NDCLHLVTLQDWASGSEVKFSYGPLPNSRLLLLHGFCLPDNPFESVELWAMMEPGAPGFA 288

Query: 202 DKRMVAQRNG 211
           +K  +   NG
Sbjct: 289 EKNKIMLDNG 298


>gi|358399747|gb|EHK49084.1| hypothetical protein TRIATDRAFT_213818 [Trichoderma atroviride IMI
           206040]
          Length = 378

 Score = 44.7 bits (104), Expect = 0.067,   Method: Compositional matrix adjust.
 Identities = 36/171 (21%), Positives = 72/171 (42%), Gaps = 24/171 (14%)

Query: 28  LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPT 87
             V  P++W   EL +L     +  + +R +  + ++++           F +   DI  
Sbjct: 102 FEVGMPMMWPR-ELKHLLPLEPRNLVFKREKAFQGDWSD-----------FHKAFSDISY 149

Query: 88  EAFTFEIFKQAFVAVQSCVVHLQ-----KVSLARRFALVPLGPPLLAYSSKCKAMLAAVD 142
           E +T+     A++ V +   + +     K     R AL+P+        + C+   +   
Sbjct: 150 EEYTY-----AWLTVNTRTFYNESPETLKYPWEDRLALIPVADLFNHADAGCRVYYSP-- 202

Query: 143 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 193
           +   +V DR YK GE + +      N   L+ YGFV ++NP D + ++  +
Sbjct: 203 EGYHIVADRDYKRGEELYISYSSHSNDYNLVEYGFVPDENPSDDVYIDDVI 253


>gi|428173103|gb|EKX42007.1| hypothetical protein GUITHDRAFT_141487 [Guillardia theta CCMP2712]
          Length = 355

 Score = 44.7 bits (104), Expect = 0.067,   Method: Compositional matrix adjust.
 Identities = 49/188 (26%), Positives = 74/188 (39%), Gaps = 18/188 (9%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           MY  K  K+SFW PY   +          + SP+ WSE EL  L GS          + +
Sbjct: 143 MYNPK--KESFWKPYFDIMPTN-------LTSPIFWSEDELQELAGSEVSNMARIEKKRL 193

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF-A 119
           +  Y+EL         +F+        +AFT + +  A     S V+ L + +       
Sbjct: 194 RAMYDELRE------RIFKHDRKTFLKQAFTLKNWFWANGLYDSRVIQLNRQTGHGNVPT 247

Query: 120 LVPLGPPLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
            +PL   +    S+ K  +       A  +  DR    G  +    G + N + L+  GF
Sbjct: 248 FIPLIDMVNCIESQDKTFIQYDKKLRAAVMYADRAVSRGVQVFESYGNKSNYEYLLYNGF 307

Query: 178 VDEDNPYD 185
           V EDNP D
Sbjct: 308 VMEDNPND 315


>gi|115391295|ref|XP_001213152.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114194076|gb|EAU35776.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 691

 Score = 44.7 bits (104), Expect = 0.067,   Method: Compositional matrix adjust.
 Identities = 49/185 (26%), Positives = 83/185 (44%), Gaps = 15/185 (8%)

Query: 6   QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE-- 63
           +G + FW PYI  L +    G L   +PL +   +L +L G+ + A   E+ E + +E  
Sbjct: 123 RGSEGFWYPYICTLPQP---GDLT--TPLYYEGADLRWLEGT-SLAPAREQKESLLKEKY 176

Query: 64  ---YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
              + EL    F      ++Y +++   A T  + +     V + VV   ++       L
Sbjct: 177 QSTFEELRKSGFGDA---EKYTWELYLWASTIFVSRAFSAKVLAGVVPHAELPEENVSVL 233

Query: 121 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
           +P    +L +    K    A +  V  VV     AGE +    GP+ N +L++NYGF  +
Sbjct: 234 LPF-IDVLNHRPLAKVEWRAGERDVLFVVLEHVAAGEEVANNYGPRNNEQLMMNYGFCLQ 292

Query: 181 DNPYD 185
           +NP D
Sbjct: 293 NNPCD 297


>gi|195132508|ref|XP_002010685.1| GI21676 [Drosophila mojavensis]
 gi|193907473|gb|EDW06340.1| GI21676 [Drosophila mojavensis]
          Length = 593

 Score = 44.7 bits (104), Expect = 0.071,   Method: Compositional matrix adjust.
 Identities = 72/350 (20%), Positives = 133/350 (38%), Gaps = 44/350 (12%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           + EK +G  S W P+I  L  +         + L ++  ++  L G+   +  +     I
Sbjct: 252 IMEKLRGADSPWQPFIDTLPSR-------YNTVLYFTVEQMQRLRGTSACSAAVRHCRVI 304

Query: 61  KREYNEL---------DTVWFMAGSLFQQYP--YDIPTEAFTFEIFKQAFVAVQSCVVHL 109
            R Y  +         D+V     +LF  Y   Y++   A +    +Q  V  Q      
Sbjct: 305 ARLYASMYKCAFMQLDDSVMGGMANLFTDYGLCYELYRWAVSTVTTRQNLVPRQEIPSDA 364

Query: 110 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 169
             + ++   AL+P        S K  +        ++      YK+GE   ++ G + N+
Sbjct: 365 ANLPIS---ALIPYWDMANHRSGKITSFYDQAAGQMECTAQEAYKSGEQYFIYYGDRSNA 421

Query: 170 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV-----AQRNGKLSVQVFHVHAGR 224
             L++ GFVD  NP D + +   L+  D   + + ++      +R  +L V     H   
Sbjct: 422 DRLVHNGFVDMQNPKDYVQIRLGLSPTDALAEQRAILLAELNIERKAELRVLPAPEHISG 481

Query: 225 EKEAISDMLPYLRLGYVSD------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 278
           E      +L ++R+  +S        S+++  +  L   C +   +E      L    K 
Sbjct: 482 E------LLAFVRVFNMSKEQLEHWCSDLERAVDLLHIDCALETDLETRTWQYLYQRLKL 535

Query: 279 RLAGYPATLSED------EAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 322
            L    ATL E       EA+    +      +  Q  R+E+++L+  LQ
Sbjct: 536 LLGVLEATLKETDELKQLEALQQQADASEIDIMVLQYRRLERRILSDALQ 585


>gi|148226164|ref|NP_001079674.1| SET domain containing 4 [Xenopus laevis]
 gi|28422727|gb|AAH46855.1| MGC53706 protein [Xenopus laevis]
          Length = 456

 Score = 44.3 bits (103), Expect = 0.076,   Method: Compositional matrix adjust.
 Identities = 46/194 (23%), Positives = 82/194 (42%), Gaps = 19/194 (9%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           E+  G  S W PY+  +             P+ W E E+ +L  +P + + LE+    K 
Sbjct: 117 ERFAGDCSQWKPYLDVIPS-------TYSCPVYW-ELEIIHLLPAPLRKKALEQ----KT 164

Query: 63  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA--- 119
           E  EL T      S  Q    D   + +T++  + A+  V +  V+++     R  A   
Sbjct: 165 EVQELHTESLAFFSSLQPLFCDNVADIYTYDALRWAWCTVNTRTVYMKHTQQDRLLAQQD 224

Query: 120 LVPLGP--PLLAYSSKCK--AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
           +  L P   LL +S + +  A  +      ++  +   +  +   +  GP  N +LL+ Y
Sbjct: 225 VCALAPYLDLLNHSPEVQVEAEFSKDRRCYEIRTNSGCRKHDQAFICYGPHDNQRLLLEY 284

Query: 176 GFVDEDNPYDRLVV 189
           GFV  +NP+  + V
Sbjct: 285 GFVAANNPHRSVYV 298


>gi|440464611|gb|ELQ34010.1| hypothetical protein OOU_Y34scaffold00824g3 [Magnaporthe oryzae
           Y34]
          Length = 373

 Score = 44.3 bits (103), Expect = 0.077,   Method: Compositional matrix adjust.
 Identities = 46/163 (28%), Positives = 71/163 (43%), Gaps = 24/163 (14%)

Query: 33  PLLWSETELAYLTGSPTKAEILERAEGIKREYN-ELDTVWFMAGSL----FQQYPYDIPT 87
           P +W + EL  L   PT A +    E  + +YN E +TV     S+    FQ Y + + T
Sbjct: 107 PFMWPK-ELQKLL--PTSARVF--LENQQTKYNHEWNTVSQAMPSISEERFQYYWHIVNT 161

Query: 88  EAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQL 147
             F +E      V+   C       S   R ALVPL          C+  ++ + +   +
Sbjct: 162 RTFLYE------VSETECY------SWEDRLALVPLADIFNHADEGCR--VSYMPEHYVI 207

Query: 148 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 190
             DR Y+AGE + +  G   N  LL  YGF+   N +D + ++
Sbjct: 208 TTDRAYEAGEELFISYGDHSNDCLLTEYGFLLPKNRWDIICID 250


>gi|303288796|ref|XP_003063686.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226454754|gb|EEH52059.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 538

 Score = 44.3 bits (103), Expect = 0.082,   Method: Compositional matrix adjust.
 Identities = 91/359 (25%), Positives = 142/359 (39%), Gaps = 64/359 (17%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG- 59
           ++EK  G  S W  YIR L R       A+++PL WS  ELA L G+    ++L  A G 
Sbjct: 163 LHEKSLGDDSRWAAYIRCLPRVE-----ALDAPLFWSSEELAELAGT----QLLANAAGY 213

Query: 60  ---IKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 115
              ++  +  L    F    +LF     D    AF+   F  AF  ++S    L  V   
Sbjct: 214 DSYVRGTHAALKETTFKEHPALFGDAGDDDGGGAFSEREFLWAFGVLRSRA--LPPVDQG 271

Query: 116 RRFALVPLGPPLLAYSSKC-------KAMLAAV---------DDAVQLVVDRP----YKA 155
              AL+P G  +  +   C          +AAV           +V L V++      K 
Sbjct: 272 ESIALIP-GIDMANHDGLCSQTWQLNNGGIAAVFGGRGGADGGGSVLLRVEKTKAGGAKR 330

Query: 156 GESIVVWCGP-QPNSKLLINYGFVDEDNPYDRLVVEA-ALNTEDPQYQDKRMVAQRNGKL 213
           GE I    GP   +S+  ++YGFVD        V+   ++  +D    DK  V    G  
Sbjct: 331 GEEIRCNYGPANIDSQFALDYGFVDAFCSRPGYVLGPLSIPEDDVNAFDKMDVLSVAGLK 390

Query: 214 SVQVFHVHAGREKEAISDMLPYLRLGYVSDTS----------EMQSVISSLGPICPVSPC 263
               F + A   ++   +M+ ++RL  + +            E  ++IS      PVSP 
Sbjct: 391 ESPAFTIRA--FEDPPPEMVVFMRLLNLKNDDAFLLEAIFRQECWALISD-----PVSP- 442

Query: 264 MERAVLDQLADY-FKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 321
                 D  AD   +  L  Y   + +D  +  D +  P+ R+A ++   EK+ L   L
Sbjct: 443 ------DNEADAGCEEALGAYATKIEDDRGVADDADASPRLRLAARVRMGEKQALEEVL 495


>gi|340503949|gb|EGR30449.1| SET domain protein [Ichthyophthirius multifiliis]
          Length = 518

 Score = 44.3 bits (103), Expect = 0.083,   Method: Compositional matrix adjust.
 Identities = 50/214 (23%), Positives = 92/214 (42%), Gaps = 25/214 (11%)

Query: 142 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 201
           D+   +   +P K G+ I    G + N  LL+ YGF    N YD   +   +N    Q  
Sbjct: 250 DNYFVVTTQKPEKKGQQIYNCYGQRTNKFLLMWYGFCFNKNRYDSYSLRLWINMRQEQLN 309

Query: 202 D---KRMVAQ---------------RNGKLSVQVFHVHAGREKEAIS-DMLPYLRLGYVS 242
           +   +++V Q               +  K+++     +   +K  I+ D++ YLRL  + 
Sbjct: 310 NDLFEKIVFQEFLEKEDCKGGFVWKKQEKVNLDDITQNFRIKKNKINIDLIIYLRLYLMM 369

Query: 243 DTS--EMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 300
                +++ V+ SL    PVSP  E  VL          L+ +  T+ +D+ +L + NL+
Sbjct: 370 HYKGPDLKRVMVSL----PVSPVYECFVLSFAIRLLSYLLSRFTTTIKDDKELLQNQNLN 425

Query: 301 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDV 334
            K R A      +K++L   + +    ++LL  +
Sbjct: 426 YKYRFAIIYRLNQKEILQEQISLMNQALILLNQL 459


>gi|222640175|gb|EEE68307.1| hypothetical protein OsJ_26571 [Oryza sativa Japonica Group]
          Length = 422

 Score = 44.3 bits (103), Expect = 0.095,   Method: Compositional matrix adjust.
 Identities = 35/130 (26%), Positives = 59/130 (45%), Gaps = 22/130 (16%)

Query: 123 LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 182
           + P +L Y    K     V  +++  + RP KAGE   +  G  P S L+  YGF+  DN
Sbjct: 302 VSPHILNYGRVDK-----VTKSLKFPLSRPCKAGEQCFLSYGKHPGSHLITFYGFLPRDN 356

Query: 183 PYDR--LVVEAALNTEDP----------QYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 230
           PYD   L ++ +++ ED            +  +RM+  R  +  +Q     + ++   + 
Sbjct: 357 PYDVIPLDLDTSVDEEDSSSPSVTTSQTSHMGERMLG-RQSRTGLQ----RSTKKDSFVH 411

Query: 231 DMLPYLRLGY 240
               YLRLG+
Sbjct: 412 CYFVYLRLGH 421


>gi|449702130|gb|EMD42824.1| Hypothetical protein EHI5A_004190 [Entamoeba histolytica KU27]
          Length = 749

 Score = 44.3 bits (103), Expect = 0.096,   Method: Compositional matrix adjust.
 Identities = 54/221 (24%), Positives = 84/221 (38%), Gaps = 22/221 (9%)

Query: 4   KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 63
            K   K F  PYI  L           + PL ++  EL  + G+   A + E+       
Sbjct: 47  NKTNPKCFHWPYINVLPE-------TYDCPLSYTIDELNLMKGTKLYAAV-EKINAFL-- 96

Query: 64  YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 123
              +  V +    L QQ+P     ++F  ++FK+   A QS       V   + F  V  
Sbjct: 97  ---MKVVDYYNNKLIQQFPQYF--QSFD-DLFKRLQWAHQSFWSRAFLVIYPQPFGEVGS 150

Query: 124 GPPLLAYSSKCKAMLAAVDDAVQL------VVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
             P   +S+ C           Q         +   K GE I      + N KLL+ YGF
Sbjct: 151 LIPFCDFSNHCTQAKVTYISNTQTETFSFQTNEELVKPGEQIFNNYRIRSNEKLLLGYGF 210

Query: 178 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
           V+E+NP D L++      +D QY +   + ++    S   F
Sbjct: 211 VEENNPCDNLLLRIYFEVDDNQYNEIEEILKQEEIKSFDFF 251


>gi|167389227|ref|XP_001738871.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165897700|gb|EDR24782.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
          Length = 791

 Score = 44.3 bits (103), Expect = 0.097,   Method: Compositional matrix adjust.
 Identities = 54/222 (24%), Positives = 88/222 (39%), Gaps = 24/222 (10%)

Query: 4   KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 63
            K   K F  PYI  L +         + PL ++  EL  + G+     + E+       
Sbjct: 89  NKTNPKCFHWPYINVLPK-------TYDCPLSYTIDELNIMKGTKLYVAV-EKINAFL-- 138

Query: 64  YNELDTVWFMAGSLFQQYP-YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 122
              +  V +    L QQ+P Y  P +    ++FK+   A QS       V   + F  V 
Sbjct: 139 ---MKVVDYYNNKLIQQFPQYFQPFD----DLFKRLQWAHQSFWSRAFLVIYPQPFGEVG 191

Query: 123 LGPPLLAYSSKC-KAMLAAVDDAVQLVV-----DRPYKAGESIVVWCGPQPNSKLLINYG 176
              P   +S+ C +A +  + +           +   K GE I      + N KLL+ YG
Sbjct: 192 SLIPFCDFSNHCTQAKVTYISNTRTETFSFQTNEEVVKPGEQIFNNYRIRSNEKLLLGYG 251

Query: 177 FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
           FV+E+NP D L++      +D QY +   + ++    S   F
Sbjct: 252 FVEENNPCDNLLLRIYFEVDDNQYNEIEEILKQEEIKSFDFF 293


>gi|328872715|gb|EGG21082.1| hypothetical protein DFA_00957 [Dictyostelium fasciculatum]
          Length = 643

 Score = 43.9 bits (102), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 74/348 (21%), Positives = 136/348 (39%), Gaps = 52/348 (14%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN---- 65
           S+W  Y+  L +Q         + + ++  E+  L  SP       R  G+ R Y+    
Sbjct: 302 SYWYEYVSILPKQ-------FTTTVYFTREEMTQLQASPVHRFTEMRLNGVHRHYDTTIS 354

Query: 66  ---------ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLAR 116
                    E D+      S       +   + +T + FK A   V S    L +     
Sbjct: 355 RLRFGYEGGEDDSTKTKTKSQLDAMK-EFKDDRYTLDQFKWALGCVWSRAFSLSE----E 409

Query: 117 RFALVPLGPPLLAYS----SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG---PQPNS 169
              +VPL     A +    SK    ++A   ++     +  +AGE I    G      + 
Sbjct: 410 DGGMVPLADMFNADTVISRSKVHPKISASSPSLVYTASQDIEAGEQIFTPYGVYKTLGSG 469

Query: 170 KLLINYGFVDEDNPY--DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 227
           ++L++YGF+ ED       +V  A +   +P Y  KR + Q NG +  + F +    + +
Sbjct: 470 QMLMDYGFIHEDGSSADSTIVTVAPIPPSEPLYDLKRHLMQSNG-IESEEFTI---TKNK 525

Query: 228 AISDMLPYLRLGYVS--DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 285
              ++  + R+  ++  ++ +  +   S      ++P  E+A L  L++     L  Y  
Sbjct: 526 LAKELFLFARIKSINKKESDQASAHFMSTQRHSMLNPRNEKAALRLLSNLISRHLDAYQT 585

Query: 286 TLSEDEAML-------TDYNLHPKKRVAT----QLVRMEKKMLNACLQ 322
           T+ +D  +L       T+ N H      T    +L  MEK +LN+ L+
Sbjct: 586 TIDQDNQILKEIEKDKTNTN-HSSVTFNTINAIKLRLMEKNILNSFLK 632


>gi|407035166|gb|EKE37568.1| [Ribulose-bisphosphate-carboxylase]-lysine N-methyltransferase
           [Entamoeba nuttalli P19]
          Length = 791

 Score = 43.9 bits (102), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 55/222 (24%), Positives = 84/222 (37%), Gaps = 24/222 (10%)

Query: 4   KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 63
            K   K F  PYI  L           + PL ++  EL  + G+   A + E+       
Sbjct: 89  NKTNPKCFHWPYINVLPE-------TYDCPLSYTIDELNLMKGTKLYAAV-EKINAFL-- 138

Query: 64  YNELDTVWFMAGSLFQQYP-YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 122
              +  V +    L QQ+P Y  P +    ++FK+   A QS       V   + F  V 
Sbjct: 139 ---MKVVDYYNNKLIQQFPQYFQPFD----DLFKRLQWAHQSFWSRAFLVIYPQPFGEVG 191

Query: 123 LGPPLLAYSSKCKAMLAAVDDAVQL------VVDRPYKAGESIVVWCGPQPNSKLLINYG 176
              P   +S+ C           Q         +   K GE I      + N KLL+ YG
Sbjct: 192 SLIPFCDFSNHCTQAKVTYISNTQTETFSFQTNEALVKPGEQIFNNYRIRSNEKLLLGYG 251

Query: 177 FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
           FV+E+NP D L++      +D QY +   + ++    S   F
Sbjct: 252 FVEENNPCDNLLLRIYFEVDDNQYNEIEEILKQEEIKSFDFF 293


>gi|67484540|ref|XP_657490.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
 gi|56474743|gb|EAL52100.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
          Length = 791

 Score = 43.9 bits (102), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 54/221 (24%), Positives = 84/221 (38%), Gaps = 22/221 (9%)

Query: 4   KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 63
            K   K F  PYI  L           + PL ++  EL  + G+   A + E+       
Sbjct: 89  NKTNPKCFHWPYINVLPE-------TYDCPLSYTIDELNLMKGTKLYAAV-EKINAFL-- 138

Query: 64  YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 123
              +  V +    L QQ+P     ++F  ++FK+   A QS       V   + F  V  
Sbjct: 139 ---MKVVDYYNNKLIQQFPQYF--QSFD-DLFKRLQWAHQSFWSRAFLVIYPQPFGEVGS 192

Query: 124 GPPLLAYSSKCKAMLAAVDDAVQL------VVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
             P   +S+ C           Q         +   K GE I      + N KLL+ YGF
Sbjct: 193 LIPFCDFSNHCTQAKVTYISNTQTETFSFQTNEELVKPGEQIFNNYRIRSNEKLLLGYGF 252

Query: 178 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
           V+E+NP D L++      +D QY +   + ++    S   F
Sbjct: 253 VEENNPCDNLLLRIYFEVDDNQYNEIEEILKQEEIKSFDFF 293


>gi|281205954|gb|EFA80143.1| hypothetical protein PPL_06965 [Polysphondylium pallidum PN500]
          Length = 417

 Score = 43.9 bits (102), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 69/299 (23%), Positives = 127/299 (42%), Gaps = 36/299 (12%)

Query: 7   GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 66
            K S+W  Y   L +Q         + + + + E+  L  S  +     R  GI+R YN 
Sbjct: 102 NKDSYWHQYTSILPKQ-------FTTSIYFDQDEMKELQLSKLRYFTESRLSGIERHYN- 153

Query: 67  LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 126
              ++    SL  ++      + +TFE+FK A   + S    L     +    +VPL   
Sbjct: 154 --VIFKKLSSLNDEFK----KKEYTFELFKWALSCIWSRAFSLS----SDDGGMVPLADM 203

Query: 127 LLAYSSKCKAMLAAVDDAVQLV--VDRPYKAGESIVVWCGPQP---NSKLLINYGFVDED 181
             A   K K+ +     A QL+    +  + GE +    G      N+++L++YGF   D
Sbjct: 204 FNAIE-KAKSKVRPDSRADQLIYYASKDIERGEQVFTPYGVYKTIGNAQMLMDYGFA-FD 261

Query: 182 NPYDRLVVEAALN--TEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLG 239
           +P +   ++  L+  ++D  Y D ++       + V+ F++   +  +   ++L Y R+ 
Sbjct: 262 DPSEGDTIQLTLDNFSDDELYIDTKIDLLEQLDI-VREFNL---KRNQLPQELLIYARVK 317

Query: 240 YVSDTSEMQ----SVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 294
            + + +E+Q       +      PVS   E+  L  L++Y    L  Y  TLS+D  +L
Sbjct: 318 NLKE-NELQLAKEHYRNDDNRNKPVSRRNEKTALRYLSNYLSRYLDSYETTLSDDLELL 375


>gi|347967016|ref|XP_003436005.1| AGAP002018-PB [Anopheles gambiae str. PEST]
 gi|333469796|gb|EGK97407.1| AGAP002018-PB [Anopheles gambiae str. PEST]
          Length = 504

 Score = 43.9 bits (102), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 77/356 (21%), Positives = 135/356 (37%), Gaps = 66/356 (18%)

Query: 9   KSFWLPYIREL-DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 67
           KS W PY+  L DR          +PL ++  ++  L  +      L+  + I R+Y  +
Sbjct: 163 KSDWKPYLDLLPDR--------YTTPLYYTTEDMGELAETDAFLPALKLCKHIARQYGFI 214

Query: 68  DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV--------AVQSCVVHLQKV------- 112
                     F Q   D   + FT+++F+            AV + +    KV       
Sbjct: 215 RR--------FVQEKVDELRDCFTYDVFRLLLFSLLIPHSWAVSTVMTRQNKVPVNLAEF 266

Query: 113 -SLARRFALVPL------GPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYK--AGESIVV 161
             +    AL+PL        P  A  ++C A     A ++ ++  + R     A   I +
Sbjct: 267 DGMDHTLALIPLWDMANHAFPDTANETRCVAETCYNATNEQLECSLTREVSDIASVPIFI 326

Query: 162 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 221
             G + +++ L++ GFV   NP+  +     L    P Y+++  + +  G  +   F   
Sbjct: 327 VYGTRTDAEFLVHNGFVCPRNPHANVQKRFTLVPAIPLYKERAHLLELLGMPTTGTFSFG 386

Query: 222 AGREKEAISDMLP----YLRLGYVSDTS------------EMQSVISSLGPICPVSPC-- 263
             RE  A +   P     + L  VS  +              +  + +   + P   C  
Sbjct: 387 PAREPAAATTTTPISQELISLARVSSMTAKELDEYTAMKETQRQTLRTYQALLPAELCAR 446

Query: 264 MERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 319
            ER     LA   K  L  YP T+ +DEA+L   N H  +R+  +    EK++L +
Sbjct: 447 TER----WLATVMKIMLLRYPTTIEQDEALLKT-NRHHIRRLLIEYRLGEKQILRS 497


>gi|428163078|gb|EKX32170.1| hypothetical protein GUITHDRAFT_121664 [Guillardia theta CCMP2712]
          Length = 449

 Score = 43.5 bits (101), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 61/251 (24%), Positives = 107/251 (42%), Gaps = 12/251 (4%)

Query: 82  PYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL---GPPLLAYSSKCKAML 138
           P ++  E FT+E F+ A+  V +  +    +    R   VPL   GP   ++      + 
Sbjct: 185 PQNVTREDFTYEAFEWAYSVVTTRGIFPGLLGEEEREGEVPLLVLGPLADSFIHGASGVK 244

Query: 139 AAVD-DAVQLVVDRPYKAGES--IVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 195
            + D    + V    +K  ++  I +  G   N +LL N GF+ ++N  + ++++  L+ 
Sbjct: 245 ISYDAQEHRCVFSALHKVAKNSPISIGVGMSSNMELLANRGFMMQNNGNNFVLMKFQLDR 304

Query: 196 EDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLG 255
               +   R    +   LS  + +V   R  E    +L  LR+  +S   E  S   +L 
Sbjct: 305 NSDMHASARESMMKQLNLSNPMTYV--VRYGEMPQGLLASLRIQSLSPV-EFGSYGKALA 361

Query: 256 PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKK 315
              PV+   E      L     + LA YP T+ EDE +LT        R A  L+R E+K
Sbjct: 362 --TPVTLENEWRAYRLLISSCNSILAMYPTTIEEDEIVLTQTKTSRHLRAAV-LLRREEK 418

Query: 316 MLNACLQVTAD 326
           ++   ++  A+
Sbjct: 419 LIYESIKTWAN 429


>gi|207346544|gb|EDZ73016.1| YDR257Cp-like protein [Saccharomyces cerevisiae AWRI1631]
          Length = 354

 Score = 43.5 bits (101), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           +++PL   L A +SKC A L    + +++V  R  +  E +    G  PNS+LL  YG+V
Sbjct: 80  SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 139

Query: 179 DED-NPYD 185
           + D + YD
Sbjct: 140 EWDGSKYD 147


>gi|12718364|emb|CAC28558.1| related to histone-lysine N-methyltransferase [Neurospora crassa]
          Length = 471

 Score = 43.5 bits (101), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 45/181 (24%), Positives = 81/181 (44%), Gaps = 17/181 (9%)

Query: 32  SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 91
           S +L++E +L    G+       +  + I+ ++  L         LF Q+P   P + FT
Sbjct: 115 SSILFAEDDLEACAGTSLYTITKQLEQSIEDDHRALVV------RLFVQHPDLFPLDKFT 168

Query: 92  FE----IFKQAFVAVQSCVVHLQKVSLARRFALVPLGP--PLLAYSSKCKA--MLAAVDD 143
            E     +K A   V S  +      LA   ++  L P   +L ++S+ K   +      
Sbjct: 169 VEDVGLHYKWALCTVWSRAMDF---VLADGNSIRLLAPFADMLNHTSEVKQCHVYDPSSG 225

Query: 144 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 203
            + +   + Y+AG+ + +  GP PNS+LL  YGFV   NP D   +  + + + P ++ K
Sbjct: 226 TLSVFAGKDYEAGDQVFINYGPVPNSRLLRLYGFVIPGNPNDSYDLVLSTHPQAPFFEQK 285

Query: 204 R 204
           +
Sbjct: 286 Q 286


>gi|440792461|gb|ELR13682.1| [Ribulose-bisphosphate-carboxylase]-lysine N-methyltransferase
           [Acanthamoeba castellanii str. Neff]
          Length = 400

 Score = 43.5 bits (101), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 47/192 (24%), Positives = 84/192 (43%), Gaps = 23/192 (11%)

Query: 9   KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEGIKREYNEL 67
           +SFW PY+ EL         AV +   W++ ELA +  +    E++E + + I  +Y  +
Sbjct: 61  QSFWQPYLSELPD-------AVATVDRWNQEELAEVGHTLMLYEMVEYKKKKIAADYAAI 113

Query: 68  DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 127
              +    +  Q +   IP+E    E +++A   V S      +  L      +P     
Sbjct: 114 LLPFLQENT--QLFGGSIPSE----EEYRRALSLVYSRTFDFSE--LIGEHVFIPF-VDF 164

Query: 128 LAYS------SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 181
           L +S      + C        D  +L+    Y  GE + +  G + +S+LL +YGF+ E+
Sbjct: 165 LNHSINDTGKAACTYSYNHDKDCFELLAGADYDEGEEVFISYGEKTSSQLLASYGFMYEN 224

Query: 182 NPYDRLVVEAAL 193
           N  D + + A+L
Sbjct: 225 NAEDTVDITASL 236


>gi|407852222|gb|EKG05847.1| hypothetical protein TCSYLVIO_003073 [Trypanosoma cruzi]
          Length = 565

 Score = 43.5 bits (101), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 43/148 (29%), Positives = 70/148 (47%), Gaps = 15/148 (10%)

Query: 156 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRL-----VVEAALNTEDPQYQDKR--MVAQ 208
           G  I +  GP  N +LL  YGFV E N +DRL       EAA+  E   +  +R  +VA+
Sbjct: 362 GREIWMSYGPLQNWELLQFYGFVLEGNEHDRLPFPLDFPEAAVGDE---WDGRRAALVAK 418

Query: 209 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 268
               L+   +  H GR   A   ++  LR+ ++++  E  ++  + GP   +    E  V
Sbjct: 419 YGLHLAGCCWICHDGRPPPA---LVALLRV-HLAEAEEFDTMERN-GPFASLGAGTEARV 473

Query: 269 LDQLADYFKARLAGYPATLSEDEAMLTD 296
              +AD  +  L  +  +L EDE +L +
Sbjct: 474 FATIADTIRCILDLFSTSLEEDERLLEN 501


>gi|260807503|ref|XP_002598548.1| hypothetical protein BRAFLDRAFT_118329 [Branchiostoma floridae]
 gi|229283821|gb|EEN54560.1| hypothetical protein BRAFLDRAFT_118329 [Branchiostoma floridae]
          Length = 448

 Score = 43.5 bits (101), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 54/233 (23%), Positives = 96/233 (41%), Gaps = 41/233 (17%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           +Y+K  G+ SFW PY+  L  +          P+ + E +  YL  S  +A I  + +  
Sbjct: 112 IYQKHIGETSFWKPYLDILPNE-------YTHPVYFGEEDFLYLPHS-LRANIKAKKQEC 163

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF--VAVQSCVVHLQKVSLARRF 118
            + Y EL   +     L   +        FTF+ ++ A+  V  +S  V  +  ++ R  
Sbjct: 164 IKSYEELKPFFPSLEPLLPNW-----EGIFTFDAYRWAWSTVKTRSLYVDDKGSTVLRNL 218

Query: 119 --------ALVPLGPPLLAYSSKCKAML------AAVDDAVQLVVDRPYKAGESIVVWCG 164
                   +LVP+   LL +S   +  L         D    +  +  YK G+ ++    
Sbjct: 219 DKSGLGVTSLVPM-VDLLNHSHSARTGLLIKKSCKNGDYFYTVTAEDDYKRGDQVLFCYR 277

Query: 165 PQPNSKLLINYGFVDEDNPYDRL----------VVE-AALNTEDPQYQDKRMV 206
              N  LL+NYGFV  DN  D +          ++E      EDP+++ ++++
Sbjct: 278 RADNQTLLLNYGFVLPDNHLDTIKFFLVKDIIGILELMNFEEEDPKFRRRKVL 330


>gi|308812294|ref|XP_003083454.1| N-methyltransferase (ISS) [Ostreococcus tauri]
 gi|116055335|emb|CAL58003.1| N-methyltransferase (ISS) [Ostreococcus tauri]
          Length = 492

 Score = 43.5 bits (101), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 20/56 (35%), Positives = 32/56 (57%)

Query: 130 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
           +S    A +   ++ V+LV  R  KAG+ I +  G   N +L ++YGF+ EDN +D
Sbjct: 248 HSFDASARVRECENGVELVTTRDLKAGQPIELCYGELSNDELFLDYGFIVEDNAFD 303


>gi|452986759|gb|EME86515.1| hypothetical protein MYCFIDRAFT_131111 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 391

 Score = 43.5 bits (101), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 31/120 (25%), Positives = 49/120 (40%), Gaps = 10/120 (8%)

Query: 93  EIFKQAFVAVQSCVVHLQKVSLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 151
           ++FK  +  V S   H +       F  L P    +    S     +       ++  +R
Sbjct: 164 DVFKYYWAIVNSRSFHFKPPGAKPGFMVLCPFIDYMNHGPSGTGVNVRQTAKGYEVTANR 223

Query: 152 PYKAGESIVVWCGPQPNSKLLINYGFV---------DEDNPYDRLVVEAALNTEDPQYQD 202
            Y AGE ++   G  PN KLL++YGF+         D+D   D  +++   NT   Q QD
Sbjct: 224 DYVAGEEVLATYGAHPNDKLLVHYGFINSSKPGAPSDDDIRLDHYILDNLSNTTRDQLQD 283


>gi|323355591|gb|EGA87411.1| Set7p [Saccharomyces cerevisiae VL3]
          Length = 515

 Score = 43.5 bits (101), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           +++PL   L A +SKC A L    + +++V  R  +  E +    G  PNS+LL  YG+V
Sbjct: 220 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 279

Query: 179 DED-NPYD 185
           + D + YD
Sbjct: 280 EWDGSKYD 287


>gi|432901733|ref|XP_004076920.1| PREDICTED: SET domain-containing protein 4-like [Oryzias latipes]
          Length = 441

 Score = 43.5 bits (101), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 55/223 (24%), Positives = 94/223 (42%), Gaps = 26/223 (11%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL-TGSPTKAEILERAEGIK 61
           E+ +G+ S W PYI  L         +   P  +++T +A L +G   +AE  E+ EG++
Sbjct: 117 ERHRGEASDWFPYIDVLP-------CSYCCPPYFTDTVMAVLPSGVRRRAE--EQREGLQ 167

Query: 62  REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK-----VSLAR 116
             Y  +   +FM+      +P   P E  T+E  + A+ ++ +  V + +     +S   
Sbjct: 168 HLY-AVHQDFFMSLQPVLSHP---PEEVLTYEALRWAWCSINTRSVFMDRPSSSFLSGPD 223

Query: 117 RFALVPLGPPLLAY--SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 174
            +AL P    LL +    + KA         ++      +      +  G   N +LL+ 
Sbjct: 224 NYALAPF-LDLLNHRPDVQVKAGFNRTSGCYEIRSISGVQRYHQAFINYGSHDNQRLLLE 282

Query: 175 YGFVDEDNPYDRLVVEAALNTE----DPQYQDKRMVAQRNGKL 213
           YGFV   NP+  + VE  L  E    D    +K    + NG L
Sbjct: 283 YGFVSSCNPHSVIYVEEDLLCEVLRGDESLDEKMKFLRENGFL 325


>gi|224042477|ref|XP_002188626.1| PREDICTED: SET domain-containing protein 4 [Taeniopygia guttata]
          Length = 457

 Score = 43.5 bits (101), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 65/275 (23%), Positives = 107/275 (38%), Gaps = 36/275 (13%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           EK  G+KS W PY+  L +       A   P    E ++  L   P + +  E+   I+ 
Sbjct: 134 EKHAGQKSPWKPYLDVLPK-------AYTCPAC-LEPDIINLLPKPLQKKAQEQKMLIQE 185

Query: 63  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 122
            +      +     LF +   +I    F F   + A+  V +  +++ K      F+L P
Sbjct: 186 LFQSSRAFFSSLQPLFAEDTGNI----FNFSALQWAWCTVNTRTIYM-KHPHRECFSLEP 240

Query: 123 ----LGP--PLLAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 174
               L P   LL +S   + KA       + ++  D   K  + +++  GP  N +LL+ 
Sbjct: 241 DVYALAPYLDLLNHSPNVQVKAGFNEQTRSYEIWTDSQCKKYQEVLICYGPHDNQRLLLE 300

Query: 175 YGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP 234
           YGFV  DNP+  + V A    +     DK    QR  K+S+   H           D L 
Sbjct: 301 YGFVATDNPHSSVYVSADTLLKYFSSLDK----QREAKVSILKDH-----------DFLE 345

Query: 235 YLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVL 269
            L  G+   +  + + +  L        C  R +L
Sbjct: 346 NLTFGWEGPSWRLLTALKVLSLAADEFACWRRILL 380


>gi|323334121|gb|EGA75505.1| Set7p [Saccharomyces cerevisiae AWRI796]
          Length = 515

 Score = 43.5 bits (101), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           +++PL   L A +SKC A L    + +++V  R  +  E +    G  PNS+LL  YG+V
Sbjct: 220 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 279

Query: 179 DED-NPYD 185
           + D + YD
Sbjct: 280 EWDGSKYD 287


>gi|367016539|ref|XP_003682768.1| hypothetical protein TDEL_0G01900 [Torulaspora delbrueckii]
 gi|359750431|emb|CCE93557.1| hypothetical protein TDEL_0G01900 [Torulaspora delbrueckii]
          Length = 573

 Score = 43.5 bits (101), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 25/79 (31%), Positives = 37/79 (46%), Gaps = 9/79 (11%)

Query: 127 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD- 185
           LL + +  K      +D V  V     K GE +    G + N  LL++YGFV + NPYD 
Sbjct: 229 LLNHKNDTKVKWTFTNDNVCFVSQEIMKEGEEVFNNYGEKSNEDLLLSYGFVQDQNPYDL 288

Query: 186 --------RLVVEAALNTE 196
                   + +++ ALN E
Sbjct: 289 TRLTLRLTKEMIDEALNAE 307


>gi|428181778|gb|EKX50641.1| hypothetical protein GUITHDRAFT_135258 [Guillardia theta CCMP2712]
          Length = 254

 Score = 43.1 bits (100), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 45/162 (27%), Positives = 68/162 (41%), Gaps = 25/162 (15%)

Query: 135 KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 194
           K+ L  V+  VQL+   P KAGE I ++ G    +  L  +GF D DNP D +  E  L 
Sbjct: 74  KSELGRVE--VQLLA--PVKAGEQIFIYYGALSTASELTRFGFCDRDNPNDTVPFELDL- 128

Query: 195 TEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSL 254
           +E  + Q K M          +V+      ++    D LP  RL            +  +
Sbjct: 129 SEMTELQRKAM----------EVWEFRPDVQQLLKRDGLPSWRL----------LAMLRI 168

Query: 255 GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
             +  +S   E+ V   + +   A  AGYP  L ED + L +
Sbjct: 169 LHLNQLSVANEKLVWGTMEELLNAVTAGYPTRLEEDISRLEE 210


>gi|6320463|ref|NP_010543.1| Rkm4p [Saccharomyces cerevisiae S288c]
 gi|46577338|sp|Q12504.1|RKM4_YEAST RecName: Full=Ribosomal N-lysine methyltransferase 4; AltName:
           Full=SET domain-containing protein 7
 gi|1136212|emb|CAA92714.1| unknown [Saccharomyces cerevisiae]
 gi|1226033|emb|CAA94096.1| unknown [Saccharomyces cerevisiae]
 gi|51830266|gb|AAU09704.1| YDR257C [Saccharomyces cerevisiae]
 gi|190404795|gb|EDV08062.1| hypothetical protein SCRG_00269 [Saccharomyces cerevisiae RM11-1a]
 gi|259145494|emb|CAY78758.1| Set7p [Saccharomyces cerevisiae EC1118]
 gi|285811273|tpg|DAA12097.1| TPA: Rkm4p [Saccharomyces cerevisiae S288c]
 gi|323349272|gb|EGA83501.1| Set7p [Saccharomyces cerevisiae Lalvin QA23]
 gi|365766338|gb|EHN07836.1| Set7p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
 gi|392300372|gb|EIW11463.1| Rkm4p [Saccharomyces cerevisiae CEN.PK113-7D]
          Length = 494

 Score = 43.1 bits (100), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           +++PL   L A +SKC A L    + +++V  R  +  E +    G  PNS+LL  YG+V
Sbjct: 220 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 279

Query: 179 DED-NPYD 185
           + D + YD
Sbjct: 280 EWDGSKYD 287


>gi|349577313|dbj|GAA22482.1| K7_Set7p [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 494

 Score = 43.1 bits (100), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           +++PL   L A +SKC A L    + +++V  R  +  E +    G  PNS+LL  YG+V
Sbjct: 220 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 279

Query: 179 DED-NPYD 185
           + D + YD
Sbjct: 280 EWDGSKYD 287


>gi|151942233|gb|EDN60589.1| SET domain-containing protein [Saccharomyces cerevisiae YJM789]
          Length = 494

 Score = 43.1 bits (100), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           +++PL   L A +SKC A L    + +++V  R  +  E +    G  PNS+LL  YG+V
Sbjct: 220 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 279

Query: 179 DED-NPYD 185
           + D + YD
Sbjct: 280 EWDGSKYD 287


>gi|256270722|gb|EEU05884.1| Set7p [Saccharomyces cerevisiae JAY291]
          Length = 494

 Score = 43.1 bits (100), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           +++PL   L A +SKC A L    + +++V  R  +  E +    G  PNS+LL  YG+V
Sbjct: 220 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 279

Query: 179 DED-NPYD 185
           + D + YD
Sbjct: 280 EWDGSKYD 287


>gi|391342782|ref|XP_003745694.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Metaseiulus occidentalis]
          Length = 278

 Score = 43.1 bits (100), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 37/182 (20%), Positives = 83/182 (45%), Gaps = 12/182 (6%)

Query: 151 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 210
           R YK  E + ++ G + N++ +++ GFV ++N +D L ++  L+  D  ++ KR + ++ 
Sbjct: 96  REYKKNEQVNIFYGNRANAQFMLHNGFVPDENQWDSLAIKIGLSKADKLFEMKRRLCEQM 155

Query: 211 GKLSVQVFHVHAGREKEAISDMLPYLRLGYV--------SDTSEMQSVISSLGPICPVSP 262
              +  VF +    + + +  ++P + L  V        SD +   + + +  P     P
Sbjct: 156 KIPTSDVFELKKAPDGDGV--LVPKVLLHLVHILQWKAPSDGTTSGTDVGA-DPSDATDP 212

Query: 263 CMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML-NACL 321
              +     L    +  +   P ++ E   +L D     + ++A +    E++ML NAC 
Sbjct: 213 VRTKKAKTFLHVRCQLLMKALPRSVEELTEILNDPTTSLESKLAIRYRLSEQRMLTNACN 272

Query: 322 QV 323
           ++
Sbjct: 273 KI 274


>gi|345561352|gb|EGX44442.1| hypothetical protein AOL_s00188g347 [Arthrobotrys oligospora ATCC
           24927]
          Length = 468

 Score = 43.1 bits (100), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 53/215 (24%), Positives = 88/215 (40%), Gaps = 23/215 (10%)

Query: 32  SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 91
           S + +++ EL    GS       +  + I+ +Y  L         LF QY        FT
Sbjct: 107 SSIFFTDDELEVCAGSSLYTITKQLKQQIQDDYRTL------VERLFGQYLDIFSLGKFT 160

Query: 92  FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS---KCKAMLAAVDDAVQLV 148
            E +K A   V S  +   +        L P    +L +SS   KC     +  D + ++
Sbjct: 161 IEDYKWALCTVWSRAMDFVQPDGKSIRLLAPFAD-MLNHSSDVKKCHVYDTSSGD-LSIL 218

Query: 149 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 208
             + Y+ G+ + +  G  PN++LL  YGFV  +NP D   +      E P ++ K+ +  
Sbjct: 219 AGKDYEPGDQVFINYGSIPNNRLLRLYGFVVPNNPNDSYDLVLMTQPEAPFFELKQKLWV 278

Query: 209 RNGKLSVQVFHVHAGREKEAISDMLP-----YLRL 238
             G  SV    +       +++D LP     YLR+
Sbjct: 279 SAGLDSVSTISL-------SLNDPLPKSVLQYLRI 306


>gi|323309789|gb|EGA62995.1| Set7p [Saccharomyces cerevisiae FostersO]
          Length = 417

 Score = 43.1 bits (100), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           +++PL   L A +SKC A L    + +++V  R  +  E +    G  PNS+LL  YG+V
Sbjct: 220 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 279

Query: 179 DED-NPYD 185
           + D + YD
Sbjct: 280 EWDGSKYD 287


>gi|330924929|ref|XP_003300837.1| hypothetical protein PTT_12198 [Pyrenophora teres f. teres 0-1]
 gi|311324820|gb|EFQ91062.1| hypothetical protein PTT_12198 [Pyrenophora teres f. teres 0-1]
          Length = 372

 Score = 43.1 bits (100), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 22/77 (28%), Positives = 38/77 (49%), Gaps = 2/77 (2%)

Query: 117 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 176
           R A++P+          C+A  A+  +    + DR Y+AGE + +  G      LL  YG
Sbjct: 179 RLAILPVADLFNHADVGCEAQFAS--ENYSFIADRTYRAGEELYISYGTHSTDFLLAEYG 236

Query: 177 FVDEDNPYDRLVVEAAL 193
           FV  +N +D + ++ A+
Sbjct: 237 FVPAENRWDVVCLDEAI 253


>gi|301094169|ref|XP_002997928.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262109714|gb|EEY67766.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 440

 Score = 43.1 bits (100), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 41/166 (24%), Positives = 66/166 (39%), Gaps = 27/166 (16%)

Query: 151 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL-----NTEDPQYQDKRM 205
           + Y+ GE + +  G   N +LL NYGF   +NPYD + +   +     N  DP +  KR 
Sbjct: 247 KAYEPGEQLYINYGSHSNLRLLRNYGFTTPNNPYDVVTLPMPIALQQPNPADPAFLQKRG 306

Query: 206 VAQR-NGKLSVQVFHVHAGREKE----------------AISDMLPYLRLGYVSDTSEMQ 248
           + Q   G  S  +  + + R                   A  + L  +     S +    
Sbjct: 307 LLQSATGSHSTDIPALRSLRFNHDGQLAPNAEHWLEILLATPEELSEIITQAASQSGAAD 366

Query: 249 SVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 294
           S IS   P+      ++  V  ++     ARL  + +TL ED+A L
Sbjct: 367 STISLALPMS-----LKHKVHSEVGSLVTARLKQHSSTLEEDDAFL 407


>gi|241712095|ref|XP_002413441.1| conserved hypothetical protein [Ixodes scapularis]
 gi|215507255|gb|EEC16749.1| conserved hypothetical protein [Ixodes scapularis]
          Length = 227

 Score = 43.1 bits (100), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 31/98 (31%), Positives = 45/98 (45%), Gaps = 11/98 (11%)

Query: 7   GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 66
           G  SFW PYI  L R       +  + L +S  EL  LTGS    E L+    I R+Y  
Sbjct: 94  GSASFWHPYISILPR-------SFNTVLYFSVDELQLLTGSSVLDEALKLHRSIARQYAY 146

Query: 67  LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 104
              + F    L +  PY    + FT+++++ A  AV +
Sbjct: 147 FHKI-FRTHPLAKSLPY---KDCFTYDLYRWAVSAVMT 180


>gi|50303805|ref|XP_451849.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
 gi|49640981|emb|CAH02242.1| KLLA0B07161p [Kluyveromyces lactis]
          Length = 553

 Score = 42.7 bits (99), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 25/65 (38%), Positives = 33/65 (50%), Gaps = 1/65 (1%)

Query: 141 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 200
            D+ V +++    KAGE I    G   NS LL  YGF  EDNP+D + +   L  E  Q 
Sbjct: 319 TDECVDIILSNDVKAGEEIFNSYGDHSNSYLLARYGFCIEDNPHDVVDLSEEL-VEYGQK 377

Query: 201 QDKRM 205
             KR+
Sbjct: 378 NSKRI 382


>gi|302793745|ref|XP_002978637.1| hypothetical protein SELMODRAFT_52721 [Selaginella moellendorffii]
 gi|300153446|gb|EFJ20084.1| hypothetical protein SELMODRAFT_52721 [Selaginella moellendorffii]
          Length = 523

 Score = 42.7 bits (99), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 57/261 (21%), Positives = 102/261 (39%), Gaps = 62/261 (23%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP----TKAEI--- 53
           + E+ +G+ SFW PY+  L    G       +PL + + EL  L G+     TKA++   
Sbjct: 104 IIERARGRASFWAPYLEMLPSGFG-------TPLWFEDEELMELDGTTLFEATKAQVFFP 156

Query: 54  -----------LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA------------- 89
                      L R +  + E+ E   +W  A  +F     +IP  A             
Sbjct: 157 STFVSTCMSLYLFRPDDRELEFQEF--LW--ANCIFWTRALNIPCPASFVTSSSPEVAKD 212

Query: 90  ----FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAM-------- 137
                   +    F++  +     + VS      LVP G     ++ +   +        
Sbjct: 213 DGNRLVIYVLPHPFISCSA-----KDVSTIWIEGLVP-GIDFCNHTRRASGLWEIDGSDG 266

Query: 138 -LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE-AALNT 195
             + V  ++ L+ D  +  G  +++  G + N +LL  YGFV+EDN  D ++V    +  
Sbjct: 267 STSGVPHSMYLIADVVFPPGSEVLINYGDKGNEELLFLYGFVEEDNSNDYVMVHFPKMFL 326

Query: 196 EDPQYQDKRMVAQRNGKLSVQ 216
           ++    D ++   R   LS+Q
Sbjct: 327 DEDNTMDFKLQLLRELDLSLQ 347


>gi|336258546|ref|XP_003344085.1| hypothetical protein SMAC_09068 [Sordaria macrospora k-hell]
 gi|380093059|emb|CCC09296.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 421

 Score = 42.7 bits (99), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 18/50 (36%), Positives = 29/50 (58%)

Query: 144 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 193
           A  +   RPY AGE + +  G   N  LLI YGF+ ++N +D + ++ A+
Sbjct: 263 AFTITTTRPYSAGEEVYICYGNHSNDFLLIEYGFLFDENVWDEVCIDDAI 312


>gi|346465219|gb|AEO32454.1| hypothetical protein [Amblyomma maculatum]
          Length = 353

 Score = 42.7 bits (99), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 42/172 (24%), Positives = 73/172 (42%), Gaps = 16/172 (9%)

Query: 87  TEAFTFEIFKQAFVAVQS-CVVHLQKVSLARRF---ALVPLGPPLLAYSSKCKAMLAAVD 142
           +E FT+ +F  A+ AV + C+    K      F       L P L   +   KA +    
Sbjct: 185 SENFTWHLFVWAWTAVNTRCI--FSKHRTDHSFWDDDYCALAPFLDCLNHHWKADVETTV 242

Query: 143 DA--VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 200
           +    ++V +  Y+  + + +  G   N KLL+ YGFV  DNP D + +     T+   Y
Sbjct: 243 EGSYFEIVTNNNYEPNDQVFISYGSHDNKKLLLEYGFVLADNPNDVVAI-----TKGHLY 297

Query: 201 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS 252
              ++ +Q+N  +      +    EK+ ISD   +   G   +   +  V+S
Sbjct: 298 ---KLNSQQNDTVLYFATKLSFLEEKDIISDTCGFTTDGLTWNGKIVMQVLS 346


>gi|307109196|gb|EFN57434.1| hypothetical protein CHLNCDRAFT_142903 [Chlorella variabilis]
          Length = 1233

 Score = 42.7 bits (99), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 67/293 (22%), Positives = 114/293 (38%), Gaps = 58/293 (19%)

Query: 29   AVESPLLWSETELA-YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPT 87
            A  +P+LW + E    L GSP   E   R + +++E+ ++  +        + YP  +  
Sbjct: 929  ATLTPILWPDEERQQLLRGSPVLEEARTREQALRQEWQDIAAIAAQTSGGPEAYPAVVYN 988

Query: 88   EAFTFEIFKQAFVAVQSCVV-HLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDA 144
            E        QAF+   S V+ H   +  A+ FAL+PL   L    S   A+L      +A
Sbjct: 989  E--------QAFLEAMSVVLAHAAYLPKAQCFALLPLVGGLCRTGSSSGALLDYDLEREA 1040

Query: 145  VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 204
            V +V  R    G+ + ++C                       L + A+L   D  Y  KR
Sbjct: 1041 VTVVAQR--TPGQEVALYC-----------------------LFMAASLVAADRLYTTKR 1075

Query: 205  MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCM 264
             + +  G      F +    ++ A   ++ +          E  ++I         SP  
Sbjct: 1076 EILEELGLGVKAEFPIF--EDRLATQQLINF----------EQDTII---------SPEN 1114

Query: 265  ERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 317
            E  +L  L    + R+  Y     +D   L   +L P++R+A QL   EK++L
Sbjct: 1115 EYEILQLLMGDLRDRIQAYATEFDDDIKDLQRTDLTPRQRLAAQLRLGEKRIL 1167


>gi|367042232|ref|XP_003651496.1| hypothetical protein THITE_2111880 [Thielavia terrestris NRRL 8126]
 gi|346998758|gb|AEO65160.1| hypothetical protein THITE_2111880 [Thielavia terrestris NRRL 8126]
          Length = 377

 Score = 42.7 bits (99), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 42/171 (24%), Positives = 69/171 (40%), Gaps = 22/171 (12%)

Query: 28  LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPT 87
           LA   PL WS   L      P +A  L RA+  K   +     W    + F         
Sbjct: 104 LATALPLAWSSPVLHNYLPPPARA--LLRAQQAKFARD-----WAAVSAAFP-------- 148

Query: 88  EAFTFEIFKQAFVAVQSCVVHLQKVSLAR-----RFALVPLGPPLLAYSSKCKAMLAAVD 142
            A   + F+ A++   +   + +    AR     R  L P+   L  +++     +A   
Sbjct: 149 -ALAPDAFRHAWLLTNTRTFYHETARTARLPHDDRMVLQPVAD-LFNHAADGGCEVAFTP 206

Query: 143 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 193
            +  +  DR Y  GE +++  G   N  LL+ YGFV E N +D + ++ A+
Sbjct: 207 ASFAITADRAYAEGEEVLICYGRHSNDFLLVEYGFVLEQNRWDEVGLDEAV 257


>gi|400598098|gb|EJP65818.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
          Length = 356

 Score = 42.7 bits (99), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 41/168 (24%), Positives = 69/168 (41%), Gaps = 22/168 (13%)

Query: 33  PLLWSETELAYLTGSPTKAEILERAE-GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 91
           P  W       L G  T   +LE+ +    R++  L +           YPY +P+E + 
Sbjct: 104 PFFWPPEAQRLLPG--TARRLLEKQQSNFGRDWKHLQSA----------YPY-VPSEDYM 150

Query: 92  FEIFKQAFVAVQSCVVHLQKVSL---ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 148
              F    V+ ++     Q+  L     R A++P+       S  CK    A  ++  +V
Sbjct: 151 HAWF---VVSSRAFYQETQQTLLYPWHDRLAMLPVADLFNHASVGCKVSYCA--ESYDIV 205

Query: 149 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 196
            DR Y  G+ +    G   N  LL  YGF+ ++N  DR   +  +++E
Sbjct: 206 ADREYGTGDEVCTCYGEHSNDFLLAEYGFLLQNNTNDRFDPDDLISSE 253


>gi|384254260|gb|EIE27734.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 724

 Score = 42.7 bits (99), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 60/234 (25%), Positives = 88/234 (37%), Gaps = 44/234 (18%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           +YE++ G  S   PY+  L        +   +PL WSE +L  L       E+  + EG+
Sbjct: 157 LYERQLGPASKLAPYVAALP-------VDFSTPLSWSEAQLQALCYPQLIREVATQREGL 209

Query: 61  KREYNEL------------DTVWFMAGSLFQQY--PYDIPTEAFTFEIFKQAFVAVQSCV 106
           KR + EL            D +W +     + +  PY  PT     + F        + +
Sbjct: 210 KRLHAELAVSTPGTPITEQDLIWALQAVRSRAFSGPYAGPTWRSRLKTFGALGALAAASI 269

Query: 107 VHLQKVSLARRFALVPL-GPPLLAYSSKCKAMLAAVD-----DAVQLVVDRPYKA----- 155
                ++ A   AL  L    +L+   K  AM   VD       VQ  V+  Y A     
Sbjct: 270 TVAHVLNGAIAAALFNLLYDVVLSQKVKWYAMCPVVDFLNHKSTVQSEVEYEYFADRFSV 329

Query: 156 --------GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV----EAALNTED 197
                   GE + +  G Q N  LL  YGFV+   P+D   +     AAL   D
Sbjct: 330 RCQSYFSKGEQVFISYGKQSNDSLLQYYGFVEPGIPHDTYTIPDLRAAALALSD 383


>gi|328772032|gb|EGF82071.1| hypothetical protein BATDEDRAFT_23340 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 419

 Score = 42.7 bits (99), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 48/191 (25%), Positives = 83/191 (43%), Gaps = 31/191 (16%)

Query: 12  WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 71
           W PYI  L R      L ++  LL     L Y        +I E A   K + ++LDT +
Sbjct: 107 WSPYIDILPRSFDTMPLCIDLKLL---AMLPY--------DIQEIA---KNQQSKLDTDY 152

Query: 72  FMAGSLFQQYPYD-IPTEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-------RFALVPL 123
               +      Y+ IP +     IFK A++ V +  + +   ++++       +  ++ L
Sbjct: 153 AFVCTALAVSGYEMIPKD-----IFKWAWIVVNTRCITMNTNAISKPQLSHIHQQPIITL 207

Query: 124 GPPLLAYSSKCKAMLAAVDDAVQ----LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 179
            P L   +    A ++A  D V+    +    PYK G  + +  GP  N+ LL  YGF  
Sbjct: 208 APFLDCLNHTSTARISAGYDTVEKAYIIRTLVPYKKGSQVFINYGPHDNNFLLAEYGFAI 267

Query: 180 EDNPYDRLVVE 190
             NP++ +V++
Sbjct: 268 LKNPFNHVVLD 278


>gi|297608243|ref|NP_001061350.2| Os08g0244400 [Oryza sativa Japonica Group]
 gi|255678277|dbj|BAF23264.2| Os08g0244400, partial [Oryza sativa Japonica Group]
          Length = 195

 Score = 42.7 bits (99), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 22/59 (37%), Positives = 34/59 (57%), Gaps = 2/59 (3%)

Query: 141 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD--RLVVEAALNTED 197
           V  +++  + RP KAGE   +  G  P S L+  YGF+  DNPYD   L ++ +++ ED
Sbjct: 14  VTKSLKFPLSRPCKAGEQCFLSYGKHPGSHLITFYGFLPRDNPYDVIPLDLDTSVDEED 72


>gi|413917183|gb|AFW57115.1| hypothetical protein ZEAMMB73_742803 [Zea mays]
          Length = 514

 Score = 42.4 bits (98), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 50/187 (26%), Positives = 80/187 (42%), Gaps = 16/187 (8%)

Query: 41  LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 100
           LA L G+    EI++  + ++++Y+EL  +      L   +P     +  T++ F  A  
Sbjct: 235 LAALEGTLLFDEIIQARQHLRQQYDELFPL------LCTNFPEIFRKDVCTWDDFLWACE 288

Query: 101 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV---VDRPYKAGE 157
              S  + +   S      LVP+   L    S        VD+A + +   + RP  AGE
Sbjct: 289 LWYSNSMMIVLSSGKLSTCLVPVAGLLNHSVSPHILNYGRVDEATKSLKFPLSRPCDAGE 348

Query: 158 SIVVWCGPQPNSKLLINYGFVDE-DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ 216
              +  G  P S LL  YGF+   DNPYD + ++      D    D+ + AQ +   S Q
Sbjct: 349 QCFLSYGKHPGSHLLTFYGFLPRGDNPYDVIPLDL-----DTSADDEDITAQSSATTS-Q 402

Query: 217 VFHVHAG 223
             H+  G
Sbjct: 403 TTHMVRG 409


>gi|85113406|ref|XP_964517.1| hypothetical protein NCU02158 [Neurospora crassa OR74A]
 gi|28926302|gb|EAA35281.1| hypothetical protein NCU02158 [Neurospora crassa OR74A]
          Length = 504

 Score = 42.4 bits (98), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 18/50 (36%), Positives = 29/50 (58%)

Query: 144 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 193
           A  +   RPY AGE + +  G   N  LLI YGF+ ++N +D + ++ A+
Sbjct: 270 AFTITTTRPYAAGEEVYICYGNHSNDFLLIEYGFLFDENVWDEVCIDDAI 319


>gi|242823770|ref|XP_002488126.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
 gi|218713047|gb|EED12472.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
          Length = 480

 Score = 42.4 bits (98), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 49/193 (25%), Positives = 85/193 (44%), Gaps = 20/193 (10%)

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
            +VPL   L A + +  A L   D A+ +   +P K G+ I    G  P S LL  YG+V
Sbjct: 230 GMVPLADLLNADADRNNARLFQEDGALVMRAIKPIKTGDEIFNDYGELPRSDLLRRYGYV 289

Query: 179 DEDNPYDRLVVEAAL----------NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEA 228
             DN     VVE  L          N E  +Y   +++ +   ++    + +     +++
Sbjct: 290 -TDNYAQYDVVELPLTGICHAAGLDNIESQEYPHLKLLHEL--EILEDGYCILRPSAEDS 346

Query: 229 ISDMLP----YLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP 284
           ++D+LP     L      +  E+Q + S   P  P+    E  +   L D  K++L+ Y 
Sbjct: 347 LTDILPDELLALLKSLTLEREELQRLQSKQKPPKPILAAREARI---LLDSVKSKLSQYG 403

Query: 285 ATLSEDEAMLTDY 297
            T+ +D+A+L  +
Sbjct: 404 TTVEQDKAILQQF 416


>gi|356511297|ref|XP_003524363.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
           setd3-like [Glycine max]
          Length = 449

 Score = 42.4 bits (98), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 41/172 (23%), Positives = 70/172 (40%), Gaps = 19/172 (11%)

Query: 142 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD--RLVVEAAL------ 193
           D  +++V +   K  + +++  G   N   L++YGFV   NPYD   L  + AL      
Sbjct: 232 DSKMKVVAETAIKEDDPLLLCYGCLNNDLFLLDYGFVMHSNPYDCIELKYDGALLDAAST 291

Query: 194 -------NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE 246
                  N   P    + +++Q N         V  G ++     +L  LR+   ++   
Sbjct: 292 AAGVSSPNFSTPAPWQELILSQLNLAGETPDLKVSLGGQETVEGRLLAALRVILSTNVET 351

Query: 247 MQ----SVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 294
           MQ    S++ SL    P+    E AV   L       L  +P  + +DE++L
Sbjct: 352 MQKYDLSILQSLDAEAPLGVANEIAVFRTLIALCVIALGHFPTKIMDDESLL 403


>gi|440464432|gb|ELQ33864.1| hypothetical protein OOU_Y34scaffold00857g1 [Magnaporthe oryzae
           Y34]
          Length = 464

 Score = 42.4 bits (98), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 36/129 (27%), Positives = 55/129 (42%), Gaps = 3/129 (2%)

Query: 77  LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA 136
           L  Q+    P E FT E +K A   V S  +            L P    +L +S   K 
Sbjct: 146 LLVQHRDLFPLEQFTIEDYKWALCTVWSRAMDFVLPGGNSIRLLAPFAD-MLNHSDNVKQ 204

Query: 137 MLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 194
             A  +    + ++  + Y+AG+ + ++ GP  NS+LL  YGFV   N  D   +  A +
Sbjct: 205 CHAYDSSSKTLSVLAGKDYEAGDQVFIYYGPVSNSRLLRLYGFVLPGNSNDNYDLVLATH 264

Query: 195 TEDPQYQDK 203
            E P +  K
Sbjct: 265 PEAPFFARK 273


>gi|336463341|gb|EGO51581.1| hypothetical protein NEUTE1DRAFT_125257 [Neurospora tetrasperma
           FGSC 2508]
 gi|350297448|gb|EGZ78425.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
          Length = 503

 Score = 42.4 bits (98), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 18/50 (36%), Positives = 29/50 (58%)

Query: 144 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 193
           A  +   RPY AGE + +  G   N  LLI YGF+ ++N +D + ++ A+
Sbjct: 270 AFTITTTRPYAAGEEVYICYGNHSNDFLLIEYGFLFDENVWDEVCIDDAI 319


>gi|170588849|ref|XP_001899186.1| SET domain containing protein [Brugia malayi]
 gi|158593399|gb|EDP31994.1| SET domain containing protein [Brugia malayi]
          Length = 278

 Score = 42.4 bits (98), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 28/79 (35%), Positives = 44/79 (55%), Gaps = 5/79 (6%)

Query: 119 ALVPLGPPLLAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 176
           A+VPL   +L +S  S+C A+  +  +  +++V RP + GE I +  G   N  L I YG
Sbjct: 186 AIVPL-IDMLNHSNDSQCCAIWDSKFNLYKVIVTRPIRKGEQIFICYGSHTNGSLWIEYG 244

Query: 177 FVDEDNPYDRLVVEAALNT 195
           F  +DN  D+  VE +L +
Sbjct: 245 FYLKDNICDK--VEISLGS 261


>gi|308809221|ref|XP_003081920.1| N-methyltransferase (ISS) [Ostreococcus tauri]
 gi|116060387|emb|CAL55723.1| N-methyltransferase (ISS) [Ostreococcus tauri]
          Length = 403

 Score = 42.4 bits (98), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 47/164 (28%), Positives = 70/164 (42%), Gaps = 40/164 (24%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           M +   G  + W  Y   L         AV+S ++WS+ EL  L GS  +   + R +  
Sbjct: 54  MQQTNGGASARWRAYCDAL-------PAAVDSLMMWSDEELEVLQGSALRQRAVFRRDLC 106

Query: 61  KREYNELDTVWFMAGSLFQQYPYDI-PTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 119
           KREY+ L    F A  L +  P      EA++F++F+ A+  V           +AR F 
Sbjct: 107 KREYDAL----FPA--LARADPETFGDVEAYSFDVFRWAYATV-----------MARAFV 149

Query: 120 LVPLGPPLLAYSSKCKAMLAAVD------DAVQLVVDRPYKAGE 157
           L  L         +C A+L  +D      DA + VV+R   A E
Sbjct: 150 LPDL---------QCMALLPGLDIYNSARDAEKCVVERDEGACE 184


>gi|42567909|ref|NP_197226.2| protein SET DOMAIN GROUP 40 [Arabidopsis thaliana]
 gi|75271674|sp|Q6NQJ8.1|SDG40_ARATH RecName: Full=Protein SET DOMAIN GROUP 40
 gi|34222078|gb|AAQ62875.1| At5g17240 [Arabidopsis thaliana]
 gi|51969984|dbj|BAD43684.1| unknown protein [Arabidopsis thaliana]
 gi|332005020|gb|AED92403.1| protein SET DOMAIN GROUP 40 [Arabidopsis thaliana]
          Length = 491

 Score = 42.4 bits (98), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 40/179 (22%), Positives = 80/179 (44%), Gaps = 22/179 (12%)

Query: 143 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV--EAALNTEDPQY 200
           +A  L   R Y+ GE +++  G   N +LL +YGF+ E+N  D++ +  E +L +    +
Sbjct: 257 NAYCLYARRNYQLGEQVLLCYGTYTNLELLEHYGFMLEENSNDKVFIPLETSLFSLASSW 316

Query: 201 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPV 260
               +   ++GKLS                 ++  LRL  +  +   +SV+  +     +
Sbjct: 317 PKDSLYIHQDGKLSFA---------------LISTLRLWLIPQSQRDKSVMRLVYAGSQI 361

Query: 261 SPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 319
           S   E  V+  +++   + L   P +++ED  +     LH   ++    +R+E+K   A
Sbjct: 362 SVKNEILVMKWMSEKCGSVLRDLPTSVTEDTVL-----LHNIDKLQDPELRLEQKETEA 415


>gi|453087416|gb|EMF15457.1| SET domain-containing protein [Mycosphaerella populorum SO2202]
          Length = 454

 Score = 42.4 bits (98), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 57/263 (21%), Positives = 108/263 (41%), Gaps = 26/263 (9%)

Query: 31  ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 90
            S + +++ EL    GS         A  I+ +Y +L         LF ++    P++ F
Sbjct: 106 SSSIFFADAELEICAGSSLYTTTKHLARQIEVDYKDL------VARLFGRHRDVFPSDKF 159

Query: 91  TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYS---SKCKAMLAAVDDAVQL 147
           T + +K A   V S  +   K+       L+     +L +S    +C        + + +
Sbjct: 160 TIDDYKWALCTVWSRAMDF-KLRDGESIRLMAPFADMLNHSPDVGQCHVYDPQSGN-LSI 217

Query: 148 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 207
           +  + Y+ G+ + +  GP PN++L   YGFV   NP D   +  + +   P ++ K  + 
Sbjct: 218 LAGKSYEPGDQVFINYGPIPNNRLSRLYGFVVPGNPNDSYDLVLSTHPMAPFFEQKHKLW 277

Query: 208 QRNGKLSVQVFHVHAGREKEAISDMLP-----YLRLGYVSDTSEMQSVISSLGPIC--PV 260
              G  S     +        ++D LP     YLR+  +++T ++ +V +    +    +
Sbjct: 278 IAAGLDSTSTVSL-------TLTDPLPRSVLRYLRIQRLNET-DLAAVGTRQSDVAFEKI 329

Query: 261 SPCMERAVLDQLADYFKARLAGY 283
           S   E  VL  L +   A L G+
Sbjct: 330 SDSNETEVLTFLVESISALLDGF 352


>gi|392594054|gb|EIW83379.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
          Length = 508

 Score = 42.4 bits (98), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 64/236 (27%), Positives = 94/236 (39%), Gaps = 30/236 (12%)

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           A+VP+   L A      A L   + A+++   R  + G+ I    G  PNS LL  YG V
Sbjct: 261 AMVPMADLLNARYGSSNAKLFYEEHALEMRTTRFIRRGDQIWNTYGDPPNSDLLRRYGHV 320

Query: 179 DE--------DNPYDRLVVEAALNTEDPQYQDKRM-VAQR------NGKLSVQVFHVHAG 223
           D          NP D + V A L T+        + VA+R       G   V V      
Sbjct: 321 DLVPLAQGGLGNPADVVEVRADLVTDVVSASGSSIPVAERIDWYLEMGGDDVFVLETDLD 380

Query: 224 REKEAIS--DMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 281
             +  I    +L  L   +   T E +          P  P M+ AV+  L +    RL 
Sbjct: 381 IPEPLIVLVRLLQLLEPDW-EKTREKEK---------PPKPKMDGAVIGVLVEVLHRRLK 430

Query: 282 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVS 337
            YP T+ EDEA+L   +     +    +VRM +K++   LQ T + +  L   +V 
Sbjct: 431 EYPTTIDEDEALLYKEDTLSINKKNAIIVRMGEKII---LQRTLESLQRLASQSVG 483


>gi|218200748|gb|EEC83175.1| hypothetical protein OsI_28406 [Oryza sativa Indica Group]
          Length = 319

 Score = 42.4 bits (98), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 26/77 (33%), Positives = 40/77 (51%), Gaps = 7/77 (9%)

Query: 123 LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 182
           + P +L Y    K     V  +++  + RP KAGE   +  G  P S L+  YGF+  DN
Sbjct: 166 VSPHILNYGRVDK-----VTKSLKFPLSRPCKAGEQCFLSYGKHPGSHLITFYGFLPRDN 220

Query: 183 PYDR--LVVEAALNTED 197
           PYD   L ++ +++ ED
Sbjct: 221 PYDVIPLDLDTSVDEED 237


>gi|327290197|ref|XP_003229810.1| PREDICTED: SET domain-containing protein 4-like [Anolis
           carolinensis]
          Length = 440

 Score = 42.0 bits (97), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 37/143 (25%), Positives = 64/143 (44%), Gaps = 14/143 (9%)

Query: 81  YPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQKVSLARRFALVPLGPPL-LAYSSKCKA 136
           +P D+ +  F ++ FK A+  + +  V   H Q+   +R      L P L L   +    
Sbjct: 184 FPKDVAS-VFNYQAFKWAWCTINTRTVYMKHSQRDCFSRDTDTYALAPYLDLLNHNPTVQ 242

Query: 137 MLAAVDDAVQL----VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 192
           + A  ++  +      V + +   E  + + GP  N +LL+ YGFV  DNP+  + V   
Sbjct: 243 VKAGFNEKTKCYEITTVTQCHHYNEVFICY-GPHDNQRLLLEYGFVSRDNPHSSVYVGTD 301

Query: 193 LNTEDPQYQDKRMVAQRNGKLSV 215
              ++   +DK    QR  KLS+
Sbjct: 302 TLLKNVFPEDK----QRPKKLSI 320


>gi|358055500|dbj|GAA98620.1| hypothetical protein E5Q_05307 [Mixia osmundae IAM 14324]
          Length = 462

 Score = 42.0 bits (97), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 18/46 (39%), Positives = 28/46 (60%)

Query: 147 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 192
           LV D P  AG+ +    GP+ N + L+ YGF+  +NP D +V++ A
Sbjct: 225 LVQDEPTAAGQQVFNNYGPKSNEEFLLGYGFIIPNNPDDHMVLKLA 270


>gi|50556556|ref|XP_505686.1| YALI0F20944p [Yarrowia lipolytica]
 gi|49651556|emb|CAG78495.1| YALI0F20944p [Yarrowia lipolytica CLIB122]
          Length = 402

 Score = 42.0 bits (97), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 54/206 (26%), Positives = 98/206 (47%), Gaps = 35/206 (16%)

Query: 3   EKKQGKKSFWLPYIREL-DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA-EGI 60
           ++  G KS W  ++  L DR+ G     ++ PL WS+ +   LT  P    +L++  +  
Sbjct: 92  QQSLGSKSDWKAFMGLLPDRKEG----FLDVPLQWSKEDQDSLT--PEGIVVLKKTLDTF 145

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-CVVHLQKVSLARRFA 119
           + +Y++  T  F+A     +Y  D P +A+ +     A++ V S C+     ++  ++ A
Sbjct: 146 EADYDKTKT--FVA-----KYDSD-PRDAYLW-----AWLCVNSRCLYFDLTLTTGKKDA 192

Query: 120 L-----VPLGP--PLLAYS-----SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 167
                 + L P   L+ +S     + C+   +++   + L   R Y A E I +  GP+ 
Sbjct: 193 QEVPDNITLAPYVDLINHSVESGPTHCQLKTSSIGFEI-LCGQRGYTADEEIFLCYGPRS 251

Query: 168 NSKLLINYGFVDEDNPYDRLVVEAAL 193
           NS LL  YGF   +NP+D + +  AL
Sbjct: 252 NSVLLCEYGFTVPENPWDDVDISDAL 277


>gi|307103393|gb|EFN51653.1| hypothetical protein CHLNCDRAFT_139846 [Chlorella variabilis]
          Length = 712

 Score = 42.0 bits (97), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 31/101 (30%), Positives = 53/101 (52%), Gaps = 7/101 (6%)

Query: 151 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 210
           RP +AG+ + +  GP PN KLL  YGFV   NP+D  +V   L   +   + +++ A   
Sbjct: 444 RPCQAGQQVFISYGPVPNLKLLCYYGFVVPHNPHD--LVPLQLEPPEGPLKQQQLAAMEA 501

Query: 211 GKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVI 251
             L ++    H+ ++      +L  LRL  V+ ++E+Q V+
Sbjct: 502 LGLGLE----HSLQDGPLSKQLLACLRL-IVATSAELQLVV 537


>gi|330800139|ref|XP_003288096.1| hypothetical protein DICPUDRAFT_152307 [Dictyostelium purpureum]
 gi|325081857|gb|EGC35358.1| hypothetical protein DICPUDRAFT_152307 [Dictyostelium purpureum]
          Length = 525

 Score = 42.0 bits (97), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 86/425 (20%), Positives = 154/425 (36%), Gaps = 128/425 (30%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           +YE   G+KS W  YI  L        L V+ P+LW +     L G+  +  I +    I
Sbjct: 130 IYEASIGEKSKWYGYISSL-------PLKVDIPILWDKESQQLLNGTVMEDVIQDDNILI 182

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF-- 118
              Y ++     +   L + +P     E F+ EIF      + + +V  +   +      
Sbjct: 183 NHAYADI-----VESLLIKNHP-----EYFSKEIFSFENFKIANSIVSSRAFCIDSYHGD 232

Query: 119 ALVPLG-----------------------------------PPLLAYSSK---------- 133
           +LVPL                                     PL+  S+K          
Sbjct: 233 SLVPLADIFNHKTGRENVHIESNGDVCNKCGSIKTCKHRKVTPLITKSAKSYKKLTNKKK 292

Query: 134 -----------------CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 176
                            C  +    D+ + + V +  KA + +    G   N+ LL  YG
Sbjct: 293 MELIEKQKQQQINDEENCGDIAEEDDEHLYIKVVKAVKANQEVYNTYGDHSNATLLSKYG 352

Query: 177 FVDEDNPYDRLVVEAAL-----------NTEDPQYQDKRM--------VAQRNGKLSVQV 217
           F++ DNP D L VE +L           N  D     KR+        +  RN   S+++
Sbjct: 353 FIEMDNPCDNLPVEKSLVDTNLISLCKENGFDSNELSKRISFYASLFDIDSRNTH-SIEI 411

Query: 218 FHVHAGREKEA-----------ISDMLPYLRLGYVSDTSEMQSVISSLGP--ICPVSPCM 264
               +GR  +A           +S+   +L++       +++     L    I   +  +
Sbjct: 412 ----SGRLDDALVCSVGIALAPLSEFEGWLKMS----EHKLEKYFEKLEAEDIVKQNAQV 463

Query: 265 ERAVLDQLADYFKARLAGYPATLSEDEAMLTDY--NLHPKKRVATQLVRMEKKMLNACLQ 322
           ++A++  L +    +L+ YP TL +D+  L +   N   +K ++T L   EKK++   ++
Sbjct: 464 KKAIVQILNN----KLSNYPTTLEQDQNKLKELKENEENRKIISTSLNICEKKLIYKSIK 519

Query: 323 VTADM 327
              D+
Sbjct: 520 YYEDL 524


>gi|218200744|gb|EEC83171.1| hypothetical protein OsI_28399 [Oryza sativa Indica Group]
          Length = 437

 Score = 42.0 bits (97), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 33/56 (58%), Gaps = 2/56 (3%)

Query: 144 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR--LVVEAALNTED 197
           +++  + RP KAGE   +  G  P S L+  YGF+  DNPYD   L ++ +++ ED
Sbjct: 300 SLKFPLSRPCKAGEQCFLSYGKHPGSHLITFYGFLPRDNPYDVIPLDLDTSVDEED 355


>gi|383863095|ref|XP_003707018.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Megachile
           rotundata]
          Length = 277

 Score = 42.0 bits (97), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 39/78 (50%), Gaps = 3/78 (3%)

Query: 151 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE---DPQYQDKRMVA 207
           R +K G+ I +  GP+PNS   ++ GFV  D+ +D L        E   DP   ++R + 
Sbjct: 80  RDFKKGDQIFISYGPRPNSDFFLHSGFVYMDHKHDTLKFWVGSFLESNLDPHLAERRQLL 139

Query: 208 QRNGKLSVQVFHVHAGRE 225
           ++        F V++GRE
Sbjct: 140 KKLHLQPWSEFVVNSGRE 157


>gi|50294638|ref|XP_449730.1| hypothetical protein [Candida glabrata CBS 138]
 gi|49529044|emb|CAG62706.1| unnamed protein product [Candida glabrata]
          Length = 510

 Score = 42.0 bits (97), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 22/66 (33%), Positives = 34/66 (51%)

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           ++VPL   L A + KC A L      ++++  R   +GE +    G   NS+LL  YG+V
Sbjct: 238 SMVPLADTLNADTKKCNANLLHSKQTLRMIAIRDIPSGEQVYNTYGELSNSELLRRYGYV 297

Query: 179 DEDNPY 184
           + D  Y
Sbjct: 298 EWDGSY 303


>gi|388250581|gb|AFK23406.1| histone-lysine N-methyltransferase [Cordyceps militaris]
          Length = 479

 Score = 42.0 bits (97), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 48/188 (25%), Positives = 81/188 (43%), Gaps = 7/188 (3%)

Query: 145 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 204
           + ++  + Y+ G+ I ++ G  PN++LL  YGFV  DNP D   +    +   P Y+ K 
Sbjct: 239 LSILAAKDYQVGDQIFIYYGSVPNNRLLRLYGFVLLDNPNDSYDLVLQTSPMAPLYEQKE 298

Query: 205 MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV--SDTSEMQSVISSLGPICPVSP 262
            +    G  S     + A  +     ++L YLR   +  +D ++M   + + G    V+ 
Sbjct: 299 RLWALAGLDSTCTIPLTA--KHPLPKNVLRYLRTQRLDAADVADMTLQLLN-GTDGKVND 355

Query: 263 CMERAVLDQLADYFKARLAGYPATLSEDEAMLTD--YNLHPKKRVATQLVRMEKKMLNAC 320
             E  VL  L D   + L G+   L + EA L    Y        A Q+   E+ +L   
Sbjct: 356 GNEIQVLQFLIDSLGSVLEGFGIPLEKLEAQLAGGFYPAGGNAWAAAQVSAGEQGILTRA 415

Query: 321 LQVTADMI 328
            +   DM+
Sbjct: 416 KKTAEDML 423


>gi|358335378|dbj|GAA53907.1| histone-lysine N-methyltransferase setd3 [Clonorchis sinensis]
          Length = 254

 Score = 42.0 bits (97), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 36/168 (21%), Positives = 79/168 (47%), Gaps = 15/168 (8%)

Query: 164 GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 223
           G + +++ L+  GFV   NP++ + +   ++  D     +  + +     S  + H+   
Sbjct: 63  GKRTSAEFLMFSGFVPATNPHNNVRIVLGVSKSDQLSSKREQLLELIALQSPLILHITG- 121

Query: 224 REKEAISDMLPYLRLGYVSDTSEMQSVIS---------SLGPICPVSPCMERAVLDQLAD 274
            +  ++SD + + R+ +V D+ ++ + +S            P+CP  P  ++A+   L  
Sbjct: 122 -DLSSLSDAIAFARV-FVMDSDQLDAHLSMTTSALHALRTSPLCPGDPIDDQAIA-FLIM 178

Query: 275 YFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 322
            F+  ++ Y   +SEDE      NL P +R   +L   E ++L +C++
Sbjct: 179 RFELLVSAYGPMVSEDEVGYE--NLTPIQRYCERLRVQEVQILRSCIE 224


>gi|320170159|gb|EFW47058.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 640

 Score = 42.0 bits (97), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 47/185 (25%), Positives = 73/185 (39%), Gaps = 23/185 (12%)

Query: 9   KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP--TKAEILERAEGIKREYNE 66
           KS W  +I  L +        ++S + WS  E   L   P   K +ILER   +++ YN 
Sbjct: 265 KSMWREWISSLPQ-------TLDSTVFWSAEEQDALQSLPLKRKTQILER--HLQQLYNA 315

Query: 67  LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-CVVHLQKVSLARRFALVPLGP 125
                     L   +P+      +++E+FK A++ V S  +          +  L PL  
Sbjct: 316 ------TTPRLLAAFPHIFAGGNYSYEMFKWAYMIVDSRSLTFSTGPDTLPQIMLAPLVD 369

Query: 126 PLLAYSSKCKAMLAAVDDAV-----QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
            L     +    L    + V      L   R  K GE +V   G  PN +LL+ +G    
Sbjct: 370 LLHHDPVQTNIQLGVHPEEVLGFEISLKTTRAIKKGEPLVRHIGELPNHQLLLRFGLAMP 429

Query: 181 DNPYD 185
            NPY+
Sbjct: 430 RNPYE 434


>gi|260835045|ref|XP_002612520.1| hypothetical protein BRAFLDRAFT_214305 [Branchiostoma floridae]
 gi|229297897|gb|EEN68529.1| hypothetical protein BRAFLDRAFT_214305 [Branchiostoma floridae]
          Length = 287

 Score = 42.0 bits (97), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 46/196 (23%), Positives = 77/196 (39%), Gaps = 24/196 (12%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           M EK + +KSFW PYI  L  +          P  ++E +   L  S  + +   +    
Sbjct: 103 MCEKYRREKSFWRPYIDILPEE-------YSCPTFFTEDDFRLLPNS-LRGKAKAKKYEC 154

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV----------HLQ 110
            +EY EL   + M   LF         +AF F+ FK A+ A+++  +          HL+
Sbjct: 155 HKEYKELAPFFKMLADLFPD-----QEDAFNFKDFKWAWSAIKTRALDVPIGRESCRHLR 209

Query: 111 KVSLARRFALVPLGPPL-LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 169
                    + PL   +  A  +K +         ++   +  Y+    ++   G   N 
Sbjct: 210 DAEDTPTPTMFPLVDSINHAAQAKIRHRYNEKSRCLESRTETVYRRHAEVMNSYGRADND 269

Query: 170 KLLINYGFVDEDNPYD 185
            LL+ +GFV   NP D
Sbjct: 270 NLLLEFGFVVPGNPED 285


>gi|308811012|ref|XP_003082814.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplast precursor (ISS)
           [Ostreococcus tauri]
 gi|116054692|emb|CAL56769.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplast precursor (ISS)
           [Ostreococcus tauri]
          Length = 588

 Score = 42.0 bits (97), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 63/266 (23%), Positives = 108/266 (40%), Gaps = 46/266 (17%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG- 59
           + E+++G    +  Y+  L +        ++SPL WS  EL  + G+    ++L+ A G 
Sbjct: 133 LKERERGADGEFAAYVATLPK-------TLDSPLFWSADELRDIAGT----QLLDNAAGY 181

Query: 60  ---IKREYNELDTVWFMAGSLFQQYPYDIPTE-AFTFEIFKQAFVAVQS-CVVHLQKVSL 114
              ++  Y EL         +F +Y      + AF    F+ AF  ++S  +  L   ++
Sbjct: 182 DAYVRAVYEEL------KNGVFVEYASTFDVDGAFDEASFRWAFGILRSRTMAPLDGANV 235

Query: 115 ARRFALVPLGPPLLAYSSKCKAMLAAVDD---------------AVQLVVDRPYKAGESI 159
           A    LVP G  L+ +SS   A                      A  +  DR Y  G  I
Sbjct: 236 A----LVP-GLDLINHSSLSGARWRVGGGGGMGGLFGGGSGSGVAAYVECDRDYDEGAEI 290

Query: 160 VVWCGPQP-NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
            V   P+  +SK  ++YGF+D  NP     +  ++  +D    DK  V +  G      F
Sbjct: 291 FVNYDPEGIDSKFALDYGFIDVVNPSPGYALTLSIPEDDANLFDKLDVLETQGLPEAPTF 350

Query: 219 HVHAGREKEAISDMLPYLRLGYVSDT 244
            +    + +   ++  +LRL +  DT
Sbjct: 351 TLRPYSDPD--RELRTFLRLLHCKDT 374


>gi|384483765|gb|EIE75945.1| hypothetical protein RO3G_00649 [Rhizopus delemar RA 99-880]
          Length = 376

 Score = 42.0 bits (97), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 47/194 (24%), Positives = 83/194 (42%), Gaps = 31/194 (15%)

Query: 6   QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 65
           + K+S+W PY   L        +   S LL      ++L  S  K E +++ + I  +Y 
Sbjct: 51  RDKQSWWKPYTDLLPMHFNTMPVNYPSELL------SHLPNS-LKQETMQQKDNIHTDY- 102

Query: 66  ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP 125
               V  +     +Q P DI     T E FK A++ V +  +H+       +   + L P
Sbjct: 103 ----VTCLKFCKSKQLPQDI-----TAEEFKWAWLCVNTRCIHMTVPDYLAKGENIALAP 153

Query: 126 PL--LAYSSKCKAMLAAVDDAVQLVVDR-------PYKAGESIVVWCGPQPNSKLLINYG 176
            L  L ++++ K     ++    +   R        YK GE + +  GP  N  +L  YG
Sbjct: 154 MLDFLNHTTEAK-----IESGFNIRTQRFEIKTLTAYKKGEQVYINYGPHDNLAMLKEYG 208

Query: 177 FVDEDNPYDRLVVE 190
           FV  +N Y+ ++++
Sbjct: 209 FVLNENIYNFVLLD 222


>gi|358384831|gb|EHK22428.1| hypothetical protein TRIVIDRAFT_84056 [Trichoderma virens Gv29-8]
          Length = 458

 Score = 42.0 bits (97), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 21/67 (31%), Positives = 37/67 (55%)

Query: 145 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 204
           + ++  + Y+A + + ++ GP PNS+LL  YGFV  DNP D   +  + +   P Y+ K+
Sbjct: 216 LSILAGKDYEAEDQVFIYYGPMPNSRLLRLYGFVIPDNPNDSYDLVLSTHPLAPFYEQKQ 275

Query: 205 MVAQRNG 211
            +    G
Sbjct: 276 KLWASAG 282


>gi|299115166|emb|CBN75532.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 524

 Score = 42.0 bits (97), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 54/235 (22%), Positives = 94/235 (40%), Gaps = 46/235 (19%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW----SETELAYLTGSPTKAEILER 56
           M E+++G  SFW  Y+R L          V++PL W    +E E   L G  T   +L R
Sbjct: 90  MAERRKGDGSFWKQYLRTLPDD-------VDTPLRWLVEQAEEEFRLLDG--TMVGLLSR 140

Query: 57  --AEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK--- 111
                +++++ E          L + +P  +     TFE +  A  ++ S     Q+   
Sbjct: 141 MMHSQVRKDWEEFHL------PLVEAHPEILG--GVTFEDYLWAMSSIWSRSFDYQEPGP 192

Query: 112 -VSLARRFALVPLG-------------PPLLAYSSKCKAMLAAVDD------AVQLVVDR 151
             S   R A+VP+                ++ + ++   +   + +       +++   R
Sbjct: 193 DDSPCSRRAMVPVINAANHDPSAADSLSEMIEFQAQEGGLSMGIGEPGRARGTLRVSAGR 252

Query: 152 PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 206
            Y A E   +  G   N+KLL +YGFV   NPY  L     +   DP +  K+ +
Sbjct: 253 DYAAREQFFILYGRYSNAKLLYSYGFVLASNPYGGLDYWVRVPQTDPGFAWKQAL 307


>gi|401624185|gb|EJS42251.1| set7p [Saccharomyces arboricola H-6]
          Length = 494

 Score = 41.6 bits (96), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 22/68 (32%), Positives = 38/68 (55%), Gaps = 1/68 (1%)

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           +++PL   L A +SKC A L     +++++  R  +  E +    G  PNS++L  YG+V
Sbjct: 220 SMIPLADTLNADTSKCNANLTYDSGSLKMIAVRDIEIDEQVYNIYGEHPNSEILRRYGYV 279

Query: 179 DED-NPYD 185
           + D + YD
Sbjct: 280 EWDGSKYD 287


>gi|449506720|ref|XP_004162829.1| PREDICTED: uncharacterized LOC101212907 [Cucumis sativus]
          Length = 559

 Score = 41.6 bits (96), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 45/178 (25%), Positives = 77/178 (43%), Gaps = 20/178 (11%)

Query: 29  AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 88
           A  + L +    +  L G+    E+++  E ++++YNEL    F A  L   +P   P E
Sbjct: 259 AFNTGLSFGVGAMTTLVGTLLFDELMQAKEHLRKQYNEL----FPA--LCNNHPDIFPEE 312

Query: 89  AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP--------LGPPLLAYSSKCKAMLAA 140
            +++E F  A     S  + +       R  LVP        L P +L Y       + +
Sbjct: 313 FYSWEEFLWACELWYSNSLKIMFPDGNVRTCLVPIAGFLNHSLHPHILHYGK-----VDS 367

Query: 141 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYDRLVVEAALNTED 197
             D+++  + RP +AGE   +  G    S L+  YGF+ E DN  D + ++     +D
Sbjct: 368 DTDSLKFRLSRPCRAGEECYLSYGNYSGSHLVTFYGFLPEGDNVNDVIPLDIDFGDDD 425


>gi|449466129|ref|XP_004150779.1| PREDICTED: uncharacterized protein LOC101212907 [Cucumis sativus]
          Length = 559

 Score = 41.6 bits (96), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 45/178 (25%), Positives = 77/178 (43%), Gaps = 20/178 (11%)

Query: 29  AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 88
           A  + L +    +  L G+    E+++  E ++++YNEL    F A  L   +P   P E
Sbjct: 259 AFNTGLSFGVGAMTTLVGTLLFDELMQAKEHLRKQYNEL----FPA--LCNNHPDIFPEE 312

Query: 89  AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP--------LGPPLLAYSSKCKAMLAA 140
            +++E F  A     S  + +       R  LVP        L P +L Y       + +
Sbjct: 313 FYSWEEFLWACELWYSNSLKIMFPDGNVRTCLVPIAGFLNHSLHPHILHYGK-----VDS 367

Query: 141 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYDRLVVEAALNTED 197
             D+++  + RP +AGE   +  G    S L+  YGF+ E DN  D + ++     +D
Sbjct: 368 DTDSLKFRLSRPCRAGEECYLSYGNYSGSHLVTFYGFLPEGDNVNDVIPLDIDFGDDD 425


>gi|10177069|dbj|BAB10511.1| unnamed protein product [Arabidopsis thaliana]
          Length = 447

 Score = 41.6 bits (96), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 40/179 (22%), Positives = 80/179 (44%), Gaps = 22/179 (12%)

Query: 143 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV--EAALNTEDPQY 200
           +A  L   R Y+ GE +++  G   N +LL +YGF+ E+N  D++ +  E +L +    +
Sbjct: 213 NAYCLYARRNYQLGEQVLLCYGTYTNLELLEHYGFMLEENSNDKVFIPLETSLFSLASSW 272

Query: 201 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPV 260
               +   ++GKLS                 ++  LRL  +  +   +SV+  +     +
Sbjct: 273 PKDSLYIHQDGKLSFA---------------LISTLRLWLIPQSQRDKSVMRLVYAGSQI 317

Query: 261 SPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 319
           S   E  V+  +++   + L   P +++ED  +     LH   ++    +R+E+K   A
Sbjct: 318 SVKNEILVMKWMSEKCGSVLRDLPTSVTEDTVL-----LHNIDKLQDPELRLEQKETEA 371


>gi|145349891|ref|XP_001419360.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144579591|gb|ABO97653.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 465

 Score = 41.6 bits (96), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 48/204 (23%), Positives = 77/204 (37%), Gaps = 47/204 (23%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           E+  G KS +  Y R L R       A      W++ E +YL G+  +  + + A   K 
Sbjct: 106 ERSLGLKSRYFAYDRVLPRCEANVVCA------WNDGERSYLAGTEVETSLRDEAAAAKN 159

Query: 63  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 122
           E+        +   +F+++  +      +FE F +A   V S           R F L P
Sbjct: 160 EWER------VVAPVFKEHGVEC-----SFEQFIEARTVVSS-----------RAFTLSP 197

Query: 123 -----LGPPLLAYSSKCKAMLAAVDDA--------------VQLVVDRPYKAGESIVVWC 163
                L P   A++         V D               V++  ++  + G+ I    
Sbjct: 198 NAGVGLVPIADAFNHLTGNHHVNVGDGDAVVRSETGGEALCVKVTNEQGVRRGDEIFNTY 257

Query: 164 GPQPNSKLLINYGFVDEDNPYDRL 187
           G   N+KLL +YGF   DNP D +
Sbjct: 258 GFHGNAKLLNSYGFTQNDNPADEV 281


>gi|297845640|ref|XP_002890701.1| hypothetical protein ARALYDRAFT_472886 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297336543|gb|EFH66960.1| hypothetical protein ARALYDRAFT_472886 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 471

 Score = 41.6 bits (96), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 38/175 (21%), Positives = 71/175 (40%), Gaps = 19/175 (10%)

Query: 139 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL----------V 188
           A  +  V++V +   K  + +++  G   N   L++YGFV E NPYD +           
Sbjct: 252 AESNTLVKVVAETELKENDPLLLNYGCLSNDFFLLDYGFVIESNPYDTIELKYDEQLMDA 311

Query: 189 VEAALNTEDPQYQD-----KRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD 243
              A     P++        ++++Q N    +    V  G  +     +L  +R+    +
Sbjct: 312 ASMAAGVSSPKFSSPAPWQHQLLSQLNLAGEMPNLKVTIGGPEPVEGRLLAAIRILLCGE 371

Query: 244 TSEMQS----VISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 294
             E++      + SL  I P+    E AV   +       L+ +P  + EDEA++
Sbjct: 372 MVEVEKHDLDTLKSLSAIAPLGIANEIAVFRTVIALCVIALSHFPTKIMEDEAII 426


>gi|365982325|ref|XP_003667996.1| hypothetical protein NDAI_0A05980 [Naumovozyma dairenensis CBS 421]
 gi|343766762|emb|CCD22753.1| hypothetical protein NDAI_0A05980 [Naumovozyma dairenensis CBS 421]
          Length = 573

 Score = 41.6 bits (96), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 23/54 (42%), Positives = 28/54 (51%)

Query: 132 SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
           SK +  L   DD V +V  R    GE I +  GP PN+ LL   GF   DNP+D
Sbjct: 335 SKPEEELNNPDDYVDIVTTRGILKGEEIFISYGPLPNAFLLAKCGFTMADNPFD 388


>gi|145344497|ref|XP_001416768.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144576994|gb|ABO95061.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 514

 Score = 41.6 bits (96), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 88/357 (24%), Positives = 144/357 (40%), Gaps = 62/357 (17%)

Query: 7   GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 66
           G KS + P++R L R        V+S + W+E EL+ L+GS             + EY+ 
Sbjct: 144 GSKSAYWPWLRLLPRD-------VDSTVGWNEDELSELSGSNVVVFTRAIKAQWRMEYDA 196

Query: 67  LDTVWFMAGSLFQQYPYDIPTE---AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 123
           LD       +L +++P     E    +TF+ F  A   + S  + L   S       + +
Sbjct: 197 LDV-----PTLGEKFPDVFGGERAAHYTFDKFTWARFIIWSRAIDLSTESA--EAPTIRV 249

Query: 124 GPPLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
             PLL     A   K +    A  +AV++     ++    +      +P+   L+ YGF+
Sbjct: 250 LVPLLDMANHAPGGKLRPEWDARSNAVKVYAASAFREHTELRFNYDTKPSQYFLLQYGFI 309

Query: 179 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE------AIS-D 231
            E NP +   VEA +   D         + R+ K  +   H    +++       +I  D
Sbjct: 310 PETNPAE--CVEATVRVSDHD-------SLRDAKEELLRLHGLDPKKRNFEWKPRSIDYD 360

Query: 232 MLPYLRLGYVSDTSEMQSVIS-----SLGPICPVSPCMERAV-LDQLADYFKARLAGYPA 285
           +L   R+    D +EM    S     S   +   +    +AV L  LA +    L  Y  
Sbjct: 361 LLAATRV-ITMDEAEMSDATSLTLAVSGASVSAKNDARTKAVLLKSLASF----LESYTT 415

Query: 286 TLSEDEAMLT----DYNLHP----KKRVATQLVRMEKKMLNACLQVTADMIML-LPD 333
           TL+ED   +     + N  P    +KR A  L+RM +K +   L  +AD +   LPD
Sbjct: 416 TLAEDNEYVARVDDESNDEPLPGKRKRFAV-LLRMREKQI---LLASADALFKELPD 468


>gi|302835223|ref|XP_002949173.1| hypothetical protein VOLCADRAFT_120737 [Volvox carteri f.
           nagariensis]
 gi|300265475|gb|EFJ49666.1| hypothetical protein VOLCADRAFT_120737 [Volvox carteri f.
           nagariensis]
          Length = 593

 Score = 41.2 bits (95), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 43/154 (27%), Positives = 67/154 (43%), Gaps = 6/154 (3%)

Query: 37  SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK 96
           S+ ++A L G+P   + + RA     E  E  +  F   SL   YP     E F++E + 
Sbjct: 276 SQEDVALLEGTPLHGDAV-RARQHLSEAFESSSPAFR--SLLGAYPDYFKPEWFSWESYL 332

Query: 97  QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD---DAVQLVVDRPY 153
            A     S  + +Q  S   R  L P    +  +        + VD     +++   RP 
Sbjct: 333 WAAELWYSYGIQVQFASGDIRTCLAPYLGLMNHHPLPHVVHFSKVDPETGCLRVRAFRPC 392

Query: 154 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 187
           +AG  + +  GP  N+KLL+ YGF   DNP D +
Sbjct: 393 EAGNQLFLSYGPYSNAKLLLFYGFAVRDNPADEV 426


>gi|71425330|ref|XP_813082.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70877934|gb|EAN91231.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 565

 Score = 41.2 bits (95), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 51/184 (27%), Positives = 80/184 (43%), Gaps = 31/184 (16%)

Query: 156 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRL-----VVEAALNTEDPQYQDKR--MVAQ 208
           G  I +  GP  N +LL  YGFV E N +DRL       E  +  E   +  +R  +VA 
Sbjct: 362 GREIWMSYGPLQNWELLQFYGFVLEGNEHDRLPFPFDFPEGVVGDE---WDGRRAALVAT 418

Query: 209 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 268
               L+ + +  H GR   A   ++  LR+ ++++  E  + +   GP   +    E  V
Sbjct: 419 YGLHLAGRCWICHDGRPPPA---LVALLRV-HLAEAEEFDT-MERKGPFASLGAGTEARV 473

Query: 269 LDQLADYFKARLAGYPATLSEDEAML------------TDYNLHP---KKRVATQLVRME 313
           +  +AD  +  L  +  +L EDE +L             D N  P    KR+A  L+RM 
Sbjct: 474 VATIADTIRCILDLFSTSLEEDERLLENGSGPVATHSGDDGNTQPLSCNKRLAI-LLRMG 532

Query: 314 KKML 317
            K +
Sbjct: 533 MKRI 536


>gi|407417214|gb|EKF38012.1| hypothetical protein MOQ_001785 [Trypanosoma cruzi marinkellei]
          Length = 578

 Score = 41.2 bits (95), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 37/137 (27%), Positives = 65/137 (47%), Gaps = 9/137 (6%)

Query: 164 GPQPNSKLLINYGFVDEDNPYDRLVVEAAL--NTEDPQYQDKR--MVAQRNGKLSVQVFH 219
           GP  N +LL  YGFV E+N +DRL             ++  +R  +VA     L+ + + 
Sbjct: 370 GPLQNWELLQFYGFVVEENEHDRLPFPFDFPEGVAGDEWDRRRATLVATYGLHLAGRCWI 429

Query: 220 VHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 279
            H GR   A   ++  LR+ ++++  E  ++  + GP   +    E  V+  +AD  +  
Sbjct: 430 CHDGRPPPA---LVALLRV-HLAEAEEFDTMERN-GPFASLGAGTEARVVATIADTIRCI 484

Query: 280 LAGYPATLSEDEAMLTD 296
           L  +  +L EDE +L +
Sbjct: 485 LDLFSTSLEEDEWLLEN 501


>gi|440802665|gb|ELR23594.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
          Length = 984

 Score = 41.2 bits (95), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 42/205 (20%), Positives = 81/205 (39%), Gaps = 33/205 (16%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           +  ++  + +FW PY+R L +   R            + ++ +L G+     + E+ + I
Sbjct: 90  LIHQRAAQDAFWGPYLRSLPKHDDR-----------PDEDIQHLAGTNLFYAMQEKQQQI 138

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV------QSCVVHLQKVSL 114
           +  ++       +  +L   +P   P + FT++ F   F A       Q+ V      + 
Sbjct: 139 RESFD------LLFPALCHAHPTVFPPDLFTWDHFLWTFTACSSRSFPQTLVQQPTATTS 192

Query: 115 ARR--FALVPLGPPLL--------AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG 164
           A    + L+ +   LL         Y  K    L      ++ V +   + G       G
Sbjct: 193 AHADPYDLLEIDECLLPGLDMLNHQYRKKITWALDPSTGRLKFVTEDTVEKGTEAFNNYG 252

Query: 165 PQPNSKLLINYGFVDEDNPYDRLVV 189
           P+ N +LL+ YGF  EDN  D +++
Sbjct: 253 PKGNEELLMGYGFCIEDNEQDYVMI 277


>gi|156849027|ref|XP_001647394.1| hypothetical protein Kpol_1018p68 [Vanderwaltozyma polyspora DSM
           70294]
 gi|156118080|gb|EDO19536.1| hypothetical protein Kpol_1018p68 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 494

 Score = 41.2 bits (95), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 18/61 (29%), Positives = 34/61 (55%)

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           +++PL   L + +  C A L    + +++   +P +AGE +    G  PNS++L  YG+V
Sbjct: 221 SMIPLADTLNSDTHLCNANLMYDKETLKMTAIKPIRAGEEVFNIYGEHPNSEILRRYGYV 280

Query: 179 D 179
           +
Sbjct: 281 E 281


>gi|115386294|ref|XP_001209688.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114190686|gb|EAU32386.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 486

 Score = 41.2 bits (95), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 83/369 (22%), Positives = 143/369 (38%), Gaps = 67/369 (18%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP---------TKA 51
           MYE   G +S W  Y + L R+        ++ + W+ +EL  L GS             
Sbjct: 100 MYEYLLGDQSTWAAYFKVLPRK-------FDTLMFWTPSELLELQGSAVIDKIGRQGADE 152

Query: 52  EILERAEGIKREYNEL----------------DTVWFMA---GSLFQQYPYDI--PTEAF 90
            ILE    I R +  L                  +  +A   GSL   Y +DI  P +  
Sbjct: 153 SILEMIAPIVRAHPSLFPPVDGLPSYDGDAGTQALLHLAHTMGSLIMAYAFDIEKPEDED 212

Query: 91  TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD 150
                +  ++  +      ++  L++   +VPL   L A + +  A L   ++A+ +   
Sbjct: 213 EEGDGEGGYMTDE------EEEQLSK--GMVPLADLLNADADRNNARLFQDENALVMKAI 264

Query: 151 RPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVE-------AALNTEDPQYQD 202
           +P   GE I    G  P + LL  YG+V D   PYD + V        A L+  DP+ Q 
Sbjct: 265 KPIAKGEEIFNDYGEIPRADLLRRYGYVTDNYAPYDVVEVSLDVICKAAGLSDSDPEKQP 324

Query: 203 KRMVAQRNGKLSVQVFHVHAGREKEAISDMLP-YLRLGYVSDTSEMQSVISSLGPICPVS 261
                     L          +E + ++D+LP  L +   + T   + +        P  
Sbjct: 325 PLEFLDELELLDDGYVIPRPSQEDDQLTDILPDELIILLRTLTLSPEQLAQQRSKNKPPK 384

Query: 262 PCMERAVLDQLADYFKARLAGYPATLSEDEAMLT------------DYNLHPKKRVATQL 309
           P    A    LA   + + A Y  T+++D+ +L+            D + H ++++A Q+
Sbjct: 385 PAFAEAEATILAKAIQLKQAQYATTIAQDQEILSQLNSSEVSSGFVDESTH-RRKMAVQV 443

Query: 310 VRMEKKMLN 318
              EK++L+
Sbjct: 444 RIGEKEILH 452


>gi|402077770|gb|EJT73119.1| hypothetical protein GGTG_09969 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 377

 Score = 41.2 bits (95), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 24/76 (31%), Positives = 33/76 (43%), Gaps = 2/76 (2%)

Query: 115 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 174
           A R ALVP+        + C A      D   +  DR Y  G+ I +  G   N  LL  
Sbjct: 177 ADRLALVPIADLFNHADTGCGASFTP--DGFVVSTDRKYHVGQEIYISYGTHTNDLLLAE 234

Query: 175 YGFVDEDNPYDRLVVE 190
           YGFV   N +D+  ++
Sbjct: 235 YGFVPMANRWDKTCLD 250


>gi|302790237|ref|XP_002976886.1| hypothetical protein SELMODRAFT_416932 [Selaginella moellendorffii]
 gi|300155364|gb|EFJ21996.1| hypothetical protein SELMODRAFT_416932 [Selaginella moellendorffii]
          Length = 177

 Score = 41.2 bits (95), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 32/98 (32%), Positives = 47/98 (47%), Gaps = 6/98 (6%)

Query: 127 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 186
            L  S  CK  + AV +++++   R  KAG    +  G  PN  LL  YGFV E+NP+D 
Sbjct: 54  FLWASELCK--IDAVTNSLKVYSLRSCKAGMQCFISYGALPNIDLLCFYGFVLENNPFDT 111

Query: 187 LVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGR 224
           + VE     E P+   K  + +R   +S   F +   R
Sbjct: 112 IPVE----LEVPESPAKVALMERYNLVSHISFELRGFR 145


>gi|171684553|ref|XP_001907218.1| hypothetical protein [Podospora anserina S mat+]
 gi|170942237|emb|CAP67889.1| unnamed protein product [Podospora anserina S mat+]
          Length = 396

 Score = 41.2 bits (95), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 27/111 (24%), Positives = 50/111 (45%), Gaps = 12/111 (10%)

Query: 110 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 169
           ++++   + AL P+   L      C+ +         +  DR YK GE + +  G   N 
Sbjct: 194 ERLTKDDKMALQPVADLLNHSDEGCEVVFDT--GCYTISADREYKQGEEVYICYGTHSND 251

Query: 170 KLLINYGFVDEDNPYDRL----VVEAALNTEDPQYQDKR------MVAQRN 210
            L++ YGF  E+N +D +    VV   ++T   ++ D R      ++ +RN
Sbjct: 252 FLMVEYGFCPEENKWDEVCIDEVVLEEMSTARKKWLDGRDFLGKYLIDERN 302


>gi|159471213|ref|XP_001693751.1| transcription factor, E2F and DP-related [Chlamydomonas
           reinhardtii]
 gi|158283254|gb|EDP09005.1| transcription factor, E2F and DP-related [Chlamydomonas
           reinhardtii]
          Length = 656

 Score = 41.2 bits (95), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 41/149 (27%), Positives = 62/149 (41%), Gaps = 6/149 (4%)

Query: 42  AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 101
           A L GSP  AE  +    +   +      +    SL + YP       F++E +  A   
Sbjct: 164 AALAGSPLAAEAGQARRHLAEAFAASQPAFE---SLLKAYPDYFQPHWFSWESYLWAAEL 220

Query: 102 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA---VQLVVDRPYKAGES 158
             S  + +Q  +   R  LVP    +  +        + VD A   +++   RP   G  
Sbjct: 221 WYSYGIQVQVAAGDIRTCLVPYLGLMNHHPLPHVVHFSKVDPASRGLRVRAFRPCARGRQ 280

Query: 159 IVVWCGPQPNSKLLINYGFVDEDNPYDRL 187
           + +  GP PNSKLL+ YGF   DNP D +
Sbjct: 281 LFLSYGPYPNSKLLLFYGFALPDNPVDEV 309


>gi|378728064|gb|EHY54523.1| SET domain-containing protein 6 [Exophiala dermatitidis NIH/UT8656]
          Length = 495

 Score = 41.2 bits (95), Expect = 0.75,   Method: Compositional matrix adjust.
 Identities = 72/276 (26%), Positives = 102/276 (36%), Gaps = 46/276 (16%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILER-AEG 59
           +YE  +G+ S W PY + L            + + W++ ELA L  S    +I  R AE 
Sbjct: 103 IYEYLRGESSPWHPYFKIL-------PTTFNTLMFWNDAELAELQASAVVDKIGRRQAEE 155

Query: 60  ------------------IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 101
                             +     +L  +  MAGSL   Y +DI  +    +       A
Sbjct: 156 EWQNTIIPTMADHPDLFPVGGSSAKLIELAHMAGSLIMAYAFDIDRDDMEDDNDNDKDGA 215

Query: 102 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 161
             +     +         +VP    L A + K  A L    D + +   +P  AGE I  
Sbjct: 216 DSADDEFEEDDEDEPFKGMVPFADMLNADADKNNARLFQEPDYLIMKATKPISAGEQIFN 275

Query: 162 WCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV 220
             GP P S LL  YG+V D    YD +     L  E         VA ++ K   QV+  
Sbjct: 276 DYGPLPRSDLLRMYGYVTDNYAQYDVVEFSHDLLLE---------VAGKHSKSKDQVW-- 324

Query: 221 HAGREKEAISDMLPYLRLGYV-----SDTSEMQSVI 251
              RE+E   D L  L  GY       DT  +Q V+
Sbjct: 325 ---REREQQLDELGVLDDGYAITRPEYDTQGLQDVL 357


>gi|325186532|emb|CCA21071.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 441

 Score = 41.2 bits (95), Expect = 0.75,   Method: Compositional matrix adjust.
 Identities = 50/206 (24%), Positives = 88/206 (42%), Gaps = 19/206 (9%)

Query: 155 AGESIVVWCGPQPNSKLLINYGFV--DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 212
           A +   +  G   ++KLL +YGFV  +E N ++ + +   +   DP ++ K+ + + N  
Sbjct: 238 ANDPFYISYGSYSDAKLLYSYGFVSLNEKNRFNGIDLWMRVPVTDPNFKLKQAILEGNAA 297

Query: 213 LSVQVFHVHAGREKEAISD-MLPYLRLGYVS--DTSEMQSVISSLGPICPVSPCMERAVL 269
              Q +        + + +  L   R+  +S  +  E +    S      VS   E AV 
Sbjct: 298 TRDQTYDFRGTIHLDDVDERFLASFRIILLSQEEFREYEKAFDS----TIVSVRNELAVY 353

Query: 270 DQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQL-VRME-KKMLNACLQVTADM 327
             + D  + RLA +P +L +D   L +  ++   R    + VRME KK+L +  ++  + 
Sbjct: 354 AAIHDVCEKRLARFPTSLEDDLKKLAELEMNSDLRKTYAISVRMEDKKILQSVCRLMKEW 413

Query: 328 IMLL--------PDVTVSPCPAPYAP 345
             LL        PDVT    P    P
Sbjct: 414 RNLLENDSNIYPPDVTRQQQPQLSMP 439


>gi|255087300|ref|XP_002505573.1| set domain protein [Micromonas sp. RCC299]
 gi|226520843|gb|ACO66831.1| set domain protein [Micromonas sp. RCC299]
          Length = 509

 Score = 41.2 bits (95), Expect = 0.75,   Method: Compositional matrix adjust.
 Identities = 79/347 (22%), Positives = 142/347 (40%), Gaps = 46/347 (13%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           ++E+  G+KS W  Y+  L  Q       +++PL WS  E+A LTG+    ++L+ A G 
Sbjct: 140 LHERSIGEKSRWAAYVNALPAQ-------LDAPLFWSAEEVATLTGT----QLLDAAAGY 188

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-CVVHLQKVSLARRFA 119
             +     T   +  S F   P   P++AF    F  AF  ++S C      V      A
Sbjct: 189 --DSYARGTWARLKESAFDANPDVFPSDAFDEPSFLWAFGILRSRCQA---PVDQGADIA 243

Query: 120 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV------------------ 161
           LVP     +A  S   +    +++     V    K+G S+++                  
Sbjct: 244 LVP--GLDMANHSGLSSQTWTLNNGGVAAVFGGGKSGGSMLLRTEKGAKGLLAKGAEVFM 301

Query: 162 -WCGPQPNSKLLINYGFVDEDNPYDRLVVE-AALNTEDPQYQDKRMVAQRNGKLSVQVFH 219
            +   + +++L ++YGF D        V+   A+   DP   DK  V +  G      F 
Sbjct: 302 NYGQRKIDNQLALDYGFTDAFASRPGYVLGPIAIPESDPNAFDKMDVLEVAGLREAPSFV 361

Query: 220 VHAGREKEAISDMLPYLRLGYVS--DTSEMQSVI--SSLGPIC-PVSPCMERAVLDQLAD 274
           + A  + E   ++  ++RL  +   D   ++++    + G I  PVS   E+     + +
Sbjct: 362 LRAFEDPE--PELRVFMRLLNLKGEDAFLLEAIFRQEAWGLISEPVSRLNEQEACGTMIN 419

Query: 275 YFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 321
             +  L GY   + +D  +  D  +  + R+A ++   EK+ L   L
Sbjct: 420 GCEEALRGYATRVEDDRRVAEDPGVGHRLRLAARVRMGEKQALADAL 466


>gi|159464317|ref|XP_001690388.1| hypothetical protein CHLREDRAFT_144255 [Chlamydomonas reinhardtii]
 gi|158279888|gb|EDP05647.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 486

 Score = 41.2 bits (95), Expect = 0.75,   Method: Compositional matrix adjust.
 Identities = 23/55 (41%), Positives = 35/55 (63%), Gaps = 3/55 (5%)

Query: 142 DDAVQ---LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 193
           D+A Q   +VV R   AG+ +++  G   N +LL +YGFV +DNP+D   ++AAL
Sbjct: 244 DEARQQYVIVVRRRVAAGQQVLLCYGRHTNLELLEHYGFVMQDNPHDTAPLDAAL 298


>gi|242059429|ref|XP_002458860.1| hypothetical protein SORBIDRAFT_03g041640 [Sorghum bicolor]
 gi|241930835|gb|EES03980.1| hypothetical protein SORBIDRAFT_03g041640 [Sorghum bicolor]
          Length = 491

 Score = 41.2 bits (95), Expect = 0.77,   Method: Compositional matrix adjust.
 Identities = 50/236 (21%), Positives = 88/236 (37%), Gaps = 58/236 (24%)

Query: 1   MYEKKQGKKSFWLPYIREL-DRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAE 58
           MYE+ +G  S W  Y++ L DR+          PL+W   E    L G+     + +  E
Sbjct: 88  MYERARGTDSPWDAYLQLLPDRE--------SVPLVWPADEAECLLAGTELDKIVKQDRE 139

Query: 59  GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF 118
            +  ++ E      ++G L      D+  + F+ E +  A   V S    +        F
Sbjct: 140 FLCEDWKECIEPLLLSGEL------DVDPDDFSLEKYFSAKTLVSSRSFQIDSY---HGF 190

Query: 119 ALVPLGPPLLAYSSKCKAML--------------------------------------AA 140
            +VPL   L  + + C+ +                                        +
Sbjct: 191 GMVPLAD-LFNHKTDCEHVHFTSASDASDSDGEDADDDQSDASADDESTIENPTSSSPGS 249

Query: 141 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 196
            D+ +++++ R    GE +    G   N+ LL  YGF + DN YD + ++ AL T+
Sbjct: 250 KDEDLEMIIVRDVNEGEEVYNTYGTMGNAALLHRYGFTELDNQYDIVNIDLALVTK 305


>gi|145354661|ref|XP_001421597.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144581835|gb|ABO99890.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 341

 Score = 41.2 bits (95), Expect = 0.77,   Method: Compositional matrix adjust.
 Identities = 73/320 (22%), Positives = 126/320 (39%), Gaps = 30/320 (9%)

Query: 21  RQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG-----IKREYN-ELDTVWFMA 74
           R+R +G ++  +P + S  E        T      R EG       REY  + +  W  A
Sbjct: 2   RERAKGGVSAYAPFVESLYEHTPARAVETSRAARARLEGHAAAETMREYERDAEDGWRAA 61

Query: 75  GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC 134
              F+ +P       FT   F++A   V++     +     R  ALVP+   LL  +   
Sbjct: 62  RRTFETFPSIFSVHEFTRAAFEEALAIVRANSFEARSEDGTRARALVPMAHLLLHDTGSE 121

Query: 135 KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG----FVDEDNPYDRLVVE 190
              +  VD    + VD  ++ G+ +    G   +++    +G    +  E N  ++  ++
Sbjct: 122 VPCVKIVDGVFVINVD-EHEEGDELSCSHGDYSDAETFARFGVSAFYSAEKNARNK--IK 178

Query: 191 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV 250
            A  +E   Y  K +   R G +    F   AG    A  + +  LRL   ++T E  ++
Sbjct: 179 FAFPSE--IYSMKSL--DRCGSVENIAF-TDAG----ATEEFMCALRLASANET-EWAAI 228

Query: 251 ISSLGPI-----CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH--PKK 303
             S   +      P+S   E AV + L       L  YP++ +EDE +L    L   P +
Sbjct: 229 SKSKASVRALRKKPLSEESEIAVYEALFATLTELLNSYPSSDNEDERLLQSRTLQSAPDE 288

Query: 304 RVATQLVRMEKKMLNACLQV 323
             A  +   EK++  + L  
Sbjct: 289 ERAVTIRLREKRLALSSLNA 308


>gi|62860180|ref|NP_001017105.1| SET domain containing 4 [Xenopus (Silurana) tropicalis]
 gi|89267009|emb|CAJ81787.1| novel protein containing a SET domain [Xenopus (Silurana)
           tropicalis]
          Length = 442

 Score = 41.2 bits (95), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 44/194 (22%), Positives = 87/194 (44%), Gaps = 19/194 (9%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           E+   ++S W PY+  L         +   P+ W E+E+  L  +P + + LE+   +K 
Sbjct: 117 ERVARERSPWKPYLDVLPS-------SYSCPVYW-ESEIISLLPAPLRQKALEQQTEVKE 168

Query: 63  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQKVSLARRFA 119
            + E    W    SL   +  +I T+ +T+   + A+  V +  V   H ++  L+ +  
Sbjct: 169 LHTE---SWSFFVSLQPLFGGNI-TDIYTYGALRWAWCTVNTRTVYMKHPRRHGLSAQQD 224

Query: 120 LVPLGPPL-LAYSSKCKAMLAAVDD---AVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
           +  + P L L   S    + AA ++     ++  +   +  +   +  GP  N +LL+ Y
Sbjct: 225 VYAMAPYLDLLNHSPAVQVEAAFNEERRCYEIRTNSGCRKHDQAFICYGPHDNQRLLLEY 284

Query: 176 GFVDEDNPYDRLVV 189
           GF+  +NP+  + V
Sbjct: 285 GFIAANNPHRSVYV 298


>gi|357131408|ref|XP_003567330.1| PREDICTED: ribosomal N-lysine methyltransferase 3-like
           [Brachypodium distachyon]
          Length = 495

 Score = 41.2 bits (95), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 19/55 (34%), Positives = 32/55 (58%)

Query: 142 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 196
           D+ +++++ R   AGE +    G   N+ LL  YGF + DNPYD + ++  L T+
Sbjct: 251 DEDLEMIIVRDANAGEEVYNTYGTMGNAALLHRYGFTELDNPYDIVNIDLTLVTK 305


>gi|396469509|ref|XP_003838423.1| similar to SET domain-containing protein [Leptosphaeria maculans
           JN3]
 gi|312214991|emb|CBX94944.1| similar to SET domain-containing protein [Leptosphaeria maculans
           JN3]
          Length = 415

 Score = 41.2 bits (95), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 36/75 (48%), Gaps = 2/75 (2%)

Query: 111 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 170
           K++ A  +A+ P         S C+    A      ++ DR Y+AGE + V  GP  N  
Sbjct: 190 KLTSADCYAMCPFMDYFNHSDSGCEPQHNA--HGYSVLADRAYRAGEEVYVSYGPHTNDF 247

Query: 171 LLINYGFVDEDNPYD 185
           LL+ YGF+ + N  D
Sbjct: 248 LLVEYGFLLDANSND 262


>gi|291000152|ref|XP_002682643.1| predicted protein [Naegleria gruberi]
 gi|284096271|gb|EFC49899.1| predicted protein [Naegleria gruberi]
          Length = 619

 Score = 41.2 bits (95), Expect = 0.82,   Method: Compositional matrix adjust.
 Identities = 47/193 (24%), Positives = 79/193 (40%), Gaps = 18/193 (9%)

Query: 6   QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 65
           + +KS   PY+  L R+         + L + E E+A L  +     +    + +K+ Y 
Sbjct: 120 EKEKSTHFPYLNLLPRE-------FTTALYFDEDEMAALRSTNLYKSVQSIRQNLKQIY- 171

Query: 66  ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK-VSLARRFALVP-L 123
           E    + M      +YP     + F++E F  AF AV S V  ++          +VP L
Sbjct: 172 ETKVEYLM-----NKYPQKFDRQVFSYENFMWAFSAVWSRVFPIEYPAENGEGVEIVPTL 226

Query: 124 GPPLLAYSSKCKAMLA---AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 180
            P +   + K  A +      D    L      K+G+ +    G + N   L++YGFV  
Sbjct: 227 LPTVDILNHKFNAKITYFTGSDRRFYLKTRESLKSGDYVCNNYGAKSNDSFLLSYGFVIP 286

Query: 181 DNPYDRLVVEAAL 193
           +N  D L V+  +
Sbjct: 287 NNSEDTLYVQFGI 299


>gi|121701277|ref|XP_001268903.1| SET domain protein [Aspergillus clavatus NRRL 1]
 gi|119397046|gb|EAW07477.1| SET domain protein [Aspergillus clavatus NRRL 1]
          Length = 498

 Score = 41.2 bits (95), Expect = 0.82,   Method: Compositional matrix adjust.
 Identities = 43/186 (23%), Positives = 85/186 (45%), Gaps = 11/186 (5%)

Query: 29  AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 88
           ++  P  ++E EL  L G+  +  +  +   +++E+  L           Q+Y +D  T 
Sbjct: 144 SIRLPTFYTEAELELLRGTSLRTAVFAKLASLEKEFERLRQS-TEGIPWCQKYWWDEDTG 202

Query: 89  AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP---LGPPLLAYSSKCKAMLAAVDDAV 145
             TF+ +K      +S VV L +       A+VP   +       S K +   ++ +DA+
Sbjct: 203 RLTFDDWKYVDAVYRSRVVELPESG----HAIVPCVDMANHASEDSVKARYDESSTEDAL 258

Query: 146 -QLVVDRPYKAGESIVVWCGPQ-PNSKLLINYGFV-DEDNPYDRLVVEAALNTEDPQYQD 202
            QL   R   +GE + +  G + P S+++ +YGFV +E     ++ ++  +  +DP    
Sbjct: 259 LQLRQGRRICSGEEVTISYGSEKPASEMVFSYGFVENERTDAKQIFLDLEIPDDDPLRMA 318

Query: 203 KRMVAQ 208
           K+M  +
Sbjct: 319 KQMFCK 324


>gi|449019745|dbj|BAM83147.1| similar to protein N-methyltransferase [Cyanidioschyzon merolae
           strain 10D]
          Length = 576

 Score = 41.2 bits (95), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 23/62 (37%), Positives = 36/62 (58%), Gaps = 3/62 (4%)

Query: 127 LLAYSSKCKAMLAA--VDDAVQLVV-DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 183
           L  +SS+ ++ +A     DA  L + +R   AG+ + +  G   N +LL  YGFV+EDNP
Sbjct: 411 LFNHSSRVQSKVAYEYFYDAFSLSISNRDTHAGDQVFISYGTLTNDELLALYGFVEEDNP 470

Query: 184 YD 185
           +D
Sbjct: 471 HD 472


>gi|158295743|ref|XP_001688855.1| AGAP006364-PD [Anopheles gambiae str. PEST]
 gi|347965224|ref|XP_003435732.1| AGAP013401-PA [Anopheles gambiae str. PEST]
 gi|333469389|gb|EGK97284.1| AGAP013401-PA [Anopheles gambiae str. PEST]
          Length = 451

 Score = 41.2 bits (95), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 41/86 (47%), Gaps = 13/86 (15%)

Query: 146 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT------EDPQ 199
            L  D  Y+AGE I +  G   N+KLL+ YGF    NP D   VE  + T       DP+
Sbjct: 276 NLHTDTAYRAGEQIFISYGTHNNTKLLLEYGFSIPSNPDD--FVELTIGTINAFMKHDPE 333

Query: 200 YQDKRMVAQR-----NGKLSVQVFHV 220
            +  R+  ++     + +L  Q+F V
Sbjct: 334 LRCLRLPREKYRFLADHRLDEQLFFV 359


>gi|320169513|gb|EFW46412.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 495

 Score = 40.8 bits (94), Expect = 0.88,   Method: Compositional matrix adjust.
 Identities = 49/199 (24%), Positives = 88/199 (44%), Gaps = 21/199 (10%)

Query: 151 RPYKAGESIVV-WCGPQ--PNSKLLINYGFVDEDNPYDRLVV------EAALNTEDPQYQ 201
           RP   G+ ++  + G +   N +L+++YG    +NP D + +      E A+  +     
Sbjct: 282 RPVAEGQELLTPYGGAEQLSNGQLIMDYGVTFRNNPSDLVALPIPKLRETAVAYDSKMRL 341

Query: 202 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS---DTSEMQSVISSLGPIC 258
              M   R  +L + V        KE    +L + R+ YVS   D S+++ V+  +    
Sbjct: 342 LMAMSLDRFDRLQLPVLDHFESIPKE----LLAFARV-YVSTPSDLSDLEHVLELMKEHR 396

Query: 259 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH--PKKRVATQLV-RM-EK 314
            ++P  ER  L+ L       +  Y  T+ EDE ML + +    P       +V R+ EK
Sbjct: 397 AINPSNERRALELLLQLTNEMILKYITTIEEDETMLRELDAESVPNANAVNAVVLRLGEK 456

Query: 315 KMLNACLQVTADMIMLLPD 333
           ++L++  Q+    I  LP+
Sbjct: 457 RILSSLWQLLDSAIEALPE 475


>gi|189190580|ref|XP_001931629.1| conserved hypothetical protein [Pyrenophora tritici-repentis
           Pt-1C-BFP]
 gi|187973235|gb|EDU40734.1| conserved hypothetical protein [Pyrenophora tritici-repentis
           Pt-1C-BFP]
          Length = 372

 Score = 40.8 bits (94), Expect = 0.88,   Method: Compositional matrix adjust.
 Identities = 21/77 (27%), Positives = 37/77 (48%), Gaps = 2/77 (2%)

Query: 117 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 176
           R A++P+          C+A  A+  +    + DR Y+ GE + +  G      LL  YG
Sbjct: 179 RLAILPVADLFNHADVGCEARFAS--ENYSFIADRDYRTGEELHISYGSHSTDFLLTEYG 236

Query: 177 FVDEDNPYDRLVVEAAL 193
           FV  +N +D + ++ A+
Sbjct: 237 FVPTENCWDVVCLDEAI 253


>gi|226508108|ref|NP_001151788.1| SET domain containing protein [Zea mays]
 gi|195649689|gb|ACG44312.1| SET domain containing protein [Zea mays]
          Length = 536

 Score = 40.8 bits (94), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 49/187 (26%), Positives = 80/187 (42%), Gaps = 16/187 (8%)

Query: 41  LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 100
           LA L G+    EI++  + ++++Y+EL  +      L   +P     +  T++ F  A  
Sbjct: 266 LAALEGTLLFDEIIQARQHLRQQYDELFPL------LCTNFPEMFRKDVCTWDDFLWACE 319

Query: 101 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV---VDRPYKAGE 157
              S  + +   S      LVP+   L    S        VD+A + +   + RP  AGE
Sbjct: 320 LWYSNSMMIVLSSGKLSTCLVPVAGLLNHSVSPHILNYGRVDEATKSLKFPLSRPCDAGE 379

Query: 158 SIVVWCGPQPNSKLLINYGFVDE-DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ 216
              +  G  P S L+  YGF+   DNPYD + ++      D    D+ + AQ +   S Q
Sbjct: 380 QCFLSYGKHPGSHLVTFYGFLPRGDNPYDVIPLDL-----DTSVDDEDIAAQSSATTS-Q 433

Query: 217 VFHVHAG 223
             H+  G
Sbjct: 434 TTHMVRG 440


>gi|323473309|gb|ADX78230.1| CIA6 [Chlamydomonas reinhardtii]
          Length = 699

 Score = 40.8 bits (94), Expect = 0.91,   Method: Compositional matrix adjust.
 Identities = 23/55 (41%), Positives = 35/55 (63%), Gaps = 3/55 (5%)

Query: 142 DDAVQ---LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 193
           D+A Q   +VV R   AG+ +++  G   N +LL +YGFV +DNP+D   ++AAL
Sbjct: 396 DEARQQYVIVVRRRVAAGQQVLLCYGRHTNLELLEHYGFVMQDNPHDTAPLDAAL 450


>gi|406606937|emb|CCH41659.1| hypothetical protein BN7_1200 [Wickerhamomyces ciferrii]
          Length = 577

 Score = 40.8 bits (94), Expect = 0.91,   Method: Compositional matrix adjust.
 Identities = 55/222 (24%), Positives = 87/222 (39%), Gaps = 58/222 (26%)

Query: 14  PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF- 72
           P+I  L   R  G     SP  W+E E + +    T A++      I    N+L   W+ 
Sbjct: 110 PFIEFLPTGREIG-----SPFFWNEMERSLIKN--TDADL-----AIDVGLNKLVEEWYD 157

Query: 73  MAGSL---FQQYPYDIPTEAFT-----------FEIFKQAFVAVQSCVVHLQKVSL--AR 116
           +   L   FQ Y Y    + F            FE F    V+  S   +L   ++  +R
Sbjct: 158 IVTKLPKKFQSYQYQKDLKFFHDFQKDRDVSKHFEFFNDDSVSWTSFAAYLWSSTIFTSR 217

Query: 117 RFALVPLGPPLLAYSSKCK----AMLAAVDDA-----------------VQLVVDRPYKA 155
            F      P L++ + +C+     ML  + D                  +    ++  K 
Sbjct: 218 GF------PFLISSTDECRDLNEGMLVPIQDLSNHNPSVEIKWGRLDKFMTFTTEQIVKK 271

Query: 156 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 197
           G+ I    GP+ N +LL  YGFV ++N YD+ V+  AL  +D
Sbjct: 272 GDEIFSNYGPKSNHELLFGYGFVMDNNIYDKAVL--ALRLQD 311


>gi|424512980|emb|CCO66564.1| predicted protein [Bathycoccus prasinos]
          Length = 542

 Score = 40.8 bits (94), Expect = 0.95,   Method: Compositional matrix adjust.
 Identities = 18/43 (41%), Positives = 25/43 (58%)

Query: 151 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 193
           + YK GE +++  G  PN +LL+ YGFVD+ N  D    E  L
Sbjct: 350 KSYKKGEEVLISYGLMPNDELLLRYGFVDDQNVADTYQFEGLL 392


>gi|71019075|ref|XP_759768.1| hypothetical protein UM03621.1 [Ustilago maydis 521]
 gi|46099291|gb|EAK84524.1| hypothetical protein UM03621.1 [Ustilago maydis 521]
          Length = 685

 Score = 40.8 bits (94), Expect = 0.99,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 38/83 (45%), Gaps = 6/83 (7%)

Query: 118 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
            ++ P+   L A      A L      +++   +P  AGE I       PNS LL  YG 
Sbjct: 368 ISMTPMADMLNAKFESDNARLFYKSHVLEMRATKPIAAGEQIFNTYADPPNSDLLRRYGH 427

Query: 178 VDEDNPYD------RLVVEAALN 194
           VDE N  D      +LVV+AA+N
Sbjct: 428 VDEPNGNDVVELDAKLVVQAAVN 450


>gi|170093191|ref|XP_001877817.1| SET-domain protein [Laccaria bicolor S238N-H82]
 gi|164647676|gb|EDR11920.1| SET-domain protein [Laccaria bicolor S238N-H82]
          Length = 524

 Score = 40.8 bits (94), Expect = 0.99,   Method: Compositional matrix adjust.
 Identities = 59/228 (25%), Positives = 98/228 (42%), Gaps = 32/228 (14%)

Query: 118 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
            A+VP+   L A      A L   ++ ++++  RP K GE I    G  PN++LL  YG 
Sbjct: 276 IAMVPMADILNARYGSENAKLFYEENYLKMISTRPIKGGEQIWNTYGDLPNAELLRRYGH 335

Query: 178 VD--------EDNPYD------RLVVEAA-----LNTEDPQYQDKRMVAQRNGKLSVQVF 218
           VD        + NP D       L+V  A     L+T+D   +    + +    +    F
Sbjct: 336 VDVIQLPNGGQGNPGDVAEIRADLIVSVAAEQHSLSTDDTHERIDWWLEEGGDDVFDLYF 395

Query: 219 HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 278
            +      E    ++  +RL  + D  E + +        P  P M+   L  L +  + 
Sbjct: 396 DL------EIPPSIISVIRLLLLPD-EEWEKIKEKA---KPPKPKMDAVALTVLHEVLQR 445

Query: 279 RLAGYPATLSEDEAML-TDYNLHPKKRVATQLVRMEKKMLNACLQVTA 325
           RL  YP ++ +DE +L T  +L+ +  +  +L   EKK+L+  L  TA
Sbjct: 446 RLKEYPTSIQDDEQLLMTAPSLNLRHAIIVRL--GEKKILDGILTKTA 491


>gi|428163884|gb|EKX32933.1| hypothetical protein GUITHDRAFT_120884 [Guillardia theta CCMP2712]
          Length = 320

 Score = 40.8 bits (94), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 40/168 (23%), Positives = 69/168 (41%), Gaps = 29/168 (17%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
           SF+ PY+  L          V++P+ WS+ E   L GSP     ++    + R + E   
Sbjct: 87  SFFRPYLDLLPD-------TVDTPITWSKEEAKELVGSPVLHRAVKLRHELARSFQE--- 136

Query: 70  VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLA 129
              M   +F +YP   P   F++E ++ A+  ++S             + L+PL   +  
Sbjct: 137 ---MKDKVFDKYPDRFPPLLFSYERYQWAYSILRSRAFG--------NYTLMPLIDLMNH 185

Query: 130 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC--GPQPNSKLLINY 175
           +     A     D +  L+  R Y       VW   G + ++ LL+NY
Sbjct: 186 HPDSRLAPTLLSDGSDALIARREYN------VWGFYGRKSDADLLLNY 227


>gi|303279242|ref|XP_003058914.1| set domain protein [Micromonas pusilla CCMP1545]
 gi|226460074|gb|EEH57369.1| set domain protein [Micromonas pusilla CCMP1545]
          Length = 457

 Score = 40.8 bits (94), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 20/48 (41%), Positives = 29/48 (60%)

Query: 142 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 189
           DDA+ + V +  KAGE +    G   N+KLL +YGF   DNP D++ +
Sbjct: 232 DDALFVRVVKASKAGEEVFNTYGKLGNAKLLCSYGFAQLDNPADKVTI 279


>gi|340522118|gb|EGR52351.1| predicted protein [Trichoderma reesei QM6a]
          Length = 377

 Score = 40.4 bits (93), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 31/142 (21%), Positives = 60/142 (42%), Gaps = 15/142 (10%)

Query: 54  LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ--- 110
           LE    ++R   E    W      F++   D+P + +T+     A++ V +   + +   
Sbjct: 120 LESQVTLERREKEFQDNW----DDFKEAFPDVPRDDYTY-----AWLVVNTRTFYHETPE 170

Query: 111 --KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPN 168
             K     R AL+P+       +  C+   +  +    +V DR YK GE + +      N
Sbjct: 171 TLKYPWEDRLALIPVADLFNHAAGGCRVYYSP-EGCYHVVADRAYKKGEELFISYSSHSN 229

Query: 169 SKLLINYGFVDEDNPYDRLVVE 190
              L+ YGF+ ++N  D + ++
Sbjct: 230 DYNLLEYGFIPDENSLDDVYID 251


>gi|358395377|gb|EHK44764.1| hypothetical protein TRIATDRAFT_80097 [Trichoderma atroviride IMI
           206040]
          Length = 463

 Score = 40.4 bits (93), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 61/269 (22%), Positives = 108/269 (40%), Gaps = 41/269 (15%)

Query: 32  SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 91
           S + ++E EL    G+       +  E I+ +Y        +   +F Q+P  +P    +
Sbjct: 107 SSIFFNEDELEVCAGTSLYTITKQLEERIEDDYR------VLVMRVFTQHPDLLPLAKIS 160

Query: 92  FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL---------LAYSSKCKAMLAAVD 142
            + +K A   V S  +            ++P G PL         + +S + K   A   
Sbjct: 161 IQDYKWALCTVWSRAMDF----------VLPNGKPLRVLAPFADMINHSPEVKQCHAYDP 210

Query: 143 DAVQLVV--DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 200
            +  L V   + Y+ G+ I +  G  PN++LL  YGFV  +NP D   +  + +   P Y
Sbjct: 211 SSGNLSVLAGKDYEIGDQIYISYGSIPNNRLLRLYGFVIPENPNDSYDLVLSTHPMAPFY 270

Query: 201 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP-----YLRLGYVSDTSEMQSV-ISSL 254
           + K+ +    G  S     +        + D LP     YLR+  + D S++ ++ +  L
Sbjct: 271 EQKQKLWASAGLDSASTIPL-------TLIDPLPKSVLRYLRIQRL-DASDLAAIALQKL 322

Query: 255 GPICPVSPCMERAVLDQLADYFKARLAGY 283
                +S   E  +L  L +   A L G+
Sbjct: 323 DTNEKISNSKEVEILQFLVESISALLDGF 351


>gi|302896942|ref|XP_003047350.1| hypothetical protein NECHADRAFT_106552 [Nectria haematococca mpVI
           77-13-4]
 gi|256728280|gb|EEU41637.1| hypothetical protein NECHADRAFT_106552 [Nectria haematococca mpVI
           77-13-4]
          Length = 471

 Score = 40.4 bits (93), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 51/221 (23%), Positives = 90/221 (40%), Gaps = 36/221 (16%)

Query: 7   GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL-ERAEGIKREYN 65
           G++SFW PYI+ L +       A+  PLLW E++L +L G+  +  +  ++ + +KR   
Sbjct: 121 GEQSFWYPYIQILPQPDDDKDSAI--PLLWPESDLLWLRGTHLEEAVSKQKVDHVKR--- 175

Query: 66  ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV-----------------H 108
                W  A    Q+Y +D P++ FT E+   A+    S                    H
Sbjct: 176 -----WTEAMETLQKYGWD-PSQ-FTLELGLWAYYCFYSRYFWSIILEPDVANIKPEFQH 228

Query: 109 LQKVSLARRFALVPLGPPL--LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 166
           L K  +        L P L  L ++ +        D  + +  +   K G+   +    +
Sbjct: 229 LVKAGMNLDDTAKILLPILETLNHAQETNTEYNLDDKGLSVSKNIELKPGDPFYIAYDKE 288

Query: 167 P----NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 203
                N+ LL ++GF+  DN    LV+ +  +   P + D 
Sbjct: 289 TQRFNNTVLLKDFGFILPDNEAAELVLSSPFDLTRPMHLDH 329


>gi|340517549|gb|EGR47793.1| hypothetical protein TRIREDRAFT_122428 [Trichoderma reesei QM6a]
          Length = 482

 Score = 40.4 bits (93), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 56/246 (22%), Positives = 99/246 (40%), Gaps = 48/246 (19%)

Query: 7   GKKSF----WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           G ++F    W  YIR L R        +  P +W+  E   L G+  +A +  +   +  
Sbjct: 111 GTRAFASTPWTEYIRFLPR-------PIPVPTMWTNDERELLKGTSLEAAVSAKLSALSS 163

Query: 63  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 122
           EY++L      A +L   +   + +E+ T E +  A    +S  + L +       A+VP
Sbjct: 164 EYDKLCE---EASAL--SFWSTLLSESATLEDWVLADAWYRSRCLELPRAG----HAMVP 214

Query: 123 LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYK---AGESIVVWCG-PQPNSKLLINYGFV 178
            G  +  +S    A      D   +++ RP     AG  I +  G  +P +++L +YGF+
Sbjct: 215 -GLDMANHSQSHSAYYDESSDGDVVLLPRPGSKIPAGAEITISYGEAKPAAEMLFSYGFI 273

Query: 179 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 238
           D+D+    L +      +DP              L    FH++ G          P +RL
Sbjct: 274 DKDSTVKELTLHLEALPDDP--------------LGRAKFHIYKGP---------PTVRL 310

Query: 239 GYVSDT 244
             ++D 
Sbjct: 311 SIINDN 316


>gi|297598048|ref|NP_001044988.2| Os01g0879500 [Oryza sativa Japonica Group]
 gi|255673923|dbj|BAF06902.2| Os01g0879500 [Oryza sativa Japonica Group]
          Length = 263

 Score = 40.4 bits (93), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 20/63 (31%), Positives = 33/63 (52%)

Query: 134 CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 193
           C   +   D+ ++++V R    GE +    G   N+ LL  YGF + DN YD + ++ AL
Sbjct: 15  CSYYVGDDDEDLEMIVVRDVNEGEEVFNTYGTMGNAALLHRYGFTEMDNSYDIVNIDLAL 74

Query: 194 NTE 196
            T+
Sbjct: 75  VTK 77


>gi|342875304|gb|EGU77102.1| hypothetical protein FOXB_12400 [Fusarium oxysporum Fo5176]
          Length = 371

 Score = 40.4 bits (93), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 23/77 (29%), Positives = 35/77 (45%), Gaps = 2/77 (2%)

Query: 117 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 176
           R   +P           CK   +A+  +VQ   DR Y  GE + V  GP  N  LL  YG
Sbjct: 178 RLVCMPTADLFNHADQGCKLAYSALGYSVQ--ADRVYHQGEEVYVSYGPHSNDFLLSEYG 235

Query: 177 FVDEDNPYDRLVVEAAL 193
           F+ + N +D + ++  +
Sbjct: 236 FILDTNRWDEVYLDEVI 252


>gi|296808191|ref|XP_002844434.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
 gi|238843917|gb|EEQ33579.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
          Length = 684

 Score = 40.4 bits (93), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 49/188 (26%), Positives = 80/188 (42%), Gaps = 11/188 (5%)

Query: 2   YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIK 61
           +++ + K S W PY+  L R    G+L   S L +   +L +L  +          + +K
Sbjct: 117 HQQLKAKDSHWWPYLATLPRA---GELT--SALFYQGEDLEWLQDTNFYHARQMYHDAVK 171

Query: 62  REYNELDTVWFMAGS-LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
            EY+   ++    G  L + Y ++I   A+T  I  +AF + +    ++ K    R+   
Sbjct: 172 TEYDAAISILRKEGCPLVESYSWNIFCWAYTV-IASRAFTS-RVLEAYISKNPALRQDDE 229

Query: 121 VPLGPPLLAYSSK---CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
             +  PL+  S+     K    A    + L V  P  A E I    GP  N +L+  YGF
Sbjct: 230 FQIMLPLVDSSNHRPLAKIEWRAEATRIGLKVIDPVSAKEEIHNNYGPLNNQQLMATYGF 289

Query: 178 VDEDNPYD 185
              DNP D
Sbjct: 290 CIVDNPCD 297


>gi|392563539|gb|EIW56718.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
          Length = 441

 Score = 40.4 bits (93), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 16/46 (34%), Positives = 29/46 (63%)

Query: 145 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 190
           + LV+  P   G  ++   GP+PN++L++ YGF   +NP D +V++
Sbjct: 250 ISLVIHTPTTTGSELLNNYGPKPNAELILGYGFSLPNNPDDTIVLK 295


>gi|410082051|ref|XP_003958604.1| hypothetical protein KAFR_0H00600 [Kazachstania africana CBS 2517]
 gi|372465193|emb|CCF59469.1| hypothetical protein KAFR_0H00600 [Kazachstania africana CBS 2517]
          Length = 508

 Score = 40.4 bits (93), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 18/61 (29%), Positives = 34/61 (55%)

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           +++PL   L + + +C A L   D  +++   +P K GE +    G  PN+++L  YG+V
Sbjct: 215 SMIPLADTLNSDTHQCNANLMYDDKFLKMYAIKPIKKGEQVFNIYGNHPNAEILRRYGYV 274

Query: 179 D 179
           +
Sbjct: 275 E 275


>gi|195565510|ref|XP_002106342.1| GD16174 [Drosophila simulans]
 gi|194203718|gb|EDX17294.1| GD16174 [Drosophila simulans]
          Length = 395

 Score = 40.4 bits (93), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 22/79 (27%), Positives = 37/79 (46%)

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           AL+P          K  +  AAV   ++        AGE   ++ G + N+ LL++ GFV
Sbjct: 229 ALIPYWDMANHRPGKITSFYAAVPRQLECTAQEAVDAGEQFFIYYGDRSNTDLLVHNGFV 288

Query: 179 DEDNPYDRLVVEAALNTED 197
           D++N  D + +   L+  D
Sbjct: 289 DDNNLKDYVNIRVGLSLTD 307


>gi|145354549|ref|XP_001421544.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144581782|gb|ABO99837.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 488

 Score = 40.4 bits (93), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 17/42 (40%), Positives = 27/42 (64%)

Query: 144 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
            V+L+  R   +GE I +  G   N +LL++YGF+ +DNP+D
Sbjct: 276 GVELIARRALTSGEPIELSYGNLSNDELLLDYGFIVKDNPFD 317


>gi|387219019|gb|AFJ69218.1| set domain-containing protein 3, partial [Nannochloropsis gaditana
           CCMP526]
          Length = 265

 Score = 40.4 bits (93), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 39/163 (23%), Positives = 62/163 (38%), Gaps = 29/163 (17%)

Query: 153 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQ-YQDKRMVAQRNG 211
           YK GE +    G + N++LL+ YGF   DN ++ + +        P  +Q         G
Sbjct: 18  YKKGEEVFTSYGRRTNAELLLFYGFALLDNEHESVALSMPGIPSPPSWFQASHSALGTAG 77

Query: 212 KLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---SEMQSVISSLGPICPVS------- 261
            +         GR +    D+L    L +   T   SE+ +   +L   C ++       
Sbjct: 78  SV--------GGRARSMAEDVLRPSHLLFAGATELPSELVAYFRALTACCSMNEKDLVEQ 129

Query: 262 --------PC--MERAVLDQLADYFKARLAGYPATLSEDEAML 294
                   PC   ER     L  +  A LA +P ++ EDE  L
Sbjct: 130 KLDYMQHFPCSRHERDAFSTLGAHMSASLAAFPTSIEEDEVEL 172


>gi|254577261|ref|XP_002494617.1| ZYRO0A05654p [Zygosaccharomyces rouxii]
 gi|238937506|emb|CAR25684.1| ZYRO0A05654p [Zygosaccharomyces rouxii]
          Length = 494

 Score = 40.4 bits (93), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 19/63 (30%), Positives = 37/63 (58%)

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           ++VPL   L A + +  A L    +++++V  +P K G+ +  + G  PN+++L  YG+V
Sbjct: 226 SMVPLADTLNADTKQFNAHLVYDKESLKMVSVKPIKMGQQVYNFYGEHPNAEILRRYGYV 285

Query: 179 DED 181
           + D
Sbjct: 286 EWD 288


>gi|299748031|ref|XP_002911244.1| tho2 protein [Coprinopsis cinerea okayama7#130]
 gi|298407787|gb|EFI27750.1| tho2 protein [Coprinopsis cinerea okayama7#130]
          Length = 2474

 Score = 40.4 bits (93), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 15/47 (31%), Positives = 30/47 (63%)

Query: 144 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 190
           ++ L+       G+ +    GP+PNS+L+++YGF  +DNP D ++++
Sbjct: 209 SISLIAHSAIWTGQEVFNNYGPKPNSELILSYGFSIQDNPDDSIILK 255


>gi|145356486|ref|XP_001422460.1| chloroplast lysine N-methyltransferase [Ostreococcus lucimarinus
           CCE9901]
 gi|144582703|gb|ABP00777.1| chloroplast lysine N-methyltransferase [Ostreococcus lucimarinus
           CCE9901]
          Length = 529

 Score = 40.4 bits (93), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 63/263 (23%), Positives = 112/263 (42%), Gaps = 49/263 (18%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG- 59
           + E+++G++S +  Y++ L          + SPL W+  EL+ + G+    ++L+ A G 
Sbjct: 147 LKERERGEESKYAAYVKTLPE-------VLHSPLFWNAEELSEIQGT----QLLDNAAGY 195

Query: 60  ---IKREYNELDTVWFMAGSLFQQYP--YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 114
              ++  Y  L T       +F ++   +D+   AF+ + F+ AF  ++S          
Sbjct: 196 DGYVRGVYETLRT------GMFAKHADVFDVEG-AFSEDNFRWAFGILRS---RTMAPCD 245

Query: 115 ARRFALVPLGPPLLAYSSKCKA-------MLAAV---------DDAVQLVV--DRPYKAG 156
               ALVP G  L+ +SS  +A       +  AV         DD V   V  DR     
Sbjct: 246 GANIALVP-GVDLVNHSSLSQARWRVSGGVAGAVAGLFGGGKGDDGVSARVECDRALNVN 304

Query: 157 ESIVVWCGPQ-PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 215
           E + V   P+  ++   +++GFVD   P     +  ++  +DP   DK  V    G    
Sbjct: 305 EPLYVNYNPEGTDTSFALDFGFVDTITPSPGYALSLSVPEDDPNVFDKLDVLDVCGLGET 364

Query: 216 QVFHVHAGREKEAISDMLPYLRL 238
             F + A  + +   D+  +LRL
Sbjct: 365 PTFTLRAYSDPD--PDLRTFLRL 385


>gi|453088140|gb|EMF16181.1| SET domain-containing protein, partial [Mycosphaerella populorum
           SO2202]
          Length = 307

 Score = 40.4 bits (93), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 45/185 (24%), Positives = 70/185 (37%), Gaps = 24/185 (12%)

Query: 33  PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 92
           PL WS     +L  S    + LER     RE  E D  + ++         +   + +  
Sbjct: 112 PLFWSPELCHHLPES--VKQPLERM----REDYEKDLTYMLS--------LNCDDQTWKE 157

Query: 93  EIFKQAFVAVQSCVVHLQKVSLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 151
           E FK  +  V S   H +   L   F  L P    +    +     ++      ++V DR
Sbjct: 158 EDFKYYWAIVNSRCFHFKPPGLKPGFMVLCPFIDYMNHGPTGTGVKVSQSPKGYEVVADR 217

Query: 152 PYKAGESIVVWCGPQPNSKLLINYGFV---------DEDNPYDRLVVEAALNTEDPQYQD 202
            Y+    I+   G  PN KLL++YGF          D+D   D +++ A       Q QD
Sbjct: 218 DYEPNTEILATYGSHPNDKLLVHYGFCLSYKPNEPSDDDIRLDHILLPAMSANTKSQLQD 277

Query: 203 KRMVA 207
             M+ 
Sbjct: 278 VGMLG 282


>gi|348552908|ref|XP_003462269.1| PREDICTED: SET domain-containing protein 4-like [Cavia porcellus]
          Length = 440

 Score = 40.4 bits (93), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 52/232 (22%), Positives = 92/232 (39%), Gaps = 32/232 (13%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           EK  G +S W PY+  L +       +   P+   E E+  L   P KA+  E+   +++
Sbjct: 117 EKHAGDQSVWKPYLDILPK-------SYTCPVCL-EPEVVNLLPEPLKAKAEEQRMSVQQ 168

Query: 63  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 122
            +      +     LF++    +    F++     A+  V +  V+L+     RR   + 
Sbjct: 169 FFASSRDFFSSLQPLFEEATDSV----FSYSALLWAWCTVNTRAVYLR----TRRRDCLS 220

Query: 123 LGPP---------LLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 171
           L P          LL +S   + KA         ++     Y+  + + +  GP  N +L
Sbjct: 221 LEPDTCALAPYLDLLNHSPNVQVKAAFNEETGCYEIRTASDYRKHKEVFICYGPHDNHRL 280

Query: 172 LINYGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
           L+ YGFV   NP+      R ++   L + D Q   K  + + +G L    F
Sbjct: 281 LLEYGFVSLCNPHACVYVSREILVKYLPSTDKQMNKKISILKDHGFLENLTF 332


>gi|328864871|gb|EGG13257.1| hypothetical protein DFA_11018 [Dictyostelium fasciculatum]
          Length = 1658

 Score = 40.4 bits (93), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 68/297 (22%), Positives = 115/297 (38%), Gaps = 25/297 (8%)

Query: 1    MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
            +YEK +   SFW P+   L            + + ++ TEL  L G+     + E    I
Sbjct: 1266 IYEK-ENPNSFWRPFFDTLPS-------YFPTSIHYTTTELLELEGT----NLFEETIQI 1313

Query: 61   KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
            K     +  + F    L  QYP   P   FT E F  A     S  + L K+       L
Sbjct: 1314 KEHLESIRELLF--PELSNQYPDVFPESLFTMENFLWARSLFDSRAIQL-KIDGRIVNCL 1370

Query: 121  VPLGPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
            VP+   +  +      +      +D  +++      A   I +  G   + +L + YGFV
Sbjct: 1371 VPMADMINHHDQAQISQRYFDQENDCFRMISCCNIPATSQIFLQYGALQSWELALYYGFV 1430

Query: 179  DEDNPYDRLVVEAALNTED-PQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 237
              +N YD + +   +  ED P+ ++++        L+V   ++H        S +L  LR
Sbjct: 1431 ISNNHYDSVHIGFDMPEEDTPELREEKQKLLDRHLLTVDHHYLHRSN---IPSKLLASLR 1487

Query: 238  LGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 294
            +  +++  E    +    PI   S   E  VL  L       L  + +T  ED+ +L
Sbjct: 1488 VALLAE-DEFNPHVDVWNPI---SRSNEEVVLYTLYSTVLMLLKQFSSTCDEDQQLL 1540


>gi|323455796|gb|EGB11664.1| hypothetical protein AURANDRAFT_61664 [Aureococcus anophagefferens]
          Length = 1916

 Score = 40.4 bits (93), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 17/46 (36%), Positives = 27/46 (58%)

Query: 143 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 188
           DA  +   R Y AG+ +    G + N++L+ NYGF++  NP+D  V
Sbjct: 294 DAFAVNAHRDYDAGDEVHASYGKKSNAQLVANYGFLEPGNPFDDYV 339


>gi|7329638|emb|CAB82703.1| putative protein [Arabidopsis thaliana]
          Length = 486

 Score = 40.0 bits (92), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 29/97 (29%), Positives = 47/97 (48%), Gaps = 13/97 (13%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           EKK G+KS W+PYI  L +        + S + W E EL+ +  S    E +++   I++
Sbjct: 124 EKKMGQKSRWVPYISRLPQPA-----EMHSSIFWGEDELSMIRCSAVHQETVKQKAQIEK 178

Query: 63  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 99
           +++      F+A +  Q  P  I TE    E F  A+
Sbjct: 179 DFS------FVAQAFKQHCP--IVTERPDLEDFMYAY 207


>gi|195040205|ref|XP_001991024.1| GH12451 [Drosophila grimshawi]
 gi|193900782|gb|EDV99648.1| GH12451 [Drosophila grimshawi]
          Length = 573

 Score = 40.0 bits (92), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 23/95 (24%), Positives = 44/95 (46%), Gaps = 3/95 (3%)

Query: 115 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ---LVVDRPYKAGESIVVWCGPQPNSKL 171
           AR   +  L P     + +C  + +  D A Q         +KAGE   ++ G + N+  
Sbjct: 339 ARNAPISALIPYWDMTNHRCGKITSYYDRAAQQMECTAQEAFKAGEQFFIYYGDRSNADR 398

Query: 172 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 206
           L+++GF+D  N  D + +   L+  DP  + + ++
Sbjct: 399 LVHHGFLDMHNLKDYVQIRLGLSPTDPLVEQRSLL 433


>gi|297738159|emb|CBI27360.3| unnamed protein product [Vitis vinifera]
          Length = 449

 Score = 40.0 bits (92), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 43/180 (23%), Positives = 79/180 (43%), Gaps = 23/180 (12%)

Query: 151 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV--EAALNTEDPQYQDKRMVAQ 208
           + YK GE +++  G   N +LL +YGF+ ++NP D+  +  E  +       +D   + Q
Sbjct: 257 KNYKKGEQVLLSYGTYTNLELLEHYGFLLDENPNDKAFIPLEPEVYASSSWPKDSLYIHQ 316

Query: 209 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 268
            NGK S                 +L  LRL + +  S+ +SV   +     +S   E  V
Sbjct: 317 -NGKPSFA---------------LLSALRL-WATPASQRRSVGHLVYSGTQLSSENEIFV 359

Query: 269 LDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 328
           ++ +A      L   P ++ ED  +L+      + ++A Q     K++L  C+    ++I
Sbjct: 360 MEWIAKSCHVVLENLPTSVEEDSLLLS----MERWKLAVQWRLRHKRILVDCISRCTEII 415


>gi|356564844|ref|XP_003550657.1| PREDICTED: uncharacterized protein LOC100778605 [Glycine max]
          Length = 549

 Score = 40.0 bits (92), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 41/151 (27%), Positives = 69/151 (45%), Gaps = 14/151 (9%)

Query: 41  LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 100
           +  L G+    EI++  + +  +Y+EL    F A  L   +P   P E +T+E F  A  
Sbjct: 260 ITMLDGTLLLEEIMQARQHLHAQYDEL----FPA--LCNNFPDIFPPELYTWEKFLWACE 313

Query: 101 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVD---DAVQLVVDRPYKA 155
              S  + +       R  L+PL   L    S C  ++    VD   ++++  + RP ++
Sbjct: 314 LWYSNSMKIMYSDGKLRTCLIPLAGFL--NHSLCPHVMHYGKVDPATNSLKFCLSRPCRS 371

Query: 156 GESIVVWCGPQPNSKLLINYGFVDE-DNPYD 185
           GE   +  G   +S L+  YGF+ + DN YD
Sbjct: 372 GEECCLSYGNFSSSHLITFYGFLPQGDNSYD 402


>gi|389646769|ref|XP_003721016.1| hypothetical protein MGG_02740 [Magnaporthe oryzae 70-15]
 gi|86196443|gb|EAQ71081.1| hypothetical protein MGCH7_ch7g488 [Magnaporthe oryzae 70-15]
 gi|351638408|gb|EHA46273.1| hypothetical protein MGG_02740 [Magnaporthe oryzae 70-15]
 gi|440466942|gb|ELQ36183.1| hypothetical protein OOU_Y34scaffold00666g44 [Magnaporthe oryzae
           Y34]
 gi|440488101|gb|ELQ67845.1| hypothetical protein OOW_P131scaffold00283g3 [Magnaporthe oryzae
           P131]
          Length = 390

 Score = 40.0 bits (92), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 23/84 (27%), Positives = 39/84 (46%), Gaps = 2/84 (2%)

Query: 110 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 169
           +++    R  L P+        + C   +A  D+   +  DR Y AGE +++  G   N 
Sbjct: 173 ERLGKEDRMVLQPVADLFNHADAGC--AVAFNDEDFTIRADRDYDAGEEVLICYGNHSND 230

Query: 170 KLLINYGFVDEDNPYDRLVVEAAL 193
            LL  YGFV   N +D + ++ A+
Sbjct: 231 FLLAEYGFVLAANRWDEVCIDDAI 254


>gi|348676999|gb|EGZ16816.1| hypothetical protein PHYSODRAFT_251772 [Phytophthora sojae]
          Length = 424

 Score = 40.0 bits (92), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 59/262 (22%), Positives = 102/262 (38%), Gaps = 40/262 (15%)

Query: 31  ESPLLWSETELAYLTGSPTKAEILER-AEGIKREYNELDTVWFMAGSLFQ-QYPYDIPTE 88
           ES   W++ EL  L          ER  +   R  +E+D  +       Q +YP + P  
Sbjct: 118 ESVQDWTQAELGELHD--------ERLVDAAARRTSEIDVYYRRVMVRLQTKYPGEFPEA 169

Query: 89  AFTFEIFKQAFVAVQSCVVHLQKVSLARRF---ALVPLGPPLLAYSSKCKAMLAAVDDAV 145
            +TF+ FK A+  +Q+        +  RR    ALVP    L   +   K      D+ +
Sbjct: 170 LYTFDRFKFAWKTIQA-------RTFGRRLPWTALVPFADCLNHTNVATKYDFDVNDNGL 222

Query: 146 QLVV---DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQD 202
             +       +  G  +    G + N +LL++YGF   DN +D + VE   +   P    
Sbjct: 223 FRLYPSGATSFAQGAEVFNSYGRRSNFQLLLDYGFALPDNEWDYVDVEIGKDRAGP---- 278

Query: 203 KRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD------TSEMQSVISSLGP 256
                 R  KL      V   R+  ++ ++ P   L  ++D       SE  + +S    
Sbjct: 279 ------RGRKLRFMKRVVRIDRQS-SLDELFPPSFLAGLADPVPDEEQSEAAAELSERTA 331

Query: 257 ICPVSPCMERAVLDQLADYFKA 278
           +C     +   +++ +AD+  A
Sbjct: 332 LCDALEWLRSILIETIADWGTA 353


>gi|116197927|ref|XP_001224775.1| hypothetical protein CHGG_07119 [Chaetomium globosum CBS 148.51]
 gi|88178398|gb|EAQ85866.1| hypothetical protein CHGG_07119 [Chaetomium globosum CBS 148.51]
          Length = 555

 Score = 40.0 bits (92), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 48/184 (26%), Positives = 83/184 (45%), Gaps = 27/184 (14%)

Query: 32  SPLLWSETELAYLTGSP--TKAEILERAEGIKREYNELDTVWFMAGS--LFQQYPYDIPT 87
           S + ++E EL    G+   T  + L+R+  I+ +Y  L  V  +A S  LF       P 
Sbjct: 194 SSIFFAEEELEVCAGTSLYTITKKLDRS--IEDDYRTL-VVRVLAQSRDLF-------PL 243

Query: 88  EAFTFEIFKQAFVAVQSCVVHL-----QKVSLARRFALVPLGPPLLAYSSKCK--AMLAA 140
           + F+ E +K A   V S  +         + L   FA       +L +SS+ +   +  A
Sbjct: 244 DKFSIEDYKWALCTVWSRAMDFVLPDGNSIRLVAPFA------DMLNHSSEVEPCHIYDA 297

Query: 141 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 200
               + ++  + Y+AG+   ++ G  PNS+LL  YGFV   NP D   +  + +   P +
Sbjct: 298 SSGNLSVLAGKDYEAGDQAFIYYGSIPNSRLLRLYGFVMPGNPNDSYDLVISTHPSAPFF 357

Query: 201 QDKR 204
           + K+
Sbjct: 358 ERKQ 361


>gi|126325439|ref|XP_001376285.1| PREDICTED: SET domain-containing protein 4-like [Monodelphis
           domestica]
          Length = 437

 Score = 40.0 bits (92), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 46/194 (23%), Positives = 81/194 (41%), Gaps = 19/194 (9%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           EK  G KS W PY+  L +          + L+  E E+  L   P + +  E+   +++
Sbjct: 118 EKHAGNKSPWKPYLDVLPK--------AYTCLVCLEPEVVRLLPRPLQMKAEEQRMQVQK 169

Query: 63  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQKVSLARRFA 119
            +      +    SLF +   D+    F +  F  A+  + +  V   H QK  L+    
Sbjct: 170 LFISSRGFFSSLQSLFTE---DV-KHVFHYHAFLWAWCTINTRTVYMKHAQKQCLSAEPD 225

Query: 120 LVPLGP--PLLAYSSKCKAMLAAVDDAV--QLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
           +  L P   LL +S +     A  ++    ++      K  E + +  GP  N +LL+ Y
Sbjct: 226 VYALAPYLDLLNHSPRVWVEAAFNEETCCYEIRTTSHCKKFEELFICYGPHDNHRLLLEY 285

Query: 176 GFVDEDNPYDRLVV 189
           GFV  +NP+  + +
Sbjct: 286 GFVASNNPHSAVYI 299


>gi|428177750|gb|EKX46628.1| hypothetical protein GUITHDRAFT_107412 [Guillardia theta CCMP2712]
          Length = 606

 Score = 40.0 bits (92), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 17/49 (34%), Positives = 27/49 (55%)

Query: 145 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 193
           +Q     P K G  + +  GP  N++LL+ YG+ ++DNPY    +E  L
Sbjct: 440 LQFCTMAPIKQGSQVFLNYGPLDNTQLLLYYGYAEQDNPYQTYAIELEL 488


>gi|194896580|ref|XP_001978500.1| GG17647 [Drosophila erecta]
 gi|190650149|gb|EDV47427.1| GG17647 [Drosophila erecta]
          Length = 544

 Score = 40.0 bits (92), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 22/79 (27%), Positives = 37/79 (46%)

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           AL+P          K  +  AAV   ++       +AGE   ++ G + N+ LL++ GFV
Sbjct: 319 ALIPYWDMANHKPGKITSFYAAVSRQLECTAQEAVEAGEQFFIYYGDRSNTDLLVHNGFV 378

Query: 179 DEDNPYDRLVVEAALNTED 197
           D +N  D + +   L+  D
Sbjct: 379 DVNNLKDYVNIRVGLSPTD 397


>gi|121703688|ref|XP_001270108.1| SET domain protein [Aspergillus clavatus NRRL 1]
 gi|119398252|gb|EAW08682.1| SET domain protein [Aspergillus clavatus NRRL 1]
          Length = 492

 Score = 39.7 bits (91), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 28/77 (36%), Positives = 41/77 (53%), Gaps = 3/77 (3%)

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
            +VPL   L A + +  A L   +D++ +   +P  AG+ I    G  P S LL  YG+V
Sbjct: 234 GMVPLADILNADADRNNARLFQEEDSLVMKAIKPIAAGDEIFNDYGELPRSDLLRRYGYV 293

Query: 179 -DEDNPYDRLVVEAALN 194
            D   PYD  V+EA+L+
Sbjct: 294 TDNYAPYD--VIEASLD 308


>gi|330797452|ref|XP_003286774.1| hypothetical protein DICPUDRAFT_54488 [Dictyostelium purpureum]
 gi|325083217|gb|EGC36675.1| hypothetical protein DICPUDRAFT_54488 [Dictyostelium purpureum]
          Length = 1335

 Score = 39.7 bits (91), Expect = 1.9,   Method: Composition-based stats.
 Identities = 49/192 (25%), Positives = 78/192 (40%), Gaps = 19/192 (9%)

Query: 1    MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
            +YEK +   +FW P+   L            + + +S TEL  L G+   AE L     +
Sbjct: 885  IYEK-ENPNTFWRPFYDTLPSY-------FTTSIHYSSTELLELEGTNLFAETL----AV 932

Query: 61   KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
            K++        F    L  QYP   P   F++E F  A   + S  + L K+    +  L
Sbjct: 933  KQQLQAFRDYLF--PELSNQYPDIFPESVFSWENFLWARSLLDSRAIQL-KIDGKIKSCL 989

Query: 121  VPLGPPLLAYSSKCKAMLAAVD---DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
            VP+   ++ + +  +      D   +  ++V      A   I +  G   NS L + YGF
Sbjct: 990  VPMAD-MINHHTNAQISERHFDQDSNCFRMVSSCNIPANNQIFLHYGALQNSDLALYYGF 1048

Query: 178  VDEDNPYDRLVV 189
            V  +N YD   V
Sbjct: 1049 VIPNNIYDSFHV 1060


>gi|194764087|ref|XP_001964163.1| GF21412 [Drosophila ananassae]
 gi|190619088|gb|EDV34612.1| GF21412 [Drosophila ananassae]
          Length = 1017

 Score = 39.7 bits (91), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 22/88 (25%), Positives = 38/88 (43%)

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           AL+P        S +      +    V         AGE   ++ G + N++ L+N GFV
Sbjct: 802 ALIPFWDMANHGSGRITTFFDSTAGEVSCNAQAACSAGEQFFIYYGDRTNTEFLVNNGFV 861

Query: 179 DEDNPYDRLVVEAALNTEDPQYQDKRMV 206
           D DN  D + +   L+  DP  + + ++
Sbjct: 862 DPDNRNDYVNIRLGLSPTDPLAEKRAII 889


>gi|426392958|ref|XP_004062802.1| PREDICTED: SET domain-containing protein 4 [Gorilla gorilla
           gorilla]
          Length = 440

 Score = 39.7 bits (91), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 51/228 (22%), Positives = 90/228 (39%), Gaps = 24/228 (10%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           EK  G++S W PY+  L +       A   P+   E E+  L     KA+  E+   ++ 
Sbjct: 117 EKHAGRRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQE 168

Query: 63  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFA 119
            +      +     LF +    I    F++     A+  V +  V+L   Q+  L+    
Sbjct: 169 FFASSRDFFSSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPD 224

Query: 120 LVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
              L P   LL +S   + KA       + ++     ++  E + +  GP  N +L + Y
Sbjct: 225 TCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEY 284

Query: 176 GFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
           GFV   NP+      R ++   L + D Q   K  + + +G +    F
Sbjct: 285 GFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332


>gi|392587357|gb|EIW76691.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
          Length = 430

 Score = 39.7 bits (91), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 16/51 (31%), Positives = 30/51 (58%)

Query: 140 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 190
           + +  + +V    +  G+ I    GP+PNS+L++ YGF   +NP D +V++
Sbjct: 230 STETTISIVSHDAFNEGQEIYNNYGPKPNSELILGYGFSLPNNPDDTIVLQ 280


>gi|195396323|ref|XP_002056781.1| GJ16703 [Drosophila virilis]
 gi|194146548|gb|EDW62267.1| GJ16703 [Drosophila virilis]
          Length = 539

 Score = 39.7 bits (91), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 62/311 (19%), Positives = 118/311 (37%), Gaps = 36/311 (11%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           + EK +G  S W P+I  L  +         + L ++  ++  L G+   +  +     I
Sbjct: 196 IIEKVRGAASNWQPFIDTLPTR-------YNTVLYFTVEQMQRLRGTSACSAAVRHCRVI 248

Query: 61  KREYNEL---------DTVWFMAGSLFQQYP--YDIPTEAFTFEIFKQAFVAVQSCVVHL 109
            R Y  +         D+V     +LF +Y   Y++   A +    +Q  V  Q      
Sbjct: 249 ARIYASMYKCAYMQPDDSVMAGMANLFTEYGLCYELYRWAVSTVTTRQNLVPRQ-LATDS 307

Query: 110 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 169
             V  +   AL+P          K  +        ++ +    +KAGE   ++ G + N+
Sbjct: 308 DGVRNSPMSALIPFWDMANHRCGKITSYYKPSAQQMECIAQEAFKAGEQFFIYYGDRCNA 367

Query: 170 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV-----AQRNGKLSVQVFHVHAGR 224
             L+++GF+D +N  D + +   L+  D   + + ++      +R  +L V     H   
Sbjct: 368 DRLVHHGFLDMNNLKDYVHIRLGLSPTDALAEQRALLLSELNIERKAELRVLPAPEHISG 427

Query: 225 EKEAISDMLPYLRLGYVSD------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 278
           E      +L ++R+  +S        S+++  +  L   C +   +E      L    K 
Sbjct: 428 E------LLAFVRVFNMSKEQLEHWCSDLERAVDLLHIDCALETDLETRTWQYLYQRLKL 481

Query: 279 RLAGYPATLSE 289
            L    ATL E
Sbjct: 482 LLGVLDATLKE 492


>gi|440640494|gb|ELR10413.1| hypothetical protein GMDG_00825 [Geomyces destructans 20631-21]
          Length = 492

 Score = 39.7 bits (91), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 87/337 (25%), Positives = 134/337 (39%), Gaps = 48/337 (14%)

Query: 9   KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-LERAEGIKR----- 62
           +S W PY   L  +       ++S + WS  ELA L  S    ++  ++AE I       
Sbjct: 104 ESKWAPYFNVLPTK-------LDSLVFWSPEELAELQASAVLKKVGKDKAEEIFHQSISK 156

Query: 63  ---EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVH-LQKVSLARRF 118
              E  ++D ++    S    Y +DIP      E   +        V    QK SLA   
Sbjct: 157 VTPEGTDVD-IFHRVASTIMAYAFDIPD----IEQEDEEGANEDDLVDDDEQKTSLA--- 208

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
            ++PL   +L   +   A L    + +++    P K GE I+   G  P S LL  YG+V
Sbjct: 209 -MIPLAD-MLNADADNNARLHYDGEELEMRTINPIKTGEEILNDYGQLPRSDLLRRYGYV 266

Query: 179 -DEDNPYDRLVVEAALNT-EDPQYQD---KRMVAQRNGKLSVQVFHV------------- 220
            D+   +D  V E + +T  D  YQD   +  V  R G++  ++                
Sbjct: 267 TDKYATFD--VAEISTSTITDHIYQDLAGELKVYLRAGEIEARLELARREDVYEDAHDVG 324

Query: 221 HAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARL 280
           HA  E   ISD L  L    +     + ++ SS       S      V   L    + R 
Sbjct: 325 HATEEWPCISDELVALVYLLLVGEETLAAIQSSKMSFPSRSKMETELVGKALQRILERRE 384

Query: 281 AGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 317
             Y  T+ EDE +L   N   + ++A Q VRM +K++
Sbjct: 385 REYATTVVEDENLLQSGNHSNRVKMAIQ-VRMGEKVV 420


>gi|320584053|gb|EFW98265.1| Nuclear protein that contains a SET-domain [Ogataea parapolymorpha
           DL-1]
          Length = 499

 Score = 39.7 bits (91), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 20/61 (32%), Positives = 34/61 (55%)

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           ++VPL   L + ++   A L+  +DA+ +   +  K GE I    G  PNS++L  YG+V
Sbjct: 219 SMVPLADTLNSNTNLVNANLSYENDALVMTATKDIKKGEQIYNIYGELPNSEILRKYGYV 278

Query: 179 D 179
           +
Sbjct: 279 E 279


>gi|195353393|ref|XP_002043189.1| GM17489 [Drosophila sechellia]
 gi|194127287|gb|EDW49330.1| GM17489 [Drosophila sechellia]
          Length = 537

 Score = 39.7 bits (91), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 22/79 (27%), Positives = 36/79 (45%)

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           AL+P          K  +  AAV   ++        AGE   ++ G + N+ LL++ GFV
Sbjct: 319 ALIPYWDMANHRQGKITSFYAAVPRQLECTAQEAVDAGEQFFIYYGDRSNTDLLVHNGFV 378

Query: 179 DEDNPYDRLVVEAALNTED 197
           D+ N  D + +   L+  D
Sbjct: 379 DDYNLKDYVNIRVGLSLTD 397


>gi|315042966|ref|XP_003170859.1| SET domain-containing protein [Arthroderma gypseum CBS 118893]
 gi|311344648|gb|EFR03851.1| SET domain-containing protein [Arthroderma gypseum CBS 118893]
          Length = 693

 Score = 39.7 bits (91), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 47/188 (25%), Positives = 79/188 (42%), Gaps = 11/188 (5%)

Query: 2   YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIK 61
           +++ + K S W PY+  L R        + S L +   +L +L G+            +K
Sbjct: 114 HQQLKEKGSHWWPYLATLPRAS-----ELTSALFYHGDDLEWLQGTNLYQTHQAYMNAVK 168

Query: 62  REYNELDTVWFMAGSLFQQ-YPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
            EY+   ++    G L  + Y +D+   A+T  I  +AF + +   V+L +    ++   
Sbjct: 169 EEYDSAISILRDEGCLAAELYSWDLFCWAYTV-IASRAFTS-RVLSVYLSRNPALKQDEE 226

Query: 121 VPLGPPLLAYSSK---CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
             +  PL+  S+     K    A    + L V  P  + E I    GP  N +L+  YGF
Sbjct: 227 FQILLPLVDSSNHKPLAKIEWRAEAAEIGLKVVEPIVSEEEIHNNYGPLNNQQLMTTYGF 286

Query: 178 VDEDNPYD 185
              DNP D
Sbjct: 287 CIVDNPCD 294


>gi|255071473|ref|XP_002499410.1| predicted protein [Micromonas sp. RCC299]
 gi|226514673|gb|ACO60669.1| predicted protein [Micromonas sp. RCC299]
          Length = 323

 Score = 39.7 bits (91), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 16/35 (45%), Positives = 24/35 (68%)

Query: 156 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 190
           GE +V+  G + N +LL  +GF D DNP+D LV++
Sbjct: 199 GEEVVISYGDKTNEELLFVHGFADRDNPHDALVLQ 233


>gi|213407234|ref|XP_002174388.1| lysine methyltransferase [Schizosaccharomyces japonicus yFS275]
 gi|212002435|gb|EEB08095.1| lysine methyltransferase [Schizosaccharomyces japonicus yFS275]
          Length = 537

 Score = 39.7 bits (91), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 49/215 (22%), Positives = 86/215 (40%), Gaps = 26/215 (12%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           E+ +G+KS W  YI  L +         ++PL +++ EL  L  +       +R    K 
Sbjct: 86  ERLKGEKSQWAKYIEYLPK-------TFDTPLYFTDDELKSLEHTNIFYGCNDRKRIWKE 138

Query: 63  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 122
           E+         A  L      D P + F++ ++  A     S       +         P
Sbjct: 139 EHA-------TAAKLL-----DNP-DNFSWNMYLWAATVFSSRCFSSALLGEEDTDDAAP 185

Query: 123 LGPPL---LAYSSKCKAMLAAV---DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 176
           +  PL   L +  +C  +   V     AVQLV  +P  +G  +    GP+ N +LL+ YG
Sbjct: 186 ILIPLVDSLNHKPRCPIIWNKVTKESHAVQLVSVKPISSGGQVYNNYGPKGNEELLMGYG 245

Query: 177 FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 211
           F   +N ++   +  +L+      + KR +   +G
Sbjct: 246 FCLPNNEFETFALRLSLDKAVYNSEKKRSILASHG 280


>gi|322707769|gb|EFY99347.1| SET domain protein [Metarhizium anisopliae ARSEF 23]
          Length = 467

 Score = 39.7 bits (91), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 43/181 (23%), Positives = 72/181 (39%), Gaps = 13/181 (7%)

Query: 9   KSFWLPYIRELDR--QRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 66
           KSFW PYIR L +  Q  + Q A+     W + E   L G+  +  I +    ++R+  E
Sbjct: 106 KSFWWPYIRALPQPGQGNKSQWALAP--FWDDDEAELLEGTNVEVGIDKIRNDVRRDLQE 163

Query: 67  LDTVWFMAGSLFQQYPYDIPTEAFTFE--IFKQAFVAVQSCVVHLQKVSLARRFALVPLG 124
              +  + G     +   + TE + +   IF          +   Q+ SL R   +    
Sbjct: 164 AQELLRLHGDADGAFGKALTTELYQWAYCIFSSRSFRPSLVLSDEQRRSLPRGVTMDDFS 223

Query: 125 PPLLAYSSKCKAMLAAV----DDAVQ---LVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
             L  +      M   +    DD  Q   L V R +  G+ +      + N++LL+ YGF
Sbjct: 224 VLLPLFDIGNHDMTTEIRWDLDDDRQTCELRVGRTHMPGQQVFNNYSMKTNAELLLGYGF 283

Query: 178 V 178
           +
Sbjct: 284 M 284


>gi|145346652|ref|XP_001417799.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144578027|gb|ABO96092.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 490

 Score = 39.3 bits (90), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 49/201 (24%), Positives = 79/201 (39%), Gaps = 25/201 (12%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS-------PTKAEI 53
           M E   GK S W  Y+  +      G     S L W + ++  L G+           E 
Sbjct: 115 MCEMCAGKSSAWCEYLETVHEAVRVGH----SVLAWDDEQVTALFGTDAWRDAYENDDET 170

Query: 54  LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF-TFEIFKQAFVAVQSCVVHLQKV 112
           L+     +  +  + T++F    LF +    +  EA         A VA  S  +   ++
Sbjct: 171 LDLPMMTEEHFENVVTLFF---KLFPKLASGLSVEALRELHFAATAMVAGYSFTLGDDEI 227

Query: 113 SLARRFALVPLGPPLLAYSSKCKAMLAAVDD----AVQLVVDRPYKAGESIVVWCGPQPN 168
                 A+VP    +L ++  C+A +    D     +Q++  R  K GE +    GP  N
Sbjct: 228 Q-----AMVPFWD-MLNHAPPCEASVRLHHDQKNGCLQMITVRGVKKGEEVFNTYGPLRN 281

Query: 169 SKLLINYGFVDEDNPYDRLVV 189
           ++LL  YGFV   NP+    V
Sbjct: 282 AELLRRYGFVLPRNPHGGTTV 302


>gi|384246985|gb|EIE20473.1| rubisco small subunit N-methyltransferase, partial [Coccomyxa
           subellipsoidea C-169]
          Length = 363

 Score = 39.3 bits (90), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 74/327 (22%), Positives = 123/327 (37%), Gaps = 74/327 (22%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW-SETELAYLTGSPTKAEILERAEG 59
           M E+++G++S W P++  L         A  SP+LW  E +   L  SPT  E   R   
Sbjct: 41  MAERQKGEESRWAPFLECLPE-------ATLSPVLWPEEVQDELLKNSPTLKECRARRAA 93

Query: 60  IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 119
           +++E++                                  +A +      ++ S      
Sbjct: 94  LQQEWD---------------------------------VIAQRIATGDARRFSGGDELK 120

Query: 120 L-VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           L + LG P   +      +L A+ D                      +PN +L +  G V
Sbjct: 121 LWITLGSP--GWGGTSDKLLMAIYDG---------------------RPNGELAMATGRV 157

Query: 179 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 238
           ++DN  D L V   L   D  +  K+ + +  G   VQ F +   R     + +L YLRL
Sbjct: 158 EDDNASDCLTVRVGLVQADRLFSVKKQILESLGFDIVQEFPIFRDR---MPTQLLAYLRL 214

Query: 239 GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP----ATLSEDEAML 294
             ++D + +  V  S      ++P  E  VL  L    + RL  Y      +  ED  +L
Sbjct: 215 ARLTDPALLAKV--SFEEDIILNPVNEYEVLQLLLGECRDRLTSYAGMHMGSAEEDVKLL 272

Query: 295 TDYNLHPKKRVATQLVRMEKKMLNACL 321
               L  ++R+A +L + EK +L   L
Sbjct: 273 QRPGLTAQERLAARLRKAEKAILQGTL 299


>gi|195480581|ref|XP_002101314.1| GE17555 [Drosophila yakuba]
 gi|194188838|gb|EDX02422.1| GE17555 [Drosophila yakuba]
          Length = 548

 Score = 39.3 bits (90), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 22/79 (27%), Positives = 36/79 (45%)

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           AL+P          K  +  AAV   ++        AGE   ++ G + N+ LL++ GFV
Sbjct: 323 ALIPYWDMANHRPGKITSFYAAVSRQLECTAQEAVAAGEQFFIYYGDRSNTDLLVHNGFV 382

Query: 179 DEDNPYDRLVVEAALNTED 197
           D +N  D + +   L+  D
Sbjct: 383 DVNNLKDYVNIRVGLSPTD 401


>gi|302829721|ref|XP_002946427.1| hypothetical protein VOLCADRAFT_86703 [Volvox carteri f.
           nagariensis]
 gi|300268173|gb|EFJ52354.1| hypothetical protein VOLCADRAFT_86703 [Volvox carteri f.
           nagariensis]
          Length = 658

 Score = 39.3 bits (90), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 29/91 (31%), Positives = 42/91 (46%), Gaps = 12/91 (13%)

Query: 142 DDAVQ---LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL----- 193
           D+A Q   +VV RPY+ GE +++  G   N +LL  YGFV E N +D   ++ AL     
Sbjct: 312 DEATQQYCIVVRRPYREGEQVMLCYGRYTNLELLEYYGFVLEGNLHDTARLDPALLPLPS 371

Query: 194 ----NTEDPQYQDKRMVAQRNGKLSVQVFHV 220
                   P           NG+ S Q+ H+
Sbjct: 372 AARTAGGAPHLAPSDCFLHANGQPSWQLLHL 402


>gi|255581713|ref|XP_002531659.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase,
           putative [Ricinus communis]
 gi|223528717|gb|EEF30729.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase,
           putative [Ricinus communis]
          Length = 558

 Score = 39.3 bits (90), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 36/149 (24%), Positives = 66/149 (44%), Gaps = 20/149 (13%)

Query: 46  GSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSC 105
           G+    EI++  E ++ +Y+EL        +L   YP   P E +T+E F  A     S 
Sbjct: 277 GTLLFDEIMQAKEHLRVQYDEL------VPALCNNYPDVFPPELYTWEQFLWACELWYSN 330

Query: 106 VVHLQKVSLARRFALVP--------LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGE 157
            + ++ +    R  L+P        L P ++ Y       + ++ + ++  + RP + GE
Sbjct: 331 SMKIKFLDGKLRTCLIPIAGFLNHSLHPHIIHYGK-----VDSITNTLKFPLSRPCRVGE 385

Query: 158 SIVVWCGPQPNSKLLINYGFVDE-DNPYD 185
              +  G    + L+  YGF+ + DN YD
Sbjct: 386 QCCLSYGNFSGAHLITFYGFLPQGDNRYD 414


>gi|449544081|gb|EMD35055.1| hypothetical protein CERSUDRAFT_107074 [Ceriporiopsis subvermispora
           B]
          Length = 457

 Score = 39.3 bits (90), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 18/49 (36%), Positives = 30/49 (61%)

Query: 142 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 190
           D AV L++  P   G  ++   GP+PN++L++ YGF    NP D +V++
Sbjct: 275 DLAVSLLLHSPTPRGAELLNNYGPKPNAELVLGYGFALPSNPDDTIVLK 323


>gi|169626351|ref|XP_001806576.1| hypothetical protein SNOG_16462 [Phaeosphaeria nodorum SN15]
 gi|160705819|gb|EAT76160.2| hypothetical protein SNOG_16462 [Phaeosphaeria nodorum SN15]
          Length = 474

 Score = 39.3 bits (90), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 57/228 (25%), Positives = 85/228 (37%), Gaps = 49/228 (21%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI------- 53
           +YE   G  S W PY   L  +         + + WSE ELA L  S    +I       
Sbjct: 91  LYEYINGDASNWAPYFSVLPTE-------FNTLMFWSEDELAELQASAVLNKIGKEGANE 143

Query: 54  ------------------------LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 89
                                    +RAE ++ E N L  +    GSL   Y +D+   A
Sbjct: 144 AFMEQLLPIIKEFADIFFAGDERAKQRAEEMRDERNVL--LMHKMGSLIMAYAFDVEP-A 200

Query: 90  FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV 149
            + +   +   A +     L K        ++PL   L A +    A L   +  +++  
Sbjct: 201 TSRKDVDEEGFAEEEEDEALPK-------GMIPLADMLNADADCNNARLFYEEKYLEMKA 253

Query: 150 DRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALNTE 196
            +P +AGE +    GP P S LL  YG+V D    YD + +   L TE
Sbjct: 254 LKPIRAGEEVFNDYGPLPRSDLLRRYGYVTDNYAQYDVVEINMDLVTE 301


>gi|254568640|ref|XP_002491430.1| Nuclear protein that contains a SET-domain [Komagataella pastoris
           GS115]
 gi|238031227|emb|CAY69150.1| Nuclear protein that contains a SET-domain [Komagataella pastoris
           GS115]
          Length = 538

 Score = 39.3 bits (90), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 19/61 (31%), Positives = 32/61 (52%)

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           ++VP    L A ++ C A L    + + +   +P K GE +    G  PNS++L  YG+V
Sbjct: 254 SMVPFADTLNADTNLCNANLTYQSENLVMKAIKPIKKGEQVYNTYGNHPNSEILRRYGYV 313

Query: 179 D 179
           +
Sbjct: 314 E 314


>gi|242210759|ref|XP_002471221.1| predicted protein [Postia placenta Mad-698-R]
 gi|220729780|gb|EED83649.1| predicted protein [Postia placenta Mad-698-R]
          Length = 256

 Score = 39.3 bits (90), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 19/64 (29%), Positives = 33/64 (51%)

Query: 144 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 203
           ++ LV+   + AG  +    GP+PN++L++ YGF    NP D +V++    +     Q  
Sbjct: 77  SISLVLHNAHPAGAELFNNYGPKPNAELILGYGFALPHNPDDTIVLKLGGASAAQHAQHN 136

Query: 204 RMVA 207
             VA
Sbjct: 137 NAVA 140


>gi|194707708|gb|ACF87938.1| unknown [Zea mays]
          Length = 352

 Score = 39.3 bits (90), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 17/48 (35%), Positives = 31/48 (64%)

Query: 140 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 187
           ++D +V+++ ++  K  E+I +  G  PN   L++YGFV   NPYD++
Sbjct: 136 SLDMSVKVLAEKKIKQNEAITLNYGCYPNDFFLLDYGFVITQNPYDQV 183


>gi|320167148|gb|EFW44047.1| hypothetical protein CAOG_02072 [Capsaspora owczarzaki ATCC 30864]
          Length = 533

 Score = 39.3 bits (90), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 47/211 (22%), Positives = 89/211 (42%), Gaps = 35/211 (16%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 69
           SFW P+ +   R+       ++    W +  L  L  +  +  I +    I+ EY++LD 
Sbjct: 163 SFWQPWFQLFPRE-------LDCAGFWDDLLLMELDNTSIRDAIRQLEALIEYEYDQLDL 215

Query: 70  VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL------ 123
                 +L  ++P     + F+++ FK AF+ + S  + +  V+ A    ++P       
Sbjct: 216 -----PALRLRFPDSFVADRFSYDDFKWAFMVLASRGLTM-SVNNAPCTVMIPFVDFFNH 269

Query: 124 -GPPLLAYSSKCKAMLAA------VDDAVQ------LVVDRPYKAGESIVVWCGPQPNSK 170
            G   +A+S   +A  A+       DD+V+      +  +  +  GE + +      N  
Sbjct: 270 NGAKSIAFSYTRRAGDASDVSSGNYDDSVENLNCAVISGNETFLPGEQMFLNYKAHSNEV 329

Query: 171 LLINYGFVDEDNPYDRLVVEAALN---TEDP 198
           LL++YGF    N +D  +V    +   T DP
Sbjct: 330 LLLHYGFALPHNEHDTFLVRLHFDREKTNDP 360


>gi|395326815|gb|EJF59220.1| SET domain-containing protein [Dichomitus squalens LYAD-421 SS1]
          Length = 429

 Score = 39.3 bits (90), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 17/44 (38%), Positives = 28/44 (63%)

Query: 147 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 190
           LV+  P   G  ++   GP+PNS+L++ YGF   +NP D +V++
Sbjct: 245 LVIHTPTPRGSELLNNYGPKPNSELILGYGFSLPNNPDDTIVLK 288


>gi|226505024|ref|NP_001151430.1| SET domain containing protein [Zea mays]
 gi|195646778|gb|ACG42857.1| SET domain containing protein [Zea mays]
 gi|413923893|gb|AFW63825.1| SET domain containing protein [Zea mays]
          Length = 491

 Score = 39.3 bits (90), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 17/48 (35%), Positives = 31/48 (64%)

Query: 140 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 187
           ++D +V+++ ++  K  E+I +  G  PN   L++YGFV   NPYD++
Sbjct: 275 SLDMSVKVLAEKKIKQNEAITLNYGCYPNDFFLLDYGFVITQNPYDQV 322


>gi|22328112|gb|AAH36556.1| SETD4 protein [Homo sapiens]
 gi|119630166|gb|EAX09761.1| SET domain containing 4, isoform CRA_d [Homo sapiens]
 gi|167773807|gb|ABZ92338.1| SET domain containing 4 [synthetic construct]
          Length = 416

 Score = 39.3 bits (90), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 51/228 (22%), Positives = 89/228 (39%), Gaps = 24/228 (10%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           EK  G +S W PY+  L +       A   P+   E E+  L     KA+  E+   ++ 
Sbjct: 93  EKHAGHRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQE 144

Query: 63  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFA 119
            +      +     LF +    I    F++     A+  V +  V+L   Q+  L+    
Sbjct: 145 FFASSRDFFSSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPD 200

Query: 120 LVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
              L P   LL +S   + KA       + ++     ++  E + +  GP  N +L + Y
Sbjct: 201 TCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEY 260

Query: 176 GFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
           GFV   NP+      R ++   L + D Q   K  + + +G +    F
Sbjct: 261 GFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 308


>gi|114684050|ref|XP_001168792.1| PREDICTED: SET domain-containing protein 4 isoform 4 [Pan
           troglodytes]
 gi|410222534|gb|JAA08486.1| SET domain containing 4 [Pan troglodytes]
 gi|410259178|gb|JAA17555.1| SET domain containing 4 [Pan troglodytes]
 gi|410287502|gb|JAA22351.1| SET domain containing 4 [Pan troglodytes]
 gi|410336607|gb|JAA37250.1| SET domain containing 4 [Pan troglodytes]
          Length = 307

 Score = 39.3 bits (90), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 46/199 (23%), Positives = 78/199 (39%), Gaps = 19/199 (9%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           EK  G +S W PY+  L +       A   P+   E E+  L     KA+  E+   ++ 
Sbjct: 117 EKHAGHRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQE 168

Query: 63  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFA 119
            +      +     LF +    I    F++     A+  V +  V+L   Q+  L+    
Sbjct: 169 FFASSRDFFSSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPD 224

Query: 120 LVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
              L P   LL +S   + KA       + ++     ++  E + +  GP  N +L + Y
Sbjct: 225 TCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEY 284

Query: 176 GFVDEDNPYDRLVVEAALN 194
           GFV   NP+  + V    N
Sbjct: 285 GFVSVHNPHACVYVSRGWN 303


>gi|452825744|gb|EME32739.1| ribulose-1,5 bisphosphate carboxylase oxygenase large subunit
           N-methyltransferase, putative isoform 1 [Galdieria
           sulphuraria]
          Length = 487

 Score = 39.3 bits (90), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 76/325 (23%), Positives = 129/325 (39%), Gaps = 35/325 (10%)

Query: 10  SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE-LD 68
           S W PYI  L      G +       WS +ELA L   P   E+      I + Y E L 
Sbjct: 166 SLWKPYIDILPHALNTGLVY------WSSSELAQLQYRPLIEEV-----KINQYYREALY 214

Query: 69  TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL- 127
           T  F + S     P  +  +     +F  A   VQS    +  V   + +AL+P+   L 
Sbjct: 215 TRVFESLS----SPVRVWLQNEKENVFFWALDMVQSRAFGIPDVG-NKTYALLPMMDMLN 269

Query: 128 LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 187
              +S+   +  ++ +  ++        G  I +  GP  N  LL  YGF+  +NP D  
Sbjct: 270 HRVNSQTHFLYDSIANQYEMKTYSKLSPGTDIYISYGPLDNDHLLHFYGFLQTNNPSDYF 329

Query: 188 VVEAALNTEDPQYQDKRMVAQRNG----KLSV----------QVFHVHAGREKEAISDML 233
            V+         Y+ +   AQ +     KLS+          + FH++     + I D++
Sbjct: 330 QVKDIFQWLHLMYEQEEWQAQPSHLLEEKLSLLRKYHIYENGKTFHLYHDHYDDEI-DII 388

Query: 234 PYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM 293
             + +   +D  ++Q   +       +S   +  V   +    K  L     ++ EDE +
Sbjct: 389 LRVFMASKTDWQQIQENFAMGLFHKALSLENQLHVWQVIIGGCKHLLKDMKTSVEEDEQL 448

Query: 294 LTDYN-LHPKKRVATQLVRMEKKML 317
           L + + L  K ++A Q  R+EKK +
Sbjct: 449 LKNKDQLSTKLQLAIQF-RLEKKYI 472


>gi|254585507|ref|XP_002498321.1| ZYRO0G07502p [Zygosaccharomyces rouxii]
 gi|238941215|emb|CAR29388.1| ZYRO0G07502p [Zygosaccharomyces rouxii]
          Length = 562

 Score = 39.3 bits (90), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 19/59 (32%), Positives = 29/59 (49%)

Query: 127 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
            L + +  K      D  +  V     K+GE +    G + N +LL+NYGFV ++N YD
Sbjct: 224 FLNHKNDTKVKWCFQDGKMCFVSKESLKSGEELFNNYGDKSNEELLLNYGFVQDNNQYD 282


>gi|55953063|ref|NP_001007260.1| SET domain-containing protein 4 isoform 2 [Homo sapiens]
 gi|12804091|gb|AAH02898.1| SET domain containing 4 [Homo sapiens]
 gi|119630161|gb|EAX09756.1| SET domain containing 4, isoform CRA_a [Homo sapiens]
          Length = 307

 Score = 39.3 bits (90), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 46/199 (23%), Positives = 78/199 (39%), Gaps = 19/199 (9%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           EK  G +S W PY+  L +       A   P+   E E+  L     KA+  E+   ++ 
Sbjct: 117 EKHAGHRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQE 168

Query: 63  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFA 119
            +      +     LF +    I    F++     A+  V +  V+L   Q+  L+    
Sbjct: 169 FFASSRDFFSSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPD 224

Query: 120 LVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
              L P   LL +S   + KA       + ++     ++  E + +  GP  N +L + Y
Sbjct: 225 TCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEY 284

Query: 176 GFVDEDNPYDRLVVEAALN 194
           GFV   NP+  + V    N
Sbjct: 285 GFVSVHNPHACVYVSRGWN 303


>gi|403331500|gb|EJY64700.1| hypothetical protein OXYTRI_15263 [Oxytricha trifallax]
          Length = 741

 Score = 38.9 bits (89), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 20/64 (31%), Positives = 36/64 (56%), Gaps = 4/64 (6%)

Query: 153 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN----TEDPQYQDKRMVAQ 208
           YK G+ +    G + N  LL+NYGF   +N Y+ L     +N    +E  + ++K++ A+
Sbjct: 440 YKKGQQVFHCYGRRTNRFLLLNYGFCMNNNKYNSLSFRVWINFDWQSERQKDEEKKLKAE 499

Query: 209 RNGK 212
           ++GK
Sbjct: 500 KDGK 503


>gi|388452885|ref|NP_001253203.1| SET domain-containing protein 4 [Macaca mulatta]
 gi|355560299|gb|EHH16985.1| SET domain-containing protein 4 [Macaca mulatta]
 gi|387541878|gb|AFJ71566.1| SET domain-containing protein 4 isoform 1 [Macaca mulatta]
          Length = 440

 Score = 38.9 bits (89), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 51/228 (22%), Positives = 89/228 (39%), Gaps = 24/228 (10%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           EK  G +S W PY+  L +       A   P+   E E+  L     KA+  E+   ++ 
Sbjct: 117 EKHAGDRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQE 168

Query: 63  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFA 119
            +      +     LF +    I    F++     A+  V +  V+L   Q+  L+    
Sbjct: 169 FFASSRDFFSSLQPLFVEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPD 224

Query: 120 LVPLGP--PLLAYSSKC--KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
              L P   LL +S +   KA       + ++     ++  E + +  GP  N +L + Y
Sbjct: 225 TCALAPYLDLLNHSPRVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEY 284

Query: 176 GFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
           GFV   NP+      R ++   L + D Q   K  + + +G +    F
Sbjct: 285 GFVSVHNPHACVYVSREILVKYLPSRDKQMDKKISILKDHGYIENLTF 332


>gi|443730800|gb|ELU16158.1| hypothetical protein CAPTEDRAFT_140019 [Capitella teleta]
          Length = 255

 Score = 38.9 bits (89), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 51/200 (25%), Positives = 85/200 (42%), Gaps = 30/200 (15%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           + E+ +G  SFW PY+  L         +    L W+  E+  L    TK    +     
Sbjct: 62  LCERNKGCSSFWKPYVDILPS-------SYTDILHWTSKEMDLLPKF-TKRRACDLRLKA 113

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLA-- 115
           +  +N L   +     L +Q P      AFT+++FK A+ +V +  V++   Q   L+  
Sbjct: 114 EESFNRLCNGFLPL--LVRQMPQF--NGAFTWDLFKWAWSSVNTRCVYMSQPQNSVLSPD 169

Query: 116 --RRFALVPLGPPLLAYSSKCKAMLAAVDDA------VQLVVDRPYKAGESIVVWCGPQP 167
              + AL P    LL ++   +   A  DD+        L   +PY   + + +  GP  
Sbjct: 170 EEDKSALAPFLD-LLNHTVDVEVN-ARFDDSSKSYKITTLTACKPY---DQVFINYGPHS 224

Query: 168 NSKLLINYGFVDEDNPYDRL 187
           N KLL+ YGF    NP++ +
Sbjct: 225 NEKLLLEYGFTLPCNPHNNI 244


>gi|281201674|gb|EFA75882.1| tryptophan 2,3-dioxygenase [Polysphondylium pallidum PN500]
          Length = 732

 Score = 38.9 bits (89), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 54/113 (47%), Gaps = 15/113 (13%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           E ++ ++SFW PYIR L +       + ++ L ++  E   L GSP    +LE  E I  
Sbjct: 153 ETQKQQQSFWAPYIRMLPK-------SYKTALYFTLAEFQLLIGSP----VLE--ESINT 199

Query: 63  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 115
             N L    F+    F + P  + T  FT+E F+Q  +A    +V L K  LA
Sbjct: 200 YRNTLRQYCFLY-DFFGKNPGILSTSNFTWE-FEQNELAAYKSIVSLLKKRLA 250


>gi|66813084|ref|XP_640721.1| hypothetical protein DDB_G0281543 [Dictyostelium discoideum AX4]
 gi|60468751|gb|EAL66753.1| hypothetical protein DDB_G0281543 [Dictyostelium discoideum AX4]
          Length = 1339

 Score = 38.9 bits (89), Expect = 3.7,   Method: Composition-based stats.
 Identities = 45/189 (23%), Positives = 77/189 (40%), Gaps = 17/189 (8%)

Query: 1    MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
            +YEK     SFW P+   L            + + +S TEL  L G+     + E     
Sbjct: 839  IYEKGNAN-SFWRPFYDTLPSY-------FTTSIHYSATELLELEGT----NLFEETLHT 886

Query: 61   KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 120
            K++ N      F    L +QYP   P   F++E F  A   + S  + L K+  + +  L
Sbjct: 887  KQQLNSFRDYLF--PELSKQYPDIFPESQFSWENFLWARSLLDSRAIQL-KIDGSIKSCL 943

Query: 121  VPLGPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
            VP+   +  +++    +        + +++      A   I +  G   N +L + YGF+
Sbjct: 944  VPMADMINHHTNAQISERFFDHDSQSFKMISSCNIPANNQIFLHYGALQNWELALYYGFI 1003

Query: 179  DEDNPYDRL 187
              +N YD L
Sbjct: 1004 IPNNIYDSL 1012


>gi|8393013|ref|NP_059134.1| SET domain-containing protein 4 isoform 1 [Homo sapiens]
 gi|12229715|sp|Q9NVD3.1|SETD4_HUMAN RecName: Full=SET domain-containing protein 4
 gi|7023055|dbj|BAA91819.1| unnamed protein product [Homo sapiens]
 gi|119630162|gb|EAX09757.1| SET domain containing 4, isoform CRA_b [Homo sapiens]
 gi|119630163|gb|EAX09758.1| SET domain containing 4, isoform CRA_b [Homo sapiens]
 gi|119630165|gb|EAX09760.1| SET domain containing 4, isoform CRA_b [Homo sapiens]
          Length = 440

 Score = 38.9 bits (89), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 51/228 (22%), Positives = 89/228 (39%), Gaps = 24/228 (10%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           EK  G +S W PY+  L +       A   P+   E E+  L     KA+  E+   ++ 
Sbjct: 117 EKHAGHRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQE 168

Query: 63  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFA 119
            +      +     LF +    I    F++     A+  V +  V+L   Q+  L+    
Sbjct: 169 FFASSRDFFSSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPD 224

Query: 120 LVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
              L P   LL +S   + KA       + ++     ++  E + +  GP  N +L + Y
Sbjct: 225 TCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEY 284

Query: 176 GFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
           GFV   NP+      R ++   L + D Q   K  + + +G +    F
Sbjct: 285 GFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332


>gi|303272215|ref|XP_003055469.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226463443|gb|EEH60721.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 468

 Score = 38.9 bits (89), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 28/98 (28%), Positives = 42/98 (42%), Gaps = 10/98 (10%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           M E+  G +S W  Y   L   RG   L    P+ W+E  L  L G+     + E AE +
Sbjct: 106 MVERALGSESRWRDYFAVLP-SRGERTL----PMFWTEARLEALKGTDLATHVREDAENL 160

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 98
           + +Y+E      +   L   +P     E  TFE + +A
Sbjct: 161 RADYDEE-----VVNGLCVAHPEKFRREELTFERYLEA 193


>gi|397507017|ref|XP_003824008.1| PREDICTED: SET domain-containing protein 4 [Pan paniscus]
          Length = 440

 Score = 38.9 bits (89), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 51/228 (22%), Positives = 89/228 (39%), Gaps = 24/228 (10%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           EK  G +S W PY+  L +       A   P+   E E+  L     KA+  E+   ++ 
Sbjct: 117 EKHAGHRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQE 168

Query: 63  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFA 119
            +      +     LF +    I    F++     A+  V +  V+L   Q+  L+    
Sbjct: 169 FFASSRDFFSSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPD 224

Query: 120 LVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
              L P   LL +S   + KA       + ++     ++  E + +  GP  N +L + Y
Sbjct: 225 TCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEY 284

Query: 176 GFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
           GFV   NP+      R ++   L + D Q   K  + + +G +    F
Sbjct: 285 GFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332


>gi|332872029|ref|XP_001168891.2| PREDICTED: SET domain-containing protein 4 isoform 8 [Pan
           troglodytes]
 gi|410222532|gb|JAA08485.1| SET domain containing 4 [Pan troglodytes]
 gi|410259176|gb|JAA17554.1| SET domain containing 4 [Pan troglodytes]
 gi|410287500|gb|JAA22350.1| SET domain containing 4 [Pan troglodytes]
 gi|410336605|gb|JAA37249.1| SET domain containing 4 [Pan troglodytes]
          Length = 440

 Score = 38.9 bits (89), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 51/228 (22%), Positives = 89/228 (39%), Gaps = 24/228 (10%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           EK  G +S W PY+  L +       A   P+   E E+  L     KA+  E+   ++ 
Sbjct: 117 EKHAGHRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQE 168

Query: 63  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFA 119
            +      +     LF +    I    F++     A+  V +  V+L   Q+  L+    
Sbjct: 169 FFASSRDFFSSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPD 224

Query: 120 LVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
              L P   LL +S   + KA       + ++     ++  E + +  GP  N +L + Y
Sbjct: 225 TCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEY 284

Query: 176 GFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
           GFV   NP+      R ++   L + D Q   K  + + +G +    F
Sbjct: 285 GFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332


>gi|320170797|gb|EFW47696.1| hypothetical protein CAOG_05634 [Capsaspora owczarzaki ATCC 30864]
          Length = 903

 Score = 38.9 bits (89), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 28/87 (32%), Positives = 38/87 (43%), Gaps = 14/87 (16%)

Query: 7   GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 66
           G  S W P+   L            SPLL+  TEL  L G+    E  E  +G++  +  
Sbjct: 481 GSASRWAPFFETL-------PATYNSPLLFHVTELLELEGTRLIDETFEIKDGLRVLHES 533

Query: 67  LDTVWFMAGSLFQQYPYDIPTEAFTFE 93
           L       G L + YP   PT+AFT+E
Sbjct: 534 L-------GPLAEAYPALFPTDAFTYE 553


>gi|242210761|ref|XP_002471222.1| predicted protein [Postia placenta Mad-698-R]
 gi|220729781|gb|EED83650.1| predicted protein [Postia placenta Mad-698-R]
          Length = 264

 Score = 38.9 bits (89), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 16/48 (33%), Positives = 29/48 (60%)

Query: 143 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 190
            ++ LV+   + AG  +    GP+PN++L++ YGF    NP D +V++
Sbjct: 215 SSISLVLHNAHPAGAELFNNYGPKPNAELILGYGFALPHNPDDTIVLK 262


>gi|258567286|ref|XP_002584387.1| predicted protein [Uncinocarpus reesii 1704]
 gi|237905833|gb|EEP80234.1| predicted protein [Uncinocarpus reesii 1704]
          Length = 706

 Score = 38.9 bits (89), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 50/189 (26%), Positives = 79/189 (41%), Gaps = 22/189 (11%)

Query: 7   GKKSFWLPYIREL--DRQRGRGQLAVESPLLWSE-TELAYLTGSPTKAEILERAEGIK-- 61
           G +SFW PYI+ L  D Q  R +      L W E T L  L     +    +   G++  
Sbjct: 137 GDESFWAPYIQSLPDDSQFTRLEYYTGDDLKWLEGTNLLKLREKLLERLKAKYETGLRLL 196

Query: 62  REYNELDT---VW--FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLAR 116
           +E+   +T    W  F+  S        I + AF+ E+ K       + V  L+  S+  
Sbjct: 197 KEFPNKNTPKYTWERFLWASSI------ILSRAFSSEVLKDYIKGTPTRVKPLEDFSV-- 248

Query: 117 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 176
              LVPL   +  +    +   A   + + L+V +    GE +    GP+ N +L++NYG
Sbjct: 249 ---LVPLVD-ISNHQPLAQVEWATSLEKIGLIVHKTLLPGEEVPNNYGPRSNERLMMNYG 304

Query: 177 FVDEDNPYD 185
           F    N  D
Sbjct: 305 FCIRGNVCD 313


>gi|294659704|ref|XP_462118.2| DEHA2G13354p [Debaryomyces hansenii CBS767]
 gi|199434171|emb|CAG90604.2| DEHA2G13354p [Debaryomyces hansenii CBS767]
          Length = 480

 Score = 38.5 bits (88), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 47/202 (23%), Positives = 83/202 (41%), Gaps = 19/202 (9%)

Query: 2   YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS------ETELAYLTGSPTKAEILE 55
           +EK++G  SFW P+I  L  +     LA   PL+W         EL  L  + TK  + +
Sbjct: 155 FEKQRGSSSFWKPFIDMLP-ETSDFDLA---PLVWKVLKVDHYEELLKLLPNSTKRHMDK 210

Query: 56  RAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFT----FEIFKQAFVAVQSCVVHL- 109
             +  + +YN + D +      +      +  T+A       E++  +++ + S  +++ 
Sbjct: 211 IYDRFQTDYNVVKDLISIKLKEISDNERSNDLTDAIRHLVPIELYLWSWMCINSRCLYME 270

Query: 110 --QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 167
             Q  + A  F + P     L +S   +  L       Q+     Y   E + +  GP  
Sbjct: 271 IPQSKNAADNFTMAPY-VDFLNHSCDDQCGLKIDGTGFQVYTTCSYNPDEQLFLSYGPHS 329

Query: 168 NSKLLINYGFVDEDNPYDRLVV 189
           N  LL  YGF   +N ++ L V
Sbjct: 330 NEFLLCEYGFTLPENKWNDLDV 351


>gi|169595142|ref|XP_001790995.1| hypothetical protein SNOG_00305 [Phaeosphaeria nodorum SN15]
 gi|160701026|gb|EAT91800.2| hypothetical protein SNOG_00305 [Phaeosphaeria nodorum SN15]
          Length = 391

 Score = 38.5 bits (88), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 19/48 (39%), Positives = 26/48 (54%), Gaps = 5/48 (10%)

Query: 147 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN-----PYDRLVV 189
           +  DR YKAGE + V  G   N  LL+ YGF+ + N     P D L++
Sbjct: 203 VTADREYKAGEEVFVSYGAHTNDFLLVEYGFILDSNRNDAIPLDHLIL 250


>gi|308802011|ref|XP_003078319.1| N-methyltransferase (ISS) [Ostreococcus tauri]
 gi|116056770|emb|CAL53059.1| N-methyltransferase (ISS) [Ostreococcus tauri]
          Length = 429

 Score = 38.5 bits (88), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 51/185 (27%), Positives = 76/185 (41%), Gaps = 17/185 (9%)

Query: 151 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQD-KRMVAQR 209
           + Y+ GE +++  G   N +L+  YGFVD DN  D    E  ++     Y   KR +   
Sbjct: 246 KDYETGEEVLISYGVLNNDELITRYGFVDVDNVADIYRFEGLMSYLQASYDPMKRALGAD 305

Query: 210 NGKLSVQVFHVH-----AGREKEAISD------MLPYLRLGYVSDTSEMQSVISSLGPIC 258
             +LS  +   H     A  E   ISD      +L  LR   V  T E  +    +    
Sbjct: 306 QKRLST-LKRTHPELDQALWEGNFISDGNADPKLLWALRT--VLATPEEYAAAKGVDGFK 362

Query: 259 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 318
                 ER   D +    ++RLA YP T+ EDE  L   N    +R A Q    +K++L 
Sbjct: 363 LGGGAPERRAADAVRAAVESRLAEYPTTIEEDEEALKTAN--GNERTAIQYRIRKKRILR 420

Query: 319 ACLQV 323
              ++
Sbjct: 421 DASRI 425


>gi|21952799|dbj|BAC06215.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
 gi|22202682|dbj|BAC07340.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
 gi|215769224|dbj|BAH01453.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222619626|gb|EEE55758.1| hypothetical protein OsJ_04288 [Oryza sativa Japonica Group]
          Length = 495

 Score = 38.5 bits (88), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 19/55 (34%), Positives = 31/55 (56%)

Query: 142 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 196
           D+ ++++V R    GE +    G   N+ LL  YGF + DN YD + ++ AL T+
Sbjct: 255 DEDLEMIVVRDVNEGEEVFNTYGTMGNAALLHRYGFTEMDNSYDIVNIDLALVTK 309


>gi|125528589|gb|EAY76703.1| hypothetical protein OsI_04658 [Oryza sativa Indica Group]
          Length = 495

 Score = 38.5 bits (88), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 19/55 (34%), Positives = 31/55 (56%)

Query: 142 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 196
           D+ ++++V R    GE +    G   N+ LL  YGF + DN YD + ++ AL T+
Sbjct: 255 DEDLEMIVVRDVNEGEEVFNTYGTMGNAALLHRYGFTEMDNSYDIVNIDLALVTK 309


>gi|384249602|gb|EIE23083.1| SET domain-containing protein, partial [Coccomyxa subellipsoidea
           C-169]
          Length = 306

 Score = 38.5 bits (88), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 27/98 (27%), Positives = 44/98 (44%), Gaps = 13/98 (13%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           MYE   GK+SFW  Y+ EL ++          PL W+E E + L G+  +    E  E  
Sbjct: 52  MYELSIGKESFWHGYLEELHKRE-------YLPLFWAEQERSLLQGTEAEHRPQEDEELT 104

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 98
           + ++            L +Q+   +  ++FT E F+ A
Sbjct: 105 QEDFET------HVPPLVEQHADRLRADSFTLESFRVA 136


>gi|367001244|ref|XP_003685357.1| hypothetical protein TPHA_0D02870 [Tetrapisispora phaffii CBS 4417]
 gi|357523655|emb|CCE62923.1| hypothetical protein TPHA_0D02870 [Tetrapisispora phaffii CBS 4417]
          Length = 495

 Score = 38.5 bits (88), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 20/63 (31%), Positives = 32/63 (50%)

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           +++PL   L A + +C A L      +++   +P KA E I    G   N +LL  YG+V
Sbjct: 220 SMIPLADTLNADTKRCNANLIYDSGVLKMCAIKPIKANEQIYNTYGNHANFELLRRYGYV 279

Query: 179 DED 181
           + D
Sbjct: 280 EVD 282


>gi|297707870|ref|XP_002830708.1| PREDICTED: SET domain-containing protein 4 [Pongo abelii]
          Length = 440

 Score = 38.5 bits (88), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 50/228 (21%), Positives = 89/228 (39%), Gaps = 24/228 (10%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           EK  G +S W PY+  L +       A   P+   E E+  L     KA+  E+   ++ 
Sbjct: 117 EKHAGDRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPQSLKAKAEEQRAHVQE 168

Query: 63  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ---KVSLARRFA 119
            +      +     LF +    I    F++     A+  V +  V+L+   +  L+    
Sbjct: 169 FFASSRDFFSSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRHRECLSAELD 224

Query: 120 LVPLGP--PLLAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
              L P   LL +S   + KA       + ++     ++  E + +  GP  N +L + Y
Sbjct: 225 TCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRRHEEVFICYGPHDNQRLFLEY 284

Query: 176 GFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
           GFV   NP+      R ++   L + D Q   K  + + +G +    F
Sbjct: 285 GFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332


>gi|219122993|ref|XP_002181819.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217407095|gb|EEC47033.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 579

 Score = 38.5 bits (88), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 17/47 (36%), Positives = 28/47 (59%)

Query: 143 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 189
           +A  L  D+   +G+ + +  GP+ N +LL  YGFV+ +NP D  V+
Sbjct: 361 NAYSLATDQAIPSGDEVYISYGPRSNDQLLQYYGFVERNNPNDVYVM 407


>gi|413951745|gb|AFW84394.1| hypothetical protein ZEAMMB73_159573, partial [Zea mays]
          Length = 339

 Score = 38.5 bits (88), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 19/57 (33%), Positives = 32/57 (56%)

Query: 140 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 196
           A D+ +++++ R    GE +    G   N+ LL  YGF + DN YD + ++ AL T+
Sbjct: 99  ANDEDLEIIIVRDVNEGEEVYNTYGTMGNAALLHRYGFTELDNQYDIVNIDLALVTK 155


>gi|113930683|ref|NP_001039027.1| SET domain-containing protein 4 [Danio rerio]
 gi|66911144|gb|AAH96876.1| SET domain containing 4 [Danio rerio]
          Length = 440

 Score = 38.5 bits (88), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 54/229 (23%), Positives = 94/229 (41%), Gaps = 26/229 (11%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           E+  G+ S W PYI  L +           PL + +  +  L  S  K +  ++ E  + 
Sbjct: 118 ERHHGEASEWNPYIDILPK-------TYTCPLYFPDNVIELLPRSLQK-KATQQKEQFQE 169

Query: 63  EYNELDTVWFMAGSLFQQYPYDIPTEA-FTFEIFKQAFVAVQSCVV---HLQKVSLARRF 118
            ++   T +     LF Q     PTE  F+ +  + A+ +V +  V   H Q   L+R  
Sbjct: 170 LFSSSQTFFHSLQPLFNQ-----PTEELFSQDALRWAWCSVNTRTVYMEHDQSKYLSREK 224

Query: 119 ALVPLGP--PLLAY--SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 174
            +  L P   LL +  + + +A         ++      K  +   +  GP  N +LL+ 
Sbjct: 225 DVYALAPYLDLLNHCPNVQVEAGFNKETRCYEIRSVNGCKKFQQAFINYGPHDNHRLLLE 284

Query: 175 YGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
           YGFV   NP+     D   ++  L+ +D Q ++K +  + N  L    F
Sbjct: 285 YGFVAPCNPHSVVYVDLETLKVGLDEKDKQLKEKLLYLKDNDFLRNLTF 333


>gi|33468718|emb|CAE30375.1| SI:dZ63M10.4 (novel protein similar to human chromosome 21 open
           reading frame 18 (C21orf18)) [Danio rerio]
          Length = 440

 Score = 38.5 bits (88), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 54/229 (23%), Positives = 94/229 (41%), Gaps = 26/229 (11%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           E+  G+ S W PYI  L +           PL + +  +  L  S  K +  ++ E  + 
Sbjct: 118 ERHHGEASEWNPYIDILPK-------TYTCPLYFPDNVIELLPRSLQK-KATQQKEQFQE 169

Query: 63  EYNELDTVWFMAGSLFQQYPYDIPTEA-FTFEIFKQAFVAVQSCVV---HLQKVSLARRF 118
            ++   T +     LF Q     PTE  F+ +  + A+ +V +  V   H Q   L+R  
Sbjct: 170 LFSSSQTFFHSLQPLFNQ-----PTEELFSQDALRWAWCSVNTRTVYMEHDQSKYLSREK 224

Query: 119 ALVPLGP--PLLAY--SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 174
            +  L P   LL +  + + +A         ++      K  +   +  GP  N +LL+ 
Sbjct: 225 DVYALAPYLDLLNHCPNVQVEAGFNKETRCYEIRSVNGCKKFQQAFINYGPHDNHRLLLE 284

Query: 175 YGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
           YGFV   NP+     D   ++  L+ +D Q ++K +  + N  L    F
Sbjct: 285 YGFVAPCNPHSVVYVDLETLKVGLDEKDKQLKEKLLYLKDNDFLRNLTF 333


>gi|350629837|gb|EHA18210.1| hypothetical protein ASPNIDRAFT_38188 [Aspergillus niger ATCC 1015]
          Length = 480

 Score = 38.5 bits (88), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 40/157 (25%), Positives = 69/157 (43%), Gaps = 10/157 (6%)

Query: 29  AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 88
           A+  P  +SE EL  L GS  +  +  +   +++E+  L           ++Y +D  T 
Sbjct: 125 AISLPTFYSEEELELLRGSSLRLAVHAKIASLEKEFEHLRRS-TEGLDWCEKYWWDEDTG 183

Query: 89  AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP-LGPPLLAYSSKCKAMLAAVDDA--- 144
             TF  +K      +S +V L +       A+VP +     A     KA+     D    
Sbjct: 184 KLTFNDWKYVDALYRSRMVDLPR----HGHAMVPCIDMANHASEGTVKALYDEDADGNAV 239

Query: 145 VQLVVDRPYKAGESIVVWCGPQPN-SKLLINYGFVDE 180
           +QL   R  +A E + +  G + + S+L+ +YGF+DE
Sbjct: 240 LQLREGRSLRADEEVTISYGDEKSASELIFSYGFLDE 276


>gi|403366800|gb|EJY83208.1| hypothetical protein OXYTRI_19172 [Oxytricha trifallax]
          Length = 869

 Score = 38.5 bits (88), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 37/170 (21%), Positives = 75/170 (44%), Gaps = 18/170 (10%)

Query: 138 LAAVDDAVQLVVDRPYKAG---ESIVVWC-GPQPNSKLLINYGFVDEDNPYDRLVVEAAL 193
           ++ V D    ++ R Y  G    S V  C G   N ++L  YGF    N Y+ + ++  L
Sbjct: 454 VSQVPDDFNFII-RTYNDGFPKGSQVFLCYGRMSNREMLKRYGFCLTYNKYNYIFIKLRL 512

Query: 194 NTEDPQYQDKRMVAQR-------NGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE 246
             +DP +  ++ V ++         K+ +   H     +K   + +L ++++ Y +   +
Sbjct: 513 EQQDPDFIYRKYVLRKFFSIEPETDKMDISSRHFRIYFQK-LNTKVLKFIKILYFNVQED 571

Query: 247 MQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 296
             S I     +   S  +E     +L D ++  L  +P T+ ED+ +L++
Sbjct: 572 DISCI-----VETRSLSLEYLAFQRLRDVYETFLKSFPTTIGEDKKILSE 616


>gi|443733230|gb|ELU17670.1| hypothetical protein CAPTEDRAFT_97123, partial [Capitella teleta]
          Length = 199

 Score = 38.5 bits (88), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 51/200 (25%), Positives = 85/200 (42%), Gaps = 30/200 (15%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           + E+ +G  SFW PY+  L         +    L W+  E+  L    TK    +     
Sbjct: 6   LCERNKGCSSFWKPYVDILPS-------SYTDILHWTSKEMDLLPKF-TKRRACDLRLKA 57

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLA-- 115
           +  +N L   +     L +Q P      AFT+++FK A+ +V +  V++   Q   L+  
Sbjct: 58  EESFNRLCNGFLPL--LVRQMPQF--NGAFTWDLFKWAWSSVNTRCVYMSQPQNSVLSPD 113

Query: 116 --RRFALVPLGPPLLAYSSKCKAMLAAVDDA------VQLVVDRPYKAGESIVVWCGPQP 167
              + AL P    LL ++   +   A  DD+        L   +PY   + + +  GP  
Sbjct: 114 EEDKSALAPFLD-LLNHTVDVEVN-ARFDDSSKSYKITTLTACKPY---DQVFINYGPHS 168

Query: 168 NSKLLINYGFVDEDNPYDRL 187
           N KLL+ YGF    NP++ +
Sbjct: 169 NEKLLLEYGFTLPCNPHNNI 188


>gi|159490102|ref|XP_001703025.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158270838|gb|EDO96670.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 471

 Score = 38.5 bits (88), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 19/48 (39%), Positives = 25/48 (52%)

Query: 138 LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
           L     AV L   R    GE +++  G   N  L ++YGF+ EDNPYD
Sbjct: 202 LEGGGGAVGLFARRAITEGEPLLLSYGQLSNDFLFMDYGFIVEDNPYD 249


>gi|403271547|ref|XP_003927684.1| PREDICTED: SET domain-containing protein 4 [Saimiri boliviensis
           boliviensis]
          Length = 440

 Score = 38.5 bits (88), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 51/228 (22%), Positives = 88/228 (38%), Gaps = 24/228 (10%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           EK  G +S W PY+  L +       A   P+   E E+  L     KA+  E+   ++ 
Sbjct: 117 EKHAGDRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQE 168

Query: 63  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFA 119
            +      +     LF +    I    F++     A+  V +  V+L   Q+  L+    
Sbjct: 169 FFASSRDFFSSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQQECLSAEPD 224

Query: 120 LVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
              L P   LL +S   + KA         ++     ++  E + +  GP  N +L + Y
Sbjct: 225 TCALAPYLDLLNHSPHVQVKAAFNEETHCYEIRTTSRWRKHEEVFICYGPHDNQRLFLEY 284

Query: 176 GFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
           GFV   NP+      R ++   L + D Q   K  + + +G +    F
Sbjct: 285 GFVSAHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332


>gi|449302028|gb|EMC98037.1| hypothetical protein BAUCODRAFT_67154 [Baudoinia compniacensis UAMH
           10762]
          Length = 381

 Score = 38.5 bits (88), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 15/41 (36%), Positives = 25/41 (60%)

Query: 143 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 183
           +  +++ +R Y AGE ++   G   N KLL++YGF+ E  P
Sbjct: 204 NGYEMLANRDYDAGEEVLFTYGAHSNDKLLVHYGFICESPP 244


>gi|134077289|emb|CAK45629.1| unnamed protein product [Aspergillus niger]
          Length = 498

 Score = 38.5 bits (88), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 40/157 (25%), Positives = 69/157 (43%), Gaps = 10/157 (6%)

Query: 29  AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 88
           A+  P  +SE EL  L GS  +  +  +   +++E+  L           ++Y +D  T 
Sbjct: 143 AISLPTFYSEEELELLRGSSLRLAVHAKIASLEKEFEHLRRS-TEGLDWCEKYWWDEDTG 201

Query: 89  AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP-LGPPLLAYSSKCKAMLAAVDDA--- 144
             TF  +K      +S +V L +       A+VP +     A     KA+     D    
Sbjct: 202 KLTFNDWKYVDALYRSRMVDLPR----HGHAMVPCIDMANHASEGTVKALYDEDADGNAV 257

Query: 145 VQLVVDRPYKAGESIVVWCGPQPN-SKLLINYGFVDE 180
           +QL   R  +A E + +  G + + S+L+ +YGF+DE
Sbjct: 258 LQLREGRSLRADEEVTISYGDEKSASELIFSYGFLDE 294


>gi|355747383|gb|EHH51880.1| SET domain-containing protein 4 [Macaca fascicularis]
          Length = 440

 Score = 38.5 bits (88), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 50/228 (21%), Positives = 89/228 (39%), Gaps = 24/228 (10%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           EK  G +S W PY+  L +       A   P+   E E+  L     KA+  E+   ++ 
Sbjct: 117 EKHAGDRSLWKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQE 168

Query: 63  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFA 119
            +      +     LF +    I    F++     A+  + +  V+L   Q+  L+    
Sbjct: 169 FFASSRDFFSSLQPLFVEAVDSI----FSYSALLWAWCTINTRAVYLRPRQRECLSAEPD 224

Query: 120 LVPLGP--PLLAYSSKC--KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
              L P   LL +S +   KA       + ++     ++  E + +  GP  N +L + Y
Sbjct: 225 TCALAPYLDLLNHSPRVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEY 284

Query: 176 GFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 218
           GFV   NP+      R ++   L + D Q   K  + + +G +    F
Sbjct: 285 GFVSVHNPHACVYVSREILVKYLPSRDKQMDKKISILKDHGYIENLTF 332


>gi|212544736|ref|XP_002152522.1| hypothetical protein PMAA_003730 [Talaromyces marneffei ATCC 18224]
 gi|210065491|gb|EEA19585.1| hypothetical protein PMAA_003730 [Talaromyces marneffei ATCC 18224]
          Length = 429

 Score = 38.5 bits (88), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 19/59 (32%), Positives = 30/59 (50%)

Query: 151 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQR 209
           R YK GE I +  GP PN  L + YGF  E N  D + ++  +  +    + + ++ QR
Sbjct: 264 RLYKKGEEIYMSYGPHPNDFLFVEYGFYLETNESDAIFLDDIIFKDFTVAEKEELIRQR 322


>gi|317030555|ref|XP_001392774.2| SET domain protein [Aspergillus niger CBS 513.88]
          Length = 473

 Score = 38.1 bits (87), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 40/157 (25%), Positives = 69/157 (43%), Gaps = 10/157 (6%)

Query: 29  AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 88
           A+  P  +SE EL  L GS  +  +  +   +++E+  L           ++Y +D  T 
Sbjct: 143 AISLPTFYSEEELELLRGSSLRLAVHAKIASLEKEFEHLRRS-TEGLDWCEKYWWDEDTG 201

Query: 89  AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP-LGPPLLAYSSKCKAMLAAVDDA--- 144
             TF  +K      +S +V L +       A+VP +     A     KA+     D    
Sbjct: 202 KLTFNDWKYVDALYRSRMVDLPR----HGHAMVPCIDMANHASEGTVKALYDEDADGNAV 257

Query: 145 VQLVVDRPYKAGESIVVWCGPQPN-SKLLINYGFVDE 180
           +QL   R  +A E + +  G + + S+L+ +YGF+DE
Sbjct: 258 LQLREGRSLRADEEVTISYGDEKSASELIFSYGFLDE 294


>gi|223946389|gb|ACN27278.1| unknown [Zea mays]
          Length = 289

 Score = 38.1 bits (87), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 19/57 (33%), Positives = 32/57 (56%)

Query: 140 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 196
           A D+ +++++ R    GE +    G   N+ LL  YGF + DN YD + ++ AL T+
Sbjct: 47  ANDEDLEIIIVRDVNEGEEVYNTYGTMGNAALLHRYGFTELDNQYDIVNIDLALVTK 103


>gi|328352061|emb|CCA38460.1| hypothetical protein PP7435_Chr2-0775 [Komagataella pastoris CBS
           7435]
          Length = 846

 Score = 38.1 bits (87), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 19/61 (31%), Positives = 32/61 (52%)

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 178
           ++VP    L A ++ C A L    + + +   +P K GE +    G  PNS++L  YG+V
Sbjct: 254 SMVPFADTLNADTNLCNANLTYQSENLVMKAIKPIKKGEQVYNTYGNHPNSEILRRYGYV 313

Query: 179 D 179
           +
Sbjct: 314 E 314


>gi|159131477|gb|EDP56590.1| SET domain protein [Aspergillus fumigatus A1163]
          Length = 490

 Score = 38.1 bits (87), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 45/188 (23%), Positives = 81/188 (43%), Gaps = 27/188 (14%)

Query: 4   KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 63
           +K G  + W  YIR +         +V  P  +S  E   L G+  +  +  +   +++E
Sbjct: 120 QKIGISNPWTEYIRFM-------PASVPLPTFYSAEERELLRGTSLQTAVDAKLGSLEKE 172

Query: 64  YNEL-----DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF 118
           ++ L     +  W       Q++ +D  T  FTF+ +K      +S VV L +       
Sbjct: 173 FDHLRQATEEIPWC------QEHWWDEDTGKFTFDDWKYVDAVYRSRVVDLPRSG----H 222

Query: 119 ALVP---LGPPLLAYSSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWCGPQ-PNSKLLI 173
           A+VP   +       S K K       +AV QL   +  + GE + +  G + P S+++ 
Sbjct: 223 AIVPCVDMANHACEDSVKAKYDEEGAGNAVLQLRTGKKLRVGEEVTISYGDEKPASEMVF 282

Query: 174 NYGFVDED 181
           +YGFV+ +
Sbjct: 283 SYGFVENE 290


>gi|242081035|ref|XP_002445286.1| hypothetical protein SORBIDRAFT_07g007800 [Sorghum bicolor]
 gi|241941636|gb|EES14781.1| hypothetical protein SORBIDRAFT_07g007800 [Sorghum bicolor]
          Length = 490

 Score = 38.1 bits (87), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 43/151 (28%), Positives = 67/151 (44%), Gaps = 14/151 (9%)

Query: 41  LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA-- 98
           LA L G+    EI++  + ++++Y+EL  +      L   +P     +  T++ F  A  
Sbjct: 266 LAALEGTLLFDEIMQAKQHLRQQYDELFPL------LCTNFPEIFRKDVCTWDNFLWACE 319

Query: 99  FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV---VDRPYKA 155
                S +V L    L+    LVP+   L    S        VD+A + +   + RP  A
Sbjct: 320 LWYSNSMMVVLSSGKLST--CLVPVAGLLNHSVSPHILNYGRVDEATKSLKFPLSRPCDA 377

Query: 156 GESIVVWCGPQPNSKLLINYGFVDE-DNPYD 185
           GE   +  G  P S L+  YGF+   DNPYD
Sbjct: 378 GEQCFLSYGKHPGSHLVTFYGFLPRGDNPYD 408


>gi|255080174|ref|XP_002503667.1| set domain protein [Micromonas sp. RCC299]
 gi|226518934|gb|ACO64925.1| set domain protein [Micromonas sp. RCC299]
          Length = 401

 Score = 38.1 bits (87), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 25/98 (25%), Positives = 44/98 (44%), Gaps = 10/98 (10%)

Query: 1   MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 60
           M+E+  G+ S W  Y   L     RG+  +  P+ W+  +L +L G+     + E AE +
Sbjct: 59  MHERSLGEGSRWAGYFAVLP---ARGERTL--PMFWTSAQLEHLRGTDLLRHVTEDAESM 113

Query: 61  KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 98
           + ++NE      +   L   +P   P    T E + +A
Sbjct: 114 RLDFNE-----NVVDGLCVTHPVAFPPGKHTLEAYMEA 146


>gi|357145323|ref|XP_003573603.1| PREDICTED: SET domain-containing protein 4-like [Brachypodium
           distachyon]
          Length = 532

 Score = 38.1 bits (87), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 42/172 (24%), Positives = 77/172 (44%), Gaps = 22/172 (12%)

Query: 41  LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 100
           LA L G+    E+++  + + ++Y+EL  +      L  ++P     + +T++ F  A  
Sbjct: 262 LAALEGTLLFDELMQARQHLHQQYDELFPM------LCTKFPEIFTQDIYTWDNFLWACE 315

Query: 101 AVQSCVVHLQKVSLARRFALVPLG--------PPLLAYSSKCKAMLAAVDDAVQLVVDRP 152
              S  + +   S      L+P+         P +L Y    +A       +++  + RP
Sbjct: 316 LWYSNSMMVVLSSGKLTTCLIPVAGLLNHSVYPHILNYGRVDQAT-----KSLKFPLSRP 370

Query: 153 YKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDR--LVVEAALNTEDPQYQ 201
            KAG+   +  G    S L+  YGF+  EDNPYD   L ++ +++ ED   Q
Sbjct: 371 CKAGQQCFLSYGKHSGSHLITFYGFLPREDNPYDVVPLDLDMSVDEEDGTAQ 422


>gi|388579878|gb|EIM20197.1| RuBisCO-cytochrome methylase [Wallemia sebi CBS 633.66]
          Length = 447

 Score = 38.1 bits (87), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 72/355 (20%), Positives = 142/355 (40%), Gaps = 49/355 (13%)

Query: 2   YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-------- 53
           +E  QG  S W  Y + L +Q         S + WS+ EL+ L G+    +I        
Sbjct: 93  WEYNQGGSSKWDAYFKILPKQ-------FTSLMFWSKEELSLLKGTTVVDKIGLEDIENE 145

Query: 54  LERAEGI-KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKV 112
            ER   I K+  N    +      LF++    I + +FT E +K      +       + 
Sbjct: 146 FERVRDIVKQNENVFGDIANYTLDLFKRMGSLILSRSFTVEEWKTEEEREKEEEEEEDED 205

Query: 113 ------SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 166
                 +     A+VP+   L + +    A     ++ ++++  +  KAG+ I       
Sbjct: 206 EEIDLRTSVDDVAMVPMADILNSRTDSVNAHTEYEENCLRMISLQDIKAGDQIFNTYNDP 265

Query: 167 PNSKLLINYGFVD-----ED-----NPYD------RLVVEAALNTEDPQYQDKRM--VAQ 208
           PN+ L+  YG VD     +D     N  D       +++E AL      ++++R+  +  
Sbjct: 266 PNADLIRRYGHVDYSPLSQDPDFMGNKNDVVELPADILLELALPDAKESHKERRVEFLLD 325

Query: 209 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 268
             G+ S ++ H       + + ++L    L +    +E ++   S     P +    +  
Sbjct: 326 ECGEDSFELTH------DDLVPELLKICVLLFTESEAEFKTREKSRK--LPKASGFTKGK 377

Query: 269 LDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM-EKKMLNACLQ 322
            + L    K R+  Y +TL +D + L + +  P+      +V + E+++LN  ++
Sbjct: 378 AEFLIKAIKQRMEQYGSTLEDDISKLDNKDSLPENNFKALVVTVGERRILNKAIE 432


>gi|428175768|gb|EKX44656.1| hypothetical protein GUITHDRAFT_109433 [Guillardia theta CCMP2712]
          Length = 591

 Score = 38.1 bits (87), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 51/205 (24%), Positives = 84/205 (40%), Gaps = 32/205 (15%)

Query: 1   MYEK-KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG 59
           +YEK  QG  SFW P I  L    G       +   WSE EL  L     KAE +     
Sbjct: 99  LYEKLDQGNASFWKPMIDILPADPG-------AASKWSEEELQELQDESLKAEAMIVVAS 151

Query: 60  IKREYNE-LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF 118
           +++ Y   L  +    G +F         + +T+E F+ A + V+S            RF
Sbjct: 152 MQQTYQRVLRPILVQHGDVF-------SVDRYTWEEFRWALLCVESRTFG--------RF 196

Query: 119 ----ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD----RPYKAGESIVVWCGPQPNSK 170
               ++VP    L   + +        +     + D      ++ GE   +  GP+ N++
Sbjct: 197 LPHPSIVPFADLLNHVNVQTSYRWLPEERRAAYMCDASGEHVHRRGEEAFMSYGPRSNAE 256

Query: 171 LLINYGFVDEDNPYDRLVVEAALNT 195
           LL++YGF  + N Y+ + +   +NT
Sbjct: 257 LLLHYGFALQSNRYEAVELNFRINT 281


>gi|393217169|gb|EJD02658.1| SET domain-containing protein [Fomitiporia mediterranea MF3/22]
          Length = 513

 Score = 37.7 bits (86), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 55/222 (24%), Positives = 84/222 (37%), Gaps = 19/222 (8%)

Query: 118 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
            A+VP+   L A      A L      + +V  +P + GE I    G  PNS LL  YG 
Sbjct: 240 IAMVPMADMLNAQYGSENAKLFYEPTHLNMVSTKPIRRGEQIYNAYGDLPNSALLREYGH 299

Query: 178 VD----------EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 227
           VD          E NP D + + A L          R+ A+ + K  +  +    G +  
Sbjct: 300 VDLVPLPGVPWKEGNPADVVEIPADLALHAVLSSQARVDAE-SLKERIDWWLEEGGDDVF 358

Query: 228 AIS------DMLPYLRLGYVSDTSEMQSVISSLGPICP-VSPCMERAVLDQLADYFKARL 280
            +       D++       +    E +   S   P  P +    +      +    + RL
Sbjct: 359 VLGTDLELPDVMISFLKLLLLSKLEWEKARSKSKPPKPKLDMDSKLQTFPLVLGMLERRL 418

Query: 281 AGYPATLSEDEAMLTDYNLHPKKRVATQLVRM-EKKMLNACL 321
           A YP TL  DEA+L+     P       +VR+ EK +L  C+
Sbjct: 419 AKYPTTLEHDEALLSGQTSLPYNVRNAIIVRIGEKHILVGCM 460


>gi|242066082|ref|XP_002454330.1| hypothetical protein SORBIDRAFT_04g028760 [Sorghum bicolor]
 gi|241934161|gb|EES07306.1| hypothetical protein SORBIDRAFT_04g028760 [Sorghum bicolor]
          Length = 490

 Score = 37.7 bits (86), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 17/48 (35%), Positives = 30/48 (62%)

Query: 140 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 187
           ++D +V++V ++  +   SI +  G  PN   L++YGFV   NPYD++
Sbjct: 274 SLDMSVKVVAEKKIEQNASITLNYGCHPNDFFLLDYGFVITPNPYDQV 321


>gi|226492747|ref|NP_001140859.1| uncharacterized protein LOC100272935 [Zea mays]
 gi|194701488|gb|ACF84828.1| unknown [Zea mays]
 gi|413951742|gb|AFW84391.1| hypothetical protein ZEAMMB73_159573 [Zea mays]
          Length = 495

 Score = 37.7 bits (86), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 19/57 (33%), Positives = 32/57 (56%)

Query: 140 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 196
           A D+ +++++ R    GE +    G   N+ LL  YGF + DN YD + ++ AL T+
Sbjct: 253 ANDEDLEIIIVRDVNEGEEVYNTYGTMGNAALLHRYGFTELDNQYDIVNIDLALVTK 309


>gi|355718756|gb|AES06374.1| SET domain containing 4 [Mustela putorius furo]
          Length = 256

 Score = 37.7 bits (86), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 53/220 (24%), Positives = 89/220 (40%), Gaps = 23/220 (10%)

Query: 3   EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 62
           EK  G +S W PY+  L +       A   P+   E ++  L   P KA+  E+   ++ 
Sbjct: 20  EKHAGDQSLWKPYLDILPK-------AYTCPVC-LEPKVVNLFPEPLKAKAEEQRARVQG 71

Query: 63  EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQKVSLARRFA 119
            ++     +     LF +   +I    F++     A+  V +  V   H Q+   +    
Sbjct: 72  FFSSSRDFFSSLQPLFSEAVENI----FSYSALLWAWCTVNTRAVYMKHGQRKCFSPEPD 127

Query: 120 LVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 175
              L P   LL +S   + KA         ++      +  E + +  GP  N +LL+ Y
Sbjct: 128 TYALAPYLDLLNHSPDVQVKAAFNEETRCYEVRTASGCRKHEQVFICYGPHDNQRLLLEY 187

Query: 176 GFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 215
           GFV   NP+  + V A L  +     DK+M    N K+S+
Sbjct: 188 GFVSIQNPHACVYVSADLLVKYLPSTDKQM----NKKISI 223


>gi|119495234|ref|XP_001264406.1| SET domain protein [Neosartorya fischeri NRRL 181]
 gi|119412568|gb|EAW22509.1| SET domain protein [Neosartorya fischeri NRRL 181]
          Length = 492

 Score = 37.7 bits (86), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 42/191 (21%), Positives = 80/191 (41%), Gaps = 33/191 (17%)

Query: 4   KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 63
           +K G  + W  YIR +         ++  P  +S  E   L G+  +  +  +   +++E
Sbjct: 120 QKIGVSNPWTEYIRFM-------PASIPLPTFYSAEERELLRGTSLQTAVDAKLGSLEKE 172

Query: 64  YNEL-----DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF 118
           +  L     D  W       Q++ +D  T  FTF+  K      +S VV L +   A   
Sbjct: 173 FEHLRQATEDIHWC------QEHWWDEDTGKFTFDDLKYVDAVYRSRVVDLPRSGHA--- 223

Query: 119 ALVPLGPPLLAYSSKCKAMLAAVDD-------AVQLVVDRPYKAGESIVVWCGPQ-PNSK 170
               + P +   +  C+ ++ A  D        +QL   +  + GE + +  G + P S+
Sbjct: 224 ----IVPCVDMANHACEDLVKARYDEDGAGNAVLQLRTGKKLRVGEEVTISYGDEKPASE 279

Query: 171 LLINYGFVDED 181
           ++ +YGFV+ +
Sbjct: 280 MVFSYGFVENE 290


>gi|410079629|ref|XP_003957395.1| hypothetical protein KAFR_0E01060 [Kazachstania africana CBS 2517]
 gi|372463981|emb|CCF58260.1| hypothetical protein KAFR_0E01060 [Kazachstania africana CBS 2517]
          Length = 534

 Score = 37.7 bits (86), Expect = 8.4,   Method: Compositional matrix adjust.
 Identities = 19/44 (43%), Positives = 25/44 (56%)

Query: 142 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 185
           D+ V +V+D+  KAG  I    GP  N  LL  +GF  E NP+D
Sbjct: 307 DECVDIVLDKDVKAGVEIFNSYGPLSNVFLLSRHGFAVEGNPHD 350


>gi|297836754|ref|XP_002886259.1| hypothetical protein ARALYDRAFT_319874 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297332099|gb|EFH62518.1| hypothetical protein ARALYDRAFT_319874 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 541

 Score = 37.7 bits (86), Expect = 8.6,   Method: Compositional matrix adjust.
 Identities = 37/151 (24%), Positives = 63/151 (41%), Gaps = 21/151 (13%)

Query: 44  LTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ 103
           L G+    EI++  E ++  Y+EL         L   + +  P E +T+E +  A     
Sbjct: 258 LDGTLLLDEIMQAKELLRERYDELI-------PLLSNHRHVFPPEHYTWEHYLWACELYY 310

Query: 104 SCVVHLQKVSLARRFALVPLG--------PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 155
           S  + ++      +  L+P+         P ++ Y   C         +++  V RP   
Sbjct: 311 SNSMQIKFPDGKLKTCLIPVAGFLNHSIYPHIVKYGKVCVET-----SSLKFPVSRPCNK 365

Query: 156 GESIVVWCGPQPNSKLLINYGFVDE-DNPYD 185
           GE   +  G   +S LL  YGF+ + DNPYD
Sbjct: 366 GEQCFLSYGNYSSSHLLTFYGFLPKGDNPYD 396


>gi|258563540|ref|XP_002582515.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237908022|gb|EEP82423.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 445

 Score = 37.7 bits (86), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 30/111 (27%), Positives = 52/111 (46%), Gaps = 13/111 (11%)

Query: 145 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 204
           + ++  + Y+AG+ + +  G  PN++LL  YGFV   NP D   +    +   P ++ KR
Sbjct: 195 LSILAGKGYEAGDQVFIHYGSVPNNRLLRLYGFVIPSNPNDSYDLVLETHPLAPFFEQKR 254

Query: 205 MVAQRNGKLSVQVFHVHAGREKEAISDMLP-----YLRLGYVSDTSEMQSV 250
            +    G  S     +        ++D LP     YLR+   SD S++ +V
Sbjct: 255 KLWALAGLDSTSTISL-------TLTDPLPNNVLRYLRI-QRSDESDLAAV 297


>gi|156538697|ref|XP_001607787.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Nasonia
           vitripennis]
          Length = 403

 Score = 37.7 bits (86), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 33/181 (18%), Positives = 70/181 (38%), Gaps = 25/181 (13%)

Query: 34  LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 93
           L  S  ++  L  SP     L++   I R+Y+    ++  + +   +   D+    FT+E
Sbjct: 108 LYMSMDDMMELKESPALETALKQCRNIARQYSYFKKLFHNSKNSVSKLLADV----FTYE 163

Query: 94  IFKQAFVAV-----------QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD 142
            ++ A   +           QS +VH          +L+P+         K       + 
Sbjct: 164 EYRWAVSTIMTRQNVIPSENQSAMVH----------SLIPMWDMCNHSEGKITTNFNEIS 213

Query: 143 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQD 202
           +  +    + +K  + I ++ G + N++  ++ GFV  DN  D   +   + + D    +
Sbjct: 214 NCCECYAMKSFKTDDQIFIYYGSRTNAEFFVHSGFVYPDNANDSYELHLGIGSSDKLRSE 273

Query: 203 K 203
           K
Sbjct: 274 K 274


>gi|388854849|emb|CCF51530.1| uncharacterized protein [Ustilago hordei]
          Length = 673

 Score = 37.7 bits (86), Expect = 8.8,   Method: Compositional matrix adjust.
 Identities = 24/76 (31%), Positives = 34/76 (44%)

Query: 118 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 177
            ++ P+   L A      A L      +++   +P  AGE I       PNS LL  YG 
Sbjct: 362 ISMTPMADMLNARFESDNARLFYKSHVLEMRATKPISAGEQIFNTYADPPNSDLLRRYGH 421

Query: 178 VDEDNPYDRLVVEAAL 193
           VDE N  D + ++A L
Sbjct: 422 VDEPNGSDVVELDAKL 437


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.320    0.135    0.401 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,264,592,823
Number of Sequences: 23463169
Number of extensions: 207579814
Number of successful extensions: 533380
Number of sequences better than 100.0: 702
Number of HSP's better than 100.0 without gapping: 353
Number of HSP's successfully gapped in prelim test: 349
Number of HSP's that attempted gapping in prelim test: 532294
Number of HSP's gapped (non-prelim): 866
length of query: 348
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 205
effective length of database: 9,003,962,200
effective search space: 1845812251000
effective search space used: 1845812251000
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 77 (34.3 bits)