BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 009869
         (523 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|449455876|ref|XP_004145676.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cucumis
           sativus]
 gi|449492872|ref|XP_004159127.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cucumis
           sativus]
          Length = 521

 Score =  819 bits (2116), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 404/510 (79%), Positives = 442/510 (86%), Gaps = 21/510 (4%)

Query: 21  HHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSSDTLVAGSR-------EVVSK 73
           H PL + S IS+S  R  +F +S   +RR N  S   SSS+TLVAGSR       E V+K
Sbjct: 24  HRPLLLLSKISVSAPRISHFSNSFSPIRRWNVCS--ASSSETLVAGSRKENGKTGEAVTK 81

Query: 74  KEED-LGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
           KE+D  GDLK+WMH NGLPPCKVIL+EKPSH++ HRPIHYVAASEDL+ GD AFSVPNSL
Sbjct: 82  KEDDEFGDLKAWMHDNGLPPCKVILEEKPSHDKNHRPIHYVAASEDLEVGDVAFSVPNSL 141

Query: 133 VVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
           VVTLERVLGNET+AELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQL
Sbjct: 142 VVTLERVLGNETVAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQL 201

Query: 193 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 252
           AVESPLLWSE EL YL+GSPTK E+LERAEGIK+EYNELDTVWFMAGSLFQQYPYDIPTE
Sbjct: 202 AVESPLLWSEDELDYLSGSPTKKEVLERAEGIKKEYNELDTVWFMAGSLFQQYPYDIPTE 261

Query: 253 AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 312
           AF+FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY S CKAML AVD AV+LV
Sbjct: 262 AFSFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYRSNCKAMLTAVDGAVELV 321

Query: 313 VDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 372
           VDRPYKAGESI VW           CGPQPNSKLL+NYGFVDEDN YDRLVVEAALNTED
Sbjct: 322 VDRPYKAGESIAVW-----------CGPQPNSKLLLNYGFVDEDNRYDRLVVEAALNTED 370

Query: 373 PQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPI 432
           PQYQDKRMVAQRNG+LS+Q F+V+AG+EKEA+ DMLPYLRLGYV+  SEMQSVISS GP+
Sbjct: 371 PQYQDKRMVAQRNGRLSIQAFYVYAGKEKEAVLDMLPYLRLGYVTHPSEMQSVISSQGPV 430

Query: 433 CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 492
           CPVSPCMERA+L+Q+ADYFK RLAGYP TLSEDE +L D NL+PKKRVATQLVR+EKK+L
Sbjct: 431 CPVSPCMERAMLEQVADYFKRRLAGYPTTLSEDEFLLADGNLNPKKRVATQLVRLEKKLL 490

Query: 493 NACLQVTADMIMLLPDVTVSPCPAPYAPLL 522
           ++CL+VT D I  LPD TVSPCPAPYAPLL
Sbjct: 491 HSCLEVTIDFINQLPDHTVSPCPAPYAPLL 520


>gi|225452167|ref|XP_002264334.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
           [Vitis vinifera]
          Length = 509

 Score =  819 bits (2116), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 396/473 (83%), Positives = 423/473 (89%), Gaps = 18/473 (3%)

Query: 58  SSSDTLVAGSR-------EVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPI 110
           S SDTLVAGSR       E   KKE++ GDLKSWMH+NGLPPCKV+LKE+PSH+E+H+ I
Sbjct: 48  SGSDTLVAGSRKEDGRVSEAARKKEDEFGDLKSWMHENGLPPCKVVLKERPSHHEQHKAI 107

Query: 111 HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
           HY+AASEDLQAGD AFSVP+SLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ
Sbjct: 108 HYIAASEDLQAGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 167

Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
           GKKSFW PYIRELDRQRGRGQLAVESPLLWSE+ELAYLTGSPTKAE+LERAEGIKREYNE
Sbjct: 168 GKKSFWYPYIRELDRQRGRGQLAVESPLLWSESELAYLTGSPTKAEVLERAEGIKREYNE 227

Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 290
           LDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFVA+QSCVVHLQKVSLARRFALVPLGPP
Sbjct: 228 LDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFVAIQSCVVHLQKVSLARRFALVPLGPP 287

Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINY 350
           LLAY S CKAMLAAVD +VQLVVDRPYKAGESIVVW           CGPQPNSKLL+NY
Sbjct: 288 LLAYRSNCKAMLAAVDGSVQLVVDRPYKAGESIVVW-----------CGPQPNSKLLLNY 336

Query: 351 GFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPY 410
           GFVDEDN YDR+VVEAALNTEDPQYQDKRMVAQRNGKL+VQ FHV  G+E+EA+SDMLPY
Sbjct: 337 GFVDEDNSYDRIVVEAALNTEDPQYQDKRMVAQRNGKLTVQKFHVSVGKEREAVSDMLPY 396

Query: 411 LRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 470
           LRLGYVSD SEMQSVISS GPICPVSPCMERAVLDQL DYF+ RLAGYP T+SEDE +L 
Sbjct: 397 LRLGYVSDPSEMQSVISSQGPICPVSPCMERAVLDQLVDYFERRLAGYPTTMSEDECLLA 456

Query: 471 DYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 523
           D NL+PKK VATQLVR+EKKMLNACL+ T D+I  LPD TVSPCPAPY PLL 
Sbjct: 457 DSNLNPKKLVATQLVRLEKKMLNACLKATVDLINQLPDHTVSPCPAPYTPLLK 509


>gi|359488614|ref|XP_003633789.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
           [Vitis vinifera]
          Length = 515

 Score =  812 bits (2098), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 396/479 (82%), Positives = 423/479 (88%), Gaps = 24/479 (5%)

Query: 58  SSSDTLVAGSR-------EVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPI 110
           S SDTLVAGSR       E   KKE++ GDLKSWMH+NGLPPCKV+LKE+PSH+E+H+ I
Sbjct: 48  SGSDTLVAGSRKEDGRVSEAARKKEDEFGDLKSWMHENGLPPCKVVLKERPSHHEQHKAI 107

Query: 111 HYVAASEDLQ------AGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYL 164
           HY+AASEDLQ      AGD AFSVP+SLVVTLERVLGNETIAELLTTNKLSELACLALYL
Sbjct: 108 HYIAASEDLQGFLLLQAGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYL 167

Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
           MYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE+ELAYLTGSPTKAE+LERAEGI
Sbjct: 168 MYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSESELAYLTGSPTKAEVLERAEGI 227

Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 284
           KREYNELDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFVA+QSCVVHLQKVSLARRFAL
Sbjct: 228 KREYNELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFVAIQSCVVHLQKVSLARRFAL 287

Query: 285 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNS 344
           VPLGPPLLAY S CKAMLAAVD +VQLVVDRPYKAGESIVVW           CGPQPNS
Sbjct: 288 VPLGPPLLAYRSNCKAMLAAVDGSVQLVVDRPYKAGESIVVW-----------CGPQPNS 336

Query: 345 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI 404
           KLL+NYGFVDEDN YDR+VVEAALNTEDPQYQDKRMVAQRNGKL+VQ FHV  G+E+EA+
Sbjct: 337 KLLLNYGFVDEDNSYDRIVVEAALNTEDPQYQDKRMVAQRNGKLTVQKFHVSVGKEREAV 396

Query: 405 SDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSE 464
           SDMLPYLRLGYVSD SEMQSVISS GPICPVSPCMERAVLDQL DYF+ RLAGYP T+SE
Sbjct: 397 SDMLPYLRLGYVSDPSEMQSVISSQGPICPVSPCMERAVLDQLVDYFERRLAGYPTTMSE 456

Query: 465 DEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 523
           DE +L D NL+PKK VATQLVR+EKKMLNACL+ T D+I  LPD TVSPCPAPY PLL 
Sbjct: 457 DECLLADSNLNPKKLVATQLVRLEKKMLNACLKATVDLINQLPDHTVSPCPAPYTPLLK 515


>gi|224117488|ref|XP_002331687.1| SET domain protein [Populus trichocarpa]
 gi|222874165|gb|EEF11296.1| SET domain protein [Populus trichocarpa]
          Length = 502

 Score =  800 bits (2065), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 405/529 (76%), Positives = 439/529 (82%), Gaps = 33/529 (6%)

Query: 1   MEASCSLRSSKFISPPIRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSS 60
           ME +C    +K ISP        L++ S +SIS    P    S    RR   F+      
Sbjct: 1   MEFTC--LHNKCISPS-------LTVLSRVSISFSNLPKRAVSFHRRRRNLCFA------ 45

Query: 61  DTLVAGSR--EVVSKK----EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVA 114
            TLV G R  EVVSK+    E++ GDLKSWMHKNGLPPCKV+LKE+PSH++K RPIHYVA
Sbjct: 46  -TLVDGKRTSEVVSKRGGEEEDEFGDLKSWMHKNGLPPCKVVLKERPSHDKKLRPIHYVA 104

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           ASEDLQA D A SVPNSLVVTLERVLGNET+AELLTTNKLSELACLALYLMYEKKQGKKS
Sbjct: 105 ASEDLQASDVAVSVPNSLVVTLERVLGNETLAELLTTNKLSELACLALYLMYEKKQGKKS 164

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
           FW PYIRELDRQRGRGQLAVESPLLWSE ELAYLTGSPTKAE+L+RA+GIKREY ELDTV
Sbjct: 165 FWYPYIRELDRQRGRGQLAVESPLLWSEAELAYLTGSPTKAEVLDRADGIKREYEELDTV 224

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
           WFMAGSLFQQYPYDIPTEAF FEIFKQAFVA+QSCVVHLQKVSLARRFALVPLGPPLLAY
Sbjct: 225 WFMAGSLFQQYPYDIPTEAFPFEIFKQAFVAIQSCVVHLQKVSLARRFALVPLGPPLLAY 284

Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVD 354
           SS CKAML AVD AV+LVVDRPYKAGE IVVW           CGPQPNSKLL+NYGFVD
Sbjct: 285 SSNCKAMLTAVDGAVELVVDRPYKAGEPIVVW-----------CGPQPNSKLLLNYGFVD 333

Query: 355 EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLG 414
           EDNPYDR+ VEAALNTEDPQYQDKRMVAQRNGKLSVQVF V+AG+EKEA+SD+LPYLRLG
Sbjct: 334 EDNPYDRIAVEAALNTEDPQYQDKRMVAQRNGKLSVQVFQVYAGKEKEAVSDILPYLRLG 393

Query: 415 YVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNL 474
           YVSD SEMQSVISS GP+CPVSPCME+AVLDQL  YF+ RLAGY  ++SEDE ML D NL
Sbjct: 394 YVSDPSEMQSVISSQGPVCPVSPCMEQAVLDQLTVYFRTRLAGYCTSISEDELMLADPNL 453

Query: 475 HPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 523
           +PKKRVATQLVR+EKKML ACLQ T D+I  LPD T+ PCPAPYAPLL 
Sbjct: 454 NPKKRVATQLVRLEKKMLKACLQATVDLINQLPDHTMPPCPAPYAPLLK 502


>gi|296090251|emb|CBI40070.3| unnamed protein product [Vitis vinifera]
          Length = 428

 Score =  785 bits (2026), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 376/438 (85%), Positives = 401/438 (91%), Gaps = 11/438 (2%)

Query: 85  MHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNET 144
           MH+NGLPPCKV+LKE+PSH+E+H+ IHY+AASEDLQAGD AFSVP+SLVVTLERVLGNET
Sbjct: 1   MHENGLPPCKVVLKERPSHHEQHKAIHYIAASEDLQAGDVAFSVPDSLVVTLERVLGNET 60

Query: 145 IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 204
           IAELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE+E
Sbjct: 61  IAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSESE 120

Query: 205 LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
           LAYLTGSPTKAE+LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFV
Sbjct: 121 LAYLTGSPTKAEVLERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFV 180

Query: 265 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 324
           A+QSCVVHLQKVSLARRFALVPLGPPLLAY S CKAMLAAVD +VQLVVDRPYKAGESIV
Sbjct: 181 AIQSCVVHLQKVSLARRFALVPLGPPLLAYRSNCKAMLAAVDGSVQLVVDRPYKAGESIV 240

Query: 325 VWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQR 384
           VW           CGPQPNSKLL+NYGFVDEDN YDR+VVEAALNTEDPQYQDKRMVAQR
Sbjct: 241 VW-----------CGPQPNSKLLLNYGFVDEDNSYDRIVVEAALNTEDPQYQDKRMVAQR 289

Query: 385 NGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVL 444
           NGKL+VQ FHV  G+E+EA+SDMLPYLRLGYVSD SEMQSVISS GPICPVSPCMERAVL
Sbjct: 290 NGKLTVQKFHVSVGKEREAVSDMLPYLRLGYVSDPSEMQSVISSQGPICPVSPCMERAVL 349

Query: 445 DQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIM 504
           DQL DYF+ RLAGYP T+SEDE +L D NL+PKK VATQLVR+EKKMLNACL+ T D+I 
Sbjct: 350 DQLVDYFERRLAGYPTTMSEDECLLADSNLNPKKLVATQLVRLEKKMLNACLKATVDLIN 409

Query: 505 LLPDVTVSPCPAPYAPLL 522
            LPD TVSPCPAPY PLL
Sbjct: 410 QLPDHTVSPCPAPYTPLL 427


>gi|357497055|ref|XP_003618816.1| SET domain protein [Medicago truncatula]
 gi|355493831|gb|AES75034.1| SET domain protein [Medicago truncatula]
          Length = 501

 Score =  773 bits (1995), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 375/487 (77%), Positives = 418/487 (85%), Gaps = 13/487 (2%)

Query: 36  RDPNFGSSLRLVRRKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKV 95
           R P+F S     RR+ R     S+SDTLVA + +   K++ED GDLK+WMHKNGLPPCKV
Sbjct: 27  RLPSFLSLSTNHRRRRRSFCSASNSDTLVAATGK--KKRDEDDGDLKTWMHKNGLPPCKV 84

Query: 96  ILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLS 155
           +LK+KPS ++  +PIHYVAASEDLQ GD AFSVPNSLVVTLERVLGNETIAELLTTNK S
Sbjct: 85  VLKDKPSLDDSVKPIHYVAASEDLQKGDIAFSVPNSLVVTLERVLGNETIAELLTTNKFS 144

Query: 156 ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 215
           ELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE+ELAYL GSP K 
Sbjct: 145 ELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSESELAYLEGSPLKD 204

Query: 216 EILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK 275
           EI++R EGI++EYNELDTVWFM+GSLFQQYPYD+PTEAF FEIFKQAF AVQSCVVHLQ 
Sbjct: 205 EIVKRIEGIRKEYNELDTVWFMSGSLFQQYPYDLPTEAFPFEIFKQAFAAVQSCVVHLQN 264

Query: 276 VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHD 335
           VSLARRFALVPLGPPLLAY S CKAML AVD AVQLVVDRPYKAG+ IVVW         
Sbjct: 265 VSLARRFALVPLGPPLLAYCSNCKAMLTAVDGAVQLVVDRPYKAGDPIVVW--------- 315

Query: 336 FKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV 395
             CGPQPN+KLL NYGFVDEDN  DRL+VE AL+TEDPQYQDKR+VAQRNGKLS+Q F+V
Sbjct: 316 --CGPQPNTKLLTNYGFVDEDNSNDRLIVEVALSTEDPQYQDKRIVAQRNGKLSIQTFYV 373

Query: 396 HAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARL 455
           + G+E+EA+SDM+PY+RLGYVSD SEMQSVISS GP+CPVSPCMERAVLDQLADYF  RL
Sbjct: 374 YTGKEREAVSDMIPYMRLGYVSDPSEMQSVISSQGPVCPVSPCMERAVLDQLADYFNTRL 433

Query: 456 AGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCP 515
           A YP TL+EDE+MLTD +L+PK+RVATQLVR+EKKML+ACLQ   D+I  LPD +VSPCP
Sbjct: 434 AAYPTTLAEDESMLTDGSLNPKRRVATQLVRLEKKMLHACLQAIMDLISQLPDHSVSPCP 493

Query: 516 APYAPLL 522
           APYAP L
Sbjct: 494 APYAPSL 500


>gi|22326803|ref|NP_196930.2| Rubisco methyltransferase family protein [Arabidopsis thaliana]
 gi|30684815|ref|NP_851038.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
 gi|42573363|ref|NP_974778.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
 gi|17473570|gb|AAL38260.1| putative protein [Arabidopsis thaliana]
 gi|23297671|gb|AAN13005.1| unknown protein [Arabidopsis thaliana]
 gi|332004624|gb|AED92007.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
 gi|332004625|gb|AED92008.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
 gi|332004626|gb|AED92009.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
          Length = 514

 Score =  769 bits (1987), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 389/532 (73%), Positives = 433/532 (81%), Gaps = 27/532 (5%)

Query: 1   MEASCSLRSSKFISPPIRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSS 60
           ME   +   +K +S PIR    PLS  S  S+   R+    SS R V  +   S+ VSSS
Sbjct: 1   MEGVITCFHTKCVSLPIR--SFPLSRVS--SLPRWRNNKLISSSRSVHLR---SLCVSSS 53

Query: 61  DTLVAGSR--------EVVSKKE-EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIH 111
           DTLVA           +V SKKE +D  DLK WM KNGLPPCKVILKE+P+H++KH+PIH
Sbjct: 54  DTLVASGSPKEDERQSKVSSKKEGDDSEDLKFWMDKNGLPPCKVILKERPAHDQKHKPIH 113

Query: 112 YVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
           YVAASEDLQ GD AFSVP+SLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG
Sbjct: 114 YVAASEDLQKGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 173

Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
           KKS W PYIRELDRQRGRGQL  ESPLLWSE EL YLTGSPTKAE+LERAEGIKREYNEL
Sbjct: 174 KKSVWYPYIRELDRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAEGIKREYNEL 233

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
           DTVWFMAGSLFQQYP+DIPTEAF+FEIFKQAFVA+QSCVVHLQ V LARRFALVPLGPPL
Sbjct: 234 DTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVAIQSCVVHLQNVGLARRFALVPLGPPL 293

Query: 292 LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYG 351
           LAY S CKAML AVD AV+LVVDRPYKAG+ IVVW           CGPQPN+KLL+NYG
Sbjct: 294 LAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVVW-----------CGPQPNAKLLLNYG 342

Query: 352 FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL 411
           FVDEDNPYDR++VEAALNTEDPQYQDKRMVAQRNGKLS QVF V  G+E+EA+ DMLPYL
Sbjct: 343 FVDEDNPYDRVIVEAALNTEDPQYQDKRMVAQRNGKLSQQVFQVRVGKEREAVQDMLPYL 402

Query: 412 RLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 471
           RLGY+SD SEMQSVISS GP+CP+SPCMERAVLDQLA+YF  RL+GYP T  ED+A+L D
Sbjct: 403 RLGYMSDPSEMQSVISSQGPVCPMSPCMERAVLDQLANYFMRRLSGYPTTPKEDDALLAD 462

Query: 472 YNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 523
            +L P+KRVAT+LV++EKK+L ACL  T D++  LPD  +SPCPAPYAP L 
Sbjct: 463 PSLSPRKRVATRLVQLEKKILVACLTTTVDLLNQLPDTAISPCPAPYAPSLK 514


>gi|18377718|gb|AAL67009.1| unknown protein [Arabidopsis thaliana]
          Length = 514

 Score =  766 bits (1979), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 388/532 (72%), Positives = 432/532 (81%), Gaps = 27/532 (5%)

Query: 1   MEASCSLRSSKFISPPIRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSS 60
           ME   +   +K +S PIR    PLS  S  S+   R+    SS R V  +   S+ VSSS
Sbjct: 1   MEGVITCFHTKCVSLPIR--SFPLSRVS--SLPRWRNNKLISSSRSVHLR---SLCVSSS 53

Query: 61  DTLVAGSR--------EVVSKKE-EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIH 111
           DTLVA           +V SKKE +D  DLK WM KNGLPPCKVILKE+P+H++KH+PIH
Sbjct: 54  DTLVASGSPKEDERQSKVSSKKEGDDSEDLKFWMDKNGLPPCKVILKERPAHDQKHKPIH 113

Query: 112 YVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
           YVAASEDLQ GD AFSVP+SLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG
Sbjct: 114 YVAASEDLQKGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 173

Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
           KKS W PYIRELDRQRGRGQL  ESPLLWSE EL YLTGSPTKAE+LERAEGIKREYNEL
Sbjct: 174 KKSVWYPYIRELDRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAEGIKREYNEL 233

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
           DTVWFMAGSLFQQYP+DIPTEAF+FEIFKQAFVA+QSCVVHLQ V LARRFALVPLGPPL
Sbjct: 234 DTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVAIQSCVVHLQNVGLARRFALVPLGPPL 293

Query: 292 LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYG 351
           LAY S CKAML AVD AV+LVVDRPYKAG+ IVVW           CGPQPN+KLL+NYG
Sbjct: 294 LAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVVW-----------CGPQPNAKLLLNYG 342

Query: 352 FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL 411
           FVDEDNPYDR++VEAALNTE PQYQDKRMVAQRNGKLS QVF V  G+E+EA+ DMLPYL
Sbjct: 343 FVDEDNPYDRVIVEAALNTEGPQYQDKRMVAQRNGKLSQQVFQVRVGKEREAVQDMLPYL 402

Query: 412 RLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 471
           RLGY+SD SEMQSVISS GP+CP+SPCMERAVLDQLA+YF  RL+GYP T  ED+A+L D
Sbjct: 403 RLGYMSDPSEMQSVISSQGPVCPMSPCMERAVLDQLANYFMRRLSGYPTTPKEDDALLAD 462

Query: 472 YNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 523
            +L P+KRVAT+LV++EKK+L ACL  T D++  LPD  +SPCPAPYAP L 
Sbjct: 463 PSLSPRKRVATRLVQLEKKILVACLTTTVDLLNQLPDTAISPCPAPYAPSLK 514


>gi|297807453|ref|XP_002871610.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297317447|gb|EFH47869.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 516

 Score =  764 bits (1973), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 385/532 (72%), Positives = 431/532 (81%), Gaps = 25/532 (4%)

Query: 1   MEASCSLRSSKFISPPIRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSS 60
           ME   +   +K +S PIR    PLS  S  S+   R+    SS R V  ++   +  SSS
Sbjct: 1   MEGVITCFHTKCVSLPIR--SFPLSRVS--SLPRWRNTKLISSSRSVPLRS-LCVSASSS 55

Query: 61  DTLVAGSR--------EVVSKKE-EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIH 111
           DTLVAG          +V SKKE +D  DLK WM KNGLPPCKV+LKE+P+H+ K++PIH
Sbjct: 56  DTLVAGGSPKEDERQSKVSSKKEGDDSEDLKFWMDKNGLPPCKVLLKERPAHDLKYKPIH 115

Query: 112 YVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
           YVAASEDLQ GD AFSVP+SLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG
Sbjct: 116 YVAASEDLQKGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 175

Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
           KKS W PYIRELDRQRGRGQL  ESPLLWSE EL YLTGSPTKAE+LERAEGIKREYNEL
Sbjct: 176 KKSVWYPYIRELDRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAEGIKREYNEL 235

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
           DTVWFMAGSLFQQYP+DIPTEAF+FEIFKQAFVA+QSCVVHLQ V LARRFALVPLGPPL
Sbjct: 236 DTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVAIQSCVVHLQNVGLARRFALVPLGPPL 295

Query: 292 LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYG 351
           LAY S CKAML AVD AV+LVVDRPYKAG+ IVVW           CGPQPN+KLL+NYG
Sbjct: 296 LAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVVW-----------CGPQPNAKLLLNYG 344

Query: 352 FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL 411
           FVDEDNPYDR++VEAALNTEDPQYQDKRMVAQRNGKLS QVF V  G+E+EA+ DMLPYL
Sbjct: 345 FVDEDNPYDRIIVEAALNTEDPQYQDKRMVAQRNGKLSQQVFQVRVGKEREAVQDMLPYL 404

Query: 412 RLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 471
           RLGY+SD SEMQSVISS GP+C +SPCMERAVLDQLA+YF  RL+GYP T  ED+A+L D
Sbjct: 405 RLGYMSDPSEMQSVISSQGPVCTMSPCMERAVLDQLANYFMRRLSGYPTTPKEDDALLAD 464

Query: 472 YNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 523
            +L P+KRVAT+LV++EKK+L ACL  T D++  LPD  +SPCPAPYAP L 
Sbjct: 465 PSLSPRKRVATRLVQLEKKILAACLTTTVDLLNQLPDTAISPCPAPYAPSLK 516


>gi|356571407|ref|XP_003553868.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
           large subunit N-methyltransferase, chloroplastic-like
           isoform 1 [Glycine max]
 gi|356571409|ref|XP_003553869.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
           large subunit N-methyltransferase, chloroplastic-like
           isoform 2 [Glycine max]
          Length = 502

 Score =  761 bits (1964), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 361/443 (81%), Positives = 397/443 (89%), Gaps = 11/443 (2%)

Query: 80  DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
           DLKSWMHK+GLPPCKV+LK+KP  N+ H+PIHYVAAS+DLQ GD AFSVPNSLVVTLERV
Sbjct: 70  DLKSWMHKHGLPPCKVVLKDKPCPNDSHKPIHYVAASQDLQVGDVAFSVPNSLVVTLERV 129

Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
           LGNET+AELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQL+VESPLL
Sbjct: 130 LGNETVAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLSVESPLL 189

Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
           W ++EL YL+GSP K E+++R E I++EYNELDTVWFMAGSLFQQYPYDIPTEAF+FEIF
Sbjct: 190 WLKSELDYLSGSPIKDEVIQREEAIRKEYNELDTVWFMAGSLFQQYPYDIPTEAFSFEIF 249

Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 319
           KQAF A+QSCVVHLQKVSLARRFALVPLGPPLL+Y S CKAML AVD AV+L VDRPYKA
Sbjct: 250 KQAFAAIQSCVVHLQKVSLARRFALVPLGPPLLSYQSNCKAMLTAVDGAVELAVDRPYKA 309

Query: 320 GESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 379
           G+ IVVW           CGPQPNSKLLINYGFVDE+N  DRL+VEAALNTEDPQYQDKR
Sbjct: 310 GDPIVVW-----------CGPQPNSKLLINYGFVDENNSNDRLIVEAALNTEDPQYQDKR 358

Query: 380 MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCM 439
           MVAQRNGKLSVQVFHV+AG+E+EA+ DML Y+RLGYVSD SEM+SVISS GP+CPVSPCM
Sbjct: 359 MVAQRNGKLSVQVFHVYAGKEREAVLDMLRYMRLGYVSDPSEMESVISSQGPVCPVSPCM 418

Query: 440 ERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVT 499
           ERA LDQLADYFKARLAGYP TL+EDE+MLTD NL+PKKRVATQ VR+EKKML+ACLQ T
Sbjct: 419 ERAALDQLADYFKARLAGYPTTLAEDESMLTDDNLNPKKRVATQYVRLEKKMLHACLQAT 478

Query: 500 ADMIMLLPDVTVSPCPAPYAPLL 522
            D I  LPD T+SPCPAPYAPLL
Sbjct: 479 TDFINQLPDHTISPCPAPYAPLL 501


>gi|356511552|ref|XP_003524489.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
           large subunit N-methyltransferase, chloroplastic-like
           [Glycine max]
          Length = 503

 Score =  756 bits (1952), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 361/444 (81%), Positives = 396/444 (89%), Gaps = 12/444 (2%)

Query: 80  DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
           DLKSWMHK+GLPPCKV+LK+KP  N+ H+PIHYVAAS+DLQ GD AFSVPNSLVVTLERV
Sbjct: 70  DLKSWMHKHGLPPCKVVLKDKPCPNDSHKPIHYVAASQDLQVGDVAFSVPNSLVVTLERV 129

Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
           LGNET+AELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQL+VESPLL
Sbjct: 130 LGNETVAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLSVESPLL 189

Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
           WS++EL YL+GSP K E+++R E I++EY ELDTVWFMAGSLFQQYPYDIPTEAF+FEIF
Sbjct: 190 WSKSELDYLSGSPIKDEVIQREEAIRKEYKELDTVWFMAGSLFQQYPYDIPTEAFSFEIF 249

Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 319
           KQAF A+QSCVVHLQKVSLARRFALVPLGPPLL+Y S CKAML AVD AV+L VDRPYKA
Sbjct: 250 KQAFAAIQSCVVHLQKVSLARRFALVPLGPPLLSYQSNCKAMLTAVDGAVELAVDRPYKA 309

Query: 320 GESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 379
           G+ IVVW           CGPQPNSKLLINYGFVDE+N  DRL+VEAALNTEDPQYQDKR
Sbjct: 310 GDPIVVW-----------CGPQPNSKLLINYGFVDENNSNDRLIVEAALNTEDPQYQDKR 358

Query: 380 MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCM 439
           MVAQRNGKLSVQVFHV+AG+E+EA+ DML Y+RLGYVSD SEMQSVISS GP+CPVSPCM
Sbjct: 359 MVAQRNGKLSVQVFHVYAGKEREAVLDMLRYMRLGYVSDPSEMQSVISSQGPVCPVSPCM 418

Query: 440 ERAVLDQLADYFKARLAGYPATLSEDEAMLTD-YNLHPKKRVATQLVRMEKKMLNACLQV 498
           ERA LDQLADYFKARLAGYP  L+EDE+MLTD  NL+PKKRVATQ VR+EKKML+ACLQ 
Sbjct: 419 ERAALDQLADYFKARLAGYPTILAEDESMLTDGGNLNPKKRVATQYVRLEKKMLHACLQA 478

Query: 499 TADMIMLLPDVTVSPCPAPYAPLL 522
           T D I  LPD T+SPCPAPYAPLL
Sbjct: 479 TIDFINQLPDHTISPCPAPYAPLL 502


>gi|116786810|gb|ABK24248.1| unknown [Picea sitchensis]
          Length = 507

 Score =  716 bits (1849), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 348/487 (71%), Positives = 403/487 (82%), Gaps = 18/487 (3%)

Query: 42  SSLRLVRRKNRFSIRVSSSDTLVAGS------REVVSKKEEDLGDLKSWMHKNGLPPCKV 95
           S +RL  R   F + V S+DTL A S      ++ +  KEE++ DLKSWMH++GLPPC+V
Sbjct: 32  SRVRLPGRCVGFPMVVYSADTLTASSQHGEDKKDAIRGKEEEV-DLKSWMHRHGLPPCRV 90

Query: 96  ILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLS 155
           +LKE+PS + KH+PI YVAASEDLQ GD AFS+PNSL+VTLERVLGNETIAELLTTNKLS
Sbjct: 91  MLKERPSPDGKHKPIKYVAASEDLQPGDVAFSIPNSLIVTLERVLGNETIAELLTTNKLS 150

Query: 156 ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 215
           ELACLALYLMYEKKQG +SFW P+IRELDRQRGRGQLAVESPLLWS  EL Y TGSP K 
Sbjct: 151 ELACLALYLMYEKKQGNQSFWRPFIRELDRQRGRGQLAVESPLLWSSEELKYFTGSPMKE 210

Query: 216 EILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK 275
            +LER  GIKREY ELDTVWFMAGSLF+QYPYDIPTEAF FEIFKQAFVAVQSCVVHLQ 
Sbjct: 211 IMLERNSGIKREYEELDTVWFMAGSLFKQYPYDIPTEAFPFEIFKQAFVAVQSCVVHLQN 270

Query: 276 VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHD 335
           V+LARRFALVPLGPPLL+Y S CKAML AV D+VQL VDR YKAGE IVVW         
Sbjct: 271 VNLARRFALVPLGPPLLSYKSNCKAMLKAVGDSVQLEVDREYKAGEPIVVW--------- 321

Query: 336 FKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV 395
             CGPQPN++LL+NYGFVDEDNP+DRL+VE +L+T+DP YQDKR++AQRNGKLSVQ F++
Sbjct: 322 --CGPQPNARLLLNYGFVDEDNPHDRLIVEVSLDTKDPLYQDKRIIAQRNGKLSVQTFNI 379

Query: 396 HAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARL 455
           + GREKEA+ DMLPYLRL YVSD SEMQSV+SS GP+CPVSPC ERAVLDQL+ YF+ RL
Sbjct: 380 YIGREKEAVLDMLPYLRLAYVSDPSEMQSVLSSQGPVCPVSPCTERAVLDQLSRYFRERL 439

Query: 456 AGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCP 515
           AGYP T SEDE +L D   +PK++VATQLV +EKKMLN+CL    ++I  LPD+ V+PCP
Sbjct: 440 AGYPTTASEDEIVLADPTTNPKRQVATQLVLIEKKMLNSCLAAVYEIIDQLPDLAVTPCP 499

Query: 516 APYAPLL 522
           +PY+P+L
Sbjct: 500 SPYSPIL 506


>gi|7573451|emb|CAB87765.1| putative protein [Arabidopsis thaliana]
          Length = 537

 Score =  690 bits (1780), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 357/500 (71%), Positives = 392/500 (78%), Gaps = 45/500 (9%)

Query: 1   MEASCSLRSSKFISPPIRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSS 60
           ME   +   +K +S PIR    PLS  S  S+   R+    SS R V  +   S+ VSSS
Sbjct: 1   MEGVITCFHTKCVSLPIR--SFPLSRVS--SLPRWRNNKLISSSRSVHLR---SLCVSSS 53

Query: 61  DTLVAGSR--------EVVSKKE-EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIH 111
           DTLVA           +V SKKE +D  DLK WM KNGLPPCKVILKE+P+H++KH+PIH
Sbjct: 54  DTLVASGSPKEDERQSKVSSKKEGDDSEDLKFWMDKNGLPPCKVILKERPAHDQKHKPIH 113

Query: 112 YVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
           YVAASEDLQ GD AFSVP+SLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG
Sbjct: 114 YVAASEDLQKGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 173

Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
           KKS W PYIRELDRQRGRGQL  ESPLLWSE EL YLTGSPTKAE+LERAEGIKREYNEL
Sbjct: 174 KKSVWYPYIRELDRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAEGIKREYNEL 233

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL------------------ 273
           DTVWFMAGSLFQQYP+DIPTEAF+FEIFKQAFVA+QSCVVHL                  
Sbjct: 234 DTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVAIQSCVVHLQVVLVASSNLDCYASSCT 293

Query: 274 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTG 333
           Q V LARRFALVPLGPPLLAY S CKAML AVD AV+LVVDRPYKAG+ IVVW       
Sbjct: 294 QNVGLARRFALVPLGPPLLAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVVW------- 346

Query: 334 HDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 393
               CGPQPN+KLL+NYGFVDEDNPYDR++VEAALNTEDPQYQDKRMVAQRNGKLS QVF
Sbjct: 347 ----CGPQPNAKLLLNYGFVDEDNPYDRVIVEAALNTEDPQYQDKRMVAQRNGKLSQQVF 402

Query: 394 HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 453
            V  G+E+EA+ DMLPYLRLGY+SD SEMQSVISS GP+CP+SPCMERAVLDQLA+YF  
Sbjct: 403 QVRVGKEREAVQDMLPYLRLGYMSDPSEMQSVISSQGPVCPMSPCMERAVLDQLANYFMR 462

Query: 454 RLAGYPATLSEDEAMLTDYN 473
           RL+GYP T  ED+A+    N
Sbjct: 463 RLSGYPTTPKEDDALEASCN 482


>gi|125536207|gb|EAY82695.1| hypothetical protein OsI_37912 [Oryza sativa Indica Group]
          Length = 505

 Score =  669 bits (1727), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 320/445 (71%), Positives = 369/445 (82%), Gaps = 14/445 (3%)

Query: 81  LKSWMHKNGLPPCKVILKEKPS---HNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
              W+ ++GLPP KV + ++P       K  P+HYVAA +DL+AGD AF VP SLVVTLE
Sbjct: 71  FSDWLREHGLPPGKVAILDRPVPCFREGKDLPLHYVAAGQDLEAGDVAFEVPMSLVVTLE 130

Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
           RVLG+E++AELLTTNKLSELACLALYLMYEKKQG+ SFW PYI+ELDRQRGRGQLAVESP
Sbjct: 131 RVLGDESVAELLTTNKLSELACLALYLMYEKKQGQDSFWYPYIKELDRQRGRGQLAVESP 190

Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
           LLW+E+EL YL GSP K E++ R EGI+REYNELDT+WFMAGSLFQQYP+DIPTEAF FE
Sbjct: 191 LLWTESELNYLKGSPIKDEVVARDEGIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFE 250

Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY 317
           IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL Y S CKAML AV D+V+LVVDRPY
Sbjct: 251 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTAVGDSVRLVVDRPY 310

Query: 318 KAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQD 377
           KAGE I+VW           CGPQPNS+LL+NYGF+DEDNPYDR+V+EA+LN EDPQ+Q+
Sbjct: 311 KAGEPIIVW-----------CGPQPNSRLLLNYGFIDEDNPYDRIVIEASLNIEDPQFQE 359

Query: 378 KRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSP 437
           KRMVAQRNGKL++Q FHV  G+EKE I++MLPYLRLGY+SD  EMQS++SS G  CPVSP
Sbjct: 360 KRMVAQRNGKLAIQNFHVCVGKEKETIAEMLPYLRLGYISDPDEMQSILSSEGDTCPVSP 419

Query: 438 CMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 497
           C ERAVLDQL  Y ++RLA YP TL ED+AML D NL PKK VAT+LVR+EKK+L+ CLQ
Sbjct: 420 CTERAVLDQLVGYLESRLADYPTTLDEDDAMLADGNLEPKKEVATRLVRLEKKLLHGCLQ 479

Query: 498 VTADMIMLLPDVTVSPCPAPYAPLL 522
              + I  LPD TVSPCPAP+AP L
Sbjct: 480 AANEFINDLPDHTVSPCPAPFAPEL 504


>gi|357160358|ref|XP_003578740.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
           large subunit N-methyltransferase, chloroplastic-like
           [Brachypodium distachyon]
          Length = 516

 Score =  669 bits (1727), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 321/443 (72%), Positives = 366/443 (82%), Gaps = 14/443 (3%)

Query: 84  WMHKNGLPPCKVILKEKP---SHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
           W+  +GLPP KV + E+P   S   K RP+H+VAA +DL+ GD AF +P SLVVTLERVL
Sbjct: 85  WLLTHGLPPGKVAILERPVPCSRGGKDRPLHFVAAGQDLEVGDVAFEMPMSLVVTLERVL 144

Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
           G+E++AELLTTNKLSELACLALYLMYEKKQGK S W PYI+ELDRQRGRGQLAVESPLLW
Sbjct: 145 GDESVAELLTTNKLSELACLALYLMYEKKQGKDSLWYPYIKELDRQRGRGQLAVESPLLW 204

Query: 201 SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK 260
           +E+EL YL GSP + E++ R EGI+REYNELDT+WFMAGSLF+QYP+D+PTEAF FEIFK
Sbjct: 205 TESELDYLNGSPMRDEVVVRDEGIRREYNELDTLWFMAGSLFKQYPFDVPTEAFPFEIFK 264

Query: 261 QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 320
           QAFVAVQSCVVHLQKVSLARRFALVPLGPPLL Y S CKAML AVDD+V+LVVDRPYKAG
Sbjct: 265 QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTAVDDSVRLVVDRPYKAG 324

Query: 321 ESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRM 380
           E I+VW           CGPQPNS+LL+NYGFVDEDNPYDR+ +EA+LN EDPQYQ+KRM
Sbjct: 325 EPIIVW-----------CGPQPNSRLLLNYGFVDEDNPYDRIAIEASLNMEDPQYQEKRM 373

Query: 381 VAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCME 440
           VAQRNGKL++Q F V  G+EKE IS+MLPYLRLGY+SD  EMQ ++SS G  CPVSPC E
Sbjct: 374 VAQRNGKLAIQKFQVCVGKEKETISEMLPYLRLGYISDPDEMQCILSSEGDTCPVSPCSE 433

Query: 441 RAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 500
           RAVLDQL  Y K+RLAGYP TL EDEAML D NL PKK VAT+LVR+EKK+L+ CLQ   
Sbjct: 434 RAVLDQLVVYLKSRLAGYPTTLDEDEAMLADGNLEPKKEVATRLVRLEKKLLHGCLQAAH 493

Query: 501 DMIMLLPDVTVSPCPAPYAPLLN 523
           + I  LPD TVSPCPA YAP L 
Sbjct: 494 EFISALPDHTVSPCPALYAPNLK 516


>gi|115487958|ref|NP_001066466.1| Os12g0236900 [Oryza sativa Japonica Group]
 gi|77554044|gb|ABA96840.1| SET domain containing protein, expressed [Oryza sativa Japonica
           Group]
 gi|113648973|dbj|BAF29485.1| Os12g0236900 [Oryza sativa Japonica Group]
          Length = 509

 Score =  669 bits (1726), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 320/445 (71%), Positives = 369/445 (82%), Gaps = 14/445 (3%)

Query: 81  LKSWMHKNGLPPCKVILKEKPS---HNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
              W+ ++GLPP KV + ++P       K  P+HYVAA +DL+AGD AF VP SLVVTLE
Sbjct: 75  FSDWLREHGLPPGKVAILDRPVPCFREGKDLPLHYVAAGQDLEAGDVAFEVPMSLVVTLE 134

Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
           RVLG+E++AELLTTNKLSELACLALYLMYEKKQG+ SFW PYI+ELDRQRGRGQLAVESP
Sbjct: 135 RVLGDESVAELLTTNKLSELACLALYLMYEKKQGQDSFWYPYIKELDRQRGRGQLAVESP 194

Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
           LLW+E+EL YL GSP K E++ R EGI+REYNELDT+WFMAGSLFQQYP+DIPTEAF FE
Sbjct: 195 LLWTESELNYLKGSPIKDEVVARDEGIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFE 254

Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY 317
           IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL Y S CKAML AV D+V+LVVDRPY
Sbjct: 255 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTAVGDSVRLVVDRPY 314

Query: 318 KAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQD 377
           KAGE I+VW           CGPQPNS+LL+NYGF+DEDNPYDR+V+EA+LN EDPQ+Q+
Sbjct: 315 KAGEPIIVW-----------CGPQPNSRLLLNYGFIDEDNPYDRIVIEASLNIEDPQFQE 363

Query: 378 KRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSP 437
           KRMVAQRNGKL++Q FHV  G+EKE I++MLPYLRLGY+SD  EMQS++SS G  CPVSP
Sbjct: 364 KRMVAQRNGKLAIQNFHVCVGKEKETIAEMLPYLRLGYISDPDEMQSILSSEGDTCPVSP 423

Query: 438 CMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 497
           C ERAVLDQL  Y ++RLA YP TL ED+AML D NL PKK VAT+LVR+EKK+L+ CLQ
Sbjct: 424 CTERAVLDQLVGYLESRLADYPTTLDEDDAMLADGNLEPKKEVATRLVRLEKKLLHGCLQ 483

Query: 498 VTADMIMLLPDVTVSPCPAPYAPLL 522
              + I  LPD TVSPCPAP+AP L
Sbjct: 484 AANEFINDLPDHTVSPCPAPFAPEL 508


>gi|326510275|dbj|BAJ87354.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326525555|dbj|BAJ88824.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 523

 Score =  669 bits (1725), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 336/530 (63%), Positives = 395/530 (74%), Gaps = 34/530 (6%)

Query: 8   RSSKFISPP-----IRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSSDT 62
           RSS+  +PP     +   HH L +   +     R P  GS     R  +R  +    +DT
Sbjct: 14  RSSEARAPPMASSALSGTHHRLLLPCFLR----RLPQPGS-----RSCSRLRLAACHADT 64

Query: 63  LVAGS------REVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKP---SHNEKHRPIHYV 113
           L++ S                 G    W+  NGLPP K+ + E+P   S   + RP+H+V
Sbjct: 65  LLSSSGAQGPPSPAACLSASSAGGFSDWLLTNGLPPGKLAILERPVPCSRGGRDRPLHFV 124

Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKK 173
           AA +DL+AGD AF VP SLVVTLERVLG+E++AELLTTNKLSELACLALYLMYEKKQG+ 
Sbjct: 125 AAGQDLEAGDVAFEVPMSLVVTLERVLGDESVAELLTTNKLSELACLALYLMYEKKQGRD 184

Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
           S W PYI+ELDRQRGRGQLAVESPLLW+E+EL YL GSP + E++ R EGIK+EYNELDT
Sbjct: 185 SLWYPYIKELDRQRGRGQLAVESPLLWTESELDYLNGSPMRDEVVVRDEGIKKEYNELDT 244

Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLA 293
           +WFMAGSLF+QYP+D+PTEAF FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 
Sbjct: 245 LWFMAGSLFKQYPFDVPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLT 304

Query: 294 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV 353
           Y S CKAML AVD +V+L+VDRPYKAGE I+VW           CGPQPNS+LL+NYGFV
Sbjct: 305 YKSNCKAMLTAVDGSVRLLVDRPYKAGEPIIVW-----------CGPQPNSRLLLNYGFV 353

Query: 354 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 413
           DEDNPYDR+ +EA+LNTEDPQYQ+KRMVAQRNGKL++Q F V  G+EK+ IS+MLPYLRL
Sbjct: 354 DEDNPYDRIAIEASLNTEDPQYQEKRMVAQRNGKLAIQKFQVCVGKEKQTISEMLPYLRL 413

Query: 414 GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYN 473
           GY+SD  EMQ ++SS G  CPVSPC ERAVLDQL  Y K+RLAGYP  L EDEAML D +
Sbjct: 414 GYISDPDEMQCILSSEGDTCPVSPCSERAVLDQLVVYLKSRLAGYPTNLDEDEAMLADGS 473

Query: 474 LHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 523
           L PKK VAT+LVR+EKKML+ CL+   + I  LPD TVSPCPA YAP L 
Sbjct: 474 LEPKKEVATRLVRLEKKMLHGCLEAANEFISGLPDHTVSPCPALYAPELK 523


>gi|242053769|ref|XP_002456030.1| hypothetical protein SORBIDRAFT_03g029140 [Sorghum bicolor]
 gi|241928005|gb|EES01150.1| hypothetical protein SORBIDRAFT_03g029140 [Sorghum bicolor]
          Length = 512

 Score =  664 bits (1713), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 318/442 (71%), Positives = 367/442 (83%), Gaps = 14/442 (3%)

Query: 84  WMHKNGLPPCKVILKEKPSH---NEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
           W+   GLPP KV ++E+P     N K  P+ YVAA  DLQAGD AF VP SLVVTLERVL
Sbjct: 81  WLRARGLPPGKVDIRERPVPCLLNGKDLPLRYVAAGVDLQAGDVAFEVPMSLVVTLERVL 140

Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
           G+E+IAELLT NKLSELACLALYLMYEKKQGK SFW PYI+ELDR RGRGQLAVESPLLW
Sbjct: 141 GDESIAELLTNNKLSELACLALYLMYEKKQGKDSFWYPYIKELDRHRGRGQLAVESPLLW 200

Query: 201 SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK 260
           +E+EL YLTGSP K E++ R E I+REYNELDT+WFMAGSLFQQYP+DIPTEAF FEIFK
Sbjct: 201 TESELDYLTGSPLKDEVVARDEAIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIFK 260

Query: 261 QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 320
           QAFVAVQSCVVHLQKVSLARRFALVPLGPPLL Y S CKAML A  D+V+LVVDRPYKAG
Sbjct: 261 QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTADGDSVRLVVDRPYKAG 320

Query: 321 ESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRM 380
           E I++W           CGPQ NS+L++NYGFVDEDNP+DR+ +EA+LN+EDPQYQ+KRM
Sbjct: 321 EPIIIW-----------CGPQTNSRLVLNYGFVDEDNPFDRIAIEASLNSEDPQYQEKRM 369

Query: 381 VAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCME 440
           VAQRNGKL++Q F+V+ G+EK+ +++MLPYLRLGY+SD  EMQS++SS G  CP+SPC E
Sbjct: 370 VAQRNGKLAIQNFNVYVGKEKQTVAEMLPYLRLGYISDPDEMQSILSSEGDTCPLSPCTE 429

Query: 441 RAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 500
           RAVLDQL  Y ++RLAGYP TL EDEAML D +L PKK VAT+LVR+EKKM++ACLQ T 
Sbjct: 430 RAVLDQLVGYLESRLAGYPTTLDEDEAMLADGSLEPKKEVATRLVRLEKKMIHACLQATN 489

Query: 501 DMIMLLPDVTVSPCPAPYAPLL 522
           + I  LPD TVSPCPAPYAP L
Sbjct: 490 EFINDLPDHTVSPCPAPYAPEL 511


>gi|414881266|tpg|DAA58397.1| TPA: hypothetical protein ZEAMMB73_027665 [Zea mays]
          Length = 512

 Score =  664 bits (1712), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 324/468 (69%), Positives = 374/468 (79%), Gaps = 14/468 (2%)

Query: 58  SSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPS---HNEKHRPIHYVA 114
           SSS+   A    V    E        W+   GLPP KV ++E+P     + K +P+ YV+
Sbjct: 55  SSSEARAAPGPAVEPSSESATDCFVDWLRARGLPPGKVDIRERPVPCLRDGKDQPLRYVS 114

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           A  DLQAGD AF V  SLVVTLERVLG+E+IAELLT NKLSELACLALYLMYEKKQGK S
Sbjct: 115 AVVDLQAGDVAFEVSMSLVVTLERVLGDESIAELLTNNKLSELACLALYLMYEKKQGKDS 174

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
           FW PYI+ELDR RGRGQLAVESPLLW+E+EL YLTGSP K E++ R E I+REYNELDT+
Sbjct: 175 FWYPYIKELDRHRGRGQLAVESPLLWTESELDYLTGSPLKDEVVARDEAIRREYNELDTL 234

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
           WFMAGSLFQQYP+DIPTEAF FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL Y
Sbjct: 235 WFMAGSLFQQYPFDIPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTY 294

Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVD 354
            S CKAML A  D+V+LVVDRPYKAGE I++W           CGPQ NS+L++NYGFVD
Sbjct: 295 RSNCKAMLTADGDSVRLVVDRPYKAGEPIIIW-----------CGPQTNSRLVLNYGFVD 343

Query: 355 EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLG 414
           EDNP+DR+ +EA+LNTEDPQYQ+KRMVAQRNGKL++Q F+V+ G+EK+ +++MLPYLRLG
Sbjct: 344 EDNPFDRVAIEASLNTEDPQYQEKRMVAQRNGKLAIQNFNVYVGKEKQTVAEMLPYLRLG 403

Query: 415 YVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNL 474
           Y+S+  EMQS++SS G  CPVSPC ERAVLDQL  Y ++RLAGYP TL EDEAML D NL
Sbjct: 404 YISNPDEMQSILSSEGDTCPVSPCTERAVLDQLVGYLESRLAGYPTTLDEDEAMLADGNL 463

Query: 475 HPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 522
            PKK VAT+LVR+EKKML+ACLQ T + I  LPD TVSPCPAPYAP L
Sbjct: 464 EPKKEVATRLVRLEKKMLHACLQATNEFINDLPDHTVSPCPAPYAPEL 511


>gi|125578929|gb|EAZ20075.1| hypothetical protein OsJ_35675 [Oryza sativa Japonica Group]
          Length = 536

 Score =  639 bits (1649), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 305/405 (75%), Positives = 345/405 (85%), Gaps = 11/405 (2%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLP 178
           LQAGD AF VP SLVVTLERVLG+E++AELLTTNKLSELACLALYLMYEKKQG+ SFW P
Sbjct: 143 LQAGDVAFEVPMSLVVTLERVLGDESVAELLTTNKLSELACLALYLMYEKKQGQDSFWYP 202

Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMA 238
           YI+ELDRQRGRGQLAVESPLLW+E+EL YL GSP K E++ R EGI+REYNELDT+WFMA
Sbjct: 203 YIKELDRQRGRGQLAVESPLLWTESELNYLKGSPIKDEVVARDEGIRREYNELDTLWFMA 262

Query: 239 GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC 298
           GSLFQQYP+DIPTEAF FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL Y S C
Sbjct: 263 GSLFQQYPFDIPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSNC 322

Query: 299 KAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNP 358
           KAML AV D+V+LVVDRPYKAGE I+VW           CGPQPNS+LL+NYGF+DEDNP
Sbjct: 323 KAMLTAVGDSVRLVVDRPYKAGEPIIVW-----------CGPQPNSRLLLNYGFIDEDNP 371

Query: 359 YDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD 418
           YDR+V+EA+LN EDPQ+Q+KRMVAQRNGKL++Q FHV  G+EKE I++MLPYLRLGY+SD
Sbjct: 372 YDRIVIEASLNIEDPQFQEKRMVAQRNGKLAIQNFHVCVGKEKETIAEMLPYLRLGYISD 431

Query: 419 TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKK 478
             EMQS++SS G  CPVSPC ERAVLDQL  Y ++RLA YP TL ED+AML D NL PKK
Sbjct: 432 PDEMQSILSSEGDTCPVSPCTERAVLDQLVGYLESRLADYPTTLDEDDAMLADGNLEPKK 491

Query: 479 RVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 523
            VAT+LVR+EKK+L+ CLQ   + I  LPD TVSPCPAP+AP L 
Sbjct: 492 EVATRLVRLEKKLLHGCLQAANEFINDLPDHTVSPCPAPFAPELK 536


>gi|168044593|ref|XP_001774765.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162673920|gb|EDQ60436.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 523

 Score =  598 bits (1542), Expect = e-168,   Method: Compositional matrix adjust.
 Identities = 305/492 (61%), Positives = 361/492 (73%), Gaps = 21/492 (4%)

Query: 38  PNFGSSLRLVRRKNRFSIRVSSSDTLVAGSRE---VVSKKEEDLG--DLKSWMHKNGLPP 92
           P FG+    V  + R S   ++  T V    E     SKK+E     DLK WM + GLP 
Sbjct: 45  PRFGTQKVAVSSEKRGSRCRNTLTTDVYKQDENDLAQSKKQEHESGIDLKQWMEEQGLPE 104

Query: 93  CKVILKE-KPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTT 151
           CKV L E +PS  +K +PIHYV ASEDLQ G+ A ++P SLVVTLERVLG+ETIAELLTT
Sbjct: 105 CKVSLAEHQPSEGDKGKPIHYVVASEDLQPGELALTIPKSLVVTLERVLGDETIAELLTT 164

Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTG 210
           NKLSELACLALYLMYEKKQGK+S+W PYIRELDRQRGRGQL+V SPLLWS  EL  Y TG
Sbjct: 165 NKLSELACLALYLMYEKKQGKESYWYPYIRELDRQRGRGQLSVASPLLWSREELNEYFTG 224

Query: 211 SPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
           S  K  +LER  GIKREY ELDTVWFMAGSLF+QYP+D+PTEAF+FEIFKQAFVAVQSCV
Sbjct: 225 STMKEVVLERLAGIKREYEELDTVWFMAGSLFKQYPFDLPTEAFSFEIFKQAFVAVQSCV 284

Query: 271 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNIS 330
           VHLQ VSLARRFALVPLGPPLLAY S CKAML AVDD V L VDR YKAG+ I VW    
Sbjct: 285 VHLQGVSLARRFALVPLGPPLLAYKSNCKAMLKAVDDNVVLEVDRAYKAGDPIAVW---- 340

Query: 331 FTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 390
                  CGPQPNSKLL+NYGFVDEDNPYDRL VEA+L+TEDP YQ KR + Q+N +L++
Sbjct: 341 -------CGPQPNSKLLLNYGFVDEDNPYDRLAVEASLDTEDPLYQQKRAIVQKNNRLTI 393

Query: 391 QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADY 450
           Q F ++ G+E EA+ DMLPY+RL +++D  EM++V  + GP+CPVS C ERAVL+QL  Y
Sbjct: 394 QTFQIYKGKEMEAVLDMLPYMRLAHLADPEEMETVSFAQGPVCPVSACNERAVLEQLEQY 453

Query: 451 FKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVT 510
           F+ RLAGY ++ + +     D   + KKRVA +L+ +EK +L   L    ++I  LPD  
Sbjct: 454 FEKRLAGYKSSHATEGG---DAKKNAKKRVAEKLLCIEKSILRNALAAVQELISQLPDSA 510

Query: 511 VSPCPAPYAPLL 522
           +SPC  PY P L
Sbjct: 511 ISPCIGPYLPNL 522


>gi|302794360|ref|XP_002978944.1| hypothetical protein SELMODRAFT_110000 [Selaginella moellendorffii]
 gi|300153262|gb|EFJ19901.1| hypothetical protein SELMODRAFT_110000 [Selaginella moellendorffii]
          Length = 432

 Score =  585 bits (1508), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 293/443 (66%), Positives = 344/443 (77%), Gaps = 18/443 (4%)

Query: 84  WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
           WM + GLPPCKV LKE+  +    + I YV ASEDL+ GD A SVP SLVVTLERVLGNE
Sbjct: 3   WMLEQGLPPCKVSLKERDLNG---KTIRYVVASEDLKPGDLALSVPMSLVVTLERVLGNE 59

Query: 144 TIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET 203
           TIAELLTTNKLSELACLALYLMYEKK+GK+SFW P+IRELDRQRGRGQ+AVESPLLW+  
Sbjct: 60  TIAELLTTNKLSELACLALYLMYEKKRGKESFWYPFIRELDRQRGRGQVAVESPLLWTSE 119

Query: 204 EL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
           EL  Y TGS  K  +LER EGIKREY ELDTVWFMAGSLF++YP+DIPTEAF+FEIFKQA
Sbjct: 120 ELDEYFTGSRMKEVVLERLEGIKREYQELDTVWFMAGSLFKEYPFDIPTEAFSFEIFKQA 179

Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
           FVAVQSCVVHLQ VSL RRFALVPLGPPLLAY S CKAML A  D V+L VDR YK GE 
Sbjct: 180 FVAVQSCVVHLQGVSLPRRFALVPLGPPLLAYKSNCKAMLKAAGDLVRLEVDRAYKKGEQ 239

Query: 323 IVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 382
           I+VW           CGPQPN++LL+NYGFVD DNP+DRL VEA+LNT DP YQ+KR++ 
Sbjct: 240 ILVW-----------CGPQPNTRLLLNYGFVDPDNPHDRLSVEASLNTRDPFYQNKRIIV 288

Query: 383 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERA 442
           Q+N +L++Q F +  GREKEA+ +MLPYLRLG+VSD   M+SV S+ GP CPVS C ERA
Sbjct: 289 QKNNRLTIQNFQIFKGREKEAVLEMLPYLRLGHVSDPYHMESVFSAEGPTCPVSACNERA 348

Query: 443 VLDQLADYFKARLAGYPATLSEDEAMLTD--YNLHPKKRVATQLVRMEKKMLNACLQVTA 500
           VLDQLA YF+ R+A Y  T+ ED A+L D   +++PK+RVATQL+ +EK++L+  L V  
Sbjct: 349 VLDQLAQYFQERIAKYKTTIDEDRALLEDGSSDINPKQRVATQLLLIEKEILHNTLDVVN 408

Query: 501 DMIMLLPDVTVS-PCPAPYAPLL 522
                LPD +V+ PC   + P L
Sbjct: 409 GFRNQLPDGSVAPPCCGDFVPKL 431


>gi|302809535|ref|XP_002986460.1| hypothetical protein SELMODRAFT_269129 [Selaginella moellendorffii]
 gi|300145643|gb|EFJ12317.1| hypothetical protein SELMODRAFT_269129 [Selaginella moellendorffii]
          Length = 432

 Score =  585 bits (1507), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 293/443 (66%), Positives = 344/443 (77%), Gaps = 18/443 (4%)

Query: 84  WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
           WM + GLPPCKV LKE+  +    + I YV ASEDL+ GD A SVP SLVVTLERVLGNE
Sbjct: 3   WMLEQGLPPCKVSLKERDLNG---KTIRYVVASEDLKPGDLALSVPMSLVVTLERVLGNE 59

Query: 144 TIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET 203
           TIAELLTTNKLSELACLALYLMYEKK+GK+SFW P+IRELDRQRGRGQ+AVESPLLW+  
Sbjct: 60  TIAELLTTNKLSELACLALYLMYEKKRGKESFWYPFIRELDRQRGRGQVAVESPLLWTSE 119

Query: 204 EL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
           EL  Y TGS  K  +LER EGIKREY ELDTVWFMAGSLF++YP+DIPTEAF+FEIFKQA
Sbjct: 120 ELDEYFTGSRMKEVVLERLEGIKREYQELDTVWFMAGSLFKEYPFDIPTEAFSFEIFKQA 179

Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
           FVAVQSCVVHLQ VSL RRFALVPLGPPLLAY S CKAML A  D V+L VDR YK GE 
Sbjct: 180 FVAVQSCVVHLQGVSLPRRFALVPLGPPLLAYKSNCKAMLKAAGDLVRLEVDRAYKKGEQ 239

Query: 323 IVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 382
           I+VW           CGPQPN++LL+NYGFVD DNP+DRL VEA+LNT DP YQ+KR++ 
Sbjct: 240 ILVW-----------CGPQPNTRLLLNYGFVDPDNPHDRLSVEASLNTRDPFYQNKRIIV 288

Query: 383 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERA 442
           Q+N +L++Q F +  GREKEA+ +MLPYLRLG+VSD   M+SV S+ GP CPVS C ERA
Sbjct: 289 QKNNRLTIQNFQIFKGREKEAVLEMLPYLRLGHVSDPYHMESVFSAEGPTCPVSACNERA 348

Query: 443 VLDQLADYFKARLAGYPATLSEDEAMLTD--YNLHPKKRVATQLVRMEKKMLNACLQVTA 500
           VLDQLA YF+ R+A Y  T+ ED A+L D   +++PK+RVATQL+ +EK++L+  L V  
Sbjct: 349 VLDQLAQYFQERIAKYKTTIDEDRALLEDCSSDINPKQRVATQLLLIEKEILHNTLDVVN 408

Query: 501 DMIMLLPDVTVS-PCPAPYAPLL 522
                LPD +V+ PC   + P L
Sbjct: 409 GFRNQLPDGSVAPPCCGDFVPKL 431


>gi|168020073|ref|XP_001762568.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162686301|gb|EDQ72691.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 427

 Score =  566 bits (1458), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 281/440 (63%), Positives = 330/440 (75%), Gaps = 16/440 (3%)

Query: 85  MHKNGLPPCKVILKE-KPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
           M + GLP C V L E + +  +K +PIHYV AS+DLQ GD A +VP SLVVTLERVLG+E
Sbjct: 1   MEEQGLPKCNVALVEHQLAEGDKGKPIHYVVASQDLQPGDVALTVPKSLVVTLERVLGDE 60

Query: 144 TIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET 203
           TIAELLTTNKLSELACLALYLMYEKKQGK+S+W PYIRELDRQRGRGQL+V SPLLWS  
Sbjct: 61  TIAELLTTNKLSELACLALYLMYEKKQGKESYWYPYIRELDRQRGRGQLSVASPLLWSPE 120

Query: 204 EL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
           EL  Y TGS  K  +LER  GIKREY ELDTVWFMAGSLF+QYP+D+PTEAF+FEIFKQA
Sbjct: 121 ELNEYFTGSTMKEVVLERLAGIKREYEELDTVWFMAGSLFKQYPFDLPTEAFSFEIFKQA 180

Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
           FVAVQSCVVHLQ VSLARRFALVPLGPPLLAY S CKAML AV D VQL VD  YK G+ 
Sbjct: 181 FVAVQSCVVHLQGVSLARRFALVPLGPPLLAYKSNCKAMLKAVGDNVQLEVDHAYKTGDP 240

Query: 323 IVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 382
           I VW           CGPQPNSKLL+NYGFVDEDNP+DRL VEA+LNTEDP YQ KR V 
Sbjct: 241 IAVW-----------CGPQPNSKLLLNYGFVDEDNPFDRLAVEASLNTEDPLYQQKRAVV 289

Query: 383 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERA 442
           Q+N +L++Q F ++ G+E EA+ DMLPY+RLG+++D  E+++V  +  P+C VS C ERA
Sbjct: 290 QKNNRLTIQTFQIYKGKEMEAVRDMLPYMRLGHLADPEEIETVSFAQEPLCYVSACNERA 349

Query: 443 VLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADM 502
           VL+Q+  +F+ RLAGY    S D     D     K+ VA +L+ +EK +L   L    ++
Sbjct: 350 VLNQIEHFFERRLAGYK---SSDTTKAVDAKKDAKRTVAKKLMSIEKNILRNALAAVHEL 406

Query: 503 IMLLPDVTVSPCPAPYAPLL 522
           I  LPD  +SPC  PY P L
Sbjct: 407 IRELPDGAISPCIGPYLPNL 426


>gi|326503142|dbj|BAJ99196.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 425

 Score =  546 bits (1407), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 274/437 (62%), Positives = 326/437 (74%), Gaps = 34/437 (7%)

Query: 8   RSSKFISPP-----IRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSSDT 62
           RSS+  +PP     +   HH L +   +     R P  GS     R  +R  +    +DT
Sbjct: 9   RSSEARAPPMASSALSGTHHRLLLPCFLR----RLPQPGS-----RSCSRLRLAACHADT 59

Query: 63  LVAGS------REVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKP---SHNEKHRPIHYV 113
           L++ S                 G    W+  NGLPP K+ + E+P   S   + RP+H+V
Sbjct: 60  LLSSSGAQGPPSPAACLSASSAGGFSDWLLTNGLPPGKLAILERPVPCSRGGRDRPLHFV 119

Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKK 173
           AA +DL+AGD AF VP SLVVTLERVLG+E++AELLTTNKLSELACLALYLMYEKKQG+ 
Sbjct: 120 AAGQDLEAGDVAFEVPMSLVVTLERVLGDESVAELLTTNKLSELACLALYLMYEKKQGRD 179

Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
           S W PYI+ELDRQRGRGQLAVESPLLW+E+EL YL GSP + E++ R EGIK+EYNELDT
Sbjct: 180 SLWYPYIKELDRQRGRGQLAVESPLLWTESELDYLNGSPMRDEVVVRDEGIKKEYNELDT 239

Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLA 293
           +WFMAGSLF+QYP+D+PTEAF FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 
Sbjct: 240 LWFMAGSLFKQYPFDVPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLT 299

Query: 294 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV 353
           Y S CKAML AVD +V+L+VDRPYKAGE I+VW           CGPQPNS+LL+NYGFV
Sbjct: 300 YKSNCKAMLTAVDGSVRLLVDRPYKAGEPIIVW-----------CGPQPNSRLLLNYGFV 348

Query: 354 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 413
           DEDNPYDR+ +EA+LNTEDPQYQ+KRMVAQRNGKL++Q F V  G+EK+ IS+MLPYLRL
Sbjct: 349 DEDNPYDRIAIEASLNTEDPQYQEKRMVAQRNGKLAIQKFQVCVGKEKQTISEMLPYLRL 408

Query: 414 GYVSDTSEMQSVISSLG 430
           GY+SD  EMQ ++SS G
Sbjct: 409 GYISDPDEMQCILSSEG 425


>gi|384246822|gb|EIE20311.1| hypothetical protein COCSUDRAFT_48681 [Coccomyxa subellipsoidea
           C-169]
          Length = 539

 Score =  388 bits (997), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 197/383 (51%), Positives = 264/383 (68%), Gaps = 20/383 (5%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           A+ DLQAG+ A  +P+ LV+TL+RV  +E++AELLTT+KLSELACL LYLMYEKK G++S
Sbjct: 49  AARDLQAGELALRIPDHLVITLDRVFEDESLAELLTTDKLSELACLTLYLMYEKKNGRQS 108

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDT 233
            W  +I+ELDR +GRGQ+  +SPLLW E ++  YL GSP  AEI ER +GI++EY ELDT
Sbjct: 109 VWYEFIKELDRIQGRGQMGAKSPLLWDEGQVDEYLAGSPLVAEIKERLKGIEKEYAELDT 168

Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLA 293
           VWFMAGSLF+ YPYD+PTEAF+ ++F+Q F AVQ+ VVHLQ V L++RFALVPLGPPLL+
Sbjct: 169 VWFMAGSLFKSYPYDVPTEAFSLKLFRQGFAAVQASVVHLQGVPLSKRFALVPLGPPLLS 228

Query: 294 YSSKCKAMLAAVDDA--VQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYG 351
           YSS  KAML    +A  VQL VDR Y  GE I  W           CGPQPN +LL+NYG
Sbjct: 229 YSSTAKAMLTYNREAKEVQLAVDRSYTKGEPIEAW-----------CGPQPNRRLLLNYG 277

Query: 352 FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL 411
            V ++NP+D++ +   L   DP +Q KR V Q+N   + Q F +   R+K     +LPYL
Sbjct: 278 IVTDNNPHDKMALTVTLPHADPLFQAKRAVLQQNNLSTQQTFQLQ--RDKGLPELLLPYL 335

Query: 412 RLGYVSDTSEMQSVISSLGPIC--PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 469
           RL + +D + ++  +++L   C  P+SP  ER VL QLA + + RL  Y  T  EDE ++
Sbjct: 336 RLAHCTDAASLK--LATLDTCCAAPISPENERTVLHQLASHLQDRLDRYKTTCEEDEVII 393

Query: 470 TDYNLHPKKRVATQLVRMEKKML 492
                 P+++VA +L+R+EK +L
Sbjct: 394 RSTTAGPRQKVAARLLRIEKAIL 416


>gi|255536985|ref|XP_002509559.1| conserved hypothetical protein [Ricinus communis]
 gi|223549458|gb|EEF50946.1| conserved hypothetical protein [Ricinus communis]
          Length = 348

 Score =  372 bits (956), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 174/190 (91%), Positives = 182/190 (95%)

Query: 85  MHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNET 144
           MHKNGLPPCKV+LKE+PSH+ K RPIHYVAASEDLQ GD AFSVPNSLVVTLERVLGNET
Sbjct: 1   MHKNGLPPCKVVLKERPSHDAKLRPIHYVAASEDLQTGDVAFSVPNSLVVTLERVLGNET 60

Query: 145 IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 204
           + ELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE E
Sbjct: 61  VVELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSEAE 120

Query: 205 LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
           LAYLTGSPTKAE+LERA+GIKREY+ELDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFV
Sbjct: 121 LAYLTGSPTKAEVLERADGIKREYDELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFV 180

Query: 265 AVQSCVVHLQ 274
           A+QSCVVHLQ
Sbjct: 181 AIQSCVVHLQ 190



 Score =  278 bits (712), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 130/159 (81%), Positives = 148/159 (93%)

Query: 364 VEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQ 423
           ++AALNTEDPQYQDKRMVAQRNGKLS+QVF ++ G+EKEAISD+LPYLRLGYVSD SEMQ
Sbjct: 189 LQAALNTEDPQYQDKRMVAQRNGKLSIQVFQIYVGKEKEAISDILPYLRLGYVSDPSEMQ 248

Query: 424 SVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQ 483
           SVISS GPICPVSPCME+AVLDQLADYFK RLAGYP +L+EDE ML D+NL+PKKRVATQ
Sbjct: 249 SVISSQGPICPVSPCMEQAVLDQLADYFKRRLAGYPTSLNEDELMLADHNLNPKKRVATQ 308

Query: 484 LVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 522
           LVR+EKK+LNACLQ TAD+I  LPD++VSPCPAPYAP+L
Sbjct: 309 LVRLEKKILNACLQATADLINQLPDLSVSPCPAPYAPIL 347


>gi|212721460|ref|NP_001132025.1| uncharacterized protein LOC100193433 [Zea mays]
 gi|194693232|gb|ACF80700.1| unknown [Zea mays]
 gi|414881264|tpg|DAA58395.1| TPA: hypothetical protein ZEAMMB73_027665 [Zea mays]
 gi|414881265|tpg|DAA58396.1| TPA: hypothetical protein ZEAMMB73_027665 [Zea mays]
          Length = 252

 Score =  366 bits (939), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 173/253 (68%), Positives = 206/253 (81%), Gaps = 11/253 (4%)

Query: 270 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNI 329
           ++  QKVSLARRFALVPLGPPLL Y S CKAML A  D+V+LVVDRPYKAGE I++W   
Sbjct: 10  LIQEQKVSLARRFALVPLGPPLLTYRSNCKAMLTADGDSVRLVVDRPYKAGEPIIIW--- 66

Query: 330 SFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLS 389
                   CGPQ NS+L++NYGFVDEDNP+DR+ +EA+LNTEDPQYQ+KRMVAQRNGKL+
Sbjct: 67  --------CGPQTNSRLVLNYGFVDEDNPFDRVAIEASLNTEDPQYQEKRMVAQRNGKLA 118

Query: 390 VQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLAD 449
           +Q F+V+ G+EK+ +++MLPYLRLGY+S+  EMQS++SS G  CPVSPC ERAVLDQL  
Sbjct: 119 IQNFNVYVGKEKQTVAEMLPYLRLGYISNPDEMQSILSSEGDTCPVSPCTERAVLDQLVG 178

Query: 450 YFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDV 509
           Y ++RLAGYP TL EDEAML D NL PKK VAT+LVR+EKKML+ACLQ T + I  LPD 
Sbjct: 179 YLESRLAGYPTTLDEDEAMLADGNLEPKKEVATRLVRLEKKMLHACLQATNEFINDLPDH 238

Query: 510 TVSPCPAPYAPLL 522
           TVSPCPAPYAP L
Sbjct: 239 TVSPCPAPYAPEL 251


>gi|307107385|gb|EFN55628.1| hypothetical protein CHLNCDRAFT_57818 [Chlorella variabilis]
          Length = 435

 Score =  356 bits (913), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 191/415 (46%), Positives = 262/415 (63%), Gaps = 20/415 (4%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
           +  W+ ++G P  KV L+       +   +    A+E LQ GD A  +P  L+VTL+RVL
Sbjct: 9   MMQWLTESGAPQQKVKLQTVVREGTE---VDITVAAEALQPGDVALRIPEHLIVTLDRVL 65

Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
            + T+AEL+TT KLSELACL LYL YEKK+GK+  W  +I+ELDR +GRG    +SPLLW
Sbjct: 66  EDNTLAELVTTGKLSELACLTLYLAYEKKRGKEGCWYRFIKELDRMQGRGSQGAKSPLLW 125

Query: 201 SETELA-YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
            E + A  L GSP   EI  R +GI++EY ELDTVW++AGSLF + P+  PTE F+F +F
Sbjct: 126 DEGQAAELLAGSPVVGEIEARLQGIRKEYEELDTVWYLAGSLFNRQPFSPPTEQFSFPVF 185

Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPY 317
           +QAF AVQS VVHLQ V+L +RFALVP+GPPLL YSS  KAML    ++  V+L VDR Y
Sbjct: 186 RQAFTAVQSSVVHLQGVALGKRFALVPMGPPLLTYSSTAKAMLKFDPESHEVRLAVDRAY 245

Query: 318 KAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQD 377
           + GE+++ W           CGPQPNS+LLINYG VDE NPYD+L +   + ++DP Y+ 
Sbjct: 246 QPGEAVLAW-----------CGPQPNSRLLINYGIVDESNPYDKLPLSITIPSDDPLYRL 294

Query: 378 KRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSP 437
           KR      G  + Q F + A     A   +LPYLRL + +  ++++ V        PV+P
Sbjct: 295 KRDRLAERGLSTQQTFQLQAAASLPA--QLLPYLRLVHSTREADVEGVKWE-EEAGPVAP 351

Query: 438 CMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 492
             E  VL+QL  + + R + Y  T+ EDEA++ D    P+  VA +L+++EK +L
Sbjct: 352 ENELTVLNQLITHLRLRQSRYRTTIEEDEAIIADPAKGPRPTVAARLLKIEKGIL 406


>gi|413950742|gb|AFW83391.1| hypothetical protein ZEAMMB73_866859 [Zea mays]
 gi|413950743|gb|AFW83392.1| hypothetical protein ZEAMMB73_866859 [Zea mays]
          Length = 252

 Score =  355 bits (911), Expect = 4e-95,   Method: Compositional matrix adjust.
 Identities = 166/251 (66%), Positives = 203/251 (80%), Gaps = 11/251 (4%)

Query: 270 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNI 329
           ++  QKVSLARRFALVPLGPPLL Y S CKAML    ++V+LVVDRPYKAGE I++W   
Sbjct: 10  LIQEQKVSLARRFALVPLGPPLLTYKSNCKAMLTVDGESVRLVVDRPYKAGEPIIIW--- 66

Query: 330 SFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLS 389
                   CGPQ NS+L++NYGFVDE+NP+DR+ +EA+LNTEDPQYQ+KRMVAQRNGK +
Sbjct: 67  --------CGPQTNSRLVLNYGFVDENNPFDRISIEASLNTEDPQYQEKRMVAQRNGKHA 118

Query: 390 VQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLAD 449
           +Q F+V+ G+EK+ +++MLPYLRLGY+SD  EMQS++SS G  CPVSPC ERAVLDQL  
Sbjct: 119 IQNFNVYVGKEKQTVAEMLPYLRLGYISDPDEMQSILSSEGDTCPVSPCTERAVLDQLGG 178

Query: 450 YFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDV 509
           Y ++RLAGYP TL+EDEAML D +L PK+ VAT+LVR+EKKML+ACLQ T + I  LPD 
Sbjct: 179 YLESRLAGYPTTLNEDEAMLADGSLEPKQEVATRLVRLEKKMLHACLQATNEFITDLPDH 238

Query: 510 TVSPCPAPYAP 520
           TVSPCPA YAP
Sbjct: 239 TVSPCPAQYAP 249


>gi|302847476|ref|XP_002955272.1| hypothetical protein VOLCADRAFT_76643 [Volvox carteri f.
           nagariensis]
 gi|300259344|gb|EFJ43572.1| hypothetical protein VOLCADRAFT_76643 [Volvox carteri f.
           nagariensis]
          Length = 488

 Score =  338 bits (868), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 192/419 (45%), Positives = 255/419 (60%), Gaps = 25/419 (5%)

Query: 80  DLKSWMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
           +L  W+ +NG  +   +V   + PS     RP+  V A   L AG+ A SVP  L +TL+
Sbjct: 43  ELVDWLRENGAKIDAVEVKTMDVPSAG---RPLDVVVAGRSLAAGEVALSVPERLCLTLD 99

Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
           R+  +E +AELLTT+KLSELACLALYLMYEKK  KKSFW PYI+ELD+Q+ RG  A ESP
Sbjct: 100 RIFESEFVAELLTTDKLSELACLALYLMYEKKLKKKSFWYPYIKELDKQQARGPQAAESP 159

Query: 198 LLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 256
           LLW + EL + L GSP    + +R  GI++EY  LDTVWFMAGSLF +YP+D+PTE F+F
Sbjct: 160 LLWGDQELDSLLKGSPLLPAVRQRQAGIRKEYEALDTVWFMAGSLFNKYPFDLPTETFSF 219

Query: 257 EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD--AVQLVVD 314
           E+F+QAF  VQ+ +VHLQ V +A+RFALVPLGPPL+AYSS  K M+   +D  +V+LVV 
Sbjct: 220 ELFQQAFAVVQASIVHLQGVPIAKRFALVPLGPPLMAYSSTSKNMMTYDEDSRSVRLVVS 279

Query: 315 RPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVE--AALNTED 372
            P +AG  +  W           CGPQPNS+LL+NYG VDE NP+D+L       L T D
Sbjct: 280 GPVEAGRPVAAW-----------CGPQPNSRLLLNYGVVDEHNPFDKLQARFTFTLPTSD 328

Query: 373 PQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPI 432
           P +  KR V    G  + Q F V   R       +LPY+ L   +   ++ SV  S    
Sbjct: 329 PLFPAKRAVLSEAGLATQQSFDVSVARPLP--PQLLPYMMLALATTPEQVASV--SFSDT 384

Query: 433 CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKM 491
                 +E A L  L  Y + R A Y   L  D  ++ D +  P+++VA +L ++EK +
Sbjct: 385 AGHDRELEAAALAALMAYVQRRTAAYAHPLWRDLEIINDPSSTPRQKVAARLTKIEKSI 443


>gi|413950744|gb|AFW83393.1| hypothetical protein ZEAMMB73_866859 [Zea mays]
          Length = 281

 Score =  338 bits (868), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 166/280 (59%), Positives = 203/280 (72%), Gaps = 40/280 (14%)

Query: 270 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNI 329
           ++  QKVSLARRFALVPLGPPLL Y S CKAML    ++V+LVVDRPYKAGE I++W   
Sbjct: 10  LIQEQKVSLARRFALVPLGPPLLTYKSNCKAMLTVDGESVRLVVDRPYKAGEPIIIW--- 66

Query: 330 SFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLS 389
                   CGPQ NS+L++NYGFVDE+NP+DR+ +EA+LNTEDPQYQ+KRMVAQRNGK +
Sbjct: 67  --------CGPQTNSRLVLNYGFVDENNPFDRISIEASLNTEDPQYQEKRMVAQRNGKHA 118

Query: 390 VQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLAD 449
           +Q F+V+ G+EK+ +++MLPYLRLGY+SD  EMQS++SS G  CPVSPC ERAVLDQL  
Sbjct: 119 IQNFNVYVGKEKQTVAEMLPYLRLGYISDPDEMQSILSSEGDTCPVSPCTERAVLDQLGG 178

Query: 450 YFKARLAGYPATLSEDEAM-----------------------------LTDYNLHPKKRV 480
           Y ++RLAGYP TL+EDEAM                             L D +L PK+ V
Sbjct: 179 YLESRLAGYPTTLNEDEAMVMSCDFLRVVSWSLYKLAECYGIGFGHCQLADGSLEPKQEV 238

Query: 481 ATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAP 520
           AT+LVR+EKKML+ACLQ T + I  LPD TVSPCPA YAP
Sbjct: 239 ATRLVRLEKKMLHACLQATNEFITDLPDHTVSPCPAQYAP 278


>gi|413950741|gb|AFW83390.1| hypothetical protein ZEAMMB73_201403, partial [Zea mays]
          Length = 130

 Score =  232 bits (592), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 108/128 (84%), Positives = 117/128 (91%)

Query: 147 ELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA 206
           ELLT NKLSELACLALYLMYEKKQGK SFW PYI+ELDR RGRGQLAVESPLLW+E+EL 
Sbjct: 1   ELLTNNKLSELACLALYLMYEKKQGKDSFWYPYIKELDRHRGRGQLAVESPLLWTESELD 60

Query: 207 YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 266
           YL+GSP K E++ R E I+REYNELDT+WFMAGSLFQQYP+DIPTEAF FEIFKQAFVAV
Sbjct: 61  YLSGSPLKDEVVARDEAIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIFKQAFVAV 120

Query: 267 QSCVVHLQ 274
           QSCVVHLQ
Sbjct: 121 QSCVVHLQ 128


>gi|168002824|ref|XP_001754113.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162694667|gb|EDQ81014.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 638

 Score =  141 bits (355), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 128/455 (28%), Positives = 203/455 (44%), Gaps = 49/455 (10%)

Query: 45  RLVRRKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHN 104
           R V  + R  +R  +  ++   S + ++     L  L  W+ K G P   VIL       
Sbjct: 54  RTVWNEGRRGLRGVARCSMSGNSMQSMA-----LHQLSEWLSKQGFPTQDVILT---GFG 105

Query: 105 EKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYL 164
           E+   +   AA  D + G+ A  +P +  VT   V+ +  +A         ++  L L+L
Sbjct: 106 EEGVGL---AAGRDFKEGEVALKIPENYTVTGVDVVNHPVVAAPAAGR--GDVIGLTLWL 160

Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS-ETELAYLTGSPTKAEILERAEG 223
           MYE+  G+KS W PY++              SP+LW+ E +   L GSP   E+ +R+  
Sbjct: 161 MYERSLGEKSVWYPYLQTFPS-------TTLSPILWTAEEQQKLLKGSPALEEVQQRSAA 213

Query: 224 IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 283
           ++ EY +L        S F + P   P E F+ E FK AF  + S  V+L    L   FA
Sbjct: 214 LEGEYEDLQ-------SYFTKDPQAFPQEYFSLEAFKSAFSVILSRAVYLPSADL---FA 263

Query: 284 LVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQ 341
           LVP    L  + +  +A L  +  D AV   VDR YK GE +   Y     G +     +
Sbjct: 264 LVPYADAL-NHRADSQAYLDYSMEDQAVVFPVDRNYKEGEQVFTSY-----GRE-----R 312

Query: 342 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREK 401
            N+ LLI YGFVDE+N  D L +E  L   D     K+ + Q+    S Q F ++  R  
Sbjct: 313 SNADLLITYGFVDENNAMDYLDLEVGLVDGDRLLVLKQQILQQAMLDSPQTFPLYLDRFP 372

Query: 402 EAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPAT 461
              + +L Y+RL  + D +    ++     +  +    E   L  L    + +L  Y   
Sbjct: 373 ---TQLLTYMRLSRLQDPALFPKIVFDKDIM--LDQANEYECLQLLMGECRTKLGNYEGG 427

Query: 462 LSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 496
           + ++  +L +  +  ++RVA QL   EKK+L + +
Sbjct: 428 VDDEIRLLKNKKISQRERVAAQLRLCEKKILTSTM 462


>gi|298706765|emb|CBJ29688.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Ectocarpus siliculosus]
          Length = 521

 Score =  135 bits (341), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 123/444 (27%), Positives = 198/444 (44%), Gaps = 50/444 (11%)

Query: 81  LKSWMHKNGL-----------PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVP 129
           LK WM +NG+           P    +  +    NE       + A+ +++ GD  F++P
Sbjct: 92  LKEWMGENGVWVYDKSDWGVGPHALSVAVDTVDENENETAGRGMIANREIKEGDELFTLP 151

Query: 130 NSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
             L++T +     E  A+++T + LSE   +AL  ++EK +GK+SFW  YI  L      
Sbjct: 152 IDLLLT-KDAAKKEFGADVITED-LSEYIAIALLAVHEKAKGKESFWSSYIGVLPTVE-- 207

Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDI 249
               V    LW+E +LA L GSP  A        ++ EY  ++        L  ++P  +
Sbjct: 208 ---EVYPTYLWAEEDLALLEGSPVIAATESMRRKLEVEYATVEN------DLLDKFPEIL 258

Query: 250 PTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG------PPLLAY-SSKCKAML 302
           P E  T+E F+ AF  + S  + L  +S     ALVP        P   +Y  ++ + + 
Sbjct: 259 PREVHTYEEFQWAFAMLFSRAIRLGGLSTGEAVALVPYADLFNHNPFANSYIDARQQGLF 318

Query: 303 AAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRL 362
            +  D V +  DR YK  E + + Y           GP+ NS LL+ YGF  + NPY+ +
Sbjct: 319 FSKTDEVVVYADRSYKKMEQVYISY-----------GPKGNSDLLLLYGFSLDRNPYNSV 367

Query: 363 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM 422
            V  +L+  D  Y+ K+      G    + F ++  R  +   ++L YLRL  ++     
Sbjct: 368 DVTVSLDENDELYERKKAFLSEAGLPPTKAFPLYNDRYPD---ELLQYLRLIQLNTDQLR 424

Query: 423 QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD----YNLHPKK 478
              +  L      +   E  VLD L +  KA +AGYP T  +D  ++ D      L   +
Sbjct: 425 GRTLEDLSFEKKQTDVNELMVLDSLVEACKATIAGYPTTEEQDSKLMNDPGMFRALSKTQ 484

Query: 479 RVATQLVRMEKKMLNACL-QVTAD 501
           R+A +  R EK +L   +  VT D
Sbjct: 485 RMAVKHRRQEKVILRRTIAAVTKD 508


>gi|302764082|ref|XP_002965462.1| hypothetical protein SELMODRAFT_406852 [Selaginella moellendorffii]
 gi|300166276|gb|EFJ32882.1| hypothetical protein SELMODRAFT_406852 [Selaginella moellendorffii]
          Length = 481

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 125/419 (29%), Positives = 187/419 (44%), Gaps = 44/419 (10%)

Query: 80  DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
           D+  W+ + G P   +++    S  +K        A+ DLQAGDAA S+P +  VT   V
Sbjct: 49  DMTKWLQEQGFPQQPLLVS---SFEDKGLG---CCATRDLQAGDAALSIPENFTVTAVDV 102

Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
             +  I+         EL  LAL+LMYE+++ + S W PY++           +  SPLL
Sbjct: 103 ANHPVISS--AAEGRDELVGLALWLMYEQERSQDSPWYPYLKVF-------PASTLSPLL 153

Query: 200 WSETELAYLT-GSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFE 257
           W + E   L  GS   A++ ++   +++ ++ L DT+         +   D P E FTF 
Sbjct: 154 WEQEEQEELLRGSSALAKVKDQLTSLRQTFDALKDTL---------KDNKDFPMEKFTFS 204

Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY 317
            FK AF  V S  V+L    L   FALVP G  +   SS+        +  V+L VD+ Y
Sbjct: 205 AFKAAFSVVLSRAVYLPSAEL---FALVPFGDLINHESSRSLLDYDIEEQKVKLAVDKRY 261

Query: 318 KAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQD 377
           K G+ +   Y  + T  DF          LI YGF+DE +  D + +E  L + D     
Sbjct: 262 KKGDQVFASYAQNLTSADF----------LIRYGFLDESDENDFIEIEVGLVSGDSLAPL 311

Query: 378 KRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSP 437
           KR + Q  G    Q F V+  R     + +L Y RL  + D+     +      I  V  
Sbjct: 312 KREILQEVGLTVPQKFPVYLNRFP---TQLLTYTRLARIQDSGLFAKITFEKDLI--VCQ 366

Query: 438 CMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 496
             E   L  L    + +L  +  T+ +D   L   NL  K+RVA QL   EK++L   +
Sbjct: 367 TNEYETLMLLMADCRTKLLSFSDTMEDDMQTLKRKNLSYKQRVAAQLRLKEKRILTDTM 425


>gi|302823067|ref|XP_002993188.1| hypothetical protein SELMODRAFT_449044 [Selaginella moellendorffii]
 gi|300138958|gb|EFJ05708.1| hypothetical protein SELMODRAFT_449044 [Selaginella moellendorffii]
          Length = 600

 Score =  125 bits (314), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 120/415 (28%), Positives = 182/415 (43%), Gaps = 42/415 (10%)

Query: 80  DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
           D+  W+ + G P   +++    S  +K        A+ DLQAGDAA S+P +  VT   V
Sbjct: 49  DMTKWLQEQGFPQQPLLV---SSFEDKGLG---CCATRDLQAGDAALSIPENFTVTAVDV 102

Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
             +  I+         EL  LAL+LMYE+++ + S W PY++            +   L 
Sbjct: 103 ANHPVISS--AAEGRDELVGLALWLMYEQERSQDSPWYPYVKVFPAS------TLSLLLW 154

Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFEI 258
             E +   L GS   A++ ++   +++ ++ L DT+         +   D P E FTF  
Sbjct: 155 EQEEQEELLRGSSALAKVKDQLTSLRQTFDALKDTL---------KDNKDFPMEKFTFSA 205

Query: 259 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYK 318
           FK AF  V S  V+L    L   FALVP G  +   SS+        +  V+L VD+ YK
Sbjct: 206 FKTAFSVVLSRAVYLPSAEL---FALVPFGDLINHESSRSLLDYDIEEQKVKLAVDKRYK 262

Query: 319 AGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 378
            G+ +   Y  + T  DF          LI YGF+DE +  D + +E  L + D     K
Sbjct: 263 KGDQVFASYAQNLTSADF----------LIRYGFLDESDENDCIEIEVGLVSGDSLAPLK 312

Query: 379 RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPC 438
           R + Q  G    Q F ++  R     + +L Y RL  + D+     +      I  VS  
Sbjct: 313 REILQEVGLTVPQKFPLYLNR---FPTQLLTYTRLARIQDSGLFAKITFEKDLI--VSQT 367

Query: 439 MERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 493
            E   L  L    + +L     T+ ++   L   NL  K+RVA QL   EK++L 
Sbjct: 368 NEYETLMLLMADCRTKLLSSSDTMEDEMQTLRRKNLSYKQRVAAQLRLKEKRILT 422


>gi|452821842|gb|EME28868.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
           N-methyltransferase I [Galdieria sulphuraria]
          Length = 490

 Score =  125 bits (314), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 120/462 (25%), Positives = 209/462 (45%), Gaps = 63/462 (13%)

Query: 56  RVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGL----------PPCKVILKEKPSHNE 105
           R S + +  +G   V      ++     W+ +NG+           P ++++ E+ + +E
Sbjct: 58  RSSDAFSFTSGDPAVQKGWSSEISAFYDWLKENGVYLSEKASWTHAPHRLVIAEE-TKDE 116

Query: 106 KHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLM 165
                  + +S  +  G+    +P  L+ T  R L  ET    +  +   E   + L L+
Sbjct: 117 GEYSGRGLLSSRSVNLGEKVLEIPEKLMFT--RKLALETFPTSIIASIEDEYVSIGLLLL 174

Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPL-LWSETELAYLTGSPTKAEILERAEGI 224
           YEK +G  SF+ PY+  L        L   +PL LWS  +L  L GSPT +   +  + +
Sbjct: 175 YEKAKGFDSFFKPYLDILP------TLDELNPLFLWSNKDLDLLQGSPTLSACEQLRDKL 228

Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 284
            REY       ++  ++  Q P +  ++   F+ F+ AF  + S  +       ++R AL
Sbjct: 229 LREYT------YLGKNIIPQIP-NFASKPIDFKQFQWAFGILFSRAICFPS---SKRIAL 278

Query: 285 VPLGPPLLAYSSKCKAML--------AAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDF 336
           VP    LL +S  C A +          V +AV + VDR Y+  E + V Y         
Sbjct: 279 VPYAD-LLNHSPFCSAFIDEEKIPFGNGVTEAV-VYVDRLYEPYEQVYVSY--------- 327

Query: 337 KCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 396
             GP+ N +LL+ YGF  E NP+D + +   L+  DP Y +K  + +  GK  +Q F ++
Sbjct: 328 --GPRSNQELLLLYGFSLERNPFDCVEITIGLDKTDPLYLEKCRMLESYGKSPLQSFPLY 385

Query: 397 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 456
             R      +M  +LR   +   +++Q+   ++     VS   E + LD+L +Y   +L 
Sbjct: 386 MDRYP---VEMAEFLRFCCIDTETDLQADFGTI-----VSASNEESALDKLLNYIVDQLR 437

Query: 457 GYPATLSEDEAMLTD----YNLHPKKRVATQLVRMEKKMLNA 494
            YP +L +DE ++ D      L   +R+A +    EK++L+A
Sbjct: 438 KYPTSLEDDEKIIRDRAMFQTLEKNQRMAIRQRLGEKRILHA 479


>gi|297829320|ref|XP_002882542.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297328382|gb|EFH58801.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 504

 Score =  114 bits (284), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 94/326 (28%), Positives = 153/326 (46%), Gaps = 38/326 (11%)

Query: 76  EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
           E+   L++W+  +GLPP K+ + ++    E+      + AS++L+ G+    VP SLV++
Sbjct: 72  ENATSLQNWLSDSGLPPQKMAI-DRVDIGERG-----LVASQNLRKGEKLLFVPPSLVIS 125

Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
            +    N    E++    + +   LA YL+ E    K S W  YI  L RQ         
Sbjct: 126 ADSEWTNPEAGEVMKRYDVPDWPLLATYLISEASLQKSSRWYNYISALPRQ-------PY 178

Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
           S L W+ TEL  YL  S  +   +ER   +   Y +L +       +F ++P+  P E F
Sbjct: 179 SLLYWTRTELDMYLEASQIRERAIERITNVVGTYEDLRS------RIFSKHPHLFPKEVF 232

Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLV 312
             E FK +F  + S +V L   S+  RFALVP    +L ++ + +  L        V   
Sbjct: 233 NDETFKWSFGILFSRLVRLP--SMDGRFALVPWA-DMLNHNCEVETFLDYDKSSKGVVFT 289

Query: 313 VDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNT 370
            DRPY+ GE + + Y           G + N +LL++YGFV  +  NP D + +  +L  
Sbjct: 290 TDRPYQPGEQVFISY-----------GNKSNGELLLSYGFVPREGTNPSDSVELALSLRK 338

Query: 371 EDPQYQDKRMVAQRNGKLSVQVFHVH 396
            D  Y++K    +++G  + Q F V 
Sbjct: 339 NDKCYKEKLDALKKHGLSTPQCFPVR 364


>gi|15231493|ref|NP_187424.1| rubisco methyltransferase-like protein [Arabidopsis thaliana]
 gi|6466950|gb|AAF13085.1|AC009176_12 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Arabidopsis thaliana]
 gi|6648179|gb|AAF21177.1|AC013483_1 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Arabidopsis thaliana]
 gi|15028205|gb|AAK76599.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Arabidopsis thaliana]
 gi|19310671|gb|AAL85066.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Arabidopsis thaliana]
 gi|332641064|gb|AEE74585.1| rubisco methyltransferase-like protein [Arabidopsis thaliana]
          Length = 504

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 94/326 (28%), Positives = 152/326 (46%), Gaps = 38/326 (11%)

Query: 76  EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
           E+   L++W+  +GLPP K+ + ++    E+      + AS++L+ G+    VP SLV++
Sbjct: 72  ENATSLQNWLSDSGLPPQKMAI-DRVDIGERG-----LVASQNLRKGEKLLFVPPSLVIS 125

Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
            +    N    E++    + +   LA YL+ E    K S W  YI  L RQ         
Sbjct: 126 ADSEWTNAEAGEVMKRYDVPDWPLLATYLISEASLQKSSRWFNYISALPRQ-------PY 178

Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
           S L W+ TEL  YL  S  +   +ER   +   Y +L +       +F ++P   P E F
Sbjct: 179 SLLYWTRTELDMYLEASQIRERAIERITNVVGTYEDLRS------RIFSKHPQLFPKEVF 232

Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLV 312
             E FK +F  + S +V L   S+  RFALVP    +L ++ + +  L        V   
Sbjct: 233 NDETFKWSFGILFSRLVRLP--SMDGRFALVPWA-DMLNHNCEVETFLDYDKSSKGVVFT 289

Query: 313 VDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNT 370
            DRPY+ GE + + Y           G + N +LL++YGFV  +  NP D + +  +L  
Sbjct: 290 TDRPYQPGEQVFISY-----------GNKSNGELLLSYGFVPREGTNPSDSVELALSLRK 338

Query: 371 EDPQYQDKRMVAQRNGKLSVQVFHVH 396
            D  Y++K    +++G  + Q F V 
Sbjct: 339 NDKCYEEKLDALKKHGLSTPQCFPVR 364


>gi|21537309|gb|AAM61650.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Arabidopsis thaliana]
          Length = 504

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 94/326 (28%), Positives = 152/326 (46%), Gaps = 38/326 (11%)

Query: 76  EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
           E+   L++W+  +GLPP K+ + ++    E+      + AS++L+ G+    VP SLV++
Sbjct: 72  ENATSLQNWLSDSGLPPQKMAI-DRVDIGERG-----LVASQNLRKGEKLLFVPPSLVIS 125

Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
            +    N    E++    + +   LA YL+ E    K S W  YI  L RQ         
Sbjct: 126 ADSEWTNAEAGEVMKRYDVPDWPLLATYLISEANLQKSSRWFNYISALPRQ-------PY 178

Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
           S L W+ TEL  YL  S  +   +ER   +   Y +L +       +F ++P   P E F
Sbjct: 179 SLLYWTRTELDMYLEASQIRERAIERITNVVGTYEDLRS------RIFSKHPQLFPKEVF 232

Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLV 312
             E FK +F  + S +V L   S+  RFALVP    +L ++ + +  L        V   
Sbjct: 233 NDETFKWSFGILFSRLVRLP--SMDGRFALVPWA-DMLNHNCEVETFLDYDKSSKGVIFT 289

Query: 313 VDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNT 370
            DRPY+ GE + + Y           G + N +LL++YGFV  +  NP D + +  +L  
Sbjct: 290 TDRPYQPGEQVFISY-----------GNKSNGELLLSYGFVPREGTNPSDSVELALSLRK 338

Query: 371 EDPQYQDKRMVAQRNGKLSVQVFHVH 396
            D  Y++K    +++G  + Q F V 
Sbjct: 339 NDKCYEEKLDALKKHGLSTPQCFPVR 364


>gi|3065835|gb|AAC14296.1| putative methyltransferase [Arabidopsis thaliana]
          Length = 504

 Score =  109 bits (273), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 93/326 (28%), Positives = 150/326 (46%), Gaps = 38/326 (11%)

Query: 76  EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
           E+   L++W+  +GLPP K+ + ++    E+      + AS++L+ G+    V  SLV+ 
Sbjct: 72  ENATSLQNWLSDSGLPPQKMAI-DRVDIGERG-----LVASQNLRKGEKLLFVSPSLVIC 125

Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
            +    N    E++    + +   LA YL+ E    K S W  YI  L RQ         
Sbjct: 126 ADSEWTNAEAGEVMKRYDVPDWPLLATYLISEASLQKSSRWFNYISALPRQ-------PY 178

Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
           S L W+ TEL  YL  S  +   +ER   +   Y +L +       +F ++P   P E F
Sbjct: 179 SLLYWTRTELDMYLEASQIRERAIERITNVVGTYEDLRS------RIFSKHPQLFPKEVF 232

Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLV 312
             E FK +F  + S +V L   S+  RFALVP    +L ++ + +  L        V   
Sbjct: 233 NDETFKWSFGILFSRLVRLP--SMDGRFALVPWA-DMLNHNCEVETFLDYDKSSKGVVFT 289

Query: 313 VDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNT 370
            DRPY+ GE + + Y           G + N +LL++YGFV  +  NP D + +  +L  
Sbjct: 290 TDRPYQPGEQVFISY-----------GNKSNGELLLSYGFVPREGTNPSDSVELALSLRK 338

Query: 371 EDPQYQDKRMVAQRNGKLSVQVFHVH 396
            D  Y++K    +++G  + Q F V 
Sbjct: 339 NDKCYEEKLDALKKHGLSTPQCFPVR 364


>gi|224129218|ref|XP_002320530.1| predicted protein [Populus trichocarpa]
 gi|222861303|gb|EEE98845.1| predicted protein [Populus trichocarpa]
          Length = 503

 Score =  108 bits (271), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 94/323 (29%), Positives = 146/323 (45%), Gaps = 38/323 (11%)

Query: 76  EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
           E+   L+ W+  +GLPP K+ + +K    E+      + A ++++ G+    VP SLV+ 
Sbjct: 71  ENAEALQKWLSDSGLPPQKMAI-QKVEVGERG-----LVALKNIRKGEMLLFVPPSLVIA 124

Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
            +         E+L    + +   LA YL+ E    K S W  YI  L RQ         
Sbjct: 125 ADSEWSCPEAGEVLKKYSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------PY 177

Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
           S L W+  EL  YL  S  +   +ER   +   YN+L         +F +YP+  P E F
Sbjct: 178 SLLYWTRAELDTYLEASQIRERAIERITNVTGTYNDLRL------RIFSKYPHLFPEEVF 231

Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLV 312
             E FK +F  + S +V L   S+  R ALVP    +L +SS+ +  L        V   
Sbjct: 232 NMETFKWSFGILFSRLVRLP--SMDGRVALVPWA-DMLNHSSEVETFLDYDKSSKGVVFT 288

Query: 313 VDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNT 370
            DRPY+ GE + + Y           G + N +LL++YGFV  +  NP D + +  +L  
Sbjct: 289 TDRPYQPGEQVFISY-----------GRKSNGELLLSYGFVPREGTNPSDSVELSLSLKK 337

Query: 371 EDPQYQDKRMVAQRNGKLSVQVF 393
            D  Y++K    +++G    Q F
Sbjct: 338 SDKCYKEKLEALKKHGLSVSQCF 360


>gi|219121061|ref|XP_002185762.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
           N-methyltransferase I [Phaeodactylum tricornutum CCAP
           1055/1]
 gi|209582611|gb|ACI65232.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
           N-methyltransferase I [Phaeodactylum tricornutum CCAP
           1055/1]
          Length = 575

 Score =  108 bits (270), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 104/417 (24%), Positives = 185/417 (44%), Gaps = 65/417 (15%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKK-QGKK 173
           A  D+  GD    +P +L +T  +    + + + +  ++++E   +A +L+YE+  +G++
Sbjct: 131 ARRDINDGDELLRIPMALCMT--KSAARKAVGKDVLPSEINEYLAMACHLIYERNVRGEE 188

Query: 174 SFWLPYIR---ELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
           S W PY+    ++D         V     W + +LA+L GSP  A        ++REY+ 
Sbjct: 189 SPWKPYLDVLPDIDE--------VNPTFTWPDEDLAFLNGSPVIAATKSLQMKLRREYDA 240

Query: 231 LDTVWFMAG--SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG 288
           L     + G   L  +YP   P EAF F+ ++ AF  + S  + L+ +      ALVP  
Sbjct: 241 L-----LGGEDGLLAKYPDRFPAEAFNFKAWEWAFTMLFSRAIRLRSLKQGETLALVPYA 295

Query: 289 PPLLAYSSKCKAMLAAV----------DDAVQLVVDRPYKAGESIVVWYNISFTGHDFKC 338
             L+ +S   +A + A           D+ V L  DR Y+  E I + Y           
Sbjct: 296 -DLINHSPFSQAYIDARQNGDWLFKSGDEEVILYADRGYRRMEQIYISY----------- 343

Query: 339 GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE---------------DPQYQDKRMVAQ 383
           GP+ N++LL+ YGF  E NP++ + V  ++                  DP  ++K    +
Sbjct: 344 GPKSNAELLLLYGFAVERNPFNSVDVTVSIAPRTASFVKELDDDTIPVDPLAEEKAAFLE 403

Query: 384 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 443
           + G+ +   F  +A R      +ML YLRL  ++        ++       +S   E AV
Sbjct: 404 QVGRDATVDFPCYADRYP---VEMLEYLRLMQMTPEDTRGKPLAEFDYSRTISLGNEAAV 460

Query: 444 LDQLADYFKARLAGYPATLSEDEAMLTDYNLHP----KKRVATQLVRMEKKMLNACL 496
           L  +      +L+ YP +  +D A++ D +L       +R+A +  R EK++L   +
Sbjct: 461 LTSVITAVSRQLSNYPQSEEDDAALIKDKSLFRLLSYNQRMAVRHRRNEKRLLKRTI 517


>gi|356547583|ref|XP_003542190.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Glycine
           max]
          Length = 499

 Score =  107 bits (267), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 92/326 (28%), Positives = 148/326 (45%), Gaps = 38/326 (11%)

Query: 76  EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
           E+   L+ W+ ++GLPP K+ + E+    E+      + A ++++ G+    VP SLV+T
Sbjct: 67  ENSSALQRWLSESGLPPQKMGI-ERVEVGERG-----LVALKNIRKGEKLLFVPPSLVIT 120

Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
            +         E+L  N + +   LA YL+ E    + S W  YI  L RQ         
Sbjct: 121 PDSEWSCPEAGEVLKRNSVPDWPLLATYLISEASLMESSRWSNYISALPRQ-------PY 173

Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
           S L W++ EL  YL  S  +   +ER   +   YN+L         +F +YP   P E F
Sbjct: 174 SLLYWTQAELDRYLEASQIRERAIERINNVIGTYNDLRL------RIFSKYPDLFPDEVF 227

Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLV 312
             E FK +F  + S +V L   S+    ALVP    +L +S   +  L        +   
Sbjct: 228 NIESFKWSFGILFSRLVRLP--SMGGNVALVPWA-DMLNHSCDVETFLDYDKTSKGIVFT 284

Query: 313 VDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNT 370
            DRPY+ GE + + Y           G + N +LL++YGFV ++  NP D + +  +L  
Sbjct: 285 TDRPYQPGEQVFISY-----------GKKSNGELLLSYGFVPKEGANPSDSVELSLSLKK 333

Query: 371 EDPQYQDKRMVAQRNGKLSVQVFHVH 396
            D  Y++K  + +  G  + Q F + 
Sbjct: 334 SDASYKEKLELLKNYGLSASQCFPIQ 359


>gi|168003103|ref|XP_001754252.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162694354|gb|EDQ80702.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 431

 Score =  106 bits (265), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 90/338 (26%), Positives = 155/338 (45%), Gaps = 42/338 (12%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
           L+ W+ K GL   K++L    S          + A++ L+ G+    VP+ L++T +   
Sbjct: 16  LQDWLMKEGLAKQKLVLDRVDSGGRG------LVATQSLRQGERLLFVPSGLLITADSEW 69

Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
           G      ++    L E   LA++L+ E  + + S W PY   L +          S L W
Sbjct: 70  GCAETGRIIKEAGLPEWPMLAIFLISEASREESSRWFPYFATLPK-------TPSSILQW 122

Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
           +E E+  +LT SP + + LE    +   Y +L        ++F ++P   P++ +T   F
Sbjct: 123 TEEEVNTWLTASPVREKALECIRDVTETYRDL------RATIFLKHPEVFPSQVYTLAAF 176

Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD---AVQLVVDRP 316
           K AF  + S +V L  V    + ALVP    +L +S +  + L    +   +V  V DR 
Sbjct: 177 KWAFGILFSRLVRLPSVG---KLALVPWA-DMLNHSPQVDSFLDFDQNNAKSVVTVTDRA 232

Query: 317 YKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALNTEDPQY 375
           Y++GE + + Y           G + + +L + YGF+  E N +D + +E  ++++DP +
Sbjct: 233 YQSGEQVFISY-----------GKRSSGELFLAYGFIPSELNVHDSVELEMEIDSDDPSF 281

Query: 376 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 413
           + K   A   G  S Q F V   R+    + +L Y RL
Sbjct: 282 EAKLRAANEQGLSSPQRFPV---RKDGFPAQLLAYARL 316


>gi|357462493|ref|XP_003601528.1| SET domain-containing protein [Medicago truncatula]
 gi|355490576|gb|AES71779.1| SET domain-containing protein [Medicago truncatula]
 gi|388500078|gb|AFK38105.1| unknown [Medicago truncatula]
          Length = 497

 Score =  105 bits (263), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 92/326 (28%), Positives = 149/326 (45%), Gaps = 38/326 (11%)

Query: 76  EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
           E+   L+ W+ ++GLP  K+ + +K    E+      + A  +++ G+    VP  LV+T
Sbjct: 65  ENSSSLQKWLSQSGLPSQKMSI-DKVDVGERG-----LVALNNIRKGEKLLFVPPQLVIT 118

Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
            +         E+L  N + +   LA YL+ E    K S W  YI  L RQ         
Sbjct: 119 PDSEWSCPEAGEVLKKNSVPDWPLLATYLISEASLMKSSRWFSYISALPRQ-------PY 171

Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
           S L WS+ EL  YL  S  +   +ER   +   YN+      M   +F +YP   P E F
Sbjct: 172 SLLYWSQAELDRYLEASQIRERAIERTNNVIGTYND------MRVRIFSKYPDFFPEEVF 225

Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV-- 312
             E FK +F  + S +V L   S+  + ALVP    ++ +S + +  L     +  +V  
Sbjct: 226 NIESFKWSFGILFSRMVRLP--SMDGKNALVPWA-DMMNHSCEVETFLDYDKSSKGIVFP 282

Query: 313 VDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNT 370
            DRPY+ GE + + Y           G + N +LL++YGFV ++  NP D + +  +L  
Sbjct: 283 TDRPYQPGEQVFISY-----------GKKSNGELLLSYGFVPKEGTNPSDSVELSLSLKK 331

Query: 371 EDPQYQDKRMVAQRNGKLSVQVFHVH 396
            D  Y++K  + ++ G    Q F + 
Sbjct: 332 SDESYKEKLELLKKYGLSGSQCFPIR 357


>gi|357469947|ref|XP_003605258.1| SET domain-containing protein [Medicago truncatula]
 gi|355506313|gb|AES87455.1| SET domain-containing protein [Medicago truncatula]
          Length = 494

 Score =  105 bits (263), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 92/326 (28%), Positives = 149/326 (45%), Gaps = 38/326 (11%)

Query: 76  EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
           E+   L+ W+ ++GLP  K+ + +K    E+      + A  +++ G+    VP  LV+T
Sbjct: 62  ENSSSLQKWLSQSGLPSQKMSI-DKVDVGERG-----LVALNNIRKGEKLLFVPPQLVIT 115

Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
            +         E+L  N + +   LA YL+ E    K S W  YI  L RQ         
Sbjct: 116 PDSEWSCPEAGEVLKKNSVPDWPLLATYLISEASLMKSSRWFSYISALPRQ-------PY 168

Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
           S L WS+ EL  YL  S  +   +ER   +   YN+      M   +F +YP   P E F
Sbjct: 169 SLLYWSQAELDRYLEASQIRERAIERTNNVIGTYND------MRVRIFSKYPDFFPEEVF 222

Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV-- 312
             E FK +F  + S +V L   S+  + ALVP    ++ +S + +  L     +  +V  
Sbjct: 223 NIESFKWSFGILFSRMVRLP--SMDGKNALVPWA-DMMNHSCEVETFLDYDKSSKGIVFP 279

Query: 313 VDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNT 370
            DRPY+ GE + + Y           G + N +LL++YGFV ++  NP D + +  +L  
Sbjct: 280 TDRPYQPGEQVFISY-----------GKKSNGELLLSYGFVPKEGTNPSDSVELSLSLKK 328

Query: 371 EDPQYQDKRMVAQRNGKLSVQVFHVH 396
            D  Y++K  + ++ G    Q F + 
Sbjct: 329 SDESYKEKLELLKKYGLSGSQCFPIR 354


>gi|225447500|ref|XP_002267469.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic [Vitis
           vinifera]
 gi|296085051|emb|CBI28466.3| unnamed protein product [Vitis vinifera]
          Length = 497

 Score =  105 bits (262), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 93/327 (28%), Positives = 152/327 (46%), Gaps = 40/327 (12%)

Query: 76  EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
           E+   L+ W+  +GLPP K+ + E+    E+      + A ++++ G+    VP SLV+T
Sbjct: 65  ENAALLQKWLSDSGLPPQKMGI-ERVEVGERG-----LVALKNIRKGEKLLFVPPSLVIT 118

Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
            +         E+L  N + +   LA YL+ E    + S W  YI  L RQ         
Sbjct: 119 ADSEWSCTEAGEVLKRNSVPDWPLLATYLIGEASFMQSSRWSNYISALPRQ-------PY 171

Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
           S L W+  EL  YL  S  +   +ER   +   YN+L         +F ++P+  P E F
Sbjct: 172 SLLYWTRAELDKYLEASQIRERAIERINDVTGTYNDLRL------RIFSKHPHLFPEEVF 225

Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV- 313
             E FK +F  + S +V L   S+  + ALVP    +L +S + +  L   D + Q VV 
Sbjct: 226 NMETFKWSFGILFSRLVRLP--SMDEKIALVPWA-DMLNHSCEVETFL-DYDKSSQGVVF 281

Query: 314 --DRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED--NPYDRLVVEAALN 369
             DR Y+  E + + Y           G + N +LL++YGFV  +  NP D++ +  +L 
Sbjct: 282 TTDRTYQPSEQVFISY-----------GKKSNGELLLSYGFVPREGTNPNDKVELLLSLK 330

Query: 370 TEDPQYQDKRMVAQRNGKLSVQVFHVH 396
             D  Y++K    +++G  + Q F + 
Sbjct: 331 KSDKCYKEKSEAMKKHGLSTSQCFPIQ 357


>gi|255582876|ref|XP_002532210.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase, chloroplast precursor, putative
           [Ricinus communis]
 gi|223528106|gb|EEF30179.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase, chloroplast precursor, putative
           [Ricinus communis]
          Length = 508

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 95/322 (29%), Positives = 148/322 (45%), Gaps = 40/322 (12%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
           L+ W+  NGLP  K+ + +K    E+      + A ++++ G+    VP SLV+T +   
Sbjct: 81  LQRWLSNNGLPDQKMAI-DKVEVGERG-----LVALKNIRKGEKLLFVPPSLVITADSEW 134

Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
                 E+L    + +   LA+YL+ E    K S W  YI  L RQ         S L W
Sbjct: 135 SCPEAGEVLKQYSVPDWPLLAIYLISEANLQKSSKWSNYISALPRQ-------PYSLLYW 187

Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
           +  EL  YL  S  +   +ER   +   YN+L         +F +YP   P E F  E F
Sbjct: 188 TRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVFNLETF 241

Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV---DRP 316
           K +F  + S +V L   S+  + ALVP    +L +S + +  L   D + Q VV   DR 
Sbjct: 242 KWSFGILFSRLVRLP--SMDGKVALVPWA-DMLNHSCEVETFL-DYDKSSQGVVFTTDRQ 297

Query: 317 YKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQ 374
           Y+ GE + + Y           G + N +LL++YGFV  +  NP D + +  +L   D  
Sbjct: 298 YEPGEQVFISY-----------GKKSNGELLLSYGFVPREGTNPSDSVELSLSLKKSDKS 346

Query: 375 YQDKRMVAQRNGKLSVQVFHVH 396
           Y++K    +++G  + Q F V 
Sbjct: 347 YKEKLEALKKHGFSASQCFPVR 368


>gi|242066146|ref|XP_002454362.1| hypothetical protein SORBIDRAFT_04g029430 [Sorghum bicolor]
 gi|241934193|gb|EES07338.1| hypothetical protein SORBIDRAFT_04g029430 [Sorghum bicolor]
          Length = 499

 Score =  102 bits (255), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 105/397 (26%), Positives = 176/397 (44%), Gaps = 70/397 (17%)

Query: 9   SSKFISPPIRPPHH----PLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSSDTLV 64
           S+  + PP+R P H    P S +S+ S                R + R  IR S++    
Sbjct: 4   STTTLHPPLRAPRHLRPLPHSYSSSFS----------------RTRGRAPIRASAASASA 47

Query: 65  AGSREVVS--------KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAAS 116
              RE  +        +  E    L+ W+  +GLP  ++ + ++    E+      + A 
Sbjct: 48  PAQREAAAGVPWGCEIESLESAASLERWLIDSGLPEQRLAI-QRVDIGERG-----LVAL 101

Query: 117 EDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFW 176
           ++++ G+    VP SLV+T +   G   + E++  N + +   +A YL+ E      S W
Sbjct: 102 KNIRKGEKLLFVPPSLVITADSEWGRPEVGEVMKRNSVPDWPLIATYLISEASLEGSSRW 161

Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL-DTV 234
             YI  L RQ         S L W+  EL AYL  SP +   ++R   +   YN+L D +
Sbjct: 162 SSYIAALPRQ-------PYSLLYWTRAELDAYLVASPIRKRAIQRITDVIGTYNDLRDRI 214

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
           +     LF       P E +  E F  +F  + S +V L   S+  + ALVP    +L +
Sbjct: 215 FSRHSDLF-------PEEVYNIETFLWSFGILFSRLVRLP--SMDEKVALVPWA-DMLNH 264

Query: 295 SSKCKAMLAAVDDAVQLVV---DRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYG 351
           S + +  L   D + Q +V   DR Y+ GE + + Y           G + + +LL++YG
Sbjct: 265 SPEVETFL-DFDKSSQGIVFTTDRSYQPGEQVFISY-----------GKKSSGELLLSYG 312

Query: 352 FVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNG 386
           FV ++  NP D + +  +L+  D  Y++K    +RNG
Sbjct: 313 FVPKEGTNPNDSVELLVSLDKSDKCYKEKLQALKRNG 349


>gi|440792294|gb|ELR13522.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
          Length = 568

 Score =  102 bits (253), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 124/472 (26%), Positives = 195/472 (41%), Gaps = 81/472 (17%)

Query: 76  EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
           +DL  L+ W+ KNGL    +   E  ++  +      V A +D + G+    VP  L+ T
Sbjct: 66  DDLEQLRVWLLKNGLDSKWLEGIEFAANLPEGSG---VVAKKDFKKGEPFLQVPRKLMFT 122

Query: 136 LERVLGNETIAELLTTNKL---SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
            +  + N  + +LL  +K    S   CLAL+L+ EK     SFW PYI+ L +  G    
Sbjct: 123 CQ-AMQNTPLGQLLKVDKFLAQSPSLCLALHLLVEK-HNHSSFWTPYIKTLPKSYG---- 176

Query: 193 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 252
              + L ++  EL  L GSPT    ++    +  +Y  +         LFQ     +   
Sbjct: 177 ---TCLYFTLEELEGLRGSPTFTSAIKVIATVAIQYTYIH-------DLFQIRKDILHIN 226

Query: 253 AFTFEIFKQAFVAV---QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV 309
           AFT++ F  A  AV   Q+ V      +L+  +AL+P            +       D+ 
Sbjct: 227 AFTWDEFIWAMSAVGSRQNQVPQWGHNALSE-YALIPAWDMCNHDHGDLQTFWDVNSDST 285

Query: 310 QLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 369
           +    R YK GE + ++Y           GP+PNS LL++ GFV E+N +D L +   L 
Sbjct: 286 ESHAMRAYKKGEQVYIFY-----------GPRPNSDLLLHAGFVYENNRFDALAIRVRLA 334

Query: 370 TEDPQYQDK-RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISS 428
            +    +DK R++   N K+  Q +    G       D++ +LR+ +  +  E+Q V+ +
Sbjct: 335 PDAEHIKDKLRLLHLNNMKMDSQYYLYGLG----LAVDLMAFLRI-HAMNEQELQQVLGA 389

Query: 429 -------------------------LGPICPVSPCMERAVLDQLADYFKARLAGYPATLS 463
                                      P   ++   E A L        + L+ YP TL 
Sbjct: 390 YDQQEAKVHNGEHPASNGEVVASGVFDPRVKLNDRNELAALQLAEAKCLSLLSLYPTTLQ 449

Query: 464 ---------EDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLL 506
                    ED+A L   +L P  R  T L   EK++LN  L    D I LL
Sbjct: 450 VANGVELKQEDQAALRTTSLTPNMRAVTLLRLKEKEILNRTL----DAIRLL 497


>gi|226501968|ref|NP_001140387.1| uncharacterized protein LOC100272441 [Zea mays]
 gi|194699272|gb|ACF83720.1| unknown [Zea mays]
 gi|413923744|gb|AFW63676.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
           N-methyltransferase I [Zea mays]
          Length = 503

 Score =  101 bits (251), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 84/316 (26%), Positives = 146/316 (46%), Gaps = 38/316 (12%)

Query: 76  EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
           E    L+ W+  +GLP  ++ + ++    E+      + A ++++ G+    VP SLV+T
Sbjct: 71  ESAASLERWLIDSGLPEQRLAI-QRVDIGERG-----LVALKNIRKGEKLLFVPPSLVIT 124

Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
            +   G   + +++  N + +   +A YL+ E      S W+ YI  L RQ         
Sbjct: 125 ADSEWGRPEVGDVMKRNSVPDWPLIATYLISEASLEGSSRWISYIAALPRQ-------PY 177

Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
           S L W+  EL AYL  SP +   ++R   +   YN+L         +F ++P   P E +
Sbjct: 178 SLLYWTRAELDAYLVASPIRKRAIQRITDVIGTYNDLRD------RIFSRHPDLFPEEVY 231

Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLV 312
             E F  +F  + S +V L   S+  R ALVP    +L +S + +  L        +   
Sbjct: 232 NIETFLWSFGILFSRLVRLP--SMDGRVALVPWA-DMLNHSPEVETFLDFDKSSRGIVFT 288

Query: 313 VDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNT 370
            DR Y+ GE + + Y           G + + +LL++YGFV ++  NP D + +  +L+ 
Sbjct: 289 TDRSYQPGEQVFISY-----------GKKSSGELLLSYGFVPKEGTNPNDSVELLVSLDK 337

Query: 371 EDPQYQDKRMVAQRNG 386
            D  Y++K    +RNG
Sbjct: 338 SDNCYKEKLQALKRNG 353


>gi|449453618|ref|XP_004144553.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Cucumis
           sativus]
 gi|449511789|ref|XP_004164054.1| PREDICTED: LOW QUALITY PROTEIN: ribulose-1,5 bisphosphate
           carboxylase/oxygenase large subunit N-methyltransferase,
           chloroplastic-like [Cucumis sativus]
          Length = 497

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 93/327 (28%), Positives = 151/327 (46%), Gaps = 40/327 (12%)

Query: 76  EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
           E+   L+ W+ ++GLP  K+ ++     N   R +    A ++++ G+    VP SLV++
Sbjct: 65  ENASALQKWLSESGLPDQKMSIQRV---NVGERGL---VALKNVRKGEKLLFVPPSLVIS 118

Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
            E         E+L  N + +   +A YL+ E    K S W  YI  L RQ         
Sbjct: 119 AESEWSCPEAGEVLKRNSVPDWPLIATYLISEASLMKSSRWNNYISALPRQ-------PY 171

Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
           S L W+  EL  YL  S  +   +ER   +   YN+L         +F ++P   P E F
Sbjct: 172 SLLYWTREELDRYLEASEIRERAIERITNVVGTYNDLSI------RVFSKHPELFPEEVF 225

Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV- 313
             E FK +F  + S +V L   S+  + ALVP    +L ++ + +  L   D A Q VV 
Sbjct: 226 NIETFKWSFGILFSRLVRLP--SMDGKVALVPWA-DMLNHNCEVETFL-DYDKASQGVVF 281

Query: 314 --DRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED--NPYDRLVVEAALN 369
             DR Y+ GE + + Y           G + N +LL++YGFV ++  NP D + +  +L 
Sbjct: 282 TTDRAYQPGEQVFISY-----------GKKSNGELLLSYGFVPKEGSNPSDSVELLLSLK 330

Query: 370 TEDPQYQDKRMVAQRNGKLSVQVFHVH 396
             D  Y++K    +++G  + Q F + 
Sbjct: 331 KSDKCYKEKLEALKKHGLRASQCFPIQ 357


>gi|397613505|gb|EJK62256.1| hypothetical protein THAOC_17139 [Thalassiosira oceanica]
          Length = 648

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 107/425 (25%), Positives = 181/425 (42%), Gaps = 69/425 (16%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLE---RVLGNETIAELLTTNKLSELACLALYLMYEK-KQ 170
           A   +  GD    +P  L +T +   R LG + + E      ++E   +A  L++EK  +
Sbjct: 210 ARRSINDGDELLKIPLDLCLTRKSARRELGKDALQE-----GINEYLAVACQLIHEKFVK 264

Query: 171 GKKSFWLPYIR---ELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
           G+ SF+  Y+    E+D         V     W + +LA+L GSP  A        ++RE
Sbjct: 265 GEDSFYAAYMGVLPEVDE--------VNPTFTWPDEDLAFLEGSPVVAATRSLQMKLRRE 316

Query: 228 YNELDTVWFMAG--SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
           Y++L     + G   L  ++P   P E +TFE ++ AF  + S  + L+ + +  R A+V
Sbjct: 317 YDDL-----LGGPDGLVAKFPLRFPAEHYTFENWEWAFTMLFSRAIRLRNLQVGERLAMV 371

Query: 286 PLGPPLLAYSSKCKAMLAA----------VDDAVQLVVDRPYKAGESIVVWYNISFTGHD 335
           P    L+ +S+  +A + A           ++ V L  DR Y+  E + + Y        
Sbjct: 372 PYA-DLINHSAFSQAFIDARESGDWLFKSGEEEVILYADRGYRQMEQVYISY-------- 422

Query: 336 FKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN-------------TEDPQYQDKRMVA 382
              G + N++LL+ YGF  E NPY+ + V  ++               EDP   +K    
Sbjct: 423 ---GQKSNAELLLLYGFALERNPYNSVDVTVSIAPRTKQIAEANEGVEEDPLADEKLEFL 479

Query: 383 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERA 442
              G+     F  +A R      +ML YLRL  ++        +S       +S   E +
Sbjct: 480 LSVGRDQTVDFPCYADRYP---VEMLEYLRLMMMTPEDTRGKPLSDFDYSRTISSANEAS 536

Query: 443 VLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP----KKRVATQLVRMEKKMLNACLQV 498
           VL  +    K +L  +P T  +D A++ D  +       +R+A +  R EK++L   L  
Sbjct: 537 VLRSVVAAVKYQLGLFPQTEEDDAAIIKDKGMFRLFSYNQRMAVRHRRNEKRLLKRTLAA 596

Query: 499 TADMI 503
               I
Sbjct: 597 LEKQI 601


>gi|307109960|gb|EFN58197.1| hypothetical protein CHLNCDRAFT_142047 [Chlorella variabilis]
          Length = 485

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 113/434 (26%), Positives = 190/434 (43%), Gaps = 51/434 (11%)

Query: 49  RKNRFSIRVSSSDTLV-----AGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSH 103
           R  R   R+++   L+     AGS E+ +  E +  +LK+W+ + GLPP K+     P  
Sbjct: 26  RHRRCRCRLAAQAGLLDLLRGAGSTEIATDAEGE--ELKAWLIERGLPPPKLAAAATPGS 83

Query: 104 NEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALY 163
                    + A++ +  G++  S+P  LV+T    L    +  LL    L   + LAL+
Sbjct: 84  GRG------LVAAQPIGKGESLLSIPQQLVLTPAAALEQSCLRPLLEEQPLPAWSVLALW 137

Query: 164 LMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG 223
           L  ++  G    W PY+R L  + G         L WSE E+ +L GS   ++ LE    
Sbjct: 138 LAEQRAAGSAGGWWPYVRLLPERTG-------CVLEWSEEEVEWLCGSQLHSDALEIRAA 190

Query: 224 IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 283
            +  + E+  V   A +  +   +     AF     + AF  + S +V L    L  + A
Sbjct: 191 AEASWAEMQAVLAAAKAQGRAPAHG----AFGRAQLQWAFAVLLSRLVRL--AGLGDQEA 244

Query: 284 LVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQ 341
           L+P    LL +     + L  +A + AV L  +R Y+AGE +++ Y           G +
Sbjct: 245 LLPWA-DLLNHDCAAASFLDWSATEAAVVLRAERRYRAGEQLLISY-----------GQK 292

Query: 342 PNSKLLINYGFVDE--DNPYD--RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 397
            + +LL++YGF  +   NP+D  RL++E  L   D     K    +++G  + Q+F +  
Sbjct: 293 TSGELLLSYGFCPDLGSNPHDGCRLLLE--LAPGDAARNWKAAALRQHGLAASQLFPL-- 348

Query: 398 GREKEAISDMLPY--LRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARL 455
            R   A  +++ Y       V    E + +   L     + P ++ A L+ +    KA L
Sbjct: 349 -RMAAAPFELVHYTAFSAAVVGSRQEAEQLARRLFEEGDIPPALQTAALEAVVAACKAAL 407

Query: 456 AGYPATLSEDEAML 469
           A YP +   D A L
Sbjct: 408 AAYPRSFDGDRAEL 421


>gi|195651313|gb|ACG45124.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
           N-methyltransferase I [Zea mays]
          Length = 503

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 83/316 (26%), Positives = 145/316 (45%), Gaps = 38/316 (12%)

Query: 76  EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
           E    L+ W+  +GLP  ++ + ++    E+      + A ++++ G+    VP SLV+T
Sbjct: 71  ESAASLERWLIDSGLPEQRLAI-QRVDIGERG-----LVALKNIRKGENLLFVPPSLVIT 124

Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
            +   G   + +++  N + +   +A YL+ E      S W+ YI  L RQ         
Sbjct: 125 ADSEWGRPEVGDVMKRNSVPDWPLIATYLISEASLEGSSRWISYIAALPRQ-------PY 177

Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
           S L W+  EL AYL  SP +   ++R   +   YN+L         +F ++P   P E +
Sbjct: 178 SLLYWTRAELDAYLVASPIRKRAIQRITDVIGTYNDLRD------RIFSRHPDLFPEEVY 231

Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLV 312
             E F  +F  + S +V L   S+  R  LVP    +L +S + +  L        +   
Sbjct: 232 NIETFLWSFGILFSRLVRLP--SMDGRVVLVPWA-DMLNHSPEVETFLDFDKSSRGIVFT 288

Query: 313 VDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNT 370
            DR Y+ GE + + Y           G + + +LL++YGFV ++  NP D + +  +L+ 
Sbjct: 289 TDRSYQPGEQVFISY-----------GKKSSGELLLSYGFVPKEGTNPNDSVELLVSLDK 337

Query: 371 EDPQYQDKRMVAQRNG 386
            D  Y++K    +RNG
Sbjct: 338 SDNCYKEKLQALKRNG 353


>gi|3403236|gb|AAC29137.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
           N-methyltransferase I [Spinacia oleracea]
          Length = 491

 Score = 99.0 bits (245), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 99/394 (25%), Positives = 174/394 (44%), Gaps = 46/394 (11%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A +D+   +    VP    +  + V  +E  +     N L     +AL+LM EKK G 
Sbjct: 86  LVAQKDISRNEVVLEVPQKFWINPDTVAASEIGS---VCNGLKPWVSVALFLMREKKLGN 142

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
            S W PYI  L         +  S + WSE EL+ L GS      L   E +  E+ +L+
Sbjct: 143 SSSWKPYIDILPD-------STNSTIYWSEEELSELQGSQLLNTTLGVKELVANEFAKLE 195

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG---- 288
               +     Q +P+D+  + F F  F        +C+         +   L+PL     
Sbjct: 196 EEVLVPHK--QLFPFDVTQDDF-FWAFGMLRSRAFTCLE-------GQSLVLIPLADLAN 245

Query: 289 --PPLLA--YSSKCK-AMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPN 343
             P + A  Y+ + + A L + +    L    P KAG+ +++ Y+++          + N
Sbjct: 246 HSPDITAPKYAWEIRGAGLFSRELVFSLRNPTPVKAGDQVLIQYDLN----------KSN 295

Query: 344 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEA 403
           ++L ++YG  +  +  +   +   +   D  Y DK  +A+ NG      F +    E+  
Sbjct: 296 AELALDYGLTESRSERNAYTLTLEIPESDSFYGDKLDIAESNGMGESAYFDIVL--EQPL 353

Query: 404 ISDMLPYLRLGYVS--DTSEMQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGY 458
            ++MLPYLRL  +   D   ++S+   S  G +  P+SP  E  +   + D   + L+GY
Sbjct: 354 PANMLPYLRLVALGGEDAFLLESIFRNSIWGHLDLPISPANEELICQVIRDACTSALSGY 413

Query: 459 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 492
             T++EDE +L + ++ P+  +A  +   EKK+L
Sbjct: 414 STTIAEDEKLLAEGDIDPRLEIAITIRLGEKKVL 447


>gi|3403234|gb|AAC29136.1| ribulose-1,5-bisphosphate carboxylase/oxygenase N-methyltransferase
           [Spinacia oleracea]
 gi|3403238|gb|AAC29138.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
           N-methyltransferase II [Spinacia oleracea]
          Length = 495

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 96/391 (24%), Positives = 176/391 (45%), Gaps = 36/391 (9%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A +D+   +    VP    +  + V  +E  +     N L     +AL+LM EKK G 
Sbjct: 86  LVAQKDISRNEVVLEVPQKFWINPDTVAASEIGS---VCNGLKPWVSVALFLMREKKLGN 142

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
            S W PYI  L         +  S + WSE EL+ L GS      L   E +  E+ +L+
Sbjct: 143 SSSWKPYIDILPD-------STNSTIYWSEEELSELQGSQLLNTTLGVKELVANEFAKLE 195

Query: 233 TVWFMAGSLFQQYPYDIPTEAF--TFEIFK-QAFVAVQS---CVVHLQKVSLARRFALVP 286
               +     Q +P+D+  + F   F + + +AF  ++     ++ L  + + +      
Sbjct: 196 EEVLVPHK--QLFPFDVTQDDFFWAFGMLRSRAFTCLEGQSLVLIPLADLWVQQANHSPD 253

Query: 287 LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKL 346
           +  P  A+  +  A L + +    L    P KAG+ +++ Y+++          + N++L
Sbjct: 254 ITAPKYAWEIR-GAGLFSRELVFSLRNPTPVKAGDQVLIQYDLN----------KSNAEL 302

Query: 347 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD 406
            ++YG  +  +  +   +   +   D  Y DK  +A+ NG      F +    E+   ++
Sbjct: 303 ALDYGLTESRSERNAYTLTLEIPESDSFYGDKLDIAESNGMGESAYFDIVL--EQPLPAN 360

Query: 407 MLPYLRLGYVS--DTSEMQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPAT 461
           MLPYLRL  +   D   ++S+   S  G +  P+SP  E  +   + D   + L+GY  T
Sbjct: 361 MLPYLRLVALGGEDAFLLESIFRNSIWGHLDLPISPANEELICQVIRDACTSALSGYSTT 420

Query: 462 LSEDEAMLTDYNLHPKKRVATQLVRMEKKML 492
           ++EDE +L + ++ P+  +A  +   EKK+L
Sbjct: 421 IAEDEKLLAEGDIDPRLEIAITIRLGEKKVL 451


>gi|326495906|dbj|BAJ90575.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 507

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 86/317 (27%), Positives = 148/317 (46%), Gaps = 40/317 (12%)

Query: 76  EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
           E    L+ W+  +GLP  ++ L EK    E+      + A ++++ G+    VP +LV+T
Sbjct: 74  ESAASLERWLTASGLPEQRLAL-EKVDIGERG-----LVALKNVRNGEKLLFVPPTLVIT 127

Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
            +    N  + +++    + +   LA YL+ E      S W  YI  L RQ         
Sbjct: 128 ADSEWTNREVGDVMKRYSVPDWPLLATYLISEASLEGSSRWSSYIDALPRQ-------PY 180

Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
           S L W+ TE+ AYL  SP +   + R   +   YN+L         +F ++P   P + +
Sbjct: 181 SLLYWTRTEIDAYLVASPIRERAISRISDVIGTYNDLRD------RIFSKHPDLFPEKVY 234

Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV- 313
             E F+ +F  + S +V L+  S+  + ALVP    +L +S +  A L   D + Q +V 
Sbjct: 235 NMENFRWSFGILFSRLVRLE--SMGGKVALVPWA-DMLNHSPEVDAFL-DYDKSSQGIVF 290

Query: 314 --DRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED--NPYDRLVVEAALN 369
             DR Y+ GE + + Y           G + + +LL++YGFV ++  NP D +    +L 
Sbjct: 291 TTDRSYQPGEQVFISY-----------GKKSSGELLLSYGFVPKEGTNPNDSVEFLVSLK 339

Query: 370 TEDPQYQDKRMVAQRNG 386
             D  Y++K    +++G
Sbjct: 340 KSDECYKEKLQALKKHG 356


>gi|223992783|ref|XP_002286075.1| rubisco small subunit small subunit n-methyltransferase
           [Thalassiosira pseudonana CCMP1335]
 gi|220977390|gb|EED95716.1| rubisco small subunit small subunit n-methyltransferase
           [Thalassiosira pseudonana CCMP1335]
          Length = 434

 Score = 95.5 bits (236), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 102/412 (24%), Positives = 172/412 (41%), Gaps = 57/412 (13%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKK-QGKK 173
           A   +  GD    +P  L +T  R    + + +    + ++E   +A  L++EK   G +
Sbjct: 50  ARRSINDGDELLKIPMDLCIT--RKSARKALGKDALQDGINEYLAIACQLIHEKYVLGDE 107

Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
           S W  Y+  L          V     W + +LA+L GSP  A        ++REY+ L  
Sbjct: 108 SEWDAYMGVLPEVE-----EVNPTFTWKDEDLAFLDGSPVVAATRSLQMKLRREYDAL-- 160

Query: 234 VWFMAG--SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
              + G   L  ++P   P E FT+E +  AF  + S  + L+ + +  R A+VP    L
Sbjct: 161 ---LGGQDGLIAKFPDRFPAEHFTYENWVWAFTMLFSRAIRLRNLQVGERLAMVPYA-DL 216

Query: 292 LAYSSKCKAMLAAV----------DDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQ 341
           + +S+   A + A           ++ V L  DR Y+  E + + Y           G +
Sbjct: 217 INHSAFSGAFIDARESGDWLFKNGEEEVILYADRGYRQMEQVYISY-----------GQK 265

Query: 342 PNSKLLINYGFVDEDNPYDRLVVEAALNTE-------------DPQYQDKRMVAQRNGKL 388
            N++LL+ YGF  E NPY+ + V  ++                DP  Q+K       G+ 
Sbjct: 266 SNAELLLLYGFALERNPYNSVDVTVSIAPRTAALAAANEGIEVDPLAQEKVEFLASVGRD 325

Query: 389 SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLA 448
               F  +A R      +ML +LRL  ++        ++       +SP  E AVL  + 
Sbjct: 326 QTVDFPCYADRYP---VEMLEFLRLMMMTPEDTRGKPLADFDYSRTISPANEAAVLSSVV 382

Query: 449 DYFKARLAGYPATLSEDEAMLTDYNLHP----KKRVATQLVRMEKKMLNACL 496
           +  K +L  YP +  +D  ++ D  L       +R+A +  R EK++L   L
Sbjct: 383 EAVKYQLNLYPQSEEDDANIIKDKALFRLLSYNQRMAVRHRRNEKRLLKRTL 434


>gi|302785554|ref|XP_002974548.1| hypothetical protein SELMODRAFT_101776 [Selaginella moellendorffii]
 gi|300157443|gb|EFJ24068.1| hypothetical protein SELMODRAFT_101776 [Selaginella moellendorffii]
          Length = 467

 Score = 95.5 bits (236), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 83/310 (26%), Positives = 137/310 (44%), Gaps = 37/310 (11%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
           L+ W+ + GLP  KV LK   +          + +   L  GD    +P +L +T E   
Sbjct: 41  LQQWLSQAGLPIQKVELKNVGAGGRG------LVSKRMLYKGDRLLFLPATLAITTESEW 94

Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
                 +++    L E   LA YL+ E   GK S W PYI  L R+ G       S LLW
Sbjct: 95  ACAEAGKVIRAKDLPEWPFLACYLISEASLGKSSPWYPYIAALPRRPG-------SILLW 147

Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
           +  ++ A+L+ +  K   L+    ++  +N+L+   FM          + P E F  E F
Sbjct: 148 TALDVEAHLSATSIKDRALQCVREVEDTFNDLNKQVFMKNR------EEFPPEVFNLESF 201

Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPY 317
           K AF  + S +V L   SL ++ AL+P G  +L + ++    L   +   ++   +DR Y
Sbjct: 202 KWAFGILFSRLVRLP--SLGQKLALIPFG-DMLNHDTEVTTFLDFDSGSKSITCTLDRGY 258

Query: 318 KAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALNTEDPQYQ 376
           ++ + + + Y           G + N +LL+ YGFV    N  D + +   L+  D  Y+
Sbjct: 259 ESNKEVFISY-----------GKRSNGELLVAYGFVPSGKNSEDSVSITLGLDPADEMYE 307

Query: 377 DKRMVAQRNG 386
            K    + +G
Sbjct: 308 AKLGALKEHG 317


>gi|293333172|ref|NP_001168589.1| uncharacterized protein LOC100382373 [Zea mays]
 gi|223949395|gb|ACN28781.1| unknown [Zea mays]
 gi|414885391|tpg|DAA61405.1| TPA: hypothetical protein ZEAMMB73_723554 [Zea mays]
          Length = 489

 Score = 95.1 bits (235), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 108/397 (27%), Positives = 170/397 (42%), Gaps = 47/397 (11%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A+ DL  G+    VP  L +  + V  ++          L     +AL L+ E  +G 
Sbjct: 84  LVAARDLPRGEVVAEVPKKLWMDADAVAASDIGRACGGGGGLRPWVAVALLLLSEVARGA 143

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY--NE 230
            S W PY+  L RQ        +S + WSE EL  + G+    ++L    G+K EY  +E
Sbjct: 144 DSPWAPYLAILPRQ-------TDSTIFWSEEELLEIQGT----QLLSTTVGVK-EYVQSE 191

Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV---VHLQKVSLARRFALVPL 287
            D+V     S  +    D+   + TF+ F  AF  ++S V   +   K++L     LV  
Sbjct: 192 FDSVQAEIISTNK----DLFPGSITFDDFLWAFGMLRSRVFPELRGDKLALIPFADLVNH 247

Query: 288 GPPLLAYSS----KCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPN 343
            P + +  S    K K +    +    L      K+G+ I + Y++           + N
Sbjct: 248 SPNITSEGSSWEIKGKGLFGR-ELMFSLRTPVNVKSGQQIYIQYDLD----------KSN 296

Query: 344 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEA 403
           ++L ++YGFV+ +   D   V   ++  DP Y DK  +A+ NG      F V     +  
Sbjct: 297 AELALDYGFVESNPSRDSFTVTLEISESDPFYGDKLDIAEANGLGETAYFDVIL--NEPL 354

Query: 404 ISDMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLADYFKARLA 456
              MLPYLRL  +  T           SV   L    P+SP  E ++   + D  K+ LA
Sbjct: 355 PPQMLPYLRLLCIGGTDAFLLEALFRNSVWGHLE--LPLSPDNEESICQAMRDACKSALA 412

Query: 457 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 493
            Y  T+ EDE +    NL P+  +A  +   EKK+L 
Sbjct: 413 DYHTTIEEDEELSGRENLQPRLAIAIGVRAGEKKVLQ 449


>gi|444909511|ref|ZP_21229702.1| hypothetical protein D187_00317 [Cystobacter fuscus DSM 2262]
 gi|444720460|gb|ELW61244.1| hypothetical protein D187_00317 [Cystobacter fuscus DSM 2262]
          Length = 445

 Score = 94.0 bits (232), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 111/444 (25%), Positives = 181/444 (40%), Gaps = 56/444 (12%)

Query: 68  REVVSKKEEDLGDLKSWMHKNG-LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAF 126
           R      E+ L  L  WM + G L P   I+++             V A  D+  G+   
Sbjct: 2   RTSAESSEQKLSSLLRWMEQGGALFPKMHIVRQADGERS-------VLARTDIAEGEVVL 54

Query: 127 SVPNSLVVTLERV----LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRE 182
            +P + + TLER     +G    ++L   N   +   LA +L+ EK +G  SFW P++  
Sbjct: 55  QIPTTHLFTLERAKASDIGRRIQSQLQPDN---DFLYLASWLLEEKHRGADSFWKPFVDS 111

Query: 183 LDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLF 242
           L       +     PL +SE E A + GS      LER   ++R+  E +         +
Sbjct: 112 LP------EAYPHVPLFYSEQERARMKGSQ-----LERLVEVQRQSFEQE---------Y 151

Query: 243 QQYPYDIPT-EAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAM 301
            Q    +P  E F FE +  A +++ S +  L+        +LVPL   +  +      +
Sbjct: 152 AQLREKLPEYERFGFEEYVWARISLYSRLFSLKGGLQGP--SLVPLSD-MFNHRQPPDVL 208

Query: 302 LAAVDDA--VQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV-DEDNP 358
            +  +D    +++  R   AG  I   Y           G + +   L++ GFV D +  
Sbjct: 209 WSTSEDGQTFRMIAQRAVPAGTEIHTHY-----------GAKSSDVFLLHSGFVPDGNEE 257

Query: 359 YDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD 418
            D + +   L   DP    K+ +       +   F V    +  A   +  +LR+ + S 
Sbjct: 258 NDEVYLSVGLPPGDPLASVKQQMFGLASATAKHPFKVSRQGKYLASWSVFSFLRMAHASP 317

Query: 419 TSEM---QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
              +     ++S    I PVS   E  VL  LA   + RL  +P TL EDE +L +  L 
Sbjct: 318 DEFLALSNRLLSGTKTIAPVSVACEERVLGTLAAACEERLKAFPTTLEEDERLLREGPLS 377

Query: 476 PKKRVATQLVRMEKKMLNACLQVT 499
           P +R    L R EK++L   L++T
Sbjct: 378 PNERSCVLLRRQEKRLLGDYLELT 401


>gi|308807993|ref|XP_003081307.1| putative methyltransferase (ISS) [Ostreococcus tauri]
 gi|116059769|emb|CAL55476.1| putative methyltransferase (ISS) [Ostreococcus tauri]
          Length = 505

 Score = 93.2 bits (230), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 109/421 (25%), Positives = 174/421 (41%), Gaps = 78/421 (18%)

Query: 80  DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
           DL  W+  NGL   K+ L+   +          + A+E+++ G+A   V  S ++T+ER 
Sbjct: 66  DLTRWLASNGLRAQKMTLESNLAEG------RGLVATEEIKRGEALLGVDASCLITVERA 119

Query: 140 LGNETIAELLTTNKLSELACLALYLMYEK---KQGKKSFWLPYIRELDRQRGRGQLAVES 196
           +    +       +L E + LA +L  +    + G    +  YIR L R+ G       S
Sbjct: 120 IAEAKLGP--RHAELQEWSVLATFLAQQAMALESGNAGTFGEYIRALPRRTG-------S 170

Query: 197 PLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
            L W E E+   L GSP++    ER E +     E+ +           +P DI   A  
Sbjct: 171 VLDWPEDEVETLLKGSPSRLAAAERQESVNAAIAEIRS----------SFP-DITEGALR 219

Query: 256 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 315
           +     AF  + S ++ L   ++    ALVP    +L +   C A +     AV L  DR
Sbjct: 220 W-----AFDILFSRLIRLD--AMGGELALVPWAD-MLNHKPGCAAFIDLNGSAVNLTTDR 271

Query: 316 PYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDE--DNPYDRLVVEAALNTEDP 373
            Y AGE +   Y           G +P+S+LLI+YGF  E  +NP D   +   ++  DP
Sbjct: 272 AYAAGEQVWASY-----------GQRPSSELLISYGFAPEVGENPDDEYSLTLGVDVNDP 320

Query: 374 QYQDKRMVAQRNGKLSVQVFHVH-AGREKEAI-------------SDMLPYLRLGYVSDT 419
             Q K  V +R G   V+ F +   G  ++ +             S++    R  +    
Sbjct: 321 YAQAKADVLRRMGLSPVETFPLRLNGYPRQLLQYASFILCNPDKPSELEGLARTAFTGSA 380

Query: 420 SEMQSVISSL------------GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEA 467
           +  QS+  S+            G I    P  E AV + LAD     L+ YP +L +D+ 
Sbjct: 381 NFGQSIFDSVRGLAQGQARGKQGVILGGVPG-EIAVREMLADMCAEALSAYPNSLEKDKG 439

Query: 468 M 468
           +
Sbjct: 440 I 440


>gi|218202140|gb|EEC84567.1| hypothetical protein OsI_31339 [Oryza sativa Indica Group]
          Length = 649

 Score = 92.8 bits (229), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 104/406 (25%), Positives = 171/406 (42%), Gaps = 66/406 (16%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A+ DL  G+    VP  L +  + V  ++ +   +    L     +AL L+ E  +G 
Sbjct: 244 LVAARDLPRGEVLAEVPKKLWLDADAVAASD-LGGAVGRGGLRPWVAVALLLLREAARGA 302

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
            S W PY+  L RQ        +S + WSE EL  + G+    ++L    G+K EY +  
Sbjct: 303 GSPWAPYLAILPRQ-------TDSTIFWSEEELLEIQGT----QLLSTTMGVK-EYVQ-- 348

Query: 233 TVWFMAGSLFQQYPYDIPTE-------AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
                  S F+    +I +E         TF  F  AF  ++S V    +     + AL+
Sbjct: 349 -------SEFESVEAEIISENRELFPGTVTFNDFLWAFGILRSRVFAELR---GDKLALI 398

Query: 286 PLGPPL----------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHD 335
           P    +           ++  K K +    D    L      K+GE I + Y++      
Sbjct: 399 PFADLVNHSDDITSKESSWEIKGKGLFGR-DVVFSLRTPVNVKSGEQIYIQYDLD----- 452

Query: 336 FKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV 395
                + N++L ++YGF + ++  D   +   ++  DP Y DK  +A+ NG      F +
Sbjct: 453 -----KSNAELALDYGFTESNSSRDAYTLTLEISESDPFYDDKLDIAELNGMGETAYFDI 507

Query: 396 HAGREKEAIS-DMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQL 447
             G   E++   MLPYLRL  +  T           +V   L    PVS   E A+   +
Sbjct: 508 VLG---ESLPPQMLPYLRLLCLGGTDAFLLEALFRNAVWGHL--ELPVSQDNEEAICQVI 562

Query: 448 ADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 493
            +  K+ L  Y  T+ EDE +L   NL P+ ++A ++   EKK+L 
Sbjct: 563 RNACKSALGAYHTTIEEDEELLGSENLQPRLQIAVEVRAGEKKVLQ 608


>gi|308802083|ref|XP_003078355.1| ribulose-1,5-bisphosphate carb (ISS) [Ostreococcus tauri]
 gi|116056807|emb|CAL53096.1| ribulose-1,5-bisphosphate carb (ISS) [Ostreococcus tauri]
          Length = 520

 Score = 92.8 bits (229), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 106/433 (24%), Positives = 181/433 (41%), Gaps = 37/433 (8%)

Query: 76  EDLGDLKSWM-HKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
           ED  +L +W+ +  G+    +  KE      +      V    D +AG  A  VP S  V
Sbjct: 48  EDARELAAWLSYDKGVDASALAFKEDAKGGVR------VILKADAEAGATALRVPQSAAV 101

Query: 135 TLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
           T   V  +  ++EL +     EL  LAL+L  E+ +G  S W PY++ L       +   
Sbjct: 102 TSVDVGEHPIVSELASGR--PELIGLALWLCAERIKGGASEWAPYVKTL-------RANP 152

Query: 195 ESPLLWSET-ELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 253
           ++PL W++  + A L GSP  A+ +ER++  + EY  +  V        +  P   P EA
Sbjct: 153 DAPLFWTDAKDFALLKGSPVAADAIERSKSARTEYASITEV-------IKSDPSSYPPEA 205

Query: 254 FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML-AAVDDAVQLV 312
           + F    +   A+ +       +  A+ +ALVPL   +    +    +L  +  D V   
Sbjct: 206 YEFLTEARFVDALATVCAKATWLPTAQCYALVPLLDVISIGGAPVPGVLPPSASDGVVRC 265

Query: 313 VDRPYKAGESIVVWYNISFTGHD-----FKCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 367
               Y    + VV    +    +          + N +L +N G+VD+ +P D + ++  
Sbjct: 266 GPADYDVDTASVVLRCATKAAANSEVIQLDALQRNNGELFLNTGYVDQKHPGDYIYMKTD 325

Query: 368 LNTEDPQYQDKRMVAQRNGKLSV-QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVI 426
           + T D  +  K+ V +  G  +  Q F V+  R     + +  YLR   V D  EM +V 
Sbjct: 326 IQTSDRLFTAKKQVLEGMGFTAADQYFPVYKDRMP---TQLYSYLRFSRVQDPGEMMAVS 382

Query: 427 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 486
                I  VS   E  +L  L    +  +A Y     ++  +L   +  P + +    +R
Sbjct: 383 FEEDKI--VSVMNEYEILQILMGDCRELMAEYDTNEEDELNLLKLSDQMPVREIEAAKLR 440

Query: 487 M-EKKMLNACLQV 498
           M EKK++ A +  
Sbjct: 441 MSEKKLIGATMNA 453


>gi|50252331|dbj|BAD28364.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplast precursor
           [Oryza sativa Japonica Group]
 gi|215769445|dbj|BAH01674.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 495

 Score = 92.8 bits (229), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 103/405 (25%), Positives = 169/405 (41%), Gaps = 64/405 (15%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A+ DL  G+    VP  L +  + V  ++ +   +    L     +AL L+ E  +G 
Sbjct: 90  LVAARDLPRGEVLAEVPKKLWLDADAVAASD-LGGAVGRGGLRPWVAVALLLLREAARGA 148

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
            S W PY+  L RQ        +S + WSE EL  + G+    ++L    G+K EY +  
Sbjct: 149 GSPWAPYLAILPRQ-------TDSTIFWSEEELLEIQGT----QLLSTTMGVK-EYVQ-- 194

Query: 233 TVWFMAGSLFQQYPYDIPTE-------AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
                  S F+    +I +E         TF  F  AF  ++S V    +     + AL+
Sbjct: 195 -------SEFESVEAEIISENRELFPGTVTFNDFLWAFGILRSRVFAELR---GDKLALI 244

Query: 286 PLGPPL----------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHD 335
           P    +           ++  K K +    D    L      K+GE I + Y++      
Sbjct: 245 PFADLVNHSDDITSKESSWEIKGKGLFGR-DVVFSLRTPVNVKSGEQIYIQYDLD----- 298

Query: 336 FKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV 395
                + N++L ++YGF + ++  D   +   ++  DP Y DK  +A+ NG      F +
Sbjct: 299 -----KSNAELALDYGFTESNSSRDAYTLTLEISESDPFYDDKLDIAELNGMGETAYFDI 353

Query: 396 HAGREKEAISDMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLA 448
             G  +     MLPYLRL  +  T           +V   L    PVS   E A+   + 
Sbjct: 354 VLG--ESLPPQMLPYLRLLCLGGTDAFLLEALFRNAVWGHLE--LPVSQDNEEAICQVIR 409

Query: 449 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 493
           +  K+ L  Y  T+ EDE +L   NL P+ ++A ++   EKK+L 
Sbjct: 410 NACKSALGAYHTTIEEDEELLGSENLQPRLQIAVEVRAGEKKVLQ 454


>gi|115448405|ref|NP_001047982.1| Os02g0725200 [Oryza sativa Japonica Group]
 gi|45735887|dbj|BAD12920.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase [Oryza sativa Japonica
           Group]
 gi|45736017|dbj|BAD13045.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase [Oryza sativa Japonica
           Group]
 gi|113537513|dbj|BAF09896.1| Os02g0725200 [Oryza sativa Japonica Group]
 gi|215737236|dbj|BAG96165.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222623589|gb|EEE57721.1| hypothetical protein OsJ_08208 [Oryza sativa Japonica Group]
          Length = 502

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 87/319 (27%), Positives = 146/319 (45%), Gaps = 40/319 (12%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
           L+ W+  +GLP  ++ + ++    E+      + A ++++ G+    VP SLV+T +   
Sbjct: 75  LERWLTDSGLPEQRLGI-QRVDVGERG-----LVALKNIRKGEKLLFVPPSLVITADSEW 128

Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
           G   +  +L  N + +   +A YL+ E      S W  YI  L RQ         S L W
Sbjct: 129 GCPEVGNVLKRNSVPDWPLIATYLISEASLESSSRWSSYIAALPRQ-------PYSLLYW 181

Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFEI 258
           +  EL AYL  SP +   ++R   +   YN+L D ++     LF       P E +  E 
Sbjct: 182 TRPELDAYLVASPIRERAIQRITDVVGTYNDLRDRIFSKHSDLF-------PEEVYNLET 234

Query: 259 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRP 316
           F+ +F  + S +V L   S+  R ALVP    +L +S + +  L        +    DR 
Sbjct: 235 FRWSFGILFSRLVRLP--SMDGRVALVPWA-DMLNHSPEVETFLDYDKSSGGIVFTTDRS 291

Query: 317 YKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQ 374
           Y+ GE + + Y           G + + +LL++YGFV ++  NP D + +  +LN  D  
Sbjct: 292 YQPGEQVFISY-----------GKKSSGELLLSYGFVPKEGTNPNDSVELLVSLNKSDKC 340

Query: 375 YQDKRMVAQRNGKLSVQVF 393
           Y++K    +RNG    + F
Sbjct: 341 YKEKLQALKRNGLSEFESF 359


>gi|218191491|gb|EEC73918.1| hypothetical protein OsI_08761 [Oryza sativa Indica Group]
          Length = 502

 Score = 92.4 bits (228), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 87/319 (27%), Positives = 146/319 (45%), Gaps = 40/319 (12%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
           L+ W+  +GLP  ++ + ++    E+      + A ++++ G+    VP SLV+T +   
Sbjct: 75  LERWLTDSGLPEQRLGI-QRVDVGERG-----LVALKNIRKGEKLLFVPPSLVITADSEW 128

Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
           G   +  +L  N + +   +A YL+ E      S W  YI  L RQ         S L W
Sbjct: 129 GCPEVGNVLKRNSVPDWPLIATYLISEASLESSSRWSSYIAALPRQ-------PYSLLYW 181

Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFEI 258
           +  EL AYL  SP +   ++R   +   YN+L D ++     LF       P E +  E 
Sbjct: 182 TRPELDAYLVASPIRERAIQRITDVVGTYNDLRDRIFSKHSDLF-------PEEVYNLET 234

Query: 259 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRP 316
           F+ +F  + S +V L   S+  R ALVP    +L +S + +  L        +    DR 
Sbjct: 235 FRWSFGILFSRLVRLP--SMDGRVALVPWA-DMLNHSPEVETFLDYDKSSGGIVFTTDRS 291

Query: 317 YKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQ 374
           Y+ GE + + Y           G + + +LL++YGFV ++  NP D + +  +LN  D  
Sbjct: 292 YQPGEQVFISY-----------GKKSSGELLLSYGFVPKEGTNPNDSVELLVSLNKSDKC 340

Query: 375 YQDKRMVAQRNGKLSVQVF 393
           Y++K    +RNG    + F
Sbjct: 341 YKEKLQALKRNGLSEFESF 359


>gi|323456050|gb|EGB11917.1| hypothetical protein AURANDRAFT_61181 [Aureococcus anophagefferens]
          Length = 516

 Score = 92.4 bits (228), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 93/354 (26%), Positives = 152/354 (42%), Gaps = 45/354 (12%)

Query: 151 TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 210
            +  +E   +AL L+ E+ +G +SFW  YI  L      G     +   W   ELAYL G
Sbjct: 162 NDDTNEYIAIALLLILERSKGSRSFWSEYIAILPTNEDVG-----ATFTWPAEELAYLEG 216

Query: 211 SPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
           SP  +        ++ E+         A  L      D   E FTFE ++ AF  + S  
Sbjct: 217 SPAASATASMMAKLRAEH---------AAVLEGNSALD--PEIFTFEAWQWAFTNLFSRA 265

Query: 271 VHLQKVSLARRFALVPL-----GPPLLAYSSKCKAMLAAV-----DDAVQLVVDRPYKAG 320
           + L+        A+VP        P  +     + +  A      +D V L  DR YK  
Sbjct: 266 IRLKASRAGELLAMVPYVDFINHSPFSSSYVDAREVPKAFPWEEKEDEVVLFADRAYKKF 325

Query: 321 ESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRM 380
           E + + Y           GP+ N+ LL+ YGF  + NP++ + +    + +D  Y  K  
Sbjct: 326 EQVFISY-----------GPKSNADLLLLYGFALDRNPFNSVDLAVGASKDDALYDAKER 374

Query: 381 VAQRNGK-LSVQVFHVHAGREKEAISDMLPYLRLGYVS-DTSEMQSVISSLGPICPVSPC 438
            A+  G+ +S   F ++A R  +   +++ +LR+   + D    + +      +  +S  
Sbjct: 375 FARGAGRDVSSAAFPLYADRFPD---ELVQFLRMACATEDHLGARPLDDPDNYVDILSLD 431

Query: 439 MERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 492
            E AVLD + D   A +A YPA   +D   + D  L   +R+A +LV  EK++L
Sbjct: 432 NELAVLDTIRDACDAAVAAYPAKSGDD---VPDAFLSRNQRMAKRLVNTEKRIL 482


>gi|357137766|ref|XP_003570470.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like
           [Brachypodium distachyon]
          Length = 389

 Score = 92.0 bits (227), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 78/280 (27%), Positives = 132/280 (47%), Gaps = 34/280 (12%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A  +++ G+    VP SLV++ +    N  + +++ +  + +   LA YL+ E     
Sbjct: 13  LVALTNVRNGEKLLFVPPSLVISADSEWSNREVGDVMKSYSVPDWPLLATYLISEASLEG 72

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL 231
            S W  YI  L RQ         S L W+ TE+ AYL  SP +   + R   +   YN+L
Sbjct: 73  SSRWSSYIDALPRQ-------PYSLLYWTRTEIDAYLVASPIRERAISRIGDVIGTYNDL 125

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
                    +F ++P   P E +  E F+ +F  + S +V L   S+  + ALVP    +
Sbjct: 126 ------RDRIFSKHPELFPEEVYNMENFRWSFGILFSRLVRLP--SMDGKVALVPWA-DM 176

Query: 292 LAYSSKCKAMLAAVDDAVQLVV---DRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLI 348
           L ++ +  A L   D + Q +V   DR Y+ GE + + Y           G + + +LL+
Sbjct: 177 LNHNPEVDAFL-DFDKSSQGIVFTTDRSYQPGEQVFISY-----------GKKSSGELLL 224

Query: 349 NYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNG 386
           +YGFV ++  NP D +    +LN  D  Y++K    +R+G
Sbjct: 225 SYGFVPKEGTNPNDSVEFSVSLNKSDDCYREKLQALKRHG 264


>gi|302759643|ref|XP_002963244.1| hypothetical protein SELMODRAFT_80789 [Selaginella moellendorffii]
 gi|300168512|gb|EFJ35115.1| hypothetical protein SELMODRAFT_80789 [Selaginella moellendorffii]
          Length = 467

 Score = 92.0 bits (227), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 81/310 (26%), Positives = 135/310 (43%), Gaps = 37/310 (11%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
           L+ W+ + GLP  KV LK   +          + +   L  GD    +P +L +T E   
Sbjct: 41  LQQWLSQAGLPIQKVELKNVGAGGRG------LVSKRMLYKGDRLLFLPATLAITTESEW 94

Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
                 +++    L E   LA YL+ E   GK S W PYI  L R+ G       S LLW
Sbjct: 95  ACAEAGKVIRAKDLPEWPFLACYLISEASLGKSSPWYPYIAALPRRPG-------SILLW 147

Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
           +  ++  +L+ +  K   L+    ++  +N+L+   FM          + P E F  + F
Sbjct: 148 TALDVETHLSATSIKDRALQCVREVEDTFNDLNKQVFMKNR------EEFPPEVFNLKSF 201

Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPY 317
           K AF  + S +V L   SL ++ AL+P G  +L + ++    L   +   ++   +DR Y
Sbjct: 202 KWAFGILFSRLVRLP--SLGQKLALIPFG-DMLNHDTEVTTFLDFDSGSKSITCTLDRGY 258

Query: 318 KAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALNTEDPQYQ 376
           ++   + + Y           G + N +LL+ YGFV    N  D + +   L+  D  Y+
Sbjct: 259 ESNREVFISY-----------GKRSNGELLVAYGFVPSGKNSEDSVSITLGLDPADEMYE 307

Query: 377 DKRMVAQRNG 386
            K    + +G
Sbjct: 308 AKLGTLKEHG 317


>gi|443722302|gb|ELU11224.1| hypothetical protein CAPTEDRAFT_181634 [Capitella teleta]
          Length = 541

 Score = 91.3 bits (225), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 99/434 (22%), Positives = 185/434 (42%), Gaps = 46/434 (10%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPI--HYVAASEDLQAGDAAFSVP 129
           S +E++      W+  N +    V ++        H  +  + + A+ D + G+   ++P
Sbjct: 71  SGREKNFDGFMGWLKSNSVDAEAVEIQ--------HFDVGGYGIKATRDFKEGELFLAIP 122

Query: 130 NSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQ 186
            S+++T +    N  +  L+  N++ +      LAL+++ E      SFWLPY++ L   
Sbjct: 123 RSVMMTTDTA-KNSALGALIADNRILQTMPNILLALHVLCELC-SPASFWLPYLKILPH- 179

Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP 246
                 +  SPL ++  +L  L  SPT +E++ +   I R+Y     + F    L  + P
Sbjct: 180 ------SYSSPLYFNPEDLQLLKASPTLSEMINQFRNITRQYAYFFNL-FQGHELASKLP 232

Query: 247 YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF-ALVPLGPPLLAYSSKCKAMLAAV 305
             I  +   ++ ++ A  +V +    +  +   R   AL+PL       + +     +  
Sbjct: 233 --IQVKNICYDDYRWAVSSVMTRQNQIPTLDGQRMISALIPLWDMCNHTNGQITTDFSLK 290

Query: 306 DDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVE 365
           +D  +        AG  + ++Y           G + N++LLI+ GFV   N  DRL + 
Sbjct: 291 NDRSECFSLEGTVAGAQVFIFY-----------GSRSNAELLIHNGFVYPQNHSDRLTIR 339

Query: 366 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV 425
             ++  DP +  K  V  R    + ++F +H G      SD L +LR+  +++  ++++ 
Sbjct: 340 LGISKNDPLFSMKSEVLSRLSMQASRLFSLHCG-VNPVDSDTLAFLRVVVMTE-DDLRTA 397

Query: 426 ISSLGPICP-------VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKK 478
           ++    I         VS   ER     LA      L  YP +  ED  +L   +L    
Sbjct: 398 LACRQQISKLRDFDDFVSEDNERKAWAFLATRVLLLLKAYPTSAQEDATLLQGNDLSTHA 457

Query: 479 RVATQLVRMEKKML 492
           R+A QL   EK +L
Sbjct: 458 RLAVQLRHCEKNIL 471


>gi|332321747|sp|B7ZUF3.1|SETD3_XENTR RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|213624517|gb|AAI71209.1| LOC549331 protein [Xenopus (Silurana) tropicalis]
          Length = 582

 Score = 90.9 bits (224), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 106/466 (22%), Positives = 203/466 (43%), Gaps = 58/466 (12%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   +L  W  +NG       L E P           + A+ +++A +    VP  
Sbjct: 73  GKREDYFPELMEWCKENGASTDGFELVEFPEEG------FGLKATREIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E   G+  +  L + +++ +      LA +L+ E+     SFWLPYI+ L  +  
Sbjct: 127 LLMTVESAKGS-VLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWLPYIKTLPNE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL ++E E+ YL  +    ++  + +   R+Y      +F    + Q +P  
Sbjct: 183 -----YDTPLYFNEDEVQYLQSTQAILDVFSQYKNTARQY-----AYFY--KVIQTHPNA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FTF+ ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTFDDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  + +K+GE I ++Y           G + N++ +I+ GF  E+N +DR+ +
Sbjct: 291 EDDRCECVALQDFKSGEQIYIFY-----------GTRSNAEFVIHNGFFFENNLHDRVKI 339

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL---------GY 415
           +  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+         G+
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHV-TEPPISAQLLAFLRVFCMNEDELKGH 398

Query: 416 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTD 471
           +     +  + +      PVS   E  +  +L  + +AR    L  Y  T+ +D  +L  
Sbjct: 399 LIGDHAIDKIFTLGNSEFPVS--WENEI--KLWTFLEARASLLLKTYKTTVEDDNKVLEQ 454

Query: 472 YNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAP 517
            ++     +A +L R+EK++L   L+  +D   L    +    P P
Sbjct: 455 PDMTFHSAMAIKLRRVEKEILEKALKSASDNRKLYSKNSEEGTPLP 500


>gi|62857953|ref|NP_001016577.1| histone-lysine N-methyltransferase setd3 [Xenopus (Silurana)
           tropicalis]
 gi|89272100|emb|CAJ81720.1| novel protein containing a SET domain [Xenopus (Silurana)
           tropicalis]
          Length = 581

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 106/466 (22%), Positives = 203/466 (43%), Gaps = 58/466 (12%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   +L  W  +NG       L E P           + A+ +++A +    VP  
Sbjct: 72  GKREDYFPELMEWCKENGASTDGFELVEFPEEG------FGLKATREIKAEELFLWVPRK 125

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E   G+  +  L + +++ +      LA +L+ E+     SFWLPYI+ L  +  
Sbjct: 126 LLMTVESAKGS-VLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWLPYIKTLPNE-- 181

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL ++E E+ YL  +    ++  + +   R+Y      +F    + Q +P  
Sbjct: 182 -----YDTPLYFNEDEVQYLQSTQAILDVFSQYKNTARQY-----AYFY--KVIQTHPNA 229

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FTF+ ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 230 NKLPLKDSFTFDDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 289

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  + +K+GE I ++Y           G + N++ +I+ GF  E+N +DR+ +
Sbjct: 290 EDDRCECVALQDFKSGEQIYIFY-----------GTRSNAEFVIHNGFFFENNLHDRVKI 338

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL---------GY 415
           +  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+         G+
Sbjct: 339 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHV-TEPPISAQLLAFLRVFCMNEDELKGH 397

Query: 416 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTD 471
           +     +  + +      PVS   E  +  +L  + +AR    L  Y  T+ +D  +L  
Sbjct: 398 LIGDHAIDKIFTLGNSEFPVS--WENEI--KLWTFLEARASLLLKTYKTTVEDDNKVLEQ 453

Query: 472 YNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAP 517
            ++     +A +L R+EK++L   L+  +D   L    +    P P
Sbjct: 454 PDMTFHSAMAIKLRRVEKEILEKALKSASDNRKLYSKNSEEGTPLP 499


>gi|340780678|pdb|3SMT|A Chain A, Crystal Structure Of Human Set Domain-Containing Protein3
          Length = 497

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 102/445 (22%), Positives = 196/445 (44%), Gaps = 50/445 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 72  GKREDYFPDLXKWASENG---ASVEGFEXVNFKEEGFGLR---ATRDIKAEELFLWVPRK 125

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L+ T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 126 LLXTVESA-KNSVLGPLYSQDRILQAXGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 181

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 182 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 229

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 230 NKLPLKDSFTYEDYRWAVSSVXTRQNQIPTEDGSRVTLALIPLWDXCNHTNGLITTGYNL 289

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  + ++AGE I ++Y           G + N++ +I+ GF  ++N +DR+ +
Sbjct: 290 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 338

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL---------GY 415
           +  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+          +
Sbjct: 339 KLGVSKSDRLYAXKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCXTEEELKEH 397

Query: 416 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
           +   S +  + +      PVS   E  +   L D     L  Y  T+ ED+++L +++L 
Sbjct: 398 LLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNHDLS 457

Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
            + + A +L   EK++L   ++  A
Sbjct: 458 VRAKXAIKLRLGEKEILEKAVKSAA 482


>gi|194038089|ref|XP_001925323.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Sus scrofa]
 gi|456754196|gb|JAA74239.1| SET domain containing 3 [Sus scrofa]
          Length = 595

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 106/445 (23%), Positives = 195/445 (43%), Gaps = 50/445 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W   NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASDNG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L   +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYAQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP-- 246
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P  
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPQA 230

Query: 247 YDIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
           + +P  E+FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 HKLPLKESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  R ++AGE I ++Y           G + N++ +I+ GF  ++N +DR+ +
Sbjct: 291 EDDRCECVALRDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 424
           +  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++    + 
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPVSAQLLAFLRVFCMTEGELKEH 398

Query: 425 V--------ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
           +        I +LG    PVS   E  +   L D     L  Y  T+ ED+  L ++ L 
Sbjct: 399 LLGENAIDRIFTLGNSEYPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKTFLKNHGLS 458

Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
            +  +A +L   EK++L   ++  A
Sbjct: 459 VRATMAVKLRLGEKEILEKAVESAA 483


>gi|10439587|dbj|BAB15525.1| unnamed protein product [Homo sapiens]
          Length = 512

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 98/423 (23%), Positives = 192/423 (45%), Gaps = 49/423 (11%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQG 171
           A+ D++A +    VP  L++T+E    N  +  L + +++ +      LA +L+ E+   
Sbjct: 28  ATRDIKAEELFLWVPRKLLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERAS- 85

Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
             SFW PYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y   
Sbjct: 86  PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY--- 135

Query: 232 DTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPL 287
              +F    + Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL
Sbjct: 136 --AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPL 191

Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLL 347
                  +          DD  + V  + ++AGE I ++Y           G + N++ +
Sbjct: 192 WDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFV 240

Query: 348 INYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDM 407
           I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +
Sbjct: 241 IHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQL 299

Query: 408 LPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGY 458
           L +LR+  +++          S +  + +      PVS   E  +   L D     L  Y
Sbjct: 300 LAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTY 359

Query: 459 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPY 518
             T+ ED+++L +++L  + ++A +L   EK++L   ++  A    + P + +SP     
Sbjct: 360 KTTIEEDKSVLKNHDLSVRAKMAIKLRLGEKEILEKAVKSAA----VNPGI-LSPTDGGK 414

Query: 519 APL 521
           APL
Sbjct: 415 APL 417


>gi|355718753|gb|AES06373.1| SET domain containing 3 [Mustela putorius furo]
          Length = 585

 Score = 89.7 bits (221), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 106/445 (23%), Positives = 199/445 (44%), Gaps = 50/445 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+  L  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRDLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  +AFT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDAFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  R ++AGE I ++Y           G + N++ +I+ GF  ++N +DR+ +
Sbjct: 291 EDDRCECVALRDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 424
           +  ++  D  Y  K  V  R G  +  VF +H+  E    + +L +LR+  +++    + 
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPVSAQLLAFLRVFCMTEEELKEH 398

Query: 425 V--------ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
           +        I +LG    PVS   E  +   L D     L  Y  T+ ED++ L D++L 
Sbjct: 399 LLGDNALDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLKDHDLS 458

Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
            +  +A +L   EK++L   ++  A
Sbjct: 459 VRAAMAIKLRLGEKEILEKAVKSAA 483


>gi|110331827|gb|ABG67019.1| hypothetical protein LOC84193 [Bos taurus]
          Length = 488

 Score = 89.7 bits (221), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 102/442 (23%), Positives = 196/442 (44%), Gaps = 50/442 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 80  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 133

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 134 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 189

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 190 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 237

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       S         
Sbjct: 238 HKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTSGLITTGYNL 297

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  + ++AGE I ++Y           G + N++ +I+ GF  ++N +DR+ +
Sbjct: 298 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 346

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT----- 419
           +  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++      
Sbjct: 347 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 405

Query: 420 ----SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
               S +  + +      PVS   E  +   L D     L  Y  T+ ED++ L +++L 
Sbjct: 406 LLGDSAIDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLKNHDLS 465

Query: 476 PKKRVATQLVRMEKKMLNACLQ 497
            +  +A +L   EK++L   ++
Sbjct: 466 ARATMAIKLRLGEKEILERAVK 487


>gi|148744485|gb|AAI42996.1| SET domain containing 3 [Homo sapiens]
          Length = 594

 Score = 89.4 bits (220), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 102/445 (22%), Positives = 199/445 (44%), Gaps = 50/445 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  + ++AGE I ++Y           G + N++ +I+ GF  ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT----- 419
           +  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++      
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398

Query: 420 ----SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
               S +  + +      PVS   E  +   L D     L  Y  T+ ED+++L +++L 
Sbjct: 399 LLGDSAIDRIFTLGNSKFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNHDLS 458

Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
            + ++A +L   EK++L   ++  A
Sbjct: 459 VRAKMAIKLRLGEKEILEKAVKSAA 483


>gi|440907688|gb|ELR57800.1| SET domain-containing protein 3 [Bos grunniens mutus]
          Length = 594

 Score = 89.4 bits (220), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 103/445 (23%), Positives = 197/445 (44%), Gaps = 50/445 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       S         
Sbjct: 231 HKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTSGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  + ++AGE I ++Y           G + N++ +I+ GF  ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT----- 419
           +  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++      
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398

Query: 420 ----SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
               S +  + +      PVS   E  +   L D     L  Y  T+ ED++ L +++L 
Sbjct: 399 LLGDSAIDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLKNHDLS 458

Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
            +  +A +L   EK++L   ++  A
Sbjct: 459 ARATMAIKLRLGEKEILERAVKSAA 483


>gi|134254196|gb|AAI35195.1| LOC549331 protein [Xenopus (Silurana) tropicalis]
          Length = 507

 Score = 89.4 bits (220), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 106/463 (22%), Positives = 202/463 (43%), Gaps = 58/463 (12%)

Query: 75  EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
           E+   +L  W  +NG       L E P           + A+ +++A +    VP  L++
Sbjct: 1   EDYFPELMEWCKENGASTDGFELVEFPEEG------FGLKATREIKAEELFLWVPRKLLM 54

Query: 135 TLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQ 191
           T+E   G+  +  L + +++ +      LA +L+ E+     SFWLPYI+ L  +     
Sbjct: 55  TVESAKGS-VLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWLPYIKTLPNE----- 107

Query: 192 LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY--DI 249
              ++PL ++E E+ YL  +    ++  + +   R+Y      +F    + Q +P    +
Sbjct: 108 --YDTPLYFNEDEVQYLQSTQAILDVFSQYKNTARQY-----AYFY--KVIQTHPNANKL 158

Query: 250 P-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDD 307
           P  ++FTF+ ++ A  +V +    +     +R   AL+PL       +S         DD
Sbjct: 159 PLKDSFTFDDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNSLITTGYNLEDD 218

Query: 308 AVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 367
             + V  + +K+GE I ++Y           G + N++ +I+ GF  E+N +DR+ ++  
Sbjct: 219 RCECVALQDFKSGEQIYIFY-----------GTRSNAEFVIHNGFFFENNLHDRVKIKLG 267

Query: 368 LNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL---------GYVSD 418
           ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+         G++  
Sbjct: 268 VSKSDRLYAMKAEVLARAGIPTSSVFALHV-TEPPISAQLLAFLRVFCMNEDELKGHLIG 326

Query: 419 TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDYNL 474
              +  + +      PVS   E  +  +L  + +AR    L  Y  T+ +D  +L   ++
Sbjct: 327 DHAIDKIFTLGNSEFPVS--WENEI--KLWTFLEARASLLLKTYKTTVEDDNKVLEQPDM 382

Query: 475 HPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAP 517
                +A +L R+EK++L   L+  +D   L    +    P P
Sbjct: 383 TFHSAMAIKLRRVEKEILEKALKSASDNRKLYSKNSEEGTPLP 425


>gi|40068481|ref|NP_115609.2| histone-lysine N-methyltransferase setd3 isoform a [Homo sapiens]
 gi|74750394|sp|Q86TU7.1|SETD3_HUMAN RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|28071092|emb|CAD61927.1| unnamed protein product [Homo sapiens]
 gi|119602070|gb|EAW81664.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
 gi|119602072|gb|EAW81666.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
 gi|119602073|gb|EAW81667.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
 gi|194380984|dbj|BAG64060.1| unnamed protein product [Homo sapiens]
 gi|307686103|dbj|BAJ20982.1| SET domain containing 3 [synthetic construct]
          Length = 594

 Score = 89.4 bits (220), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 102/445 (22%), Positives = 199/445 (44%), Gaps = 50/445 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  + ++AGE I ++Y           G + N++ +I+ GF  ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT----- 419
           +  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++      
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398

Query: 420 ----SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
               S +  + +      PVS   E  +   L D     L  Y  T+ ED+++L +++L 
Sbjct: 399 LLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNHDLS 458

Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
            + ++A +L   EK++L   ++  A
Sbjct: 459 VRAKMAIKLRLGEKEILEKAVKSAA 483


>gi|431839268|gb|ELK01195.1| SET domain-containing protein 3 [Pteropus alecto]
          Length = 805

 Score = 89.0 bits (219), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 105/445 (23%), Positives = 197/445 (44%), Gaps = 50/445 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E     E+   +    A+ D++A +    VP  
Sbjct: 252 GKREDYFPDLMKWASENG---ASVEGFEMVDFKEEGFGLR---ATRDIKAEELFLWVPRK 305

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 306 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 361

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 362 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 409

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 410 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 469

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  R ++AGE I ++Y           G + N++ +I+ GF  ++N +DR+ +
Sbjct: 470 EDDRCECVALRDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 518

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 424
           +  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++    + 
Sbjct: 519 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 577

Query: 425 V--------ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
           +        I +LG    PVS   E  +   L D     L  Y  T+ ED + L +++L 
Sbjct: 578 LLGDNAIDRIFTLGNSEYPVSWDNEVKLWTFLEDRASLLLKTYKTTVEEDRSFLRNHDLS 637

Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
            +  +A +L   EK++L   ++  A
Sbjct: 638 VRAAMAVKLRLGEKEILERAVKSAA 662


>gi|119914085|ref|XP_589822.3| PREDICTED: histone-lysine N-methyltransferase setd3 [Bos taurus]
 gi|297488270|ref|XP_002696879.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Bos taurus]
 gi|296475307|tpg|DAA17422.1| TPA: SET domain containing 3 [Bos taurus]
          Length = 601

 Score = 89.0 bits (219), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 103/445 (23%), Positives = 197/445 (44%), Gaps = 50/445 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 80  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 133

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 134 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 189

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 190 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 237

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       S         
Sbjct: 238 HKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTSGLITTGYNL 297

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  + ++AGE I ++Y           G + N++ +I+ GF  ++N +DR+ +
Sbjct: 298 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 346

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT----- 419
           +  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++      
Sbjct: 347 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 405

Query: 420 ----SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
               S +  + +      PVS   E  +   L D     L  Y  T+ ED++ L +++L 
Sbjct: 406 LLGDSAIDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLKNHDLS 465

Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
            +  +A +L   EK++L   ++  A
Sbjct: 466 ARATMAIKLRLGEKEILERAVKSAA 490


>gi|302821397|ref|XP_002992361.1| hypothetical protein SELMODRAFT_430576 [Selaginella moellendorffii]
 gi|300139777|gb|EFJ06511.1| hypothetical protein SELMODRAFT_430576 [Selaginella moellendorffii]
          Length = 463

 Score = 89.0 bits (219), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 84/338 (24%), Positives = 138/338 (40%), Gaps = 68/338 (20%)

Query: 80  DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
           +L SW+   G      +LK  P           + A  D++AG+    V    ++T +R+
Sbjct: 39  ELVSWLKIRGEHDACSLLKTGPDKRG-------LFAVRDIKAGECILRVSRDTMMTADRL 91

Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
                  +LL++  +SE A LAL L++EK+ G+ S W PYI  L R        + S   
Sbjct: 92  --PLEFQQLLSSG-VSEWAQLALLLLFEKRAGEASIWAPYISCLPRWG-----TIHSTAF 143

Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
           W + ELA +  S    E + R   I+ E+NE+  +       FQ+Y + +     ++  F
Sbjct: 144 WRKEELAMIQESSLSYETMSRRAAIREEFNEMQPI-------FQRYEH-VFGGPVSYASF 195

Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD------------ 307
           K A+V    C     ++    + A+VP     + +     AML    D            
Sbjct: 196 KHAYVTATVC-SRAWRIDGLEKLAMVPFA-DFMNHDWSSNAMLTYDTDNGSTEVEEVKVY 253

Query: 308 ---------AVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNP 358
                      QL  D+ Y AGE +     ISF       GP  N+ L +++GF    NP
Sbjct: 254 SDCLDIALFCAQLFADKNYAAGEQV----TISF-------GPLCNASLALDFGFTVPYNP 302

Query: 359 YDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 396
           +D++ +   ++  D   ++K           +Q  H H
Sbjct: 303 WDKVQLWLGISRRDSLRKEK-----------LQYLHAH 329


>gi|426248573|ref|XP_004018037.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
           setd3 [Ovis aries]
          Length = 596

 Score = 88.6 bits (218), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 103/445 (23%), Positives = 197/445 (44%), Gaps = 50/445 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 80  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 133

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 134 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 189

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 190 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--RVIQTHPHA 237

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       S         
Sbjct: 238 HKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTSGLITTGYNL 297

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  + ++AGE I ++Y           G + N++ +I+ GF  ++N +DR+ +
Sbjct: 298 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 346

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT----- 419
           +  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++      
Sbjct: 347 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 405

Query: 420 ----SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
               S +  + +      PVS   E  +   L D     L  Y  T+ ED++ L +++L 
Sbjct: 406 LLGDSAIDRIFTLGNSEYPVSWDNEVRLWAFLEDRASLLLKTYKTTIEEDKSFLKNHDLS 465

Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
            +  +A +L   EK++L   ++  A
Sbjct: 466 ARATMAVKLRLGEKEILERAVKSAA 490


>gi|444915331|ref|ZP_21235465.1| SET domain containing protein [Cystobacter fuscus DSM 2262]
 gi|444713560|gb|ELW54457.1| SET domain containing protein [Cystobacter fuscus DSM 2262]
          Length = 449

 Score = 88.2 bits (217), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 111/447 (24%), Positives = 191/447 (42%), Gaps = 53/447 (11%)

Query: 70  VVSKKEEDLGDLKSWMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFS 127
             S   + L +L  W+ + G   P  +++ +E             V A   + AG+    
Sbjct: 10  AASSSNQKLSNLLRWLEEGGARFPKLQLVRREDGERA--------VLAQAPISAGETVLQ 61

Query: 128 VPNSLVVTLERVLGNET-----IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRE 182
           VP + ++TLE  L  E+     IAE L  +  +E   LA +L+ EK + + SFW PYI  
Sbjct: 62  VPRTHMLTLE--LARESDIGRAIAEGLDPD--NEDLYLASFLLQEKHR-EGSFWKPYIDS 116

Query: 183 LDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLF 242
           L           + PL +   E A L G      +  +A+ ++ +Y           SL 
Sbjct: 117 LPESYS------QMPLFYGSEEHALLKGCFALTLLTHQAQSLREDYL----------SLC 160

Query: 243 QQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML 302
           Q  P     E FT   F  A ++V S +  L+K     +  LVP+   +L +      + 
Sbjct: 161 QNVP---GYERFTPGEFVWARLSVSSRLFSLKKGGFLGQ-TLVPMAD-MLNHRRPPDVLW 215

Query: 303 AAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRL 362
              +D    V+    KA  ++     +    HD   G + N  +L+++GFV +DN +D  
Sbjct: 216 ETTEDGESFVM----KANNAVAAGDEV----HD-SYGAKSNDLMLLHFGFVTDDNEHDEA 266

Query: 363 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV--SDTS 420
            +   +   DP    K+M+       + + F +              +LR+     +D  
Sbjct: 267 FLGLRILDGDPLAATKQMLLMLPSPTAARPFKISRPYVHTTTRMAFSFLRIAAAVPNDIE 326

Query: 421 EMQS-VISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKR 479
           ++ S V+S    + P+S   E  VL+ LA   +ARL+ +P +L++DE +L   +L P  R
Sbjct: 327 DISSRVMSGERALGPLSVENEENVLELLAATCQARLSIFPTSLAQDEELLRGESLSPNAR 386

Query: 480 VATQLVRMEKKMLNACLQVTADMIMLL 506
               + R EK+++   L++T   + LL
Sbjct: 387 NCVLVRRAEKQLIEDYLEMTRVCLKLL 413


>gi|146181028|ref|XP_001021989.2| SET domain containing protein [Tetrahymena thermophila]
 gi|146144300|gb|EAS01744.2| SET domain containing protein [Tetrahymena thermophila SB210]
          Length = 590

 Score = 88.2 bits (217), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 102/416 (24%), Positives = 189/416 (45%), Gaps = 71/416 (17%)

Query: 103 HNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNET-IAELLTTNKLSELA--- 158
           +++ +R +H   A   +   +    +P S ++TLE  +  ET +A+ +   KL+ L+   
Sbjct: 173 YSKNYRGVH---ARRKVYNKETILFIPKSHLITLE--MAKETDVAKKIIAAKLNLLSPKH 227

Query: 159 -CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
             L+ +L+ E+K  K+S W PY+  L           + P+ +SE +L++L GSP + ++
Sbjct: 228 SFLSTFLLQERK-NKESKWKPYLDILPSDYN------QFPIFFSEDDLSWLKGSPFQNQV 280

Query: 218 LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS 277
            E+   IKR+Y+++ +V       F +Y         TFE F  A +   S V  LQ ++
Sbjct: 281 REKKADIKRDYDDICSV----APEFAEY---------TFEDFCWARMTASSRVFGLQ-IN 326

Query: 278 LARRFALVPLGPPLLAYSSKCKAMLAAVDD-----AVQLVVDRPYKAGESIVVWYNISFT 332
             +  A VPL   L     + K      DD      +Q + D P   GE +   Y     
Sbjct: 327 EQKTDAFVPLADML--NHRRPKQTSWQYDDQREGFVIQALEDIP--RGEQVYDSY----- 377

Query: 333 GHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL-SVQ 391
                 G + NS+  +NYGF++ DN  + + +    + EDP  + K+ +    G +   +
Sbjct: 378 ------GRKCNSRFFLNYGFINLDNDANEVALRLTFDAEDPTIERKKEMM--GGDVPEFK 429

Query: 392 VFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSL----------------GPICPV 435
           V+ +    +++ +S+ + YLR   + D S++  ++SSL                    P+
Sbjct: 430 VYRILENYQEQNVSEFMSYLRFILIRDNSKL-LMLSSLHEQQTENSENLSGYKPQKTPPI 488

Query: 436 SPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKM 491
           S   E  +  ++++  +  ++ Y  TL ED+ +L   NL   +R    L   EK++
Sbjct: 489 SIQNETDMWVRISNMCQTSISLYNTTLKEDKELLAKDNLTQNQRNCVLLRSGEKEV 544


>gi|147843303|emb|CAN82664.1| hypothetical protein VITISV_015206 [Vitis vinifera]
          Length = 507

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 95/390 (24%), Positives = 170/390 (43%), Gaps = 37/390 (9%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A  D+   +A   VP    +  + V  +E  +       L     +AL+L+ EK +  
Sbjct: 81  LVAQRDIARNEAVLEVPKRFWINPDAVAASEIGS---VCGGLKPWVSVALFLIREKLR-D 136

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
           +S W  Y+  L            S + WSE EL  + G+      L   E ++ E+ +++
Sbjct: 137 ESPWRSYLDILPEY-------TNSTIYWSEEELVEIQGTQLSNTTLGVKEYVQSEFLKVE 189

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
               +  S    +P  +    + F I + +AF  ++      Q + L     L+   P +
Sbjct: 190 EEVILPHSQLFPFPVTLDDFLWAFGILRSRAFSRLRG-----QNLVLIPLADLINHSPSI 244

Query: 292 LA--YSSKCK-AMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLI 348
               Y+ + K A L + D    L      KAGE +++ Y++           + N++L +
Sbjct: 245 TTEEYAWEIKGAGLFSRDQLFSLRTPVSVKAGEQVLIQYDLD----------KSNAELAL 294

Query: 349 NYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDML 408
           +YGF++     +   +   ++  DP + DK  +A+ NG   +  F +  G+   A   ML
Sbjct: 295 DYGFIESRPNRNSYTLTLEISESDPFFGDKLDIAESNGLSEIAYFDIVLGQSLPAA--ML 352

Query: 409 PYLRLGYVS--DTSEMQSVISS--LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLS 463
           PYLRL  +   D   ++S+  +   G +  PVS   E  +   + D  K+ L+GY  T+ 
Sbjct: 353 PYLRLVALGGPDAFLLESIFRNTIWGHLELPVSRANEELICQVIQDACKSALSGYLTTIE 412

Query: 464 EDEAMLTDYNLHPKKRVATQLVRMEKKMLN 493
           EDE +  + NLHP+  +A  +   EKK+L 
Sbjct: 413 EDEKLKEEGNLHPRLEIAVGVRTGEKKVLQ 442


>gi|225462926|ref|XP_002267249.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
           large subunit N-methyltransferase, chloroplastic [Vitis
           vinifera]
 gi|296087793|emb|CBI35049.3| unnamed protein product [Vitis vinifera]
          Length = 484

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 97/395 (24%), Positives = 172/395 (43%), Gaps = 47/395 (11%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A  D+   +A   VP    +  + V  +E  +       L     +AL+L+ EK +  
Sbjct: 81  LVAQRDIARNEAVLEVPKRFWINPDAVAASEIGS---VCGGLKPWVSVALFLIREKLR-D 136

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
           +S W  Y+  L            S + WSE EL  + G+      L   E ++ E+ +++
Sbjct: 137 ESPWRSYLDILPEY-------TNSTIYWSEEELVEIQGTQLSNTTLGVKEYVQSEFLKVE 189

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG---- 288
               +  S  Q +P+ +     T + F  AF  ++S      +    +   L+PL     
Sbjct: 190 EEVILPHS--QLFPFPV-----TLDDFLWAFGILRSRAFSRLR---GQNLVLIPLADLIN 239

Query: 289 --PPLLA--YSSKCK-AMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPN 343
             P +    Y+ + K A L + D    L      KAGE +++ Y++           + N
Sbjct: 240 HSPSITTEEYAWEIKGAGLFSRDQLFSLRTPVSVKAGEQVLIQYDLD----------KSN 289

Query: 344 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEA 403
           ++L ++YGF++     +   +   ++  DP + DK  +A+ NG   +  F +  G+   A
Sbjct: 290 AELALDYGFIESRPNRNSYTLTLEISESDPFFGDKLDIAESNGLSEIAYFDIVLGQSLPA 349

Query: 404 ISDMLPYLRLGYVS--DTSEMQSVISS--LGPI-CPVSPCMERAVLDQLADYFKARLAGY 458
              MLPYLRL  +   D   ++S+  +   G +  PVS   E  +   + D  K+ L+GY
Sbjct: 350 A--MLPYLRLVALGGPDAFLLESIFRNTIWGHLELPVSRANEELICQVIQDACKSALSGY 407

Query: 459 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 493
             T+ EDE +  + NLHP+  +A  +   EKK+L 
Sbjct: 408 LTTIEEDEKLKEEGNLHPRLEIAVGVRTGEKKVLQ 442


>gi|355778846|gb|EHH63882.1| hypothetical protein EGM_16943 [Macaca fascicularis]
          Length = 595

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 102/445 (22%), Positives = 198/445 (44%), Gaps = 50/445 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERAN-PNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  + ++AGE I ++Y           G + N++ +I+ GF  ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT----- 419
           +  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++      
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398

Query: 420 ----SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
               S +  + +      PVS   E  +   L D     L  Y  T+ ED+++L + +L 
Sbjct: 399 LLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLS 458

Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
            + ++A +L   EK++L   ++  A
Sbjct: 459 VRAKMAIKLRLGEKEILEKAVKSAA 483


>gi|386780935|ref|NP_001247800.1| SET domain containing 3 [Macaca mulatta]
 gi|355693560|gb|EHH28163.1| hypothetical protein EGK_18532 [Macaca mulatta]
 gi|380817110|gb|AFE80429.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
 gi|383422129|gb|AFH34278.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
 gi|384949778|gb|AFI38494.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
          Length = 595

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 102/445 (22%), Positives = 198/445 (44%), Gaps = 50/445 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-NPNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  + ++AGE I ++Y           G + N++ +I+ GF  ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT----- 419
           +  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++      
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398

Query: 420 ----SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
               S +  + +      PVS   E  +   L D     L  Y  T+ ED+++L + +L 
Sbjct: 399 LLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLS 458

Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
            + ++A +L   EK++L   ++  A
Sbjct: 459 VRAKMAIKLRLGEKEILEKAVKSAA 483


>gi|255080880|ref|XP_002504006.1| predicted protein [Micromonas sp. RCC299]
 gi|226519273|gb|ACO65264.1| predicted protein [Micromonas sp. RCC299]
          Length = 529

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 110/396 (27%), Positives = 169/396 (42%), Gaps = 41/396 (10%)

Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWL 177
           D++AG+    +P +L VT   V  +  +A L       EL  LAL+L  E+ +G  S W 
Sbjct: 83  DVRAGEPLIEIPQNLAVTSVDVADSPIVAGLAAGR--GELVGLALWLCLERHKGPLSEWA 140

Query: 178 PYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWF 236
           PY+  L           + PLLW+  EL   L GSP + + + R E    EY  +     
Sbjct: 141 PYVATLP------SAGSDHPLLWTAGELQTLLQGSPVREQAVSRLESADDEYASI----- 189

Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFV-AVQSCVVHLQKVSLARRFALVPLGPPLLAYS 295
                 +  P D P +A+ F + + AFV A+ + +     ++ A  +A+VPL   LL   
Sbjct: 190 --ADQIRSNPNDFPPDAYEF-LTRDAFVDALATVLARAVWLNAANCYAMVPL-VDLLPLV 245

Query: 296 SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNIS------FTGHDFKCG---------P 340
                 ++    A    V +P  A  + VV Y+ +       + +D +            
Sbjct: 246 GSPPPGVSPAAAAGGPAVGKPGLAAAAGVVDYDAATECVAVVSANDAQQTARVVCVDPLA 305

Query: 341 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG-KLSVQVFHVHAGR 399
           +    L +  G VDE +  D L   A+    D  Y+ KR + +  G     Q F V A R
Sbjct: 306 RNAGDLFLATGAVDESHCGDYLAFAASCTQTDRLYEAKRQILEGMGMSADGQTFPVFADR 365

Query: 400 EKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP 459
                  +L Y+R   V D  E+ SV      I  VSP  E  VL  L    +  LA Y 
Sbjct: 366 MP---MQLLAYMRFARVQDPGELMSVSFEEDRI--VSPMNEYEVLQLLMQDAREMLAEYE 420

Query: 460 ATLSEDEAM-LTDYNLHPKKRVATQLVRMEKKMLNA 494
           ++  E E + L +  L  ++RVA +L   EK+++NA
Sbjct: 421 SSSEEFELLQLKEKGLSARQRVAAKLRLAEKRLINA 456


>gi|332252553|ref|XP_003275417.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
           [Nomascus leucogenys]
 gi|332252555|ref|XP_003275418.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
           [Nomascus leucogenys]
          Length = 595

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 102/445 (22%), Positives = 198/445 (44%), Gaps = 50/445 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  + ++AGE I ++Y           G + N++ +I+ GF  ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT----- 419
           +  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++      
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398

Query: 420 ----SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
               S +  + +      PVS   E  +   L D     L  Y  T+ ED+++L + +L 
Sbjct: 399 LLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLS 458

Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
            + ++A +L   EK++L   ++  A
Sbjct: 459 VRAKMAIKLRLGEKEILEKAVKSAA 483


>gi|281182452|ref|NP_001162549.1| histone-lysine N-methyltransferase setd3 [Papio anubis]
 gi|332321745|sp|A9X1D0.1|SETD3_PAPAN RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|163781076|gb|ABY40825.1| SET domain containing 3, isoform 1 (predicted) [Papio anubis]
          Length = 595

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 102/445 (22%), Positives = 198/445 (44%), Gaps = 50/445 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERAN-PNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  + ++AGE I ++Y           G + N++ +I+ GF  ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT----- 419
           +  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++      
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398

Query: 420 ----SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
               S +  + +      PVS   E  +   L D     L  Y  T+ ED+++L + +L 
Sbjct: 399 LLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLS 458

Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
            + ++A +L   EK++L   ++  A
Sbjct: 459 VRAKMAIKLRLGEKEILEKAVKSAA 483


>gi|114654683|ref|XP_522946.2| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2 [Pan
           troglodytes]
 gi|332843114|ref|XP_003314566.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Pan
           troglodytes]
 gi|397525919|ref|XP_003832895.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1 [Pan
           paniscus]
 gi|397525921|ref|XP_003832896.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2 [Pan
           paniscus]
 gi|410227562|gb|JAA11000.1| SET domain containing 3 [Pan troglodytes]
 gi|410255618|gb|JAA15776.1| SET domain containing 3 [Pan troglodytes]
 gi|410289938|gb|JAA23569.1| SET domain containing 3 [Pan troglodytes]
 gi|410342147|gb|JAA40020.1| SET domain containing 3 [Pan troglodytes]
          Length = 594

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 102/445 (22%), Positives = 198/445 (44%), Gaps = 50/445 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  + ++AGE I ++Y           G + N++ +I+ GF  ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT----- 419
           +  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++      
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398

Query: 420 ----SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
               S +  + +      PVS   E  +   L D     L  Y  T+ ED+++L + +L 
Sbjct: 399 LLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLS 458

Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
            + ++A +L   EK++L   ++  A
Sbjct: 459 VRAKMAIKLRLGEKEILEKAVKSAA 483


>gi|297695854|ref|XP_002825140.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
           [Pongo abelii]
 gi|395746278|ref|XP_003778419.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
           [Pongo abelii]
          Length = 595

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 102/445 (22%), Positives = 198/445 (44%), Gaps = 50/445 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  + ++AGE I ++Y           G + N++ +I+ GF  ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT----- 419
           +  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++      
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398

Query: 420 ----SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
               S +  + +      PVS   E  +   L D     L  Y  T+ ED+++L + +L 
Sbjct: 399 LLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLS 458

Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
            + ++A +L   EK++L   ++  A
Sbjct: 459 VRAKMAIKLRLGEKEILEKAVKSAA 483


>gi|332320543|sp|B0VX69.2|SETD3_CALJA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
          Length = 595

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 104/445 (23%), Positives = 200/445 (44%), Gaps = 50/445 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEEEVRYLQSTQAVHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  + ++AGE I ++Y           G + N++ +I+ GF  ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 424
           +  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++    + 
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398

Query: 425 V--------ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
           +        I +LG    PVS   E  +   L D     L  Y  T+ ED+++L + +L 
Sbjct: 399 LLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLS 458

Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
            + ++A +L   EK++L   ++  A
Sbjct: 459 VRAKMAIKLRLGEKEILEKAVKSAA 483


>gi|296215874|ref|XP_002754318.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Callithrix jacchus]
          Length = 610

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 104/445 (23%), Positives = 200/445 (44%), Gaps = 50/445 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 88  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 141

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 142 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 197

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 198 -----YDTPLYFEEEEVRYLQSTQAVHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 245

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 246 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 305

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  + ++AGE I ++Y           G + N++ +I+ GF  ++N +DR+ +
Sbjct: 306 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 354

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 424
           +  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++    + 
Sbjct: 355 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 413

Query: 425 V--------ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
           +        I +LG    PVS   E  +   L D     L  Y  T+ ED+++L + +L 
Sbjct: 414 LLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLS 473

Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
            + ++A +L   EK++L   ++  A
Sbjct: 474 VRAKMAIKLRLGEKEILEKAVKSAA 498


>gi|426377975|ref|XP_004055723.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Gorilla
           gorilla gorilla]
          Length = 594

 Score = 87.4 bits (215), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 102/445 (22%), Positives = 198/445 (44%), Gaps = 50/445 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  + ++AGE I ++Y           G + N++ +I+ GF  ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT----- 419
           +  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++      
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398

Query: 420 ----SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
               S +  + +      PVS   E  +   L D     L  Y  T+ ED+++L + +L 
Sbjct: 399 LLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLS 458

Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
            + ++A +L   EK++L   ++  A
Sbjct: 459 VRAKMAIKLRLGEKEILEKTVKSAA 483


>gi|169409575|gb|ACA57918.1| SET domain containing 3 isoform a (predicted) [Callicebus moloch]
          Length = 597

 Score = 87.4 bits (215), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 104/445 (23%), Positives = 200/445 (44%), Gaps = 50/445 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 75  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 128

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 129 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 184

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 185 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 232

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 233 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 292

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  + ++AGE I ++Y           G + N++ +I+ GF  ++N +DR+ +
Sbjct: 293 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 341

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 424
           +  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++    + 
Sbjct: 342 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 400

Query: 425 V--------ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
           +        I +LG    PVS   E  +   L D     L  Y  T+ ED+++L + +L 
Sbjct: 401 LLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLS 460

Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
            + ++A +L   EK++L   ++  A
Sbjct: 461 VRAKMAIKLRLGEKEILEKAVKSAA 485


>gi|168986666|gb|ACA35060.1| SET domain containing 3 isoform a (predicted) [Callithrix jacchus]
          Length = 597

 Score = 87.4 bits (215), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 104/445 (23%), Positives = 200/445 (44%), Gaps = 50/445 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 75  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 128

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 129 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 184

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 185 -----YDTPLYFEEEEVRYLQSTQAVHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 232

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 233 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 292

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  + ++AGE I ++Y           G + N++ +I+ GF  ++N +DR+ +
Sbjct: 293 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 341

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 424
           +  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++    + 
Sbjct: 342 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 400

Query: 425 V--------ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
           +        I +LG    PVS   E  +   L D     L  Y  T+ ED+++L + +L 
Sbjct: 401 LLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLS 460

Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
            + ++A +L   EK++L   ++  A
Sbjct: 461 VRAKMAIKLRLGEKEILEKAVKSAA 485


>gi|332321478|sp|B1MTJ4.2|SETD3_CALMO RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
          Length = 595

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 104/445 (23%), Positives = 200/445 (44%), Gaps = 50/445 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  + ++AGE I ++Y           G + N++ +I+ GF  ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 424
           +  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++    + 
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398

Query: 425 V--------ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
           +        I +LG    PVS   E  +   L D     L  Y  T+ ED+++L + +L 
Sbjct: 399 LLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLS 458

Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
            + ++A +L   EK++L   ++  A
Sbjct: 459 VRAKMAIKLRLGEKEILEKAVKSAA 483


>gi|332321743|sp|C1FXW2.1|SETD3_DASNO RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|226526916|gb|ACO71275.1| SET domain containing 3 isoform a (predicted) [Dasypus
           novemcinctus]
          Length = 589

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 104/445 (23%), Positives = 199/445 (44%), Gaps = 50/445 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSMLGPLYSQDRILQAMGNITLAFHLLCERAN-PNSFWQPYIQSLPGE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRYLHSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  + ++AGE I ++Y           G + N++ +I+ GF  ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 424
           +  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++    + 
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398

Query: 425 V--------ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
           +        I +LG    PVS   E  +   L D     L  Y  T+ ED++ L +++L 
Sbjct: 399 LLGENAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSFLKNHDLS 458

Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
            +  +A +L   EK++L   ++  A
Sbjct: 459 VRATMAIKLRLGEKEILEKAVKSAA 483


>gi|348554489|ref|XP_003463058.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cavia
           porcellus]
          Length = 789

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 105/445 (23%), Positives = 198/445 (44%), Gaps = 50/445 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFWLPYI+ L  +  
Sbjct: 127 LLMTVESA-KNSILGPLYSQDRILQAMGNIALAFHLLCERAN-PNSFWLPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+  L  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEEEVQCLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  + ++AGE I ++Y           G + N++ +I+ GF  ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 424
           +  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++    + 
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398

Query: 425 VIS--------SLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
           ++         +LG    PVS   E  +   L D     L  Y  T+ ED+A+L    L 
Sbjct: 399 LLGENAIDRIFTLGNSEFPVSWENEVKLWSFLEDRASLLLKTYKTTIEEDKAVLKGPELP 458

Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
            + ++A +L   EK++L   +Q  A
Sbjct: 459 TRMKMAVKLRLGEKEILERTVQSAA 483


>gi|332321746|sp|B2KI88.1|SETD3_RHIFE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|183637154|gb|ACC64548.1| SET domain containing 3 isoform a (predicted) [Rhinolophus
           ferrumequinum]
          Length = 594

 Score = 86.7 bits (213), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 104/445 (23%), Positives = 198/445 (44%), Gaps = 50/445 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  S  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVSFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFGEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  + ++AGE I ++Y           G + N++ +I+ GF  ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFQAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 424
           +  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++    + 
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398

Query: 425 V--------ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
           +        I +LG    PVS   E  +   L D     L  Y   + ED++ L +++L 
Sbjct: 399 LLGDNAIDRIFTLGNSEYPVSWDNEVKLWTFLEDRASLLLKTYKTNIEEDKSFLKNHDLS 458

Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
            +  +A +L   EK++L   ++  A
Sbjct: 459 VRATMAIKLRLGEKEILEKAVKSAA 483


>gi|344273731|ref|XP_003408672.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Loxodonta
           africana]
          Length = 597

 Score = 86.7 bits (213), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 104/445 (23%), Positives = 198/445 (44%), Gaps = 50/445 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEVVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAN-PNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ +L  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRHLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  + ++AGE I ++Y           G + N++ +I+ GF  ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 424
           +  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++    + 
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398

Query: 425 V--------ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
           +        I +LG    PVS   E  +   L D     L  Y  T+ ED+A L  ++L 
Sbjct: 399 LLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKAFLKGHDLS 458

Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
            +  +A +L   EK++L   ++  A
Sbjct: 459 IRATMAVKLRLGEKEILERAVKSAA 483


>gi|346474100|gb|AEO36894.1| hypothetical protein [Amblyomma maculatum]
          Length = 459

 Score = 85.9 bits (211), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 106/450 (23%), Positives = 190/450 (42%), Gaps = 76/450 (16%)

Query: 84  WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
           W  +NG     V +K++P  +      + + A E ++       +P  LV+T      ++
Sbjct: 50  WCSENGAYLGSVAIKDRPDGD------YGLVAEEKIEESMQFLGIPMKLVMTTASARKSK 103

Query: 144 TIAELLTTN----KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
            +  LL  +     +S +A LA++L+ E   G+ SFW PYI  L         +  + L 
Sbjct: 104 -LGPLLRDDPIMKSMSNVA-LAIFLILELSAGESSFWHPYISVLPD-------SFNTVLY 154

Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
           ++  EL  L+GS    E L+    I R+Y     + F    L +  P+    + FT++++
Sbjct: 155 FNIEELELLSGSAVLDEALKLHRSIARQYAYFHKI-FRTHPLAKSLPF---KDCFTYDLY 210

Query: 260 KQAFVAV---QSCVVHLQKVSL----------ARRFALVPL--------GPPLLAYSS-- 296
           + A  AV   Q+ V   +   L          A   ALVPL        G  L  Y S  
Sbjct: 211 RWAVSAVMTRQNAVPWTESDGLGGDDVEIDGTAAVTALVPLWDMCNHSDGKVLTDYDSSA 270

Query: 297 ---KCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV 353
              +C AM             R +  GE + ++Y           G + N++  I+ GFV
Sbjct: 271 SMVRCYAM-------------RDFDKGEEVTIFY-----------GKRTNAEFFIHNGFV 306

Query: 354 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 413
            EDN YD + ++  ++ +DP +  K  + + +  LS+        R++    D+  +LR+
Sbjct: 307 FEDNRYDAVDIKLGVSKKDPLFAVKSKLCE-DHDLSLSGTFALVARDRPVSEDLSTFLRI 365

Query: 414 GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYN 473
             + D S+ ++   S   I   S    R  L  L    +  L  +P +  E E ++ D  
Sbjct: 366 LVLKDASQPEAF--SAEHILTSSDSNARDALTFLVVRIELLLKAFPKSDEEYEDIIKDGA 423

Query: 474 LHPKKRVATQLVRMEKKMLNACLQVTADMI 503
            + + ++A +L  +E K+L + L+   + +
Sbjct: 424 SNARVKMAARLRLLESKVLASVLETLGNHV 453


>gi|357153645|ref|XP_003576520.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
           large subunit N-methyltransferase, chloroplastic-like
           [Brachypodium distachyon]
          Length = 492

 Score = 85.9 bits (211), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 92/397 (23%), Positives = 171/397 (43%), Gaps = 47/397 (11%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A+ +L  G+    VP  L +  + V  ++      +   L     ++L ++ E  +G 
Sbjct: 86  LVAARNLPRGEVVAEVPKKLWMDADAVAASDIGRACRSGGDLRPWVSVSLLILREAARGG 145

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
            S W PY+  L RQ        +S + WSE EL  + G+   +  +   E ++ E++ ++
Sbjct: 146 DSLWAPYLAILPRQ-------TDSTIFWSEEELLEIQGTQLLSTTMGVKEYVQSEFDNVE 198

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
               + G     +P     +  TF+ F  AF  ++S V    +     + AL+P    L+
Sbjct: 199 AK--IIGPNKDLFP-----DTITFDDFLWAFGILRSRVFPELR---GDKLALIPFAD-LI 247

Query: 293 AYSS-----------KCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQ 341
            +S+           + K  L   D    L      K+GE + V Y++           +
Sbjct: 248 NHSADITSKQSCWEIQGKGFLGR-DVVFSLRTPMEVKSGEQVYVQYDLD----------K 296

Query: 342 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREK 401
            N++L ++YGF + ++  D   +   ++  DP Y DK  +A+ NG      F V  G  +
Sbjct: 297 SNAELALDYGFTETNSTRDSYTLTLEISESDPFYGDKLDIAELNGMGETAYFDVVLG--E 354

Query: 402 EAISDMLPYLRLGYVSDTSE--MQSVISS--LGPI-CPVSPCMERAVLDQLADYFKARLA 456
                M+ YLRL  +  T    ++++  +   G +  PVS   E ++   +    K+ L 
Sbjct: 355 SLPPQMITYLRLLCLGGTDAFLLEALFRNKVWGFLELPVSRDNEESICQVIQTACKSALT 414

Query: 457 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 493
            Y  T+ EDE +L   +L  + ++A ++   EKK+L 
Sbjct: 415 AYHTTIEEDEELLKREDLQSRHQIAVEVRAGEKKVLQ 451


>gi|332321744|sp|B5FW36.1|SETD3_OTOGA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|197215622|gb|ACH53017.1| SET domain containing 3 isoform a (predicted) [Otolemur garnettii]
          Length = 595

 Score = 85.9 bits (211), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 104/445 (23%), Positives = 198/445 (44%), Gaps = 50/445 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E    DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKRENYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQSLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  + ++AGE I ++Y           G + N++ +I+ GF  ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 424
           +  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++    + 
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398

Query: 425 V--------ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
           +        I +LG    PVS   E  +   L D     L  Y  T+ ED+ +L +++L 
Sbjct: 399 LLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKFVLKNHDLS 458

Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
            +  +A +L   EK++L   ++  A
Sbjct: 459 VRATMAIKLRLGEKEILEKAVKSAA 483


>gi|410962953|ref|XP_003988033.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Felis catus]
          Length = 591

 Score = 85.9 bits (211), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 104/444 (23%), Positives = 195/444 (43%), Gaps = 58/444 (13%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+  L  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRDLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAV 305
             +P  +AFT+E ++   V   S  +    + L     +   G P +      +      
Sbjct: 231 NKLPLKDAFTYEDYRLGLV---SLALGRWALGLECGVGIARCGKPQITTGYNLE------ 281

Query: 306 DDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVE 365
           DD  + V  + ++AGE I ++Y           G + N++ +I+ GF  ++N +DR+ ++
Sbjct: 282 DDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKIK 330

Query: 366 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV 425
             ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++    + +
Sbjct: 331 LGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPVSAQLLAFLRVFCMTEEELKEHL 389

Query: 426 --------ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 476
                   I +LG    PVS   E  +   L D     L  Y  T+ ED+A L ++NL  
Sbjct: 390 LGDNAIDRIFTLGNSEYPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKAFLKNHNLSV 449

Query: 477 KKRVATQLVRMEKKMLNACLQVTA 500
           +  +A +L   EK++L   ++  A
Sbjct: 450 RATMAIKLRLGEKEILEKAVKSAA 473


>gi|395827792|ref|XP_003787079.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Otolemur
           garnettii]
          Length = 595

 Score = 85.5 bits (210), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 104/445 (23%), Positives = 198/445 (44%), Gaps = 50/445 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E    DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKRENYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQSLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIYDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  + ++AGE I ++Y           G + N++ +I+ GF  ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 424
           +  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++    + 
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398

Query: 425 V--------ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
           +        I +LG    PVS   E  +   L D     L  Y  T+ ED+ +L +++L 
Sbjct: 399 LLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKFVLKNHDLS 458

Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
            +  +A +L   EK++L   ++  A
Sbjct: 459 VRATMAIKLRLGEKEILEKAVKSAA 483


>gi|332321742|sp|E2RBS6.1|SETD3_CANFA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
          Length = 588

 Score = 85.1 bits (209), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 104/445 (23%), Positives = 198/445 (44%), Gaps = 50/445 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+  L  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRDLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  +AFT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDAFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  R ++AGE I ++Y           G + N++ +I+ GF  ++N +DR+ +
Sbjct: 291 EDDRCECVALRDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 424
           +  ++  D  Y  K  V  R G  +  VF +H   +    + +L +LR+  +++    + 
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHY-TDPPVSAQLLAFLRVFCMTEEELKEH 398

Query: 425 V--------ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
           +        I +LG    PVS   E  +   L D     L  Y  T+ ED++ L +++L 
Sbjct: 399 LLGDNALDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLRNHDLS 458

Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
            +  +A +L   EK++L   ++  A
Sbjct: 459 VRATMAIKLRLGEKEILEKAVKSAA 483


>gi|73964462|ref|XP_547974.2| PREDICTED: SET domain containing 3 [Canis lupus familiaris]
          Length = 589

 Score = 85.1 bits (209), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 104/445 (23%), Positives = 198/445 (44%), Gaps = 50/445 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+  L  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRDLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  +AFT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDAFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  R ++AGE I ++Y           G + N++ +I+ GF  ++N +DR+ +
Sbjct: 291 EDDRCECVALRDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 424
           +  ++  D  Y  K  V  R G  +  VF +H   +    + +L +LR+  +++    + 
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHY-TDPPVSAQLLAFLRVFCMTEEELKEH 398

Query: 425 V--------ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
           +        I +LG    PVS   E  +   L D     L  Y  T+ ED++ L +++L 
Sbjct: 399 LLGDNALDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLRNHDLS 458

Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
            +  +A +L   EK++L   ++  A
Sbjct: 459 VRATMAIKLRLGEKEILEKAVKSAA 483


>gi|343961019|dbj|BAK62099.1| SET domain containing 3 isoform a [Pan troglodytes]
          Length = 492

 Score = 85.1 bits (209), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 98/429 (22%), Positives = 190/429 (44%), Gaps = 50/429 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  + ++AGE I ++Y           G + N++ +I+ GF  ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD------ 418
           +  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++      
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398

Query: 419 ---TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
               S +  + +      PVS   E  +   L D     L  Y  T+ ED+++L + +L 
Sbjct: 399 LLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLS 458

Query: 476 PKKRVATQL 484
            + ++A +L
Sbjct: 459 VRAKMAIKL 467


>gi|338719872|ref|XP_001488117.2| PREDICTED: histone-lysine N-methyltransferase setd3-like [Equus
           caballus]
          Length = 609

 Score = 85.1 bits (209), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 102/445 (22%), Positives = 195/445 (43%), Gaps = 50/445 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 87  GKREDYFPDLMKWASENG---ASVDGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 140

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 141 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 196

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 197 -----YDTPLYFEEDEVRYLQSTQAVHDVFSQYKNTARQY-----AYFY--RVIQTHPHA 244

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 245 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTTGLITTGYNL 304

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  + ++AGE I ++Y           G + N++ +I+ GF  ++N +DR+ +
Sbjct: 305 EDDRCECVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 353

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT----- 419
           +  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++      
Sbjct: 354 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKDH 412

Query: 420 ----SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
               + +  + +      PVS   E  +   L D     L  Y  T+ ED A L + +L 
Sbjct: 413 LLGDNAIDRIFTLGNSEYPVSWDNEVKLWTFLEDRALLLLKTYKTTVEEDRAFLKNSDLS 472

Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
            +  +A +L   EK++L   ++  A
Sbjct: 473 VRATMAIKLRLGEKEILEKAVKSAA 497


>gi|432952574|ref|XP_004085141.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Oryzias
           latipes]
          Length = 606

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 102/447 (22%), Positives = 188/447 (42%), Gaps = 56/447 (12%)

Query: 62  TLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQA 121
           T+  GSRE      +   DL SW  +NG       +    +     R       + D++A
Sbjct: 69  TVFEGSRE------DSFADLMSWAQENGASCDGFTITNFGTEGYGLR------TTRDIKA 116

Query: 122 GDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKKQGKKSFWLP 178
            +    VP  +++T+E    N  +  + + +++ +      LAL+L+ E+     SFW P
Sbjct: 117 EELFLWVPRKMLMTVESA-QNSVLGPIYSQDRILQAMGNVTLALHLLCERGD-PASFWSP 174

Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMA 238
           YIR L ++        ++PL + + ++  L G+    ++L + +   R+Y      +F  
Sbjct: 175 YIRSLPQE-------YDTPLYYQQEDVQLLLGTQAVQDVLNQYKNTARQY-----AYFY- 221

Query: 239 GSLFQQYP--YDIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAY 294
             L Q +P    +P  + F+F+ ++ A  +V +    +  V  +R   AL+PL       
Sbjct: 222 -KLVQTHPAASKLPLKDGFSFDDYRWAVSSVMTRQNQIPTVDGSRVTLALIPLWDMCNHT 280

Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVD 354
           +          DD  + V  + YK  E I ++Y           G + N++ +I+ GF  
Sbjct: 281 NGLITTGYNLEDDRCECVALQDYKKNEQIYIFY-----------GTRSNAEFVIHNGFFF 329

Query: 355 EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL- 413
           +DN +DR+ ++  ++  +  Y  K  V  R G  +  VF +H   +    + +L +LR+ 
Sbjct: 330 QDNAHDRVKIKLGVSKSERLYAMKAEVLARAGIPASCVFALHC-NDPPISAQLLAFLRVF 388

Query: 414 --------GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSED 465
                    Y+     +  + +      PVS   E  +   L       L  Y  T  ED
Sbjct: 389 CMTEEELKDYLLGERAINKIFTLGNSDFPVSWENEIKLWTFLETRAALLLKTYKTTSEED 448

Query: 466 EAMLTDYNLHPKKRVATQLVRMEKKML 492
            ++L   +L    R+A QL   EK++L
Sbjct: 449 RSILEKPDLSLHTRLAVQLRLAEKQIL 475


>gi|281338628|gb|EFB14212.1| hypothetical protein PANDA_005835 [Ailuropoda melanoleuca]
          Length = 585

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 104/445 (23%), Positives = 198/445 (44%), Gaps = 50/445 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+  L  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEEEVRDLQCTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  +AFT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDAFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  R ++AGE I ++Y           G + N++ +I+ GF  ++N +DR+ +
Sbjct: 291 EDDRCECVALRDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 424
           +  ++  D  Y  K  V  R G  +  VF +H   +    + +L +LR+  +++    + 
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TDPPVSAQLLAFLRVFCMTEEELKEH 398

Query: 425 V--------ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
           +        I +LG    PVS   E  +   L D     L  Y  T+ ED++ L +++L 
Sbjct: 399 LLGDNALDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLKNHDLS 458

Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
            +  +A +L   EK++L   ++  A
Sbjct: 459 VRATMAIKLRLGEKEILEKAVKSAA 483


>gi|291411315|ref|XP_002721936.1| PREDICTED: SET domain containing 3 [Oryctolagus cuniculus]
          Length = 591

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 103/448 (22%), Positives = 194/448 (43%), Gaps = 56/448 (12%)

Query: 72  SKKEEDLGDLKSWMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVP 129
            K+E+   +L  W   NG  +   +V+  E+            + A+ +++A +    VP
Sbjct: 73  GKREDYFPELMKWASANGASVEGFEVVNFEEEGFG--------LRATREIKAEELFLWVP 124

Query: 130 NSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQ 186
             L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +
Sbjct: 125 RKLLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERAS-PNSFWQPYIQTLPSE 182

Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP 246
                   ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P
Sbjct: 183 -------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--RVIQTHP 228

Query: 247 Y--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAML 302
           +   +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +       
Sbjct: 229 HANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGY 288

Query: 303 AAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRL 362
              DD  + V  R + AGE I ++Y           G + N++ +I+ GF  ++N +DR+
Sbjct: 289 NLEDDRCECVALRDFHAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRV 337

Query: 363 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM 422
            ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+ +     E+
Sbjct: 338 KIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRV-FCMTEEEL 395

Query: 423 QSVISSLGPI----------CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDY 472
           +  +   G I           PVS   E  +   L D     L  Y  T+ ED+A+L   
Sbjct: 396 REHLLGDGAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKAVLRSP 455

Query: 473 NLHPKKRVATQLVRMEKKMLNACLQVTA 500
            L  +  +A +L   EK++L   ++  A
Sbjct: 456 ALSARAAMAVKLRLGEKEILEKAVRSAA 483


>gi|301764186|ref|XP_002917505.1| PREDICTED: SET domain-containing protein 3-like [Ailuropoda
           melanoleuca]
          Length = 591

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 104/445 (23%), Positives = 198/445 (44%), Gaps = 50/445 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+  L  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEEEVRDLQCTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  +AFT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDAFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  R ++AGE I ++Y           G + N++ +I+ GF  ++N +DR+ +
Sbjct: 291 EDDRCECVALRDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 424
           +  ++  D  Y  K  V  R G  +  VF +H   +    + +L +LR+  +++    + 
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TDPPVSAQLLAFLRVFCMTEEELKEH 398

Query: 425 V--------ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
           +        I +LG    PVS   E  +   L D     L  Y  T+ ED++ L +++L 
Sbjct: 399 LLGDNALDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLKNHDLS 458

Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
            +  +A +L   EK++L   ++  A
Sbjct: 459 VRATMAIKLRLGEKEILEKAVKSAA 483


>gi|403274243|ref|XP_003928891.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Saimiri
           boliviensis boliviensis]
          Length = 513

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 94/402 (23%), Positives = 183/402 (45%), Gaps = 44/402 (10%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQG 171
           A+ D++A +    VP  L++T+E    N  +  L + +++ +      LA +L+ E+   
Sbjct: 28  ATRDIKAEELFLWVPRKLLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-S 85

Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
             SFW PYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y   
Sbjct: 86  PNSFWQPYIQTLPSE-------YDTPLYFEEEEVRYLQSTQAIHDVFSQYKNTARQY--- 135

Query: 232 DTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPL 287
              +F    + Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL
Sbjct: 136 --AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPL 191

Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLL 347
                  +          DD  + V  + ++AGE I ++Y           G + N++ +
Sbjct: 192 WDMCNHTNGLITTGYNLEDDRCECVALQDFQAGEQIYIFY-----------GTRSNAEFV 240

Query: 348 INYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDM 407
           I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +
Sbjct: 241 IHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQL 299

Query: 408 LPYLRLGYVSDTSEMQSV--------ISSLG-PICPVSPCMERAVLDQLADYFKARLAGY 458
           L +LR+  +++    + +        I +LG    PVS   E  +   L D     L  Y
Sbjct: 300 LAFLRVFCMTEEELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTY 359

Query: 459 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 500
             T+ ED+ +L + +L  + ++A +L   EK++L   ++  A
Sbjct: 360 KTTIEEDKFVLKNQDLSVRAKMAIKLRLGEKEILEKAVKSAA 401


>gi|145350419|ref|XP_001419603.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144579835|gb|ABO97896.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 524

 Score = 83.2 bits (204), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 82/320 (25%), Positives = 135/320 (42%), Gaps = 51/320 (15%)

Query: 80  DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
           +L  W+    LP  K+ L+   +          + A+E+++ G+A   VP + ++T+ER 
Sbjct: 83  ELARWLEGRRLPGQKMALEVNLAEG------RGLVATEEIKRGEALLGVPRTTLITVERA 136

Query: 140 LGNETIAELLTTNKLSELACLALYLMYEK---KQGKKSFWLPYIRELDRQRGRGQLAVES 196
           +    +       +L E + LA +L  +    + G    +  YIR L R+ G       S
Sbjct: 137 IAEAKLGP--KHAELQEWSVLATFLAQQALALESGTAGTFGEYIRALPRRTG-------S 187

Query: 197 PLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
            L W E E+   L GSP++    ER + +    +E+ +           +P        T
Sbjct: 188 VLDWPEDEVDKLLKGSPSRLAAAERQDSVNAAIDEIRSY----------FPE------IT 231

Query: 256 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 315
               + AF  + S ++ L   ++    ALVP    +L +   C A +    DAV L  DR
Sbjct: 232 VGALRWAFDILFSRLIRLD--AMGGELALVPW-ADMLNHKPGCAAFIDLNGDAVNLTTDR 288

Query: 316 PYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDE--DNPYDRLVVEAALNTEDP 373
            Y  GE +   Y           G +P+S+LLI+YGF  E  +NP D   +   ++  DP
Sbjct: 289 SYVKGEQVWASY-----------GQRPSSELLISYGFAPEVGENPDDEYALTLGVDVNDP 337

Query: 374 QYQDKRMVAQRNGKLSVQVF 393
               K  V +  G   V+ F
Sbjct: 338 LADAKAQVLRDMGLSPVETF 357


>gi|432098266|gb|ELK28072.1| Histone-lysine N-methyltransferase setd3 [Myotis davidii]
          Length = 585

 Score = 82.8 bits (203), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 101/445 (22%), Positives = 195/445 (43%), Gaps = 50/445 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   +L  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 62  GKREDYFPNLMKWASENG---ASVEGFEMFNFKEEGFGLR---ATRDIKAEELFLWVPRK 115

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 116 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 171

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+  L  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 172 -----YDTPLYFEEDEVRSLQSTQAVHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 219

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 220 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 279

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  R ++AGE I ++Y           G + N++ +I+ GF  ++N +DR+ +
Sbjct: 280 EDDRCECVALRDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 328

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT----- 419
           +  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++      
Sbjct: 329 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-MEPPISAQLLAFLRVFCMTEEELKDH 387

Query: 420 ----SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
               + +  + +      PVS   E  +   L D     L  Y  T  ED++ L +++L 
Sbjct: 388 LLGDNAIDKIFTLGNSEYPVSWDNEVKLWTFLEDRASLLLKTYKTTSEEDKSFLKNHDLS 447

Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
            + R+A +L   EK++L   +   A
Sbjct: 448 VRARMAIKLRLGEKEILEKAVTSAA 472


>gi|224051705|ref|XP_002200601.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Taeniopygia
           guttata]
          Length = 593

 Score = 82.8 bits (203), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 92/404 (22%), Positives = 185/404 (45%), Gaps = 44/404 (10%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
           + A+ +++A +    VP  L++T+E    N  +  L + +++ +      LA +L+ E+ 
Sbjct: 108 LKATREIKAEELFLWVPRKLLMTVESA-KNSVLGSLYSQDRILQAMGNITLAFHLLCERA 166

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
               SFWLPYI+ L  +        ++PL + E E+ +L  +    ++  + +   R+Y 
Sbjct: 167 N-PHSFWLPYIQTLPSE-------YDTPLYFEEDEVQHLQSTQAIHDVFSQYKNTARQY- 217

Query: 230 ELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALV 285
                +F    + Q +P    +P  ++FT++ ++ A  +V +    +     +R   AL+
Sbjct: 218 ----AYFY--KVIQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPTEDGSRVTLALI 271

Query: 286 PLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSK 345
           PL       +          DD  + V  + +KAGE I ++Y           G + N++
Sbjct: 272 PLWDMCNHTNGLITTGYNLEDDRCECVALQDFKAGEQIYIFY-----------GTRSNAE 320

Query: 346 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 405
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H+  E    +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPAISA 379

Query: 406 DMLPYLRLGYVSDTSEMQSVIS--SLGPICPVSPCMERAVLD---QLADYFKAR----LA 456
            +L +LR+  +S+    + +I   ++G I  +         D   +L  + +AR    L 
Sbjct: 380 QLLAFLRVFCMSEEELKEHLIGEHAIGKIFTLGNSDFPVSWDNEVKLWTFLEARASLLLK 439

Query: 457 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 500
            Y  T+  D++ L  ++L P   +A +L   EK++L   ++  A
Sbjct: 440 TYKTTVEVDKSFLETHDLTPHAIMAIKLRLGEKEILEKAVKSAA 483


>gi|115657973|ref|XP_798530.2| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Strongylocentrotus purpuratus]
          Length = 682

 Score = 82.8 bits (203), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 105/449 (23%), Positives = 190/449 (42%), Gaps = 48/449 (10%)

Query: 64  VAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGD 123
           VAG     S +E        W++ NG+    V +        K    + + A++D++   
Sbjct: 66  VAGEPMQQSDREVHFETFFKWLNTNGVTTDAVKMA-------KFDEGYGLQATQDIKMDQ 118

Query: 124 AAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKKQGKKSFWLPYI 180
              ++P  +++T +  + + TI +L+  ++L +      LA++++ EK +   SFW PY+
Sbjct: 119 ELMNIPRKVMMTDQNAVDSPTIGDLVRGDRLLKGMPNVSLAIFILSEKLK-SDSFWKPYL 177

Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS 240
             L         +   PL ++  E+    GS    E L++ + I R+Y  L    F   +
Sbjct: 178 DVLPS-------SYSLPLYFTPDEIQLFQGSTMYGECLKQHKNIARQYAYL----FKLLN 226

Query: 241 LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL-QKVSLARRFALVPLGPPLLAYSSKCK 299
           L +     I  E FT++ ++ A   V +    +  K       +L+PL       + + K
Sbjct: 227 LPENSKLHI-REYFTYDFYRWAVSTVMTRQNQIPAKDGKGMSLSLIPLWDMCNHANGEMK 285

Query: 300 AMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPY 359
                  D+   +  R +  GE I + Y           G + ++ LL+  GFV   N Y
Sbjct: 286 TDFIEERDSCVNMALRDFSVGEQIFICY-----------GRRSSADLLLYSGFVYPGNVY 334

Query: 360 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSV--QVFHVHAGREKEAISDMLPYLRLGYVS 417
           D + ++  L++ D  Y  K  +     KL V  Q +H+ AG+E   + ++L +LR+  + 
Sbjct: 335 DGMAIQLGLSSSDRLYAMKAQLCSVM-KLGVPSQNYHISAGKEPVTL-ELLTFLRIFCMQ 392

Query: 418 D---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM 468
           D          +  Q++ S +     +S   E      LA         Y  ++ EDE  
Sbjct: 393 DLELRDRLLGDNRAQALFSLVDRSQIISKLNELRTCVYLATRVTLLQRQYKTSIQEDEEK 452

Query: 469 LTDYNLHPKKRVATQLVRMEKKMLNACLQ 497
           L D NL  ++R A QL+ +EK  L   L+
Sbjct: 453 LKDGNLSAQERSALQLLLIEKCTLENVLE 481


>gi|302814473|ref|XP_002988920.1| hypothetical protein SELMODRAFT_129035 [Selaginella moellendorffii]
 gi|300143257|gb|EFJ09949.1| hypothetical protein SELMODRAFT_129035 [Selaginella moellendorffii]
          Length = 389

 Score = 82.8 bits (203), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 68/262 (25%), Positives = 126/262 (48%), Gaps = 34/262 (12%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           AS  +  G+    V + L++T E++   E + +LL+ + +S  A LAL+L+  +K+ + S
Sbjct: 5   ASRPIHTGECMLHVSHDLMITPEKL--PEEVTKLLSKD-VSAWAKLALFLLAHQKKKETS 61

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
            W PYI  L         ++ S + W++ EL YL  SP   E ++R + ++ E+   +  
Sbjct: 62  AWAPYISCLPPFG-----SMHSTIFWTQDELVYLKVSPVYRETVQRKDVVRMEFAAAENA 116

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
             +        P+   +     E FK A+  V S    ++ +   +  ALVP       +
Sbjct: 117 LLLC-------PHIFGSRVSALE-FKHAYATVCSRAWGIETI---KSLALVPF-VDFFNH 164

Query: 295 SSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGF 352
            + C+AML+  +D    ++V DR Y  G+ +V+ Y           G   N+ L +++GF
Sbjct: 165 DANCRAMLSYDEDRHCAEVVSDRDYATGDQVVISY-----------GQLSNATLALDFGF 213

Query: 353 VDEDNPYDRLV-VEAALNTEDP 373
               NP+D++  +  +L+ +DP
Sbjct: 214 ALPFNPHDQVAGIWLSLSEKDP 235


>gi|302754606|ref|XP_002960727.1| hypothetical protein SELMODRAFT_449995 [Selaginella moellendorffii]
 gi|300171666|gb|EFJ38266.1| hypothetical protein SELMODRAFT_449995 [Selaginella moellendorffii]
          Length = 430

 Score = 82.4 bits (202), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 103/392 (26%), Positives = 171/392 (43%), Gaps = 58/392 (14%)

Query: 127 SVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
           S+P +L + ++ V  +E I E      L     +ALYL++EK +   S W  YIR L R 
Sbjct: 37  SIPKTLWMDVDTVRRSE-IGECCAG--LRPWIAVALYLLHEKAK-PHSDWSAYIRVLPR- 91

Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP 246
                  ++SPL WSE ELA L G+   + I    E +KREY+++ T       + +  P
Sbjct: 92  ------TLDSPLFWSEEELAELKGTQLLSSINGFKEFLKREYDKVMT------EVIEPRP 139

Query: 247 YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD 306
                  +T E F  AF  ++S           R F      PPL+  +     +   V+
Sbjct: 140 DVFDRSLYTLEAFTWAFGILRS-----------RTF------PPLIGDNLALVPLADFVN 182

Query: 307 DAVQLVVDRPYKAGESIVVWY---NISFTGHDFKCGPQP-----------NSKLLINYGF 352
               L  + PY   +S  V+     ++       C  +            N++L  +YGF
Sbjct: 183 HGFGLTNEDPYWHVKSAGVFARQETLTLQAAA-NCAEKQEVLMQYGKKKGNAQLATDYGF 241

Query: 353 VDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPY 410
           VD D  N  D   +   ++  +    DK  +AQ  G  S   F+++  R +    DM+ Y
Sbjct: 242 VDSDEKNNRDSFTLTLQVSLSERFADDKVDIAQMAGLDSTAYFNLY--RNQGPPEDMIAY 299

Query: 411 LRLG--YVSDTSEMQSVISSL---GPICPVSPCMERAVLDQLADYFKARLAGYPATLSED 465
           LRL   + SD+  ++++  +        P+S   E A+ + + +  +A L  Y +T+ ED
Sbjct: 300 LRLIALFGSDSFLLEALFRNTVWDHLRLPISRENEEAICEAMIEGCRATLREYSSTIDED 359

Query: 466 EAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 497
             +L    L  +K++A  +   EK++L   LQ
Sbjct: 360 TMLLNSSELSTRKKMAVVVRLGEKRILQEQLQ 391


>gi|268370088|ref|NP_082538.2| histone-lysine N-methyltransferase setd3 [Mus musculus]
 gi|81879567|sp|Q91WC0.1|SETD3_MOUSE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=Endothelial differentiation inhibitory protein D10;
           AltName: Full=SET domain-containing protein 3
 gi|16359331|gb|AAH16123.1| SET domain containing 3 [Mus musculus]
 gi|18044800|gb|AAH19973.1| Setd3 protein [Mus musculus]
 gi|26327255|dbj|BAC27371.1| unnamed protein product [Mus musculus]
 gi|74145116|dbj|BAE27425.1| unnamed protein product [Mus musculus]
 gi|74151505|dbj|BAE38861.1| unnamed protein product [Mus musculus]
 gi|148686776|gb|EDL18723.1| mCG18357, isoform CRA_a [Mus musculus]
          Length = 594

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 101/445 (22%), Positives = 196/445 (44%), Gaps = 50/445 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+  L  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  E+FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  + ++AG+ I ++Y           G + N++ +I+ GF  ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFQAGDQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT----- 419
           +  ++  D  Y  K  V  R G  +  VF +H+  E    + +L +LR+  +++      
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEH 398

Query: 420 ----SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
               S +  + +      PVS   E  +   L D     L  Y  T+ ED+ +L + +L 
Sbjct: 399 LLGDSAIDRIFTLGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLS 458

Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
            +  +A +L   EK++L   ++  A
Sbjct: 459 VRATMAIKLRLGEKEILEKAVKSAA 483


>gi|148686779|gb|EDL18726.1| mCG18357, isoform CRA_d [Mus musculus]
          Length = 597

 Score = 82.0 bits (201), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 103/445 (23%), Positives = 198/445 (44%), Gaps = 50/445 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 76  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 129

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 130 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 185

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+  L  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 186 -----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 233

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  E+FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 234 NKLPLKESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 293

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  + ++AG+ I ++Y           G + N++ +I+ GF  ++N +DR+ +
Sbjct: 294 EDDRCECVALQDFQAGDQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 342

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 424
           +  ++  D  Y  K  V  R G  +  VF +H+  E    + +L +LR+  +++    + 
Sbjct: 343 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEH 401

Query: 425 V--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
           +        I +LG    PVS   E  +   L D     L  Y  T+ ED+ +L + +L 
Sbjct: 402 LLGDSAIDRIFTLGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLS 461

Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
            +  +A +L   EK++L   ++  A
Sbjct: 462 VRATMAIKLRLGEKEILEKAVKSAA 486


>gi|145344456|ref|XP_001416748.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144576974|gb|ABO95041.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 515

 Score = 81.6 bits (200), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 111/438 (25%), Positives = 172/438 (39%), Gaps = 55/438 (12%)

Query: 76  EDLGDLKSWM-HKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
           ED  +L +W+ +  G+    ++ KE        R    VA   D+ AG    +VP    V
Sbjct: 47  EDARELAAWLSYDKGVDASGLVFKEG------ARGEVEVALRGDVDAGARVLAVPQDCAV 100

Query: 135 TLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
           T   V  +  ++ L       EL  LAL+L  E+ +G  S W PY++ L           
Sbjct: 101 TSVDVDAHPIVSGL--AKGRPELVGLALWLCAERIKGGASDWAPYVKTLAAN-------P 151

Query: 195 ESPLLWSETE-LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 253
           ++PL W+E E  A L GSP   + +ER+   + EY  +  V        +  P   P EA
Sbjct: 152 DAPLFWTEAEDFALLKGSPIVNDAVERSRSAREEYAAIVEV-------IKGDPTAFPAEA 204

Query: 254 FTF---EIFKQAFVAVQSCVVHLQKVSLARRFALVPL-------GPPLLAYSSKCKAMLA 303
           + F   E F  A   V +    L   S    +ALVPL       G P+   S       +
Sbjct: 205 YEFFTEERFVDALATVCAKATWLPTASC---YALVPLLDVITIAGSPVPGVSPP-----S 256

Query: 304 AVDDAVQLVVDRPYKAGESIVVWYNI---SFTGHDFKCGP--QPNSKLLINYGFVDEDNP 358
           A D   +   D  Y    + VV   +          +  P  + N +L +N G VD+ +P
Sbjct: 257 AKDGIARCAAD--YDVDSACVVLSAVVKAPANSRVVQLDPLQRNNGELFLNTGRVDQKHP 314

Query: 359 YDRLVVEAALNTEDPQYQDKRMVAQRNG-KLSVQVFHVHAGREKEAISDMLPYLRLGYVS 417
            D L +   +   D  +  K+ V +  G     Q F V+   E    + +  YLR   V 
Sbjct: 315 GDYLYMRTEIQPSDRLFSAKKQVLEGMGFTAENQYFPVY---EDRMPTQLYSYLRFARVQ 371

Query: 418 DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPK 477
           D  EM +V      I  VS   E  +L  L    +  ++ Y     ++  +L   +    
Sbjct: 372 DPGEMMAVSFEEDKI--VSVMNEYEILQLLMGDCRELMSEYDTNEEDELNLLKLSDTMRV 429

Query: 478 KRVATQLVRMEKKMLNAC 495
           + +    +RM +K L  C
Sbjct: 430 REIEAAKLRMSEKKLIGC 447


>gi|354483159|ref|XP_003503762.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Cricetulus
           griseus]
 gi|344254671|gb|EGW10775.1| SET domain-containing protein 3 [Cricetulus griseus]
          Length = 577

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 101/445 (22%), Positives = 195/445 (43%), Gaps = 50/445 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERAS-PNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+  L  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  + ++AGE I ++Y           G + N++ +I+ GF  ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFQAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT----- 419
           +  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++      
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398

Query: 420 ----SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
               S +  + +      PVS   E  +   L D     L  Y  T+ ED+ +L + +L 
Sbjct: 399 LLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLS 458

Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
            +  +A +L   EK++L   ++  A
Sbjct: 459 VRATMAIKLRLGEKEILEKAVKSAA 483


>gi|302804384|ref|XP_002983944.1| hypothetical protein SELMODRAFT_119151 [Selaginella moellendorffii]
 gi|300148296|gb|EFJ14956.1| hypothetical protein SELMODRAFT_119151 [Selaginella moellendorffii]
          Length = 439

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 102/387 (26%), Positives = 168/387 (43%), Gaps = 48/387 (12%)

Query: 127 SVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
           S+P +L +  + V  +E I E      L     +ALYL++EK +   S W  YIR L R 
Sbjct: 37  SIPKTLWMDADTVRRSE-IGE--CCEGLRPWIAVALYLLHEKAK-PHSDWSAYIRVLPR- 91

Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP 246
                  ++SPL WSE ELA L G+   + +    E +KREY+++ T       + +  P
Sbjct: 92  ------TLDSPLFWSEEELAELKGTQLLSSMNGFKEFLKREYDKVMT------EVIEPRP 139

Query: 247 YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL------GPPLLAYSSKCKA 300
                  +T E F  AF  ++S         +    ALVPL      G  L       K 
Sbjct: 140 DVFDRSLYTLEAFTWAFGILRSRTF---PPLIGDNLALVPLADFVNHGFGLTNEDPGWKV 196

Query: 301 MLAAV---DDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED- 356
             A V    + + L         + +++ Y             + N++L  +YGFVD D 
Sbjct: 197 KSAGVFARQETLTLQAAANCAEKQEVLIQYG----------KKKGNAQLATDYGFVDSDE 246

Query: 357 -NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLG- 414
            N  D   +   ++  +    DK  +AQ  G  S   F+++  R +    DM+ YLRL  
Sbjct: 247 KNNRDSFTLTLQVSLSERFADDKVDIAQMAGLDSTAYFNLY--RNQGPPEDMIAYLRLIA 304

Query: 415 -YVSDTSEMQSVISSL---GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 470
            + SD+  ++++  +        P+S   E A+ + + +  +A L  Y +T+ ED  +L 
Sbjct: 305 LFGSDSFLLEALFRNTVWDHLRLPISRENEEAICEAMIEGCRATLREYSSTIDEDTMLLN 364

Query: 471 DYNLHPKKRVATQLVRMEKKMLNACLQ 497
              L  +K++A  +   EK++L   LQ
Sbjct: 365 SSELSTRKKMAVVVRLGEKRILQEQLQ 391


>gi|12848462|dbj|BAB27964.1| unnamed protein product [Mus musculus]
 gi|46241521|gb|AAS82953.1| endothelial differentiation inhibitory protein D10 [Mus musculus]
          Length = 594

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 101/445 (22%), Positives = 196/445 (44%), Gaps = 50/445 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+  L  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  E+FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  + ++AG+ I ++Y           G + N++ +I+ GF  ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFQAGDQIYIFY-----------GTRSNAESVIHSGFFFDNNSHDRVKI 339

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT----- 419
           +  ++  D  Y  K  V  R G  +  VF +H+  E    + +L +LR+  +++      
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEH 398

Query: 420 ----SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
               S +  + +      PVS   E  +   L D     L  Y  T+ ED+ +L + +L 
Sbjct: 399 LLGDSAIDRIFTLGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLS 458

Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
            +  +A +L   EK++L   ++  A
Sbjct: 459 VRATMAIKLRLGEKEILEKAVKSAA 483


>gi|41056027|ref|NP_956348.1| histone-lysine N-methyltransferase setd3 [Danio rerio]
 gi|82187658|sp|Q7SXS7.1|SETD3_DANRE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|32766447|gb|AAH55261.1| SET domain containing 3 [Danio rerio]
          Length = 596

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 90/394 (22%), Positives = 169/394 (42%), Gaps = 40/394 (10%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKK 169
           + A++D++A +    +P  +++T+E    N  +  L + +++ +      LAL+L+ E+ 
Sbjct: 108 LKATKDIKAEELFLWIPRKMLMTVESA-KNSVLGPLYSQDRILQAMGNVTLALHLLCERA 166

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
               S WLPYI+ L  +        ++PL + E E+ +L  +    ++L + +   R+Y 
Sbjct: 167 N-PSSPWLPYIKTLPSE-------YDTPLYFEEEEVRHLLATQAIQDVLSQYKNTARQY- 217

Query: 230 ELDTVWFMAGSLFQQYPYDIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPL 287
                +F            +P  +AFTF+ ++ A  +V +    +     +R   AL+PL
Sbjct: 218 ----AYFYKVIHTHPNASKLPLKDAFTFDDYRWAVSSVMTRQNQIPTADGSRVTLALIPL 273

Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLL 347
                  +          DD  + V  + YK GE I ++Y           G + N++ +
Sbjct: 274 WDMCNHTNGLITTGYNLEDDRCECVALKDYKEGEQIYIFY-----------GTRSNAEFV 322

Query: 348 INYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDM 407
           I+ GF  EDN +DR+ ++  ++  +  Y  K  V  R G  +  +F +H   E    + +
Sbjct: 323 IHNGFFFEDNAHDRVKIKLGVSKGERLYAMKAEVLARAGIPASSIFALHCS-EPPISAQL 381

Query: 408 LPYLRL---------GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGY 458
           L +LR+          Y+     +  + +      PVS   E  +   L       L  Y
Sbjct: 382 LAFLRVFCMTEEELRDYLVGDHAINKIFTLGNTEFPVSWENEIKLWTFLETRAALLLKTY 441

Query: 459 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 492
                ED +ML   +L    R+A +L   EK++L
Sbjct: 442 KTASEEDRSMLEKPDLSLHSRIAIKLRLAEKEIL 475


>gi|392341246|ref|XP_002726820.2| PREDICTED: histone-lysine N-methyltransferase setd3 [Rattus
           norvegicus]
 gi|392349051|ref|XP_216781.6| PREDICTED: histone-lysine N-methyltransferase setd3 [Rattus
           norvegicus]
 gi|149044195|gb|EDL97577.1| rCG27725, isoform CRA_a [Rattus norvegicus]
          Length = 596

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 100/445 (22%), Positives = 195/445 (43%), Gaps = 50/445 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSILGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+  L  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  + ++AG+ I ++Y           G + N++ +I+ GF  ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFQAGDQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT----- 419
           +  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++      
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398

Query: 420 ----SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
               S +  + +      PVS   E  +   L D     L  Y  T+ ED+ +L + +L 
Sbjct: 399 LLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKTVLKNPDLS 458

Query: 476 PKKRVATQLVRMEKKMLNACLQVTA 500
            +  +A +L   EK++L   ++  A
Sbjct: 459 VRATMAIKLRLGEKEILEKAVKSAA 483


>gi|410928182|ref|XP_003977480.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Takifugu
           rubripes]
          Length = 598

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 95/403 (23%), Positives = 176/403 (43%), Gaps = 54/403 (13%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKKQG 171
           A+ D++A +    +P  +++T+E       +  L   +++ +      LAL+L+ E+   
Sbjct: 110 ATRDIKAEELFLWIPRKMLMTVESA-KKSVLGPLYNQDRILQAMDNVTLALHLLCERA-N 167

Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
             SFWLPYIR L ++        ++PL + + E+  L G+    ++L +     R+Y   
Sbjct: 168 PASFWLPYIRTLPQE-------YDTPLFYEQDEVQLLQGTQAVQDVLSQYRNTARQY--- 217

Query: 232 DTVWFMAGSLFQQYPYD--IP-TEAFTFEIFKQAFVAVQSCVVHL-----QKVSLARRFA 283
              +F    L Q +P    +P  ++FTF+ ++ A  +V +    +     ++V+L    A
Sbjct: 218 --AYFY--KLIQTHPASSKLPLKDSFTFDDYRWAVSSVMTRQNQIPTEDGRQVTL----A 269

Query: 284 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPN 343
           L+PL       +          DD  + V  + YK  E I ++Y           G + N
Sbjct: 270 LIPLWDMCNHRNGLITTGYNLEDDRCECVALQDYKKNEQIYIFY-----------GTRSN 318

Query: 344 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEA 403
           ++ +I+ GF  ++N +D++ ++  ++  +  Y  K  V  R G     +F ++   E+  
Sbjct: 319 AEFVIHNGFFYQENAHDQVKIKLGISKSERLYAMKAEVLARAGIPVSSIFALYCN-EQPI 377

Query: 404 ISDMLPYLRLGYVSDTSEMQSV---------ISSLGPI-CPVSPCMERAVLDQLADYFKA 453
            + +L +LR+ +     E++           I +LG +  PVS   E  +   L      
Sbjct: 378 SAQLLAFLRV-FCMKEEELRDYLLGGHAINKIVTLGSMEFPVSWDNEIKLWTFLETRVAL 436

Query: 454 RLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 496
            L  Y  T  ED + L    L P  R+A QL   EK +L   L
Sbjct: 437 LLKAYKTTSEEDSSTLEKSELSPHSRMAIQLRLAEKWILEKAL 479


>gi|297849804|ref|XP_002892783.1| hypothetical protein ARALYDRAFT_471564 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297338625|gb|EFH69042.1| hypothetical protein ARALYDRAFT_471564 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 482

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 99/403 (24%), Positives = 178/403 (44%), Gaps = 47/403 (11%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A  D+   +    +P  L +  E V  ++ I  L     L     +AL+L+ EK + +
Sbjct: 79  LVARRDIGRNEVVLEIPKRLWINPETVTASK-IGPL--CGGLKPWVSVALFLIREKYE-E 134

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
           +S W  Y+  L +       + +S + WSE ELA L G+   +  L   E ++ E+ +L+
Sbjct: 135 ESSWRLYLDMLPQ-------STDSTVFWSEEELAELKGTQLLSTTLGVKEYVENEFLKLE 187

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG---- 288
               +          D+ +   T + F  AF  ++S      +    +   L+PL     
Sbjct: 188 QEILLPNK-------DLFSSRITLDDFIWAFGILKSRAFSRLR---GQNLVLIPLADLIN 237

Query: 289 --PPLLA--YSSKCK-AMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPN 343
             P +    Y+ + K A L + D    L      KAGE + + Y+++          + N
Sbjct: 238 HNPAITTEDYAYEIKGAGLFSRDLLFSLKSPVYVKAGEQVYIQYDLN----------KSN 287

Query: 344 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEA 403
           ++L ++YGFV+ +   +   +   +   DP + DK  +A+ N       F V  G+   A
Sbjct: 288 AELALDYGFVESNPNRNSYTLTIEIPESDPFFGDKLDIAETNKMGETGYFDVVDGQTLPA 347

Query: 404 ISDMLPYLRLGYV--SDTSEMQSVISS--LGPI-CPVSPCMERAVLDQLADYFKARLAGY 458
              ML YLRL  +  SD   ++S+ ++   G +  PVS   E  +   + D  K+ L+G+
Sbjct: 348 --GMLQYLRLVALGGSDAFLLESIFNNTIWGHLELPVSRSNEELICRVVRDACKSALSGF 405

Query: 459 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTAD 501
             T+ EDE +L +  L P+  +A ++   EK++L    Q+  D
Sbjct: 406 STTIEEDEKLLEEGKLDPRLEMALKIRIGEKRVLQQIDQIFKD 448


>gi|168063638|ref|XP_001783777.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664720|gb|EDQ51429.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 395

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 76/285 (26%), Positives = 129/285 (45%), Gaps = 39/285 (13%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           A+  ++ G+    V   L++T  ++     + ELL T  ++E A LAL+++ E+  G+ S
Sbjct: 5   AARPIEVGEQVLRVSGDLMITPNKL--PTEVKELLPTG-VTEWARLALFILVEQHLGQAS 61

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
            W PYI  L         A+ S + W + EL  +  +    E ++R   I  E+  +  V
Sbjct: 62  QWAPYINCLPTCG-----ALHSTVFWKKEELELVRFTSLHRETMQRRAVIGSEFASVLPV 116

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
                   Q+ P+ I  E      FKQA+   +S      + S   R   VP       +
Sbjct: 117 -------LQKCPH-IFGERVLHSKFKQAYATGKSL-----RRSSNTRILTVPF-VDFFNH 162

Query: 295 SSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGF 352
            S C+A+L+  ++    +++ D+ Y  GE +V+ Y           G  PN+ L +++GF
Sbjct: 163 DSNCRALLSYDEERACAEVIADKNYARGEQVVISY-----------GRLPNTTLALDFGF 211

Query: 353 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 397
               NPYD++ V  AL+  DP  + K  +   +G  +V    VHA
Sbjct: 212 TISCNPYDQVEVWMALSHRDPLRKMKLALLHAHGMPTV----VHA 252


>gi|17368377|sp|P94026.1|RBCMT_TOBAC RecName: Full=Ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic; AltName:
           Full=[Ribulose-bisphosphate carboxylase]-lysine
           N-methyltransferase; Short=RuBisCO LSMT; Short=RuBisCO
           methyltransferase; Short=rbcMT; Flags: Precursor
 gi|1731475|gb|AAC49565.1| ribulose-1,5-bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase [Nicotiana tabacum]
 gi|1731477|gb|AAC49566.1| ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase [Nicotiana tabacum]
          Length = 491

 Score = 79.7 bits (195), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 90/391 (23%), Positives = 171/391 (43%), Gaps = 40/391 (10%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A  D+  G+    VP    +  + V  +E I  + +   L     +AL+L+ EK +  
Sbjct: 87  LVAKRDIAKGETVLQVPKRFWINPDAVAESE-IGNVCS--GLKPWISVALFLLREKWR-D 142

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
            S W  Y+  L +       + +S + WSE EL+ + G+   +  +   + ++ E+ +++
Sbjct: 143 DSKWKYYMDVLPK-------STDSTIYWSEEELSEIQGTQLLSTTMSVKDYVQNEFQKVE 195

Query: 233 TVWFMAGSLFQQYPYDIPTEAF--TFEIFK-QAFVAVQS-CVVHLQKVSLARRFALVPLG 288
               +     Q +P+ I  + F   F I + +AF  +++  ++ +    L    A V   
Sbjct: 196 EEVILRNK--QLFPFPITLDDFFWAFGILRSRAFSRLRNQNLILVPFADLTNHNARVTTE 253

Query: 289 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLI 348
               A+  +  A L + D    L      KAG+ + + Y+++          + N+ + +
Sbjct: 254 DH--AHEVRGPAGLFSWDLLFSLRSPLKLKAGDQLFIQYDLN----------KSNADMAL 301

Query: 349 NYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDML 408
           +YGF++  +  D   +   ++  D  Y DK  +A+ NG      F +  G+       M+
Sbjct: 302 DYGFIEPSSARDAFTLTLEISESDEFYGDKLDIAETNGIGETAYFDIKIGQSLPPT--MI 359

Query: 409 PYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPAT 461
           PYLRL  +  T           SV   LG   PVS   E  +   + D  K+ L+GY  T
Sbjct: 360 PYLRLVALGGTDAFLLESIFRNSVWGHLG--LPVSRANEELICKVVRDACKSALSGYHTT 417

Query: 462 LSEDEAMLTDYNLHPKKRVATQLVRMEKKML 492
           + EDE ++ + NL  + ++A  +   EK++L
Sbjct: 418 IEEDEKLMEEGNLSTRLQIAVGIRLGEKRVL 448


>gi|160774366|gb|AAI55279.1| SET domain containing 3 [Danio rerio]
          Length = 596

 Score = 79.3 bits (194), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 89/394 (22%), Positives = 168/394 (42%), Gaps = 40/394 (10%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKK 169
           + A++D++A +    +P  +++T+E    N  +  L + +++ +      LAL+L+ E+ 
Sbjct: 108 LKATKDIKAEELFLWIPRKMLMTVESA-KNSVLGPLYSQDRILQAMGNVTLALHLLCERA 166

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
               S WLPYI+ L  +        ++PL + E E+ +L  +    ++L + +   R+Y 
Sbjct: 167 N-PSSPWLPYIKTLPSE-------YDTPLYFEEEEVRHLLATQAIQDVLSQYKNTARQY- 217

Query: 230 ELDTVWFMAGSLFQQYPYDIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPL 287
                +F            +P  +AFTF+ ++ A  +V +    +     +R   AL+PL
Sbjct: 218 ----AYFYKVIHTHPNASKLPLKDAFTFDDYRWAVSSVMTRQNQIPTADGSRVTLALIPL 273

Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLL 347
                  +          DD  + V  + YK GE I ++Y           G + N++ +
Sbjct: 274 WDMCNHTNGLITTGYNLEDDRCECVALKDYKEGEQIYIFY-----------GTRSNAEFV 322

Query: 348 INYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDM 407
           I+ GF  EDN +DR+ ++  ++  +  Y  K  V  R G  +  +F +H   E    + +
Sbjct: 323 IHNGFFFEDNAHDRVKIKLGVSKSERLYAMKAEVLARAGIPASSIFALHCS-EPPISAQL 381

Query: 408 LPYLRL---------GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGY 458
           L +LR+          Y+     +  + +      PVS   E  +   L       L  Y
Sbjct: 382 LAFLRVFCMTEEELRDYLVGDHAINKIFTLGNTEFPVSWENEIKLWTFLETRAALLLKTY 441

Query: 459 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 492
                ED +ML   +L    R+  +L   EK++L
Sbjct: 442 KTASEEDRSMLEKPDLSLHSRITIKLRLAEKEIL 475


>gi|449280698|gb|EMC87934.1| SET domain-containing protein 3 [Columba livia]
          Length = 593

 Score = 79.3 bits (194), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 95/408 (23%), Positives = 185/408 (45%), Gaps = 52/408 (12%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
           + A+ +++A +    VP  L++T+E    N  +  L + +++ +      LA +L+ E+ 
Sbjct: 108 LKATREIKAEELFLWVPRRLLMTVESA-KNSVLGSLYSQDRILQAMGNITLAFHLLCERA 166

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
               SFWLPYI+ L  +         +PL + E E+ YL  +    ++  + +   R+Y 
Sbjct: 167 N-PNSFWLPYIQTLPSE-------YNTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY- 217

Query: 230 ELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALV 285
                +F    + Q +P    +P  ++FT++ ++ A  +V +    +     +R   AL+
Sbjct: 218 ----AYFY--KVIQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPTEDGSRVTLALI 271

Query: 286 PLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSK 345
           PL       +          DD  + V  + +KAGE I ++Y           G + N++
Sbjct: 272 PLWDMCNHTNGLITTGYNLEDDRCECVALQDFKAGEQIYIFY-----------GTRSNAE 320

Query: 346 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 405
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H+  E    +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISA 379

Query: 406 DMLPYLRLGYVSDTSEMQSVIS--------SLG-PICPVSPCMERAVLDQLADYFKAR-- 454
            +L +LR+  +S+    + +I         +LG    PVS   E     +L  + +AR  
Sbjct: 380 QLLAFLRVFCMSEEELKEHLIGEHAIDKIFTLGNSEFPVSWDNEV----KLWTFLEARAS 435

Query: 455 --LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 500
             L  Y  T+ +D++ L  ++L     +A +L   EK++L   ++  A
Sbjct: 436 LLLKTYKTTVEDDKSFLETHDLTSHAIMAIKLRLGEKEILEKAVKSAA 483


>gi|302786274|ref|XP_002974908.1| hypothetical protein SELMODRAFT_102436 [Selaginella moellendorffii]
 gi|300157067|gb|EFJ23693.1| hypothetical protein SELMODRAFT_102436 [Selaginella moellendorffii]
          Length = 389

 Score = 79.3 bits (194), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 68/262 (25%), Positives = 126/262 (48%), Gaps = 36/262 (13%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           AS  +  G+    V + L++T E++   E + +LL+ + +S  A LAL+L+  +K+ + S
Sbjct: 5   ASRPIHTGECMLHVSHDLMITPEKL--PEEVTKLLSKD-VSAWAKLALFLLAHQKKKETS 61

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
            W PYI  L         ++ S + W++ EL YL  SP   E ++R + ++ E+   + V
Sbjct: 62  AWAPYISCLPPFG-----SMHSTIFWTQDELVYLKVSPVYRETVQRKDVVRMEFAAAENV 116

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
                 L QQ    + +   T       ++ V S    ++ +   +  ALVP       +
Sbjct: 117 CM----LMQQVKLFVCSRILT------DYITVCSRAWGIETI---KSLALVPF-VDFFNH 162

Query: 295 SSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGF 352
            + C+AML+  +D    ++V DR Y  G+ +V+ Y           G   N+ L +++GF
Sbjct: 163 DANCRAMLSYDEDRHCAEVVSDRDYATGDQVVISY-----------GQLSNATLALDFGF 211

Query: 353 VDEDNPYDRLV-VEAALNTEDP 373
               NP+D++  +  +L+ +DP
Sbjct: 212 ALPFNPHDQVAGIWLSLSEKDP 233


>gi|47215092|emb|CAF98166.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 444

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 93/402 (23%), Positives = 174/402 (43%), Gaps = 52/402 (12%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKKQG 171
           A+ D++A +    +P  +++T+E       +  L T +++ +      LAL+L+ E+   
Sbjct: 28  ATRDIKAEELFLWIPRKMLMTVESA-KKSVLGPLYTQDRILQAMDNVTLALHLLCERAD- 85

Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
             SFWLPYIR L ++        ++PL + + ++  L G+    ++L +     R+Y   
Sbjct: 86  PASFWLPYIRTLPQE-------YDTPLFYQQQDVQLLHGTQAIQDVLSQYRNTARQY--- 135

Query: 232 DTVWFMAGSLFQQYPYD--IP-TEAFTFEIFKQAFVAVQSCVVHL-----QKVSLARRFA 283
              +F    L Q +P    +P  ++FTF+ ++ A  +V +    +     ++V+L    A
Sbjct: 136 --AYFY--KLVQTHPASSKLPLKDSFTFDDYRWAVSSVMTRQNQIPTEDGRQVTL----A 187

Query: 284 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPN 343
           L+PL       +          DD  + V  + YK  E I ++Y           G + N
Sbjct: 188 LIPLWDMCNHRNGLITTGYNLEDDRCECVALQDYKKNEQIYIFY-----------GTRSN 236

Query: 344 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEA 403
           ++ +I+ GF  ++N +D++ ++  ++  +  Y  K  V  R G     VF ++   E   
Sbjct: 237 AEFVIHNGFFYQENAHDQVKIKLGISKSERLYAMKAEVLGRAGIPVSSVFALYCN-EPPI 295

Query: 404 ISDMLPYLRL---------GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 454
            + +L +LR+          Y+     +  +++      PVS   E  +   L       
Sbjct: 296 SAQLLAFLRVFCMMEEELKDYLFGAQAINRLVTLGSMEFPVSWENEIKLWTFLETRAALL 355

Query: 455 LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 496
           L  Y  T  ED + L   +L P  R+A QL   EK +L   L
Sbjct: 356 LKAYKTTAEEDSSTLDKTDLSPHSRMAVQLRLAEKAILEKAL 397


>gi|57529914|ref|NP_001006486.1| histone-lysine N-methyltransferase setd3 [Gallus gallus]
 gi|363734802|ref|XP_003641459.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Gallus
           gallus]
 gi|75571462|sp|Q5ZML9.1|SETD3_CHICK RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|53127281|emb|CAG31024.1| hypothetical protein RCJMB04_1k10 [Gallus gallus]
          Length = 593

 Score = 78.6 bits (192), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 93/408 (22%), Positives = 186/408 (45%), Gaps = 52/408 (12%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
           + A+ +++A +    VP  L++T+E    N  +  L + +++ +      LA +L+ E+ 
Sbjct: 108 LKATREIKAEELFLWVPRKLLMTVESA-KNSVLGSLYSQDRILQAMGNITLAFHLLCERA 166

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
               SFWLPYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y 
Sbjct: 167 N-PNSFWLPYIQTLPSE-------YDTPLYFEEDEVQYLRSTQAIHDVFSQYKNTARQY- 217

Query: 230 ELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALV 285
                +F    + Q +P    +P  ++FT++ ++ A  +V +    +     +R   AL+
Sbjct: 218 ----AYFY--KVIQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPTEDGSRVTLALI 271

Query: 286 PLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSK 345
           PL       +          DD  + V  + +KAGE I ++Y           G + N++
Sbjct: 272 PLWDMCNHTNGLITTGYNLEDDRCECVALQDFKAGEQIYIFY-----------GTRSNAE 320

Query: 346 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 405
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H+  E    +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-IEPPISA 379

Query: 406 DMLPYLRLGYVSDTSEMQSVIS--------SLG-PICPVSPCMERAVLDQLADYFKAR-- 454
            +L +LR+  +++    + +I         +LG    P+S   E     +L  + +AR  
Sbjct: 380 QLLAFLRVFCMNEEELKEHLIGEHAIDKIFTLGNSEFPISWDNEV----KLWTFLEARAS 435

Query: 455 --LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 500
             L  Y  T+ +D++ L  ++L     +A +L   EK++L   ++  A
Sbjct: 436 LLLKTYKTTVEDDKSFLETHDLTSHATMAIKLRLGEKEILEKAVKSAA 483


>gi|427784595|gb|JAA57749.1| Putative histone-lysine n-methyltransferase setd3 [Rhipicephalus
           pulchellus]
          Length = 485

 Score = 78.2 bits (191), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 97/428 (22%), Positives = 176/428 (41%), Gaps = 46/428 (10%)

Query: 84  WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
           W   NG     V +K+ P         +   A E ++  +    VP  L++T        
Sbjct: 80  WCSDNGAYLGSVSIKDLPDGE------YGFVADEHIEESNQFLGVPLKLMMTTAAA-KKS 132

Query: 144 TIAELLTTN----KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
            +  LL  +     +S +A LA++L+ E   G+ SFW PYI  L         +  + L 
Sbjct: 133 KLGPLLRDDPIMMSMSNVA-LAMFLILEFCTGESSFWHPYISTL-------PASFNTVLY 184

Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
           +S  EL  L GS    E L+    I R+Y+    + F    L +  PY    + FT++++
Sbjct: 185 FSVEELELLHGSTVLDEALKLHRSIARQYSYFHKI-FRTHPLAKSLPY---KDCFTYDLY 240

Query: 260 KQAFVAVQS--CVVHLQKVSLARR-------FALVPLGPPLLAYSSKCKAMLAAVDDAVQ 310
           + A  AV +    V L   +            A+VPL         K         + ++
Sbjct: 241 RWAVSAVMTRQNAVPLTDTAGGDDEDGTDAMTAMVPLWDMCNHSDGKVFTDYDISANMLR 300

Query: 311 LVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 370
               R ++ G+ + ++Y           G + N++  I+ GFV  +N +D + ++  ++ 
Sbjct: 301 CYAMRDFEKGQEVTIFY-----------GRRTNAEFFIHNGFVFPENRHDSVDIKLGISK 349

Query: 371 EDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLG 430
           +DP Y  K  +   +      +F +   RE+    D+  +LR+  + D S+  S      
Sbjct: 350 QDPLYAVKAKLCDDHELTPSGIFAL-VPRERPVCEDLSTFLRILVLKDASQAASFTDE-- 406

Query: 431 PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKK 490
            I   +    R  L+ L    +  L  +P +  E E ++ D   + + ++A QL  +E+K
Sbjct: 407 HIMVATDDNAREALNFLIVRIQLLLRAFPKSDQEYENIIADEGSNGRLKMAAQLRLLERK 466

Query: 491 MLNACLQV 498
           +L A L+ 
Sbjct: 467 ILTAVLET 474


>gi|168067849|ref|XP_001785817.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162662541|gb|EDQ49381.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 489

 Score = 78.2 bits (191), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 105/442 (23%), Positives = 182/442 (41%), Gaps = 70/442 (15%)

Query: 93  CKVILKEKPSHNEKHRPIHYVAASEDL--------QAGDAAFSVPNSLVVTLERVLGNET 144
            + I     SH  +   +     SE L         AGD   +VP S+ + L  V  N +
Sbjct: 43  VQTIWSWAQSHGIQGEAVKPAEVSEGLGLIAQRPVNAGDEILNVPESVWINLAAVQ-NSS 101

Query: 145 IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 204
           + +      L     +AL+L++E      S W PY+  L +       +++SPL WS+ E
Sbjct: 102 LGK--ACEGLKPWVAVALFLIHESSN-PSSKWRPYLDSLPK-------SLDSPLFWSDEE 151

Query: 205 LAYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 263
           LA L G+     +    E ++ EYN L + V      +F    Y       TF+ FK AF
Sbjct: 152 LAELVGTQLLGSVTGYLEFLENEYNNLVEEVLEPNNKIFNPAVY-------TFDGFKWAF 204

Query: 264 VAVQSCVVHLQKVSLARRFALVPL------------GPP--LLAYSSKCKAMLAAVDDAV 309
             ++S              ALVP+            G P  +   +S+   +     D +
Sbjct: 205 GILRSRTFSPLT---GEDIALVPIADLVNHGKGLGDGSPSWVRKGTSQFWNIGKGSSDLL 261

Query: 310 QLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDN--------PYDR 361
            +     + AGE +++ Y  +          + N+ L ++YGFV+ D           D 
Sbjct: 262 TVRASANFSAGEQVLMQYGAT----------KSNADLALDYGFVERDRGSQFSPGIERDS 311

Query: 362 LVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS--DT 419
           L +   ++ +D    DK  + + NG      F +  G+      +M+ +LRL  +S  D+
Sbjct: 312 LALSLEISPDDRFVDDKADILEINGFQCSMQFDLSRGQGPS--DEMITFLRLSALSGPDS 369

Query: 420 SEMQSVI--SSLGPIC-PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 476
             ++++    + G +  PVS   E A+   + +  KA L GY  T+ +D  +L   +L  
Sbjct: 370 FLLEALFRNEAWGHVSLPVSRDNEEALCTSMLEGLKAALDGYSTTVEQDMELLARGDLST 429

Query: 477 KKRVATQLVRMEKKMLNACLQV 498
           +  +A  +VR+ +K +   LQ 
Sbjct: 430 RMEIAV-VVRLGEKRVMQELQT 450


>gi|15223054|ref|NP_172856.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase
           [Arabidopsis thaliana]
 gi|17369870|sp|Q9XI84.1|RBCMT_ARATH RecName: Full=[Fructose-bisphosphate aldolase]-lysine
           N-methyltransferase, chloroplastic; AltName:
           Full=Aldolases N-methyltransferase; AltName:
           Full=[Ribulose-bisphosphate carboxylase]-lysine
           N-methyltransferase-like; Short=AtLSMT-L;
           Short=LSMT-like enzyme; Flags: Precursor
 gi|5080779|gb|AAD39289.1|AC007576_12 Putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase [Arabidopsis thaliana]
 gi|28973755|gb|AAO64193.1| putative ribulose-1,5 bisphosphate carboxylase oxygenase large
           subunit N-methyltransferase [Arabidopsis thaliana]
 gi|332190979|gb|AEE29100.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase
           [Arabidopsis thaliana]
          Length = 482

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 97/404 (24%), Positives = 174/404 (43%), Gaps = 49/404 (12%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A  D+   +    +P  L +  E V  ++ I  L     L     +AL+L+ EK + +
Sbjct: 79  LVARRDIGRNEVVLEIPKRLWINPETVTASK-IGPL--CGGLKPWVSVALFLIREKYE-E 134

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
           +S W  Y+  L +       + +S + WSE ELA L G+   +  L   E ++ E+ +L+
Sbjct: 135 ESSWRVYLDMLPQ-------STDSTVFWSEEELAELKGTQLLSTTLGVKEYVENEFLKLE 187

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL- 291
               +          D+ +   T + F  AF  ++S      +    +   L+PL   + 
Sbjct: 188 QEILLPNK-------DLFSSRITLDDFIWAFGILKSRAFSRLR---GQNLVLIPLADLIN 237

Query: 292 ---------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
                     AY  K  A L + D    L      KAGE + + Y+++          + 
Sbjct: 238 HNPAIKTEDYAYEIKG-AGLFSRDLLFSLKSPVYVKAGEQVYIQYDLN----------KS 286

Query: 343 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 402
           N++L ++YGFV+ +   +   +   +   DP + DK  +A+ N       F +  G+   
Sbjct: 287 NAELALDYGFVESNPKRNSYTLTIEIPESDPFFGDKLDIAESNKMGETGYFDIVDGQTLP 346

Query: 403 AISDMLPYLRLGYVS--DTSEMQSVISS--LGPI-CPVSPCMERAVLDQLADYFKARLAG 457
           A   ML YLRL  +   D   ++S+ ++   G +  PVS   E  +   + D  K+ L+G
Sbjct: 347 A--GMLQYLRLVALGGPDAFLLESIFNNTIWGHLELPVSRTNEELICRVVRDACKSALSG 404

Query: 458 YPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTAD 501
           +  T+ EDE +L    L P+  +A ++   EK++L    Q+  D
Sbjct: 405 FDTTIEEDEKLLDKGKLEPRLEMALKIRIGEKRVLQQIDQIFKD 448


>gi|413923745|gb|AFW63677.1| hypothetical protein ZEAMMB73_839660 [Zea mays]
 gi|413923746|gb|AFW63678.1| hypothetical protein ZEAMMB73_839660 [Zea mays]
          Length = 306

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 68/259 (26%), Positives = 116/259 (44%), Gaps = 26/259 (10%)

Query: 76  EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
           E    L+ W+  +GLP  ++ + ++    E+      + A ++++ G+    VP SLV+T
Sbjct: 71  ESAASLERWLIDSGLPEQRLAI-QRVDIGERG-----LVALKNIRKGEKLLFVPPSLVIT 124

Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
            +   G   + +++  N + +   +A YL+ E      S W+ YI  L RQ         
Sbjct: 125 ADSEWGRPEVGDVMKRNSVPDWPLIATYLISEASLEGSSRWISYIAALPRQ-------PY 177

Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
           S L W+  EL AYL  SP +   ++R   +   YN+L         +F ++P   P E +
Sbjct: 178 SLLYWTRAELDAYLVASPIRKRAIQRITDVIGTYNDL------RDRIFSRHPDLFPEEVY 231

Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLV 312
             E F  +F  + S +V L   S+  R ALVP    +L +S + +  L        +   
Sbjct: 232 NIETFLWSFGILFSRLVRLP--SMDGRVALVPWA-DMLNHSPEVETFLDFDKSSRGIVFT 288

Query: 313 VDRPYKAGESIVVWYNISF 331
            DR Y+ G  I +W  I +
Sbjct: 289 TDRSYQPGIYI-LWEEIQW 306


>gi|326921018|ref|XP_003206761.1| PREDICTED: LOW QUALITY PROTEIN: SET domain-containing protein
           3-like [Meleagris gallopavo]
          Length = 593

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 93/408 (22%), Positives = 185/408 (45%), Gaps = 52/408 (12%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
           + A+ +++A +    VP  L++T+E    +  +  L + +++ +      LA +L+ E+ 
Sbjct: 108 LKATREIKAEELFLWVPRKLLMTVESA-KSSVLGSLYSQDRILQAMGNITLAFHLLCERA 166

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
               SFWLPYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y 
Sbjct: 167 N-PNSFWLPYIQTLPNE-------YDTPLYFEEDEVQYLRSTQAIHDVFSQYKNTARQY- 217

Query: 230 ELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALV 285
                +F    + Q +P    +P  ++FT++ ++ A  +V +    +     +R   AL+
Sbjct: 218 ----AYFY--KVIQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPTEDGSRVTLALI 271

Query: 286 PLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSK 345
           PL       +          DD  + V  + +KAGE I ++Y           G + N++
Sbjct: 272 PLWDMCNHTNGLITTGYNLEDDRCECVALQDFKAGEQIYIFY-----------GTRSNAE 320

Query: 346 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 405
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H+  E    +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-IEPPISA 379

Query: 406 DMLPYLRLGYVSDTSEMQSVIS--------SLG-PICPVSPCMERAVLDQLADYFKAR-- 454
            +L +LR+  +++    + +I         +LG    P+S   E     +L  + +AR  
Sbjct: 380 QLLAFLRVFCMNEEELKEHLIGEHAIDKIFTLGNSEFPISWDNEV----KLWTFLEARAS 435

Query: 455 --LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 500
             L  Y  T+ +D+  L  ++L     +A +L   EKK+L   ++  A
Sbjct: 436 LLLKTYKTTVEDDKLFLETHDLTSHATMAIKLRLGEKKILEKTVKSAA 483


>gi|8778402|gb|AAF79410.1|AC068197_20 F16A14.25 [Arabidopsis thaliana]
          Length = 474

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 95/398 (23%), Positives = 174/398 (43%), Gaps = 45/398 (11%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A  D+   +    +P  L +  E V  ++ I  L     L     +AL+L+ EK + +
Sbjct: 79  LVARRDIGRNEVVLEIPKRLWINPETVTASK-IGPL--CGGLKPWVSVALFLIREKYE-E 134

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
           +S W  Y+  L +       + +S + WSE ELA L G+   +  L   E ++ E+ +L+
Sbjct: 135 ESSWRVYLDMLPQ-------STDSTVFWSEEELAELKGTQLLSTTLGVKEYVENEFLKLE 187

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL- 291
               +          D+ +   T + F  AF      +++ + ++    F  +   P + 
Sbjct: 188 QEILLPNK-------DLFSSRITLDDFIWAF-----GILNRESLTSMFEFEQINHNPAIK 235

Query: 292 ---LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLI 348
               AY  K  A L + D    L      KAGE + + Y+++          + N++L +
Sbjct: 236 TEDYAYEIKG-AGLFSRDLLFSLKSPVYVKAGEQVYIQYDLN----------KSNAELAL 284

Query: 349 NYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDML 408
           +YGFV+ +   +   +   +   DP + DK  +A+ N       F +  G+   A   ML
Sbjct: 285 DYGFVESNPKRNSYTLTIEIPESDPFFGDKLDIAESNKMGETGYFDIVDGQTLPA--GML 342

Query: 409 PYLRLGYVS--DTSEMQSVISS--LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLS 463
            YLRL  +   D   ++S+ ++   G +  PVS   E  +   + D  K+ L+G+  T+ 
Sbjct: 343 QYLRLVALGGPDAFLLESIFNNTIWGHLELPVSRTNEELICRVVRDACKSALSGFDTTIE 402

Query: 464 EDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTAD 501
           EDE +L    L P+  +A ++   EK++L    Q+  D
Sbjct: 403 EDEKLLDKGKLEPRLEMALKIRIGEKRVLQQIDQIFKD 440


>gi|387016380|gb|AFJ50309.1| Histone-lysine N-methyltransferase setd3 [Crotalus adamanteus]
          Length = 592

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 95/408 (23%), Positives = 183/408 (44%), Gaps = 52/408 (12%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
           + A+ D++A +    VP  L++T+E    N  +  L + +++ +      LA +L+ E+ 
Sbjct: 108 LKATRDIKAEELFLWVPRKLLMTVESA-KNSILGSLYSQDRILQAMGNITLAFHLLCER- 165

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
               SFWLPYI+ L  +         + L + E E+ YL  +    +I  + +   R+Y 
Sbjct: 166 YNPNSFWLPYIQTLPNE-------YNTALYFEEDEVQYLQSTQAIHDIFSQYKNTARQY- 217

Query: 230 ELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALV 285
                +F    + Q +P    +P  ++FT++ ++ A  +V +    +     +R   AL+
Sbjct: 218 ----AYFY--KVVQTHPNASKLPLKDSFTYDDYRWAVSSVMARQNQIPAEDGSRVTLALI 271

Query: 286 PLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSK 345
           PL       +          DD  + V  + +KAGE I ++Y           G + N++
Sbjct: 272 PLWDMCNHTNGLITTGYNLKDDRCECVALQDFKAGEQIYIFY-----------GTRSNAE 320

Query: 346 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 405
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H+  E    +
Sbjct: 321 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISA 379

Query: 406 DMLPYLRLGYVSDTSEMQSVIS--------SLG-PICPVSPCMERAVLDQLADYFKAR-- 454
            +L +LR+  +++    + +I         +LG    PVS   E     +L  + +AR  
Sbjct: 380 QLLAFLRVFCMTEDELKEHLIGEHTIDRIFTLGNSEFPVSWDNEV----KLWTFLEARAS 435

Query: 455 --LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 500
             L  Y  T+ +D+ +L   +L     +A +L   EK++L   ++  A
Sbjct: 436 LLLKTYKTTIHDDKFILETQDLTHNATMAIKLRLGEKEILEKAIKSAA 483


>gi|326492674|dbj|BAJ90193.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 449

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 96/401 (23%), Positives = 166/401 (41%), Gaps = 49/401 (12%)

Query: 109 PIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEK 168
           P+  + A  +L  G+    VP  L +  + V  +       +   L     ++L ++ E 
Sbjct: 39  PVLGLVAERNLPRGEVVAEVPKKLWLDADAVAASVLGRVCGSGGDLRPWVSVSLLILREA 98

Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
            +G  S W PY+  L RQ        +S + WSE EL  + G+   +  +   E ++ E+
Sbjct: 99  ARGGDSLWAPYLAILPRQ-------TDSTIFWSEEELLEIQGTQLLSTTMGVKEYVQSEF 151

Query: 229 NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG 288
           + ++     AG +      D+     TF+ F  AF  ++S V    +     + AL+P  
Sbjct: 152 DNVE-----AGII--NVNKDLFPGTITFDDFLWAFGVLRSRVFPELR---GDKLALIPFA 201

Query: 289 PPL----------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKC 338
             +            +  K K  L   D    L      K+GE I V Y++         
Sbjct: 202 DLINHNGDITSKESCWEIKGKGFLGR-DTVFSLRTPVDVKSGEQIYVQYDLD-------- 252

Query: 339 GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 398
             + N++L ++YGF + ++  D   +   ++  DP Y+DK  +A+ NG      F V  G
Sbjct: 253 --KSNAELALDYGFTESNSSRDSYTLTLEISESDPFYEDKLDIAELNGMGETAYFDVVLG 310

Query: 399 REKEAISDMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLADYF 451
             +     M+ YLRL  +  T            V   L    PVS   E ++   + +  
Sbjct: 311 --ESLPPQMITYLRLLCLGGTDAFLLEALFRNKVWEHLE--LPVSRDNEESICQVIQNAC 366

Query: 452 KARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 492
           K+ LA Y  T+ EDE +L   +L  + ++A ++   EKK+L
Sbjct: 367 KSALAAYHTTIEEDEELLEREDLQSRHQIAVEVRVGEKKVL 407


>gi|395504553|ref|XP_003756612.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Sarcophilus
           harrisii]
          Length = 602

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 103/448 (22%), Positives = 196/448 (43%), Gaps = 58/448 (12%)

Query: 73  KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
           K+E+   DL  W  +NG       L      N K      + A+ +++A +    VP  L
Sbjct: 81  KREDYFPDLIKWAAENGASTDGFELV-----NFKEEGFG-LRATREIKAEELFLWVPRKL 134

Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
           ++T+E    N  +  L + +++ +      LA +L+ E+     SFWLPYI+ L  +   
Sbjct: 135 LMTVESA-KNSVLGALYSQDRILQAMGNITLAFHLLCERA-NPSSFWLPYIQTLPSE--- 189

Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY-- 247
                ++PL + E E+ +L  +    ++  + +   R+Y      +F    + Q +P   
Sbjct: 190 ----YDTPLYFEEDEVQHLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPNAN 238

Query: 248 DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAV 305
            +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +          
Sbjct: 239 KLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLE 298

Query: 306 DDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVE 365
           DD  + V  + +  GE I ++Y           G + N++ +I+ GF  ++N +DR+ ++
Sbjct: 299 DDRCECVALQDFNVGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKIK 347

Query: 366 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV 425
             ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++    + +
Sbjct: 348 LGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHL 406

Query: 426 IS--------SLG-PICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDY 472
           I         +LG    PVS   E     +L  + +AR    L  Y  T+ ED++ L  +
Sbjct: 407 IGEHAIDRIFTLGNSEFPVSWDNEV----KLWTFLEARASLLLKTYKTTIEEDKSFLATH 462

Query: 473 NLHPKKRVATQLVRMEKKMLNACLQVTA 500
           +L     +A +L   EK++L   ++  A
Sbjct: 463 DLTFHATMAIKLRLGEKEILEKAVKSAA 490


>gi|126290266|ref|XP_001367810.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Monodelphis domestica]
          Length = 595

 Score = 76.3 bits (186), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 103/449 (22%), Positives = 195/449 (43%), Gaps = 58/449 (12%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W   NG       L      N K      + A+ +++A +    VP  
Sbjct: 73  GKREDYFPDLIKWAAANGASTDGFELV-----NFKEEGFG-LRATREIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFWLPYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGALYSQDRILQAMGNITLAFHLLCERA-NPSSFWLPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ +L  +    ++  + +   R+Y      +F    + Q +P  
Sbjct: 183 -----YDTPLYFEEDEVQHLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPNA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  + +  GE I ++Y           G + N++ +I+ GF  ++N +DR+ +
Sbjct: 291 EDDRCECVALQDFNVGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 339

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 424
           +  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++    + 
Sbjct: 340 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEH 398

Query: 425 VIS--------SLG-PICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTD 471
           +I         +LG    PVS   E     +L  + +AR    L  Y  T+ ED++ L  
Sbjct: 399 LIGEHAIDRIFTLGNSEFPVSWDNEV----KLWTFLEARASLLLKTYKTTIEEDKSFLAT 454

Query: 472 YNLHPKKRVATQLVRMEKKMLNACLQVTA 500
           ++L     +A +L   EK++L   ++  A
Sbjct: 455 HDLTFHATMAIKLRLGEKEILEKAVKSAA 483


>gi|159479580|ref|XP_001697868.1| rubisco large subunit N-methyltransferase [Chlamydomonas
           reinhardtii]
 gi|158273966|gb|EDO99751.1| rubisco large subunit N-methyltransferase [Chlamydomonas
           reinhardtii]
          Length = 475

 Score = 76.3 bits (186), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 111/434 (25%), Positives = 178/434 (41%), Gaps = 59/434 (13%)

Query: 83  SWMHKNGLPPCKV-----ILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
           +W  K G    K      IL +KP           + AS D+Q G++   VP++  V++ 
Sbjct: 46  AWATKQGAKLEKANLSTDILTDKP----------ILVASADVQPGESLIVVPDAAWVSVP 95

Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
            V    T+ +L ++  L     LAL L+ E+    KS    Y   L    G       +P
Sbjct: 96  NV-AKTTVGKLASSAGLEPWLQLALVLVAERFGSAKSELAGYASSLPEDLG-------TP 147

Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
           LLWSE E   L G+     +       +  + +L         LF   P   P   FT  
Sbjct: 148 LLWSEEETRALAGTQVAGTLNSYLTFFRSTFAQLQA------GLFTANPAAFPPAVFTLP 201

Query: 258 IFKQAFVAVQSCV---VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV- 313
            F  A  AV+S     +   K++LA    LV L     A ++K     + +    Q+ V 
Sbjct: 202 NFVWAVAAVRSRSHPPLEGDKIALA---PLVDLVSHRRAANTKLSVRSSGLFGRGQVAVV 258

Query: 314 --DRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 371
              R  + GE++         G D+  G + +  +L++YG +D  +P     +   L+  
Sbjct: 259 EATRAIRKGEAL---------GMDYAPG-KLDGPVLLDYGVMDTASPKPGYSLTLTLDES 308

Query: 372 DPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS--DTSEMQSVISSL 429
           D    DK  + +  G      + +    +++   +M+ +LRL  +   D   ++S+  + 
Sbjct: 309 DKFVDDKADIVEGAGLRPSMTYSITP--DQQPGEEMMAFLRLMNIKAMDAFLLESIFRN- 365

Query: 430 GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRV-ATQLVRM- 487
                VS   E AV   LA+  +A LAGYP TL +D A L   +     R  A  LVR+ 
Sbjct: 366 ----EVSEGNEEAVCAMLAEGARAALAGYPTTLDQDLAALRSNSTPLGSRAEAALLVRLG 421

Query: 488 EKKMLNACLQVTAD 501
           EK+ L+A  +   D
Sbjct: 422 EKESLDAVARFFED 435


>gi|330822500|ref|XP_003291689.1| hypothetical protein DICPUDRAFT_57488 [Dictyostelium purpureum]
 gi|325078125|gb|EGC31794.1| hypothetical protein DICPUDRAFT_57488 [Dictyostelium purpureum]
          Length = 540

 Score = 75.9 bits (185), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 104/466 (22%), Positives = 185/466 (39%), Gaps = 63/466 (13%)

Query: 67  SREVVSKKEED---LGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGD 123
            +++  K+E D   + +   W+  +G    K  +K   +  E       + ++ D++ G+
Sbjct: 56  GKQIAVKQETDQQLVSNFMEWLKNSGFDETKSKVKIGRNLAEGSG----LVSTCDIKEGE 111

Query: 124 AAFSVPNSL---VVTLERVLGNETIAELLTTNKLSEL--ACLALYLMYEKKQGKKSFWLP 178
               +P  L   ++T  +  G      LL  N +  +    LALYL+ E      S   P
Sbjct: 112 EFLEIPEKLFIDIMTALKSFGQSGYDILLRDNLIRRVPNLVLALYLIKESTNPDSSI-AP 170

Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMA 238
           Y++ L +          +   W   +   L GSP     +    G  R+Y      +F  
Sbjct: 171 YLKVLPK-------TYSTIGYWGIEDFKQLEGSPVFQTAVNYTRGSMRQY-----CYFY- 217

Query: 239 GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP--PLLAYSS 296
             LF   P  + T  FT+E F  A   VQS       V   +  AL+P        ++  
Sbjct: 218 -QLFDNNPGILQTSNFTYEAFIWAVATVQS---RQNPVGGGQEMALIPFWDFCNHSSHGG 273

Query: 297 KCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED 356
           K    +  V   +     + YK GE + ++Y           GP+PNS+  +  GF  + 
Sbjct: 274 KITTFIDPVKHVLTCSAAKSYKKGEQVYMYY-----------GPRPNSQFYLFQGFSLKT 322

Query: 357 NPYDRLVVEAALNTEDPQ--YQDK-RMVAQRNGKLSVQVFHVHAGREKEAI-SDMLPYLR 412
           N  D    +  L+ ED +    DK  ++ +R G    Q   +      E + ++++P+ R
Sbjct: 323 NLNDDYSFDMDLDNEDDRDIAHDKIHILEERCGLRVGQTVSLSQNPSSEKLPAEIIPFYR 382

Query: 413 LGYVS--DTSEM-----------QSVISSLGP--ICPVSPCMERAVLDQLADYFKARLAG 457
           +  +S  +T ++                 + P     +S   E+     L D  KARL+G
Sbjct: 383 IAALSPEETKKLAPPQEEGHHHHHQGPMDMKPEAFNIISEENEKKAFKLLLDSLKARLSG 442

Query: 458 YPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 503
           YP TL++DE  + + N   ++R    ++  EKK+L   ++    +I
Sbjct: 443 YPTTLAQDEQEMKN-NPTTQRRYVLYILINEKKILERNIKYVEQLI 487


>gi|307108530|gb|EFN56770.1| hypothetical protein CHLNCDRAFT_8187, partial [Chlorella
           variabilis]
          Length = 398

 Score = 75.5 bits (184), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 106/400 (26%), Positives = 173/400 (43%), Gaps = 48/400 (12%)

Query: 116 SEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSF 175
           S+ +  G+  F+VP +  +T +    ++  + L     L     +AL+L++E+  G  S 
Sbjct: 4   SKAVNKGEQLFAVPEAAWITADTAQQSQIGSHL---TGLESWLAIALFLLHERAMGNASR 60

Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 235
           W PYI  L    G       SP+ W E +LA L GS     +       ++ +++L    
Sbjct: 61  WAPYIALLPADSG-------SPVQWEEADLAELQGSQVLGTVQGYRAYFQQRFDQLQAEV 113

Query: 236 FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV------VHLQKVSLARRFALVPLGP 289
           F   S      +D P   F F+ F  A   V++         ++  V LA      P  P
Sbjct: 114 FGPNS----QAFD-PI-VFNFDAFLWAACTVRARAHPPLDGGNIALVPLADMVRSQPSWP 167

Query: 290 PLLA-YSSKCKAMLAAVDDAVQLVVDRP--YKAGESIVVWYNISFTGHDFKCGPQ-PNSK 345
           P  A +  K    L        LV++      AG++I +         DF  GPQ  + +
Sbjct: 168 PDSAGWQLKQTGGLFGAGSTQALVMEASGSMAAGDAIAM---------DF--GPQKSDGQ 216

Query: 346 LLINYGFVDE--DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV-HAGREKE 402
           LL+++G +D   + P   L +E  L+ ED  Y DK  + + N +L+    H+  A R  +
Sbjct: 217 LLVDHGVIDPLVNQPSYALTLE--LSKEDRNYDDKADILELN-ELAESTEHILRADRAPD 273

Query: 403 AISDMLPYLRLGYVSDTSEMQSVISSL---GPICPVSPCMERAVLDQLADYFKARLAGYP 459
           A    +  L     +D   ++S+  +        PVS   ER    QL D   A LA YP
Sbjct: 274 AGLLPVLRLLNLSGTDAFLLESIFRNEVWEHMQLPVSEDNERGCYQQLIDGCTAALAAYP 333

Query: 460 ATLSEDEAMLTDYNLHPKKRVATQL-VRM-EKKMLNACLQ 497
            ++ ED A++   +L P  R  + + VR+ EK+ L+A L+
Sbjct: 334 TSIDEDLALMASGSLQPGSRRQSAVRVRLGEKEALDATLR 373


>gi|260803924|ref|XP_002596839.1| hypothetical protein BRAFLDRAFT_284593 [Branchiostoma floridae]
 gi|229282099|gb|EEN52851.1| hypothetical protein BRAFLDRAFT_284593 [Branchiostoma floridae]
          Length = 500

 Score = 75.1 bits (183), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 97/400 (24%), Positives = 176/400 (44%), Gaps = 50/400 (12%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKK 169
           + A +D++A +   ++P  L++T E      ++  L+  +++ ++     LAL+++ EK 
Sbjct: 121 LKAVKDIKAEELFITIPRKLMLTTE-TARESSLGPLIKKDRILQVMANVSLALHVLCEK- 178

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
               SFW PYI               +PL + E E+ +L GS   +++L + + I R+Y 
Sbjct: 179 YSSNSFWAPYINIFPG-------TYTTPLYFEEGEMLHLQGSLNFSDVLNQYKSIARQY- 230

Query: 230 ELDTVWFMAGSLFQQYP--YDIP-TEAFTFEIFKQAFVAVQSCVVHLQKV--SLARRF-- 282
                +F    LFQ  P    +P  E FTF+ ++ A   V + +    +V  S  R    
Sbjct: 231 ----AYFY--KLFQTQPEAAGLPLKECFTFDEYRWA---VSTVMTRQNQVPTSDGRHLIT 281

Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
           AL+P+       + +         D+ + +  R +     + ++Y           G + 
Sbjct: 282 ALIPMWDMCNHSNGEVSTEFNLGSDSAECLAMREFPTDSQVYIFY-----------GMRS 330

Query: 343 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 402
           N++ LI+ GFV  +N +DR+ V+  ++  D  +  K  V  R G  +   F VH G++  
Sbjct: 331 NAEFLIHNGFVYPENVHDRVNVKLGVSKNDSLFAMKAEVLSRAGIHASTSFQVHCGKDP- 389

Query: 403 AISDMLPYLRL-----GYVSD--TSEMQSV-ISSLG-PICPVSPCMERAVLDQLADYFKA 453
              ++L +LR+     G + D  TSE QS  +S LG   C V+   E      L      
Sbjct: 390 IPPELLVFLRVFTMVEGDLRDLLTSEHQSAYLSCLGRSDCMVTQEQETKAWAFLETRLSL 449

Query: 454 RLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 493
            +  Y  ++ + E  L   ++    R A QL   E ++L+
Sbjct: 450 LIRSYRTSIKDVETELQAPDMTYHSRAALQLKLAEMQILS 489


>gi|356534483|ref|XP_003535783.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Glycine
           max]
          Length = 463

 Score = 75.1 bits (183), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 64/266 (24%), Positives = 120/266 (45%), Gaps = 30/266 (11%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           AS+ +Q GD    VP  + +T + +L        L   ++  +A LA  ++ EKK G+ S
Sbjct: 65  ASKIIQTGDCILKVPYRVQITADNLLPE---IRSLIGEEVGNIAKLATVILIEKKLGQGS 121

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
            W PYI  L +Q   G+L   + + W+E+EL  +  S    E +++   I++++  +  +
Sbjct: 122 EWYPYISCLPQQ---GEL--HNTVFWTESELEMIRPSSVYQETIDQKSQIEKDFLAIKHI 176

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
           +  +   F    Y     A T  +F    V +    V  +        AL+P     L +
Sbjct: 177 FECSHQSFGDSTYKDFMHACTLVLFDHFNVELP---VGSRAWGSTNGLALIPFA-DFLNH 232

Query: 295 SSKCKAMLAAVDD-------AVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLL 347
               +A++ + DD       ++Q++ DR Y  GE +++ Y           G   N+ L+
Sbjct: 233 DGVSEAIVMSDDDKQCSEVQSLQIIADRDYAPGEQVLIRY-----------GKFSNATLM 281

Query: 348 INYGFVDEDNPYDRLVVEAALNTEDP 373
           +++GF    N YD++ ++  +   DP
Sbjct: 282 LDFGFTIPYNIYDQVQIQFDIPKHDP 307


>gi|302755392|ref|XP_002961120.1| hypothetical protein SELMODRAFT_402746 [Selaginella moellendorffii]
 gi|300172059|gb|EFJ38659.1| hypothetical protein SELMODRAFT_402746 [Selaginella moellendorffii]
          Length = 371

 Score = 75.1 bits (183), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 70/262 (26%), Positives = 114/262 (43%), Gaps = 50/262 (19%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           A+  + AG     +P   ++T E V     ++ LL+T+       L+L+L+ EK + ++S
Sbjct: 10  ATRRVPAGSRFLEIPRIAIITPENVPSQ--VSHLLSTSNPKTR--LSLFLLSEKHKAQES 65

Query: 175 FWLPYIRELDRQRGRGQLA-VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
            W PY+R L       QL  +ES + W + ELA+L  SPT  E +E  + IK E++ L+ 
Sbjct: 66  QWAPYLRCL------PQLGDIESTMFWKDEELAWLKHSPTYRETMECLKIIKSEFHVLEA 119

Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLA 293
             F        +  D+  E  +   F  A+   Q                 +P       
Sbjct: 120 NVF-------PWCRDVLGEV-SLTDFMHAYSTDQ-----------------IPFA-DFFN 153

Query: 294 YSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYG 351
           +   C+  L+     D    V D+ YKAG+ I + Y           G  PNS L ++YG
Sbjct: 154 HDHNCQTRLSYDKEKDCAVAVADQDYKAGDEIFLSY-----------GSTPNSILAVDYG 202

Query: 352 FVDEDNPYDRLVVEAALNTEDP 373
           F    NP++++ V   ++  DP
Sbjct: 203 FAVASNPHEQVEVPMGVSLTDP 224


>gi|424513480|emb|CCO66102.1| predicted protein [Bathycoccus prasinos]
          Length = 571

 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 102/405 (25%), Positives = 170/405 (41%), Gaps = 45/405 (11%)

Query: 116 SEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSF 175
           S++++ GD   S+P    VT      +  +A L+   +  EL  LAL+L  EK + K S 
Sbjct: 122 SKNVEGGDVILSIPQDNCVTAVDAKEHPIVAPLI--EEKPELVQLALWLCCEKAKAKGSE 179

Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELA-YLTGSPTKAEILERAEGIKREYNELDTV 234
           W PY++ L+           S L ++E E    L G+    E  +R +  K EY  L   
Sbjct: 180 WWPYLKTLNGNPN-------SVLRFTEEEFKELLKGTSIDKEARQRRDSAKEEYEALRAA 232

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFV-AVQSCVVHLQKVSLARRFALVPL------ 287
                    + P   P + + F + + AF+ A+       Q ++ A  +A+VPL      
Sbjct: 233 -------IAEDPGKYPLDVYAF-LTESAFIDALDIVCARAQWLNSANCYAMVPLMDAIPI 284

Query: 288 --GPPLLAYSSKCKAMLAAVDD------AVQLVVDRPYKAGESIVVWYNISFTGH----D 335
              PP ++      A    + D      AV+           S+V+  N   +       
Sbjct: 285 CGAPPPVSPEDPSFARFYEIRDIKTGLTAVRCGYADYDVDSASVVLCANTRASAGSKILQ 344

Query: 336 FKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV-QVFH 394
                + NS+L +++G VD+ +P D       L+  DP Y  K+ V +  G     Q F 
Sbjct: 345 IDHSVRNNSELYLSFGDVDDQHPGDYEYWPTELSENDPLYAAKKSVLEAQGFADKGQTFP 404

Query: 395 VHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 454
           V+  R      + L YLR   V+++ E+ +V  +   +  VSP  E   L  L    + R
Sbjct: 405 VYKDRMPR---EFLSYLRFARVTNSEELFAVSFTEDKV--VSPMNEYETLQLLMADCRDR 459

Query: 455 LAGYPATLSEDEAMLTDY-NLHPKKRVATQLVRMEKKMLNACLQV 498
           ++ Y  T  EDE +L    ++  K R A++L R EK+++   +  
Sbjct: 460 MSAYD-TNEEDELLLQKRDDVSLKIRNASRLRRCEKELVGEMMNA 503


>gi|229596469|ref|XP_001008992.3| SET domain containing protein [Tetrahymena thermophila]
 gi|225565279|gb|EAR88747.3| SET domain containing protein [Tetrahymena thermophila SB210]
          Length = 629

 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 84/296 (28%), Positives = 124/296 (41%), Gaps = 48/296 (16%)

Query: 66  GSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAA 125
             +E + K E    +L SW+  N      + LK   +HN        + +   +QA +  
Sbjct: 142 ADKETLKKSE----NLLSWVQANKGEFSSIKLKYLSTHNRS------IVSKRIIQADETV 191

Query: 126 FSVPNSLVVTLERVLGNETIAELLTTNKLSEL-----ACLALYLMYEKKQGKKSFWLPYI 180
            S+P   V+TL+ V  +    ++LT  K ++L     A  AL+L+ E+K+   S +  YI
Sbjct: 192 ISIPQEQVITLD-VASSSDFCKILT-EKNTQLVQQKHAYFALFLLQEQKKKDASHYKAYI 249

Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS 240
             L             P L+SE EL YL G+     + E+ E IK +Y  +  V      
Sbjct: 250 DSLPTDLSSF------PALFSEEELQYLEGTAALKLVQEQKEDIKTDYESISQV------ 297

Query: 241 LFQQYPYDIP--TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC 298
                   IP     F+FE F+ AF+   S V  + KV   +   +VPL   L    S  
Sbjct: 298 --------IPEFKSEFSFEQFRWAFLCSHSRVFGI-KVKGVKTSVMVPLADMLNHKHSGQ 348

Query: 299 KAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVD 354
           +      DDA         KA + I     I F+      G + NSKL +NYGFVD
Sbjct: 349 EDSEWVFDDATNCFT---VKALKKIQRNQQIHFS-----YGSKCNSKLFLNYGFVD 396


>gi|291235388|ref|XP_002737626.1| PREDICTED: SET domain containing 4-like [Saccoglossus kowalevskii]
          Length = 353

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 79/309 (25%), Positives = 139/309 (44%), Gaps = 47/309 (15%)

Query: 75  EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
           + D  +L  WM +NG    K  L +  +  E  R +    A++  Q GD   S+P  L++
Sbjct: 28  DNDYIELVRWMSRNGF---KGALLKPANFKETGRGL---MATKPFQIGDQVISIPEMLLI 81

Query: 135 TLERVLGNETIAELL---TTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQ 191
           T + VL +  + + +   T  KLS +  +  YL+ E+ + K SFW  YI+ L +      
Sbjct: 82  TTQNVLSS-YLGDFIKQQTRPKLSPMQVICTYLICERSRQKDSFWYNYIKVLPK------ 134

Query: 192 LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPT 251
            +  +P+ ++  E+ +L     K ++ +  E I   Y EL  ++ +  S F  +      
Sbjct: 135 -SYSNPVYFTNEEINWLP-RRIKRKVFDECEKINTAYRELKNLFSILESTFVSFK----- 187

Query: 252 EAFTFEIFKQAFVAVQSCVVHLQK-----VSLAR-RFALVPLGPPLLAYSS--KCKAMLA 303
             F +  F+ A+  V +  V++ +     +S+ R  +AL P    LL +++  + KA   
Sbjct: 188 GIFEYSAFRWAWCTVNTRSVYMLQEQNPHLSIERDHYALAPF-LDLLNHTNTVEVKASYN 246

Query: 304 AVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLV 363
            V    ++      K  + + ++Y           GP  N KL I YGFV   N ++  V
Sbjct: 247 PVSKCYEIFTCTACKKYDQMFIYY-----------GPHDNVKLFIEYGFVLPQNQHN--V 293

Query: 364 VEAALNTED 372
           VE  L+ ED
Sbjct: 294 VE--LDFED 300


>gi|145524453|ref|XP_001448054.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124415587|emb|CAK80657.1| unnamed protein product [Paramecium tetraurelia]
          Length = 581

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 102/433 (23%), Positives = 180/433 (41%), Gaps = 83/433 (19%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNET-IAELLTTNKLSELA----CLALYLMYE 167
           V A + + A +    +P S ++TLE  +  ET +A+ +   +L  L+     L+ +L+ E
Sbjct: 165 VNAKQKINAKELILFIPKSHMITLE--MAKETPVAKKMIQFRLDLLSPKHSFLSTFLLQE 222

Query: 168 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
           K +   SFW PY+  L       Q     P+ ++  +L +L GSP   +I ++   +K++
Sbjct: 223 KSRPN-SFWKPYLDIL------PQSYPSFPIFFNNYDLEWLQGSPFLKQINDKLSDLKKD 275

Query: 228 YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 287
           YN++  V       F QY         +F  F  A +   S +  +  +   +  A VPL
Sbjct: 276 YNDICNV----APEFSQY---------SFYEFCWARMTASSRIFGI-NIKGVKTDAFVPL 321

Query: 288 GPPL---------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKC 338
              L           YS + +  +   D+ +    DR    G+ I       F  +  KC
Sbjct: 322 ADMLNHKRPKLTSWCYSEEKQGFIIETDEKI----DR----GQMI-------FDSYGRKC 366

Query: 339 GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK--------------RMVAQR 384
               NS+ L+NYGFV +DN  + + V  A    DP  Q K              R++   
Sbjct: 367 ----NSRFLLNYGFVVDDNDANEVNVTVAAEFNDPLIQLKEDATEEQLKQPKTFRLIMDT 422

Query: 385 NGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISS-----LGP--ICPVSP 437
           +G   +  F       +  + + + Y+R   + D +++Q +++      + P  I P+  
Sbjct: 423 DGINEITHFL------EATVMEFMSYIRFLVIRDQTQLQFLLNERESKYIKPTKIQPLGI 476

Query: 438 CMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 497
             E  + D +       L+ YP TL +D+ +L   +L   +R    L   EK++L    Q
Sbjct: 477 HNELDMWDLIRRICYVSLSRYPTTLEQDKEILQICDLTTNQRNCLILRMGEKEILKFYYQ 536

Query: 498 VTADMIMLLPDVT 510
            +  M  LL +  
Sbjct: 537 FSEKMKQLLSNFN 549


>gi|168043570|ref|XP_001774257.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674384|gb|EDQ60893.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 458

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 66/270 (24%), Positives = 119/270 (44%), Gaps = 33/270 (12%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLP 178
           ++ G+    V   L++T  R+    +  E   +  ++E + LAL+ +  K  GK S W P
Sbjct: 70  IKRGEQVLRVSRELMITPNRL---PSCVEESLSEDVNEWSRLALFQLLHKHAGKASPWEP 126

Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMA 238
           YIR L   RG     +++ + W + EL  L  S    +   R   I  +++ +  V    
Sbjct: 127 YIRCLPPLRG-----LQNTVFWRDEELELLRQSNVYDQTEHRKTLISNQFDLVQAV---- 177

Query: 239 GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC 298
                +YP ++  E  T E FK A+    S    ++ +       +VP    +  + S  
Sbjct: 178 ---VNKYP-ELFGETVTLESFKHAYCVASSRSWGVEALG---SITMVPF-VDMFNHDSSA 229

Query: 299 KAMLAAVDDA--VQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED 356
           +A+LA  ++    ++V D+ Y  G  +V+ Y           G  PNS L +++GF   D
Sbjct: 230 RALLAYYEEEGYAEVVADKDYNQGSQVVITY-----------GTLPNSSLALDFGFTLPD 278

Query: 357 NPYDRLVVEAALNTEDPQYQDKRMVAQRNG 386
           NP+D + +     + DP   +K  + + +G
Sbjct: 279 NPHDEVQIWMEAPSGDPLRAEKLKLLRDHG 308


>gi|42565948|ref|NP_191068.2| SET domain-containing protein [Arabidopsis thaliana]
 gi|56236044|gb|AAV84478.1| At3g55080 [Arabidopsis thaliana]
 gi|59958342|gb|AAX12881.1| At3g55080 [Arabidopsis thaliana]
 gi|332645816|gb|AEE79337.1| SET domain-containing protein [Arabidopsis thaliana]
          Length = 463

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 68/284 (23%), Positives = 129/284 (45%), Gaps = 36/284 (12%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           AS+ + AGD    VP +  +T + +  +     +L +N++  +  LA  L+ EKK G+KS
Sbjct: 75  ASKVIYAGDCMLKVPFNAQITPDELPSD---IRVLLSNEVGNIGMLAAVLIREKKMGQKS 131

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
            W+PYI  L +        + S + W E EL+ +  S    E +++   I+++++     
Sbjct: 132 RWVPYISRLPQ-----PAEMHSSIFWGEDELSMIRCSAVHQETVKQKAQIEKDFS----- 181

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
            F+A +  Q  P  I TE    E F  A+  V S     +    ++R +L+P    +   
Sbjct: 182 -FVAQAFKQHCP--IVTERPDLEDFMYAYALVGS-----RAWENSKRISLIPFADFMNHD 233

Query: 295 SSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV 353
                 +L   D+ + ++  DR Y  G+ + + Y           G   N+ L++++GF 
Sbjct: 234 GLSASIVLRDEDNQLSEVTADRNYSPGDEVFIKY-----------GEFSNATLMLDFGFT 282

Query: 354 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ---VFH 394
              N +D + ++  +  +DP    K  + Q +   +V+   +FH
Sbjct: 283 FPYNIHDEVQIQMDVPNDDPLRNMKLGLLQTHHTRTVKDINIFH 326


>gi|217038301|gb|ACJ76599.1| SET domain-containing protein 3 (predicted) [Oryctolagus cuniculus]
          Length = 394

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 80/351 (22%), Positives = 154/351 (43%), Gaps = 45/351 (12%)

Query: 72  SKKEEDLGDLKSWMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVP 129
            K+E+   +L  W   NG  +   +V+  E+            + A+ +++A +    VP
Sbjct: 73  GKREDYFPELMKWASANGASVEGFEVVNFEEEGFG--------LRATREIKAEELFLWVP 124

Query: 130 NSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQ 186
             L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +
Sbjct: 125 RKLLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE 182

Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP 246
                   ++PL + E E+ YL  +    ++  + +   R+Y     V        Q +P
Sbjct: 183 -------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYRV-------IQTHP 228

Query: 247 Y--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAML 302
           +   +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +       
Sbjct: 229 HANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGY 288

Query: 303 AAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRL 362
              DD  + V  R + AGE I ++Y           G + N++ +I+ GF  ++N +DR+
Sbjct: 289 NLEDDRCECVALRDFHAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRV 337

Query: 363 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 413
            ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+
Sbjct: 338 KIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRV 387


>gi|326496433|dbj|BAJ94678.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 453

 Score = 73.6 bits (179), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 94/392 (23%), Positives = 162/392 (41%), Gaps = 49/392 (12%)

Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWL 177
           +L  G+    VP  L +  + V  +       +   L     ++L ++ E  +G  S W 
Sbjct: 52  NLPRGEVVAEVPKKLWLDADAVAASVLGRVCGSGGDLRPWVSVSLLILREAARGGDSLWA 111

Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
           PY+  L RQ        +S + WSE EL  + G+   +  +   E ++ E++ ++     
Sbjct: 112 PYLAILPRQ-------TDSTIFWSEEELLEIQGTQLLSTTMGVKEYVQSEFDNVE----- 159

Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL------ 291
           AG +      D+     TF+ F  AF  ++S V    +     + AL+P    +      
Sbjct: 160 AGII--NVNKDLFPGTITFDDFLWAFGVLRSRVFPELR---GDKLALIPFADLINHDGDI 214

Query: 292 ----LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLL 347
                 +  K K  L   D    L      K+GE I V Y++           + N++L 
Sbjct: 215 TSKESCWEIKGKGFLGR-DTVFSLRTPVDVKSGEQIYVQYDLD----------KSNAELA 263

Query: 348 INYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDM 407
           ++YGF + ++  D   +   ++  DP Y+DK  +A+ NG      F V  G  +     M
Sbjct: 264 LDYGFTESNSSRDSYTLTLEISESDPFYEDKLDIAELNGMGETAYFDVVLG--ESLPPQM 321

Query: 408 LPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 460
           + YLRL  +  T            V   L    PVS   E ++   + +  K+ LA Y  
Sbjct: 322 ITYLRLLCLGGTDAFLLEALFRNKVWEHLE--LPVSRDNEESICQVIQNACKSALAAYHT 379

Query: 461 TLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 492
           T+ EDE +L   +L  + ++A ++   EKK+L
Sbjct: 380 TIEEDEELLEREDLQSRHQIAVEVRVGEKKVL 411


>gi|146162512|ref|XP_001009518.2| SET domain containing protein [Tetrahymena thermophila]
 gi|146146406|gb|EAR89273.2| SET domain containing protein [Tetrahymena thermophila SB210]
          Length = 789

 Score = 73.6 bits (179), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 104/428 (24%), Positives = 182/428 (42%), Gaps = 80/428 (18%)

Query: 107 HRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNET-IAELLTTNKLSELA----CLA 161
           +R +H   A + ++ G+    +P   ++TLE  L  E  I +L+ +  +  L+     L+
Sbjct: 373 YRGVH---ARQKIKKGECILFIPVDNMITLE--LSKELPICQLIESKNIRLLSPKHTFLS 427

Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES---PLLWSETELAYLTGSPTKAEIL 218
           +Y++ EKK   KSFW P++           L VE    P+L+++ EL +L GSP   ++ 
Sbjct: 428 IYIIIEKK-NHKSFWKPFL---------DILPVEYTTFPILYTDEELFWLKGSPFLNQVK 477

Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA--FTFEIFKQAFVAVQSCVVHLQKV 276
           ER E I ++Y              Q     IP  A   T + F  A +   S +  L  +
Sbjct: 478 ERRECITQDY--------------QAIVSKIPEFAKLCTLDEFAWARMMAASRIYGL-FI 522

Query: 277 SLARRFALVPLGPPLL----AYSS------KCKAMLAAVDDAVQLVVDRPYKAGESIVVW 326
           +  R  A VPL         AY++      K   ML A +D          + G+ I   
Sbjct: 523 NKKRTDAFVPLADMFNHRRPAYTNWGFCEDKGGFMLKASEDI---------RRGDQIY-- 571

Query: 327 YNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK-RMVAQRN 385
                    + CG + NS+ L+NYGFV ++N  + + +    + +D     K +M+ +R 
Sbjct: 572 ---------YSCGRKCNSRFLLNYGFVVKNNEANEIQLRVDFDKKDETLPIKLQMIGKR- 621

Query: 386 GKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQ-----SVISSLGPIC--PVSPC 438
            K    +F +H   E++++ +   +LR   + D   ++     S      P+   P S  
Sbjct: 622 -KPESLIFRIHINYEEKSVLEFFGFLRFVLIRDYIVLEKFHEMSEGKEFDPLRTPPFSIE 680

Query: 439 MERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQV 498
            E+ +  ++       +  YP TL ED+ +L    L   ++    L   EK++L   + +
Sbjct: 681 NEKQMWTEIHKICAEIMIQYPTTLDEDKKILETSKLTINQKNCVILRMGEKEILMYYITM 740

Query: 499 TADMIMLL 506
              M  LL
Sbjct: 741 ADRMKKLL 748


>gi|148686777|gb|EDL18724.1| mCG18357, isoform CRA_b [Mus musculus]
          Length = 466

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 83/355 (23%), Positives = 159/355 (44%), Gaps = 40/355 (11%)

Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+  L  +    ++ 
Sbjct: 28  ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVF 79

Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 275
            + +   R+Y      +F    + Q +P+   +P  E+FT+E ++ A  +V +    +  
Sbjct: 80  SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKESFTYEDYRWAVSSVMTRQNQIPT 132

Query: 276 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGH 334
              +R   AL+PL       +          DD  + V  + ++AG+ I ++Y       
Sbjct: 133 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFY------- 185

Query: 335 DFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFH 394
               G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF 
Sbjct: 186 ----GTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFA 241

Query: 395 VHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLGPI-CPVSPCMERAVLD 445
           +H+  E    + +L +LR+  +++    + +        I +LG    PVS   E  +  
Sbjct: 242 LHS-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNAEFPVSWDNEVKLWT 300

Query: 446 QLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 500
            L D     L  Y  T+ ED+ +L + +L  +  +A +L   EK++L   ++  A
Sbjct: 301 FLEDRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAA 355


>gi|297820264|ref|XP_002878015.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323853|gb|EFH54274.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 473

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 61/260 (23%), Positives = 120/260 (46%), Gaps = 33/260 (12%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           AS+ + AGD    VP ++ +T + +  +  ++    T+++  +  LA  L+ EKK+G+KS
Sbjct: 75  ASKVIHAGDCMLKVPFNVQITPDELSPDIRVS---LTDEVGNIGKLAAVLIREKKKGQKS 131

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
            W+PYI  L +        + S + W E E + +  S    E +++   I++E++     
Sbjct: 132 RWVPYISRLPQ-----PAEMHSTIFWGEDEFSMIRCSAVHKETVKQKAQIEKEFS----- 181

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
            F+A +  Q YP  I  E    E F  A+  V S     +    ++  +L+P    +   
Sbjct: 182 -FVAQAFKQHYPMVI--ERPYLEDFMYAYALVGS-----RAWETSKGISLIPFADFMNHD 233

Query: 295 SSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV 353
                 +L+  D+ + ++  DR Y  G+ + + Y           G   N+ L++++GF 
Sbjct: 234 GLSASIVLSDEDNQLSEVTADRNYSPGDEVFIKY-----------GEFSNATLMLDFGFT 282

Query: 354 DEDNPYDRLVVEAALNTEDP 373
              N +D + ++  +  +DP
Sbjct: 283 VPYNIHDEVQIQMDVPNDDP 302


>gi|302766942|ref|XP_002966891.1| hypothetical protein SELMODRAFT_408134 [Selaginella moellendorffii]
 gi|300164882|gb|EFJ31490.1| hypothetical protein SELMODRAFT_408134 [Selaginella moellendorffii]
          Length = 374

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 73/267 (27%), Positives = 116/267 (43%), Gaps = 57/267 (21%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           A+  + AG     +P   ++T E V     ++ LL+T+  +    L+L+L+ EK + ++S
Sbjct: 10  ATRRVPAGSRFLEIPRIAIITPENVPSQ--VSHLLSTS--NPKTRLSLFLLSEKHKAQES 65

Query: 175 FWLPYIRELDRQRGRGQLA-VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
            W PY+R L       QL  +ES + W   ELA+L  SPT  E +E  + IK E++ L  
Sbjct: 66  QWAPYLRCL------PQLGDIESTMFWKAEELAWLKHSPTYRETMECLKIIKSEFHLLT- 118

Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL-----ARRFALVPLG 288
              +A    Q +P+                     C   L +VSL     A     +P  
Sbjct: 119 ---LANK--QVFPW---------------------CRDALGEVSLTDFMHAYSTDQIPFA 152

Query: 289 PPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKL 346
                +   C+  L+     D    V D+ YKAG+ I + Y           G  PNS L
Sbjct: 153 -DFFNHDHNCQTRLSYDKEKDCAVAVADQDYKAGDEIFLSY-----------GSTPNSIL 200

Query: 347 LINYGFVDEDNPYDRLVVEAALNTEDP 373
            ++YGF    NP++++ V   ++  DP
Sbjct: 201 AVDYGFAVASNPHEQVEVPMGVSLTDP 227


>gi|303277863|ref|XP_003058225.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226460882|gb|EEH58176.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 612

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 90/317 (28%), Positives = 134/317 (42%), Gaps = 53/317 (16%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIA--ELLTTNKLSELACLALYLMYEKKQ 170
            AAS DL AG  A ++P+S ++T    L + T       T   L E   + L+L+YEK  
Sbjct: 195 AAASTDLPAGADALTIPSSALLTSRVALEDPTARGDAYRTFAGLGEDTLMTLWLVYEKYA 254

Query: 171 -GKKSFWLPYIREL---------DRQRGRGQLAVESPLLW-SETELAYLTGSPTKAEILE 219
            G +S W P +  L           +   G L + +P  W +E   A L G+P   + ++
Sbjct: 255 LGDRSPWAPLLASLPMDDGGGDDGDRTAAGALGL-TPASWPAEVTDALLRGAPLLDDAVK 313

Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
             E   R++  L    F A  L + +P   PTE +T   F+ A  A  +  + +Q  ++ 
Sbjct: 314 ARETTARQHAAL----FPA--LGEHFPEVFPTELYTLRRFRIASEAWNAYGMTVQAETVG 367

Query: 280 RRFALVPLGPPLLAYSSKCKAMLA-------AV------DDAVQLVVDRPYKAGESIVVW 326
                    PP         A+L        AV      DDA+ L + R  +AGE I V 
Sbjct: 368 GASGGGEHHPPAPTTCLPPIALLCNHATWPHAVRYSRLRDDALHLPIARGVRAGEEIFVS 427

Query: 327 YNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQR-- 384
           Y           G + N++LL+ YGF   DNPYD    +  L+ E PQ + + + A R  
Sbjct: 428 Y-----------GAKSNAELLLFYGFGVRDNPYD----DVPLSLELPQGEVRDVSALRER 472

Query: 385 ---NGKLSVQVFHVHAG 398
                KLS+    V  G
Sbjct: 473 VLHRAKLSLSPHSVRCG 489


>gi|348537527|ref|XP_003456245.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Oreochromis niloticus]
          Length = 607

 Score = 72.0 bits (175), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 101/454 (22%), Positives = 186/454 (40%), Gaps = 73/454 (16%)

Query: 74  KEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLV 133
           +E+   +L SW  +NG   C+       +     +  + + A+ D++A +    +P  ++
Sbjct: 75  REDHFPELMSWAKENG-ASCECF-----TVANFGKEGYGLRATRDIKAEELFLWIPRKML 128

Query: 134 VTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRG 190
           +T+E    N  +  L + +++ +      LAL+L+ E+     SFWLPYIR L ++    
Sbjct: 129 MTVESA-QNSILGPLYSQDRILQAMGNVTLALHLLCERA-NPASFWLPYIRSLPQE---- 182

Query: 191 QLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY----------NELDTVWF---- 236
               + PL + + ++  L G+    ++L + +   R+Y            L +V      
Sbjct: 183 ---YDIPLYYQQEDVQLLLGTQAVQDVLSQYKNTARQYAYFYKLVQDKGMLGSVELRLFA 239

Query: 237 -----MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
                M G LF Q+       A +  + +Q  +  +       +V+LA    L+PL    
Sbjct: 240 SLTPVMGGKLFDQW-------AVSSVMTRQNQIPTEDG----SRVTLA----LIPLWDMC 284

Query: 292 LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYG 351
              +          DD  + V  + YK  E I ++Y           G + N++ +I+ G
Sbjct: 285 NHTNGLITTGYNLEDDRCECVALQDYKENEQIYIFY-----------GTRSNAEFVIHNG 333

Query: 352 FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL 411
           F  +D+ +DR+ ++  ++  +  Y  K  V  R G  +  VF +H   E    + +L +L
Sbjct: 334 FFFQDDAHDRVKIKLGVSKSERLYAMKAEVLARAGIPASYVFALHCN-EPPISAQLLAFL 392

Query: 412 RLG---------YVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATL 462
           R+          Y+     +  + +      PVS   E  +   L       L  Y  T 
Sbjct: 393 RVFCMTEDELKYYLLGDRAINKIFTLGNSEFPVSWENEIKLWTFLETRAALLLKTYKTTS 452

Query: 463 SEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 496
            ED +ML   +L    R+A QL   EK++L   L
Sbjct: 453 EEDRSMLEKPDLSLHSRMAIQLRLAEKQILEKAL 486


>gi|242049248|ref|XP_002462368.1| hypothetical protein SORBIDRAFT_02g024510 [Sorghum bicolor]
 gi|241925745|gb|EER98889.1| hypothetical protein SORBIDRAFT_02g024510 [Sorghum bicolor]
          Length = 489

 Score = 72.0 bits (175), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 104/398 (26%), Positives = 165/398 (41%), Gaps = 49/398 (12%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNET-IAELLTTNKLSELACLALYLMYEKKQG 171
           + A+ DL  G+    VP  L +  + V  ++   A       L     +AL L+ E  +G
Sbjct: 83  LVAARDLPRGEVVAEVPKKLWMDADAVAASDIGRACGGGGGGLRPWVAVALLLLSEVARG 142

Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY--N 229
             S W PY+  L RQ              S    A L  S  + ++L    G+K EY  +
Sbjct: 143 ADSPWAPYLAILPRQTD------------STIFCAGLKKSSLRYKLLSTTVGVK-EYVQS 189

Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV---VHLQKVSLARRFALVP 286
           E D+V     S  +    D+   + TF+ F  AF  ++S V   +   K++L     LV 
Sbjct: 190 EFDSVQAEIISRNK----DLFPGSITFDDFLWAFGILRSRVFPELRGDKLALVPFADLVN 245

Query: 287 LGPPLLAYSS----KCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
             P + +  S    K K +    +    L      K+G+ I + Y++           + 
Sbjct: 246 HSPDITSEGSSWEIKGKGLFGR-EPMFSLRTPVDVKSGQQIYIQYDLD----------KS 294

Query: 343 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 402
           N++L ++YGFV+ +   D   V   ++  DP Y DK  +A+ N       F +    ++ 
Sbjct: 295 NAELALDYGFVESNPSRDSYTVTLEISESDPFYGDKLDIAELNELGETAYFDIIL--DEP 352

Query: 403 AISDMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLADYFKARL 455
               MLPYLRL  +  T           SV   L    P+SP  E ++   + D  K+ L
Sbjct: 353 LPPQMLPYLRLLCIGGTDAFILEALFRNSVWGHLE--LPLSPDNEESICQVMRDACKSAL 410

Query: 456 AGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 493
           A Y  T+ EDE +    NL P+  +A  +   EKK+L 
Sbjct: 411 AAYHTTIEEDEELSERENLQPRLTIAIGVRAGEKKVLQ 448


>gi|449495943|ref|XP_004159992.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Cucumis
           sativus]
          Length = 503

 Score = 71.6 bits (174), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 91/404 (22%), Positives = 169/404 (41%), Gaps = 49/404 (12%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           +A +++L   +    VP    +  + V  +E        + L     +AL+L+ E  +G 
Sbjct: 100 LATTKNLSKNEVVLEVPKRFWINPDAVADSEIGN---VCSGLKPWISVALFLIRENLKGD 156

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
            S W  Y+  L ++        +S + WSE ELA + G+   +  L   E +K E+ +++
Sbjct: 157 -SRWRRYLDILPQE-------TDSTVFWSEEELAEIQGTQLLSTTLNVKEYVKSEFLKVE 208

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL- 291
               +          D+     T + F  AF  ++S      +    +   L+P    + 
Sbjct: 209 EEILLRHK-------DLFPSRITLDDFFWAFGILRSRAFSRLR---GQNLVLIPFADLVN 258

Query: 292 ---------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
                     A+  K  A L + D    L      KAG+ + + Y++           + 
Sbjct: 259 HSANVTTEEHAWEVKGPAGLFSWDVLCSLRSPLSVKAGDQVFIQYDLK----------KS 308

Query: 343 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 402
           N+ L ++YGF+++ +  +   +   +   D  + DK  +A+ NG      F +    E+ 
Sbjct: 309 NADLALDYGFIEQKSDRNAYTLTLEIPESDLFFDDKLDIAETNGLNQTAYFDIIL--ERP 366

Query: 403 AISDMLPYLRLGYVSDTSE--MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAG 457
               MLP+LRL  +  T    ++S+   S  G +  PVS   E  +   + +  +A L+G
Sbjct: 367 FPPAMLPFLRLLALGGTDAFLLESLFRNSVWGHLEMPVSRANEELICQVVRNACEAALSG 426

Query: 458 YPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTAD 501
           Y  T+ EDE  L + NL  + R+A  +   EK++L   +Q+  D
Sbjct: 427 YHTTIEEDEK-LKEENLDSRLRIAVGIREGEKRVLQQIIQIFKD 469


>gi|449442309|ref|XP_004138924.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Cucumis
           sativus]
          Length = 503

 Score = 71.6 bits (174), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 91/404 (22%), Positives = 169/404 (41%), Gaps = 49/404 (12%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           +A +++L   +    VP    +  + V  +E        + L     +AL+L+ E  +G 
Sbjct: 100 LATTKNLSKNEVVLEVPKRFWINPDAVADSEIGN---VCSGLKPWISVALFLIRENLKGD 156

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
            S W  Y+  L ++        +S + WSE ELA + G+   +  L   E +K E+ +++
Sbjct: 157 -SRWRRYLDILPQE-------TDSTVFWSEEELAEIQGTQLLSTTLNVKEYVKSEFLKVE 208

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL- 291
               +          D+     T + F  AF  ++S      +    +   L+P    + 
Sbjct: 209 EEILLRHK-------DLFPSRITLDDFFWAFGILRSRAFSRLR---GQNLVLIPFADLVN 258

Query: 292 ---------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
                     A+  K  A L + D    L      KAG+ + + Y++           + 
Sbjct: 259 HSANVTTEEHAWEVKGPAGLFSWDVLFSLRSPLSVKAGDQVFIQYDLK----------KS 308

Query: 343 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 402
           N+ L ++YGF+++ +  +   +   +   D  + DK  +A+ NG      F +    E+ 
Sbjct: 309 NADLALDYGFIEQKSDRNAYTLTLEIPESDLFFDDKLDIAETNGLNQTAYFDIIL--ERP 366

Query: 403 AISDMLPYLRLGYVSDTSE--MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAG 457
               MLP+LRL  +  T    ++S+   S  G +  PVS   E  +   + +  +A L+G
Sbjct: 367 FPPAMLPFLRLLALGGTDAFLLESLFRNSVWGHLEMPVSRANEELICQVVRNACEAALSG 426

Query: 458 YPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTAD 501
           Y  T+ EDE  L + NL  + R+A  +   EK++L   +Q+  D
Sbjct: 427 YHTTIEEDEK-LKEENLDSRLRIAVGIREGEKRVLQQIIQIFKD 469


>gi|302754340|ref|XP_002960594.1| hypothetical protein SELMODRAFT_402971 [Selaginella moellendorffii]
 gi|300171533|gb|EFJ38133.1| hypothetical protein SELMODRAFT_402971 [Selaginella moellendorffii]
          Length = 403

 Score = 71.6 bits (174), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 83/313 (26%), Positives = 135/313 (43%), Gaps = 44/313 (14%)

Query: 80  DLKSWMHKNGLPPCKVILKEKPSHNEK----HRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
           +  SW+ + G     + +    S + +    HRP         + AG+       +LV+T
Sbjct: 41  EFMSWLRRRGEDMNSIAVAIGMSKHGRALFAHRP---------MCAGECMIKFSQNLVLT 91

Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
            E+ L  E IA L   N+ + ++   L +M EK++G+ S W PYI  L      G+  + 
Sbjct: 92  PEK-LPCEVIALLDQANEFTRVS---LLVMAEKRKGQNSAWAPYIECLP---SFGE--IH 142

Query: 196 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
           S + W   ELA L  SP      ER   ++ EY E+  V     S    Y  D+     +
Sbjct: 143 STIFWDPKELACLECSPIHRGTGERNALLQSEYREVKKV---VESCPHLYDPDV-----S 194

Query: 256 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV-D 314
            E FK  +  V S        S      ++PL      +  + + + +  DD   +VV  
Sbjct: 195 LEQFKHEYATVSSRAWGQGPHS---DMTMIPL-VDFANHDPRSRTLFSHADDNCTVVVAS 250

Query: 315 RPYKAG-ESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 373
           R Y+ G E+  +  +I +       G   N+ L ++YGFV  DNP+D   +   + +EDP
Sbjct: 251 RDYQTGDENFHLKVHICY-------GDHSNAVLALDYGFVVPDNPFDEAEIFLEIPSEDP 303

Query: 374 QYQDK-RMVAQRN 385
             + K + +AQ N
Sbjct: 304 LREIKLQYMAQNN 316


>gi|302771638|ref|XP_002969237.1| hypothetical protein SELMODRAFT_410177 [Selaginella moellendorffii]
 gi|300162713|gb|EFJ29325.1| hypothetical protein SELMODRAFT_410177 [Selaginella moellendorffii]
          Length = 336

 Score = 71.6 bits (174), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 79/282 (28%), Positives = 123/282 (43%), Gaps = 40/282 (14%)

Query: 107 HRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMY 166
           HRP         + AG+        LV+T E+ L  E IA L   N+ + ++   L +M 
Sbjct: 72  HRP---------MCAGECMIKFSQDLVLTPEK-LPCEVIALLDQANEFTRVS---LLVMA 118

Query: 167 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 226
           EK++G+ S W PYI  L      G+  + S + W   ELA L  SP      ER   ++ 
Sbjct: 119 EKRKGQNSAWAPYIECLP---SFGE--IHSTIFWDPKELACLECSPIHRGTGERNALLQS 173

Query: 227 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 286
           EY E+  V     S    Y  D+     + E FK  +  V S        S      ++P
Sbjct: 174 EYREVKKV---VESCPHLYDPDV-----SLEQFKHEYATVSSRAWGQGPHS---DMTMIP 222

Query: 287 LGPPLLAYSSKCKAMLAAVDDAVQLVV-DRPYKAG-ESIVVWYNISFTGHDFKCGPQPNS 344
           L      +  + + + +  DD   +VV  R Y+ G E+  +  +I +       G   N+
Sbjct: 223 L-VDFANHDPRSRTLFSHADDNCTVVVASRDYQTGDENFHLKVHICY-------GDHSNA 274

Query: 345 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK-RMVAQRN 385
            L ++YGFV  DNP+D   +   + +EDP  + K + +AQ N
Sbjct: 275 VLALDYGFVVPDNPFDEAEIFLEIPSEDPLREIKLQYMAQNN 316


>gi|149044197|gb|EDL97579.1| rCG27725, isoform CRA_c [Rattus norvegicus]
          Length = 468

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 80/355 (22%), Positives = 156/355 (43%), Gaps = 40/355 (11%)

Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+  L  +    ++ 
Sbjct: 28  ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVF 79

Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 275
            + +   R+Y      +F    + Q +P+   +P  ++FT+E ++ A  +V +    +  
Sbjct: 80  SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 132

Query: 276 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGH 334
              +R   AL+PL       +          DD  + V  + ++AG+ I ++Y       
Sbjct: 133 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFY------- 185

Query: 335 DFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFH 394
               G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF 
Sbjct: 186 ----GTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFA 241

Query: 395 VHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVISSLGPICPVSPCMERAVLD 445
           +H   E    + +L +LR+  +++          S +  + +      PVS   E  +  
Sbjct: 242 LHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWT 300

Query: 446 QLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 500
            L D     L  Y  T+ ED+ +L + +L  +  +A +L   EK++L   ++  A
Sbjct: 301 FLEDRASLLLKTYKTTIEEDKTVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAA 355


>gi|58177849|gb|AAH89108.1| Setd3 protein [Rattus norvegicus]
          Length = 450

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 80/355 (22%), Positives = 156/355 (43%), Gaps = 40/355 (11%)

Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+  L  +    ++ 
Sbjct: 10  ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVF 61

Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 275
            + +   R+Y      +F    + Q +P+   +P  ++FT+E ++ A  +V +    +  
Sbjct: 62  SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 114

Query: 276 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGH 334
              +R   AL+PL       +          DD  + V  + ++AG+ I ++Y       
Sbjct: 115 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFY------- 167

Query: 335 DFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFH 394
               G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF 
Sbjct: 168 ----GTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFA 223

Query: 395 VHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVISSLGPICPVSPCMERAVLD 445
           +H   E    + +L +LR+  +++          S +  + +      PVS   E  +  
Sbjct: 224 LHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWT 282

Query: 446 QLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 500
            L D     L  Y  T+ ED+ +L + +L  +  +A +L   EK++L   ++  A
Sbjct: 283 FLEDRASLLLKTYKTTIEEDKTVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAA 337


>gi|62642307|gb|AAX92711.1| SET domain-containing protein [Picea abies]
          Length = 106

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 37/75 (49%), Positives = 52/75 (69%), Gaps = 7/75 (9%)

Query: 42  SSLRLVRRKNRFSIRVSSSDTLVAGS------REVVSKKEEDLGDLKSWMHKNGLPPCKV 95
           S +RL  R   F + V S+DTL A S      ++ +  KEE++ DLKSWMH++GLPPC+V
Sbjct: 32  SRVRLPGRCVGFPMVVYSADTLTASSQHGEDKKDAIRGKEEEV-DLKSWMHRHGLPPCRV 90

Query: 96  ILKEKPSHNEKHRPI 110
           +LKE+PS + KH+PI
Sbjct: 91  MLKERPSPDGKHKPI 105


>gi|392349055|ref|XP_003750278.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Rattus
           norvegicus]
          Length = 416

 Score = 70.9 bits (172), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 80/349 (22%), Positives = 154/349 (44%), Gaps = 41/349 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 12  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 65

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 66  LLMTVESA-KNSILGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 121

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+  L  +    ++  + +   R+Y     V        Q +P+ 
Sbjct: 122 -----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQYAYFYKV-------IQTHPHA 169

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 170 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 229

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            DD  + V  + ++AG+ I ++Y           G + N++ +I+ GF  ++N +DR+ +
Sbjct: 230 EDDRCECVALQDFQAGDQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKI 278

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 413
           +  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+
Sbjct: 279 KLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRV 326


>gi|361129824|gb|EHL01706.1| putative Ribosomal N-lysine methyltransferase 4 [Glarea lozoyensis
           74030]
          Length = 483

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 110/451 (24%), Positives = 181/451 (40%), Gaps = 68/451 (15%)

Query: 84  WMHKNGLPPC-KVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL-G 141
           W+ K G+    K+ LK+  S          V A+ D +  +  F +P + V+ +  V  G
Sbjct: 16  WLSKIGVRINPKMTLKDLKSEGRGR----GVVAAADFEEDEVVFCIPRTAVLNVNNVFAG 71

Query: 142 NETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
            ++ A      ++     L   +M E +Q   S W PY+  L ++       ++S + WS
Sbjct: 72  QDSGASKEALLQMPNWLALTATMMSEGQQSD-SRWAPYLAVLPQK-------LDSLVFWS 123

Query: 202 ETELAYLTGSPTKAEILERA--EGIKREYNELDTVWF------MAGSLFQQYPYDIPTEA 253
           E ELA L  S    +I   +  E   +  + L    F         S+   Y +DIP E 
Sbjct: 124 EEELAELQASSVAKKIGRSSAEEMFTKHISPLGLGEFNVELCHQVASVIMAYAFDIPEE- 182

Query: 254 FTFEIFKQAFVAVQSCVVHL-------QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD 306
              E  KQ     +     L       +K  L+    ++PL   L A + +  A +   +
Sbjct: 183 ---EPAKQENGGAEGETDDLVSDDGEDEKTILS----MIPLADMLNADAERNNARIYYEN 235

Query: 307 DAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED-NPYDRLVVE 365
           + +++   +P  AGE I   Y           G  P S LL  YG+V E+   YD + + 
Sbjct: 236 EDLEMRTIKPIMAGEEIFNDY-----------GQLPRSDLLRRYGYVTENYAQYDVVEIS 284

Query: 366 AA----LNTEDPQ---------------YQDKRMVAQRNGKLSVQVFHVHAGREKEAISD 406
           +A    L TE PQ                +++  +A R G L        A  E+ AI D
Sbjct: 285 SASIKSLMTEKPQEIQSGQFLDPLTSAEAEERVALADREGILEDSYDVNIANAEERAIPD 344

Query: 407 MLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 466
            L  L   ++ D   ++++++S   +   S      V   L    + R A Y  TL EDE
Sbjct: 345 ELLALLYLFLLDNENLEAIVTSQSALPSRSKLATELVGKVLVKVLRHREAEYATTLEEDE 404

Query: 467 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 497
            +L   NL  +  +A Q+   EK++L   ++
Sbjct: 405 KLLQAANLPRRTAMAIQVRHGEKRVLRLAVE 435


>gi|302768639|ref|XP_002967739.1| hypothetical protein SELMODRAFT_408995 [Selaginella moellendorffii]
 gi|300164477|gb|EFJ31086.1| hypothetical protein SELMODRAFT_408995 [Selaginella moellendorffii]
          Length = 421

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 73/299 (24%), Positives = 120/299 (40%), Gaps = 57/299 (19%)

Query: 80  DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
           +L SW+   G      +LK  P           + A  D++AG+    V    ++T +R+
Sbjct: 39  ELVSWLKIRGEHDACSLLKTGPDK-------RGLFAVRDIKAGECILRVSRDTMMTADRL 91

Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
                  +LL++  +SE A LAL L++EK+ G+ S W PYI  L R        + S   
Sbjct: 92  --PLEFQQLLSSG-VSEWAQLALLLLFEKRAGEASIWAPYISCLPRWG-----TIHSTAF 143

Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
           W + EL  +  S    E + R   I+ E+NE+ +V F          +D  + A      
Sbjct: 144 WRKEELTMIQESSLSYETMSRRAAIREEFNEMQSVPFA-----DFMNHDWSSNAMLTYDT 198

Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 319
                 V+   V+   + +A   A                          QL  D+ Y A
Sbjct: 199 DNGSTEVEEVKVYSDCLYIALFCA--------------------------QLFADKNYAA 232

Query: 320 GESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 378
           GE +     ISF       GP  N+ L +++GF    NP+D++ +   ++  D   ++K
Sbjct: 233 GEQV----TISF-------GPLCNASLALDFGFTVPYNPWDKVQLWLGISRRDSLRKEK 280


>gi|242007310|ref|XP_002424484.1| SET domain-containing protein, putative [Pediculus humanus
           corporis]
 gi|212507902|gb|EEB11746.1| SET domain-containing protein, putative [Pediculus humanus
           corporis]
          Length = 492

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 98/441 (22%), Positives = 190/441 (43%), Gaps = 56/441 (12%)

Query: 74  KEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLV 133
           +E+   +L SW+ +NG     V +K     NE     + + A++DL+  +   ++P +++
Sbjct: 81  REDHFSNLISWIKENGGVADNVTIKH---FNEMG---YGLEAAKDLEESELICAIPKNVM 134

Query: 134 VTLERVLGNETIAELLTTN----KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
           +TL+ V     +  L   N     +  +A LAL+L+ E  + + SFW  YI  L      
Sbjct: 135 MTLDNV-KVSPLKYLYENNPILKNMGNVA-LALFLILEHVKNENSFWHHYISSLPSD--- 189

Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYD- 248
                 + L +   +   +  SPT     +  + I R+Y      +F   +LFQ    + 
Sbjct: 190 ----YNTVLYFDLNDFLEMKNSPTFEMATKHCKNIARQY-----AYF--NNLFQNSNDEA 238

Query: 249 --IPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF-----ALVPLGPPLLAYSSKCKAM 301
             I    FT+++++ A   V +    +   S +         L+PL   +  +++   + 
Sbjct: 239 SLILRNVFTYQLYRWAVSTVMTRQNFIPSSSTSNDVENGINGLIPLWD-MCNHTNGYLST 297

Query: 302 LAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDR 361
              VD +  L   +P+K GE ++++Y           G + NS  L++ GFV ++NP+D 
Sbjct: 298 QYKVDRSECLAC-KPFKKGEQVLIFY-----------GERSNSDFLVHNGFVYDENPHDS 345

Query: 362 LVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY--VSDT 419
             +   ++  D  +  +  + +  G      F++++G E     ++L +LR+    V + 
Sbjct: 346 FRLRLGISKSDKLHGLRCELLKDLGIPDSGDFYLYSGSEP-VRENLLAFLRIFNMDVENL 404

Query: 420 SEMQSVISSLGPI----CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
           +  +S  S L  +    C +   +E  V + L D     L  Y     E E  + D N  
Sbjct: 405 NHWKSHSSRLSDLMWKDCALDTKIESKVWNFLYDRINLLLKTYKG--DEVEVRVEDSNST 462

Query: 476 PKKRVATQLVRMEKKMLNACL 496
             +R+    ++ EKK L++ L
Sbjct: 463 ECRRLVRAQLKCEKKFLSSIL 483


>gi|340720054|ref|XP_003398458.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Bombus
           terrestris]
          Length = 484

 Score = 70.1 bits (170), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 102/449 (22%), Positives = 183/449 (40%), Gaps = 61/449 (13%)

Query: 73  KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
           K+ + +G   +W+ +NG       + E P ++        + A  +    +    +P  L
Sbjct: 77  KRSQGIGRFINWLKQNGANVYGASVAEFPGYDLG------LKAERNFLENELILRIPREL 130

Query: 133 VVTLERV------LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
           + ++         L N+ + +L+          LA+ L+ EK + + S W PY+  L   
Sbjct: 131 IFSIHNAAPELVALQNDPLLQLMPQ------VALAIALLIEKHK-EYSKWKPYLDILPT- 182

Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP 246
                    + L  +  ++  L GSPT    L++   I R+Y      +F    LFQ+  
Sbjct: 183 ------TYTTVLYMTAADMNELKGSPTLEAALKQCRNIARQY-----AYF--NKLFQKNN 229

Query: 247 YDIPT---EAFTFEIFKQAF--VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAM 301
             +     + FT+E +  A   V  +  ++  +  SL    AL+P+        SK    
Sbjct: 230 NAVSAILRDVFTYEKYCWAVSTVMTRQNIIPSKDGSLMIH-ALIPMWDMCNHEDSKITTD 288

Query: 302 LAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDR 361
             A  +  +    R +K  E I + Y           GP+ NS   ++ GFV  DN  D 
Sbjct: 289 FNATLNCCECYALRDFKKAEQIFISY-----------GPRTNSDFFVHSGFVYMDNEQDG 337

Query: 362 LVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD-MLPYLRLGYVSDTS 420
             +   ++  DP ++++  +  +    +V  F +  G   E ISD +L +LR+ +     
Sbjct: 338 FKLRLGISKADPLHKERVELLNKLDLPAVGEFLLKPG--TEPISDTLLAFLRV-FSMRKE 394

Query: 421 EMQSVISS--LGPICPVSPCMERAVLDQLADYFKARL----AGYPATLSEDEAMLTDYNL 474
           E+   I S  +  +  +   +E  V + +  +   RL    A YP TL ED  +L +  L
Sbjct: 395 ELAHWIQSDRVNDLKHMDCALETVVEENVKKFLLTRLQLLIANYPTTLKEDLQLL-ETTL 453

Query: 475 HPKKRVATQLVRMEKKMLNACLQVTADMI 503
              K++A QL   EK++L   L+     I
Sbjct: 454 PRIKKLAIQLRVTEKRILQGALEYVQQWI 482


>gi|168046556|ref|XP_001775739.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162672891|gb|EDQ59422.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 524

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 83/320 (25%), Positives = 126/320 (39%), Gaps = 33/320 (10%)

Query: 52  RFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIH 111
           RF  R   S+         VS     +  L++W+ K     C + L+  P          
Sbjct: 49  RFGCRWVQSNGSTHTKESNVSISNTKVERLRNWLKKLNHDDCNLKLERCPQGGSGS-GYG 107

Query: 112 YVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
             A    +  G     VP   ++T E     + +  L+  + L+    + L+L+YE+ +G
Sbjct: 108 AFAGPGGVGNGSTIVKVPRKALMTEETARLCQDVGPLVKKSDLTPWQAMCLHLLYERARG 167

Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSET-ELAYLTGSPTKAEILERAEGIKREYNE 230
           + SFW PYI  L ++    +L    P+LWS+     +L GSP   ++ ER   I RE  E
Sbjct: 168 ETSFWYPYIAVLPKEL---ELIGIHPMLWSQKMRREWLEGSPM-LDVTERRLAICREDYE 223

Query: 231 LDTVWFMAGSLF----QQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL------AR 280
              +   AG L     +  P  I   A     +    +  +S  ++LQ   L        
Sbjct: 224 A-MLLAGAGRLTPRGNEGEPISITETAVQ---WAATMLLSRSFSLNLQTQKLRPGSFAED 279

Query: 281 RFALVPLGPPLLAYSSKCKAMLAAVDD---AVQLVVDRPYKAGESIVVWYNISFTGHDFK 337
             ALVP    L   SS  +      D       L   R Y  GE +       F  +   
Sbjct: 280 TIALVPWADMLNHSSSAGRESCLVYDQKSGVATLQAHRTYSEGEQV-------FDSYGPS 332

Query: 338 CGPQPNSKLLINYGFVDEDN 357
           C P   S+LL++YGFVDE+N
Sbjct: 333 CSP---SRLLLDYGFVDEEN 349


>gi|145516108|ref|XP_001443948.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124411348|emb|CAK76551.1| unnamed protein product [Paramecium tetraurelia]
          Length = 572

 Score = 68.9 bits (167), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 91/418 (21%), Positives = 173/418 (41%), Gaps = 62/418 (14%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA----CLALYLMYEK 168
           V A + + A +    +P S ++TLE +    T+A+ +   +L  L+     L+ +L+ EK
Sbjct: 165 VNAKQTINAKELILFIPKSHMITLE-MAKETTVAKKMMQFRLDLLSPKHSFLSTFLLQEK 223

Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
            +   SFW PYI  L             P+ ++ ++L +L GSP   +I ++   ++++Y
Sbjct: 224 FRPN-SFWKPYIDILPSSYP------SFPIFYNNSDLEWLKGSPFLKQIKDKLADLQKDY 276

Query: 229 NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG 288
           N++  V       F QY          F  F  A +   S +  +  ++  +  A VPL 
Sbjct: 277 NDICNV----VPEFTQY---------QFHEFCWARMTASSRIFGI-NINGVKTDAFVPLA 322

Query: 289 -------PPLLA--YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCG 339
                  P L +  YS + +  +   D+ ++         G+ I       F  +  KC 
Sbjct: 323 DMLNHKRPKLTSWCYSDEKQGFIIETDEKIE--------RGQMI-------FDSYGRKC- 366

Query: 340 PQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGR 399
              NS+  +NYGFV E N  + + +    +  DP  Q K    + + +   + F +    
Sbjct: 367 ---NSRFFLNYGFVVEGNDANEVNLAVEADQNDPLLQLKEQAIKESLQWP-KNFKLLMDT 422

Query: 400 EKEAISDMLPYLRLGYVSDTSEMQSVISSLG-------PICPVSPCMERAVLDQLADYFK 452
           ++ A+ D + ++R   + D ++++ +++             P+    E  +   +    K
Sbjct: 423 DETAVIDFMSHIRFLVIRDEAQLKLLLNQKNSQNFKSTKTQPLGIYNELEMWKMIGRICK 482

Query: 453 ARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVT 510
             L  YP T  +D+ +L    L   +R    L   EK++L    Q +  M  LL +  
Sbjct: 483 KTLKQYPTTFEQDQEILQICELTTNQRNCLILRMGEKEILKFYFQFSERMKELLSNFN 540


>gi|440797255|gb|ELR18348.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
          Length = 431

 Score = 68.6 bits (166), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 72/265 (27%), Positives = 106/265 (40%), Gaps = 48/265 (18%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVT------------LERVLGNETIAELLTTNKLSELACL 160
           V A+ D+  G+   SVP SLVV             + R+L  E        N L      
Sbjct: 59  VVAAHDIATGETLLSVPFSLVVDSADAPLATAAPEIRRILDEEFPLSATNENAL------ 112

Query: 161 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILER 220
              L+   K    S W  YI  L            + L +S+ EL+YL GS       +R
Sbjct: 113 ---LLLVHKNDPNSPWQRYIDVLPS-------TFSTTLFFSDDELSYLEGSSLHHFARQR 162

Query: 221 AEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV--QSCVVHLQKVSL 278
              I+ +Y+ + T       LF  YP     E F+ + +K A   +  +S VV   K  L
Sbjct: 163 RRAIESQYDTIFT------PLFVDYPEHFAPEQFSLDAWKWALSVIWSRSFVVDEGKRGL 216

Query: 279 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKC 338
                +  + P     + + K  + AVD  +      P K GE I V Y         + 
Sbjct: 217 VPWADMFNMAPE----TEQVKVAVDAVDHHLIYSARSPIKKGEQIFVAYG--------QS 264

Query: 339 GPQPNSKLLINYGFVDEDNPYDRLV 363
               N++LL++YGFV E+NP+D +V
Sbjct: 265 RQMSNAQLLMDYGFVLENNPHDAVV 289


>gi|449017905|dbj|BAM81307.1| similar to ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplast precursor
           [Cyanidioschyzon merolae strain 10D]
          Length = 567

 Score = 68.6 bits (166), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 117/455 (25%), Positives = 174/455 (38%), Gaps = 98/455 (21%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLE---RVLGNETIAELLTTNKLSELACLALY------LM 165
           A  D+QAG+  F VP  L  T +   R      + EL    +   LA L LY        
Sbjct: 120 ARRDIQAGEVLFQVPFHLCFTKDVAVRRFAALNVPELADEEEFFALATLLLYERGLDESW 179

Query: 166 YEKKQGKKSFWLPYIRELDR--QRGRGQLAVES----PL----LWSETELAYLTGSPTKA 215
            +  +G  SFW PY+  L       +G    ES    PL    LW+E E+ +L GSPT  
Sbjct: 180 KKSGRGPGSFWGPYLDILPPVPWEFKGAEPAESLSMDPLDALWLWAEDEMQWLQGSPTLL 239

Query: 216 EILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE-AFTFEIFKQAFVAVQSCVVHLQ 274
                   ++REY E          L++++P+    E AF  E F  AF  + S  V L 
Sbjct: 240 SARALRSKVEREYAE------ACERLYRRHPHIFDLEGAFRLERFLWAFGVLFSRAVSLP 293

Query: 275 KVSLARRFALVPLGPPLLAYSSKCKAMLAA----------------------------VD 306
             +     ALVP    L  +S+ C + + A                             D
Sbjct: 294 AENGM--LALVPYA-DLANHSAFCVSFIDARTAAFPYAFRASSKQKRGQWWQRFLAPNSD 350

Query: 307 DA------------------VQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLI 348
           DA                  V    DR Y   E + V Y           G + N++LL+
Sbjct: 351 DAGAVANTDSSHYREDAQREVVAYADRFYDKFEQVYVSY-----------GQKSNAELLL 399

Query: 349 NYGFVDEDNPYDRLVVEAALNTEDPQ----YQDKR--MVAQRNGKLSVQVFHVHAGREKE 402
            YGFV + NPY+ + V  +L+  +         KR  ++A        + F ++A R   
Sbjct: 400 LYGFVSDRNPYNSVEVCVSLSGSEAAGAGLLDRKRSFLLACGRDPDKPECFPLYADRYPL 459

Query: 403 AISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATL 462
            +  +L +  L      +   S +  +    PV+   E A    L    K  L  YP + 
Sbjct: 460 ELMQLLRFASL--TEQDAAGYSDLEQIDVAQPVNRENEIAAKSALLQACKIALQAYPTSA 517

Query: 463 SEDEAMLTDYN----LHPKKRVATQLVRMEKKMLN 493
            ED+A L D +    L  K+R++ +L R EK++L 
Sbjct: 518 DEDDAALKDKSMAQLLSRKQRLSVRLRRSEKRILE 552


>gi|255087300|ref|XP_002505573.1| set domain protein [Micromonas sp. RCC299]
 gi|226520843|gb|ACO66831.1| set domain protein [Micromonas sp. RCC299]
          Length = 509

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 110/451 (24%), Positives = 188/451 (41%), Gaps = 46/451 (10%)

Query: 64  VAGSREVVSKKEEDLGDLKSWMHKNGLPPCKV---ILKEKPSHNEKHRPIHYVAASEDLQ 120
           VA    V S+ + D   L +W+   G+   KV   ++   P      R    VAA ED+ 
Sbjct: 44  VAVDASVDSRTQADFDALWAWLGSEGVDVSKVSPALVDAAPGG----RGWGLVAA-EDIG 98

Query: 121 AGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYI 180
            GDA  ++P SL +T++  L +   A        +    +AL L++E+  G+KS W  Y+
Sbjct: 99  GGDAVLAIPRSLWMTVDTALASPIGAH---CGDEAGWIAVALQLLHERSIGEKSRWAAYV 155

Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS 240
             L  Q       +++PL WS  E+A LTG+    ++L+ A G   +     T   +  S
Sbjct: 156 NALPAQ-------LDAPLFWSAEEVATLTGT----QLLDAAAGY--DSYARGTWARLKES 202

Query: 241 LFQQYPYDIPTEAFTFEIFKQAFVAVQS-CVVHLQKVSLARRFALVPLGPPLLAYSSKCK 299
            F   P   P++AF    F  AF  ++S C      V      ALVP     +A  S   
Sbjct: 203 AFDANPDVFPSDAFDEPSFLWAFGILRSRCQA---PVDQGADIALVP--GLDMANHSGLS 257

Query: 300 AMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGP--------QPNSKLLINYG 351
           +    +++     V    K+G S+++       G   K           + +++L ++YG
Sbjct: 258 SQTWTLNNGGVAAVFGGGKSGGSMLLRTEKGAKGLLAKGAEVFMNYGQRKIDNQLALDYG 317

Query: 352 FVDEDNPYDRLVVE-AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPY 410
           F D        V+   A+   DP   DK  V +  G      F + A  + E   ++  +
Sbjct: 318 FTDAFASRPGYVLGPIAIPESDPNAFDKMDVLEVAGLREAPSFVLRAFEDPE--PELRVF 375

Query: 411 LRLGYV--SDTSEMQSVI--SSLGPIC-PVSPCMERAVLDQLADYFKARLAGYPATLSED 465
           +RL  +   D   ++++    + G I  PVS   E+     + +  +  L GY   + +D
Sbjct: 376 MRLLNLKGEDAFLLEAIFRQEAWGLISEPVSRLNEQEACGTMINGCEEALRGYATRVEDD 435

Query: 466 EAMLTDYNLHPKKRVATQLVRMEKKMLNACL 496
             +  D  +  + R+A ++   EK+ L   L
Sbjct: 436 RRVAEDPGVGHRLRLAARVRMGEKQALADAL 466


>gi|255083899|ref|XP_002508524.1| set domain protein [Micromonas sp. RCC299]
 gi|226523801|gb|ACO69782.1| set domain protein [Micromonas sp. RCC299]
          Length = 425

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 73/288 (25%), Positives = 118/288 (40%), Gaps = 46/288 (15%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYE----K 168
           + A+E+++ G++   +P S ++T+ER +    +        L E + LA +L  +     
Sbjct: 24  LVATEEVRRGESLLDIPESTLITVERAIAESNLGP--AHANLQEWSVLAAFLAEQALAID 81

Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA-YLTGSPTKAEILERAEGIKRE 227
                S +  Y+R L R+ G         L W E ++   L GSP++   +ER   +   
Sbjct: 82  AGADGSRFATYVRALPRRTG-------GVLDWPEEDVKELLAGSPSQRAAMERQASVDAA 134

Query: 228 YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 287
            +E+             +P   P         + AF  + S ++ L     A   ALVP 
Sbjct: 135 IDEIRA----------SFPQLTPG------ALRWAFDVLFSRLIRLPNRGGA--LALVPW 176

Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLL 347
              +L +   C A +     AV L  DR YK GE +   Y           GP+P+S+LL
Sbjct: 177 A-DMLNHRPGCDAYIDDTGGAVCLSPDRRYKPGEQVYASY-----------GPRPSSELL 224

Query: 348 INYGFVDE--DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 393
           I+YGF     +NP D   V   ++  D     K    +R G   V+ F
Sbjct: 225 ISYGFAPAVGENPDDEFEVVLGIDPNDRHADAKADALRRIGLSPVEAF 272


>gi|440804743|gb|ELR25614.1| SET domain containing protein, partial [Acanthamoeba castellanii
           str. Neff]
          Length = 273

 Score = 67.8 bits (164), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 72/262 (27%), Positives = 105/262 (40%), Gaps = 48/262 (18%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVT------------LERVLGNETIAELLTTNKLSELACL 160
           V A+ D+ AG+   SVP SLVV             + R+L  E        N L      
Sbjct: 45  VVAAHDIAAGETLLSVPFSLVVDSADALLATSAPEIRRILDEEFPLSPTNENAL------ 98

Query: 161 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILER 220
              L+   K    S W  YI  L            + L +S+ EL+YL GS       +R
Sbjct: 99  ---LLLVHKNDPNSPWQRYIDVLPS-------TFSTTLFFSDDELSYLEGSSLHYFARQR 148

Query: 221 AEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV--QSCVVHLQKVSL 278
              I+ +Y+ + T       LF  YP     E F+ + +K A   +  +S VV   K  L
Sbjct: 149 RRAIESQYDTIFT------PLFVDYPEHFAPEQFSLDAWKWALSVIWSRSFVVDEGKSGL 202

Query: 279 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKC 338
                +  + P     + + K  + AVD  +      P K GE I V Y         + 
Sbjct: 203 VPWADMFNMAPE----TEQVKVAVDAVDHHLIYSARSPIKKGEQIFVAYG--------QS 250

Query: 339 GPQPNSKLLINYGFVDEDNPYD 360
               N++LL++YGFV E+NP+D
Sbjct: 251 RQMSNAQLLMDYGFVLENNPHD 272


>gi|358392567|gb|EHK41971.1| hypothetical protein TRIATDRAFT_251278, partial [Trichoderma
           atroviride IMI 206040]
          Length = 956

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 98/460 (21%), Positives = 176/460 (38%), Gaps = 95/460 (20%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER----------VLGNETIAELLTTNKLSELACLAL 162
           + A +D+ A    F+VP S +V +E            L  +T  E+    +    + L +
Sbjct: 531 IVALQDIPADTVLFTVPRSAIVNIETSELRAKLPDVFLNQDTAMEVDNKPQQDPWSTLII 590

Query: 163 YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RA 221
            L+YE  +G +S W PY+  L         + E+P+ WS+ E+  L  S T+++I +  A
Sbjct: 591 VLIYEYFKGDQSSWKPYLDVL-------PASFETPMFWSDAEVDELQASATRSKIGKTNA 643

Query: 222 EGI--------------------KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ 261
           E +                     +   EL  +    GS    Y +D          F+ 
Sbjct: 644 EEMFHAKILPVIRGNPDIFQTSQAKSDEELIQLAHRMGSTIMSYAFD----------FQN 693

Query: 262 AFVAVQSCVVHLQKVSLARR-FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 320
                +       +   A+    +VP+   +L   ++  A +   DDA+ +   R  KAG
Sbjct: 694 EDEEEEDDSEEWVEDREAKSTMGMVPMAD-ILNADAEYNAHVNYGDDALTVATLRTIKAG 752

Query: 321 ESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDN--------PY----DRLVVEAAL 368
           E I+ +Y           GP PNS+LL  YG+V   +        P+    D L     L
Sbjct: 753 EEILNYY-----------GPHPNSELLRRYGYVTPKHSRYDVVELPWKMIEDALAANLGL 801

Query: 369 NTED----------PQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD 418
           ++E            ++++  ++ + + + +      +  +  E   D+   L+      
Sbjct: 802 SSEQLDSAREHLDLDEFEETFVLERESDEPNPDGTFANPAKFSEIPEDLREQLK------ 855

Query: 419 TSEMQSVISSLGPICPVSPCMERAVLDQ-LADYFKARLAGYPATLSEDEAMLTDYNLHPK 477
              M   I  + P C V       V    L     A  + YP T+ EDE +L+  NL  +
Sbjct: 856 --SMLKAIRKVDPSCIVDKRKRDEVQHTVLITALDALTSQYPTTIIEDELILSGSNLSER 913

Query: 478 KRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAP 517
           ++ A  +   EK++L     + ++   +  D  +   PAP
Sbjct: 914 RKAAVTVRLGEKRLLQEARVLLSE---IASDAILDDAPAP 950


>gi|145528147|ref|XP_001449873.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124417462|emb|CAK82476.1| unnamed protein product [Paramecium tetraurelia]
          Length = 605

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 93/419 (22%), Positives = 174/419 (41%), Gaps = 62/419 (14%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNET-IAELLTTNKLSELA----CLALYLMYE 167
           V A + + + +    VP S ++TLE  +  +T +A+ +   +L  L+     L+ +L+ E
Sbjct: 190 VNARKAISSKEVILFVPRSHMITLE--MAKDTPVAKKIIQYRLDLLSPKHSFLSTFLLQE 247

Query: 168 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
           KK  + SFW PY+  L +           P+ +++++L +L GSP   ++ ++   +K++
Sbjct: 248 KK-IQDSFWKPYLDVLPKSYSNF------PIFFNDSDLEWLKGSPFLKQVKDKITDLKKD 300

Query: 228 YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 287
           Y ++  V   A    Q           +F+ F  A +   S +  +  +   +  A VPL
Sbjct: 301 YCDICQV---APEFLQN----------SFDEFCWARMTASSRIFGIN-IKGVKTDAFVPL 346

Query: 288 GPPL---------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKC 338
              L           YS + +  +   D+ ++         G+ I       F  +  KC
Sbjct: 347 ADMLNHKRPKLTSWCYSDERQGFIIETDENIE--------KGQMI-------FDSYGSKC 391

Query: 339 GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 398
               NS+ L+NYGFV +DN  + + V    +      Q K  +++   +   + F +   
Sbjct: 392 ----NSRFLLNYGFVVDDNNANEVNVMVEPDGTISLIQLKEGLSRETLQFP-KSFRLVID 446

Query: 399 REKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICP-----VSPCMERAVLDQLADYFKA 453
               + SD + ++R   + +  E  +++     I P     +S   E A  + + +    
Sbjct: 447 PNDVSFSDFMSFIRFILIQEEKEFANLLGKNSYIKPTKIHFISIQNELATWNLIENICIR 506

Query: 454 RLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVS 512
            L  YP TL +D  +L    L   +R    L   EKK+LN   Q +  M  L  +   S
Sbjct: 507 ALNQYPTTLEQDLEILKICELTTNQRNCLILRMGEKKILNFYKQFSEKMRQLFSNFDFS 565


>gi|303288796|ref|XP_003063686.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226454754|gb|EEH52059.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 538

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 109/455 (23%), Positives = 176/455 (38%), Gaps = 62/455 (13%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKV---ILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
           ++ + D   L +W+ + G     V   ++   P           + A+ D+  GDAA  V
Sbjct: 73  ARTQADFDALWTWLEREGADVASVSPALVDATPGGRGWG-----LVATRDVGGGDAAIVV 127

Query: 129 PNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           P +L +T E    ++ I   L          LAL L++EK  G  S W  YIR L R   
Sbjct: 128 PRALWMTKETAFASK-IGTALDPETTPPWCALALQLLHEKSLGDDSRWAAYIRCLPRVE- 185

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEG----IKREYNELDTVWFMAG-SLFQ 243
               A+++PL WS  ELA L G+    ++L  A G    ++  +  L    F    +LF 
Sbjct: 186 ----ALDAPLFWSSEELAELAGT----QLLANAAGYDSYVRGTHAALKETTFKEHPALFG 237

Query: 244 QYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA 303
               D    AF+   F  AF  ++S    L  V      AL+P G  +  +   C     
Sbjct: 238 DAGDDDGGGAFSEREFLWAFGVLRSRA--LPPVDQGESIALIP-GIDMANHDGLCSQTWQ 294

Query: 304 AVDDAVQLVVDRPYKAGESIVVWYNISFT-------GHDFKCGPQP---NSKLLINYGFV 353
             +  +  V      A     V   +  T       G + +C   P   +S+  ++YGFV
Sbjct: 295 LNNGGIAAVFGGRGGADGGGSVLLRVEKTKAGGAKRGEEIRCNYGPANIDSQFALDYGFV 354

Query: 354 DEDNPYDRLVVEA-ALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 412
           D        V+   ++  +D    DK  V    G      F + A   ++   +M+ ++R
Sbjct: 355 DAFCSRPGYVLGPLSIPEDDVNAFDKMDVLSVAGLKESPAFTIRA--FEDPPPEMVVFMR 412

Query: 413 LGYVSDTS----------EMQSVISSLGPICPVSPCMERAVLDQLADY-FKARLAGYPAT 461
           L  + +            E  ++IS      PVSP       D  AD   +  L  Y   
Sbjct: 413 LLNLKNDDAFLLEAIFRQECWALISD-----PVSP-------DNEADAGCEEALGAYATK 460

Query: 462 LSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 496
           + +D  +  D +  P+ R+A ++   EK+ L   L
Sbjct: 461 IEDDRGVADDADASPRLRLAARVRMGEKQALEEVL 495


>gi|198413420|ref|XP_002131202.1| PREDICTED: similar to SET domain containing 3 [Ciona intestinalis]
          Length = 577

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 100/449 (22%), Positives = 198/449 (44%), Gaps = 67/449 (14%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
            KSW+ ++G+    + ++E  S  E       V A +D++      ++P   ++T E   
Sbjct: 83  FKSWLKEHGVEYSAIDIQE-VSEEEGFG----VIALQDIEIKCPLVTIPRKAMMTYEDA- 136

Query: 141 GNETIAELLTTNKLSEL---ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
            +  +A L+  N++  +    CLALYL  E+     S + PYI  + ++         + 
Sbjct: 137 KSSYLAGLIEGNEVLSVMPNVCLALYLHCERFT-LNSKYQPYIDMIPQE-------FNTI 188

Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA---F 254
           L +   E+ YL G+   +  + + + I R++  L  V+   GS  ++    +P +A   F
Sbjct: 189 LYFKPHEMKYLKGTAALSVAINQFKSIVRQFALLYQVF--NGSHQKEDVEKLPLQARNAF 246

Query: 255 TFEIFKQAFVAVQS----CVVHLQKV----SLARRFALVPL--------GPPLLAYSSKC 298
           TF+ ++    AV +       H+  V          AL+P+        GP   AY+   
Sbjct: 247 TFDTYRWCASAVTTRQNKIPTHVGDVLGDLDENSTLALIPMWDMFNHAIGPLSTAYN--- 303

Query: 299 KAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNP 358
                A+   ++ +  + +K GE + + Y           G + NS LLI+ GFV +++P
Sbjct: 304 -----ALTRGIECLAMQDFKTGEQVKICY-----------GARTNSDLLIHNGFVMKESP 347

Query: 359 YDRLVVEAALNTEDPQYQDK-RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS 417
           +D++ +   ++ +DP Y  K +++ + N ++S Q               +L +LR+ +++
Sbjct: 348 FDKVRIHLGVSQKDPLYSLKAKLLEKLNVEVSGQFAVCSMDNSLPTSPQLLVFLRVFHMN 407

Query: 418 DT------SEMQSVISSLGPI---CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM 468
           +        + ++ +SSL  I     V    +  V + L +  K  L G+       E M
Sbjct: 408 EEELRSWLEKQKNELSSLREIYISGEVKFKSDVKVWEFLENRVKLLLMGFKKIGDNIEEM 467

Query: 469 LTDYNLHPKKRVATQLVRMEKKMLNACLQ 497
           + D +L  + ++A Q    E ++L+AC+ 
Sbjct: 468 MEDKSLTHRSKLALQFRIEEHRILSACVN 496


>gi|260835124|ref|XP_002612559.1| hypothetical protein BRAFLDRAFT_219602 [Branchiostoma floridae]
 gi|229297937|gb|EEN68568.1| hypothetical protein BRAFLDRAFT_219602 [Branchiostoma floridae]
          Length = 327

 Score = 67.0 bits (162), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 77/300 (25%), Positives = 127/300 (42%), Gaps = 35/300 (11%)

Query: 76  EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
           +D   L  W+ +NG     ++L + P           V ++ +L+ GD   S+P +L++T
Sbjct: 4   DDSIQLMRWLRRNGFRDSHLVLTDFPDTGRG------VMSTRNLKEGDCIVSLPENLLIT 57

Query: 136 LERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA 193
              V+ N  + + + T K  L+    L+LYL+ EK +GK SFW PYI+ L         +
Sbjct: 58  TTTVV-NSHLGQYIKTWKPRLTPKQVLSLYLIAEKSRGKDSFWYPYIQTL-------PTS 109

Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 253
             +P  +S  E+  L     +A +  R + ++  Y  L T       LF  +        
Sbjct: 110 YTTPSYFSTAEVDALPALVREATLRHR-KVLQNSYKSLQTSLHNLEPLFPDW-----KTV 163

Query: 254 FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAM----LAAVDDAV 309
           FT + ++ A+  V +  V+  K      F L P  P + A +     +    L   D   
Sbjct: 164 FTLKSYRWAWATVYTRSVY--KRGPGWEF-LDPSDPDVYALAPFLDMLNHSPLVQTDTDF 220

Query: 310 QLVVDRPYKA-GESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 368
             V  + Y+   E     Y   F  +D    P  N +LL+ YGFV   NP+  +   AA+
Sbjct: 221 N-VSSKCYEVKTEGACRKYRQVFINYD----PYDNGRLLMEYGFVMPRNPHSVVTFTAAV 275


>gi|442753255|gb|JAA68787.1| Putative set domain-containing protein [Ixodes ricinus]
          Length = 428

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 69/293 (23%), Positives = 123/293 (41%), Gaps = 39/293 (13%)

Query: 81  LKSWMHKNGLP-PCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
           L +WM  NG     K+ L++ P           V A E L  G+    +P SL+++    
Sbjct: 31  LLTWMEANGFRLHSKLGLRDFPDTGRG------VVALEKLVGGETFLKLPTSLLISTRTA 84

Query: 140 LGNETIAELLTTN---KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES 196
           L  +++     T    KL+ +  L L+++ +K  G+ S W P++  L R          +
Sbjct: 85  L--QSLLHSFITRYHAKLTPIDVLTLFVLDQKLLGEASRWWPFVDSLPR-------TFTT 135

Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 256
           P+    T    L     + E+  R   I+R + +L  +  + G + ++      +  FT+
Sbjct: 136 PVFLRRTVFESLP-KDLREEVHTRITSIQRTFLKLKVL--LGGHVEEEPEVQSLSTGFTW 192

Query: 257 EIFKQAFVAVQSCVVHLQKVSLARRF-----ALVPLGPPLLAYSSKCKAMLAAVDDAVQL 311
             F  A+ AV +  +  Q  + +  +     AL P     L +  K     A V +  ++
Sbjct: 193 NNFVWAWTAVNTRCIFAQGSNSSSLWENDHCALAPF-LDCLNHHWKASIETAMVGENFEI 251

Query: 312 VVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
           +  + + A E + + Y           GP  N +L ++YGFV  DNP D +VV
Sbjct: 252 LSHKSHDANEQVFISY-----------GPHSNRRLFLDYGFVLPDNPNDVVVV 293


>gi|348675930|gb|EGZ15748.1| hypothetical protein PHYSODRAFT_561468 [Phytophthora sojae]
          Length = 430

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 99/429 (23%), Positives = 163/429 (37%), Gaps = 65/429 (15%)

Query: 111 HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
           H V A   L +G     VP  L +  E    ++    L    ++ +   LAL+LM+E+ +
Sbjct: 35  HGVFAKRALTSGQVTLQVPFKLTMNTESAATSDLAPVLEKYPQIPDDEVLALHLMHERSK 94

Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
           G +SF+ P+I  +           + P+ W+E EL  L G+          + ++R++  
Sbjct: 95  GGESFFAPFIASM-------PTTFDLPVFWTEAELNELKGTNVLLLTQLMKQHLERDFE- 146

Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 290
                    ++ Q    D P      +IF           +    V  +R F +   G  
Sbjct: 147 ---------NIHQAVAADFP------DIFASLPTLTIDDYMWAMSVIWSRAFGVSKGGKY 191

Query: 291 LLAYSSKCKAM---------------LAAVDDAVQLV---VDRPYKAGESIVVWYNISFT 332
           L      C AM                 + ++  Q++   V     AG ++ + Y     
Sbjct: 192 LHVL---CPAMDMFNHDVTVRKPLDDFVSFNEEKQMMTHHVPEDVAAGSAVHISY----- 243

Query: 333 GHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQV 392
                 G   N+KLL +YGFV  +N    +     +   DP ++ K+ V   N     Q 
Sbjct: 244 ------GQYSNAKLLYSYGFVSPENFRRGVDFWMKIPLSDPYFKLKQTVLDSNELTKEQT 297

Query: 393 FHVHAGREKEAISD-MLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYF 451
           +  H       + + +L  LR+  +++    Q   +    I  V    E AV + L    
Sbjct: 298 YDFHGTLLSNDVDERLLATLRVILMNEQEIRQYKKAFESSILSVRN--ELAVYENLQSTC 355

Query: 452 KARLAGYPATLSEDEAMLTDYNLHPKKRVATQL-VRMEKKMLNACLQVTADMIMLLPDVT 510
           + +L+ Y  TL EDEA+L +     K R+A  + VRME K      QVT  +I  L    
Sbjct: 356 RRKLSNYATTLEEDEAILAETETESKPRLAFAVRVRMEDK------QVTTSVIETLEQWK 409

Query: 511 VSPCPAPYA 519
            S    P A
Sbjct: 410 QSLASKPDA 418


>gi|428163078|gb|EKX32170.1| hypothetical protein GUITHDRAFT_121664 [Guillardia theta CCMP2712]
          Length = 449

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 98/434 (22%), Positives = 177/434 (40%), Gaps = 38/434 (8%)

Query: 77  DLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTL 136
           D  D+  W   NG    KV+L++    + +  PI  + A ED++AG+   S+P +L+   
Sbjct: 25  DGSDVYEWAAANGANVSKVVLRD----DGEAGPI--LHAKEDIEAGEVILSLPANLLFP- 77

Query: 137 ERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES 196
            RV  +  +  ++    +  +  + LYL+ E+     S W P+++ L  +          
Sbjct: 78  TRVSDHSPVVHMIENTTIGRITAICLYLISERADSS-SHWKPWLQSLPPRFFHA------ 130

Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQ------QYPYDIP 250
            L +SE ++ +   S  K     + + +++EY +           F         P ++ 
Sbjct: 131 -LSYSEDDMLHFQASSFKELRDRKKKNVRQEYEQTVAPLLHKLPAFDPLLAAVDKPQNVT 189

Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL---GPPLLAYSSKCKAMLAAVDD 307
            E FT+E F+ A+  V +  +    +    R   VPL   GP   ++      +  + D 
Sbjct: 190 REDFTYEAFEWAYSVVTTRGIFPGLLGEEEREGEVPLLVLGPLADSFIHGASGVKISYDA 249

Query: 308 AVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 367
                V   + A   +     IS        G   N +LL N GF+ ++N  + ++++  
Sbjct: 250 QEHRCV---FSALHKVAKNSPISI-----GVGMSSNMELLANRGFMMQNNGNNFVLMKFQ 301

Query: 368 LNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS 427
           L+     +   R    +   LS  + +V   R  E    +L  LR+  +S   E  S   
Sbjct: 302 LDRNSDMHASARESMMKQLNLSNPMTYV--VRYGEMPQGLLASLRIQSLSPV-EFGSYGK 358

Query: 428 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 487
           +L    PV+   E      L     + LA YP T+ EDE +LT        R A  L+R 
Sbjct: 359 ALA--TPVTLENEWRAYRLLISSCNSILAMYPTTIEEDEIVLTQTKTSRHLRAAV-LLRR 415

Query: 488 EKKMLNACLQVTAD 501
           E+K++   ++  A+
Sbjct: 416 EEKLIYESIKTWAN 429


>gi|322703179|gb|EFY94792.1| UV-endonuclease UVE-1 [Metarhizium anisopliae ARSEF 23]
          Length = 1118

 Score = 65.9 bits (159), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 96/438 (21%), Positives = 173/438 (39%), Gaps = 79/438 (18%)

Query: 113  VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN-------KLSELACLALYLM 165
            + A  D+ A    F++P   ++  +     E + EL  +         L   + L L +M
Sbjct: 694  IVALRDIPADTTLFTIPRDAIINSDTSSLREKLPELFESQGDEDEQQALDSWSALILIMM 753

Query: 166  YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEGI 224
            YE   G +S W PYI  L        L  ++P+ WSE EL+YL  S T  +I +  AE +
Sbjct: 754  YEFFLGHQSKWKPYIDVL-------PLTFDTPMFWSEEELSYLQASATVNKIGKADAEEM 806

Query: 225  KREY--------------------NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
             R                       +L  +    GS    Y +D+  E    +     +V
Sbjct: 807  FRTRLIPAIRGNPSVFASSGDCSDEDLIGLAHRMGSTIMAYAFDLENEEAENDDESDGWV 866

Query: 265  AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 324
              +     +  V++A           +L   ++  A +   D+ + +   R  KAGE I+
Sbjct: 867  EDREGKSMMGMVAMA----------DILNADAEFNAHVNHGDEELTVTSIRDIKAGEEIL 916

Query: 325  VWYNISFTGHDFKCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAALNTEDPQYQ 376
             +Y           GP PNS+LL  YG++ E +        P+D   V+ +L +E    Q
Sbjct: 917  NYY-----------GPHPNSELLRRYGYITEKHSRYDVVEIPWD--AVQHSLMSELGVPQ 963

Query: 377  DKRMVAQRNGKL------SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE-MQSVISSL 429
            D  ++ +   K+       + V    +G      +   P +  G   D  E +++ I  L
Sbjct: 964  D--IMTETMDKMDQDDLEDIFVLERDSGEPNPDGTFAGPAVVDGMPPDLKEQLKATIKLL 1021

Query: 430  ----GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLV 485
                G +       +  +   + +  +   + Y  T++EDE +L   +L  ++R+A ++ 
Sbjct: 1022 QKVDGNLISDKRKRDDILRSTMVETLRLIASRYSTTIAEDEILLAQDSLTRRQRMAVRVR 1081

Query: 486  RMEKKMLNACLQVTADMI 503
              EKK+L       ++M+
Sbjct: 1082 LGEKKLLQEAFDHFSEMV 1099


>gi|350408192|ref|XP_003488333.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Bombus
           impatiens]
          Length = 484

 Score = 65.9 bits (159), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 101/449 (22%), Positives = 182/449 (40%), Gaps = 61/449 (13%)

Query: 73  KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
           K+ + +G   +W+ +NG       + E P ++        + A  +    +    +P  L
Sbjct: 77  KRSQGIGRFINWLKQNGANVYGASVAEFPGYDLG------LKAERNFLENELILRIPREL 130

Query: 133 VVTLERV------LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
           + ++         L N+ + +L+          LA+ L+ EK + + S W PY+  L   
Sbjct: 131 IFSIHNAAPELVALQNDPLLQLMPQ------VALAIALLIEKHK-EYSKWKPYLDILPT- 182

Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP 246
                    + L  +  ++  L GSPT    L++   I R+Y      +F    LFQ+  
Sbjct: 183 ------TYTTVLYMTAADMNELKGSPTLEAALKQCRNIARQY-----AYF--NKLFQKNN 229

Query: 247 YDIPT---EAFTFEIFKQAF--VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAM 301
             +     + FT+E +  A   V  +  ++  +  SL    AL+P+       +SK    
Sbjct: 230 NAVSAILRDVFTYEKYCWAVSTVMTRQNIIPSKDGSLMIH-ALIPMWDMCNHENSKITTD 288

Query: 302 LAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDR 361
             A  +  +    R +K  E I + Y           G + NS   ++ GFV  DN  D 
Sbjct: 289 FNATLNCCECYALRDFKKAEQIFISY-----------GARTNSDFFVHSGFVYMDNEQDG 337

Query: 362 LVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD-MLPYLRLGYVSDTS 420
             +   ++  DP  +++  +  +    +V  F +  G   E ISD +L +LR+ +     
Sbjct: 338 FKLRLGISKADPLQKERVELLNKLDLPAVGEFLLKPG--TEPISDTLLAFLRV-FSMRKE 394

Query: 421 EMQSVISS--LGPICPVSPCMERAVLDQLADYFKARL----AGYPATLSEDEAMLTDYNL 474
           E+   I S  +  +  +   +E  V + +  +   RL    A YP TL ED  +L +  L
Sbjct: 395 ELAHWIQSDRVNDLKHMDCALETVVEENVKKFLLTRLQLLIANYPTTLKEDLQLL-ETTL 453

Query: 475 HPKKRVATQLVRMEKKMLNACLQVTADMI 503
              K++A QL   EK++L   L+     I
Sbjct: 454 PRIKKLAIQLRVTEKRILQGALEYVQQWI 482


>gi|160331079|ref|XP_001712247.1| met [Hemiselmis andersenii]
 gi|159765694|gb|ABW97922.1| met [Hemiselmis andersenii]
          Length = 464

 Score = 65.9 bits (159), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 79/393 (20%), Positives = 171/393 (43%), Gaps = 51/393 (12%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER-VLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
           + A + +Q G+    +P +L+++++R  + NE        + L+E   L ++L+ +   G
Sbjct: 103 LLAFKKIQQGEKLIEIPENLILSVDRDQIKNEG------NDFLNEYDSLGIFLIQQMAMG 156

Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
            KS W  Y   L R+       +     W+  ++ +L GS T    L   E IK ++  L
Sbjct: 157 DKSKWKIYFDILPREED-----LNLGFRWNLNDIVFLRGSKTLNASLYLKEKIKIQFLRL 211

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP-- 289
           +   F    L  +YP  I    F    ++ A   + S  + LQ +   ++ +LVP     
Sbjct: 212 EKTIFSKNRL--KYPVSI----FNLAQWEWALSILLSRAIFLQNL---KKVSLVPYADFM 262

Query: 290 ---PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKL 346
              P        K +  + +  + +  D+ Y   + I   Y           G + N +L
Sbjct: 263 NHNPFSTSYINSKKISFSKNHEIVMYADKDYNKFDQIFTTY-----------GQKTNLEL 311

Query: 347 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV---AQRNGKLSVQVFHVHAGREKEA 403
           L+ YGF+ E NP+D + +  +L+ +D  ++ K+      ++  +++  +F+    +E   
Sbjct: 312 LLLYGFILERNPFDSIELRISLSDKDSFFEKKKQFMIECEKTSEITFPIFYYKYPKE--- 368

Query: 404 ISDMLPYLRLGYVSDTSEMQSV-ISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATL 462
              +  +LR   +S+  E+ S  +S        +  +E+ +   +    +  L  Y   +
Sbjct: 369 ---LYEFLRFC-ISNQEELGSTDLSDFNFNDENNYEIEKIIRKLVLFSCEKLLKNYSKKV 424

Query: 463 SEDE---AMLTDYNLHPKKRVATQLVRMEKKML 492
           SE++   ++ +++ +   +++A +  + EKK++
Sbjct: 425 SEEKILNSLNSNFLISKNQKMALKQSKCEKKII 457


>gi|399949805|gb|AFP65462.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Chroomonas mesostigmatica
           CCMP1168]
          Length = 464

 Score = 65.9 bits (159), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 82/389 (21%), Positives = 161/389 (41%), Gaps = 47/389 (12%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           A   +Q G+    +P +L+  L++ L  E  +E L+   L+E   LA+  + E+  G+KS
Sbjct: 100 AFRKIQQGEKLIEIPENLI--LKKSLK-ENRSEDLSF--LNEYDSLAIKAIQERAIGEKS 154

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
            W  Y   L +++    +       W  +++ +L GS          E IK ++  ++  
Sbjct: 155 KWKVYYEILPKEKDLNLV-----FRWKISDIVFLRGSKVLNASFYLKEKIKIQFLRIEKT 209

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP----- 289
            F    L        P + F  + ++ A   + S  + LQ +   ++ ALVP        
Sbjct: 210 IFSKNRLV------YPEKIFNLQSWEWAISLLLSRAIFLQNM---KKIALVPYADFINHN 260

Query: 290 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
           P        K +  + ++ + +  D+ Y   + I   Y           G + N +LL+ 
Sbjct: 261 PFSTSYINSKKIAFSENNEIVMYADKDYNKFDQIFTTY-----------GQKTNLELLVL 309

Query: 350 YGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV---AQRNGKLSVQVFHVHAGREKEAISD 406
           YGF+ E NP+D + +  AL+T+D  Y  K       ++  +++  VF+    +E      
Sbjct: 310 YGFIIERNPFDSIELRVALSTKDELYNKKEKFINDCEKTEQITFPVFYYKYPKE------ 363

Query: 407 MLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 466
           +  ++RL             S+L      +  +E+ +   +    K  L  Y  T++E++
Sbjct: 364 LYEFMRLCLSGPRDFFGEEFSNLNFTDEENFNLEKIIRKTVIFACKKNLKAYNKTINEEK 423

Query: 467 AMLTDYN---LHPKKRVATQLVRMEKKML 492
            +    N   L   ++ + +  + EKK+L
Sbjct: 424 ILNNLSNIIVLTKNQKTSIKQRKCEKKIL 452


>gi|303271159|ref|XP_003054941.1| methyltransferase [Micromonas pusilla CCMP1545]
 gi|226462915|gb|EEH60193.1| methyltransferase [Micromonas pusilla CCMP1545]
          Length = 544

 Score = 65.9 bits (159), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 103/392 (26%), Positives = 158/392 (40%), Gaps = 34/392 (8%)

Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWL 177
           D++AG+    +P +L VT   V  +  +A L       EL  LAL+L  E+ +G  S W 
Sbjct: 98  DVRAGEPLLEIPQNLAVTSVDVSDHPIVAGLAAGR--GELVGLALWLCCERAKGSLSDWA 155

Query: 178 PYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWF 236
           PY+  L          V+ PL W E+E+ + L GSPT  + + RA   + EY  +     
Sbjct: 156 PYVNTLPT-----GCTVDHPLRWEESEIRSLLKGSPTCEQAVGRAVDAREEYASIRAAIA 210

Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL--------- 287
                +    Y+  TE      F  A   V +  V L   ++   +ALVPL         
Sbjct: 211 ADADAYPADAYEFLTEL----AFTDALATVLARAVWLNAANV---YALVPLVDLLPVVGA 263

Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGP---QPNS 344
            PP +  ++      A   DA   VVD         VV  N +       C     +   
Sbjct: 264 PPPGVNPAAAAADAGARGLDAAVGVVDYDAATECVAVVSANDARQTAPVVCADALGRNAG 323

Query: 345 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG-KLSVQVFHVHAGREKEA 403
            L ++ G V+  +  D L    +    D  Y  K+ + +  G     Q F V A R    
Sbjct: 324 DLFLSTGRVNGAHVGDYLTFVTSTVMSDKLYAAKKQILEGMGYSADAQAFPVFADRMP-- 381

Query: 404 ISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLS 463
              +  Y+R   V + SE+ +V      I  VSP  E  +L  L    +  LA Y  +  
Sbjct: 382 -LQLFAYMRFARVQEPSELMTVSFEEDRI--VSPMNEYEILQLLMGDAREMLAEYENSSE 438

Query: 464 EDEAM-LTDYNLHPKKRVATQLVRMEKKMLNA 494
           E E + L + N+  ++  A +L   EK+++NA
Sbjct: 439 EFELLQLKETNISERQMTAAKLRLGEKRLINA 470


>gi|310799999|gb|EFQ34892.1| SET domain-containing protein [Glomerella graminicola M1.001]
          Length = 478

 Score = 65.5 bits (158), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 97/444 (21%), Positives = 173/444 (38%), Gaps = 93/444 (20%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER----------VLGNETIAELLTTNKLSELACLAL 162
           + A++D+      F++P   ++ +E             GN+   E +    L     L L
Sbjct: 44  IVATKDIAPETVLFTIPRKSIINIETSELPKKIPQVFTGNDGDDEDMENEPLDSWGSLIL 103

Query: 163 YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAE 222
            ++YE  QG  S W  Y   L  +        ++ + W   +L YL GS   ++I +   
Sbjct: 104 VMIYEYLQGNASPWKTYFEVLPEK-------FDTLMFWESPDLEYLKGSAVLSKIGKDEA 156

Query: 223 GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR- 281
                   L  +   AG  F Q     P+E+   ++  +    + +    L+      + 
Sbjct: 157 DEMFRSRILPVISANAGIFFPQ-GVSPPSESELLQLAHRMGSIIMAYAFDLENEEEPEQE 215

Query: 282 -------------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYN 328
                          +VP+   +L   ++  A +   +D + +   RP KAGE I+ +Y 
Sbjct: 216 DEEWVEDREGKTMLGMVPMA-DILNADAEFNAHVNHGEDDLSVTALRPIKAGEEILNYY- 273

Query: 329 ISFTGHDFKCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAALNTEDPQYQDK-- 378
                     GP PNS+LL  YG+V   +        P+D  +V++ L TE  +  D+  
Sbjct: 274 ----------GPHPNSELLRRYGYVTPKHSRYDVVEIPWD--LVQSTL-TEQLRLTDEVW 320

Query: 379 RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSL--------- 429
           + VA+      ++   V      E  S+       G++   +++Q V + L         
Sbjct: 321 KQVAEHVDPEDLEDVFVLERESGEPDSE-------GHLQTPAKVQEVSAELEEQLKDVLK 373

Query: 430 ------GPICP-------VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 476
                 G + P       V  C+  + L +L       LA YP T  EDEAML   N+  
Sbjct: 374 AIKKVRGDLIPDKRKRDEVYQCVVVSTLQKL-------LAQYPTTAEEDEAMLASGNVTS 426

Query: 477 KKRVATQLVRMEKKMLNACLQVTA 500
           ++++A ++   EK+++   LQV  
Sbjct: 427 RQKLAVEVRLGEKRLIKEALQVAG 450


>gi|156064409|ref|XP_001598126.1| hypothetical protein SS1G_00212 [Sclerotinia sclerotiorum 1980]
 gi|154691074|gb|EDN90812.1| hypothetical protein SS1G_00212 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 470

 Score = 65.5 bits (158), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 103/420 (24%), Positives = 170/420 (40%), Gaps = 68/420 (16%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSE----LACLALYLMYEK 168
           V A+ D+   +  FS+P + V      L  + +A L  + +L E       L   LM E 
Sbjct: 42  VVATGDIDDDEIIFSIPRNAV------LNAQNVAPLPVSRRLFEKMPSWLVLTSILMTEA 95

Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA------- 221
            Q + S W PY+  L  +       ++S + WS++ELA L  S    +I ++        
Sbjct: 96  -QMENSKWAPYLAVLPER-------LDSLVFWSDSELAELQASAVVKKIGKKDAEDMFKS 147

Query: 222 ----EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQ 274
               +G+K    E+        S+   Y +DIP  +       +   A    V      +
Sbjct: 148 YIAPQGLKHSSTEM---CHKVASVIMAYAFDIPDPSDAPTSGGKGGEAGDDLVSDDGEDE 204

Query: 275 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGH 334
           K  L+    ++PL   L A + +  A L   ++ +++   +P   GE I   Y       
Sbjct: 205 KTILS----MIPLADMLNADADRNNARLICDNEELEMRAIKPISKGEEIFNDY------- 253

Query: 335 DFKCGPQPNSKLLINYGFV-DEDNPYD------RLVVEAALN----------TEDPQYQD 377
               G  P S LL  YG+V D  + YD       L+V    N          T+D   + 
Sbjct: 254 ----GQLPRSDLLRRYGYVTDGYSAYDVAEISAELIVSLFRNGKVHPSLHKLTQD-GLKT 308

Query: 378 KRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSP 437
           +  +A+R G        VH+  ++ +I D L       + D S ++++++S   I   S 
Sbjct: 309 RLELAEREGVYEDSFDLVHSSPDEPSIPDELLAFLYLLLVDESHLKAILNSESSIPSRSK 368

Query: 438 CMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 497
                    LA   +AR   Y  TL EDE +L + +L  +  +A Q+   EKK+L A +Q
Sbjct: 369 LTTELAGQVLATLLQAREKEYSTTLEEDEDLLKNADLPVRHAMAIQVRSGEKKVLRAAMQ 428


>gi|302836231|ref|XP_002949676.1| Rubisco large subunit N-methyltransferase [Volvox carteri f.
           nagariensis]
 gi|300265035|gb|EFJ49228.1| Rubisco large subunit N-methyltransferase [Volvox carteri f.
           nagariensis]
          Length = 484

 Score = 65.5 bits (158), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 108/412 (26%), Positives = 178/412 (43%), Gaps = 47/412 (11%)

Query: 105 EKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYL 164
           +  RP+  + AS D Q GD  FSVP+S  ++ E V     + +L     L     +AL L
Sbjct: 65  QTDRPV--LIASTDAQQGDVLFSVPDSAWLSAESVK-KAAVGKLAAAAGLEPWLQIALQL 121

Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
           + ++    KS    Y   +          +++PLLWSE EL  L G+    ++L+   G 
Sbjct: 122 VADRFGSTKSELSAYAASIPED-------LDTPLLWSEDELQELQGT----QVLQTLGGY 170

Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV---VHLQKVSLARR 281
              +    T   +   LF   P   P   FT   F  A  AV+S     +   K++LA  
Sbjct: 171 LTFFRS--TFQQLQSGLFTSNPAAFPPSIFTLPRFLWAVAAVRSRSHPPLDGPKIALA-- 226

Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV---DRPYKAGESIVVWYNISFTGHDFKC 338
             L  L     A +SK     A +    Q++V    R  + GE + + Y           
Sbjct: 227 -PLTELVSHRRAANSKLSVRSAGLFGRGQVLVLEATRAIRKGEPLSMDY----------- 274

Query: 339 GP-QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 397
           GP + +  +L++YG +D  +P     +   +   D    DK  + + N      V+++  
Sbjct: 275 GPGKLDGPVLVDYGVMDVTSPKPGYSLTLKMPDSDRFIDDKLDILESNDLPQSVVYNLTP 334

Query: 398 GREKEAISDMLPYLRLGYV--SDTSEMQSVISS--LGPIC-PVSPCMERAVLDQLADYFK 452
             +++   +ML +LRL  +  SD   ++S+  +   G +  PVS   E AV + L++  +
Sbjct: 335 --DEQPTIEMLAFLRLMQLKGSDAFLLESIFRNDVWGFMQEPVSEGNEEAVCNTLSEGAR 392

Query: 453 ARLAGYPATLSEDEAMLTDYNLHPK--KRVATQLVRM-EKKMLNACLQVTAD 501
           A L GY  T+ +D A L       K  +R A  L+R+ EK+ L+A  +   D
Sbjct: 393 AALGGYGTTIDQDLAELRAQGSRAKGSRREAALLIRLGEKEALDAVARFFED 444


>gi|156361027|ref|XP_001625323.1| predicted protein [Nematostella vectensis]
 gi|156212150|gb|EDO33223.1| predicted protein [Nematostella vectensis]
          Length = 447

 Score = 65.1 bits (157), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 76/291 (26%), Positives = 118/291 (40%), Gaps = 43/291 (14%)

Query: 75  EEDLGDLKSWMHKNGLPPCKVILKEKPS-HNEKHRPIHYVAASEDLQAGDAAFSVPNSLV 133
           EE+   L  W  +NG+    V  K +P+  +   R +    A E + + +   SVP  L+
Sbjct: 47  EENYISLLKWAKRNGM----VFKKIRPAIFSSTGRGM---LAIERIHSSECVISVPERLL 99

Query: 134 VT----LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
           +T    LE  +GN     +    K S    L L+LMYEK   K SFW PYIR L      
Sbjct: 100 ITASSVLESAIGNYVAERMKGGAKSSNDYLLVLFLMYEKYLEKGSFWAPYIRTLPD---- 155

Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDI 249
                 +P  ++  EL +L     + +  E+   IK+ Y      +     + Q +  + 
Sbjct: 156 ---TFNTPCYFTRKEL-FLLPEQCREQAFEQVTQIKQSYKSFAKAY---NDVLQDFDCNF 208

Query: 250 PTEAFTFEIFKQAFVAVQS-CVVHLQKVSLAR----RFALVPLGPPLLAYSSKCK--AML 302
                 FE FK A+  V +  V H +    A+      AL PL   LL +  K +     
Sbjct: 209 -WRTVDFESFKWAWCVVNTRSVYHDEPNRRAQPIDGNCALAPL-LDLLNHCDKAEMCGRF 266

Query: 303 AAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV 353
            +     ++ V   Y+ G  + + Y           GP  N++L + YGFV
Sbjct: 267 NSSSKNYEINVITEYQKGTQVFINY-----------GPHDNTRLFLEYGFV 306


>gi|297726941|ref|NP_001175834.1| Os09g0411650 [Oryza sativa Japonica Group]
 gi|255678893|dbj|BAH94562.1| Os09g0411650, partial [Oryza sativa Japonica Group]
          Length = 206

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 47/158 (29%), Positives = 82/158 (51%), Gaps = 9/158 (5%)

Query: 341 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGRE 400
           + N++L ++YGF + ++  D   +   ++  DP Y DK  +A+ NG      F +  G  
Sbjct: 10  KSNAELALDYGFTESNSSRDAYTLTLEISESDPFYDDKLDIAELNGMGETAYFDIVLG-- 67

Query: 401 KEAIS-DMLPYLRLGYVSDTSE--MQSVISS--LGPI-CPVSPCMERAVLDQLADYFKAR 454
            E++   MLPYLRL  +  T    ++++  +   G +  PVS   E A+   + +  K+ 
Sbjct: 68  -ESLPPQMLPYLRLLCLGGTDAFLLEALFRNAVWGHLELPVSQDNEEAICQVIRNACKSA 126

Query: 455 LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 492
           L  Y  T+ EDE +L   NL P+ ++A ++   EKK+L
Sbjct: 127 LGAYHTTIEEDEELLGSENLQPRLQIAVEVRAGEKKVL 164


>gi|357444999|ref|XP_003592777.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase [Medicago truncatula]
 gi|355481825|gb|AES63028.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase [Medicago truncatula]
          Length = 451

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 65/272 (23%), Positives = 118/272 (43%), Gaps = 36/272 (13%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           AS+ +Q GD    VP SL +T + +       +   +  +  +A LA  L+  K  G+ S
Sbjct: 66  ASKSIQTGDCILQVPYSLQLTPDNLPPE---IKPFISEDVGNIAKLATVLLIHKNLGQDS 122

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
            W PYI  L  Q       + + + W+E+EL  +  S    E + +   I++++ E+  V
Sbjct: 123 EWHPYISCLPPQA-----EMHNTIFWNESELEMIRQSSVYQETIYQKSQIEKDFLEIKPV 177

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
                  FQ  P+      FT++ F  A   V S     +     +  +L+P     L +
Sbjct: 178 -------FQ--PFCQSFGDFTWKDFMHACTLVGS-----RAWGSTKGLSLIPFAD-FLNH 222

Query: 295 SSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGF 352
               +A++ + DD    ++  DR Y  GE +++ Y           G   N+ L++++GF
Sbjct: 223 DGISEAIVMSDDDNKCSEVFSDRDYVPGEQVLIRY-----------GKFSNATLMLDFGF 271

Query: 353 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQR 384
               N YD++ ++  +   DP    K  + Q+
Sbjct: 272 TIPYNIYDQVQIQYDIPKYDPLRHTKLELLQQ 303


>gi|384484604|gb|EIE76784.1| hypothetical protein RO3G_01488 [Rhizopus delemar RA 99-880]
          Length = 400

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 72/300 (24%), Positives = 127/300 (42%), Gaps = 62/300 (20%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           V A++D++ GD  FS+P S++++       + ++EL   ++LS  + L L +MYE ++  
Sbjct: 41  VTANKDIKEGDLLFSLPRSILLSQLTSSLKDQVSEL---SELSGWSPLILCMMYEIEK-P 96

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
            SFW PY   L R+         +P+ W++ +L  L G+   ++I  + E  +  +NEL+
Sbjct: 97  DSFWKPYFDVLPRE-------FTTPMFWNQEDLKELEGTDIISKI-GKKESEELFHNELE 148

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFK--QAFVAVQSCVVHLQKVSLARR--------- 281
            +        ++YP     +  T E+F    + +   S    LQK               
Sbjct: 149 PI-------IKKYPNLFDEQKHTIELFHICGSLIMAYSFNDELQKAPKENNKEEEKEEEE 201

Query: 282 ---------------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 326
                           ++VP+   L   +    A L    D++Q+   +  K GE I   
Sbjct: 202 EEEEEEEEEEEEEGLISMVPMADMLNHKTGFNNARLFHEPDSLQMRAIKDIKEGEQIYNT 261

Query: 327 YNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDR------LVVEAALNTEDPQYQDKRM 380
           Y           G   N+ LL  YGFVDE N +D       L+VE     +D   +++++
Sbjct: 262 Y-----------GDLCNADLLRKYGFVDEKNDFDLVELDGPLLVEVCCEDQDEALKERKI 310


>gi|255562948|ref|XP_002522479.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase, chloroplast precursor, putative
           [Ricinus communis]
 gi|223538364|gb|EEF39971.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase, chloroplast precursor, putative
           [Ricinus communis]
          Length = 502

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 93/415 (22%), Positives = 168/415 (40%), Gaps = 74/415 (17%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERV----LGNETIAELLTTNKLSELACLALYLMYEKKQ 170
           A  D+   +    +P  L +  + V    +GN         + L     +AL+L+ EK +
Sbjct: 84  AERDIARNEVVLEIPKKLWINPDAVAASDIGN-------VCSGLKPWISVALFLIREKLK 136

Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLW-----------------SETELAYLTGSPT 213
            + S W PY+  L            S + W                 SE ELA L G+  
Sbjct: 137 KEGSTWWPYLDILPD-------TTNSTIYWWVLLVAFYVLVLSFQRRSEEELAELQGTQL 189

Query: 214 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 273
               L   E ++RE+ +++    +     + +P  I     T + F  AF  ++S     
Sbjct: 190 LRTTLGVKEYMQREFAKVEEEILLPHK--ELFPSPI-----TLDDFLWAFGILRSRAFSR 242

Query: 274 QKVSLARRFALVPLGPPL----------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESI 323
            +    +   L+PL   +           AY  K   + +  +    L      K+GE +
Sbjct: 243 LR---GQNLVLIPLADLINHSPDITTEDYAYEIKGGGLFSR-ELLFSLRSPISVKSGEQV 298

Query: 324 VVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 383
           ++ Y+++          + N++L ++YGF+++    +   +   ++  DP + DK  +A+
Sbjct: 299 LIQYDLN----------KSNAELALDYGFIEKTPDRNTYTLTLQISESDPFFGDKLDIAE 348

Query: 384 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE--MQSVISS--LGPI-CPVSPC 438
            NG      F +  G        MLPYLRL  +  T    ++S+  +   G +  P+S  
Sbjct: 349 TNGSGETADFDIVLGNPLPPA--MLPYLRLVALGGTDAFLLESIFRNTIWGHLELPISRA 406

Query: 439 MERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 493
            E  +   + D  K+ L+GY  T+ EDE  L   +L+P+  +A  +   EKK+L 
Sbjct: 407 NEELICRVVRDACKSALSGYHTTIEEDEK-LEAADLNPRLEIAVGIRAGEKKVLQ 460


>gi|380015248|ref|XP_003691619.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Apis
           florea]
          Length = 483

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 96/446 (21%), Positives = 182/446 (40%), Gaps = 53/446 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
           SK+ + +G   +W+ +NG       + E P ++        + A  +    +    +P  
Sbjct: 75  SKRSQGIGQFINWLKENGANVDGASVAEFPGYDLG------LKAERNFLENELILRIPRG 128

Query: 132 LVVTLERVLGNETIAELLTTNK------LSELACLALYLMYEKKQGKKSFWLPYIRELDR 185
           L+ ++     +    EL+T         + ++A LA+ L+ E+ + + S W PY+  L  
Sbjct: 129 LIFSI-----HNAAPELITLQNDPLIQHMPQVA-LAIALLIERHK-ENSKWKPYLDILPT 181

Query: 186 QRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQY 245
                     + L  +  ++  L GSPT    L++   I R+Y+  + V+    +     
Sbjct: 182 -------TYTTVLYMTAADMIELKGSPTLEAALKQCRNIARQYSYFNKVFQNNNNAVSAI 234

Query: 246 PYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF-ALVPLGPPLLAYSSKCKAMLAA 304
             D+    FT+E +  A   V +    +     +R   AL+P+       + +      A
Sbjct: 235 LRDV----FTYERYCWAVSTVMTRQNLIPSEDGSRMIHALIPMWDMCNHENGRITTDFNA 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
             +  +    R +K GE I + Y           GP+ NS   ++ GFV  +N  D   +
Sbjct: 291 TSNYCECYALRDFKKGEQIFISY-----------GPRTNSDFFVHSGFVYMENKQDGFKL 339

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD-MLPYLRLGYVSDTSEMQ 423
              ++  D   +++  +  +    +V  F +  G   E ISD +L +LR+ +    +E+ 
Sbjct: 340 RLGISKADSLQKERIELLNKLDLPTVGEFLLKLG--TEPISDLLLAFLRV-FSMRKAELA 396

Query: 424 SVISS--LGPICPVSPCMERAVLDQLADYFKARL----AGYPATLSEDEAMLTDYNLHPK 477
             I S  +  +  +   +E  V + +  +   RL    A YP TL ED  +L +  L   
Sbjct: 397 HWIRSDRVNDLKHMDCALETVVEENVRKFLLTRLQLLIANYPTTLKEDLQLL-ETTLPQI 455

Query: 478 KRVATQLVRMEKKMLNACLQVTADMI 503
           K++  QL   EK++L   L+     I
Sbjct: 456 KKLTIQLRVTEKRILQGALEYVEQWI 481


>gi|412990750|emb|CCO18122.1| predicted protein [Bathycoccus prasinos]
          Length = 543

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 71/273 (26%), Positives = 112/273 (41%), Gaps = 49/273 (17%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLM------- 165
           + A+E ++ G+    +P   ++T+E  L    + E     +L E + LA +L        
Sbjct: 118 LVATESIKRGEKVLEIPQEAIITVEVALKESLLREKKKLAELQEWSILATFLAETAQNLS 177

Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGI 224
            E     K  +  Y++ L R  G       S L W E+++   L GSP+    LER   +
Sbjct: 178 TEDNSSNKYRFATYVKALPRSTG-------SVLEWPESDVRTLLAGSPSLFSALERRASV 230

Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 284
                E+   +        Q+ +DI        +F        S ++ L+  SL    AL
Sbjct: 231 AAAIAEIRVNFPELNEKTLQWAFDI--------LF--------SRLIRLE--SLGGNLAL 272

Query: 285 VPLGPPLLAYSSKCKAM--LAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
           VP    +L +   C+A   L      V L  DR Y+ GE +   Y           G +P
Sbjct: 273 VPWAD-MLNHQPGCEAFIDLDRGSRKVCLTTDRSYEPGEQVWASY-----------GQRP 320

Query: 343 NSKLLINYGFVDE--DNPYDRLVVEAALNTEDP 373
           +S+LLI+YGF     DNP D   +   ++ EDP
Sbjct: 321 SSELLISYGFAPAVGDNPDDEYALNLQIDEEDP 353


>gi|79315114|ref|NP_001030864.1| SET domain-containing protein [Arabidopsis thaliana]
 gi|51971180|dbj|BAD44282.1| unnamed protein product [Arabidopsis thaliana]
 gi|332645817|gb|AEE79338.1| SET domain-containing protein [Arabidopsis thaliana]
          Length = 353

 Score = 64.3 bits (155), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 59/244 (24%), Positives = 107/244 (43%), Gaps = 38/244 (15%)

Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
           LA  L+ EKK G+KS W+PYI  L +        + S + W E EL+ +  S    E ++
Sbjct: 2   LAAVLIREKKMGQKSRWVPYISRLPQ-----PAEMHSSIFWGEDELSMIRCSAVHQETVK 56

Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
           +   I+++++      F+A +  Q  P  I TE    E F  A+  V S     +    +
Sbjct: 57  QKAQIEKDFS------FVAQAFKQHCP--IVTERPDLEDFMYAYALVGS-----RAWENS 103

Query: 280 RRFALVPLGPPLLAYSSKCKAMLAAVDD------AVQLVVDRPYKAGESIVVWYNISFTG 333
           +R +L+P    +         +L   D+       +Q+  DR Y  G+ + + Y      
Sbjct: 104 KRISLIPFADFMNHDGLSASIVLRDEDNQLSEFSTLQVTADRNYSPGDEVFIKY------ 157

Query: 334 HDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ-- 391
                G   N+ L++++GF    N +D + ++  +  +DP    K  + Q +   +V+  
Sbjct: 158 -----GEFSNATLMLDFGFTFPYNIHDEVQIQMDVPNDDPLRNMKLGLLQTHHTRTVKDI 212

Query: 392 -VFH 394
            +FH
Sbjct: 213 NIFH 216


>gi|302896454|ref|XP_003047107.1| SET domain protein [Nectria haematococca mpVI 77-13-4]
 gi|256728035|gb|EEU41394.1| SET domain protein [Nectria haematococca mpVI 77-13-4]
          Length = 1037

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 107/458 (23%), Positives = 179/458 (39%), Gaps = 104/458 (22%)

Query: 113  VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN----------KLSELACLAL 162
            + A +D+ A    F++P   ++ +E     + + ++   +          +L   + L L
Sbjct: 610  IIALQDIPAETTLFTIPRKGIINVETSELPKKLPDVFDLDKPIDDDDEAPRLDSWSSLIL 669

Query: 163  YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK-------A 215
             LMYE  QG+KS W PY   L         + ++P+ WSE+EL  L  S  +       A
Sbjct: 670  VLMYEYLQGEKSQWKPYFDVLPS-------SFDTPMFWSESELDQLQASHMRHKIGKADA 722

Query: 216  EILERAE-------------GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
            E + R               G  R  ++L  +    GS    Y +D+  +    E     
Sbjct: 723  ESMFRKTLLPIIRKNSSVFGGENRSDDDLVEIAHRMGSTIMAYAFDLENDEDEEEEETDG 782

Query: 263  FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
            +V  +     +          +VP+   +L   ++  A +   ++++ +   RP KAGE 
Sbjct: 783  WVEDREGKSMM---------GMVPMA-DILNADAEFNAHVNHEEESLTVTSLRPIKAGEE 832

Query: 323  IVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNP-YDRLVVEAALNTEDPQYQDKRMV 381
            I  +Y           GP PNS+LL  YG+V E +  YD  VVE   +  +        V
Sbjct: 833  IFNYY-----------GPHPNSELLRRYGYVTERHSRYD--VVEIPWDVVES-------V 872

Query: 382  AQRNGKLSVQVFHV--HAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGP-------- 431
             + N  +S QV     H   E+E   D     R     +T E+ S  +  GP        
Sbjct: 873  MRLNFGISGQVLEKLRHGLEEEEEFEDTFVLER-----ETGEVNSDGTFSGPARFESMPE 927

Query: 432  ---------ICPVSPCMERAVLDQ----------LADYFKARLAGYPATLSEDEAMLTDY 472
                     +  V      A+ D+          LA   +A  + YP + SED  +L   
Sbjct: 928  DLQEQLKTFLKGVKKAQPEAIPDKRKRDEIHHAVLAKTLQALASKYPTSTSEDGILLQRQ 987

Query: 473  NLHPKKRVATQLVRMEKKMLNACLQVTA--DMIMLLPD 508
            +L  + R+A ++   EKK+L   +  T+  D+ M + D
Sbjct: 988  DLSQRTRMAIEVRLGEKKLLQEAIASTSSVDVEMTVDD 1025


>gi|225448769|ref|XP_002275729.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Vitis
           vinifera]
          Length = 480

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 70/303 (23%), Positives = 133/303 (43%), Gaps = 46/303 (15%)

Query: 75  EEDLGDLKSWM-HKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLV 133
           ++D  D   W+  K G+    V+   K ++         + AS+ +Q GD    VP ++ 
Sbjct: 39  DKDCDDFLPWLEQKAGVEISSVLSIGKSTYGRS------LFASKSIQTGDCILKVPYNVQ 92

Query: 134 VTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA 193
           ++ + V     I  LL  +++  +A LA+ +  E K G+ S W PYI  L  Q G     
Sbjct: 93  ISPDNV--PSKINSLLG-DEVGNIAKLAIVISVEWKMGQDSEWAPYINRLP-QPGE---- 144

Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW-FMAGSLFQQYPYDIPTE 252
           + S + WSE EL  +  S    E + +   I++++  +  V    + +LF+    DI  +
Sbjct: 145 MHSTIFWSEGELKMIQQSSVYQETINQKAQIQKDFLAIKPVLHHFSENLFK----DISLK 200

Query: 253 AFTFEIFKQAFVAVQSC-VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA--V 309
            F           + +C +V  +     +  +L+P     + +     ++L   +D    
Sbjct: 201 EF-----------MHACALVGSRAWGSTKGLSLIPFA-DFVNHDGFSDSVLLGDEDKQLS 248

Query: 310 QLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 369
           +++ DR Y  GE +++ Y           G  PN+ LL+++GF    N YD++ ++  + 
Sbjct: 249 EVIADRNYAPGEQVLIRY-----------GKFPNATLLLDFGFTLPYNIYDQVQIQVNIP 297

Query: 370 TED 372
             D
Sbjct: 298 HHD 300


>gi|297736447|emb|CBI25318.3| unnamed protein product [Vitis vinifera]
          Length = 487

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 70/309 (22%), Positives = 135/309 (43%), Gaps = 51/309 (16%)

Query: 75  EEDLGDLKSWM-HKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLV 133
           ++D  D   W+  K G+    V+   K ++  +      + AS+ +Q GD    VP ++ 
Sbjct: 39  DKDCDDFLPWLEQKAGVEISSVLSIGKSTYGSRS-----LFASKSIQTGDCILKVPYNVQ 93

Query: 134 VTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA 193
           ++ + V     I  LL  +++  +A LA+ +  E K G+ S W PYI  L  Q G     
Sbjct: 94  ISPDNV--PSKINSLLG-DEVGNIAKLAIVISVEWKMGQDSEWAPYINRLP-QPGE---- 145

Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW-FMAGSLFQQYPYDIPTE 252
           + S + WSE EL  +  S    E + +   I++++  +  V    + +LF+    DI  +
Sbjct: 146 MHSTIFWSEGELKMIQQSSVYQETINQKAQIQKDFLAIKPVLHHFSENLFK----DISLK 201

Query: 253 AFTFEIFKQAFVAVQSC-VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD---- 307
            F           + +C +V  +     +  +L+P     + +     ++L   +D    
Sbjct: 202 EF-----------MHACALVGSRAWGSTKGLSLIPFA-DFVNHDGFSDSVLLGDEDKQLS 249

Query: 308 ----AVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLV 363
                ++++ DR Y  GE +++ Y           G  PN+ LL+++GF    N YD++ 
Sbjct: 250 ESSSTLEVIADRNYAPGEQVLIRY-----------GKFPNATLLLDFGFTLPYNIYDQVQ 298

Query: 364 VEAALNTED 372
           ++  +   D
Sbjct: 299 IQVNIPHHD 307


>gi|301094750|ref|XP_002896479.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262109454|gb|EEY67506.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 478

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 107/471 (22%), Positives = 187/471 (39%), Gaps = 61/471 (12%)

Query: 62  TLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQA 121
           T V+  R  +SK++    +L  W+  NG    K+ L+E   +  + R +H   + + L  
Sbjct: 18  TPVSPPRNGMSKEDVVGQELIQWLETNGADSKKLTLQE---YAPEVRGVH---SRKVLVP 71

Query: 122 GDAAFSVPNSLVVTLERVLGNET-IAELLTTNKLSELA----CLALYLMYEKKQGKKSFW 176
           G+    +P   ++T+E  +G +T I   L    +  +A     L ++L+ + +  + SF+
Sbjct: 72  GERILVIPKKCLITVE--MGKQTDIGRKLLARNVDFVAPKHIFLMMFLLTDMEHVETSFF 129

Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
             Y   L             P+ WSE EL++L GS    +I ER   I+++Y+ +  V  
Sbjct: 130 RNYYSTLPSTLS------NMPIFWSEEELSWLKGSYIIQQIQERKAAIRKDYDVICRV-- 181

Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS 296
                      D     F+ + F  A + V S    L  +   +  ALVP    L  Y  
Sbjct: 182 -----------DPSFARFSLDRFSWARMIVCSRNFGL-TIDGVKTAALVPFADMLNHYRP 229

Query: 297 K-CKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDE 355
           +          DA  +        G  +   Y           G + N + L+NYGF  E
Sbjct: 230 RETSWTFDQSIDAFTITSLGTIGTGAQVYDSY-----------GKKCNHRFLLNYGFAVE 278

Query: 356 DNPY------DRLVVEAALNTEDPQ-YQDKRMVAQRNGKLSVQVFHVHAGREKEAISDML 408
           DN        + ++++  L+  D Q + DKR     +G  ++    +          +  
Sbjct: 279 DNTEEDGRNPNEVLIDFQLSPADGQLFYDKRAYLHESGIYTMDA-RLSCSHSDANTREGF 337

Query: 409 PYLRLGYVSDTSEMQSVISSLGPIC---PVSPCMERAVLDQLADYFKARLAGYPATLSED 465
            + RL  +  T E  S +    P     P+S   E   L+ L +    +L+ Y  T+ ED
Sbjct: 338 SFARL--IVATEEEFSTMKMKSPAHSSPPISFDNEIRALEYLRNLMTHQLSLYDTTIEED 395

Query: 466 EAMLT--DYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPC 514
             +L    Y L   +  A   +R EK++     Q  AD ++ L  + ++ C
Sbjct: 396 NELLASKQYPLFSNRIQALFFIRGEKQVCRY-FQELADKVIPLFSLPLAEC 445


>gi|322694827|gb|EFY86647.1| SET domain protein [Metarhizium acridum CQMa 102]
          Length = 467

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 97/438 (22%), Positives = 173/438 (39%), Gaps = 79/438 (18%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN-------KLSELACLALYLM 165
           + A  D+ A    F++P   ++  E     + + +L  +         L   + L L +M
Sbjct: 43  ITALRDIPADTTLFTIPRDAIINSETSSLRKKLPDLFESQGDEDEEQALDSWSALILIMM 102

Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEGI 224
           YE   G +S W PYI  L        L  ++P+ WSE EL+YL  S T  +I +  AE +
Sbjct: 103 YEFFLGDESKWKPYIDVL-------PLTFDTPMFWSEEELSYLQASATVNKIGKADAEEM 155

Query: 225 KREY--------------------NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
            R                       +L  +    GS    Y +D+  E    +     +V
Sbjct: 156 FRTRLIPAIRGNPSVFVSSGDCSDEDLIGLAHRMGSTIMAYAFDLENEEAENDEESDGWV 215

Query: 265 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 324
             +     +  V++A           +L   ++  A +   D+ + +   R  KAGE I+
Sbjct: 216 EDREGKSMMGMVAMA----------DILNADAEFNAHVNHGDEELTVTSIRDIKAGEEIL 265

Query: 325 VWYNISFTGHDFKCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAALNTEDPQYQ 376
            +Y           GP PNS+LL  YG++ E +        P+D   V+ +L +E    Q
Sbjct: 266 NYY-----------GPHPNSELLRRYGYITEKHSRYDVVEIPWD--AVQHSLMSELGVPQ 312

Query: 377 D------KRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE-MQSVISSL 429
           D      +RM   ++    + V    +G      +   P +  G   D  E +++ I  L
Sbjct: 313 DIMAETMERM--DQDDLEDIFVLERDSGEPNPDGTFAGPAVVDGMPPDLKEQLKATIKLL 370

Query: 430 ----GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLV 485
               G +       +  +   + +  +   + Y  T++EDE +L   +L  ++R+A Q+ 
Sbjct: 371 QKLDGNLISDKRKRDDILRSTMVETLRLIASRYSTTIAEDEVLLAQDSLTRRQRMAVQVR 430

Query: 486 RMEKKMLNACLQVTADMI 503
             EKK+L       ++M+
Sbjct: 431 LGEKKLLQEACDHFSEMV 448


>gi|71895277|ref|NP_001025965.1| SET domain-containing protein 4 [Gallus gallus]
 gi|53134599|emb|CAG32346.1| hypothetical protein RCJMB04_23h14 [Gallus gallus]
          Length = 439

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 102/447 (22%), Positives = 176/447 (39%), Gaps = 72/447 (16%)

Query: 86  HKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVVTLERV 139
           HK      K  LK++   +   RP  +      +  ++ LQAG+   S+P   +VT   V
Sbjct: 28  HKLEYIKLKKWLKDRGFGDSSLRPAQFWGTGRGLMTTKALQAGELVISLPEKCLVTTTTV 87

Query: 140 LGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
           L N  + E +   K  +S L  L  +L+ EK  G++S W PY+  L +           P
Sbjct: 88  L-NSCLGEYIMKWKPPVSPLIALCPFLIAEKHAGERSLWKPYLDVLPK-------TYSCP 139

Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
           +   E ++  L   P + +  E+   +   Y      +    SLF +    I    F + 
Sbjct: 140 VC-LEQDVVQLLPEPLRKQAQEQRTAVHELYMSSKAFFSSLQSLFAENTATI----FNYS 194

Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVP----LGP--PLLAYS--SKCKAMLAAVDDAV 309
             + A+  + +  +++ K S    F+L P    L P   LL +S   + KA         
Sbjct: 195 ALEWAWCTINTRTIYM-KHSQRECFSLEPDVYALAPYLDLLNHSPNVQVKAAFNEQSRNY 253

Query: 310 QLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 369
           ++  +   K  E + + Y           GP  N +LL+ YGFV  DNP+  + V +   
Sbjct: 254 EIQTNSQCKKYEEVFICY-----------GPHDNQRLLLEYGFVAVDNPHSSVYVSSDTL 302

Query: 370 TEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSL 429
            +     DK    Q+N KLS+   H           D+L  L  G+   +  + + +  L
Sbjct: 303 LKYFPSLDK----QKNAKLSILKEH-----------DLLENLTFGWDGPSWRLLTALKVL 347

Query: 430 GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM-----LTDYNLHPKKRVATQL 484
                   C  RA+   L D   AR        +E +A+     +  + +   + V  Q+
Sbjct: 348 SLGGDEFTCWRRAL---LGDVISAR--------NEQQALNITTKICHFLIEETQHVLLQI 396

Query: 485 VRMEKKMLNACLQVTADMIMLLPDVTV 511
            +M++   N   Q+T    + L D+ +
Sbjct: 397 SQMKRDKENLKEQLTLVEALRLEDLKI 423


>gi|452841392|gb|EME43329.1| hypothetical protein DOTSEDRAFT_131367 [Dothistroma septosporum
           NZE10]
          Length = 445

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 106/448 (23%), Positives = 176/448 (39%), Gaps = 70/448 (15%)

Query: 83  SWMHKNGLP-PCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLG 141
           +W+  NG     K+ L +    N        + A EDL   +  FSVP S ++T E    
Sbjct: 16  NWLRDNGASISAKITLDDLRQQNAGRG----IVAVEDLDEDEELFSVPRSTMLTTETSRN 71

Query: 142 NETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
            E + + +    LS +  +AL  +     G +S W PY   L        ++ ++ + WS
Sbjct: 72  GEAVLQEVDDPWLSLIVVMALEYL----DGSQSRWKPYFDVL-------PVSFDNLMFWS 120

Query: 202 ETELAYLTGSPTKAEI----------------LERAEGIKREYN-ELDTVWFMAGSLFQQ 244
           + EL +L GS    +I                +ER    K   N EL  +    GS    
Sbjct: 121 DRELRHLEGSTVVGKIGKEAADATFREQLIPVIERISKAKAADNEELLRMCHRMGSTIMA 180

Query: 245 YPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAA 304
           Y +D+ T +   +   + +         L K        +VPL   L A + +  A L  
Sbjct: 181 YGFDLETSSDQAKNDGEEWEEDSDAGETLPK-------GMVPLADMLNADADRNNAKLFY 233

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDN------- 357
            DD V +   +P KAGE +   YN      DF  G  P + LL  YG++  DN       
Sbjct: 234 EDDKVVMKTIKPVKAGEEL---YN------DF--GSLPRADLLRRYGYL-TDNYAQYDVV 281

Query: 358 --PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 415
             P D +   A L T+D    ++   A+  G L        A  E+    + L  L    
Sbjct: 282 EIPADLIKERAGLRTQD--VDERWQYAEEQGVLDDGYDVSRASSEEGQFPEELCVLLNLL 339

Query: 416 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 475
               +E + V +      P      + +L  +  Y   R A YP  + +   M +D +L+
Sbjct: 340 ALPRAEFEKVKNKDKIPKPDLTTNAKKLLRTILVY---RYAAYPGNVDQ---MHSDVSLN 393

Query: 476 PKKR-VATQLVRMEKKMLNACLQVTADM 502
            ++R +A  +++ EK++L   +   +++
Sbjct: 394 DRRRKMAIVVIQGEKQVLQEAVDAISEI 421


>gi|255947868|ref|XP_002564701.1| Pc22g06730 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211591718|emb|CAP97961.1| Pc22g06730 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 679

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 63/220 (28%), Positives = 102/220 (46%), Gaps = 26/220 (11%)

Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
             A +LM +  +G + FW PY+R L +    GQL   +PL + E ++ ++ G+      +
Sbjct: 105 TFAFFLMGQYLRGSEGFWYPYLRTLPQP---GQLT--TPLFFGEEDVDWIQGTGIPEAAV 159

Query: 219 ERAEGIKREYN----ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA-VQSCVVHL 273
           ER +  +++Y+    +LD + F      +QY +++   A T  I  +AF A V S  V  
Sbjct: 160 ERIKVWEQKYDLGYLKLDEIGFPD---CEQYTWELYLWASTI-ITSRAFSAKVLSGAVQP 215

Query: 274 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTG 333
             +      AL+PL   L  +    K    A D+ + L+V   + AG+ I   Y      
Sbjct: 216 DDLPEDGVSALLPL-IDLPNHRPMAKVEWRAGDEDIGLLVLEDHSAGQEISNNY------ 268

Query: 334 HDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 373
                GP+ N +LLINYGF    NP D  +V   +  + P
Sbjct: 269 -----GPRNNEQLLINYGFCIAGNPTDYRIVLLGVKPDSP 303


>gi|336261436|ref|XP_003345507.1| hypothetical protein SMAC_07495 [Sordaria macrospora k-hell]
 gi|380088183|emb|CCC13858.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 499

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 71/315 (22%), Positives = 123/315 (39%), Gaps = 61/315 (19%)

Query: 81  LKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
           L  W H +G    P  ++   EK   + + +P    +A+E L +   A + P S+ ++  
Sbjct: 12  LLDWAHNHGASLHPSVEIYQDEKTGFSLRVKP----SAAESLHSPFKAVTCPTSITLSYL 67

Query: 138 RVLGNETIAELLT---------------TNKLSELACLALYLMYEKKQGKKSFWLPYIRE 182
             L +  I   LT                N L        YL+ +  +GK S W PYI  
Sbjct: 68  NALTDGPITPYLTPPALDTQKHAFPERFMNSLPPHVIGRFYLIQQYLKGKSSLWAPYIST 127

Query: 183 LDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS-- 240
           L       + A+  P  W+E ++  L G+     I E  + +K EY +   +    GS  
Sbjct: 128 LTDPSQLDKWAL--PPFWTEHDIELLRGTNAYVAIQEIQDNVKSEYKQARKILKQEGSPD 185

Query: 241 --LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYS--- 295
              + Q  Y+     FT   F+ + +  +S   ++++        L+P G  +  +S   
Sbjct: 186 YRAYTQVLYNWAYCMFTSRSFRPSLILSESAREYVER--------LLPEGAKIDDFSILQ 237

Query: 296 -----------SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNS 344
                      ++    L +   A +L+    Y+ G+ +   Y           G + NS
Sbjct: 238 PLYDIGNHSPEAEYSWNLTSEPSACELICRNSYEPGQQVFNNY-----------GKKTNS 286

Query: 345 KLLINYGFVDEDNPY 359
           +LL+ YGFV E+N Y
Sbjct: 287 ELLLGYGFVTENNDY 301


>gi|302753470|ref|XP_002960159.1| hypothetical protein SELMODRAFT_437298 [Selaginella moellendorffii]
 gi|300171098|gb|EFJ37698.1| hypothetical protein SELMODRAFT_437298 [Selaginella moellendorffii]
          Length = 377

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 58/220 (26%), Positives = 96/220 (43%), Gaps = 41/220 (18%)

Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
           LAL ++ E+ +G+ S W PYI  L +        +++  LW +TEL+YL  SP   +  E
Sbjct: 114 LALIVLMERYKGQSSVWAPYISCLPQPA-----ELDNTFLWEDTELSYLKASPLYGKTRE 168

Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
           R E I  E+ ++     +   LF +          + E FK  +  V S     + +++ 
Sbjct: 169 RLEMITTEFGQVQNALNVWPQLFGK---------VSLEDFKHVYATVFS-----RSLAIG 214

Query: 280 RRFALVPLGPPLLAY-----SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGH 334
               LV +  P+L +     +S  K     + +   +  DR Y   + I  W N      
Sbjct: 215 EDSTLVMI--PMLDFFNHNATSFAKLSFNGLLNYAVVTADRAYTENDQI--WINY----- 265

Query: 335 DFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQ 374
               G   N++L ++YGF   +NPYD    E  L T+ P+
Sbjct: 266 ----GDLSNAELALDYGFTVPENPYD----ETDLLTQFPE 297


>gi|219126444|ref|XP_002183467.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217405223|gb|EEC45167.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 519

 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 95/363 (26%), Positives = 143/363 (39%), Gaps = 69/363 (19%)

Query: 160 LALYLMYEKK-QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
           L +YL++++K  G  SF+ PY   L             P+ WS  EL  L GS   ++I 
Sbjct: 116 LMIYLLWDRKTHGSSSFFHPYYEILP------PTLRNMPIFWSAFELQELEGSHLLSQIA 169

Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 278
           +R + I+ +Y  +  V    G+L             T + FK A + V S    LQ +  
Sbjct: 170 DRGQAIQDDYEAILEVAPSLGTLC------------TLDEFKWARMCVCSRNFGLQ-IDG 216

Query: 279 ARRFALVPLGPPLLAYSSK-CKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFK 337
            R  ALVP    L  Y  +  K     V     +   +  +AG  +   Y          
Sbjct: 217 HRTSALVPHADMLNHYRPRETKWTFDEVTQCFTITSLQSIQAGAQVYDSY---------- 266

Query: 338 CGPQPNSKLLINYGFVDEDNPY------DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ 391
            G + N + L+NYGF  EDN        + + +E  ++  D  +QDK     R       
Sbjct: 267 -GQKCNHRFLLNYGFAVEDNRELDGFCPNEVPLELYVDPADILFQDKLEFWTRG------ 319

Query: 392 VFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYF 451
                   E   IS        G V+     Q+V  S+G   P S   E      +    
Sbjct: 320 --------ETNQIS--------GAVTAGLIAQAVGGSMGRGVP-SHAAESYTSGPVVK-- 360

Query: 452 KARLAGYPATLSEDEAMLTDYNLHPK---KRVATQLVRMEKKMLN---ACLQVTADMIML 505
           + RLA YP T+S+D A L D   +P+   +R A   VR EK++L+      +   DM+  
Sbjct: 361 RVRLASYPTTISQDMADLQDEASYPQFSNRRHAKIQVRGEKEVLHHFRVWSETALDMLTF 420

Query: 506 LPD 508
           + D
Sbjct: 421 IED 423


>gi|57899520|dbj|BAD87034.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
 gi|57899939|dbj|BAD87851.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
          Length = 509

 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 60/271 (22%), Positives = 103/271 (38%), Gaps = 50/271 (18%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           ASE +Q GD    VP  + +TL+++         L  + + + + LA  L+ E+  G +S
Sbjct: 63  ASEPIQEGDCIMQVPYHVQLTLDKL---PQKFNTLLDHAVGDTSKLAALLIMEQHLGNES 119

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
            W PYI+ L  +       + + +LW   EL  +  S    E +E  E  K+E+  L   
Sbjct: 120 GWAPYIKSLPTKD-----QMHNMVLWDLNELHAVQNSSIYDEAIEHKEQAKKEFLALKPA 174

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
                 LF +        A   +      V     +   QK                   
Sbjct: 175 LDHFPHLFGEVKLGDFMHASALDFLNHDGVFGSVLIYDEQK------------------- 215

Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVD 354
                       D  +++ DR Y  GE +++ Y           G   N+ L +N+GF  
Sbjct: 216 ------------DVCEIIADRNYAVGEQVMIRY-----------GKYSNATLALNFGFTL 252

Query: 355 EDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 385
             N YD+ ++   +  +DP Y+ K  + Q++
Sbjct: 253 ARNIYDQALIRIDMPVQDPLYKKKLDIWQKH 283


>gi|367029027|ref|XP_003663797.1| hypothetical protein MYCTH_2080826, partial [Myceliophthora
           thermophila ATCC 42464]
 gi|347011067|gb|AEO58552.1| hypothetical protein MYCTH_2080826, partial [Myceliophthora
           thermophila ATCC 42464]
          Length = 357

 Score = 62.8 bits (151), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 81/328 (24%), Positives = 133/328 (40%), Gaps = 74/328 (22%)

Query: 78  LGDLKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
            G L  W  ++G    P  ++ L     ++ +  P    +A+E LQ G AA S P  + +
Sbjct: 10  FGALVEWAEQHGARLHPSVEIYLDPVSKYSLRVSP----SATEGLQPGFAAVSCPARITL 65

Query: 135 TLERVLGNETIAELLTTNKLSELACL------------------------ALYLMYEKKQ 170
           +    L +  +     +++ ++ A L                          +L+ E  +
Sbjct: 66  SYLNALVDGLLDPSALSDRSAQSARLDQETSSTGAFPPRFTRSVPPHVLGRFFLVKEYLK 125

Query: 171 GKKSFWLPYIRELDRQRGRGQLAVES-PLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
           GK SFW PYI  L       Q+AV + P  W + ++AYL G+     I E  E +KRE+ 
Sbjct: 126 GKDSFWWPYIATLPPPE---QVAVWALPPFWPDHDIAYLEGTNAHVAIQEIQENVKREFK 182

Query: 230 ELDTVWFMAGSLFQQYPY-DIPTEAFTFEIFKQAFVAVQSCVVH----LQKVSLARRFAL 284
           +       A  L ++  + D+P  A+T  ++K AF    S        L   +  R  AL
Sbjct: 183 Q-------ARKLLKEEDFPDLP--AYTQLLYKWAFCIFTSRSFRPSLVLSDATKRRLSAL 233

Query: 285 VPLG---------PPLLAYSSKCKAM-----LAAVDDAVQLVVDRPYKAGESIVVWYNIS 330
           +P G          PLL  ++            +V D  +L+   PY+ G  +   YN  
Sbjct: 234 LPQGVQLDDFSVLQPLLDIANHSPTARYTWDTTSVPDTCRLICHDPYQPGTQV---YN-- 288

Query: 331 FTGHDFKCGPQPNSKLLINYGFVDEDNP 358
                   G + NS+LL+ YGF+  + P
Sbjct: 289 ------NYGLKTNSELLLAYGFILPETP 310


>gi|348671353|gb|EGZ11174.1| hypothetical protein PHYSODRAFT_361758 [Phytophthora sojae]
          Length = 486

 Score = 62.4 bits (150), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 102/452 (22%), Positives = 177/452 (39%), Gaps = 59/452 (13%)

Query: 80  DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
           +L  W+  NG    K+ L+E   +  + R +H   + + L  G+    +P   ++T+E  
Sbjct: 44  ELIQWLEGNGADTKKLALQE---YAPEVRGVH---SRKVLAPGERILVIPKKCLITVE-- 95

Query: 140 LGNET-IAELLTTNKLSELA----CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
           +G +T I   L    +  +A     L ++L+ + ++ + SF+  Y   L           
Sbjct: 96  MGKQTDIGRKLLARNVDFVAPKHIFLMMFLLTDMERAETSFFRNYYSTLP------STLS 149

Query: 195 ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
             P+ WS+ EL +L GS    +I ER   I+++Y+ +  V             D     F
Sbjct: 150 NMPIFWSDEELGWLKGSYIIQQIQERKAAIRKDYDVICRV-------------DPAFARF 196

Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSK-CKAMLAAVDDAVQLVV 313
           + + F  A + V S    L  +   +  ALVP    L  Y  +          DA  +  
Sbjct: 197 SLDRFSWARMIVCSRNFGL-TIDGVKTAALVPFADMLNHYRPRETSWTFDQSIDAFTITS 255

Query: 314 DRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPY------DRLVVEAA 367
                 G  +   Y           G + N + L+NYGF  EDN        + ++++  
Sbjct: 256 LGTIGTGAQVYDSY-----------GKKCNHRFLLNYGFAVEDNTEEDGRNPNEVLIDFQ 304

Query: 368 LNTEDPQ-YQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS-- 424
           L+  D Q + DKR     +G  ++    +          +   + RL  V+   E  S  
Sbjct: 305 LSQADGQLFYDKRAYLHESGIYTMDA-RLSCSHSDANTREGFSFARL-IVATEDEFSSMK 362

Query: 425 VISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT--DYNLHPKKRVAT 482
           + S      P+S   E   L  L D    +L+ Y  T+ ED  +L    Y L   +  A 
Sbjct: 363 MKSPAHSSPPISFDNEIRALQYLRDLMTHQLSLYDTTIEEDNELLASKQYPLFSNRIQAL 422

Query: 483 QLVRMEKKMLNACLQVTADMIMLLPDVTVSPC 514
             +R EK++     Q  AD ++ L  + ++ C
Sbjct: 423 FFIRGEKQVCRY-FQELADKVIQLFSLPLAEC 453


>gi|298711968|emb|CBJ32910.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 247

 Score = 62.4 bits (150), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 65/207 (31%), Positives = 92/207 (44%), Gaps = 34/207 (16%)

Query: 311 LVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 370
           +   R + AGE +++ Y           GP+ N  LL  YGFV++DNP D   +   ++ 
Sbjct: 40  VTTQRGWTAGEQVLISY-----------GPRSNDHLLRRYGFVEQDNPNDVYRITGLIDK 88

Query: 371 -EDPQYQDK-RMVAQRNGKLSVQ--------VFHVHAGR-----EKEAISDMLPYLRLGY 415
             D   +D  R++ +  GKL           V  V  GR     EKE    ++P  RL  
Sbjct: 89  LSDVLGKDSVRVLRESGGKLGTTGDNAEGRPVESVTVGRSGLLGEKEE-GRVMPVFRLAV 147

Query: 416 VSDTS--EMQSVISSLGPIC-PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDY 472
           V D    E ++   SL      +SP  E A  D L         G+  TL+EDEA L+  
Sbjct: 148 VKDDQLPEGKAAGISLKDFSNEISPANEAAARDALRKLCIKEREGFATTLAEDEAYLSSL 207

Query: 473 N--LHPKKRVATQLVRMEKK-MLNACL 496
              L  +KRVA    RMEKK +L+A +
Sbjct: 208 GNSLGAQKRVAFSF-RMEKKRVLDAAI 233


>gi|241603784|ref|XP_002405757.1| SET domain-containing protein, putative [Ixodes scapularis]
 gi|215502568|gb|EEC12062.1| SET domain-containing protein, putative [Ixodes scapularis]
          Length = 429

 Score = 62.0 bits (149), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 66/293 (22%), Positives = 123/293 (41%), Gaps = 35/293 (11%)

Query: 79  GDLKSWMHKNGLP-PCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
           G L +WM  NG     K+ L++ P           V A E L  G+    +P +L+++  
Sbjct: 30  GRLLTWMEANGFRLHSKLGLRDFPDTGRG------VVALEKLVGGETFLKLPATLLISTR 83

Query: 138 RVLGNETIAELLTTN-KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES 196
             L +   + ++  + KL+ +  L L+++ +K  G+ S W P++  L R          +
Sbjct: 84  TALQSRLHSFIIRHHAKLTPIDVLTLFVLDQKLLGEASRWWPFVDSLPR-------TFTT 136

Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 256
           P+         L     + E+      I+R + +L  +  + G + ++      +  FT+
Sbjct: 137 PVFLRRKVFESLP-KDLREEVQTGITFIQRTFLKLKVL--LGGHVEEEPEVQCLSTGFTW 193

Query: 257 EIFKQAFVAVQSCVVHLQKVSLARRF-----ALVPLGPPLLAYSSKCKAMLAAVDDAVQL 311
             F  A+ AV +  +  Q  + +  +     AL P     L +  K     A V +  ++
Sbjct: 194 NNFVWAWTAVNTRCIFAQGSNSSSLWEDDHCALAPF-LDCLNHHWKASIETAMVGENFEI 252

Query: 312 VVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
           +  + + A E + + Y           GP  N +L ++YGFV  DNP D +VV
Sbjct: 253 LSHKSHDANEQVFISY-----------GPHSNRRLFLDYGFVLPDNPNDVVVV 294


>gi|429861365|gb|ELA36056.1| set domain protein [Colletotrichum gloeosporioides Nara gc5]
          Length = 471

 Score = 62.0 bits (149), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 90/436 (20%), Positives = 162/436 (37%), Gaps = 76/436 (17%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER----------VLGNETIAELLTTNKLSELACLAL 162
           + A++D+ A    F++P   ++ +E             GN+   E +    L     L L
Sbjct: 42  IIATKDIPAETTLFTIPRRSIINVETSELPKKIPQVFTGNDGDDEDMENEPLDSWGSLIL 101

Query: 163 YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-LERA 221
            ++YE  QG  S W PY   L  +         + + W  ++L  L GS   ++I  E A
Sbjct: 102 VMIYEFLQGAASPWKPYFEVLPEK-------FHTLMFWESSDLENLKGSAVLSKIGKEEA 154

Query: 222 EGIKREY----------------------NELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
           + + R                         EL  +    GS+   Y +D+  E    +  
Sbjct: 155 DEMFRSRILTVIAANPAIFYPEGSSPLGEAELLQLAHRMGSIIMAYAFDLDNEEEPEQEE 214

Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 319
              ++  +     L  V +A           +L   ++  A +   DD + +   RP  A
Sbjct: 215 DDEWIEDRDGKTMLGMVPMAD----------ILNADAEFNAHVNHGDDELTVTALRPIPA 264

Query: 320 GESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDN--------PYD--RLVVEAALN 369
           GE I+ +Y           GP PNS+LL  YG+V   +        P+D  +  V   L 
Sbjct: 265 GEEILNYY-----------GPHPNSELLRRYGYVTPKHSRYDVVEIPWDLVQASVSEHLK 313

Query: 370 TEDPQYQDKRMVAQRNGKLSVQVFHVHAGR-EKEAISDMLPYLRLGYVSDTSEMQSVISS 428
             D  ++  +          V V    +G  + E     +  +R        ++++V+ +
Sbjct: 314 IGDDVWKQVQEYVDPEELEDVFVLERESGEPDSEGQFRTVAEVREISAELEEQLKAVLKA 373

Query: 429 L----GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQL 484
           +    G + P     +      +    +  L+ YP +  EDEA+L   +L  ++R+A  +
Sbjct: 374 IKKINGDLIPDKRKRDEVFHAVIVSTLQKILSQYPTSTQEDEALLATSDLTNRQRMAIHV 433

Query: 485 VRMEKKMLNACLQVTA 500
              EKK+L   L+   
Sbjct: 434 RLGEKKLLKEALEFAG 449


>gi|357615786|gb|EHJ69829.1| putative SET domain containing 3 [Danaus plexippus]
          Length = 489

 Score = 62.0 bits (149), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 81/353 (22%), Positives = 143/353 (40%), Gaps = 71/353 (20%)

Query: 83  SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
           SW+H++G     V + E   +         + A++D   G    +VP  ++++ E+    
Sbjct: 88  SWLHEHGAEFEGVEISEFDGYG------FGLKATKDFSEGSLILTVPGKVMMS-EKDPKA 140

Query: 143 ETIAELLTTNKLSEL---ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
             ++E +  + L +      LAL+L+ EK     SFW PYI  L  +         + L 
Sbjct: 141 SDLSEFINIDPLLQNMPNVTLALFLLLEK-NNPNSFWKPYIDVLPEK-------YSTVLY 192

Query: 200 WSETELAYLTGSPTKAEILERAEGIKREY----NELDTVWFMAGSLFQQYPYDIPT---- 251
           ++  ELA L  SP     L+    I R+Y    N++ T+             D+P     
Sbjct: 193 FNSEELAELRPSPVFESSLKLYRSIVRQYAYFYNKIHTI-------------DLPVLKNL 239

Query: 252 -EAFTFEIFKQAFVAV---QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD 307
            + FTF+ ++ A   V   Q+ +V     +L    A +PL          C      +  
Sbjct: 240 QDIFTFDNYRWAVSTVMTRQNNIVQGTAFTLTN--AFIPLW-------DMCNHKHGKITT 290

Query: 308 AVQLVVDR-------PYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYD 360
              L ++R        Y+  E I ++Y           G +PNS L ++ GFV  DN YD
Sbjct: 291 DFNLELNRGECYALQDYRRDEQIFIFY-----------GARPNSDLFLHNGFVYPDNDYD 339

Query: 361 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 413
            L +   ++  D     K  +  + G   V  F ++ G    ++ ++L ++R+
Sbjct: 340 SLSIALGISPNDALRNGKVNLLNKLGLSGVTNFSLYKGASPISV-ELLAFIRI 391


>gi|224098926|ref|XP_002311320.1| SET domain-containing protein [Populus trichocarpa]
 gi|222851140|gb|EEE88687.1| SET domain-containing protein [Populus trichocarpa]
          Length = 490

 Score = 62.0 bits (149), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 90/411 (21%), Positives = 168/411 (40%), Gaps = 59/411 (14%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A  D+   +    +P  L +  + V  +E          +     +AL+L+ EK + +
Sbjct: 80  LVAQRDISRNEVVLEIPKKLWINPDVVAASEIGN---VCGGVKPWVSVALFLIREKLK-E 135

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
            S W PY+  L         +  S + WSE ELA L G+   +  L     ++RE+ +++
Sbjct: 136 DSTWRPYLDVLPE-------STNSTIFWSEEELAELQGTQLLSTTLGVKSYLRREFLKVE 188

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG---- 288
               +     Q +P  +     T + F  AF  ++S      +    +   L+PL     
Sbjct: 189 EEILVPHK--QLFPSPV-----TLDDFSWAFGILRSRSFSRLR---GQNLVLIPLADLCN 238

Query: 289 -------------PPLL----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISF 331
                        P +      Y  K   + +  D    L      KAGE +++ YN++ 
Sbjct: 239 FLHTWLLDQVNHSPDITIEDGVYEIKGAGLFSR-DLIFSLRSPISLKAGEQVLIQYNLNL 297

Query: 332 TGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ 391
           +          N++L ++YGF++  +  +   +   ++  DP + DK  +A+ NG   + 
Sbjct: 298 S----------NAELAVDYGFIEAKSDRNMYTLTLQISESDPFFGDKLDIAETNGLGEIA 347

Query: 392 VFH-VHAGREKEAISDMLPYLRLGYVSDTSEMQSVISS--LGPI-CPVSPCMERAVLDQL 447
            F  V        +   L  + LG  +D+  ++S+  +   G +  PVS   E  +   +
Sbjct: 348 DFDIVLGNPLPPTLLPYLRLVALGG-TDSFLLESIFRNTIWGHLELPVSRANEELICRVV 406

Query: 448 ADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQV 498
            D  K+ L+GY  T+ EDE  L    L+P+  +A  +   EKK+L    ++
Sbjct: 407 RDACKSALSGYHTTIEEDEK-LKGEELNPRLEIAVGIRAGEKKVLQQIEEI 456


>gi|190402231|gb|ACE77646.1| hypothetical protein [Sorex araneus]
          Length = 350

 Score = 62.0 bits (149), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 62/259 (23%), Positives = 115/259 (44%), Gaps = 22/259 (8%)

Query: 252 EAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 310
           E+FT+E ++ A  +V +    +     +R   AL+PL       +          DD  +
Sbjct: 12  ESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCE 71

Query: 311 LVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 370
            V  + ++AGE I ++Y           G + N++ +++ GF  ++N +DR+ ++  ++ 
Sbjct: 72  CVALQDFRAGEQIYIFY-----------GTRSNAEFVVHSGFFFDNNSHDRVKIKLGVSK 120

Query: 371 EDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS--- 427
            D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++    + ++    
Sbjct: 121 SDRLYAMKAEVLARAGIPTSSVFALHV-TELPISAQLLAFLRVFCMTEEELREHLLGENA 179

Query: 428 -----SLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVA 481
                +LG    PVS   E  +   L D     L  Y  T+ ED+A L    L P+  +A
Sbjct: 180 IDRIFTLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKAFLQSPGLSPRAAMA 239

Query: 482 TQLVRMEKKMLNACLQVTA 500
            +L   EK++L   ++  A
Sbjct: 240 VKLRLGEKEILEKAVRSAA 258


>gi|189236574|ref|XP_975615.2| PREDICTED: similar to SET domain containing 3 [Tribolium castaneum]
          Length = 667

 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 59/237 (24%), Positives = 102/237 (43%), Gaps = 37/237 (15%)

Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRP-------YKAGESIVVWYNISFTGH 334
           +AL+PL          C      +  A   V+DR        +KAGE + ++Y       
Sbjct: 451 YALIPLW-------DMCNHTNGTISTAYNPVLDRSECLAVKNFKAGEQLFIFY------- 496

Query: 335 DFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFH 394
               G + N+ L ++ GFV E+N YD   +   ++  DP  Q +  +    GKLS+    
Sbjct: 497 ----GSRSNADLFVHNGFVFENNDYDVYWIRLGISKSDPLQQKRGHLL---GKLSIASTC 549

Query: 395 VHAGREKEAISD--MLPYLRLGYVSDTSEMQSVISS-----LGPI-CPVSPCMERAVLDQ 446
             + R+  +  D  +L +LR+ +  +  ++   I+S     LG I C +   +E      
Sbjct: 550 DFSIRKGASPIDGQLLAFLRV-FNMNEEQLDHWINSDKSADLGHIDCALDTALETKSWRF 608

Query: 447 LADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 503
           L    K  L+ Y  TL EDE +L +    P + +A ++   EK+++   L+     I
Sbjct: 609 LHARLKLLLSTYKTTLDEDEKLLAEAQATPNRLLAIKMRATEKRIIRETLEYVEQYI 665



 Score = 40.4 bits (93), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 35/119 (29%), Positives = 58/119 (48%), Gaps = 12/119 (10%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSEL--ACLALYLMYEKK 169
           V A+ D+       +VP  L++++E       + +L+  +K L  +    L+++L+ EK 
Sbjct: 116 VKANVDIAESSLVIAVPRKLMMSVENA-KESVLKDLIEKDKILGSMPNVALSIFLLLEKY 174

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
           +G  SFW PYI  L +          + L +S  EL  L GSPT    L + + I R+Y
Sbjct: 175 KGD-SFWKPYIDILPK-------TYTTVLYFSIDELEELRGSPTLEVALRQIKSITRQY 225


>gi|270005261|gb|EFA01709.1| hypothetical protein TcasGA2_TC007289 [Tribolium castaneum]
          Length = 230

 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 56/230 (24%), Positives = 101/230 (43%), Gaps = 23/230 (10%)

Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQ 341
           +AL+PL       +         V D  + +  + +KAGE + ++Y           G +
Sbjct: 14  YALIPLWDMCNHTNGTISTAYNPVLDRSECLAVKNFKAGEQLFIFY-----------GSR 62

Query: 342 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREK 401
            N+ L ++ GFV E+N YD   +   ++  DP  Q +  +    GKLS+      + R+ 
Sbjct: 63  SNADLFVHNGFVFENNDYDVYWIRLGISKSDPLQQKRGHLL---GKLSIASTCDFSIRKG 119

Query: 402 EAISD--MLPYLRLGYVSDTSEMQSVISS-----LGPI-CPVSPCMERAVLDQLADYFKA 453
            +  D  +L +LR+ +  +  ++   I+S     LG I C +   +E      L    K 
Sbjct: 120 ASPIDGQLLAFLRV-FNMNEEQLDHWINSDKSADLGHIDCALDTALETKSWRFLHARLKL 178

Query: 454 RLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 503
            L+ Y  TL EDE +L +    P + +A ++   EK+++   L+     I
Sbjct: 179 LLSTYKTTLDEDEKLLAEAQATPNRLLAIKMRATEKRIIRETLEYVEQYI 228


>gi|52545671|emb|CAH56365.1| hypothetical protein [Homo sapiens]
          Length = 380

 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 59/259 (22%), Positives = 117/259 (45%), Gaps = 22/259 (8%)

Query: 252 EAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 310
           ++FT+E ++ A  +V +    +     +R   AL+PL       +          DD  +
Sbjct: 23  DSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCE 82

Query: 311 LVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 370
            V  + ++AGE I ++Y           G + N++ +I+ GF  ++N +DR+ ++  ++ 
Sbjct: 83  CVALQDFRAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSK 131

Query: 371 EDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD---------TSE 421
            D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++          S 
Sbjct: 132 SDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSA 190

Query: 422 MQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVA 481
           +  + +      PVS   E  +   L D     L  Y  T+ ED+++L +++L  + ++A
Sbjct: 191 IDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNHDLSVRAKMA 250

Query: 482 TQLVRMEKKMLNACLQVTA 500
            +L   EK++L   ++  A
Sbjct: 251 IKLRLGEKEILEKAVKSAA 269


>gi|340519125|gb|EGR49364.1| predicted protein [Trichoderma reesei QM6a]
          Length = 963

 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 92/421 (21%), Positives = 171/421 (40%), Gaps = 64/421 (15%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERV--------LGNETIAELLTTNKLSELACLALYL 164
           + A +D+ A    F+VP S +++ E          +  ET  E+    +    + L + +
Sbjct: 537 IVALQDIPAEAVLFTVPRSGILSSETSELKGKLPEIFQETAMEVDDKPQQDPWSTLIIVM 596

Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
           MYE  +G +S W PYI  L         + E+P+ WS+ EL  L  S T++++  +A   
Sbjct: 597 MYEYFKGSESKWKPYIDVLPS-------SFETPMFWSDAELDELQASATRSKV-GKASAE 648

Query: 225 KREYNELDTVWFMAGSLF---QQYPYDIPTE----------AFTFEIFKQAFVAVQSCVV 271
           +   +++  V      LF   Q Y  D   +          +++F+   +          
Sbjct: 649 EMFQDKVLPVIRANQHLFPTSQTYSDDDLIQLAHRMGSTIMSYSFDFQNEDEEDEDETEE 708

Query: 272 HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISF 331
            +++        +VP+   +L   ++  A +   DDA+ +   R  KAGE I  +Y    
Sbjct: 709 WVEEREAKSTMGMVPMAD-ILNADAEYNAHVNYGDDALTVTALRTIKAGEEIFNYY---- 763

Query: 332 TGHDFKCGPQPNSKLLINYGFVD-EDNPYDRL---------VVEAALNTEDPQYQDKRMV 381
                  GP PNS+LL  YG+V  + + YD +          V A+L     Q  DK   
Sbjct: 764 -------GPHPNSELLRRYGYVTPKHSRYDVVELPWTLVEESVAASLGLSSEQL-DKARE 815

Query: 382 AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT--SEMQSVISSLGPICPVSPCM 439
              + +L           E      +    R   + +    +++S++ ++    P S   
Sbjct: 816 CLDSDELEDTFVLERETEEPNPDGTLTGSARFSEIPEDLRDQLKSLLKAIRKAVPSSVVD 875

Query: 440 ERA-------VLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 492
           +R        +L +  D   +R   YP ++SEDE +L   ++  ++R A  +   EK+++
Sbjct: 876 KRKRDEIQHNILIRALDALASR---YPTSISEDERILAGNDISERRRAAVTVRLGEKRLI 932

Query: 493 N 493
            
Sbjct: 933 Q 933


>gi|46130858|ref|XP_389160.1| hypothetical protein FG08984.1 [Gibberella zeae PH-1]
          Length = 1000

 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 96/454 (21%), Positives = 180/454 (39%), Gaps = 90/454 (19%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--------LSELACLALYL 164
           + A +D+ A    F++P   ++  E     + I ++   +K        L   + L L +
Sbjct: 578 IIALKDIPAETTLFTIPRKGIINTETSELPKKIPDVFDLDKPDEDDVPGLDSWSSLILIM 637

Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEG 223
           +YE  QG  S W  Y   L         + ++P+ WSE EL  L  S  + +I +  AE 
Sbjct: 638 IYEYLQGDSSQWKSYFDVLPS-------SFDTPMFWSENELDQLQASHMRHKIGKADAED 690

Query: 224 I-------------------KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
           +                    R   EL  +    GS    Y +D+  +    E  ++   
Sbjct: 691 MFKKTLVPIIRSNPSIFNAENRSDYELVEIAHRMGSTIMAYAFDLENDEEEEEETEEWVE 750

Query: 265 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 324
             +   +            +VP+   +L   ++  A +   ++++ +   RP KAGE I+
Sbjct: 751 DREGKSM----------MGMVPMAD-ILNADAEFNAHVNHEEESLTVTSLRPIKAGEEIL 799

Query: 325 VWYNISFTGHDFKCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAALNT------ 370
            +Y           GP PNS+LL  YG+V E +        P+D  +VE+ L        
Sbjct: 800 NYY-----------GPHPNSELLRRYGYVTEKHSRYDVVEIPWD--IVESVLTNFGISSK 846

Query: 371 ----------EDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 420
                     E+ +++D  ++ +  G+++         + +    D+   L+  ++    
Sbjct: 847 ILEQIRGEFEEEEEFEDTFVLERDTGEVNSDGTFAEPAKFEGMPEDLQEQLK-SFLKGIK 905

Query: 421 EMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRV 480
           ++QS      P       + +AVL +  +   AR   YP ++SED+ +L   NL  + R+
Sbjct: 906 KLQS---DTIPDKRKRDEIHQAVLVKTLEALAAR---YPTSISEDQILLNGQNLDQRARM 959

Query: 481 ATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPC 514
           AT +   EKK+L   +   ++ + +  D    P 
Sbjct: 960 ATVVRLGEKKLLQEAIATFSEDVEMTMDDESGPA 993


>gi|296419472|ref|XP_002839331.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295635461|emb|CAZ83522.1| unnamed protein product [Tuber melanosporum]
          Length = 541

 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 64/262 (24%), Positives = 118/262 (45%), Gaps = 36/262 (13%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           V    D+ +     S P++L +   +     + A+ +T    ++ A L ++L  E  +GK
Sbjct: 43  VITCTDIPSHSQLISCPHTLTINYTKARSAFS-ADFITNT--TQHAALCMFLCLEWLKGK 99

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
           +SFW PY+  L R+        ++PL +S+ +L +L G   +A  +E  + I RE  E  
Sbjct: 100 ESFWWPYLCVLPRE-------FDTPLYFSDEDLQFLQGCNLEATEVEARKLIWREEFE-- 150

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA----LVPLG 288
                A S+ Q+  YD  TE +T+E++  A     S     + +   R        +P+ 
Sbjct: 151 ----AAVSILQREGYD--TEYYTWELYLWASTIFTSRSFPGKLMDWDRIIVHEDDTMPIL 204

Query: 289 PPLLAYSSKCKAMLAA---VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSK 345
            PL+   +   A +      D +++++      AG  +   YN          GP+ N +
Sbjct: 205 FPLIDSLNHYPATIITWQPSDTSLRIISGVGVSAGAEV---YN--------NYGPKANEE 253

Query: 346 LLINYGFVDEDNPYDRLVVEAA 367
           LL+ YGF    NP+D  +++++
Sbjct: 254 LLMGYGFTLLQNPFDSFLLKSS 275


>gi|302792358|ref|XP_002977945.1| hypothetical protein SELMODRAFT_107696 [Selaginella moellendorffii]
 gi|300154648|gb|EFJ21283.1| hypothetical protein SELMODRAFT_107696 [Selaginella moellendorffii]
          Length = 467

 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 73/292 (25%), Positives = 122/292 (41%), Gaps = 48/292 (16%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           V A  DL  G+   ++P +  +TL      + I        L     L + LMYE+ +GK
Sbjct: 34  VRALRDLHHGELIATIPKAACLTLLTTAARDAIERARLGGGLG----LTVALMYERSKGK 89

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL 231
            S W  Y++ L RQ          P LWSE E+   L G+     + E    +K ++ E 
Sbjct: 90  GSKWYRYLKTLPRQE-------SVPFLWSEEEIDGLLLGTELHKALKEDKLLMKEDWEE- 141

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ-KVSLARRFALVPLGPP 290
                    L ++ P + P + FTFE    +++A +S V     ++     + +VPL   
Sbjct: 142 -----NIAPLTKEDPLEFPAQDFTFE----SYLAAKSLVSSRSFEIDAEHGYGMVPLAD- 191

Query: 291 LLAYSSKCKA-----MLAA-----------VDDAVQLVVDRPYKAGESI---VVWYNISF 331
              ++ K  A     ML A           +DD +     R   + +S+   V+  +++ 
Sbjct: 192 --LFNHKTDAEDVHFMLNASDSDDDDNGLIIDDGLANGDCREISSDKSVLEMVMVKDVAA 249

Query: 332 TGHDFKC-GPQPNSKLLINYGFVDEDNPYD--RLVVEAALNTEDPQYQDKRM 380
               F   G   N+ LL  YGF + +NP+D   L ++  L     ++Q KR+
Sbjct: 250 GSEIFNTYGQLGNAALLHRYGFTEPNNPHDIVNLDMDCVLEVLLSRFQKKRV 301


>gi|449464220|ref|XP_004149827.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Cucumis
           sativus]
          Length = 499

 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 64/260 (24%), Positives = 111/260 (42%), Gaps = 50/260 (19%)

Query: 112 YVAASEDLQAGDAAFSVP-------NSLVVTLERVLGNETIAELLTTNKLSELACLALYL 164
           ++ ASE ++AGD    VP       +SL + +  +LGNE          +  +A LA+ +
Sbjct: 77  FLFASETIRAGDCILKVPFNVQISPDSLPLPIRDLLGNE----------IGNVAKLAVVV 126

Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
           + E K G  S W PYI  L +        + + + W E+EL  +  S    E L +   I
Sbjct: 127 LLEHKLGLGSEWAPYIIRLPQ-----PWEMHNTIFWKESELEMIRKSSLYEESLNQRSQI 181

Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 284
           KRE+  +           + +P  I  +  + + F  A+  V S     +        +L
Sbjct: 182 KREFLAIRKA-------LEAFPEII--DRISCDDFMHAYALVTS-----RAWRSTEGVSL 227

Query: 285 VPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
           +P     L +    +AML   DD    ++V DR +  GE +++ Y           G   
Sbjct: 228 IPFA-DFLNHDGASEAMLLNDDDKQLSEVVADRDFAPGEHVLIRY-----------GKYS 275

Query: 343 NSKLLINYGFVDEDNPYDRL 362
           N+ L++++GF    N +D++
Sbjct: 276 NATLMLDFGFALPYNIHDQV 295


>gi|350595011|ref|XP_003484025.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Sus
           scrofa]
          Length = 326

 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 47/161 (29%), Positives = 78/161 (48%), Gaps = 18/161 (11%)

Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS 296
           M   +F +YP   P E F  E FK +F  + S +V L   S+  + ALVP    ++ +S 
Sbjct: 73  MRVRIFSKYPDFFPEEVFNIESFKWSFGILFSRMVRLP--SMDGKNALVPWA-DMMNHSC 129

Query: 297 KCKAMLAAVDDAVQLV--VDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVD 354
           + +  L     +  +V   DRPY+ GE + + Y           G + N +LL++YGFV 
Sbjct: 130 EVETFLDYDKSSKGIVFPTDRPYQPGEQVFISY-----------GKKSNGELLLSYGFVP 178

Query: 355 ED--NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 393
           ++  NP D + +  +L   D  Y++K  + ++ G    Q F
Sbjct: 179 KEGTNPSDSVELSLSLKKSDESYKEKLELLKKYGLSGSQCF 219


>gi|325183831|emb|CCA18289.1| conserved hypothetical protein [Albugo laibachii Nc14]
 gi|325183979|emb|CCA18437.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 561

 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 102/448 (22%), Positives = 181/448 (40%), Gaps = 70/448 (15%)

Query: 68  REVVSKKEEDL--GDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAA 125
           RE V+  E D+   +L  W+   G    K++L++   +  + R +H      +L  G+  
Sbjct: 105 REDVADLENDVVGAELIDWLQNQGAETKKLMLQQ---YAPEVRGVH---CRNELVPGERI 158

Query: 126 FSVPNSLVVTLERVLGNET-IAELLTTNKLSELA----CLALYLMYEKKQGKKSFWLPYI 180
             +P + ++T+E  +G +T I + +  + +  +A     L LYL+ + ++   +F+  Y 
Sbjct: 159 LFIPKNCLITVE--MGKQTEIGQKVLAHNIEFVAPKHIFLILYLLTDMEKKDLTFFKYYY 216

Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS 240
             L             P+ WS+ EL++L GS    +I ER   I+++Y+ +         
Sbjct: 217 STL------PSTLKNMPIFWSDQELSWLKGSYILHQIQERKAAIRKDYDAICRA------ 264

Query: 241 LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA 300
                  D     F+ E F  A + V S    L  +   +  ALVP    L  Y  +  +
Sbjct: 265 -------DPSFSRFSLERFSWARMIVCSRNFGL-TIDGVKTAALVPFADMLNHYRPRETS 316

Query: 301 -MLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPY 359
                  D   +       +G  +   Y           G + N + L+NYGF  EDN  
Sbjct: 317 WTFDQKLDGFTITSLESICSGAQVYDSY-----------GKKCNHRFLLNYGFAVEDNTE 365

Query: 360 ------DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 413
                 + ++V+  L+  D Q     ++  +   L     +    R   + SD  P  R 
Sbjct: 366 EDGSNPNEIMVDFQLDPGDGQ-----LLYDKTAYLYESGIYTMNARLSCSHSD--PSTRE 418

Query: 414 GY-----VSDTSEMQSVISSLGPIC---PVSPCMERAVLDQLADYFKARLAGYPATLSED 465
           G+     ++ T +  S +    P     P+S   E A L+ L      +L  YP +L E 
Sbjct: 419 GFSFARLIAATEDEFSSMKMRSPAHASPPISFRNEIAALNLLKQLMDTQLDQYPTSLDEG 478

Query: 466 EAML--TDYNLHPKKRVATQLVRMEKKM 491
           EA+L   +Y L+  +  A   +R EK++
Sbjct: 479 EAILKSKEYPLYSNRIQALFFIRGEKQV 506


>gi|342881738|gb|EGU82570.1| hypothetical protein FOXB_06936 [Fusarium oxysporum Fo5176]
          Length = 467

 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 103/461 (22%), Positives = 183/461 (39%), Gaps = 86/461 (18%)

Query: 104 NEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--------LS 155
           N     ++   A ED+ A    F++P   ++ +E     + I +    +K        L 
Sbjct: 38  NAGRGEVNKTVALEDIPAETTLFTIPRKGIINVETSELPKKIPDAFDLDKPDDDDAPGLD 97

Query: 156 ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 215
             + L L ++YE  QG+ S W PY   L         + ++P+ WS+ EL  L  S  + 
Sbjct: 98  SWSSLILIMIYEYLQGENSKWKPYFDVLPS-------SFDTPMFWSDNELDQLQASHMRH 150

Query: 216 EILE-RAEGIKRE------------YN-------ELDTVWFMAGSLFQQYPYDIPTEAFT 255
           +I +  AE + ++            +N       EL  +    GS    Y +D+  +   
Sbjct: 151 KIGKADAENMFQKTLLPIIRSNAEIFNAGNKTDAELIEIAHRMGSTIMAYAFDLENDE-- 208

Query: 256 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 315
                +        V      S+     +VP+   +L   ++  A +   ++++ +   R
Sbjct: 209 -----EEEEEADGWVEDRDGKSM---MGMVPMAD-ILNADAEFNAHVNHEEESLTVTSLR 259

Query: 316 PYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAA 367
           P KAGE I+ +Y           GP PNS+LL  YG+V E +        P+D  +VE+A
Sbjct: 260 PIKAGEEILNYY-----------GPHPNSELLRRYGYVTEKHSRYDVVEIPWD--IVESA 306

Query: 368 L--NTEDPQYQDKRMVAQRNGKL-------SVQVFHVHAGREKEAISDMLPYLRLGYVSD 418
           L  N   P     +++ Q  G L          V     G      +   P        D
Sbjct: 307 LTSNFGIP----GQVLEQIRGALEEDEEFEDTFVLERETGEVNSDGTFAEPARFESMPED 362

Query: 419 TSE-MQSVISSLGPICP--VSPCMERAVLDQ--LADYFKARLAGYPATLSEDEAMLTDYN 473
             E +++ +  +    P  +    +R  + Q  LA   +A +A YP ++SEDE +L   +
Sbjct: 363 LQEQLKTFLKGIKKAQPDAIPDKRKRDEIHQAVLAKTLEALVARYPTSISEDENLLKQ-D 421

Query: 474 LHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPC 514
           L+ + R+A  +   EKK+L   +  ++  + +  D    P 
Sbjct: 422 LNQRTRMAIAVRLGEKKLLQEAITASSGDVEMTMDDESGPA 462


>gi|391340216|ref|XP_003744440.1| PREDICTED: SET domain-containing protein 4-like [Metaseiulus
           occidentalis]
          Length = 381

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 67/290 (23%), Positives = 119/290 (41%), Gaps = 52/290 (17%)

Query: 78  LGDLKSWMHKNGLPPCKVI-LKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTL 136
           +G+L SW+ + G  P  V+ L   P+          +    +++AGD    +P++L++T 
Sbjct: 21  IGELYSWIQRLGFKPTSVLRLACTPASGRG------IVCLSNIEAGDVIIDLPSTLLITP 74

Query: 137 ERVLGNETIAEL-LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
           + V       EL ++   LS    L ++++ E+  G+KS W PYI  +            
Sbjct: 75  DLVR-----KELNMSKENLSAEEILTIFVLSERSLGEKSKWKPYIESI------------ 117

Query: 196 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
            P ++   +       P +      A+ I R   E   V+      F+    D+      
Sbjct: 118 -PDVFDGLQCRKSVRLPRRL-----AQAIDRWNAERRNVFSRLRMFFRGRGIDL-----N 166

Query: 256 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 315
           FE F  A+ AV +  ++++         L P    LL +  K     + V++   +  + 
Sbjct: 167 FETFSWAWSAVNTRCIYVE----GHGSTLAPF-LDLLNHHWKASIETSFVNNHFIIRSNV 221

Query: 316 PYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVE 365
            Y+AG  + + Y           G   N  L +NYGFV ++NP D + VE
Sbjct: 222 GYEAGSEVFIGY-----------GSHDNRTLFLNYGFVLDENPNDCITVE 260


>gi|425766115|gb|EKV04742.1| hypothetical protein PDIG_87340 [Penicillium digitatum PHI26]
 gi|425778867|gb|EKV16969.1| hypothetical protein PDIP_33360 [Penicillium digitatum Pd1]
          Length = 679

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 59/224 (26%), Positives = 99/224 (44%), Gaps = 34/224 (15%)

Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
             A +LM +  +G + FW PY+R L +    GQL   +PL + E ++ ++ G+      +
Sbjct: 105 TFAFFLMAQYLRGPEGFWYPYLRTLPQP---GQLT--TPLFFGEEDVDWIQGTGIPEAAV 159

Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 278
           ER +  + +Y   D+ +   G+    +P     E +T+E++  A   + S     + +S 
Sbjct: 160 ERIKIWEEKY---DSGYLQLGA--TGFP---DCETYTWELYLWASTIITSRAFSAKVLSG 211

Query: 279 ARR---------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNI 329
           A +          AL+PL   L  +    K    A D  + L+V   + AG+ I   Y  
Sbjct: 212 AVQPGDLPEDGVSALLPL-IDLPNHRPMAKVEWRAGDKDIGLLVLEDHSAGQEISNNY-- 268

Query: 330 SFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 373
                    GP+ N +LLINYGF    NP D  +V   +  + P
Sbjct: 269 ---------GPRNNEQLLINYGFCIAGNPTDYRIVHLGVKPDSP 303


>gi|321470773|gb|EFX81748.1| hypothetical protein DAPPUDRAFT_317395 [Daphnia pulex]
          Length = 495

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 97/443 (21%), Positives = 177/443 (39%), Gaps = 51/443 (11%)

Query: 88  NGLPPCKVILKEKPSHNEKHRPIHYVA---------ASEDLQAGDAAFSVPNSLVVTLER 138
           + LPP    L+   +H+ K  P+  V          A++ +   +  FS+P  L+++ E 
Sbjct: 75  DTLPP---FLEWMTNHDVKMGPVELVELPLYGCCVRATKQVSTDELLFSIPQKLMLSNET 131

Query: 139 VLGNETIAELLTTNK-LSELACLAL-YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES 196
              + TI   +  +  LS++  +AL + +  +    KSFW PY+  L         + ++
Sbjct: 132 A-NSSTIGHFINNDPILSQMPNVALAFHVLNELYDPKSFWKPYLDALPS-------SYDT 183

Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYNEL-----DTVWFMAGSLFQQYPYDIPT 251
            + ++  E+  L GSP   + L     I R+Y+         V     +L   + Y+   
Sbjct: 184 VMYFTPDEITELKGSPAFDDALRMCRNIARQYSYFYSLLQKNVDPALSNLRANFTYNDYR 243

Query: 252 EAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQL 311
            A +  + +Q  +  Q  +    K  L    AL+PL         +              
Sbjct: 244 WAVSTVMTRQNLIPSQEEISGNDKDQLPPVNALIPLWDFCNHQDGQFSTEFQLESRRTVC 303

Query: 312 VVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 371
              R +  GE + ++Y           G +  ++  I+ GFVD +N +D L ++  L+  
Sbjct: 304 QAGRDFGPGEQVFIFY-----------GTRTCAEQFIHNGFVDINNAHDALTLKVGLSKS 352

Query: 372 DPQYQDKRMVAQRNGKLSVQ------VFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV 425
           DP    +  +  +   LS +       F + AG  +     +L +LRL  ++  S  + +
Sbjct: 353 DPLAGQRATLLCKLRILSDEKISGPIAFQLKAG-PQPVDGKLLAFLRLFCMTKDSLDRWL 411

Query: 426 ISSLGPICPVSPC-MERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRV 480
            S          C +E  V D+   + KAR    L  YP T   D  ML + +L   +R+
Sbjct: 412 QSDNASNLMHEECGIETEVDDKSWSFLKARCQLLLQLYPTTKEADLKMLEE-DLSSHRRM 470

Query: 481 ATQLVRMEKKMLNACLQVTADMI 503
              L   EK++L + ++  A  I
Sbjct: 471 CVLLRLAEKRILLSAIECAAQRI 493


>gi|189189204|ref|XP_001930941.1| SET domain-containing protein RMS1 [Pyrenophora tritici-repentis
           Pt-1C-BFP]
 gi|187972547|gb|EDU40046.1| SET domain-containing protein RMS1 [Pyrenophora tritici-repentis
           Pt-1C-BFP]
          Length = 476

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 79/313 (25%), Positives = 123/313 (39%), Gaps = 62/313 (19%)

Query: 83  SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLER-VLG 141
           +W+ ++G      I  E   + +  R    V AS+D+   +  F +P + ++++E  +L 
Sbjct: 13  AWLRQSGAEISPKIKLEDLRNKDAGRG---VVASQDIAEHELLFRIPRASILSVENSILS 69

Query: 142 NETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
            E  A  L+   L     L L ++YE   G  S W PY   L  +         + + W+
Sbjct: 70  TEIPAATLSL--LGPWLSLILVMLYEYHNGSASNWAPYFAVLPTE-------FNTLMFWT 120

Query: 202 ETELAYLTGSPTKAEIL---------------------------ERAEGIKREYNELDTV 234
           E ELA L  S    ++                            ERA+   +E   L+ +
Sbjct: 121 EDELAELQASAVVGKVGKESADEAFLEQLLPVIEEFADIVFSGDERAKDKAKEMRSLENL 180

Query: 235 WFM--AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
             M   GSL   Y +D+     T E+ ++   A +     L K        +VPL   L 
Sbjct: 181 ELMHKMGSLIMAYAFDVEPATPTKEVDEEG-FAEEEEDAALPK-------GMVPLADMLN 232

Query: 293 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGF 352
           A + +C A L    D +++   +P +AGE I   Y           GP P S LL  YG+
Sbjct: 233 ADADRCNARLFYEKDCLEMKALKPIQAGEEIFNDY-----------GPLPRSDLLRRYGY 281

Query: 353 VDEDNPYDRLVVE 365
           V  DN     VVE
Sbjct: 282 VT-DNYAQYDVVE 293


>gi|327290197|ref|XP_003229810.1| PREDICTED: SET domain-containing protein 4-like [Anolis
           carolinensis]
          Length = 440

 Score = 60.1 bits (144), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 81/320 (25%), Positives = 132/320 (41%), Gaps = 43/320 (13%)

Query: 86  HKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVVTLERV 139
           HK+     K  LKEK  +  K RP  +      +  ++ LQ G+   S+P   ++T + V
Sbjct: 29  HKDEYILLKKWLKEKGCNVNKLRPAQFPETGRGLVTTKGLQVGELIISLPEKCLLTTDTV 88

Query: 140 LGNETIAELLT--TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
           L N  + E +   T  +S L  L  +L+ EK   +KS W PY+  L          + S 
Sbjct: 89  L-NSYLREYIVKWTPPISPLIALCTFLIAEKWAQEKSPWKPYLDLLPE--------IYSC 139

Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
            +  E ++  L   P + +  E+ + ++  +      +F    LF   P D+ +  F ++
Sbjct: 140 PVCLEQKIVNLFPEPLRRKAHEQRKLVQELFISSQQFFFSLQPLF---PKDVAS-VFNYQ 195

Query: 258 IFKQAFVAVQSCVV---HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD 314
            FK A+  + +  V   H Q+   +R      L P L   +      + A         +
Sbjct: 196 AFKWAWCTINTRTVYMKHSQRDCFSRDTDTYALAPYLDLLNHNPTVQVKA-------GFN 248

Query: 315 RPYKAGESIVV----WYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 370
              K  E   V     YN  F  +    GP  N +LL+ YGFV  DNP+  + V      
Sbjct: 249 EKTKCYEITTVTQCHHYNEVFICY----GPHDNQRLLLEYGFVSRDNPHSSVYVGTDTLL 304

Query: 371 EDPQYQDKRMVAQRNGKLSV 390
           ++   +DK    QR  KLS+
Sbjct: 305 KNVFPEDK----QRPKKLSI 320


>gi|408393455|gb|EKJ72719.1| hypothetical protein FPSE_07119 [Fusarium pseudograminearum CS3096]
          Length = 465

 Score = 60.1 bits (144), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 95/454 (20%), Positives = 177/454 (38%), Gaps = 90/454 (19%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--------LSELACLALYL 164
           + A  D+ A    F++P    + +E     + I ++   +K        L   + L L +
Sbjct: 43  IIALRDIPAETTLFTIPRKGSINIETSELPQKIPDVFDLDKPDEDDVPGLDSWSSLILIM 102

Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEG 223
           +YE  +G  S W  Y   L         + ++P+ WSE EL  L  S  + +I +  AE 
Sbjct: 103 IYEYLRGDSSQWKSYFDVLPS-------SFDTPMFWSENELDQLQASHMRHKIGKADAEN 155

Query: 224 I-------------------KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
           +                    R  +EL  +    GS    Y +D+  +    E  ++   
Sbjct: 156 MFKKTLVPIIRSNPSIFNAENRSDSELVEIAHRMGSTIMAYAFDLENDEEEEEETEEWVE 215

Query: 265 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 324
                 +            +VP+   +L   ++  A +   ++++ +   RP KAGE I+
Sbjct: 216 DRDGKSM----------MGMVPMAD-ILNADAEFNAHVNHEEESLTVTSLRPIKAGEEIL 264

Query: 325 VWYNISFTGHDFKCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAALNT------ 370
            +Y           GP PNS+LL  YG+V E +        P+D  +VE+ L        
Sbjct: 265 NYY-----------GPHPNSELLRRYGYVTEKHSRYDVVEIPWD--IVESVLTNFGISSK 311

Query: 371 ----------EDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 420
                      + +++D  ++ +  G+++         + +    D+   L+    S   
Sbjct: 312 ILKQIRGEFEGEEEFEDTFVLERDTGEINSDGTFAEPAKFEGMPEDLQEQLK----SFLK 367

Query: 421 EMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRV 480
            ++ V S   P       + +AVL +  +   AR   YP ++SED+ +L   NL  + R+
Sbjct: 368 GIKKVQSDTIPDKRKRDEIHQAVLVKTLEALAAR---YPTSISEDQTLLNGQNLDQRARM 424

Query: 481 ATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPC 514
           AT +   EKK+L   +   ++ + +  D    P 
Sbjct: 425 ATVVRLGEKKLLQEAIATFSEDVEMTMDDESGPA 458


>gi|80479475|gb|AAI08868.1| Unknown (protein for MGC:132347) [Xenopus laevis]
          Length = 456

 Score = 60.1 bits (144), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 64/257 (24%), Positives = 113/257 (43%), Gaps = 29/257 (11%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAEL-LTTNKLSELACLALYLMYEKKQGKK 173
           A+ DL+ G+   ++P + ++T E VL +     + L    +S L  L  +L+ E+  G++
Sbjct: 64  ATRDLKPGELIIALPETCLITTETVLQSYLGKYIRLWRPHVSPLLALCTFLIAERFAGER 123

Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
           S W PY+  +             P+ W E E+ +L  +P + + LE+    K E  EL T
Sbjct: 124 SQWKPYLDVIPS-------TYSCPVYW-ELEIVHLLPAPLRQKALEQ----KTEVQELHT 171

Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLA 293
                 +  Q    D   + +T++  + A+  V +  V+++     R          LLA
Sbjct: 172 ESLAFFNSLQPLFCDNVADIYTYDALRWAWCTVNTRTVYMKHTQQDR----------LLA 221

Query: 294 YSSKCKA--MLAAVDDAVQLVVDRPY-KAGESIVVWYNISFTGHD--FKC-GPQPNSKLL 347
               C     L  ++ + ++ V+  + K      +  N     HD  F C GP  N +LL
Sbjct: 222 QQDVCALAPYLDLLNHSPEVQVEAEFSKDRRCYEIRTNSGCRKHDQAFICYGPHDNQRLL 281

Query: 348 INYGFVDEDNPYDRLVV 364
           + YGFV  +NP+  + V
Sbjct: 282 LEYGFVAANNPHRSVYV 298


>gi|17367341|sp|Q43088.1|RBCMT_PEA RecName: Full=Ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic; AltName:
           Full=[Fructose-bisphosphate aldolase]-lysine
           N-methyltransferase; AltName:
           Full=[Ribulose-bisphosphate carboxylase]-lysine
           N-methyltransferase; Short=PsLSMT; Short=RuBisCO LSMT;
           Short=RuBisCO methyltransferase; Short=rbcMT; Flags:
           Precursor
 gi|508551|gb|AAA69903.1| ribulose-1,5 bisphosphate carboxylase large subunit
           N-methyltransferase [Pisum sativum]
          Length = 489

 Score = 60.1 bits (144), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 92/387 (23%), Positives = 163/387 (42%), Gaps = 33/387 (8%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A +D+   D    VP  L +  + V  +E I  + +  +L     + L+L+ E+ + +
Sbjct: 84  LVALKDISRNDVILQVPKRLWINPDAVAASE-IGRVCS--ELKPWLSVILFLIRERSR-E 139

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
            S W  Y   L ++        +S + WSE EL  L GS      +   E +K E  +L+
Sbjct: 140 DSVWKHYFGILPQE-------TDSTIYWSEEELQELQGSQLLKTTVSVKEYVKNECLKLE 192

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQS-CVVHLQKVSLARRFALVPLGPP 290
               +        P  +    + F I + +AF  +++  +V +    L    A V     
Sbjct: 193 QEIILPNKRLFPDPVTLDDFFWAFGILRSRAFSRLRNENLVVVPMADLINHSAGVTTEDH 252

Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINY 350
             AY  K  A L + D    L      KAGE + + Y+++          + N++L ++Y
Sbjct: 253 --AYEVKGAAGLFSWDYLFSLKSPLSVKAGEQVYIQYDLN----------KSNAELALDY 300

Query: 351 GFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPY 410
           GF++ +       +   ++  DP + DK  VA+ NG      F +   R       +LPY
Sbjct: 301 GFIEPNENRHAYTLTLEISESDPFFDDKLDVAESNGFAQTAYFDIFYNRTLPP--GLLPY 358

Query: 411 LRLGYVSDTSE--MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSED 465
           LRL  +  T    ++S+   +  G +   VS   E  +   + +  K+ LAGY  T+ +D
Sbjct: 359 LRLVALGGTDAFLLESLFRDTIWGHLELSVSRDNEELLCKAVREACKSALAGYHTTIEQD 418

Query: 466 EAMLTDYNLHPKKRVATQLVRMEKKML 492
              L + NL  +  +A  +   EK +L
Sbjct: 419 RE-LKEGNLDSRLAIAVGIREGEKMVL 444


>gi|255568191|ref|XP_002525071.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase, chloroplast precursor, putative
           [Ricinus communis]
 gi|223535652|gb|EEF37318.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase, chloroplast precursor, putative
           [Ricinus communis]
          Length = 456

 Score = 59.7 bits (143), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 62/260 (23%), Positives = 122/260 (46%), Gaps = 38/260 (14%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           AS+ +Q GD    VP S  +  + +L    +++LL  +++  +A LA+ L+ ++K G++S
Sbjct: 51  ASKSIQTGDCILRVPYSAQIASDNLLPE--LSDLLG-DEVGSVAKLAIVLLVDQKVGQES 107

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
            W PYI  L  Q G     + S + WS++EL  +  S    E +++   I++++  +  V
Sbjct: 108 KWAPYISRLP-QLGE----MHSTIFWSKSELDMIFQSSVYKETIKQKAQIEKDFLTIKPV 162

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
                   + +P    + + TF+ F  A+  V+S     +     +  +L+P     L +
Sbjct: 163 -------LEHFPQ--ISRSITFQDFMHAYALVKS-----RAWGSTKGVSLIPFA-DFLNH 207

Query: 295 SSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGF 352
               +A++   +D    ++  DR Y   E +++ Y           G   N+ LL+++GF
Sbjct: 208 DGFSEAVVLNDEDKQVSEVAADRNYAPHEEVLIRY-----------GKFSNATLLLDFGF 256

Query: 353 VDEDNPYDRLVVEAALNTED 372
               N +++  VE  +N  D
Sbjct: 257 SLPYNIHEQ--VEIQINIPD 274


>gi|62860180|ref|NP_001017105.1| SET domain containing 4 [Xenopus (Silurana) tropicalis]
 gi|89267009|emb|CAJ81787.1| novel protein containing a SET domain [Xenopus (Silurana)
           tropicalis]
          Length = 442

 Score = 59.7 bits (143), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 67/260 (25%), Positives = 118/260 (45%), Gaps = 31/260 (11%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSELACLALYLMYEKKQG 171
           + A+ DLQ G+   S+P+S ++T E VL +     + T +  +S L  L  +L+ E+   
Sbjct: 62  LMATRDLQPGELIISLPDSCLITTETVLQSYLGKYIRTWSPPVSPLLALCTFLIAERVAR 121

Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
           ++S W PY+  L         +   P+ W E+E+  L  +P + + LE+   +K  + E 
Sbjct: 122 ERSPWKPYLDVLPS-------SYSCPVYW-ESEIISLLPAPLRQKALEQQTEVKELHTE- 172

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQKVSLARRFALVPLG 288
              W    SL   +  +I T+ +T+   + A+  V +  V   H ++  L+ +  +  + 
Sbjct: 173 --SWSFFVSLQPLFGGNI-TDIYTYGALRWAWCTVNTRTVYMKHPRRHGLSAQQDVYAMA 229

Query: 289 PPL-LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHD--FKC-GPQPNS 344
           P L L   S    + AA ++  +    R            N     HD  F C GP  N 
Sbjct: 230 PYLDLLNHSPAVQVEAAFNEERRCYEIRT-----------NSGCRKHDQAFICYGPHDNQ 278

Query: 345 KLLINYGFVDEDNPYDRLVV 364
           +LL+ YGF+  +NP+  + V
Sbjct: 279 RLLLEYGFIAANNPHRSVYV 298


>gi|301122791|ref|XP_002909122.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262099884|gb|EEY57936.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 426

 Score = 59.3 bits (142), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 93/422 (22%), Positives = 172/422 (40%), Gaps = 49/422 (11%)

Query: 111 HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
           H V A + L +G     +P  L + +E    ++    L    ++ +   LAL+LM+E+ +
Sbjct: 35  HGVFAKQALTSGQVTLRIPFKLTMNIESAARSDLARVLEKYPQIPDDEVLALHLMHERSK 94

Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
              SF+ P+I  L           + P+ WSE+EL  L G+          + ++R++  
Sbjct: 95  RSDSFFAPFIASLPT-------TFDLPVFWSESELNELKGTNVLLLTQLMKQQLQRDFEN 147

Query: 231 LDTVWFMAGSLFQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 287
           +        ++ + +P     +PT   T E +  A   + S    + +     R  L P 
Sbjct: 148 IHQ------AVVEDFPEVFALLPT--LTLEDYTWAMSVIWSRAFGVTREKKYLR-VLCP- 197

Query: 288 GPPLLAYSSKCKAML---AAVDDAVQLV---VDRPYKAGESIVVWYNISFTGHDFKCGPQ 341
              +  +    + +L    + D+  Q++   V +   AG ++ + Y           G  
Sbjct: 198 AMDMFNHDVSLRILLDDFVSFDEETQMLTHHVPKEVAAGSALQISY-----------GQY 246

Query: 342 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGR-E 400
            N+KLL +YGFV ++N    +     +   DP  + K+ V   N     Q +       E
Sbjct: 247 SNAKLLFSYGFVAKENSRRAVDFWMKIPPNDPYLKLKQTVLDSNELTRDQTYDFCGTLFE 306

Query: 401 KEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 460
            +    +L  LR+  +++  E++ +         +S   E AV + L +  + +LA +  
Sbjct: 307 NDVDERLLATLRVILMNE-QEIR-LYKKAFETSIISIRNELAVYENLQNTCRRKLANFAT 364

Query: 461 TLSEDEAMLTDYNLHPKKRVATQL-VRMEKKM--------LNACLQVTADMIMLLPDVTV 511
           TL EDEA+L +       R++  + VR+E K         L    QV A  + + P  T 
Sbjct: 365 TLEEDEAILAEMATESSPRLSFAVRVRVEDKQVLTGVIDTLEKWKQVLASNLEMYPPSTT 424

Query: 512 SP 513
            P
Sbjct: 425 RP 426


>gi|390354259|ref|XP_001201449.2| PREDICTED: SET domain-containing protein 4-like [Strongylocentrotus
           purpuratus]
          Length = 455

 Score = 59.3 bits (142), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 69/301 (22%), Positives = 123/301 (40%), Gaps = 37/301 (12%)

Query: 75  EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
           +E    L  WM ++G   CK    +    ++  R +      ++L+ GD+   +P  L+V
Sbjct: 40  DEQYITLMKWMKEHGFN-CKGCCLKPAVFSDTGRGL---MTKKNLRPGDSIVEIPRHLLV 95

Query: 135 TLERVLGNETIAELLTTN--KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
           T + +L  E +  ++     K +    +  +L+ E+ +GK SFW PYI  L +       
Sbjct: 96  TAKDILNTE-LGPIIKRQRQKPTPYQVVCAFLLTERSKGKSSFWYPYINVLPKD------ 148

Query: 193 AVESPLLWSETELAYLTGSPT--KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIP 250
              +P   S T+ A     PT  ++  + + + I+  +     ++      F QY     
Sbjct: 149 -FTTPAFGS-TKQADFDVLPTIARSRAINQLQDIRAAFESASCLFEDIERTFPQYRIFFS 206

Query: 251 TEAFTFEIF----KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYS--SKCKAMLAA 304
            ++F +  F    +  ++    C     K S    FAL P    LL +S  ++  A    
Sbjct: 207 LDSFVWAWFVINSRSVYIEPSGCEAFDPKAS--DDFALAPFLD-LLNHSPGAEVTAGFDP 263

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
           V +  ++     Y A + + + Y           GP  N  LL+ YGFV   NP+D +  
Sbjct: 264 VSNCYRIKTLDSYHAYDQVFIHY-----------GPHDNVNLLLEYGFVIPSNPHDAVSF 312

Query: 365 E 365
           E
Sbjct: 313 E 313


>gi|320168265|gb|EFW45164.1| hypothetical protein CAOG_03170 [Capsaspora owczarzaki ATCC 30864]
          Length = 464

 Score = 58.9 bits (141), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 83/318 (26%), Positives = 135/318 (42%), Gaps = 70/318 (22%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELA---------CLA 161
           V A  D+ AG    +VP +L++T E+   +ET  +L+T+  L  +EL+          L 
Sbjct: 170 VIARRDIPAGQTFINVPEALMMTAEKARKSETF-QLITSGALDSTELSPAMAKLDNFLLR 228

Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
           ++L+ E+++G  S+W PYI +L  QR R       PL ++E EL  L  SP   E   + 
Sbjct: 229 MFLIVERRRGGNSYWSPYI-DLLPQRFR------LPLYFTEAELELLKPSPALQEAFVQL 281

Query: 222 EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR 281
             + R+Y           + ++QY           E+ + A +   S   H + +   RR
Sbjct: 282 RNVVRQY-----------AAWKQY-------LMMLELARAAELPSGSGDAHQKILDQRRR 323

Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD--RPYKAGE----SIVVW--YNISFTG 333
              +P+    L Y   C A  A      Q+VV   R  +A E     I  W   N +F G
Sbjct: 324 AQAMPVRYNELTYDLFCWASSAVATRQNQIVVGEVRANQAPELSLALIPGWDMCNHAFGG 383

Query: 334 ---------HDFKC-GPQPNSK---LLINYG------------FVDEDNPYDRLVVEAAL 368
                       +C    P +K   +L++YG            FV  D+P D+ ++  A+
Sbjct: 384 ASSFYDTQTRSLECVAVAPIAKGEPVLLHYGDRSSMAYFGNSEFVPADHPTDQYLILLAV 443

Query: 369 NTEDPQYQDKRMVAQRNG 386
             +DP ++ K  + Q  G
Sbjct: 444 GKQDPLFKSKSTILQALG 461


>gi|451999637|gb|EMD92099.1| hypothetical protein COCHEDRAFT_1134267 [Cochliobolus
           heterostrophus C5]
          Length = 476

 Score = 58.9 bits (141), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 93/390 (23%), Positives = 156/390 (40%), Gaps = 66/390 (16%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           VAA +D+   +  FS+P S ++++E  + +  I    T   L     L L ++YE   G 
Sbjct: 40  VAAKQDIAEHELLFSIPRSSILSVENSILSTEIPPT-TFALLGPWLSLILVMLYEYHNGS 98

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL-------------- 218
            S W PY   L           ++ + W+E EL  L  S    +I               
Sbjct: 99  ASNWAPYFAVLPTD-------FDTLMFWTEDELTELQASAVVNKIGKEGANEVFIEQLLP 151

Query: 219 -------------ERAEGIKREYNELDTVWFM--AGSLFQQYPYDIPTEAFTFEIFKQAF 263
                        ERA+ + +E    + +  M   GSL   Y +D+   A + +   +  
Sbjct: 152 VIEEFADVIFSGDERAKDLAKEMRAPENLELMHKMGSLIMAYAFDVEP-AISDKEVDEEG 210

Query: 264 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESI 323
            A +     L K        +VPL   L A + +C A L    D +++   +P +AGE I
Sbjct: 211 FAEEEEDAALPK-------GMVPLADMLNADADRCNARLFYEKDGLEMKALKPIQAGEEI 263

Query: 324 VVWYNISFTGHDFKCGPQPNSKLLINYGFVDED-NPYDRLVVEAALNTE----DPQYQDK 378
              Y           GP P S LL  YG++ E+   YD + + A L ++    D  + +K
Sbjct: 264 FNDY-----------GPLPRSDLLRRYGYITENYAQYDVVEIPADLVSQALAHDGLWHEK 312

Query: 379 RMVAQRNGKLSVQVFHVHAG---REKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPV 435
           R+      ++    + + A      +E++S  L  L    +  + E +  + S G + P 
Sbjct: 313 RIEYLDEQEIVDTGYDIAASVPFSLEESLSPELVILVETMLLPSEEFER-LQSKGRL-PK 370

Query: 436 SPCMERAVLDQLADYFKARLAGYPATLSED 465
           +  M     + L    +AR+A YP TL +D
Sbjct: 371 AEKMTGKAAEILYKIVQARIAQYPTTLEQD 400


>gi|24987776|pdb|1MLV|A Chain A, Structure And Catalytic Mechanism Of A Set Domain Protein
           Methyltransferase
 gi|24987777|pdb|1MLV|B Chain B, Structure And Catalytic Mechanism Of A Set Domain Protein
           Methyltransferase
 gi|24987778|pdb|1MLV|C Chain C, Structure And Catalytic Mechanism Of A Set Domain Protein
           Methyltransferase
 gi|33357815|pdb|1OZV|A Chain A, Crystal Structure Of The Set Domain Of Lsmt Bound To
           Lysine And Adohcy
 gi|33357816|pdb|1OZV|B Chain B, Crystal Structure Of The Set Domain Of Lsmt Bound To
           Lysine And Adohcy
 gi|33357817|pdb|1OZV|C Chain C, Crystal Structure Of The Set Domain Of Lsmt Bound To
           Lysine And Adohcy
 gi|33357822|pdb|1P0Y|A Chain A, Crystal Structure Of The Set Domain Of Lsmt Bound To
           Melysine And Adohcy
 gi|33357823|pdb|1P0Y|B Chain B, Crystal Structure Of The Set Domain Of Lsmt Bound To
           Melysine And Adohcy
 gi|33357824|pdb|1P0Y|C Chain C, Crystal Structure Of The Set Domain Of Lsmt Bound To
           Melysine And Adohcy
          Length = 444

 Score = 58.9 bits (141), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 91/387 (23%), Positives = 161/387 (41%), Gaps = 33/387 (8%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A +D+   D    VP  L +  + V  +E        ++L     + L+L+ E+ + +
Sbjct: 40  LVALKDISRNDVILQVPKRLWINPDAVAASEIGR---VCSELKPWLSVILFLIRERSR-E 95

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
            S W  Y   L ++        +S + WSE EL  L GS      +   E +K E  +L+
Sbjct: 96  DSVWKHYFGILPQE-------TDSTIYWSEEELQELQGSQLLKTTVSVKEYVKNECLKLE 148

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQS-CVVHLQKVSLARRFALVPLGPP 290
               +        P  +    + F I + +AF  +++  +V +    L    A V     
Sbjct: 149 QEIILPNKRLFPDPVTLDDFFWAFGILRSRAFSRLRNENLVVVPMADLINHSAGVTTEDH 208

Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINY 350
             AY  K  A L + D    L      KAGE + + Y+++          + N++L ++Y
Sbjct: 209 --AYEVKGAAGLFSWDYLFSLKSPLSVKAGEQVYIQYDLN----------KSNAELALDY 256

Query: 351 GFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPY 410
           GF++ +       +   ++  DP + DK  VA+ NG      F +   R       +LPY
Sbjct: 257 GFIEPNENRHAYTLTLEISESDPFFDDKLDVAESNGFAQTAYFDIFYNRTLPP--GLLPY 314

Query: 411 LRLGYVSDTSE--MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSED 465
           LRL  +  T    ++S+   +  G +   VS   E  +   + +  K+ LAGY  T+ +D
Sbjct: 315 LRLVALGGTDAFLLESLFRDTIWGHLELSVSRDNEELLCKAVREACKSALAGYHTTIEQD 374

Query: 466 EAMLTDYNLHPKKRVATQLVRMEKKML 492
              L + NL  +  +A  +   EK +L
Sbjct: 375 RE-LKEGNLDSRLAIAVGIREGEKMVL 400


>gi|328700922|ref|XP_003241429.1| PREDICTED: SET domain-containing protein 3-like [Acyrthosiphon
           pisum]
          Length = 463

 Score = 58.9 bits (141), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 69/312 (22%), Positives = 130/312 (41%), Gaps = 41/312 (13%)

Query: 73  KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
           + ++ +  L  W  KNG      IL     H  ++   + + A++++  GD   +VP +L
Sbjct: 81  RNDQSIEKLTKWATKNG-----AILNGVEIHQFENYA-YGMKANKNITVGDKLVTVPRAL 134

Query: 133 VVTLERV----LGNETIAELLTTNKLSELACLALYLMYEK-KQGKKSFWLPYIRELDRQR 187
           ++T E +    L      +++  N  +    LA++++ E  ++ KKSFW  Y+  L    
Sbjct: 135 MMTEENIPSSPLWKLHSQDMMLRNMPN--VALAIFILVESLRKDKKSFWHSYLTTL---- 188

Query: 188 GRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY 247
               +   +P+ +   +L  L GSP     L+    I R+Y     ++ ++       P 
Sbjct: 189 ---PVTYSTPVYFDVADLEALKGSPAFEAALKLNRNIARQYAYFKKLFQLSND-----PA 240

Query: 248 D-IPTEAFTFEIFKQA---FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA 303
             I  + FT+E ++ A    ++ Q+ V      S     AL+PL       S +      
Sbjct: 241 SVILKDTFTYEYYRWAVSTLMSRQNTVPSSDNPS-ENVSALIPLWDMFNHRSGRLSTDFV 299

Query: 304 AVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLV 363
              +      D  Y A E + ++Y           G + N+  L++ GFV  DN +D + 
Sbjct: 300 KSSNVCVCYADGDYAADEQVYIFY-----------GVRTNADFLVHNGFVYPDNEHDAVK 348

Query: 364 VEAALNTEDPQY 375
           +   ++  DP Y
Sbjct: 349 IRLGVSRSDPLY 360


>gi|148226164|ref|NP_001079674.1| SET domain containing 4 [Xenopus laevis]
 gi|28422727|gb|AAH46855.1| MGC53706 protein [Xenopus laevis]
          Length = 456

 Score = 58.5 bits (140), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 65/257 (25%), Positives = 111/257 (43%), Gaps = 29/257 (11%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAEL-LTTNKLSELACLALYLMYEKKQGKK 173
           A+ DL+ G+   ++P + ++T E VL +     + L    +S L  L  +L+ E+  G  
Sbjct: 64  ATRDLKPGELIIALPETCLITTETVLQSYLGKYIRLWRPHVSPLLALCTFLIAERFAGDC 123

Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
           S W PY+  +             P+ W E E+ +L  +P + + LE+    K E  EL T
Sbjct: 124 SQWKPYLDVIPS-------TYSCPVYW-ELEIIHLLPAPLRKKALEQ----KTEVQELHT 171

Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLA 293
                 S  Q    D   + +T++  + A+  V +  V+++     R          LLA
Sbjct: 172 ESLAFFSSLQPLFCDNVADIYTYDALRWAWCTVNTRTVYMKHTQQDR----------LLA 221

Query: 294 YSSKCKA--MLAAVDDAVQLVVDRPY-KAGESIVVWYNISFTGHD--FKC-GPQPNSKLL 347
               C     L  ++ + ++ V+  + K      +  N     HD  F C GP  N +LL
Sbjct: 222 QQDVCALAPYLDLLNHSPEVQVEAEFSKDRRCYEIRTNSGCRKHDQAFICYGPHDNQRLL 281

Query: 348 INYGFVDEDNPYDRLVV 364
           + YGFV  +NP+  + V
Sbjct: 282 LEYGFVAANNPHRSVYV 298


>gi|109158151|pdb|2H21|A Chain A, Structure Of Rubisco Lsmt Bound To Adomet
 gi|109158152|pdb|2H21|B Chain B, Structure Of Rubisco Lsmt Bound To Adomet
 gi|109158153|pdb|2H21|C Chain C, Structure Of Rubisco Lsmt Bound To Adomet
 gi|109158154|pdb|2H23|A Chain A, Structure Of Rubisco Lsmt Bound To Trimethyllysine And
           Adohcy
 gi|109158155|pdb|2H23|B Chain B, Structure Of Rubisco Lsmt Bound To Trimethyllysine And
           Adohcy
 gi|109158156|pdb|2H23|C Chain C, Structure Of Rubisco Lsmt Bound To Trimethyllysine And
           Adohcy
 gi|109158157|pdb|2H2E|A Chain A, Structure Of Rubisco Lsmt Bound To Azaadomet And Lysine
 gi|109158158|pdb|2H2E|B Chain B, Structure Of Rubisco Lsmt Bound To Azaadomet And Lysine
 gi|109158159|pdb|2H2E|C Chain C, Structure Of Rubisco Lsmt Bound To Azaadomet And Lysine
 gi|109158160|pdb|2H2J|A Chain A, Structure Of Rubisco Lsmt Bound To Sinefungin And
           Monomethyllysine
 gi|109158161|pdb|2H2J|B Chain B, Structure Of Rubisco Lsmt Bound To Sinefungin And
           Monomethyllysine
 gi|109158162|pdb|2H2J|C Chain C, Structure Of Rubisco Lsmt Bound To Sinefungin And
           Monomethyllysine
          Length = 440

 Score = 58.5 bits (140), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 91/387 (23%), Positives = 161/387 (41%), Gaps = 33/387 (8%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A +D+   D    VP  L +  + V  +E        ++L     + L+L+ E+ + +
Sbjct: 36  LVALKDISRNDVILQVPKRLWINPDAVAASEIGR---VCSELKPWLSVILFLIRERSR-E 91

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
            S W  Y   L ++        +S + WSE EL  L GS      +   E +K E  +L+
Sbjct: 92  DSVWKHYFGILPQE-------TDSTIYWSEEELQELQGSQLLKTTVSVKEYVKNECLKLE 144

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQS-CVVHLQKVSLARRFALVPLGPP 290
               +        P  +    + F I + +AF  +++  +V +    L    A V     
Sbjct: 145 QEIILPNKRLFPDPVTLDDFFWAFGILRSRAFSRLRNENLVVVPMADLINHSAGVTTEDH 204

Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINY 350
             AY  K  A L + D    L      KAGE + + Y+++          + N++L ++Y
Sbjct: 205 --AYEVKGAAGLFSWDYLFSLKSPLSVKAGEQVYIQYDLN----------KSNAELALDY 252

Query: 351 GFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPY 410
           GF++ +       +   ++  DP + DK  VA+ NG      F +   R       +LPY
Sbjct: 253 GFIEPNENRHAYTLTLEISESDPFFDDKLDVAESNGFAQTAYFDIFYNRTLPP--GLLPY 310

Query: 411 LRLGYVSDTSE--MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSED 465
           LRL  +  T    ++S+   +  G +   VS   E  +   + +  K+ LAGY  T+ +D
Sbjct: 311 LRLVALGGTDAFLLESLFRDTIWGHLELSVSRDNEELLCKAVREACKSALAGYHTTIEQD 370

Query: 466 EAMLTDYNLHPKKRVATQLVRMEKKML 492
              L + NL  +  +A  +   EK +L
Sbjct: 371 RE-LKEGNLDSRLAIAVGIREGEKMVL 396


>gi|50557134|ref|XP_505975.1| YALI0F28061p [Yarrowia lipolytica]
 gi|49651845|emb|CAG78787.1| YALI0F28061p [Yarrowia lipolytica CLIB122]
          Length = 454

 Score = 58.5 bits (140), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 74/286 (25%), Positives = 117/286 (40%), Gaps = 54/286 (18%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           V ASED++  +  F +P S  +++E     + I ++    KL+    L LY+M   K G 
Sbjct: 42  VIASEDIEEDEVLFKIPRSSFLSVEN--DPDFIKQVPEAKKLNSWLQLILYMM---KAGS 96

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE-- 230
            + W PY   L  Q       ++S ++W++ EL  L GS    +I     G + +Y E  
Sbjct: 97  MTKWKPYFDVLPTQ-------LDSLMMWTDDELEGLKGSMIVKKI--GKAGAEEDYQEKL 147

Query: 231 -------------LDT---VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ 274
                         DT    +   G L   Y +D P ++F+    +            L 
Sbjct: 148 KPIIDAHPEYFKDCDTSLESFHRMGGLIMAYSFDAP-DSFS----EDEEDDEDIEHDDLY 202

Query: 275 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD-AVQLVVDRPYKAGESIVVWYNISFTG 333
              L +  A+VPL   L A++  C A L A DD    +   +P K GE +   Y      
Sbjct: 203 NEGLVK--AMVPLADTLNAHTRFCNANLIAEDDGGFSMTAIQPIKKGEQVYNTY------ 254

Query: 334 HDFKCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALNTEDPQYQDK 378
                G  PN   L  YG+V +E   +D  +VE +++     Y +K
Sbjct: 255 -----GELPNCDFLRRYGYVENEGTEFD--IVEFSMDEISDFYANK 293


>gi|384246211|gb|EIE19702.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 503

 Score = 58.5 bits (140), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 110/446 (24%), Positives = 192/446 (43%), Gaps = 51/446 (11%)

Query: 76  EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
           E L  L +W+ + GLP  K+ ++ +    +       +  S+  + G    +VP+S  +T
Sbjct: 50  ETLPPLSAWVEQRGLPLKKLNVRPEIVEGDL-----CLVVSKPTKKGQPLVAVPSSAWLT 104

Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
            ++V+ + +I  L+    L     +AL+L++E+ +   + W  ++  +         A +
Sbjct: 105 -QQVVRSSSIGSLV--EDLEPWLQIALFLLHERSKPDAA-WQGFLDSI-------PAAPD 153

Query: 196 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
            PL WSE EL+ L G+   + +    +  + +Y EL+        LF  +    P ++  
Sbjct: 154 VPLFWSEEELSQLEGTQLLSSVQGYRQFFEAKYAELEE------QLFAPHREAFPPKSHQ 207

Query: 256 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY----SSKCKAMLAA--VDDAV 309
            + F  A   V+S V            ALVPL   L+ +     ++ +  LA      A 
Sbjct: 208 LDDFLWAVATVRSRV---HSPLDGEDVALVPLAD-LVQHRKLQGARWQLQLAGGLFSKAQ 263

Query: 310 QLVVD--RPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPY-DRLVVEA 366
            LVV+  R Y  GE + + +    T  D +   + +S++L++YG +D D P  D  VV+ 
Sbjct: 264 ALVVEAQRDYAEGEVVTMDFGAPLTEEDQE---KLDSQVLLDYGALDADRPQADPGVVQG 320

Query: 367 ------ALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 420
                 AL  +D  Y DK  + + NG      F + A  E          L      D  
Sbjct: 321 GFILSLALPEDDKYYDDKADILELNGLSEAASFVLRANEEPSEQLLGFLRLLNLSGQDAF 380

Query: 421 EMQSVISSLG---PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPK 477
            ++ +  +      + PVS   ERAV + + +  +A L GY  ++ +D   L D    P 
Sbjct: 381 LLEPLFRNEAWGHMLAPVSEANERAVYESMMEGCRAALQGYATSIDDDLRALRDT--QPG 438

Query: 478 KRVATQ-LVRM-EKKMLNACLQVTAD 501
            R+    LVR+ EK+ L+A L    D
Sbjct: 439 TRLEKAILVRLGEKETLDATLAFFED 464


>gi|428182808|gb|EKX51668.1| hypothetical protein GUITHDRAFT_102933 [Guillardia theta CCMP2712]
          Length = 436

 Score = 58.2 bits (139), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 82/380 (21%), Positives = 157/380 (41%), Gaps = 62/380 (16%)

Query: 81  LKSWMHK-NGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
           L+ W+ + +G+   KV L+  P           V A+  L+ G+  F +P S  +  E V
Sbjct: 29  LRIWLEEEHGVDMSKVDLQRSPLEG------LGVFANRRLEPGETLFMIPKSCCIYPELV 82

Query: 140 LGNETIAELLTTNKLS-------ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
             +  + +  +  KL+       E+  LA +L  EK +G +S + P+I  L         
Sbjct: 83  FEDRQLGK--SMQKLASAAGEGIEVVALATFLAREKMKGSESSYKPFIDVL-------PW 133

Query: 193 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 252
               PLLW++ E+  L G+    EIL   E ++      + V    G  ++Q+   I TE
Sbjct: 134 DSLHPLLWTDEEVDLLEGTYAHREILAFREQVEVATELFEPVLNPKG--WKQFFQTIETE 191

Query: 253 AFTFEIF----KQAFVAVQS----CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAA 304
             T E F    + AF +V S      +      L  R  +    P L  ++         
Sbjct: 192 KMTPEEFGFMMRGAFASVLSRAFDSKIGRGDKGLEERVVI----PLLDIFNHGSYGPSIT 247

Query: 305 VDDAVQLVVDRPY-----------KAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV 353
            D A++   ++ +           + GE +  +Y           G +PN  +L  YGFV
Sbjct: 248 FDTALERDNEKGFPVRVADKGKSIEEGEELFGFY-----------GDKPNWNMLTTYGFV 296

Query: 354 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV-QVFHVHAGREKEAISDMLPYLR 412
             +       +  +++ +DP +  K  + +  G ++V Q+F +    + + +  ++ Y R
Sbjct: 297 SPNPKCQETTLSVSIDEKDPYFAQKEEILKARGMVAVEQLFDIR--HDTDPMGPLINYFR 354

Query: 413 LGYVSDTSEMQSVISSLGPI 432
           +  +S+ +++  V ++ G +
Sbjct: 355 IREISNEADLTKVQTNYGEM 374


>gi|327259114|ref|XP_003214383.1| PREDICTED: SET domain-containing protein 3-like, partial [Anolis
           carolinensis]
          Length = 311

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 56/208 (26%), Positives = 98/208 (47%), Gaps = 29/208 (13%)

Query: 306 DDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVE 365
           DD  + V  + +KAGE I ++Y           G + N++ +I+ GF  ++N +DR+ ++
Sbjct: 9   DDRCECVALQDFKAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKIK 57

Query: 366 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV 425
             ++  D  Y  K  V  R G  +  VF +HA  E    + +L +LR+  +++    + +
Sbjct: 58  LGVSKSDRLYAMKAEVLARAGIPTSSVFALHA-TEPPISAQLLAFLRVFCMTEDELKEHL 116

Query: 426 IS--------SLG-PICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDY 472
           I         +LG    PVS   E     +L  + +AR    L  Y  T+SED+A L   
Sbjct: 117 IGEHAIDRIFTLGNSEFPVSWDNEV----KLWTFLEARASLLLKTYKTTVSEDKAFLGTQ 172

Query: 473 NLHPKKRVATQLVRMEKKMLNACLQVTA 500
           +L     +A +L   EK++L   ++  A
Sbjct: 173 DLTCNATMAIKLRLGEKEILEKAIKSAA 200


>gi|336467028|gb|EGO55192.1| hypothetical protein NEUTE1DRAFT_147775 [Neurospora tetrasperma
           FGSC 2508]
 gi|350288355|gb|EGZ69591.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
          Length = 504

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 74/313 (23%), Positives = 125/313 (39%), Gaps = 51/313 (16%)

Query: 72  SKKEEDLGDLKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
           S  +E    L  W HK+G    P  +V   E    + + +P    +A+E L +G  A S 
Sbjct: 3   SPHKERFEALLDWAHKHGASLHPLLEVYEDEVTGFSLRVKP----SATELLGSGFKAVSC 58

Query: 129 PNSLVVTLERVLGNETIAELLTT---------------NKLSELACLALYLMYEKKQGKK 173
           P S+ ++    L +  I    TT               N L        YL+ +  +GK 
Sbjct: 59  PTSITLSYLNALTDGPITPSSTTLAPNTENPAFPERFMNSLPPHVIGRFYLIQQYLKGKS 118

Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
           SFW PYI  L       + A+  P  W+E ++  L G+     I E    +K EY +   
Sbjct: 119 SFWAPYISTLADPSQLDKWAL--PPFWAEDDIELLKGTNAYVAIQEIQSNVKSEYKQARK 176

Query: 234 VWFMAG----SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKV----SLARRFALV 285
           +    G      + Q  Y+     FT   F+ + V  +S   +++++    S    F+++
Sbjct: 177 ILKKEGFPDYRDYTQVLYNWAYCMFTSRSFRPSLVLSESAREYVERLLPEGSKIDDFSIL 236

Query: 286 PLGPPLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGP 340
               PL      ++ +     L +   A +L+ +  Y  G+ +       F  + FK   
Sbjct: 237 ---QPLYDIGNHSWDASYTWNLTSEPSACELICNDSYGPGQQV-------FNNYGFKT-- 284

Query: 341 QPNSKLLINYGFV 353
             NS+LL+ YGF+
Sbjct: 285 --NSELLLGYGFI 295


>gi|449662705|ref|XP_002165483.2| PREDICTED: uncharacterized protein LOC100209819 [Hydra
           magnipapillata]
          Length = 819

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 95/405 (23%), Positives = 159/405 (39%), Gaps = 70/405 (17%)

Query: 107 HRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAEL---LTT-----NKLSELA 158
           HR  + + A+ED++ G+  F+VP  L++          + E    L T     N  S   
Sbjct: 127 HR--YGMLATEDIKKGEVLFTVPRQLLLNQNTATLKNRLNEFEKWLDTHGKSLNDSSGWL 184

Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY-LTGSPTKAEI 217
            L + LM+E  Q K SFW  Y+  +      G      PL W E E      G P   +I
Sbjct: 185 PLLITLMWEFNQ-KDSFWASYLLLVPEISEFGH-----PLFWKEEEYNLEFQGMPLLNDI 238

Query: 218 LERAEGIKREYNELDTVWF-----MAGSLFQQYPYDIPTEAFTFEIFKQ--AFVAVQSCV 270
           +   E I+ EY E   ++      + GSL          E ++ E FK+  AFV   S  
Sbjct: 239 IVDRENIETEYAEFVLLFLRRNKDLFGSL----------ENYSLEFFKRMVAFVMAYSFT 288

Query: 271 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNIS 330
              +  S      +VP+   +L + S   A L      +Q++  R  K GE +   +   
Sbjct: 289 EDEESPS------MVPMA-DILNHHSNNNAHLVFHKSNLQMISIRRIKKGEEVFNTF--- 338

Query: 331 FTGHDFKCGPQPNSKLLINYGFVD-EDNPYDRLVV-----------EAALNTEDPQYQDK 378
                   G   N++LL  YG+V+   N YD L++           +     +DP    K
Sbjct: 339 --------GKLGNTELLQMYGYVEIPSNQYDSLLLPVKDFYKIMTSKNGTANDDPYLLAK 390

Query: 379 RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVS-- 436
             +  R G   V  F +          D++ +L++ + SD  E++ ++ +     P S  
Sbjct: 391 INLLNRTGIAEVDAFFMFDKNGLRCGPDLIQFLKIFHASD-RELEKILKTRASKRPESFY 449

Query: 437 -PCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRV 480
              + +  L +  +  K  L      ++ED+  +   N + +K V
Sbjct: 450 HKLLRKLRLSKKTE--KNSLGMTVIDITEDDTEMDIENFNKRKNV 492


>gi|85093434|ref|XP_959692.1| hypothetical protein NCU09581 [Neurospora crassa OR74A]
 gi|28921141|gb|EAA30456.1| predicted protein [Neurospora crassa OR74A]
          Length = 504

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 75/318 (23%), Positives = 123/318 (38%), Gaps = 61/318 (19%)

Query: 72  SKKEEDLGDLKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
           S  +E    L  W HK+G    P  +V   E    + + +P    +A+E L +G  A S 
Sbjct: 3   SPHKERFEALLDWAHKHGASLHPLLEVYEDEVTGFSLRVKP----SATERLGSGFKAVSC 58

Query: 129 PNSLVVTLERVLGNETIAELLTT---------------NKLSELACLALYLMYEKKQGKK 173
           P S+ ++    L +  I    TT               N L        YL+ +  +GK 
Sbjct: 59  PTSITLSYLNALTDGPITPSSTTPAPNTKNPAFPERFMNSLPPHVIGRFYLIQQYLKGKS 118

Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
           SFW PYI  L       + A+  P  W+E ++  L G+     I E    +K EY +   
Sbjct: 119 SFWAPYISTLADPSQLDKWAL--PPFWAEDDIELLQGTNAYIAIQEIQNNVKSEYKQARK 176

Query: 234 VWFMAG----SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP 289
           +    G      + Q  Y+     FT   F+ + V  +S   ++++        L+P G 
Sbjct: 177 ILKKEGFPDYREYTQVLYNWAYCMFTSRSFRPSLVLSESAREYVER--------LLPEGT 228

Query: 290 ---------PLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHD 335
                    PL      ++ +     L +   A +L+ +  Y  G+ +       F  + 
Sbjct: 229 KIDDFSVLQPLYDIGNHSWDASYTWNLTSEPSACELICNDSYGPGQQV-------FNNYG 281

Query: 336 FKCGPQPNSKLLINYGFV 353
           FK     NS+LL+ YGF+
Sbjct: 282 FKT----NSELLLGYGFI 295


>gi|238485948|ref|XP_002374212.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
 gi|83768069|dbj|BAE58208.1| unnamed protein product [Aspergillus oryzae RIB40]
 gi|220699091|gb|EED55430.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
          Length = 713

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 56/204 (27%), Positives = 90/204 (44%), Gaps = 18/204 (8%)

Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
            +LM +  QGK+ FW PYIR L  Q G    A+ +PL +   +L +L G+       ++A
Sbjct: 131 FFLMGQYLQGKEGFWYPYIRTLP-QPG----ALTTPLYYEGDDLEWLEGTSLSPARQQKA 185

Query: 222 EGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLAR 280
             +K +Y  + T    AG    ++Y +D+   A T  + +     V S V+   ++    
Sbjct: 186 NLLKEKYGTVYTELCKAGFDGAEKYTWDLYLWASTIFVSRAFSAKVLSGVIPDTQLPEEN 245

Query: 281 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGP 340
              L+P    +L +    K    A    V  +V     AG+ I   Y           GP
Sbjct: 246 VSVLLPF-IDILNHRPLAKVEWRAGKGNVAFLVLEDVAAGQEISNNY-----------GP 293

Query: 341 QPNSKLLINYGFVDEDNPYDRLVV 364
           + N +L++NYGF   +NP D  +V
Sbjct: 294 RNNEQLMMNYGFCLPNNPCDYRIV 317


>gi|330806388|ref|XP_003291152.1| hypothetical protein DICPUDRAFT_155733 [Dictyostelium purpureum]
 gi|325078672|gb|EGC32310.1| hypothetical protein DICPUDRAFT_155733 [Dictyostelium purpureum]
          Length = 465

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 96/414 (23%), Positives = 165/414 (39%), Gaps = 57/414 (13%)

Query: 72  SKKEEDLGDLKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
           +K+ E L + K W+  N     P   + L +K   +        + A + ++  D   S+
Sbjct: 34  TKEIESLKEFKEWLVNNNAYINPNIDIELLDKYGRS--------IVAKKSIKKQDKLISI 85

Query: 129 PNSLVVTLERVLGN-----ETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIREL 183
           P  ++++   + G      + I E + +  LS     A+++MY  K  +KSFW PY+  L
Sbjct: 86  PKDIIMS--NIGGYPKKIPKEIYEQVQSIGLSPTNLQAVFIMY-SKLNEKSFWHPYVTVL 142

Query: 184 DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQ 243
                    +  + L +S+ EL  L  S  K   + R +GI+R Y           S F 
Sbjct: 143 PE-------SFSTSLYFSDNELDELQASQLKEFTIIRKDGIERHYE----------STFS 185

Query: 244 QYPYDIPTEAFTFEIFKQA-FVAVQSCVVHLQKVSLARR-FALVPLGPPLLAYS-SKCKA 300
           +    +P E     ++ Q  F    SCV   +  SLA     +VPL     A   SK K 
Sbjct: 186 RLSKLVP-EFSNLALYNQELFTWALSCVWS-RAFSLAENDGGMVPLADMFNAEDRSKSKV 243

Query: 301 MLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV-DEDNPY 359
           +    D  +          GE I   Y +          P  +S++L++YGF+ DE    
Sbjct: 244 LPKVTDTTLDYYASDDIAEGEQIFTPYGV--------YKPLSSSQMLMDYGFIFDEGTVS 295

Query: 360 DRLVVEA-ALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD 418
           D + +     + ++P    K+ + + N  ++ +VF +       A  D+L Y R+  +  
Sbjct: 296 DNVAITVPVFHNDEPNLSTKQEILEENDIIN-EVFLLQKTDPLPA--DLLLYARVKNLIA 352

Query: 419 TSEMQSVISSLGP---ICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 469
               Q+    L P     P++   E+  L  L +     L  Y   L  D+ +L
Sbjct: 353 KECDQAKKHFLSPNTRNTPLNTRNEKVSLRFLENLIHRYLDSYGTNLESDKNLL 406


>gi|413917183|gb|AFW57115.1| hypothetical protein ZEAMMB73_742803 [Zea mays]
          Length = 514

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 74/288 (25%), Positives = 116/288 (40%), Gaps = 35/288 (12%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           ASE +  GD A  +P SL+++ E +  +E    L   N ++    L L+ M E+      
Sbjct: 153 ASESIGVGDIALEIPESLIISDELLCQSEVFLSLKDFNNITSETMLLLWSMRERYNLGSK 212

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
           F  PY   L      G       L +    LA L G+    EI++  + ++++Y+EL  +
Sbjct: 213 F-KPYFDTLPANFNTG-------LSFGIDALAALEGTLLFDEIIQARQHLRQQYDELFPL 264

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
                 L   +P     +  T++ F  A     S  + +   S      LVP+   L   
Sbjct: 265 ------LCTNFPEIFRKDVCTWDDFLWACELWYSNSMMIVLSSGKLSTCLVPVAGLLNHS 318

Query: 295 SSKCKAMLAAVDDA---VQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYG 351
            S        VD+A   ++  + RP  AGE   + Y           G  P S LL  YG
Sbjct: 319 VSPHILNYGRVDEATKSLKFPLSRPCDAGEQCFLSY-----------GKHPGSHLLTFYG 367

Query: 352 FVDE-DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 398
           F+   DNPYD + ++      D    D+ + AQ +   S Q  H+  G
Sbjct: 368 FLPRGDNPYDVIPLDL-----DTSADDEDITAQSSATTS-QTTHMVRG 409


>gi|317144568|ref|XP_001820210.2| SET domain protein [Aspergillus oryzae RIB40]
 gi|391871646|gb|EIT80803.1| N-methyltransferase [Aspergillus oryzae 3.042]
          Length = 703

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 56/204 (27%), Positives = 90/204 (44%), Gaps = 18/204 (8%)

Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
            +LM +  QGK+ FW PYIR L  Q G    A+ +PL +   +L +L G+       ++A
Sbjct: 121 FFLMGQYLQGKEGFWYPYIRTLP-QPG----ALTTPLYYEGDDLEWLEGTSLSPARQQKA 175

Query: 222 EGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLAR 280
             +K +Y  + T    AG    ++Y +D+   A T  + +     V S V+   ++    
Sbjct: 176 NLLKEKYGTVYTELCKAGFDGAEKYTWDLYLWASTIFVSRAFSAKVLSGVIPDTQLPEEN 235

Query: 281 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGP 340
              L+P    +L +    K    A    V  +V     AG+ I   Y           GP
Sbjct: 236 VSVLLPF-IDILNHRPLAKVEWRAGKGNVAFLVLEDVAAGQEISNNY-----------GP 283

Query: 341 QPNSKLLINYGFVDEDNPYDRLVV 364
           + N +L++NYGF   +NP D  +V
Sbjct: 284 RNNEQLMMNYGFCLPNNPCDYRIV 307


>gi|347836900|emb|CCD51472.1| similar to SET domain-containing protein [Botryotinia fuckeliana]
          Length = 470

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 99/408 (24%), Positives = 164/408 (40%), Gaps = 64/408 (15%)

Query: 126 FSVPNSLVVTLERVLGNETIAELLTTNKLSEL--ACLALY-LMYEKKQGKKSFWLPYIRE 182
           FS+P S V      L  +    L  + +L+E   + LAL  ++  + Q   S W PY+  
Sbjct: 55  FSIPRSAV------LNAQNAKPLAISKRLAEKMPSWLALTSILMAEGQVDDSKWAPYLAI 108

Query: 183 LDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA-----------EGIKREYNEL 231
           L  Q       + S + WS++ELA L  S    +I ++            +G++    E+
Sbjct: 109 LPEQ-------LNSLVFWSDSELAELQASAVVKKIGKQGAEDMFKTYITPQGLQHSSTEM 161

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQKVSLARRFALVPLG 288
                   S+   Y +DIP  +       +   A    V      +K  L+    ++PL 
Sbjct: 162 ---CHKVASVIMAYAFDIPDPSEGPTSGGKGEEAADDLVSDDGEDEKTILS----MIPLA 214

Query: 289 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLI 348
             L A + +  A L   ++ +++   +P   GE I   Y           G  P S LL 
Sbjct: 215 DMLNADADRNNARLICDNEDLEMRAIKPIAKGEEIFNDY-----------GQLPRSDLLR 263

Query: 349 NYGFV-DEDNPYDRLVVEAAL----------NTEDPQY-QDKRMV----AQRNGKLSVQV 392
            YG+V D  + YD   + A L          +   P+  QDK  V    A+R G      
Sbjct: 264 RYGYVTDGYSAYDVAEISAELIVSLFRNGKVHPSLPKLTQDKLKVRLDLAEREGVYDESF 323

Query: 393 FHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFK 452
             VH+  ++ +I D L       + D S +++++ S   +   S          LA   +
Sbjct: 324 DLVHSSPDEPSIPDELLAFLYLLLVDESHLKAILDSESSLPSRSKLTTELAGQVLAILLQ 383

Query: 453 ARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 500
           AR   Y  T+ EDE +L + +L  +  +A Q+   EKK+L A ++  A
Sbjct: 384 ARENEYSTTVEEDEDLLKNADLPIRTAMAIQVRSGEKKVLRAAIREAA 431


>gi|365989356|ref|XP_003671508.1| hypothetical protein NDAI_0H00910 [Naumovozyma dairenensis CBS 421]
 gi|343770281|emb|CCD26265.1| hypothetical protein NDAI_0H00910 [Naumovozyma dairenensis CBS 421]
          Length = 540

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 66/263 (25%), Positives = 106/263 (40%), Gaps = 45/263 (17%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLT-----TNKLSELA---CLALYL 164
           + AS+D+   +  F +P S ++       N T ++L       T KL EL+    L + +
Sbjct: 90  IIASKDIDTDELLFEIPRSSIL-------NVTTSQLCVDFPHITGKLMELSQWDSLIICM 142

Query: 165 MYEKKQGK-KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLT--------GSPTKA 215
           MYE K  + +S W  Y   L        L     + W++ EL++LT        G     
Sbjct: 143 MYEMKVLQHESRWSSYFNVLPSSESLNTL-----MYWNDKELSFLTPSLVVNRVGKGDAE 197

Query: 216 EILERAEGIKREYNELDTVWFMAGSL----FQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 271
            +  R      E+NE D +    GS+    F   P  I   +F  EI             
Sbjct: 198 TMYRRILDTINEFNE-DILTEKLGSISWEEFLYIPSIIMAYSFDVEIKNDDDENEGDEEF 256

Query: 272 HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISF 331
             ++       +++PL   L A + KC A L    D+++++  +P K GE +   Y    
Sbjct: 257 DEKEEEPELLKSMIPLADTLNADTHKCNANLTYDKDSLKMLAIKPIKKGEQVYNTY---- 312

Query: 332 TGHDFKCGPQPNSKLLINYGFVD 354
                  G  PNS+LL  YG+V+
Sbjct: 313 -------GELPNSELLRKYGYVE 328


>gi|212546319|ref|XP_002153313.1| SET domain protein [Talaromyces marneffei ATCC 18224]
 gi|210064833|gb|EEA18928.1| SET domain protein [Talaromyces marneffei ATCC 18224]
          Length = 481

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 97/440 (22%), Positives = 174/440 (39%), Gaps = 84/440 (19%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           V A  ++Q G+  F +P+ +V+ ++    N+ +A+ L    L     L + ++YE   G+
Sbjct: 50  VVARSNIQEGEDLFHLPHHIVLMVKTSRLNQILADDLKN--LGPWLSLVVVMIYEYSLGE 107

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT---------KAEILERAEG 223
           +S W  Y + L  +        ++ + WSE E + L  S           + +I E+   
Sbjct: 108 QSNWKQYFQVLPSK-------FDTLMFWSEEEFSQLQASAVVDKVGKRDAEEDIFEKVLP 160

Query: 224 IKREYNEL------------DT-------VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
           + R + +L            DT       +    GSL   Y +DI          +  ++
Sbjct: 161 LVRAHPDLFPPIDGVMSYDDDTGAQALLELAHRMGSLIMAYAFDIEKAEEEESEGEDGYL 220

Query: 265 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 324
                   L K        +VPL   L A + +  A L   + A+ +   +P KAG+ I 
Sbjct: 221 TDDEE--QLPK-------GMVPLADLLNADADRNNARLFQEEGALVMRAIKPIKAGDEIF 271

Query: 325 VWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL----------NTEDPQ 374
             Y           G  P S LL  YG+V  DN     VVE  L          N ED +
Sbjct: 272 NDY-----------GELPRSDLLRRYGYV-TDNYAQYDVVELPLTGICHAAGFDNIEDKE 319

Query: 375 YQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP----YLRLGYVSDTSEMQSVISSLG 430
           Y   +++ Q   ++    + +     ++ + D+LP     L      D+ E+Q ++S   
Sbjct: 320 YPQLKLLDQL--EILEDGYCILRPSPEDTLLDILPDELLALLKTLTLDSEELQRLLSKNK 377

Query: 431 PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDY-------NLHPKKRVATQ 483
              P+    E  +   L D  ++++  Y  T+ ED+ +L  +           ++ +A Q
Sbjct: 378 HPKPILGAREARI---LLDAAQSKMGQYGTTIQEDKILLQQFASSSVLRTRERRRHMAVQ 434

Query: 484 LVRMEKKMLNACLQVTADMI 503
           +   EK++L A L +  D +
Sbjct: 435 VRVGEKEILQALLMMLQDFL 454


>gi|303275964|ref|XP_003057276.1| set domain protein [Micromonas pusilla CCMP1545]
 gi|226461628|gb|EEH58921.1| set domain protein [Micromonas pusilla CCMP1545]
          Length = 308

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 70/289 (24%), Positives = 123/289 (42%), Gaps = 48/289 (16%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEK---K 169
           + A ED++ G+    +P++ ++T+ER +    +       +L E + LA +L  +    +
Sbjct: 25  LVAREDVKRGEPLLEIPDASLITVERAVKESKLGP--KHAELQEWSLLAAFLAEQALDIE 82

Query: 170 QGKKS-FWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKRE 227
            G +S  +  Y++ L R+ G         L W E ++   L GSP++    ER   +   
Sbjct: 83  NGDESGVFAAYVKALPRRTG-------GVLDWPEEDVKTLLAGSPSQRAAYERQASVDGA 135

Query: 228 YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 287
             E+            ++P   P         + AF  + S ++ L   +     ALVP 
Sbjct: 136 IEEIRA----------EFPQLTPG------ALRWAFDVLFSRLIRLP--NRGGELALVPW 177

Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLL 347
              +L +   C A +      V L  DR YK GE +   Y           G +P+++LL
Sbjct: 178 A-DMLNHKPGCNAYIDDSGGKVCLQPDRAYKPGEQVFASY-----------GQRPSAELL 225

Query: 348 INYGFVDE--DNPYDRLVVEAALNTEDPQYQDKRMVA-QRNGKLSVQVF 393
           I+YGF  E  +NP D   +   ++  D +Y D +  A ++ G   V+ F
Sbjct: 226 ISYGFAPEVGENPDDEYEITLGIDPND-RYADAKAAALEKIGLRPVESF 273


>gi|159477607|ref|XP_001696900.1| rubisco small subunit N-methyltransferase [Chlamydomonas
           reinhardtii]
 gi|158274812|gb|EDP00592.1| rubisco small subunit N-methyltransferase [Chlamydomonas
           reinhardtii]
          Length = 411

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 52/185 (28%), Positives = 84/185 (45%), Gaps = 9/185 (4%)

Query: 315 RPYKAGESIVVWYNISFTGHDFKCGP-QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 373
           R   A + +VVW      G +      +PN +LL+  G + ++N  D L   A L   D 
Sbjct: 166 RAAGARKGVVVWDG---AGSEMLLNDGRPNGELLLATGTLQDNNSSDFLSWPAGLVPADR 222

Query: 374 QYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPIC 433
            Y  K  V +  G  + + F V+A R       +L YLRL  V+D + +     +     
Sbjct: 223 YYMMKSQVLESMGYSAAEEFPVYADRMP---IQLLAYLRLSRVADPALLAKC--TFEADV 277

Query: 434 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 493
            +S   E  +L  L    + RLA Y  +  ED  +    +L PK+R+A +L   EK+++N
Sbjct: 278 ELSQMNEYEILQILMGDCRERLASYTKSYEEDVKIAQQSDLSPKERLAVKLRLGEKRIIN 337

Query: 494 ACLQV 498
           A ++ 
Sbjct: 338 ATMEA 342


>gi|432901733|ref|XP_004076920.1| PREDICTED: SET domain-containing protein 4-like [Oryzias latipes]
          Length = 441

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 74/281 (26%), Positives = 122/281 (43%), Gaps = 34/281 (12%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
           +Q G    S+P S ++T   VL +  +   L + K   S L  L ++L+ E+ +G+ S W
Sbjct: 68  IQPGGMLVSLPESCLLTTSTVL-HSYLGPFLKSWKPRPSSLVALCVFLVCERHRGEASDW 126

Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYL-TGSPTKAEILERAEGIKREYNELDTVW 235
            PYI  L         +   P  +++T +A L +G   +AE  E+ EG++  Y  +   +
Sbjct: 127 FPYIDVLP-------CSYCCPPYFTDTVMAVLPSGVRRRAE--EQREGLQHLY-AVHQDF 176

Query: 236 FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYS 295
           FM+      +P   P E  T+E  + A+     C ++ + V + R  +    GP   A +
Sbjct: 177 FMSLQPVLSHP---PEEVLTYEALRWAW-----CSINTRSVFMDRPSSSFLSGPDNYALA 228

Query: 296 SKCKAMLAAVDDAVQLVVDRPYKAGE----SIVVWYNISFTGHDFKCGPQPNSKLLINYG 351
                +    D  V+   +R     E    S V  Y+ +F  +    G   N +LL+ YG
Sbjct: 229 PFLDLLNHRPDVQVKAGFNRTSGCYEIRSISGVQRYHQAFINY----GSHDNQRLLLEYG 284

Query: 352 FVDEDNPYDRLVVEAALNTE----DPQYQDKRMVAQRNGKL 388
           FV   NP+  + VE  L  E    D    +K    + NG L
Sbjct: 285 FVSSCNPHSVIYVEEDLLCEVLRGDESLDEKMKFLRENGFL 325


>gi|315039895|ref|XP_003169325.1| hypothetical protein MGYG_08872 [Arthroderma gypseum CBS 118893]
 gi|311337746|gb|EFQ96948.1| hypothetical protein MGYG_08872 [Arthroderma gypseum CBS 118893]
          Length = 455

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 91/406 (22%), Positives = 165/406 (40%), Gaps = 49/406 (12%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
           V A    + G+   ++P++ + T+++   +  +  +L   +  LS    LALYL++ K  
Sbjct: 28  VKALRSFKEGERILTIPSACLWTVKKAYADPLLGPVLRAAQPPLSVEDSLALYLLFVKS- 86

Query: 171 GKKSFWLPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIK 225
                     R L  +  R  +A       + + +++ EL    GS   A   +  + + 
Sbjct: 87  ----------RTLGYEGQRHHIAAMPQSYSASIFFTDDELQVCKGSSLYALTPQLEQRVH 136

Query: 226 REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
            +Y +L        +L  Q+    P + FT E +K A  ++ S  +       A    + 
Sbjct: 137 DDYRQLLV------ALLSQHRDLFPLDQFTIEDYKWALCSIWSRAMDFAVSETASVRLVA 190

Query: 286 PLGPPLLAYS---SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
           PL   +L +S    +C A      D + ++  + Y+ G+ I ++Y           G  P
Sbjct: 191 PLAD-MLNHSPDVKQCHAYDPTSGD-LSILAAKDYQVGDQIFIYY-----------GSVP 237

Query: 343 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 402
           N++LL  YGFV  DNP D   +    +   P Y+ K  +    G  S     +     K+
Sbjct: 238 NNRLLRLYGFVLPDNPNDSYDLVLQTSPLAPLYEQKERLWALAGLDSTCTIPLTV---KD 294

Query: 403 AI-SDMLPYLRLGYVSDTSEMQSVISSL--GPICPVSPCMERAVLDQLADYFKARLAGYP 459
            + +++L YLR+  + D S +  +   L  G    V+   E  VL  L D   + L G+ 
Sbjct: 295 PLPNNVLRYLRIQRL-DESNITDITLRLVNGTDGKVNDGNEIQVLQFLVDSIGSLLEGFG 353

Query: 460 ATLSEDEAMLT--DYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 503
             L + EA L   DY        A  +   E+++L    +   D++
Sbjct: 354 IPLEKLEAQLVAGDYPAGGNAWAAAHVSAGEQRVLTRAKKTAEDLL 399


>gi|145516585|ref|XP_001444181.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124411592|emb|CAK76784.1| unnamed protein product [Paramecium tetraurelia]
          Length = 658

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 37/143 (25%), Positives = 65/143 (45%), Gaps = 26/143 (18%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL---------SELACLALY 163
           V+A  ++ A     ++PN L+++  +VL +E ++++  T+K          +E  CLALY
Sbjct: 48  VSAKMNIPANKVIIAIPNKLIISHHKVLKSE-LSDMFKTHKQFFDDQITADAEFNCLALY 106

Query: 164 LMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG 223
           + Y K QG KSFW PY+  +++          +   W   +L  L       E +     
Sbjct: 107 IFYHKLQGDKSFWYPYLNVVEQH---------TMFEWRNRDLFNLQDQSLIDEFMYIQ-- 155

Query: 224 IKREYNELDTVWFMAGSLFQQYP 246
                +E+D  W+    L  +YP
Sbjct: 156 -----SEMDKSWYKFKGLMNKYP 173


>gi|159490820|ref|XP_001703371.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158280295|gb|EDP06053.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 339

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 61/261 (23%), Positives = 109/261 (41%), Gaps = 29/261 (11%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER-----VLGNETIAELLTTNKLSELACLALYLMYE 167
           + AS +++ G+    VP+  V+  E      VL  E + +     ++ E+  L + +M+E
Sbjct: 68  LVASRNIKMGEVVVEVPDDAVLMAENCGLRDVLEEEGMTKDSADEEILEVQGLVIAVMWE 127

Query: 168 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
           + +G +S W PY+  L             PL W   E   L G+    ++L RA+     
Sbjct: 128 RWRGPESRWAPYLALLPDD------MTHMPLYWKRREFRELRGTAAYDKMLGRAQHPSDA 181

Query: 228 YNELDTVWF-MAGSLFQQYP-YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
             ++  +W  + G    ++P   +P     +E+++ A  AV S    L      +  A+V
Sbjct: 182 PTQVPLLWSEVVGPFIAEHPELGLPGGERGYELYRWATAAVASYSFILGD---DKYQAMV 238

Query: 286 PLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPN 343
           P+   L   +      L   +    +Q++  R   AG  +V  Y           G   N
Sbjct: 239 PVWDLLNHITGDVNVRLHHCSKRHVLQMIAMRDIVAGSELVNNY-----------GELSN 287

Query: 344 SKLLINYGFVDEDNPYDRLVV 364
           ++LL  YGFV+  N Y+ + V
Sbjct: 288 AELLRGYGFVERANRYNHIPV 308


>gi|156384284|ref|XP_001633261.1| predicted protein [Nematostella vectensis]
 gi|156220328|gb|EDO41198.1| predicted protein [Nematostella vectensis]
          Length = 403

 Score = 57.0 bits (136), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 91/405 (22%), Positives = 169/405 (41%), Gaps = 54/405 (13%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL--ACLALYLMYEKKQGK 172
           A+ DLQ      +VP  L++++     +     +   + L  +    LAL+++ E+   +
Sbjct: 26  ATADLQENQVFVAVPEKLLMSVVTAKKSSLGPLISREHGLRSMPHVVLALHVLCERLH-E 84

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
            S W PY+  L R       +  + L +S  ++  L GSP+  E L++  GI ++Y    
Sbjct: 85  DSTWAPYLNILPR-------SYSTCLYFSPDDMMALQGSPSMGEALKQFRGIVKQY---- 133

Query: 233 TVWFMAGSLFQQYPYDIPTE-AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
            V+F            +P + +FTF+ F+ A   V +    ++  S     AL+    P+
Sbjct: 134 -VYFFRLVQINPEASRLPLKNSFTFDDFRWAVSTVMTRQNDVKVSSNETVKALI----PM 188

Query: 292 LAYSSKCKA-MLAAVDDA---VQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLL 347
               + C        DD+   V+ +  +P +AG+ + ++Y           G + N+  L
Sbjct: 189 WDMCNHCNGPFTTGFDDSTKEVKSLAFKPTRAGDQVFIFY-----------GRRNNADRL 237

Query: 348 INYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS-D 406
            + GFV  +   D + ++  ++  D  Y  K  +    G L          R  E IS +
Sbjct: 238 FHNGFVYTEAEEDWVNIQLGVSKNDRLYAMKAQILAMVG-LDASGRSYRVLRGPEPISPE 296

Query: 407 MLPYLRLGYVSDTSEMQSV--------ISSLGPICPVSPCMERAVLDQLADYFKAR---- 454
           +  +LR+ +  +T E++          ++ L  +C     +      +L  +F  R    
Sbjct: 297 LRIFLRV-FSMNTGELKPYLFNPEGLPVTPLAELCKAEFTLSEENELKLWSFFHTRLQLI 355

Query: 455 LAGYPATLSEDEAMLT--DYNLHPKKRVATQLVRMEKKMLNACLQ 497
           L  Y  T  EDEA+L+  D  LH   R   +L   E+ +L + L+
Sbjct: 356 LGQYKTTKQEDEALLSRDDNTLH--TRNCIRLRMSERDILVSALE 398


>gi|66828265|ref|XP_647487.1| hypothetical protein DDB_G0268558 [Dictyostelium discoideum AX4]
 gi|60475797|gb|EAL73732.1| hypothetical protein DDB_G0268558 [Dictyostelium discoideum AX4]
          Length = 459

 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 80/324 (24%), Positives = 132/324 (40%), Gaps = 57/324 (17%)

Query: 78  LGDLKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
           L +   W+  N +   P  ++ + EK   +        + A + ++  +   SVP  +++
Sbjct: 35  LNEFNKWLINNKVYKNPKIEIKVLEKYGRS--------IVAKQSIKKNEKLISVPKLIIM 86

Query: 135 T----LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRG 190
           +        L NE I E   +  +S     A++LMY  K   KSFW PY+  L ++    
Sbjct: 87  SNMGGFSHHLPNE-IYEPSISIGISPTNLQAIFLMY-CKLNDKSFWYPYVSVLPKE---- 140

Query: 191 QLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIP 250
                + + +SE EL  L  S  K   + R +GI+R YN   T     G + +  P    
Sbjct: 141 ---FTTSIYFSEEELDELQSSKLKEFTIIRKDGIERHYNSTFTRLSNRG-IAEFSPTSTQ 196

Query: 251 T---EAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL-------VPLGPPLLAYS-SKCK 299
           T   + +T E+F  A     SCV        +R F+L       VPL     A   SK K
Sbjct: 197 TLQQKGYTLELFTWAL----SCV-------WSRAFSLSDSDGGMVPLADMFNAEEISKSK 245

Query: 300 AMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV-DEDNP 358
                 D  +        + GE I   Y +          P  +S++L++YGFV D   P
Sbjct: 246 VQPKVTDSTLDYYASDDIEIGEQIFTPYGV--------YKPLSSSQMLMDYGFVFDHGTP 297

Query: 359 YDRLVVEAAL-NTEDPQYQDKRMV 381
            D + +   + + ++P  Q K+ +
Sbjct: 298 SDNVAISVPIFHPDEPNIQVKQSI 321


>gi|296804474|ref|XP_002843089.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
 gi|238845691|gb|EEQ35353.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
          Length = 455

 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 91/406 (22%), Positives = 165/406 (40%), Gaps = 49/406 (12%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
           V A    + G+   ++P++ + T+E+   +  +  +L + +  LS    LA+YL++ +  
Sbjct: 28  VKALRSFKEGERILTIPSACLWTVEKAYADPLLGPVLRSAQPPLSVEDALAVYLLFVRS- 86

Query: 171 GKKSFWLPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIK 225
                     R    +  R  +A       + + ++E EL    GS   A   +  + ++
Sbjct: 87  ----------RTSGYEGQRHHIAAMPQSYSASIFFTEDELQVCAGSSLYALTRQLEQRVR 136

Query: 226 REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
            +Y +L         L  Q+    P + FT E +K A  ++ S  +    VS      LV
Sbjct: 137 DDYRQLLV------PLLSQHRDLFPLDQFTIEDYKWALCSIWSRAMDF-AVSGTTSVRLV 189

Query: 286 PLGPPLLAYS---SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
                +L +S    +C A      D + ++  + Y+ G+ + ++Y           G  P
Sbjct: 190 APLADMLNHSPDVKQCHAYDPTSGD-LSILAAKDYQVGDQVFIYY-----------GSVP 237

Query: 343 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 402
           N++LL  YGFV  DNP D   +    +   P Y+ K  +    G  S     +     K+
Sbjct: 238 NNRLLRLYGFVLPDNPNDSYDLVLQTSPLAPLYEQKERLWALAGLDSTCTIPLTV---KD 294

Query: 403 AI-SDMLPYLRLGYVSDTSEMQSVISSL--GPICPVSPCMERAVLDQLADYFKARLAGYP 459
            + +++L YLR+  + D S +  +   L  G    VS   E  VL  L D   + L G+ 
Sbjct: 295 PLPNNVLRYLRIQRL-DESNITDITLQLVNGTDGKVSDGNEMQVLQFLVDSIGSLLEGFG 353

Query: 460 ATLSEDEAMLT--DYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 503
             L + EA L   DY        A  +   E+++L    +   D++
Sbjct: 354 IPLEKLEAQLAAGDYPAGGNAWAAAHVSAGEQRVLTRAKRTAEDLL 399


>gi|66825817|ref|XP_646263.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|60474297|gb|EAL72234.1| SET domain-containing protein [Dictyostelium discoideum AX4]
          Length = 567

 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 97/471 (20%), Positives = 175/471 (37%), Gaps = 69/471 (14%)

Query: 65  AGSREVVSKKEEDL-GDLKSWMHKNGL--PPCKVILKEKPSHNEKHRPIHYVAASEDLQA 121
           A S ++V   E  L  +   W+   G     CKV +    S          + A++D++ 
Sbjct: 56  ANSGKIVEPTEAQLVANFIEWLKGKGFDESKCKVKIDRNTSEGTG------LVATQDIKE 109

Query: 122 GDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLP 178
           G+    +P++L +T            +L  ++L +      L+++L+ E      S W P
Sbjct: 110 GEDFVEIPSNLFITTAVAFQGLGKPPILENDRLIQSIPGILLSIFLVKELSN-PTSEWGP 168

Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMA 238
           YI+ L +Q         +   W   E     GSP     +    G  R+Y       ++ 
Sbjct: 169 YIKLLPKQ-------YNTVYYWGLKEFTQFRGSPNLEYAMRYVRGAMRQY------CYLY 215

Query: 239 GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR-----FALVPLGP--PL 291
             + +     +P  +FT++ F  A   VQS     Q    A        AL+P       
Sbjct: 216 SMIDRTQSNIMPISSFTWDAFVWAISTVQS----RQNPVYAGNGNGSIMALIPFWDFCNH 271

Query: 292 LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYG 351
            +  SK  +      + +     + +K GE + ++Y           GP+ N++LL++ G
Sbjct: 272 SSTGSKITSFYHMDSNCMTSGAIKDFKKGEQVYMFY-----------GPRDNTQLLMHAG 320

Query: 352 FVDEDNPYDRLVVEAAL--NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP 409
           F  + N +D    E  L     + ++    ++ +R  +  V V         E   +++P
Sbjct: 321 FATKTNLHDSYPFELHLLEGNHEIRHDKVHLLEERGIRDGVVVNLNQNPTSNELPLELIP 380

Query: 410 YLRLGYVSDTSEMQSVIS---------------SLGPIC--PVSPCMERAVLDQLADYFK 452
           + R+ Y     E +++                  L P+    ++   E      L    K
Sbjct: 381 FYRI-YALSEQETRAIAPPQVPGEHNHHHGHQLELKPLAFKIITQENEEKAYSNLVQALK 439

Query: 453 ARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 503
            +LA YP TL EDE  L   N    +R        EKK+L+  ++    +I
Sbjct: 440 GKLASYPTTLEEDEQELKK-NPPANQRFILYTKINEKKILDRNIKYLESLI 489


>gi|344277088|ref|XP_003410336.1| PREDICTED: SET domain-containing protein 4 [Loxodonta africana]
          Length = 440

 Score = 56.2 bits (134), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 78/334 (23%), Positives = 131/334 (39%), Gaps = 45/334 (13%)

Query: 67  SREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAF 126
           SR V    + +  +LK W+         +I    P           + +   LQ G    
Sbjct: 22  SRGVNESYKSEFIELKKWLKDRKFEDTNLIPARFPGTGRG------LMSKTSLQVGQMII 75

Query: 127 SVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFWLPYIRELD 184
           S+P S +++ + V+    +   +T  K   S L  L  +L+ EK  G +S W PY+  L 
Sbjct: 76  SLPESCLLSTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVLEKHAGDQSSWKPYLETLP 134

Query: 185 RQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQ 244
           +           P+ W E E+  L   P +A+  E+   ++  +      +     LF +
Sbjct: 135 K-------TYTCPVCW-EPEVVNLLPRPLRAKAQEQRTRVQEFFTSFRDFFSSLQPLFSE 186

Query: 245 YPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP----LGP--PLLAYSS-- 296
              +I    FT+     A+  V +  V+L+   L R F+  P    L P   LL +S   
Sbjct: 187 AVENI----FTYSALLWAWCTVNTRAVYLRHRQL-RCFSAEPDTCALAPYLDLLNHSPDV 241

Query: 297 KCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED 356
           + KA         ++V     +  E + + Y           GP  N +LL+ YGFV   
Sbjct: 242 QVKAAFNEKTRCYEIVAVSSCRKHEEVFICY-----------GPHDNHRLLLEYGFVSTR 290

Query: 357 NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 390
           NP+  + V   +  +     DK+M    N K+S+
Sbjct: 291 NPHACVYVSRDILVKYLPSTDKQM----NKKISI 320


>gi|340966944|gb|EGS22451.1| hypothetical protein CTHT_0019870 [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 499

 Score = 56.2 bits (134), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 50/202 (24%), Positives = 80/202 (39%), Gaps = 23/202 (11%)

Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
            +L+ E  +G+ SFW PYI  L +        +  P  W E ++ +L G+     I E  
Sbjct: 111 FFLIKEYLKGENSFWWPYIATLPQPEQVNSWTL--PAFWPEDDIQFLEGTNAHVAIGEIQ 168

Query: 222 EGIKREYNELDTVW----FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK-V 276
             IKREY +   V     F     + Q  Y      FT   F+ + +  QS   ++   +
Sbjct: 169 ANIKREYKQARKVLKEENFPNWKEYSQMLYKWAFSIFTSRSFRPSLILSQSVKDYVSTLL 228

Query: 277 SLARRFALVPLGPPLLAYSSKCKAMLAAVD-----DAVQLVVDRPYKAGESIVVWYNISF 331
             AR      +  PL   ++         D     +  QL+    Y+ G+ +       F
Sbjct: 229 PSAREIDDFSILQPLFDIANHSMTATYTWDTTSDPNCCQLICQDSYRPGDQV-------F 281

Query: 332 TGHDFKCGPQPNSKLLINYGFV 353
             + FK     NS+LL+ YGF+
Sbjct: 282 NNYGFKT----NSELLLAYGFI 299


>gi|212542185|ref|XP_002151247.1| SET domain protein [Talaromyces marneffei ATCC 18224]
 gi|210066154|gb|EEA20247.1| SET domain protein [Talaromyces marneffei ATCC 18224]
          Length = 709

 Score = 56.2 bits (134), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 69/234 (29%), Positives = 103/234 (44%), Gaps = 35/234 (14%)

Query: 161 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE--LAYLTGSPTKAEIL 218
           A +LM +    +  FW PYIR L      G+  + +PL + E E  L +L G  + A   
Sbjct: 108 AFFLMGQYLLQEHGFWYPYIRSLP-----GKEELTTPLFFREEEGDLEWL-GMTSLAASR 161

Query: 219 ERAEGI-----KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA--VQSCVV 271
           ER   I     +R Y  L  + F      + Y +D+   A T  I  +AF A  + S + 
Sbjct: 162 ERRLAIWRGNYERGYTMLKELGFEG---VEGYTWDLYLWASTI-ISSRAFTAKVLASVIP 217

Query: 272 HLQKVSLARRFALVPLGPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWYNI 329
            L+   + R   L+PL   + A + K   K    A  D++ LVV     AGE +      
Sbjct: 218 ELKNAEVDRVSVLLPL---IDATNHKPLSKVEWRAGTDSIGLVVMSDVAAGEEV------ 268

Query: 330 SFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 383
              G+++  GP+ N +L++NYGF   DNP +  VV      + P  Q K    Q
Sbjct: 269 ---GNNY--GPRNNEQLMMNYGFCIPDNPCEYRVVSLRAPLDSPLAQIKAQYEQ 317


>gi|403349615|gb|EJY74245.1| hypothetical protein OXYTRI_04500 [Oxytricha trifallax]
          Length = 689

 Score = 56.2 bits (134), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 35/131 (26%), Positives = 66/131 (50%), Gaps = 19/131 (14%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-------SELACLALYLM 165
           + A +D+    A   +PNS ++++ RV  +  + ++L+ ++        ++  CLA++LM
Sbjct: 74  IGAKKDIGQYKAFLFIPNSCIISVTRVKKHPIVGQILSNHQELFMKHADADQLCLAVFLM 133

Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL--WSETELAYLTGSPTKAEILERAEG 223
            E  QG++SFW PYI  ++          ES LL  W + E+  L       +  E  + 
Sbjct: 134 NEYLQGQQSFWWPYINVMN----------ESDLLYKWKDEEIKLLNDFEIYQQAKEYRDD 183

Query: 224 IKREYNELDTV 234
           I+ E+N+L  +
Sbjct: 184 IEDEWNKLSKI 194


>gi|241712095|ref|XP_002413441.1| conserved hypothetical protein [Ixodes scapularis]
 gi|215507255|gb|EEC16749.1| conserved hypothetical protein [Ixodes scapularis]
          Length = 227

 Score = 55.8 bits (133), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 52/190 (27%), Positives = 81/190 (42%), Gaps = 25/190 (13%)

Query: 84  WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLG-- 141
           W   NG     + L+  P         +  AA +D+Q G     VP  +++T    +G  
Sbjct: 11  WCLDNGATINGITLQALPDDE------YGFAAEQDIQVGPVFLGVPLGMMMT---TIGAR 61

Query: 142 NETIAELLTTN---KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPL 198
              +  LL  +   K  E   L+++L+ E   G  SFW PYI  L R       +  + L
Sbjct: 62  KSKLGALLKDDPIMKSMENVALSMFLILELCAGSASFWHPYISILPR-------SFNTVL 114

Query: 199 LWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEI 258
            +S  EL  LTGS    E L+    I R+Y     + F    L +  PY    + FT+++
Sbjct: 115 YFSVDELQLLTGSSVLDEALKLHRSIARQYAYFHKI-FRTHPLAKSLPY---KDCFTYDL 170

Query: 259 FKQAFVAVQS 268
           ++ A  AV +
Sbjct: 171 YRWAVSAVMT 180


>gi|281207968|gb|EFA82146.1| hypothetical protein PPL_04566 [Polysphondylium pallidum PN500]
          Length = 510

 Score = 55.8 bits (133), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 40/159 (25%), Positives = 72/159 (45%), Gaps = 21/159 (13%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           V A +DL+       +P S ++T        +I+  L   K+ +    ++ L+YE   G 
Sbjct: 59  VIALQDLKIDHTVAIIPKSCLLTPHTT----SISAYLKKYKIKDATATSIALLYEASIGS 114

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
           +S W  YI+ L        L+V+ P+LW++ +L  L G+  +  + E  E +   YN+  
Sbjct: 115 QSKWYGYIKSL-------PLSVDLPILWNDADLKNLKGTSIETVVYENKETVDATYNK-- 165

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 271
              ++   L   +P       F+ + FK+A     SC+V
Sbjct: 166 ---YIKSKLIANHPDVFNEHVFSLDNFKRA-----SCLV 196


>gi|380480025|emb|CCF42668.1| SET domain-containing protein RMS1 [Colletotrichum higginsianum]
          Length = 318

 Score = 55.5 bits (132), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 56/241 (23%), Positives = 108/241 (44%), Gaps = 45/241 (18%)

Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQ 341
             +VP+   +L   ++  A +   +D + +V  RP KAGE I+ +Y           GP 
Sbjct: 71  LGMVPMAD-ILNADAEFNAHVNHGEDDLSVVALRPIKAGEEILNYY-----------GPH 118

Query: 342 PNSKLLINYGFVDEDN--------PYDRLVVEAALNTE---------------DPQ-YQD 377
           PNS+LL  YG+V   +        P+D  +V++ L  +               DP+ ++D
Sbjct: 119 PNSELLRRYGYVTPKHSRYDVVEIPWD--LVQSILTEQLRLTDDVWKQLAEHVDPEDFED 176

Query: 378 KRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSP 437
             ++ + +G+   +       + +E  +++   L+       + ++++    G + P   
Sbjct: 177 VFVLERDSGEPDSEGRLTTPAKVQEVSAELEEQLK-------AVLKAIKKVRGDLIPDKR 229

Query: 438 CMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 497
             +      +A   +  LA YP T  EDEA+L   NL  ++R+A ++   EK++L   LQ
Sbjct: 230 KRDEVYQHVVAAALQKLLAQYPTTAEEDEALLASGNLTSRQRMAVEVRLGEKRLLKEALQ 289

Query: 498 V 498
           +
Sbjct: 290 M 290


>gi|162606198|ref|XP_001713614.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Guillardia theta]
 gi|13794534|gb|AAK39909.1|AF165818_117 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Guillardia theta]
          Length = 460

 Score = 55.5 bits (132), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 76/393 (19%), Positives = 167/393 (42%), Gaps = 66/393 (16%)

Query: 133 VVTLERVLGNETIAEL--------------LTTNKLSELACLALYLMYEKKQGKKSFWLP 178
           ++    +L NE I E+              + +N     + LA+ L+ E  + KKSFW P
Sbjct: 102 LIASRNILKNEKIIEISENLMFDKFEHNLEINSNGSDNYSDLAIKLLVELFKNKKSFWFP 161

Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMA 238
           YI  L  +     L       W   EL ++ GS          + +K +Y  ++      
Sbjct: 162 YIGILPEEYDLKLL-----FRWPLKELFFIKGSRLSKASDYLKKKLKAQYEMVNK----- 211

Query: 239 GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC 298
             +FQ+     P++ F ++ ++ +   + S  + LQ+    ++  L+P    LL ++   
Sbjct: 212 -EVFQRNRLLYPSKIFNYQNWEWSMSILLSRTISLQE---TKKVVLIPY-IDLLNHNPFS 266

Query: 299 KAMLAA----VDDAVQLVV--DRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGF 352
            + ++     + D+ ++VV  D+     + + + Y           G + N +LL  YGF
Sbjct: 267 SSFISYRKIPLSDSKEIVVYSDKNCNKFDQLYISY-----------GQKSNLELLNLYGF 315

Query: 353 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 412
           + E NPYD +++  +++ +D  +++K+     N K     + +   +  +   +M+ +++
Sbjct: 316 IAERNPYDSVIIRISMSPKDIFFKEKKSFLFSNKKFFYNSYPIFLYKYPD---EMIEFIK 372

Query: 413 L----GYVSDTSEMQSVISSLGPICPVSPC----MERAVLDQLADYFKARLAGYPATLSE 464
           +      ++D +   + I +      +  C    +E+++     DY   R       L E
Sbjct: 373 ICLFNTNINDKNFNLNKIENYDYTKIIKSCIVTVIEKSLNSNYNDYENLR----NIMLKE 428

Query: 465 DEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 497
           +   ++D      ++++ +   +EKK+LN  L+
Sbjct: 429 NLLHISD-----NQKISIKYNALEKKILNRFLE 456


>gi|452824261|gb|EME31265.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase
           [Galdieria sulphuraria]
          Length = 546

 Score = 55.5 bits (132), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 88/383 (22%), Positives = 146/383 (38%), Gaps = 63/383 (16%)

Query: 76  EDLGDLKSWMHKNGLPPCKVILKEKPS---HNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
           E   +L++W+  NG+P     +K KP    HN          A   L+ G+   ++P   
Sbjct: 71  EKTEELENWLFDNGVPS----IKGKPVLSPHN-----CRTFRAKIPLKLGEEVLAIPERF 121

Query: 133 VVTL---ERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
            +T    E++LG          + LS+   +A  L+ E  + + SFW P+I  L      
Sbjct: 122 WLTKQLSEKLLG-------FHVSDLSDEEAIAALLLVETARKETSFWKPWIETLPSSDEL 174

Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY-D 248
                   L+WS  E  YL  S T  +IL   E     + EL+T       LF ++ Y  
Sbjct: 175 HHF-----LVWSTAETQYLESSSTFEDILSLRETASLVFEELNT------ELFPKFLYPQ 223

Query: 249 IPTEAFTFEIFKQAFVAVQSCVVHLQKVS-----------LARRFALVPLGPPLLAYSSK 297
              + FT   F  A   VQS  ++    S           L  +++L+        Y   
Sbjct: 224 YDVKYFTLPYFTWALSIVQSFGLYDIMDSCPLVIVPGLEWLTYKYSLITEESFFRQYFHI 283

Query: 298 CKAMLAAV------DDAVQLVVDRPYKAGESIVVWY--NISFTGHDFKCGPQPNSKLLIN 349
               L  V      +  +++      K GE + + Y  N+S      + G +      ++
Sbjct: 284 SNVSLIRVGPFFTQERRLKITASEDLKVGEPVSLVYEGNVSLIDTFCRWGWK------LD 337

Query: 350 YGFVDEDN--PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDM 407
            G +DE+         +  A+ T D  + DK  +        +Q F +     KE +  +
Sbjct: 338 LGALDEEQLLKMGSYEISFAVTTTDQFFDDKEDILDAQRLELLQTFELRYDMSKELLQRI 397

Query: 408 LPYLRLGYVSDTSE--MQSVISS 428
           LP+LRL  + D     ++SV  S
Sbjct: 398 LPFLRLICLKDKDSFILESVFRS 420


>gi|168005531|ref|XP_001755464.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693592|gb|EDQ79944.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1033

 Score = 55.5 bits (132), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 71/306 (23%), Positives = 118/306 (38%), Gaps = 68/306 (22%)

Query: 70  VVSKKEEDLGD-LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
           VV +   D  D   SWM  NG            S +EK    H +A    L  G      
Sbjct: 520 VVHQNGTDTTDQFVSWMEGNGF-----------SISEKLSITHLLAGDGKLVRG------ 562

Query: 129 PNSLVVTLERVLGNETIAEL-----------LTTNKLSELACLALYLMYEKKQGKKSFWL 177
               VV L+ +   ET+  L           +   ++      A  L+ EK +G  S W 
Sbjct: 563 ----VVVLKNIRRGETLCNLPLDMGLYDNETIVAGEVDSWDRAAARLLREKAKGSSSAWA 618

Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
            YI  L +        +  P+L  + EL  +   P   E+++  + I+  ++ L +V  +
Sbjct: 619 SYINILPQN-------MTVPILLEDHELHEVQWWPVLRELVQVRKSIRESFSLL-SVDDL 670

Query: 238 AGSLFQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
           AG+ F++Y +    + + AFT  +F     A    + ++  ++    +    +  P+   
Sbjct: 671 AGADFEEYRWAAMMVHSRAFTLPVFADDHYAPYVMMPYMDMINHHYHYQADWMSQPIWG- 729

Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVD 354
                         V++V  R  K GE +      SF       GP+ N  L + YGFV 
Sbjct: 730 ------------GKVEIVARRDIKKGEELFA----SF-------GPRANDNLFLYYGFVL 766

Query: 355 EDNPYD 360
           +DNP+D
Sbjct: 767 KDNPFD 772


>gi|440804394|gb|ELR25271.1| rubisco lsmt substrate-binding protein [Acanthamoeba castellanii
           str. Neff]
          Length = 408

 Score = 55.5 bits (132), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 94/406 (23%), Positives = 155/406 (38%), Gaps = 56/406 (13%)

Query: 121 AGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELAC-----LALYLMYEKKQGKKSF 175
           A +    VP SL++     L  E +  +    K +  A      LAL++++E ++   SF
Sbjct: 4   ASERILEVPFSLLLDAGAALRAEDVGSVFAAVKPALDAVDNRLPLALFMLHELRK-PDSF 62

Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 235
           W PY   L  +       V  P+ W++ ++  L GSP  A +L + +  +  + E     
Sbjct: 63  WRPYFDALPSR-------VNLPMFWADEDMQLLAGSPLHAAVLAQKKQARDWHTE----- 110

Query: 236 FMAGSLFQQYP--YDIPTE------AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 287
                + ++YP  + +  +      +++   F+     + S       +       +VP+
Sbjct: 111 -HIVPIVRRYPRPFGVSDDDSSLEPSYSLARFEWVLSMIASRAFWHFDLKDTWEPHMVPM 169

Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLV---VDRPYKAGESIVVWYNISFTGHDFKCGPQPNS 344
              +    +         DD  Q     V +PY  GE + + Y          C    N 
Sbjct: 170 ADLINHSLTNDNVSKYTFDDKTQTFIVHVQQPYAEGEQVFITY----------CT-DSNF 218

Query: 345 KLLINYGFVDEDN-------PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 397
           +LL  Y  + EDN         D   +      E  +    R + QR   L+ Q + V  
Sbjct: 219 ELLKTYAMMVEDNYNKYTEIRLDETTIARICPDEVERLTKTRALTQRG--LAKQTYPV-- 274

Query: 398 GREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAG 457
            + +E   D++  LRL ++  T       S+     PVS   E  V D +A   K  L+ 
Sbjct: 275 -KSEEFPLDLVQALRLYHLPLTDSHTE--STCFETDPVSVQNELMVYDTIAGCVKELLSQ 331

Query: 458 YPATLSEDEAMLT-DYNLHPKKRVATQLVRMEKKMLNACLQVTADM 502
           YP T  ED AML  D  L    R+A    R +K  L     V A+M
Sbjct: 332 YPITAQEDAAMLAHDPRLSATARLAVAYRREDKLFLTEVGSVFAEM 377


>gi|351701197|gb|EHB04116.1| SET domain-containing protein 3 [Heterocephalus glaber]
          Length = 705

 Score = 55.5 bits (132), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 63/262 (24%), Positives = 117/262 (44%), Gaps = 25/262 (9%)

Query: 252 EAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 310
           ++FT+E ++ A  +V +    +     +R   AL+PL       +          DD  +
Sbjct: 346 DSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCE 405

Query: 311 LVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 370
            V  + ++AGE I ++Y           G + N++ +I+ GF  ++N +DR+ ++  ++ 
Sbjct: 406 CVALQDFQAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSK 454

Query: 371 EDPQYQDKRMVAQRNG---KLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV-- 425
            D  Y  K  V  R G    +   VF +H   E    + +L +LR+  +++    + +  
Sbjct: 455 SDRLYAMKAEVLARAGIPTYVWSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLG 513

Query: 426 ------ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKK 478
                 I +LG    PVS   E  +   L D     L  Y  T  ED+A+L + +L  + 
Sbjct: 514 ENAIDRIFTLGNSEFPVSWENEVKLWSFLEDRASLLLKTYKTTTEEDKAVLKNPDLPART 573

Query: 479 RVATQLVRMEKKMLNACLQVTA 500
           ++A +L   EK++L   +Q  A
Sbjct: 574 KMAIKLRLGEKEILEKAVQSAA 595


>gi|260831632|ref|XP_002610762.1| hypothetical protein BRAFLDRAFT_91548 [Branchiostoma floridae]
 gi|229296131|gb|EEN66772.1| hypothetical protein BRAFLDRAFT_91548 [Branchiostoma floridae]
          Length = 604

 Score = 55.1 bits (131), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 59/242 (24%), Positives = 113/242 (46%), Gaps = 46/242 (19%)

Query: 151 TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 210
           T++ +    L+L+L+ EK +GK SFW PYIR L         +  +P+ ++E+EL  L+ 
Sbjct: 230 TSRFTCAQVLSLFLLLEKNKGKDSFWYPYIRSLPN-------SFTTPVYFTESELNALSP 282

Query: 211 SPTKAEILERAEGIKRE----YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 266
           S     + E+A  +K+E    +N+L+   F+   L +          FTF+ F+ A+  +
Sbjct: 283 S-----LQEKARDLKKELLHAFNDLEP--FVTSCLPEL------DSTFTFDAFRWAWSVL 329

Query: 267 QSCVVHLQKVS---LARR----FALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPY 317
           ++  ++ +      L+ +      LVP+   L+ +S   KA           ++ V  PY
Sbjct: 330 KTRTLYQEDCRSPYLSNKEPQTSTLVPM-LDLINHSPSAKARFGYNVNTSCYEVRVLEPY 388

Query: 318 KAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED-PQYQ 376
           +  + + + Y           G + N++L++ +GF   +NP D + +  +   E  PQ  
Sbjct: 389 RKYDQVFISY-----------GFEENTELMLKFGFFVPENPKDFMKINLSEMLESLPQIN 437

Query: 377 DK 378
           D+
Sbjct: 438 DE 439


>gi|145356486|ref|XP_001422460.1| chloroplast lysine N-methyltransferase [Ostreococcus lucimarinus
           CCE9901]
 gi|144582703|gb|ABP00777.1| chloroplast lysine N-methyltransferase [Ostreococcus lucimarinus
           CCE9901]
          Length = 529

 Score = 55.1 bits (131), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 92/396 (23%), Positives = 156/396 (39%), Gaps = 64/396 (16%)

Query: 48  RRKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKH 107
           RR+ R+    +S            +    D   L  W+  NG     V   +  + +E  
Sbjct: 24  RRRARWGDATTSKTRRPRTRARRDAASSADHDALHEWLSANGADVASVEFYDARAGDEDD 83

Query: 108 RPIHYVA--ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELL----TTNKLSELACLA 161
                    A+  L  G  A  VP SL +T E  + ++ + + L        L+    LA
Sbjct: 84  GGDAGWGARATRALARGAKAIVVPKSLWITPEVGMNDDELGKALRDEDVAGGLARWTTLA 143

Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
           L L+ E+++G++S +  Y++ L          + SPL W+  EL+ + G+    ++L+ A
Sbjct: 144 LTLLKERERGEESKYAAYVKTLPE-------VLHSPLFWNAEELSEIQGT----QLLDNA 192

Query: 222 EG----IKREYNELDTVWFMAGSLFQQYP--YDIPTEAFTFEIFKQAFVAVQSCVVHLQK 275
            G    ++  Y  L T       +F ++   +D+   AF+ + F+ AF  ++S       
Sbjct: 193 AGYDGYVRGVYETLRT------GMFAKHADVFDVEG-AFSEDNFRWAFGILRS---RTMA 242

Query: 276 VSLARRFALVPLGPPLLAYSSKCKA-------MLAAV---------DDAVQLVV--DRPY 317
                  ALVP G  L+ +SS  +A       +  AV         DD V   V  DR  
Sbjct: 243 PCDGANIALVP-GVDLVNHSSLSQARWRVSGGVAGAVAGLFGGGKGDDGVSARVECDRAL 301

Query: 318 KAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQD 377
              E + V YN   T          ++   +++GFVD   P     +  ++  +DP   D
Sbjct: 302 NVNEPLYVNYNPEGT----------DTSFALDFGFVDTITPSPGYALSLSVPEDDPNVFD 351

Query: 378 KRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 413
           K  V    G      F + A  + +   D+  +LRL
Sbjct: 352 KLDVLDVCGLGETPTFTLRAYSDPD--PDLRTFLRL 385


>gi|281207217|gb|EFA81400.1| mRNA-decapping enzyme 2 [Polysphondylium pallidum PN500]
          Length = 1078

 Score = 55.1 bits (131), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 103/475 (21%), Positives = 188/475 (39%), Gaps = 108/475 (22%)

Query: 61  DTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQ 120
           D  + G    ++ +++D  ++++W   +  P   +   EK            + +S D++
Sbjct: 12  DIRIGGQTVQLTFRKDDGINIQTWKQDSKQPLLSLTPNEKG-----------IFSSRDIK 60

Query: 121 AGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLA--------LYLMY--EKKQ 170
            G+   S+P    +++ +V   + +  L   NK+ +L   A        LY  Y  +   
Sbjct: 61  EGEELLSLPWYNSLSMNKV--QQQLPWLF--NKIQDLELTAEDGLVVALLYYRYCMDDLS 116

Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
              S W   + E+          + S L +S+ E   L GSP   +++ +    K  +  
Sbjct: 117 FDYSEWFSAMPEV----------LNSGLFFSDAEAELLNGSPAYIDLMNQRLDAKELFGR 166

Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL---ARRFALVPL 287
           L        SLF++  +     A T++  K A+  V S  ++ +  +L      F  V L
Sbjct: 167 L-------KSLFKEQQFS--KCAMTYDRLKWAYSVVDSRKIYTEAPNLDANGNPFITVVL 217

Query: 288 GPPLLAYSSKCKAMLAAVD-----DAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
            P  L Y +  +   AA D      A+++V  +P K GE I + Y           G Q 
Sbjct: 218 AP-FLDYFNHAEDAQAAYDFDYDESAIKVVALQPIKKGEQIFLNY-----------GNQD 265

Query: 343 -NSKLLINYGFVDEDNPYDRLV---VEAALNT---EDPQYQDKRMVAQRNGKLS--VQVF 393
            NS LLI+YGF+D+ +     V   VE  LNT    DPQ  +K  +  +  + +  +++F
Sbjct: 266 CNSDLLIHYGFIDQSSTAKHCVNVLVEELLNTIPASDPQLIEKTELLTKAFEQNERMKLF 325

Query: 394 HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 453
                 E   IS  L Y                               ++L  L      
Sbjct: 326 KDSLTEELLKISKYLSYKNF----------------------------SLLPYLKSLIDM 357

Query: 454 RLAGYPATLSEDEAML---TDYNLHPKKRVATQLVRMEK----KMLNACLQVTAD 501
           ++  YP T+ ED A++   T++    ++   + ++R+++    K + A +QV  D
Sbjct: 358 KMKAYPTTMEEDRAIIEATTEFEKLSQRSKMSIIMRLQEKETLKEIGALIQVKID 412


>gi|70984218|ref|XP_747626.1| SET domain protein [Aspergillus fumigatus Af293]
 gi|66845253|gb|EAL85588.1| SET domain protein [Aspergillus fumigatus Af293]
          Length = 492

 Score = 55.1 bits (131), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 72/291 (24%), Positives = 120/291 (41%), Gaps = 66/291 (22%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
           V A  D+  G+  FS+P  LV++ +    N  + +LL+ + L EL     L L +MYE  
Sbjct: 50  VVARSDIFDGEELFSIPRGLVLSAQ----NSKLKDLLSQD-LEELGPWLSLILVMMYEYL 104

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
            G++S W PY + L +       + ++ + WS +EL  L GS   ++I    EG +    
Sbjct: 105 LGEQSAWAPYFKILPK-------SFDTLMFWSPSELRELQGSAIVSKI--GKEGAE---- 151

Query: 230 ELDTVWFMAGSLFQQYPYDIPT---------EAFTFEIFKQAFVA---VQSCVVHLQKVS 277
             D++  M   + +  P   P+         EA +  + + A +    + +    ++KV 
Sbjct: 152 --DSIMQMIAPVVRANPSLFPSVDGLASWDGEAGSHALLRLAHIMGSLIMAYAFDIEKVE 209

Query: 278 LARRF-------------------ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYK 318
                                    +VPL   L A + +  A L   DD++ +   +P +
Sbjct: 210 DEDDENNDEEDGYVTDDEQDQSSKGMVPLADILNADADRNNARLFQEDDSLVMKAIKPIR 269

Query: 319 AGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 369
            GE I   Y           G  P + LL  YG+V  DN     VVE +L+
Sbjct: 270 VGEEIFNDY-----------GELPRADLLRRYGYV-TDNYAQYDVVELSLD 308


>gi|320169513|gb|EFW46412.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 495

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 99/426 (23%), Positives = 168/426 (39%), Gaps = 59/426 (13%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYE-KKQG 171
           V A  DL AG+    VP SL++ +E    +  +  +L   +LS+   +A +L+YE  +  
Sbjct: 79  VFALRDLAAGETVLRVPLSLLLNVEHASAS-PLGGILDDFRLSDAEAMAFWLIYELTRPE 137

Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
           + S WLPY+  L       QL     + +   E+  L  SP       R   ++ ++ + 
Sbjct: 138 RASPWLPYLESL--PASIKQLT----MFYDPFEMKRLQASPVAEFTSRRTVKMRNKFGKY 191

Query: 232 DTVWFMAGSLFQQYP-----YDIPTEAFTFEIFKQAFVAVQSC------VVHLQKVSLAR 280
                    + +  P      + P E  T + F  A +AVQ        V H       R
Sbjct: 192 RE------QISKHRPAHLAEIEFPVELITVDDFLWA-MAVQFTRLITVQVKHPADGEWER 244

Query: 281 RFALVPLG-----PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHD 335
              LVPL       P    + +C   L +     +    RP   G+ ++  Y        
Sbjct: 245 TKCLVPLADLLNTAPADQINVECATNLDSTH--FECATIRPVAEGQELLTPYG------- 295

Query: 336 FKCGPQPNSKLLINYGFVDEDNPYDRLVV------EAALNTEDPQYQDKRMVAQRNGKLS 389
                  N +L+++YG    +NP D + +      E A+  +        M   R  +L 
Sbjct: 296 -GAEQLSNGQLIMDYGVTFRNNPSDLVALPIPKLRETAVAYDSKMRLLMAMSLDRFDRLQ 354

Query: 390 VQVFHVHAGREKEAISDMLPYLRLGYVS---DTSEMQSVISSLGPICPVSPCMERAVLDQ 446
           + V        KE    +L + R+ YVS   D S+++ V+  +     ++P  ER  L+ 
Sbjct: 355 LPVLDHFESIPKE----LLAFARV-YVSTPSDLSDLEHVLELMKEHRAINPSNERRALEL 409

Query: 447 LADYFKARLAGYPATLSEDEAMLTDYNLH--PKKRVATQLV-RM-EKKMLNACLQVTADM 502
           L       +  Y  T+ EDE ML + +    P       +V R+ EK++L++  Q+    
Sbjct: 410 LLQLTNEMILKYITTIEEDETMLRELDAESVPNANAVNAVVLRLGEKRILSSLWQLLDSA 469

Query: 503 IMLLPD 508
           I  LP+
Sbjct: 470 IEALPE 475


>gi|159122413|gb|EDP47534.1| SET domain protein [Aspergillus fumigatus A1163]
          Length = 492

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 72/291 (24%), Positives = 120/291 (41%), Gaps = 66/291 (22%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
           V A  D+  G+  FS+P  LV++ +    N  + +LL+ + L EL     L L +MYE  
Sbjct: 50  VVARSDIFDGEELFSIPRGLVLSAQ----NSKLKDLLSQD-LEELGPWLSLILVMMYEYL 104

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
            G++S W PY + L +       + ++ + WS +EL  L GS   ++I    EG +    
Sbjct: 105 LGEQSAWAPYFKILPK-------SFDTLMFWSPSELRELQGSAIVSKI--GKEGAE---- 151

Query: 230 ELDTVWFMAGSLFQQYPYDIPT---------EAFTFEIFKQAFVA---VQSCVVHLQKVS 277
             D++  M   + +  P   P+         EA +  + + A +    + +    ++KV 
Sbjct: 152 --DSIMQMIAPVVRANPSLFPSVDGLASWDGEAGSHALLRLAHIMGSLIMAYAFDIEKVE 209

Query: 278 LARRF-------------------ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYK 318
                                    +VPL   L A + +  A L   DD++ +   +P +
Sbjct: 210 DEDDENNDEEDGYVTDDEQDQSSKGMVPLADILNADADRNNARLFQEDDSLVMKAIKPIR 269

Query: 319 AGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 369
            GE I   Y           G  P + LL  YG+V  DN     VVE +L+
Sbjct: 270 VGEEIFNDY-----------GELPRADLLRRYGYV-TDNYAQYDVVELSLD 308


>gi|330933580|ref|XP_003304225.1| hypothetical protein PTT_16721 [Pyrenophora teres f. teres 0-1]
 gi|311319308|gb|EFQ87682.1| hypothetical protein PTT_16721 [Pyrenophora teres f. teres 0-1]
          Length = 476

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 78/313 (24%), Positives = 121/313 (38%), Gaps = 62/313 (19%)

Query: 83  SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLER-VLG 141
           +W+ K+G      I  E   + +  R    V AS+++   +  F +P + ++++E  +L 
Sbjct: 13  AWLRKSGAEISPKIKLEDLRNKDAGRG---VVASQEIAEHELLFRIPRTSILSVENSILS 69

Query: 142 NETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
            E  A  L+   L     L L ++YE   G  S W PY   L  +         + + W+
Sbjct: 70  TEIPAATLSL--LGPWLSLILVMLYEYHNGSASNWAPYFAVLPTE-------FNTLMFWT 120

Query: 202 ETELAYLTGSPTKAEIL---------------------------ERAEGIKREYNELDTV 234
           E ELA L  S    +I                            E+A+   +E      +
Sbjct: 121 EDELAELQASAVVGKIGKESADEAFLEQLLPVIEEFADIVFSGDEKAKDKAKEMRSPKNL 180

Query: 235 WFM--AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
             M   GSL   Y +D+     T E+ ++   A +     L K        +VPL   L 
Sbjct: 181 ELMHKMGSLIMAYAFDVEPATPTKEVDEEG-FAEEEEDAALPK-------GMVPLADMLN 232

Query: 293 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGF 352
           A + +C A L    D +++   +P +AGE I   Y           GP P S LL  YG+
Sbjct: 233 ADADRCNARLFYEKDCLEMKALKPIQAGEEIFNDY-----------GPLPRSDLLRRYGY 281

Query: 353 VDEDNPYDRLVVE 365
           V  DN     VVE
Sbjct: 282 VT-DNYAQYDVVE 293


>gi|302804448|ref|XP_002983976.1| hypothetical protein SELMODRAFT_423083 [Selaginella moellendorffii]
 gi|300148328|gb|EFJ14988.1| hypothetical protein SELMODRAFT_423083 [Selaginella moellendorffii]
          Length = 266

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 63/253 (24%), Positives = 104/253 (41%), Gaps = 40/253 (15%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK-- 172
           AS  ++AG+    +   L++    + G  +     T    ++LA + L   Y K Q K  
Sbjct: 25  ASRPVRAGERVLEISLDLMIAPSDLPGELSTVLSSTVKPWTKLALIVLMERY-KGQAKLQ 83

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
            S W PYI  L          +++  LW +TEL+YL  SP   +  ER E I  E+ ++ 
Sbjct: 84  SSAWAPYISCLPEPA-----ELDNTFLWEDTELSYLRASPLYGKTRERLEMITTEFGQVQ 138

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
               +   LF +          + E FK  +  V S     + +++     LV +  P+L
Sbjct: 139 NALDVWPQLFGK---------VSLEDFKHVYATVFS-----RSLAIGEDSTLVMI--PML 182

Query: 293 AY-----SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLL 347
            +     +S  K     + +   +  DR Y   + I  W N          G   N++L 
Sbjct: 183 DFFNHNATSFAKLSFNGLLNYAVVTADRDYAENDQI--WINY---------GDLSNAELA 231

Query: 348 INYGFVDEDNPYD 360
           ++YGF   +NPYD
Sbjct: 232 LDYGFAVPENPYD 244


>gi|356577306|ref|XP_003556768.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Glycine
           max]
          Length = 487

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 85/386 (22%), Positives = 162/386 (41%), Gaps = 31/386 (8%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A +D+   +    VP  L +  + V  +E I ++     L     +AL+L+ E+ +  
Sbjct: 82  LVALKDISRNEVVLQVPKRLWINPDAVAASE-IGKVCIG--LKPWLAVALFLIRERSR-S 137

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
            S W  Y   L ++        +S + WSE EL+ L G+          + ++ EY  L+
Sbjct: 138 NSLWKHYFSVLPKE-------TDSTIYWSEEELSELQGTQLLNTTRSVKQYVENEYRRLE 190

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQS-CVVHLQKVSLARRFALVPLGPP 290
               +        P  +    + F I + +AF  +++  +V +         A V     
Sbjct: 191 EEIILPNKKLFPSPLTLDDFFWAFGILRSRAFSRLRNENLVVIPFADFINHSARVTTEDH 250

Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINY 350
             AY  K  A L + D    L      KAG+ + + Y+++          + N++L ++Y
Sbjct: 251 --AYEIKGAAGLFSWDYLFSLRSPLSLKAGDQVYIQYDLN----------KSNAELALDY 298

Query: 351 GFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV-HAGREKEAISDMLP 409
           GF++ +   +   +   ++  DP + DK  +A+ NG      F + ++      +   L 
Sbjct: 299 GFIEPNADRNAYTLTLQISESDPFFGDKLDIAESNGFGETAYFDIFYSRPLPPGLLPYLR 358

Query: 410 YLRLGYVSDTSEMQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 466
            + LG  +D   ++S+   S  G +  PVS   E  +   + +  K  LAGY  T+ ED+
Sbjct: 359 LVALG-GTDAFLLESIFRNSIWGHLELPVSRDNEELICRVVRETCKTALAGYHTTIEEDQ 417

Query: 467 AMLTDYNLHPKKRVATQLVRMEKKML 492
             L +  L  +  +A  +   EK++L
Sbjct: 418 K-LKEAKLDSRHAIAVGIREGEKQLL 442


>gi|328869852|gb|EGG18227.1| hypothetical protein DFA_03714 [Dictyostelium fasciculatum]
          Length = 504

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 42/176 (23%), Positives = 78/176 (44%), Gaps = 20/176 (11%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A +DL+  +    +P   V++ +      +IA +L   +L E    ++ LMYE  +G 
Sbjct: 44  IIAKQDLKVDEIIAVIPKRCVLSPKTT----SIAPILEKYELEEAVATSIALMYETSKGV 99

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
           +S W  YI+ +          ++ P+LW +  + YL G+  +  ++E  E ++ +Y E  
Sbjct: 100 QSKWYSYIQSM-------PTVIDLPILWDKESIEYLVGTDLEEIVIENIETLEEQYRE-- 150

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG 288
                   + + +P       FT E FK A   V S   ++ +       +LVPL 
Sbjct: 151 ----DVEPIIKNHPETFKENIFTLESFKIASTIVSSRAFNIDQYHGE---SLVPLA 199


>gi|356521657|ref|XP_003529470.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Glycine
           max]
          Length = 487

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 86/387 (22%), Positives = 161/387 (41%), Gaps = 33/387 (8%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A +D+   +    VP  L +  + V  +E I ++ +   L     +AL+L+ E+ +  
Sbjct: 82  LVALKDISRNEVVLQVPKRLWINPDAVAASE-IGKVCSG--LKPWLAVALFLIRERSR-S 137

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
            S W  Y   L ++        +S + WSE EL+ L G+          + ++ E+  L+
Sbjct: 138 DSLWKHYFSILPKE-------TDSTIYWSEEELSELQGTQLLNTTRSVKQYVQNEFRRLE 190

Query: 233 TVWFMAGSLFQQYPYDIPTEAF--TFEIFK-QAFVAVQS-CVVHLQKVSLARRFALVPLG 288
               +     + +P  I  + F   F I + +AF  +++  +V +    L    A V   
Sbjct: 191 EEIIIPNK--KLFPSSITLDDFFWAFGILRSRAFSRLRNENLVVIPLADLINHSARVTTD 248

Query: 289 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLI 348
               AY  K  A L + D    L      KAG+ + + Y+++          + N++L +
Sbjct: 249 DH--AYEIKGAAGLFSWDYLFSLRSPLSLKAGDQVYIQYDLN----------KSNAELAL 296

Query: 349 NYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDML 408
           +YGF++ +   +   +   ++  DP + DK  +A+ NG      F +   R         
Sbjct: 297 DYGFIEPNTDRNAYTLTLQISESDPFFGDKLDIAESNGFGETAYFDIFYNRPLPPGLLPY 356

Query: 409 PYLRLGYVSDTSEMQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSED 465
             L     +D   ++S+   S  G +  PVS   E  +   + +  K  LAGY  T+ ED
Sbjct: 357 LRLVALGGTDAFLLESIFRNSIWGHLELPVSRDNEELICRVVRETCKTALAGYHTTIEED 416

Query: 466 EAMLTDYNLHPKKRVATQLVRMEKKML 492
           +  L +  L  +  +A  +   EK +L
Sbjct: 417 QK-LKEAKLDSRHAIAVGIREGEKNLL 442


>gi|332020870|gb|EGI61268.1| SET domain-containing protein 3 [Acromyrmex echinatior]
          Length = 232

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 56/228 (24%), Positives = 95/228 (41%), Gaps = 22/228 (9%)

Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
           AL+P+       + +      A  D  +    R +K GE + + Y           GP+ 
Sbjct: 18  ALIPMWDMCNHENGRITTDFNATSDRCECYALRDFKKGEQVFISY-----------GPRT 66

Query: 343 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 402
           NS   ++ GFV  DN  D   +   ++  D   +++  +  +    SV  F +  G   E
Sbjct: 67  NSDFFVHSGFVCMDNEQDGFKLRLGISKADSLQKERIELLSKLDLPSVGEFLLKPG--TE 124

Query: 403 AISD-MLPYLRLGYVSDTSEMQSVISSLGPI------CPVSPCMERAVLDQLADYFKARL 455
            ISD +L +LR+ +    +E+   + S          C +   +E  V   L    +  +
Sbjct: 125 PISDTLLAFLRV-FSMRKAELTHWLRSDKVFDLKHVDCALETVVEENVRKFLLTRLQLLI 183

Query: 456 AGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 503
           A YP TL ED  +L +  L   K++A QL   EK++L+  L+     I
Sbjct: 184 ANYPTTLKEDLELL-ETTLPQMKKMAVQLRVTEKRILSGALEYVEQWI 230


>gi|396495152|ref|XP_003844476.1| similar to SET domain-containing protein [Leptosphaeria maculans
           JN3]
 gi|312221056|emb|CBY00997.1| similar to SET domain-containing protein [Leptosphaeria maculans
           JN3]
          Length = 475

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 73/292 (25%), Positives = 114/292 (39%), Gaps = 63/292 (21%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER-VLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
           V A+++++  +  F +P S V+++E  +L  E      T + L     L L ++YE   G
Sbjct: 40  VVATQEIREHEVLFRIPRSAVLSVENSILSTEIPTS--TFDLLGPWLSLILVMLYEHLNG 97

Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-------------- 217
             S W PY   L  +         + + WSE ELA L  S   A+I              
Sbjct: 98  DASNWAPYFAVLPNE-------FNTLMFWSEHELAELQASAVLAKIGREGANEAFLGQLV 150

Query: 218 -----------------LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK 260
                             ++AE ++ E N   T+    GSL   Y +DI           
Sbjct: 151 PVIKEFAGIFFSGDSRAAQKAEEMRDEKN--ITLMHKMGSLIMAYAFDIEPAT------P 202

Query: 261 QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 320
           +  V  +      +  +L +   ++PL   L A + +C A L      +++   +P KAG
Sbjct: 203 RKDVDEEGFAEEEEDEALPK--GMIPLADMLNADADRCNARLFYEQKYLEMKALKPIKAG 260

Query: 321 ESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED-NPYDRLVVEAALNTE 371
           E I   Y           GP P S LL  YG+V E+   YD + V   L +E
Sbjct: 261 EEIFNDY-----------GPLPRSDLLRRYGYVTENYAQYDVVEVPMELVSE 301


>gi|302816067|ref|XP_002989713.1| hypothetical protein SELMODRAFT_447801 [Selaginella moellendorffii]
 gi|300142490|gb|EFJ09190.1| hypothetical protein SELMODRAFT_447801 [Selaginella moellendorffii]
          Length = 400

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 61/266 (22%), Positives = 112/266 (42%), Gaps = 35/266 (13%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           A+  ++AG+    +P+ LV+T E++  ++ + +LL+T    +   L L ++ E+ +G+ S
Sbjct: 14  AARSIRAGEQIVRIPHELVLTAEKL--DDCVKKLLSTEY--DWCPLTLLILAEQHKGEAS 69

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
            W PY+  L            S + W + EL +L  +       ER E I  EYN +  V
Sbjct: 70  RWAPYVSCLPSFGDH-----HSTIFWGKEELKFLECTRAFRGTAERREMISDEYNSVKDV 124

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
                      P+    +   F+ F  A+  V   V      +L+   ++ P       +
Sbjct: 125 -------ISSCPHVFGEDISLFQ-FAHAYATV---VSRAWNGALSSEISMRPF-VDFCNH 172

Query: 295 SSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGF 352
                A ++     DA  ++ +R Y  GE + + Y           G + N+ L ++YGF
Sbjct: 173 DPVSHATVSHDTCKDAT-IIAERDYTKGEEVFISY-----------GKRSNAVLAVDYGF 220

Query: 353 VDEDNPYDRLVVEAALNTEDPQYQDK 378
           V  +N  D+  +   +   DP  + K
Sbjct: 221 VLPNNLSDQAELWMEIPWNDPLREKK 246


>gi|406860468|gb|EKD13526.1| putative SET domain-containing protein [Marssonina brunnea f. sp.
           'multigermtubi' MB_m1]
          Length = 474

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 98/421 (23%), Positives = 166/421 (39%), Gaps = 70/421 (16%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELAC-LALY-LMYEKKQ 170
           + A  D+   +  F++P   V+     LG+     +L      E+ C LAL  ++  + Q
Sbjct: 42  LVAQSDIGEDEVLFTIPRDAVLNTTTALGSADNPAIL------EMPCWLALTAIILTEGQ 95

Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEGIKREY- 228
            + S W PY+  L  +       ++S + WSE+EL  L  S    +I    AE +  E+ 
Sbjct: 96  QEDSKWAPYLALLPSR-------LDSLVFWSESELLELQASTVVNKIGRASAEQLFLEHI 148

Query: 229 ------NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-----CVVHLQKVS 277
                 N    +     S+   Y +DIP         K+     +S      +V   +  
Sbjct: 149 SPLGLSNTNTEMCHKVASVVMAYAFDIPE--------KKGHDDPESPEDGDDLVSDNEEE 200

Query: 278 LARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFK 337
                +++PL   L A +    A L   ++ +++   +P   GE I+  Y          
Sbjct: 201 ENTILSMIPLADMLNADADGNNARLCCDNEELEMRSIKPISKGEEILNDY---------- 250

Query: 338 CGPQPNSKLLINYGFV-DEDNPYDRLVVE-------AALNTEDP-------------QYQ 376
            G  P S LL  YG++ D+   YD  V E       A+L+TE P             + +
Sbjct: 251 -GQLPRSDLLRRYGYISDKYAAYD--VAELSTQSLLASLSTEQPLLAGGTLQPLSREKLE 307

Query: 377 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVS 436
            +  +AQR G         H G +  +I D L  L    + D   + ++ +S   +   S
Sbjct: 308 QRVELAQREGVYEDSYDLTHPGPDDPSIPDELLALLYILLLDNENLAAIETSHASLPSRS 367

Query: 437 PCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 496
                 V   L    ++R   Y  T+  D+A+L   NL  +KR+A ++   EK +L   +
Sbjct: 368 KLATSLVGQILTKILESRKQEYATTIEADQAILQADNLPSRKRMAVEVRLGEKLVLEKAI 427

Query: 497 Q 497
           Q
Sbjct: 428 Q 428


>gi|320170797|gb|EFW47696.1| hypothetical protein CAOG_05634 [Capsaspora owczarzaki ATCC 30864]
          Length = 903

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 53/200 (26%), Positives = 87/200 (43%), Gaps = 21/200 (10%)

Query: 62  TLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQA 121
           T V G+R    +  +   +L  W+H  G+     I +   S  +       V A+E ++A
Sbjct: 371 TAVIGTRPAALESRKIGDNLLQWLHNAGMTS---IAENHLSIADFEHTGRGVLANERIEA 427

Query: 122 GDAAFSVPNSLVVTLERVLG-NETIAELLTT--NKLSELACLALYLMYEK-KQGKKSFWL 177
           G     +P  L++ +   L  +  I  +L+   ++  +   L LY+++EK   G  S W 
Sbjct: 428 GVEVLHLPQHLLINIHVALDESHPIGRVLSDLRDEYDDDTLLLLYVLHEKLVAGSASRWA 487

Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
           P+   L            SPLL+  TEL  L G+    E  E  +G++  +  L      
Sbjct: 488 PFFETL-------PATYNSPLLFHVTELLELEGTRLIDETFEIKDGLRVLHESL------ 534

Query: 238 AGSLFQQYPYDIPTEAFTFE 257
            G L + YP   PT+AFT+E
Sbjct: 535 -GPLAEAYPALFPTDAFTYE 553


>gi|148686778|gb|EDL18725.1| mCG18357, isoform CRA_c [Mus musculus]
          Length = 536

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 48/204 (23%), Positives = 94/204 (46%), Gaps = 21/204 (10%)

Query: 306 DDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVE 365
           DD  + V  + ++AG+ I ++Y           G + N++ +I+ GF  ++N +DR+ ++
Sbjct: 234 DDRCECVALQDFQAGDQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKIK 282

Query: 366 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT------ 419
             ++  D  Y  K  V  R G  +  VF +H+  E    + +L +LR+  +++       
Sbjct: 283 LGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEHL 341

Query: 420 ---SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 476
              S +  + +      PVS   E  +   L D     L  Y  T+ ED+ +L + +L  
Sbjct: 342 LGDSAIDRIFTLGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSV 401

Query: 477 KKRVATQLVRMEKKMLNACLQVTA 500
           +  +A +L   EK++L   ++  A
Sbjct: 402 RATMAIKLRLGEKEILEKAVKSAA 425


>gi|148686780|gb|EDL18727.1| mCG18357, isoform CRA_e [Mus musculus]
          Length = 458

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 50/204 (24%), Positives = 96/204 (47%), Gaps = 21/204 (10%)

Query: 306 DDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVE 365
           DD  + V  + ++AG+ I ++Y           G + N++ +I+ GF  ++N +DR+ ++
Sbjct: 156 DDRCECVALQDFQAGDQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKIK 204

Query: 366 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV 425
             ++  D  Y  K  V  R G  +  VF +H+  E    + +L +LR+  +++    + +
Sbjct: 205 LGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEHL 263

Query: 426 --------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 476
                   I +LG    PVS   E  +   L D     L  Y  T+ ED+ +L + +L  
Sbjct: 264 LGDSAIDRIFTLGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSV 323

Query: 477 KKRVATQLVRMEKKMLNACLQVTA 500
           +  +A +L   EK++L   ++  A
Sbjct: 324 RATMAIKLRLGEKEILEKAVKSAA 347


>gi|7329638|emb|CAB82703.1| putative protein [Arabidopsis thaliana]
          Length = 486

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 44/159 (27%), Positives = 77/159 (48%), Gaps = 18/159 (11%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           AS+ + AGD    VP +  +T + +  +     +L +N++  +  LA  L+ EKK G+KS
Sbjct: 75  ASKVIYAGDCMLKVPFNAQITPDELPSD---IRVLLSNEVGNIGMLAAVLIREKKMGQKS 131

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
            W+PYI  L +        + S + W E EL+ +  S    E +++   I+++++     
Sbjct: 132 RWVPYISRLPQPA-----EMHSSIFWGEDELSMIRCSAVHQETVKQKAQIEKDFS----- 181

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS--CVV 271
            F+A +  Q  P  I TE    E F  A+   +   C+V
Sbjct: 182 -FVAQAFKQHCP--IVTERPDLEDFMYAYALGEKVLCIV 217


>gi|358056251|dbj|GAA97802.1| hypothetical protein E5Q_04481 [Mixia osmundae IAM 14324]
          Length = 433

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 94/402 (23%), Positives = 164/402 (40%), Gaps = 66/402 (16%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETI---AELLTTNKLSELACLALYLMYEKKQG 171
           A+ +L++    FS+P SLV+++      +++   +E+ T   +  + CL    MYE+   
Sbjct: 39  ATSNLRSETELFSIPRSLVLSVHTSPLPKSLPDWSEISTQGWVGLILCL----MYEQID- 93

Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-LERAEGIKREYNE 230
             S W  Y+  +           +S + WS+ EL  L GS    +I  E AEG    Y+ 
Sbjct: 94  PASHWKRYLNSM-------PTCFDSLMFWSDDELRELQGSSVLDKIGREEAEG--SYYSI 144

Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL------------ 278
           L         +F+      P EA++  ++ +    + S   H+                 
Sbjct: 145 LVPYLSKHADIFK------PLEAYSLALYHRCGSLILSRSFHVSNQDDSASDASDDDDAA 198

Query: 279 ---ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHD 335
                   +VP+   L A S    A L    DA+ +   +   AGE I   YN       
Sbjct: 199 YHEVETVGMVPMADVLNAKSGSANACLVYHPDALVMTTTKEIAAGEQIFNTYN------- 251

Query: 336 FKCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL-NTEDPQYQDKRMVAQRNGKLSVQVFH 394
                 PN+ LL  YG VDE N  D + + A L   +D          +R   L  ++  
Sbjct: 252 ----DPPNADLLRRYGHVDEVNLNDNVEISADLIGCKD---------LERVDWLLDRLDD 298

Query: 395 VHAGREKEAI-SDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 453
           V+   + E +  D +  +++   +  SE + +  +     P    ++ A   ++ +  + 
Sbjct: 299 VYTLTQAEDLPEDFITAVKI-LTASKSEFRKIQKA--DDLP-DDVLDEATAMRVREILQM 354

Query: 454 RLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM-EKKMLNA 494
           RLA Y +T+ EDE++L    +   +  A  LVR+ EK++L A
Sbjct: 355 RLAQYSSTIEEDESLLASSTMLTSRSRAALLVRLGEKRILAA 396


>gi|384248321|gb|EIE21805.1| SET domain-containing protein, partial [Coccomyxa subellipsoidea
           C-169]
          Length = 275

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 66/269 (24%), Positives = 110/269 (40%), Gaps = 45/269 (16%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLS----ELACLALYLMYEK 168
           V A++D+  G+    VP+  V+  E    +E + +   TN       E   L L LM EK
Sbjct: 32  VVATKDISCGEVVVHVPDESVLMPENCSCSEALEDAGLTNASGDAEMESIGLILALMTEK 91

Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR-- 226
           K GK S W  Y+  L +           PL W   +L  L G+     ++E+  G K   
Sbjct: 92  KLGKSSKWKGYLDFLPKS------IPGMPLFWDSEQLQSLEGT----SLIEKMNGCKAMP 141

Query: 227 --------EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 278
                   ++N +  + F+  +   + P++  +    + ++  A V+  S  +   +   
Sbjct: 142 DRPLEPPCKFNSV-VLPFLQSNAHLKLPHNAASTRRLY-VWATAMVSAYSFTIGEDRFQ- 198

Query: 279 ARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDF 336
               A+VP+   L   +      L   A   A++++       GE ++  Y         
Sbjct: 199 ----AMVPMWDALNHITGHANVRLHHCARKGALRMIATCLITKGEQVINSY--------- 245

Query: 337 KCGPQPNSKLLINYGFVDED-NPYDRLVV 364
             G  PNS+LL  YGFV+ D NP+D L V
Sbjct: 246 --GDLPNSELLRRYGFVETDPNPHDCLEV 272


>gi|307190530|gb|EFN74527.1| SET domain-containing protein 3 [Camponotus floridanus]
          Length = 232

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 56/228 (24%), Positives = 95/228 (41%), Gaps = 22/228 (9%)

Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
           AL+P+       + +      A  D  +    R +K GE + + Y           GP+ 
Sbjct: 18  ALIPMWDMCNHENGRITTDFNATSDHCECYALRNFKKGEQVFISY-----------GPRT 66

Query: 343 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 402
           NS   ++ GFV  +N  D   +   ++  D   +++  +  + G  SV  F +  G   E
Sbjct: 67  NSDFFVHSGFVYMNNKQDGFKLRLGISKADSLQKERIELLSKLGLPSVGEFLLKPG--TE 124

Query: 403 AISD-MLPYLRLGYVSDTSEMQSVISSLGPI------CPVSPCMERAVLDQLADYFKARL 455
            ISD +L +LR+ +    +E+   + S          C +   +E  V   L    +  +
Sbjct: 125 PISDTLLAFLRV-FSMRKAELAHWLRSDKVFDLKHMDCALETVVEENVRKFLLTRLQLLI 183

Query: 456 AGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 503
           A YP TL ED  +L +  L   K++A QL   EK++L   L+     I
Sbjct: 184 ANYPTTLKEDLELL-ETTLPQIKKMAVQLRVTEKRILLGALEYVEQWI 230


>gi|384251065|gb|EIE24543.1| hypothetical protein COCSUDRAFT_40909 [Coccomyxa subellipsoidea
           C-169]
          Length = 685

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 53/182 (29%), Positives = 77/182 (42%), Gaps = 29/182 (15%)

Query: 201 SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK 260
           +E E++ L G+P     +E  + I+ +Y     V     +L   YP DI  +  T + F 
Sbjct: 65  TEEEVSMLEGTPAHTTFVEARQHIREQYRAAQPV---LQALTAAYPDDITPDLVTEDKFI 121

Query: 261 QAFVAVQSCVVHLQKVSLARRFALVPLG--------PPLLAYSSKCKAMLAAVDDAVQLV 312
            A     S  + ++ V  A R  LVP+         P ++ Y       L A  D+++L 
Sbjct: 122 WACELWYSYAIEVEYVDGAVRQTLVPIAHLLNHSPWPHIVRY-----GRLDAATDSLRLR 176

Query: 313 VDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDR--LVVEAALNT 370
             R   AGE   + Y           GP PN KLL+ YGF   DNP+D   +  EA  N 
Sbjct: 177 AFRHCAAGEQCFLSY-----------GPLPNLKLLLFYGFALPDNPHDTVPITFEAEKNE 225

Query: 371 ED 372
            D
Sbjct: 226 GD 227


>gi|34784341|gb|AAH57968.1| Setd3 protein [Mus musculus]
          Length = 408

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 50/204 (24%), Positives = 96/204 (47%), Gaps = 21/204 (10%)

Query: 306 DDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVE 365
           DD  + V  + ++AG+ I ++Y           G + N++ +I+ GF  ++N +DR+ ++
Sbjct: 106 DDRCECVALQDFQAGDQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKIK 154

Query: 366 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV 425
             ++  D  Y  K  V  R G  +  VF +H+  E    + +L +LR+  +++    + +
Sbjct: 155 LGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEHL 213

Query: 426 --------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 476
                   I +LG    PVS   E  +   L D     L  Y  T+ ED+ +L + +L  
Sbjct: 214 LGDSAIDRIFTLGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSV 273

Query: 477 KKRVATQLVRMEKKMLNACLQVTA 500
           +  +A +L   EK++L   ++  A
Sbjct: 274 RATMAIKLRLGEKEILEKAVKSAA 297


>gi|19112238|ref|NP_595446.1| ribosomal lysine methyltransferase Set10 [Schizosaccharomyces pombe
           972h-]
 gi|74626910|sp|O74738.1|SET10_SCHPO RecName: Full=Ribosomal N-lysine methyltransferase set10
 gi|3738151|emb|CAA21252.1| ribosomal lysine methyltransferase Set10 [Schizosaccharomyces
           pombe]
          Length = 547

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 65/258 (25%), Positives = 107/258 (41%), Gaps = 42/258 (16%)

Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
           L  +L  E  +G +S W  YI  L +          +PL ++E + A+L  +   +   E
Sbjct: 82  LCTFLALESLKGIQSKWYGYIEYLPK-------TFNTPLYFNENDNAFLISTNAYSAAQE 134

Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF-KQAFVAVQSCVVHLQKVSL 278
           R    K EY E       A SL     +  PTE FTF+++   A V    C        +
Sbjct: 135 RLHIWKHEYQE-------ALSL-----HPSPTERFTFDLYIWSATVFSSRC---FSSNLI 179

Query: 279 ARRFALVPLGPPLL-AYSSKCKAMLAAVDD-----AVQLVVDRPYKAGESIVVWYNISFT 332
            +     P+  PL+ + + K K  +    D     +VQL+       G  +   Y     
Sbjct: 180 YKDSESTPILLPLIDSLNHKPKQPILWNSDFQDEKSVQLISQELVAKGNQLFNNY----- 234

Query: 333 GHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG--KLSV 390
                 GP+ N +LL+ YGF   DNP+D + ++ A++ + P    K  + + +   +LS 
Sbjct: 235 ------GPKGNEELLMGYGFCLPDNPFDTVTLKVAIHPDLPHKDQKAAILENDCQFQLSN 288

Query: 391 QVFHVHAGREKEAISDML 408
            VF +    +KE    +L
Sbjct: 289 LVFFLPKSPDKEIFQKIL 306


>gi|327295326|ref|XP_003232358.1| hypothetical protein TERG_07206 [Trichophyton rubrum CBS 118892]
 gi|326465530|gb|EGD90983.1| hypothetical protein TERG_07206 [Trichophyton rubrum CBS 118892]
          Length = 692

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 49/206 (23%), Positives = 90/206 (43%), Gaps = 24/206 (11%)

Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
           LA ++++E+ +G+ S W PY+  L R        + S L + +++L +L G+        
Sbjct: 108 LAFFMVHEQLKGRDSHWWPYLATLPRAS-----ELTSALFFQDSDLEWLQGTSLYETHRA 162

Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
               +K EY+       +A S+ +   Y +  E++T++IF  A+  + S      +   +
Sbjct: 163 YRNTVKEEYD-------LAISILRDEGY-LAIESYTWDIFCWAYTLIAS------RAFTS 208

Query: 280 RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNI---SFTGHDF 336
           R         P L    + + ML  VD +    + +     E+  +   +   +FTG + 
Sbjct: 209 RVLDAYLSNHPSLKQEEEFQIMLPLVDFSNHKPLAKIEWQAEATEIRLKVVEPTFTGEEV 268

Query: 337 --KCGPQPNSKLLINYGFVDEDNPYD 360
               GP  N +L+  YGF   DNP D
Sbjct: 269 HNNYGPLNNQQLMTTYGFCIVDNPCD 294


>gi|346465219|gb|AEO32454.1| hypothetical protein [Amblyomma maculatum]
          Length = 353

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 85/357 (23%), Positives = 147/357 (41%), Gaps = 47/357 (13%)

Query: 80  DLKSWMHKNGLP-PCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLER 138
           DL  WM  NG     ++ ++E     E  R +   A  + + AG+    VP  L++T   
Sbjct: 28  DLLEWMIANGFELHVQLCVRE---FTETGRGL---ATLQKVTAGETFLRVPTCLLITTTT 81

Query: 139 VLGNETIAELLTTNK-LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
            L +     L+  ++ L+ +  L L+L+ EK +G  S W  +I  L        ++  +P
Sbjct: 82  ALSSSLHGFLVRHHRQLTAIEVLTLFLINEKLRGLDSEWRFFIDSL-------PVSYTTP 134

Query: 198 LLWSETELAYLTGSPT-KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 256
           +      LA L  +   KAE   +   I+R +  L  +  +  +L         +E FT+
Sbjct: 135 VFLGSKLLARLPETMCRKAE--AQVSRIRRTFVRLQIL--LKRALLDDSALLNLSENFTW 190

Query: 257 EIFKQAFVAVQS-CVVHLQKVSLARRFA---LVPLGPPLLAYSSKCKAMLAAVDDA--VQ 310
            +F  A+ AV + C+    K      F       L P L   +   KA +    +    +
Sbjct: 191 HLFVWAWTAVNTRCI--FSKHRTDHSFWDDDYCALAPFLDCLNHHWKADVETTVEGSYFE 248

Query: 311 LVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 370
           +V +  Y+  + +     IS+  HD       N KLL+ YGFV  DNP D + +     T
Sbjct: 249 IVTNNNYEPNDQVF----ISYGSHD-------NKKLLLEYGFVLADNPNDVVAI-----T 292

Query: 371 EDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS 427
           +   Y   ++ +Q+N  +      +    EK+ ISD   +   G   +   +  V+S
Sbjct: 293 KGHLY---KLNSQQNDTVLYFATKLSFLEEKDIISDTCGFTTDGLTWNGKIVMQVLS 346


>gi|145250231|ref|XP_001396629.1| SET domain protein [Aspergillus niger CBS 513.88]
 gi|134082145|emb|CAK42259.1| unnamed protein product [Aspergillus niger]
 gi|350636112|gb|EHA24472.1| hypothetical protein ASPNIDRAFT_48629 [Aspergillus niger ATCC 1015]
          Length = 489

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 79/319 (24%), Positives = 122/319 (38%), Gaps = 65/319 (20%)

Query: 83  SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
           +W+   G P  K+  K + +    H     V A  DL  G+  F++P + V++++    N
Sbjct: 22  TWLA--GKPGVKINPKIQIADLRSHAAGRGVVAQSDLDEGEELFTIPRAHVLSVQ----N 75

Query: 143 ETIAELLTTN--KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
             +  LL+ N   L     L + ++YE  QG +S W  Y R L R         ++ + W
Sbjct: 76  SNLKNLLSQNLDDLGPWLSLMVVMIYEYLQGDQSAWASYFRVLPRN-------FDTLMFW 128

Query: 201 SETELAYLTGSP---------TKAEILERAEGIKRE----YNELDTV------------- 234
           S +EL  L GS           +  ILE    I R     +  +D V             
Sbjct: 129 SASELEELQGSAIVEKIGKQGAEESILETIAPIVRANPALFPPIDGVASYDGDAGTQALL 188

Query: 235 --WFMAGSLFQQYPYDI--PTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 290
                 GSL   Y +DI  P +       +  ++  +                +VPL   
Sbjct: 189 HLAHTMGSLIMAYAFDIEKPEDEEGERDGEDGYLTDEE--------EEQSSKGMVPLADL 240

Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINY 350
           L A + +  A L   ++ + +   +P KAGE I   Y           G  P S LL  Y
Sbjct: 241 LNADADRNNARLFQEEEVLVMKAIKPIKAGEEIFNDY-----------GEIPRSDLLRRY 289

Query: 351 GFVDEDNPYDRLVVEAALN 369
           G+V  DN     VVE +L+
Sbjct: 290 GYV-TDNYAQYDVVELSLD 307


>gi|449542715|gb|EMD33693.1| hypothetical protein CERSUDRAFT_56467 [Ceriporiopsis subvermispora
           B]
          Length = 510

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 66/237 (27%), Positives = 99/237 (41%), Gaps = 42/237 (17%)

Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
           A+VP+   L A      A L   +  +++V  +P  AGE I   Y           G  P
Sbjct: 260 AMVPMADMLNARFESENAKLFYEEHYLKMVATKPINAGEQIWNTY-----------GDPP 308

Query: 343 NSKLLINYGFVD----------EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQV 392
           NS LL  YG VD          E NP D + + A L          R   +  G L V+V
Sbjct: 309 NSDLLRRYGHVDVVPLGEPLSGEGNPADVVEIRADL-----VVSAVRKARKAAGDLQVRV 363

Query: 393 -FHVHAGREK--------EAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 443
            F +    +         E   ++L ++RL  +S T +  + + + G +      +E  +
Sbjct: 364 DFWLEEADDDTFVLMTDCEVPEELLSFIRL--LSLTKDEWNKVKAKGKLP--KGKLELEL 419

Query: 444 LDQLADYFKARLAGYPATLSEDEAML---TDYNLHPKKRVATQLVRMEKKMLNACLQ 497
           L  + D  K RL  YP T+ EDE++L   +  NL   KR A  +   EK++L   LQ
Sbjct: 420 LPAIVDVLKERLKEYPTTIEEDESLLGPDSAVNLSFNKRNAVVVRLGEKRILRGALQ 476


>gi|224077384|ref|XP_002305239.1| SET domain protein [Populus trichocarpa]
 gi|222848203|gb|EEE85750.1| SET domain protein [Populus trichocarpa]
          Length = 518

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 42/151 (27%), Positives = 66/151 (43%), Gaps = 21/151 (13%)

Query: 56  RVSSSDTLVAGSREVVSKKEEDLGD------LKSWMHKNGLPPCKVILKEKP-------- 101
           R+ +S T++  +     K+ ED G          W    G+  C   L   P        
Sbjct: 9   RIWASFTVLRRNSRQTKKEMEDAGQDEGFERFLKWAANLGISDCTTNLSLHPQSPTSCLG 68

Query: 102 -SHNEKHRPI---HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL 157
            S    H P      +AA  DL+ G+    VP S+++T + +L +E +   +  N  S L
Sbjct: 69  HSLTVSHFPDAGGRGLAAVRDLKKGELVLRVPKSVLITRDSLLKDEKLCSFVNNNTYSSL 128

Query: 158 A---CLALYLMYEKKQGKKSFWLPYIRELDR 185
           +    LA+ L+YE  +GK S+W PY+  L R
Sbjct: 129 SPTQILAVCLLYEMGKGKSSWWYPYLMHLPR 159


>gi|356564844|ref|XP_003550657.1| PREDICTED: uncharacterized protein LOC100778605 [Glycine max]
          Length = 549

 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 65/252 (25%), Positives = 107/252 (42%), Gaps = 33/252 (13%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           A +DL+ GD A  +P S++++ E V   +    L   + +S    L L+ M EK      
Sbjct: 178 ARKDLKVGDIALEIPVSIIISEELVHETDMYGVLKEIDGISSETILLLWSMKEKYNCDSK 237

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
           F + Y   L  +   G       L +S   +  L G+    EI++  + +  +Y+EL   
Sbjct: 238 FKI-YFDTLPEKFNTG-------LSFSIQAITMLDGTLLLEEIMQARQHLHAQYDEL--- 286

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
            F A  L   +P   P E +T+E F  A     S  + +       R  L+PL   L   
Sbjct: 287 -FPA--LCNNFPDIFPPELYTWEKFLWACELWYSNSMKIMYSDGKLRTCLIPLAGFL--N 341

Query: 295 SSKCKAML--AAVD---DAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
            S C  ++    VD   ++++  + RP ++GE   + Y           G   +S L+  
Sbjct: 342 HSLCPHVMHYGKVDPATNSLKFCLSRPCRSGEECCLSY-----------GNFSSSHLITF 390

Query: 350 YGFVDE-DNPYD 360
           YGF+ + DN YD
Sbjct: 391 YGFLPQGDNSYD 402


>gi|255071849|ref|XP_002499599.1| predicted protein [Micromonas sp. RCC299]
 gi|226514861|gb|ACO60857.1| predicted protein [Micromonas sp. RCC299]
          Length = 588

 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 79/306 (25%), Positives = 124/306 (40%), Gaps = 47/306 (15%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVL---GNETIAELLTTNKLSELACLALYLMYEKK 169
            AA+  + AGD A ++P   + T+   L   G    A  +    L E    AL+L+ E+ 
Sbjct: 188 AAATTHIPAGDIAAAIPVERLFTVRHALEMPGPRGDAYRMFA-ALGEDTIAALWLIAERA 246

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVE-----SPLLWS-ETELAYLTGSPTKAEILERAEG 223
            G+ S W   I  L    G G+ +       +P+ W  E   A L G+P  A+ +  +E 
Sbjct: 247 LGEASPWHAVIASLPWPEG-GEGSASPCGGCTPVSWPREACDALLGGTPLLADAIAASEK 305

Query: 224 IKREYNELDTVWFMAGSLFQQYPYDI-PTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF 282
           + R++  L    F A     ++  D+ P  A+T + F++A  A  S  + +Q        
Sbjct: 306 LARQHAAL----FPA---LSEHMADVFPASAYTLDNFRRAHEAWNSYGMTVQASPGEPAA 358

Query: 283 ALVP---------LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTG 333
             +P         L P ++ YS          D  ++L V R   AGE + V Y      
Sbjct: 359 TCLPPVAMLCNHALWPHVVRYSRL-------RDGTLRLPVARSVHAGEEVFVSY------ 405

Query: 334 HDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL-NTEDPQYQDKRMVAQRNGKLSVQV 392
                G + N++LL+ YGF    NPYD + +   L   E       R  A     L++  
Sbjct: 406 -----GAKSNAELLLFYGFALPGNPYDDVPLSLELPGGEVADVTKAREAALARAGLTLSP 460

Query: 393 FHVHAG 398
             V AG
Sbjct: 461 HAVRAG 466


>gi|308811012|ref|XP_003082814.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplast precursor (ISS)
           [Ostreococcus tauri]
 gi|116054692|emb|CAL56769.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplast precursor (ISS)
           [Ostreococcus tauri]
          Length = 588

 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 88/399 (22%), Positives = 153/399 (38%), Gaps = 64/399 (16%)

Query: 49  RKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSH-NEKH 107
           R +R      S+   V G  +      E    L  W+ + G    +V+     +  N+  
Sbjct: 12  RASRARWTTRSTRARVRGDAQRARASREAYDGLWMWLERRGADVSRVVADAVTTDANDSE 71

Query: 108 RPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL----ACLALY 163
           R    V A   L+ G  A  +P  + +   R   +  +   L   +   +      +AL 
Sbjct: 72  RAQFGVRAKTTLRRGTRAMVIPREVWMDATRATEDADVGAALRDARYDAVKQPWVRVALL 131

Query: 164 LMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG 223
           L+ E+++G    +  Y+  L +        ++SPL WS  EL  + G+    ++L+ A G
Sbjct: 132 LLKERERGADGEFAAYVATLPK-------TLDSPLFWSADELRDIAGT----QLLDNAAG 180

Query: 224 ----IKREYNELDTVWFMAGSLFQQYP--YDIPTEAFTFEIFKQAFVAVQS-CVVHLQKV 276
               ++  Y EL         +F +Y   +D+   AF    F+ AF  ++S  +  L   
Sbjct: 181 YDAYVRAVYEEL------KNGVFVEYASTFDVDG-AFDEASFRWAFGILRSRTMAPLDGA 233

Query: 277 SLARRFALVPLGPPLLAYSSKCKAMLAAVDD---------------AVQLVVDRPYKAGE 321
           ++A    LVP G  L+ +SS   A                      A  +  DR Y  G 
Sbjct: 234 NVA----LVP-GLDLINHSSLSGARWRVGGGGGMGGLFGGGSGSGVAAYVECDRDYDEGA 288

Query: 322 SIVVWYNISFTGHDFKCGPQP-NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRM 380
            I V Y+           P+  +SK  ++YGF+D  NP     +  ++  +D    DK  
Sbjct: 289 EIFVNYD-----------PEGIDSKFALDYGFIDVVNPSPGYALTLSIPEDDANLFDKLD 337

Query: 381 VAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 419
           V +  G      F +    + +   ++  +LRL +  DT
Sbjct: 338 VLETQGLPEAPTFTLRPYSDPD--RELRTFLRLLHCKDT 374


>gi|428175234|gb|EKX44125.1| hypothetical protein GUITHDRAFT_109909 [Guillardia theta CCMP2712]
          Length = 442

 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 47/200 (23%), Positives = 79/200 (39%), Gaps = 39/200 (19%)

Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM---------AGSLFQQ 244
           + +PL WS+ E   L GS                YN LD  W M         A  L Q 
Sbjct: 161 LTTPLFWSDKEREELQGSNL--------------YNMLDG-WTMNVEKLHRSTARVLGQH 205

Query: 245 YPY-DIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA---RRFALVPLGPPLLAYSSKCKA 300
             + D+P   ++ + FK A+  + +    +   S     R+  + P+         K   
Sbjct: 206 NVFPDLPKAIYSLKEFKWAYATIFARAFDVDGKSFGFSGRQRIMAPMADLFNHGDVKTSY 265

Query: 301 MLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYD 360
              A     +L   + +  GE I + Y+            + N++ L+ YGFV E NP+D
Sbjct: 266 TFNAASGHFELFTQQFFSRGEQIFMNYD-----------SKNNAEFLLQYGFVIESNPHD 314

Query: 361 RLVVEAALNTEDPQYQDKRM 380
            + + A++  + P Y+DK +
Sbjct: 315 YVGIAASIGNDQPFYRDKSL 334


>gi|149044196|gb|EDL97578.1| rCG27725, isoform CRA_b [Rattus norvegicus]
          Length = 538

 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 48/204 (23%), Positives = 93/204 (45%), Gaps = 21/204 (10%)

Query: 306 DDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVE 365
           DD  + V  + ++AG+ I ++Y           G + N++ +I+ GF  ++N +DR+ ++
Sbjct: 234 DDRCECVALQDFQAGDQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKIK 282

Query: 366 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT------ 419
             ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++       
Sbjct: 283 LGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHL 341

Query: 420 ---SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 476
              S +  + +      PVS   E  +   L D     L  Y  T+ ED+ +L + +L  
Sbjct: 342 LGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKTVLKNPDLSV 401

Query: 477 KKRVATQLVRMEKKMLNACLQVTA 500
           +  +A +L   EK++L   ++  A
Sbjct: 402 RATMAIKLRLGEKEILEKAVKSAA 425


>gi|302815683|ref|XP_002989522.1| hypothetical protein SELMODRAFT_129980 [Selaginella moellendorffii]
 gi|300142700|gb|EFJ09398.1| hypothetical protein SELMODRAFT_129980 [Selaginella moellendorffii]
          Length = 464

 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 90/386 (23%), Positives = 151/386 (39%), Gaps = 73/386 (18%)

Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
           L L L+YE+ Q K S+W PYI  L         +   P+ +S  ++  +  +P   ++ +
Sbjct: 105 LGLKLLYERAQ-KGSYWWPYISMLPH-------SFTLPIFFSGVDIESIDYAPVTHQVKK 156

Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF---EIFKQAFVAVQSCVVHLQKV 276
           R   + +  +EL        +  +  P +I   A  F        A  AV S    +  V
Sbjct: 157 RCRFLLQFSSEL--------AKLESLPEEIHPFAGQFVDSGALGWAMAAVSSRAFRIHGV 208

Query: 277 SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV----------QLVVDRPYKAGESIVVW 326
           +     A++    PL+   +      A +++ +          ++V  R  + G +I + 
Sbjct: 209 TNKLCSAMML---PLIDMCNHSFQPNAHIEEDLSRDAQDVSFLKVVTKRNLEKGSAITLN 265

Query: 327 YNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA--QR 384
           Y           GP  N  LL++YGFV  DNP+DR+     L  +    ++ RM+A   R
Sbjct: 266 Y-----------GPLSNDLLLLDYGFVIPDNPHDRI----ELRYDGSLMENARMIAGLSR 310

Query: 385 NGK------LSVQVFH--------------VHAGREKEAISDMLPYLRLGYVSDTS--EM 422
            G        S QV                V  G  +E    +L  LR+ +       E 
Sbjct: 311 TGSPPFSSPASWQVDRLKQLGLADSGESQKVTLGGPEEVDGRLLAALRILHAESQEPLER 370

Query: 423 QSVIS--SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRV 480
           + ++S  + G    VS   E  VL  L          +  T+ EDEA L+D +L    R+
Sbjct: 371 RELVSLQAWGVESMVSSDNEERVLRTLCGLGAIVFNQFKTTIEEDEAKLSDKSLAETSRI 430

Query: 481 ATQLVRMEKKMLNACLQVTADMIMLL 506
           A Q    +K+++   L+     +M L
Sbjct: 431 AVQFRLTKKRLVVRVLESLKKRLMDL 456


>gi|449283795|gb|EMC90389.1| SET domain-containing protein 4 [Columba livia]
          Length = 440

 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 86/375 (22%), Positives = 148/375 (39%), Gaps = 60/375 (16%)

Query: 97  LKEKPSHNEKHRPIHY------VAASEDLQAG-DAAFSVPNSLVVTLERVLGNETIAELL 149
           LK++   +   RP  +      +  ++ LQ   D   S+P   ++T + VL +  + E +
Sbjct: 39  LKDRGFEDSHLRPAEFWDTGRGLMTTKTLQVSRDLIISLPEKCLLTTDTVLSS-CLGEYI 97

Query: 150 TTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY 207
              K  +S L  L  +L+ EK  G+KS W PY+  L +           P+   E ++  
Sbjct: 98  MKWKPPVSPLTALCTFLIAEKHAGEKSLWKPYLDVLPK-------TYSCPVC-LEHDVVS 149

Query: 208 LTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ 267
           L   P + +  E+   +   Y      +     LF +    I    F +   + A+  + 
Sbjct: 150 LLPEPLRKKAQEQRTKVHELYISSKAFFSSLQPLFAENTETI----FNYSALEWAWCTIN 205

Query: 268 SCVVHLQKVSLARRFALVP----LGP--PLLAYS--SKCKAMLAAVDDAVQLVVDRPYKA 319
           +  +++ K S  + F+L P    L P   LL +S   + KA       + ++  +   K 
Sbjct: 206 TRTIYM-KHSQRKCFSLEPDVYALAPYLDLLNHSPNVQVKAAFNEQTRSYEIRTNSLCKK 264

Query: 320 GESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 379
            E + + Y           GP  N +LL+ YGFV  DNP+  + V +A   +     DK 
Sbjct: 265 YEEVFICY-----------GPHDNQRLLLEYGFVAMDNPHSSVYVSSATLLKYFPPLDK- 312

Query: 380 MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCM 439
              QRN K+S+   H           D+L  L  G+   +  + + +  L        C 
Sbjct: 313 ---QRNAKVSILKDH-----------DLLENLTFGWDGPSWRLLTALKVLSLGADEFTCW 358

Query: 440 ERAVLDQLADYFKAR 454
            R +   L D   AR
Sbjct: 359 RRTL---LGDVISAR 370


>gi|307195794|gb|EFN77608.1| SET domain-containing protein 3 [Harpegnathos saltator]
          Length = 245

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 57/228 (25%), Positives = 93/228 (40%), Gaps = 22/228 (9%)

Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
           AL+P+       + +      A  D  +    R ++ GE I + Y           GP+ 
Sbjct: 31  ALIPMWDMCNHENGRITTDFNATSDRCECYALRNFQKGEQIFISY-----------GPRT 79

Query: 343 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 402
           NS   ++ GFV  DN  D   +   ++  D   +++  +  +    SV  F +  G   E
Sbjct: 80  NSDFFVHSGFVYMDNEQDGFKLRLGISKADSLQKERTELLGKLDLPSVGEFLLKPG--TE 137

Query: 403 AISDM-LPYLRLGYVSDTSEMQSVISSLGPI------CPVSPCMERAVLDQLADYFKARL 455
            ISDM L +LR+ +    +E+   + S          C +   +E  V   L    +  +
Sbjct: 138 PISDMLLAFLRV-FSMRKAELAHWLRSDKVFDLKHMDCALETVVEENVRKFLLTRLQLLI 196

Query: 456 AGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 503
           A YP TL ED  +L +  L   K++  QL   EKK+L   L+     I
Sbjct: 197 ANYPTTLKEDLELL-ETTLPQVKKMTVQLRVTEKKILLGALEYVEQWI 243


>gi|403215215|emb|CCK69715.1| hypothetical protein KNAG_0C06190 [Kazachstania naganishii CBS
           8797]
          Length = 496

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 62/265 (23%), Positives = 112/265 (42%), Gaps = 43/265 (16%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLT--TNKLSELACLALYLMYE-KK 169
           V A ED++  +  F VP + ++ +E    ++   E+       + +   L + L++E K 
Sbjct: 41  VIAIEDIEKDEILFEVPRTTMLNVENCELSKRYPEIKNHLVESVGQWEGLIIALLFEWKV 100

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
            G+KS W PY++ L ++    QL     + W++ EL  L         G+    E+ E  
Sbjct: 101 VGEKSKWWPYLQVLPKKTDMNQL-----IYWADDELELLKPSLILERVGADKAKEMFENV 155

Query: 222 EGI--KREYNELDT-----VW---FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 271
             I  K    E D+      W    +  S+   Y +D+  + +  E  K+     +    
Sbjct: 156 VDIINKSTLKEKDSYILKVTWENFLLVASIIMSYSFDV--QDYVEE--KEGGTDEEEDDN 211

Query: 272 HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISF 331
             + V   +   ++PL   L + + KC A L    + +++   +  K GE I   YNI  
Sbjct: 212 ESENVRSLK--CMIPLADTLNSNTHKCNAHLIHGSNLLEMRSIKAIKKGEQI---YNIY- 265

Query: 332 TGHDFKCGPQPNSKLLINYGFVDED 356
                  G  PNS++L  YG+++ D
Sbjct: 266 -------GDHPNSEILRRYGYIEPD 283


>gi|226508108|ref|NP_001151788.1| SET domain containing protein [Zea mays]
 gi|195649689|gb|ACG44312.1| SET domain containing protein [Zea mays]
          Length = 536

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 72/288 (25%), Positives = 115/288 (39%), Gaps = 35/288 (12%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           ASE +  GD A  +P  L+++ E +  +E    L   N ++    L L+ M E+      
Sbjct: 184 ASESIGVGDIALEIPEFLIISDELLCQSEVFLALKDFNNITSETMLLLWSMRERYNLGSK 243

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
           F  PY   L      G       L +    LA L G+    EI++  + ++++Y+EL  +
Sbjct: 244 F-KPYFDTLPANFNTG-------LSFGIDALAALEGTLLFDEIIQARQHLRQQYDELFPL 295

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
                 L   +P     +  T++ F  A     S  + +   S      LVP+   L   
Sbjct: 296 ------LCTNFPEMFRKDVCTWDDFLWACELWYSNSMMIVLSSGKLSTCLVPVAGLLNHS 349

Query: 295 SSKCKAMLAAVDDAVQLV---VDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYG 351
            S        VD+A + +   + RP  AGE   + Y           G  P S L+  YG
Sbjct: 350 VSPHILNYGRVDEATKSLKFPLSRPCDAGEQCFLSY-----------GKHPGSHLVTFYG 398

Query: 352 FVDE-DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 398
           F+   DNPYD + ++      D    D+ + AQ +   S Q  H+  G
Sbjct: 399 FLPRGDNPYDVIPLDL-----DTSVDDEDIAAQSSATTS-QTTHMVRG 440


>gi|299473350|emb|CBN77749.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 563

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 81/309 (26%), Positives = 130/309 (42%), Gaps = 41/309 (13%)

Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIP-TEAFT 255
           P+ W+E E+  L GS    ++ ER + I+ +Y          G +   YP   P  +  T
Sbjct: 212 PIFWTEEEMRLLQGSYLVTQVEERNQAIEGDY----------GVICDLYP---PFRDVAT 258

Query: 256 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 315
            E FK A + V S    L  ++  R  ALVP    L  Y  +            +   D 
Sbjct: 259 LEEFKWARMCVCSRNFGLD-INGLRTSALVPYADMLNHYRPR----------ETKWTYDN 307

Query: 316 PYKAGESIVVWYNISFTGHDFKC-GPQPNSKLLINYGFVDEDNPY------DRLVVEAAL 368
             + G +I   + I      +   G + N + L+NYGF  E+N        + + +   L
Sbjct: 308 -NRGGFTITTLHRILGGAQVYDSYGQKCNHRFLLNYGFAIENNQEANGFCPNEVPLLFRL 366

Query: 369 NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS--VI 426
           +  DP  Q K    + +G    +V  +  G + +A+   L  LR+  V+D +EM +  + 
Sbjct: 367 DARDPLRQKKARFWRMDGPEQRRV-RLCVG-DTDAVRGALSMLRV-IVADAAEMGARYMY 423

Query: 427 SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP--KKRVATQ 483
            ++  +  P+S   E A +++L       L  YP TL ED A L +  L P   +R A  
Sbjct: 424 RTVKDVRFPLSVRNEVAAMERLLLLTTGALDAYPTTLEEDRAALKNGGLEPFSNRRHALI 483

Query: 484 LVRMEKKML 492
            V  EK +L
Sbjct: 484 QVYGEKVVL 492


>gi|145553305|ref|XP_001462327.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124430166|emb|CAK94954.1| unnamed protein product [Paramecium tetraurelia]
          Length = 481

 Score = 53.1 bits (126), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 94/476 (19%), Positives = 184/476 (38%), Gaps = 74/476 (15%)

Query: 59  SSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASED 118
           S   L   S+ +  +  +   +L  W+        KV ++ K   +E +R +    AS+ 
Sbjct: 21  SESELRTKSKRITYEDPDPYKNLIQWLKDGKAEVSKVSIEVK---SEGYRTLR---ASQF 74

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIA-ELLTTNKL-SELACLALYLMYEKKQGKKSFW 176
           ++ G+    VP +  ++LE V  +  I  +++  N + + +    +  + ++ + + SFW
Sbjct: 75  IRQGEWVLFVPRTHYLSLEEVKKSCLINRKMIQLNYIPNNIQTYFVNHLLQENRRQNSFW 134

Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
            PYI  L +           P  +   + A L GSPT   ++ + +  + EY+ L     
Sbjct: 135 KPYIDVLPKD------VSGFPTNFDAEQDALLKGSPTLFTVMNQRKTFQEEYDNLKE--- 185

Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF-----------ALV 285
            A   FQ+Y Y       T+  F           V  + ++++R F            LV
Sbjct: 186 -AVKEFQRYGY-------TYNDF-----------VKFRTLTISRSFPVYIGENEQQQLLV 226

Query: 286 PLGPPLLAYSSKCKAMLAAVDDAVQLVVD--RPYKAGESIVVWYNISFTGHDFKCGPQPN 343
           PL    + + +          DA    +   R  + GE +   Y           G   N
Sbjct: 227 PLAD-FINHDNNGFLQYGYSPDADGFFMQAVRNIQKGEELFYNY-----------GQWSN 274

Query: 344 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEA 403
               +NYGF    NP ++   +  L+  D  +   +M  +  G        +    + + 
Sbjct: 275 KYFFMNYGFASLTNPMNQFDFDICLDRNDRMF---KMKVELTGGNICWGNRLVNETDHDT 331

Query: 404 ISDMLPYLRLGYVS---DTSEMQSVISSLGPICP---VSPC---MERAVLDQLADYFKAR 454
               L  +R   +S   D  +++  + +     P    +P    +E+A    L D   + 
Sbjct: 332 FRQSLATVRFAQISKLDDFLQLEEDVQNYNQFWPGWHTTPKTIELEKATFKALRDLLVSE 391

Query: 455 LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVT 510
           LA + +T+ +D+  L D +    +R    L   EK+++   ++V  DM++ + D T
Sbjct: 392 LANFASTIEDDQRRLNDPSTPEFRRHIIMLTMREKQIIKKNIEV-CDMMLSVIDKT 446


>gi|260835045|ref|XP_002612520.1| hypothetical protein BRAFLDRAFT_214305 [Branchiostoma floridae]
 gi|229297897|gb|EEN68529.1| hypothetical protein BRAFLDRAFT_214305 [Branchiostoma floridae]
          Length = 287

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 46/195 (23%), Positives = 87/195 (44%), Gaps = 20/195 (10%)

Query: 75  EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
           EE       W+H+NG   C+ +  +        R +    A++ L+  +    +P  L++
Sbjct: 18  EESFVRFFQWLHRNG---CRNVPLKPAVFPGTGRGM---MATKALKHEELMLVIPQRLLI 71

Query: 135 TLERVLGNETIAELLTTN-KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA 193
           T++ ++ +     +   + +L+    LA++LM EK + +KSFW PYI  L  +       
Sbjct: 72  TMDAIMDSYIAPYIERADPRLTPTQALAVFLMCEKYRREKSFWRPYIDILPEE------- 124

Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 253
              P  ++E +   L  S  + +   +     +EY EL   + M   LF         +A
Sbjct: 125 YSCPTFFTEDDFRLLPNS-LRGKAKAKKYECHKEYKELAPFFKMLADLFPD-----QEDA 178

Query: 254 FTFEIFKQAFVAVQS 268
           F F+ FK A+ A+++
Sbjct: 179 FNFKDFKWAWSAIKT 193


>gi|367048695|ref|XP_003654727.1| hypothetical protein THITE_2117893 [Thielavia terrestris NRRL 8126]
 gi|347001990|gb|AEO68391.1| hypothetical protein THITE_2117893 [Thielavia terrestris NRRL 8126]
          Length = 481

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 56/205 (27%), Positives = 82/205 (40%), Gaps = 29/205 (14%)

Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
            +L+ E  +G+ SFW PYI  L +       A+  P  W E ++AYL G+     I E  
Sbjct: 107 FFLIKEYLKGRDSFWAPYIATLPQPEHVSAWAL--PAFWPEEDIAYLAGTNAHVAIAEIQ 164

Query: 222 EGIKREYNE----LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV----AVQSCVVHL 273
             +K E+ +    L    F A   + Q  Y      FT   F+ + V    A Q     L
Sbjct: 165 ANVKSEFKQARKALKAAGFPAWQDYTQMLYKWAFCIFTSRSFRPSLVLSEPAKQQMAELL 224

Query: 274 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA-----VQLVVDRPYKAGESIVVWYN 328
                   F+++    PL   ++       A D A      QLV    Y+ GE +   YN
Sbjct: 225 PPGCQLDDFSIL---QPLFDIANHSMTARYAWDVASDPASCQLVCHDAYQPGEQV---YN 278

Query: 329 ISFTGHDFKCGPQPNSKLLINYGFV 353
                     G + NS+LL+ YGF+
Sbjct: 279 --------NYGLKTNSELLLAYGFI 295


>gi|118395738|ref|XP_001030215.1| hypothetical protein TTHERM_01108540 [Tetrahymena thermophila]
 gi|89284510|gb|EAR82552.1| hypothetical protein TTHERM_01108540 [Tetrahymena thermophila SB210]
          Length = 1709

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 41/148 (27%), Positives = 70/148 (47%), Gaps = 37/148 (25%)

Query: 113  VAASEDLQAGDAAFSVPNSLVVTLERVLG-------NETIAELLTTNKLSEL----AC-- 159
            +AA +D+       ++PN L+++ ++V G        +   +++  N+  EL     C  
Sbjct: 954  IAADQDISPQKVILAIPNKLIISEDKVYGCDLEEVLEKIQQQIIKQNRFPELFDEEKCGD 1013

Query: 160  -----LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 214
                 LALYLMYEK +G++SFW PY  EL+++         + L WS  ELA    S   
Sbjct: 1014 ADFNILALYLMYEKLKGEQSFWHPYF-ELNQKS-------YTLLDWSTEELAQFEDSY-- 1063

Query: 215  AEILERAEGIKREYNELDTVWFMAGSLF 242
                     I +E N+ + ++F+  S+ 
Sbjct: 1064 ---------ILQEVNQSNQIFFLQQSVL 1082


>gi|358369683|dbj|GAA86297.1| SET domain protein [Aspergillus kawachii IFO 4308]
          Length = 489

 Score = 52.8 bits (125), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 77/319 (24%), Positives = 125/319 (39%), Gaps = 65/319 (20%)

Query: 83  SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
           +W+   G P  K+  K + +    H     V A  DL  G+  F++P + V++++    N
Sbjct: 22  TWLA--GKPGVKINPKIQIADLRSHAAGRGVVAQSDLDEGEELFTIPRAHVLSVQ----N 75

Query: 143 ETIAELLTTN--KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
             +  LL+ N   L     L + ++YE  QG +S W  Y R L R         ++ + W
Sbjct: 76  SNLKNLLSQNLEDLGPWLSLMVVMIYEYLQGDQSAWASYFRVLPRN-------FDTLMFW 128

Query: 201 SETELAYLTGSP---------TKAEILE------RA--------EGIKREYNELDTVWFM 237
           S +EL  L GS           +  I+E      RA        +G+     +  T   +
Sbjct: 129 SASELEELQGSAIVEKIGKQGAEGSIIESIAPIVRANPALFPPIDGVASYDGDAGTQALL 188

Query: 238 -----AGSLFQQYPYDI--PTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 290
                 GSL   Y +DI  P +       +  ++  +      +         +VPL   
Sbjct: 189 HLAHTMGSLIMAYAFDIEKPEDEEGDRDGEDGYLTDEEEEQSSK--------GMVPLADL 240

Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINY 350
           L A + +  A L   ++ + +   +P K+GE I   Y           G  P S LL  Y
Sbjct: 241 LNADADRNNARLFQEEEVLVMKAIKPIKSGEEIFNDY-----------GEIPRSDLLRRY 289

Query: 351 GFVDEDNPYDRLVVEAALN 369
           G+V  DN     VVE +L+
Sbjct: 290 GYV-TDNYAQYDVVELSLD 307


>gi|148908465|gb|ABR17345.1| unknown [Picea sitchensis]
          Length = 350

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 43/164 (26%), Positives = 75/164 (45%), Gaps = 25/164 (15%)

Query: 317 YKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED----NPYDRLVVEAALNTED 372
           ++ GE +++ Y ++          + N +L ++YGFV+ +    +  D   +   ++  D
Sbjct: 136 FRTGEQVLMQYGMN----------KSNGQLALDYGFVERNRKNGSNRDIFTLTLEISESD 185

Query: 373 PQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM-------QSV 425
           P + DK  +A+ NG  +   F +  G  +     ML +LRL  +  T           SV
Sbjct: 186 PFFADKLDIAELNGMETTAYFDITQG--QGVPESMLTFLRLIALGGTDAFLLEPLFRDSV 243

Query: 426 ISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 469
              L    PVS   E A+   + D  ++ L+GY  T+ EDEA+L
Sbjct: 244 WEHLS--LPVSQENEAAICKVVLDGCQSTLSGYGTTIEEDEALL 285


>gi|299115166|emb|CBN75532.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 524

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 79/340 (23%), Positives = 132/340 (38%), Gaps = 70/340 (20%)

Query: 77  DLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTL 136
           +L  L SW  ++G    K+ L++             +   + L  G+   S+P SL +T+
Sbjct: 27  ELDGLLSWFVEHGGSMTKLCLEDLGGEMSLS-----LLTGQALNKGEVVMSIPISLCMTV 81

Query: 137 ERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES 196
           + VL                    AL+LM E+++G  SFW  Y+R L          V++
Sbjct: 82  DSVL--------------------ALHLMAERRKGDGSFWKQYLRTLPDD-------VDT 114

Query: 197 PLLW----SETELAYLTGSPTKAEILERA--EGIKREYNELDTVWFMAGSLFQQYPYDIP 250
           PL W    +E E   L G  T   +L R     +++++ E          L + +P  + 
Sbjct: 115 PLRWLVEQAEEEFRLLDG--TMVGLLSRMMHSQVRKDWEEFHL------PLVEAHPEILG 166

Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQK----VSLARRFALVPL-----GPPLLAYSSKCKAM 301
               TFE +  A  ++ S     Q+     S   R A+VP+       P  A S      
Sbjct: 167 --GVTFEDYLWAMSSIWSRSFDYQEPGPDDSPCSRRAMVPVINAANHDPSAADSLSEMIE 224

Query: 302 LAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKC--------GPQPNSKLLINYGFV 353
             A +  + + +  P +A  ++ V       G D+          G   N+KLL +YGFV
Sbjct: 225 FQAQEGGLSMGIGEPGRARGTLRV-----SAGRDYAAREQFFILYGRYSNAKLLYSYGFV 279

Query: 354 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 393
              NPY  L     +   DP +  K+ +   +   + Q +
Sbjct: 280 LASNPYGGLDYWVRVPQTDPGFAWKQALLDEHPLTAAQAY 319


>gi|170067683|ref|XP_001868579.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167863782|gb|EDS27165.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 269

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 51/200 (25%), Positives = 90/200 (45%), Gaps = 35/200 (17%)

Query: 317 YKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 376
           Y+ GE I ++Y           G + N+  L++ GFV  DN    + +  +LN  + Q++
Sbjct: 74  YRKGEQIFIYY-----------GNRTNADFLVHNGFVYPDNANSAVAIPLSLNPTEEQFE 122

Query: 377 DKRMVAQRNGKLSVQVFHVHAGREKEAIS-DMLPYLR--------LGYVSDTSEMQSVIS 427
            ++ + ++ G  S   F+V  G     IS ++L + R        LG+      +QS + 
Sbjct: 123 QRKQLLEKLGLASSGDFNVQRGGGDSFISPELLGFARVFNMTKEQLGHWQGEDAVQSQL- 181

Query: 428 SLGPICPVSPCMERAVLDQLADYFKARLA----GYPATLSEDEAMLTDYN------LHPK 477
            L P C   P +E ++ +++  Y   RL         TL +DEA+L +        L   
Sbjct: 182 -LEPDC---PGLEASLREKVWKYLSIRLQLALRMTGTTLDQDEALLANQGQKGAQKLGHI 237

Query: 478 KRVATQLVRMEKKMLNACLQ 497
           K +  Q   +EKK+L+  L+
Sbjct: 238 KSMLVQFRVVEKKILSEALE 257


>gi|297735395|emb|CBI17835.3| unnamed protein product [Vitis vinifera]
          Length = 583

 Score = 52.4 bits (124), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 62/260 (23%), Positives = 105/260 (40%), Gaps = 39/260 (15%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           A+EDL+ GD A  +P S+V++ E V  ++    L   + +S    L L+ M EK      
Sbjct: 211 ATEDLKVGDVALEIPMSIVISEELVHESDMFPILEKIDGISSETMLLLWSMKEKHNSNSK 270

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
           F   Y   L         A  + L +    +  L G+    EI+E  + +  +Y EL   
Sbjct: 271 F-NTYFNALPE-------AFNTGLSFEFDAIMVLAGTLLLEEIIEAKKHLNAQYEEL--- 319

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG------ 288
                +L + +P   P E +T E F  A     S  + +       R  L+P+       
Sbjct: 320 ---VPALCKDHPDIFPPEFYTQEQFLWACELWYSNGMQVMFTDGKLRTCLIPIAGFLNHS 376

Query: 289 --PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKL 346
             P ++ Y       + +  ++++  V +P   GE   + Y           G   +S L
Sbjct: 377 LYPHIMHYGK-----VDSKTNSLKFCVSKPCNMGEQCYLSY-----------GNFSSSHL 420

Query: 347 LINYGFVDE-DNPYDRLVVE 365
           +  YGF+ + DN YD + +E
Sbjct: 421 VTFYGFIPQGDNLYDTIPLE 440


>gi|307109196|gb|EFN57434.1| hypothetical protein CHLNCDRAFT_142903 [Chlorella variabilis]
          Length = 1233

 Score = 52.4 bits (124), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 88/386 (22%), Positives = 145/386 (37%), Gaps = 93/386 (24%)

Query: 111  HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
            H   A+ D+  G+    VP SL +T   V G +   E+L   + SEL  LAL+LM E+ +
Sbjct: 871  HGFVAARDVGQGEVLLQVPGSLAITAVDV-GKDAQLEVLARGR-SELVGLALWLMQERAK 928

Query: 171  GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE-LAYLTGSPTKAEILERAEGIKREYN 229
                                  A  +P+LW + E    L GSP   E   R + +++E+ 
Sbjct: 929  ----------------------ATLTPILWPDEERQQLLRGSPVLEEARTREQALRQEWQ 966

Query: 230  ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV-HLQKVSLARRFALVPLG 288
            ++  +        + YP  +  E        QAF+   S V+ H   +  A+ FAL+PL 
Sbjct: 967  DIAAIAAQTSGGPEAYPAVVYNE--------QAFLEAMSVVLAHAAYLPKAQCFALLPLV 1018

Query: 289  PPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKL 346
              L    S   A+L      +AV +V  R    G+ + ++                    
Sbjct: 1019 GGLCRTGSSSGALLDYDLEREAVTVVAQR--TPGQEVALYC------------------- 1057

Query: 347  LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD 406
                           L + A+L   D  Y  KR + +  G      F +    ++ A   
Sbjct: 1058 ---------------LFMAASLVAADRLYTTKREILEELGLGVKAEFPIF--EDRLATQQ 1100

Query: 407  MLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 466
            ++ +          E  ++I         SP  E  +L  L    + R+  Y     +D 
Sbjct: 1101 LINF----------EQDTII---------SPENEYEILQLLMGDLRDRIQAYATEFDDDI 1141

Query: 467  AMLTDYNLHPKKRVATQLVRMEKKML 492
              L   +L P++R+A QL   EK++L
Sbjct: 1142 KDLQRTDLTPRQRLAAQLRLGEKRIL 1167


>gi|255581713|ref|XP_002531659.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase,
           putative [Ricinus communis]
 gi|223528717|gb|EEF30729.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase,
           putative [Ricinus communis]
          Length = 558

 Score = 52.4 bits (124), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 59/255 (23%), Positives = 110/255 (43%), Gaps = 39/255 (15%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           A+EDL+ GD A  +P S++++ E V  ++    L   + +S    L L+ M E+      
Sbjct: 190 ATEDLKVGDIALEIPVSIIISEELVRHSDMYHILEKIDGISSETMLLLWSMKERHNCNSK 249

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
             + Y   L ++   G       ++ S+  L +        EI++  E ++ +Y+EL   
Sbjct: 250 SKI-YFDTLPKEFNTGLSFGVDAIMASDGTLLF-------DEIMQAKEHLRVQYDEL--- 298

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP-------- 286
                +L   YP   P E +T+E F  A     S  + ++ +    R  L+P        
Sbjct: 299 ---VPALCNNYPDVFPPELYTWEQFLWACELWYSNSMKIKFLDGKLRTCLIPIAGFLNHS 355

Query: 287 LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKL 346
           L P ++ Y       + ++ + ++  + RP + GE   + Y  +F+G          + L
Sbjct: 356 LHPHIIHYGK-----VDSITNTLKFPLSRPCRVGEQCCLSYG-NFSG----------AHL 399

Query: 347 LINYGFVDE-DNPYD 360
           +  YGF+ + DN YD
Sbjct: 400 ITFYGFLPQGDNRYD 414


>gi|297836754|ref|XP_002886259.1| hypothetical protein ARALYDRAFT_319874 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297332099|gb|EFH62518.1| hypothetical protein ARALYDRAFT_319874 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 541

 Score = 52.4 bits (124), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 60/255 (23%), Positives = 101/255 (39%), Gaps = 40/255 (15%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           ASEDL+ GD A  +P S +++ E V  ++    L   + ++    + L+ M EK      
Sbjct: 173 ASEDLKFGDVALEIPISSIISEEYVFNSDMYPILEKIDGITSETMVLLWTMREKHNLDSK 232

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
           F  PY   L      G       + +    +  L G+    EI++  E ++  Y+EL   
Sbjct: 233 F-KPYFDSLQENFCTG-------MSFGVNAIMELDGTLLLDEIMQAKELLRERYDELI-- 282

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG------ 288
                 L   + +  P E +T+E +  A     S  + ++      +  L+P+       
Sbjct: 283 -----PLLSNHRHVFPPEHYTWEHYLWACELYYSNSMQIKFPDGKLKTCLIPVAGFLNHS 337

Query: 289 --PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKL 346
             P ++ Y   C         +++  V RP   GE   + Y           G   +S L
Sbjct: 338 IYPHIVKYGKVCVET-----SSLKFPVSRPCNKGEQCFLSY-----------GNYSSSHL 381

Query: 347 LINYGFVDE-DNPYD 360
           L  YGF+ + DNPYD
Sbjct: 382 LTFYGFLPKGDNPYD 396


>gi|281205954|gb|EFA80143.1| hypothetical protein PPL_06965 [Polysphondylium pallidum PN500]
          Length = 417

 Score = 52.0 bits (123), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 93/410 (22%), Positives = 174/410 (42%), Gaps = 59/410 (14%)

Query: 76  EDLGDLKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
           +DL   K WM   G+   P   ++  E    +        + A+  ++ GD    VP ++
Sbjct: 9   DDLVTFKQWMDDEGIYLNPSLDIVKLEDYGRS--------IIANTLIKEGDVLIRVPRNV 60

Query: 133 VVT---LERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQR 187
           +++   +E  +  E I  ++ +N+  +      A+YLMY     K S+W  Y   L +Q 
Sbjct: 61  MMSRTGIELHIPKE-IRSIIDSNRDDIGSTDGQAVYLMY-SLLNKDSYWHQYTSILPKQ- 117

Query: 188 GRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY 247
                   + + + + E+  L  S  +     R  GI+R YN    ++    SL  ++  
Sbjct: 118 ------FTTSIYFDQDEMKELQLSKLRYFTESRLSGIERHYN---VIFKKLSSLNDEFK- 167

Query: 248 DIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD 307
               + +TFE+FK A   + S    L     +    +VPL     A   K K+ +     
Sbjct: 168 ---KKEYTFELFKWALSCIWSRAFSLS----SDDGGMVPLADMFNAIE-KAKSKVRPDSR 219

Query: 308 AVQLV--VDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVE 365
           A QL+    +  + GE +   Y +  T      G   N+++L++YGF   D+P +   ++
Sbjct: 220 ADQLIYYASKDIERGEQVFTPYGVYKT-----IG---NAQMLMDYGFA-FDDPSEGDTIQ 270

Query: 366 AALN--TEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQ 423
             L+  ++D  Y D ++       + V+ F++   +  +   ++L Y R+  + + +E+Q
Sbjct: 271 LTLDNFSDDELYIDTKIDLLEQLDI-VREFNL---KRNQLPQELLIYARVKNLKE-NELQ 325

Query: 424 ----SVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 469
                  +      PVS   E+  L  L++Y    L  Y  TLS+D  +L
Sbjct: 326 LAKEHYRNDDNRNKPVSRRNEKTALRYLSNYLSRYLDSYETTLSDDLELL 375


>gi|358366345|dbj|GAA82966.1| SET domain protein [Aspergillus kawachii IFO 4308]
          Length = 673

 Score = 52.0 bits (123), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 62/240 (25%), Positives = 97/240 (40%), Gaps = 18/240 (7%)

Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
           + + E      +L+ +  +G + FW PYIR L  Q G    ++ +P  +   +L +L G+
Sbjct: 77  DAVGEKESTIFFLIGQYLRGTEGFWYPYIRTLP-QPG----SLTTPPYYEGEDLQWLDGT 131

Query: 212 PTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
              A   +R E +K +Y +  T    AG      Y +D+   A +  I +     V S V
Sbjct: 132 SLLAAREKRLEVLKEKYEKGSTALRNAGFEGADAYTWDLYLWAASMFISRAFSARVLSGV 191

Query: 271 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNIS 330
                +S  +   L+P+   +  +    K    A  D V  VV     AG+ I   Y   
Sbjct: 192 FPETDLSEEKLSVLLPI-IDMGNHRPLAKVEWRAGKDDVAFVVLEDVSAGQEISNNY--- 247

Query: 331 FTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 390
                   GP+ N +L++NYGF    NP D  +V        P Y  K    Q    L+V
Sbjct: 248 --------GPRNNEQLMMNYGFCIPGNPCDHRIVSLRAPPGSPLYMAKSHQLQMYPDLAV 299


>gi|387197713|gb|AFJ68815.1| set domain protein, partial [Nannochloropsis gaditana CCMP526]
          Length = 327

 Score = 52.0 bits (123), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 53/218 (24%), Positives = 96/218 (44%), Gaps = 31/218 (14%)

Query: 83  SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
           +W+  +G    K+   E PS+    + I    A +D+ + +   S+P  L++T +  L +
Sbjct: 74  AWLRAHGARCDKI---EWPSYATGSQ-IRGAVALDDINSNEDMVSIPEPLLLTPDVALKD 129

Query: 143 ETIAELLTTN--KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
             I ++   N    S+   L + LM+E+ +G+ SF+ PY+  L R         ++ L W
Sbjct: 130 PDIGKVFEDNLEDFSDEDMLLILLMHERGKGETSFFYPYLATLPR-------LPDTLLNW 182

Query: 201 SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAG--SLFQQYPYDIP------TE 252
           +E  L++L       E+  R   +   Y  L      AG   LF + P D         +
Sbjct: 183 NEEGLSWLQDEGLSLEVFLRESQLTAHYTRLVEEKLKAGWPGLFGEAPDDASDSESKGAD 242

Query: 253 AFTFEIFKQAFVAVQSCVVHLQKVSLARRF---ALVPL 287
            ++ E F+ A++ +Q+        +  RR    AL+PL
Sbjct: 243 PYSLENFRFAWLTIQA-------RAFGRRLPYSALIPL 273


>gi|145353540|ref|XP_001421068.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144581304|gb|ABO99361.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 813

 Score = 52.0 bits (123), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 63/256 (24%), Positives = 101/256 (39%), Gaps = 31/256 (12%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           A  D   G+    +P     TL   L ++ +  + +     +   +AL++  E+ +G+K+
Sbjct: 26  ALRDCARGEVLLEIPLERGFTLAAALEDDAVKRVASCCARHD-DVVALHVCAERFRGEKA 84

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
               ++  L R       + ++   WSE EL  LTG+    E +   E  K +Y  L   
Sbjct: 85  TRAAHVATLPR-------SFDTAFFWSEEELRELTGTTCLRETMNLREETKNDYETLTKK 137

Query: 235 W--FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
                 G   +++  D       +E +  A   + S    L      R  A+VP    + 
Sbjct: 138 MEAIGEGGWMREHEVD-------YERYAWARSNLWSRQCDLLMPDGKRTRAMVPT-FDIF 189

Query: 293 AYSSKC----KAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLI 348
            +S+K        L A  + V +     YKAGE   + Y           G   NSKLL 
Sbjct: 190 NHSAKAPLGKTHKLNAEKNCVTVYAADDYKAGEQAFISYG---------SGEAANSKLLT 240

Query: 349 NYGFVDEDNPYDRLVV 364
            YGF  +DNPY+ L V
Sbjct: 241 WYGFCIDDNPYEELDV 256


>gi|302810436|ref|XP_002986909.1| hypothetical protein SELMODRAFT_235145 [Selaginella moellendorffii]
 gi|300145314|gb|EFJ11991.1| hypothetical protein SELMODRAFT_235145 [Selaginella moellendorffii]
          Length = 447

 Score = 52.0 bits (123), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 72/294 (24%), Positives = 122/294 (41%), Gaps = 50/294 (17%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           V A  DL  G+   ++P +  +TL      + IA       L     L + +MYE+ +GK
Sbjct: 10  VRALRDLHHGELIATIPKAACLTLLTTAARDAIARARLGGGLG----LTVAVMYERSKGK 65

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL 231
            S W  Y++ L  Q          P LWSE E+   L G+     + E    +K ++ E 
Sbjct: 66  GSKWYRYLKTLPCQE-------SVPFLWSEEEIDGLLLGTELHKALKEDKLLMKEDWEE- 117

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ-KVSLARRFALVPLGPP 290
                    L ++ P + P + FTFE    +++A +S V     ++     + +VPL   
Sbjct: 118 -----NIAPLTKEDPLEFPAQDFTFE----SYLAAKSLVSSRSFEIDAEHGYGMVPLAD- 167

Query: 291 LLAYSSKCKA-----MLAA-------------VDDAVQLVVDRPYKAGESI---VVWYNI 329
              ++ K  A     ML A             +DD +     R   + +S+   V+  ++
Sbjct: 168 --LFNHKTDAEDVHFMLNASDSDDDDDNNGLIIDDGLANGDCREISSDKSVLEMVMVKDV 225

Query: 330 SFTGHDFKC-GPQPNSKLLINYGFVDEDNPYD--RLVVEAALNTEDPQYQDKRM 380
           +     F   G   N+ LL  YGF + +NP+D   L ++  L     ++Q KR+
Sbjct: 226 AAGSEIFNTYGQLGNAALLHRYGFTEPNNPHDIVNLDMDCLLEVLLSRFQKKRV 279


>gi|225446052|ref|XP_002268920.1| PREDICTED: uncharacterized protein LOC100256524 [Vitis vinifera]
          Length = 566

 Score = 52.0 bits (123), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 62/260 (23%), Positives = 105/260 (40%), Gaps = 39/260 (15%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           A+EDL+ GD A  +P S+V++ E V  ++    L   + +S    L L+ M EK      
Sbjct: 194 ATEDLKVGDVALEIPMSIVISEELVHESDMFPILEKIDGISSETMLLLWSMKEKHNSNSK 253

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
           F   Y   L         A  + L +    +  L G+    EI+E  + +  +Y EL   
Sbjct: 254 F-NTYFNALPE-------AFNTGLSFEFDAIMVLAGTLLLEEIIEAKKHLNAQYEEL--- 302

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG------ 288
                +L + +P   P E +T E F  A     S  + +       R  L+P+       
Sbjct: 303 ---VPALCKDHPDIFPPEFYTQEQFLWACELWYSNGMQVMFTDGKLRTCLIPIAGFLNHS 359

Query: 289 --PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKL 346
             P ++ Y       + +  ++++  V +P   GE   + Y           G   +S L
Sbjct: 360 LYPHIMHYGK-----VDSKTNSLKFCVSKPCNMGEQCYLSY-----------GNFSSSHL 403

Query: 347 LINYGFVDE-DNPYDRLVVE 365
           +  YGF+ + DN YD + +E
Sbjct: 404 VTFYGFIPQGDNLYDTIPLE 423


>gi|384483765|gb|EIE75945.1| hypothetical protein RO3G_00649 [Rhizopus delemar RA 99-880]
          Length = 376

 Score = 52.0 bits (123), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 67/262 (25%), Positives = 117/262 (44%), Gaps = 49/262 (18%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A+ED++AG+   SVP + ++T      NE++ +L  T+ LS    LAL+L+   +  K
Sbjct: 1   MMATEDIEAGEVIVSVPRNFLIT------NESLTKLYGTHSLSPHQLLALHLVLLTRD-K 53

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
           +S+W PY   L        +   S L      L++L  S  K E +++ + I  +Y    
Sbjct: 54  QSWWKPYTDLLPMHFNTMPVNYPSEL------LSHLPNS-LKQETMQQKDNIHTDY---- 102

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL- 291
            V  +     +Q P DI     T E FK A++ V +  +H+       +   + L P L 
Sbjct: 103 -VTCLKFCKSKQLPQDI-----TAEEFKWAWLCVNTRCIHMTVPDYLAKGENIALAPMLD 156

Query: 292 -LAYSSKCKAMLAAVDDAVQLVVDR-------PYKAGESIVVWYNISFTGHDFKCGPQPN 343
            L ++++ K     ++    +   R        YK GE + + Y           GP  N
Sbjct: 157 FLNHTTEAK-----IESGFNIRTQRFEIKTLTAYKKGEQVYINY-----------GPHDN 200

Query: 344 SKLLINYGFVDEDNPYDRLVVE 365
             +L  YGFV  +N Y+ ++++
Sbjct: 201 LAMLKEYGFVLNENIYNFVLLD 222


>gi|326480913|gb|EGE04923.1| SET domain-containing protein [Trichophyton equinum CBS 127.97]
          Length = 692

 Score = 52.0 bits (123), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 47/206 (22%), Positives = 86/206 (41%), Gaps = 24/206 (11%)

Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
           LA ++ +E+ +G+ S W PY+  L R        + S L + +++L +L G+        
Sbjct: 108 LAFFVAHEQLKGRDSHWWPYLATLPRAS-----ELTSALFYQDSDLDWLQGTNLYQTHQA 162

Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
               +K EY+   ++    G L          E+++++IF  A+  + S      +   +
Sbjct: 163 YRNTVKEEYDSAISILRDEGCL--------AVESYSWDIFCWAYTLIAS------RAFTS 208

Query: 280 RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNI---SFTGHDF 336
           R         P L    + + ML  VD +    + +     E+  +   +   +FTG + 
Sbjct: 209 RVLDAYLSNHPTLKQDEEFQIMLPLVDSSNHKPLAKIEWRAEATEIGLKVIEPTFTGEEI 268

Query: 337 --KCGPQPNSKLLINYGFVDEDNPYD 360
               GP  N +L+  YGF   DNP D
Sbjct: 269 HNNYGPLNNQQLMTTYGFCIVDNPCD 294


>gi|281201870|gb|EFA76078.1| hypothetical protein PPL_10657 [Polysphondylium pallidum PN500]
          Length = 1234

 Score = 52.0 bits (123), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 57/263 (21%), Positives = 106/263 (40%), Gaps = 29/263 (11%)

Query: 113  VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTT-NKLSELACLALYLMYEKKQG 171
            +  ++ ++  +    VP   ++ ++    +  +  +    + L++   L L+++YEK + 
Sbjct: 767  IVTTKKVEENEVIIKVPRKFLINVQVAREHPILGRIFEEFSGLNDDTILFLFVIYEK-EN 825

Query: 172  KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
              SFW P+   L            + + ++ TEL  L G+   AE L+    +K     +
Sbjct: 826  PNSFWRPFFDTLPS-------YFPTSIHYTSTELLELEGTNLFAETLQ----VKEHLQSI 874

Query: 232  DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
              + F    L +QYP   P   F++E F  A     S  + L K+       LVP+   +
Sbjct: 875  RDMLF--PELSEQYPTIFPESLFSWENFLWARSLFDSRAIQL-KIDDKITNCLVPMADMI 931

Query: 292  LAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
              + +    +      D   ++V          I + Y           G   N +L + 
Sbjct: 932  NHHHNAQISQRFFDQTDQCFKMVSCCSVPPNAQIFLHY-----------GALQNRELALY 980

Query: 350  YGFVDEDNPYDRLVVEAALNTED 372
            YGFV +DNPYD +++   L  ED
Sbjct: 981  YGFVIQDNPYDSMLIGFDLPDED 1003


>gi|225561342|gb|EEH09622.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
          Length = 487

 Score = 52.0 bits (123), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 95/429 (22%), Positives = 166/429 (38%), Gaps = 69/429 (16%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
            + G+  F++P  ++ T+E    +  +   L + +  LS    LA+Y+++ +        
Sbjct: 52  FKEGERIFTIPADVLWTVEHAYADSLLGPALRSARPPLSVDDTLAMYILFVRS------- 104

Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
               RE      R  LA       S + +++ EL    GS   A        I+ +Y  L
Sbjct: 105 ----RESGYDGPRSHLATLPKSYSSSIFFTDDELEVCAGSSLYALTKRLGRCIEDDYRAL 160

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL-----QKVSLARRFALVP 286
                    L  Q+    P + FT E +K A   V S  +       + + L   FA   
Sbjct: 161 VV------RLLVQHQDLFPLDKFTIEDYKWALCTVWSRAMDFVLPGGKSIRLMAPFA--- 211

Query: 287 LGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNS 344
               +L +SS+ +   A   +   + ++  + Y+AG+  V +    + G        PN+
Sbjct: 212 ---DMLNHSSEVRQCHAYDPLSGNLTILAGKDYEAGDQGVFFQVFIYYGS------IPNN 262

Query: 345 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI 404
           +LL  YGFV   NP D   +    +   P ++ KR +    G  S     +        +
Sbjct: 263 RLLRLYGFVMPGNPNDSYDLVLETHPMAPFFEQKRKLWDLAGFDSTSTISI-------TL 315

Query: 405 SDMLPYLRLGYV----SDTSEMQSVISSLGPICP----VSPCMERAVLDQLADYFKARLA 456
           +D LP   LGY+    SD S++ S+      I P    +S   E  VL  L + F   L 
Sbjct: 316 TDPLPKNVLGYLRIQRSDESDLASIARQ--RIDPKYEKISDSNEVEVLQSLIESFCGLLD 373

Query: 457 GYPATLSEDEAMLTDYNLHPKKR---VATQLVRMEKKMLNACLQVTADMIMLLPDVTVS- 512
            +   L   E  L +  ++P +     A  +   E+++L    +   DM+  +   + + 
Sbjct: 374 SFGTQLESLEKQLAE-GVYPSRGNAWAAAHVSLGEQQVLRLARKRAEDMLAAVESGSGNE 432

Query: 513 ----PCPAP 517
               P PAP
Sbjct: 433 KGSLPAPAP 441


>gi|242769547|ref|XP_002341787.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
 gi|218724983|gb|EED24400.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
          Length = 739

 Score = 51.6 bits (122), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 66/241 (27%), Positives = 104/241 (43%), Gaps = 40/241 (16%)

Query: 161 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSE--TELAYLTGSPTKAEIL 218
             +LM +  + ++ FW PYI+ L      G   + +PLL+ E   +LA+L  +   A   
Sbjct: 138 TFFLMGQYLRREEGFWYPYIQSLP-----GPEELTTPLLFKEEDGDLAWLNMTSLAASRE 192

Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQ--------YPYDIPTEAFTFEIFKQAFVA--VQS 268
            R +  K  Y +       A S+ Q         Y +D+   A T  I  +AF A  + S
Sbjct: 193 RRLQIWKVNYEK-------AYSMMQDLGVENARLYTWDLYLWASTI-ISSRAFTAKVLAS 244

Query: 269 CVVHLQKVSLARRFALVPLGPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVW 326
            +  LQ      R ++  L P + A + K   K    A  D++ LVV    +AG+ +   
Sbjct: 245 VIPKLQTAEEGDRISV--LLPLIDATNHKPLSKVEWRAGTDSIGLVVMSDLRAGDEV--- 299

Query: 327 YNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 386
                 G+++  GP+ N +L++NYGF   DNP +  VV      + P  Q K    Q   
Sbjct: 300 ------GNNY--GPRNNEQLMMNYGFCIPDNPCEYRVVSLRAPPDSPLAQIKAQYEQHCS 351

Query: 387 K 387
           K
Sbjct: 352 K 352


>gi|443730800|gb|ELU16158.1| hypothetical protein CAPTEDRAFT_140019 [Capitella teleta]
          Length = 255

 Score = 51.6 bits (122), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 62/253 (24%), Positives = 103/253 (40%), Gaps = 19/253 (7%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGN---ETIAELLTTNKLSELACLALYLMYEKK 169
           V     L  GD   ++P SL++T   VL +     I + L   +LS    L ++L+ E+ 
Sbjct: 8   VMVRRRLLTGDTIIAIPESLLITTSTVLRSYLGPVIHDFLPC-RLSPTETLVIFLLCERN 66

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
           +G  SFW PY+  L         +    L W+  E+  L    TK    +     +  +N
Sbjct: 67  KGCSSFWKPYVDILPS-------SYTDILHWTSKEMDLLPKF-TKRRACDLRLKAEESFN 118

Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP 289
            L   +     L +Q P      AFT+++FK A+ +V +  V++ +    +   L P   
Sbjct: 119 RLCNGFLPL--LVRQMPQF--NGAFTWDLFKWAWSSVNTRCVYMSQ---PQNSVLSPDEE 171

Query: 290 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
              A +     +   VD  V    D   K+ +   +     +       GP  N KLL+ 
Sbjct: 172 DKSALAPFLDLLNHTVDVEVNARFDDSSKSYKITTLTACKPYDQVFINYGPHSNEKLLLE 231

Query: 350 YGFVDEDNPYDRL 362
           YGF    NP++ +
Sbjct: 232 YGFTLPCNPHNNI 244


>gi|320163048|gb|EFW39947.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 476

 Score = 51.6 bits (122), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 110/482 (22%), Positives = 191/482 (39%), Gaps = 79/482 (16%)

Query: 83  SWMHKNGLPPC-KVILKEKPSHNEKHRP--IH---YVAASEDLQAGDAAFSVPNSLVVTL 136
           +W+  NG     K+ L+   + N   R   +H     +A       +  FS+P  L+++ 
Sbjct: 17  AWLRANGATVSPKLTLQATAAFNADSRTQVLHRRVIASAEAGFDKEEELFSIPRKLLLSA 76

Query: 137 ERVLGNETIAELLTTNKLSELAC-----LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQ 191
                  +IAELL  NK    A      L + +MYE    K SFW PY+  L        
Sbjct: 77  ----STSSIAELLLENKKEACALVGWMPLVVAMMYEIT-NKDSFWRPYLDLLPE------ 125

Query: 192 LAVESPLLWSETELAYLTGSPTKAEI-LERAEGIKRE----------------------Y 228
             +++P+ W++ +L  L G+ T + +  E AE I  E                      Y
Sbjct: 126 -TLDTPMFWNDDDLELLEGTSTLSHLGKEDAETIFTEQIVPFMKLHPTHFDLKVHNMALY 184

Query: 229 NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG 288
           + + +V  MA S  +    D   E    E       A   C    ++  +  + A+VPL 
Sbjct: 185 HRVASV-IMAYSFSEDDDEDDDDEDDDEEEDCCDGDANNECCSQKRQKRM-EKIAMVPLA 242

Query: 289 PPLLAYSSKCK-AMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLL 347
             +L + + C  A L      + +    P  AG  +   Y           G   NS+LL
Sbjct: 243 D-MLDHKTGCNNARLFYGKTTLAMSCIEPCAAGHELYNTY-----------GDLSNSELL 290

Query: 348 INYGFVDEDNPYDRLVVEAAL---NTEDPQYQDKRMVAQRNGKLSVQVFHVHAG----RE 400
             YGF+D+ N ++ + +   +     E   + ++ M A       +  FH+ A     +E
Sbjct: 291 RKYGFIDDVNEHNSVDIPVEMLEERFESCSFMEEAMEALEEIGCWLPEFHIPADALPPQE 350

Query: 401 KEAISDML-----PYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARL 455
            EA   +L         L  + D  E++S +++L     V+ C  R V + L  + + R 
Sbjct: 351 LEASIALLFQSPKQVRALRALDDEDEIRSFLATL-----VNKC-RRKVSETLLAFGQKRA 404

Query: 456 AGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCP 515
             Y  T  EDE  L + +L  ++++A ++   E+ +L+  +    + +   P    +  P
Sbjct: 405 EEYTTTREEDEERLKESDLTHRQKMALRVRIGERTILHNYISHLKERLETTPPDQETKEP 464

Query: 516 AP 517
           AP
Sbjct: 465 AP 466


>gi|326473914|gb|EGD97923.1| hypothetical protein TESG_05224 [Trichophyton tonsurans CBS 112818]
          Length = 692

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 47/206 (22%), Positives = 86/206 (41%), Gaps = 24/206 (11%)

Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
           LA ++ +E+ +G+ S W PY+  L R        + S L + +++L +L G+        
Sbjct: 108 LAFFVAHEQLKGRDSHWWPYLATLPRAS-----ELTSALFYQDSDLDWLQGTNLYQTHQA 162

Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
               +K EY+   ++    G L          E+++++IF  A+  + S      +   +
Sbjct: 163 YRNTVKEEYDSAISILRDEGCL--------AVESYSWDIFCWAYTLIAS------RAFTS 208

Query: 280 RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNI---SFTGHDF 336
           R         P L    + + ML  VD +    + +     E+  +   +   +FTG + 
Sbjct: 209 RVLDAYFSNHPTLKQDEEFQIMLPLVDSSNHKPLAKIEWRAEATEIGLKVIEPTFTGEEI 268

Query: 337 --KCGPQPNSKLLINYGFVDEDNPYD 360
               GP  N +L+  YGF   DNP D
Sbjct: 269 HNNYGPLNNQQLMTTYGFCIVDNPCD 294


>gi|444705829|gb|ELW47217.1| Histone-lysine N-methyltransferase setd3 [Tupaia chinensis]
          Length = 539

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 47/204 (23%), Positives = 92/204 (45%), Gaps = 21/204 (10%)

Query: 306 DDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVE 365
           DD  + V  + ++ GE I ++Y           G + N++ +I+ GF  ++N +DR+ ++
Sbjct: 234 DDRCECVALQDFRPGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKIK 282

Query: 366 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT------ 419
             ++  D  Y  K  V  R G  +  VF +H   +    + +L +LR+  +++       
Sbjct: 283 LGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TDPPISAQLLAFLRVFCMTEEELKEHL 341

Query: 420 ---SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 476
              S +  + +      PVS   E  +   L D     L  Y  T+ ED+++L   +L  
Sbjct: 342 LGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKSRDLSV 401

Query: 477 KKRVATQLVRMEKKMLNACLQVTA 500
           +  +A +L   EK++L   ++  A
Sbjct: 402 RATMAIKLRLGEKEILERAVRSAA 425



 Score = 39.7 bits (91), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 38/160 (23%), Positives = 73/160 (45%), Gaps = 18/160 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ +++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVDGFEMVNFKEEGFGLR---ATREIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERAS-PNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
                 ++PL + E E+ YL  +    ++  + +   R+Y
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY 217


>gi|242823770|ref|XP_002488126.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
 gi|218713047|gb|EED12472.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
          Length = 480

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 89/397 (22%), Positives = 157/397 (39%), Gaps = 67/397 (16%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           V A  D+Q G+  F +P  +V+ ++    NE +A+ L    L     L + ++YE   G+
Sbjct: 50  VVARSDIQEGEDLFHLPQRVVLMVKTSPLNEILADEL--KNLGPWLSLVVVMIYEYSLGE 107

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT---------KAEILERAEG 223
           +S W  Y + L  +        ++ + WS  EL+ L  S           + +I E+   
Sbjct: 108 RSNWNQYFQVLPTK-------FDTLMFWSGEELSQLQASAVIHKIGKKDAEEDIFEKIIP 160

Query: 224 IKREYNELDTVWFMAGSLFQQYPYDIPTEA--------------FTFEIFKQAFVAVQSC 269
           + R + +L    F   +    Y  D   +A              + F+I K      +  
Sbjct: 161 LVRSHPDL----FPPVNGVMSYDDDAGAQALLELAHRMGSLIMAYAFDIEKGEEEESEGE 216

Query: 270 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNI 329
             +L          +VPL   L A + +  A L   D A+ +   +P K G+ I   Y  
Sbjct: 217 DGYLTDDEEQLPKGMVPLADLLNADADRNNARLFQEDGALVMRAIKPIKTGDEIFNDY-- 274

Query: 330 SFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL----------NTEDPQYQDKR 379
                    G  P S LL  YG+V  DN     VVE  L          N E  +Y   +
Sbjct: 275 ---------GELPRSDLLRRYGYV-TDNYAQYDVVELPLTGICHAAGLDNIESQEYPHLK 324

Query: 380 MVAQRNGKLSVQVFHVHAGREKEAISDMLP----YLRLGYVSDTSEMQSVISSLGPICPV 435
           ++ +   ++    + +     +++++D+LP     L      +  E+Q + S   P  P+
Sbjct: 325 LLHEL--EILEDGYCILRPSAEDSLTDILPDELLALLKSLTLEREELQRLQSKQKPPKPI 382

Query: 436 SPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDY 472
               E  +   L D  K++L+ Y  T+ +D+A+L  +
Sbjct: 383 LAAREARI---LLDSVKSKLSQYGTTVEQDKAILQQF 416


>gi|42567909|ref|NP_197226.2| protein SET DOMAIN GROUP 40 [Arabidopsis thaliana]
 gi|75271674|sp|Q6NQJ8.1|SDG40_ARATH RecName: Full=Protein SET DOMAIN GROUP 40
 gi|34222078|gb|AAQ62875.1| At5g17240 [Arabidopsis thaliana]
 gi|51969984|dbj|BAD43684.1| unknown protein [Arabidopsis thaliana]
 gi|332005020|gb|AED92403.1| protein SET DOMAIN GROUP 40 [Arabidopsis thaliana]
          Length = 491

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 82/407 (20%), Positives = 159/407 (39%), Gaps = 67/407 (16%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNE-TIAELLTT-NKLSELACLALYLMYEKKQ 170
           + A+ +L+ G+    VP   ++T E ++  +  +++ +   N LS    L++ L+YE  +
Sbjct: 51  LGAARELKKGELVLKVPRKALMTTESIIAKDLKLSDAVNLHNSLSSTQILSVCLLYEMSK 110

Query: 171 GKKSFWLPYI----RELDRQRGRGQ-----LAVESPLLWSETELAYLTGSPTKAEILERA 221
            KKSFW PY+    R+ D     G      L VE  +  +E   A       +A  L + 
Sbjct: 111 EKKSFWYPYLFHIPRDYDLLATFGNFEKQALQVEDAVWATEKATAKCQSEWKEAGSLMKE 170

Query: 222 EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR 281
             +K ++      W  A +        +P ++                V  L        
Sbjct: 171 LELKPKFRSFQ-AWLWASATISSRTLHVPWDS----------AGCLCPVGDLFNYDAPGD 219

Query: 282 FALVPLGPP---------LLAYSSKCKAMLAAVDDAVQ---LVVDRPYKAGESIVVWYNI 329
           ++  P GP          L+  +   +      ++ V    L   R Y+ GE +++ Y  
Sbjct: 220 YSNTPQGPESANNVEEAGLVVETHSERLTDGGFEEDVNAYCLYARRNYQLGEQVLLCY-- 277

Query: 330 SFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV--EAALNTEDPQYQDKRMVAQRNGK 387
                    G   N +LL +YGF+ E+N  D++ +  E +L +    +    +   ++GK
Sbjct: 278 ---------GTYTNLELLEHYGFMLEENSNDKVFIPLETSLFSLASSWPKDSLYIHQDGK 328

Query: 388 LSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQL 447
           LS                 ++  LRL  +  +   +SV+  +     +S   E  V+  +
Sbjct: 329 LSFA---------------LISTLRLWLIPQSQRDKSVMRLVYAGSQISVKNEILVMKWM 373

Query: 448 ADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 494
           ++   + L   P +++ED  +     LH   ++    +R+E+K   A
Sbjct: 374 SEKCGSVLRDLPTSVTEDTVL-----LHNIDKLQDPELRLEQKETEA 415


>gi|295668911|ref|XP_002795004.1| SET domain-containing protein [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226285697|gb|EEH41263.1| SET domain-containing protein [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 488

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 91/440 (20%), Positives = 169/440 (38%), Gaps = 78/440 (17%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
           + A +D+   +  F++P  LV++ +    N  + +L+  N+  L +  CL L ++YE  Q
Sbjct: 50  IVAYDDINKEEELFAIPQGLVLSFQ----NSKLKDLMEINERDLGQWLCLILVMIYEYLQ 105

Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA--EGIKRE- 227
           G  S W PY + L           ++ + W++ EL  L GS     I + A  E   R+ 
Sbjct: 106 GVASPWAPYFKVLPTD-------FDTLMFWTDAELLELKGSAVLGRIGKSAAEEVFLRDL 158

Query: 228 -------------------YNELD------TVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
                              YN  D      ++    GSL   Y +D+  +       +  
Sbjct: 159 LPLVSKNSELFPLTSGLLSYNSPDGKAALLSLAHRMGSLIMSYAFDVKNDEAEEVEGEGG 218

Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
           +V         ++  L +   ++PL   L A + +  A L   D  + +   +  + GE 
Sbjct: 219 YVTDD------EERQLPK--GMIPLADLLNADADRNNACLFQEDGYLAMKSIKSIRKGEE 270

Query: 323 IVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAALNTEDPQ 374
           I   Y           G  P ++LL  YG+V ++         P   +   A L +  P 
Sbjct: 271 IFNDY-----------GELPRAELLRRYGYVTDNYAQYDEAEVPIQTICKVAGLKSSTPG 319

Query: 375 YQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICP 434
             + R+    + ++    + +        +++ LP   L  ++         + L     
Sbjct: 320 PDEPRLEFLDDLEVLDDGYGIPRPDRSTPLAETLPTELLVVLNILVMPLEQFNQLKQKSK 379

Query: 435 V-SPCMERAVLDQLADYFKARLAGYPATLSEDEAML-----TDYNLHPKK----RVATQL 484
           V  P +  A    L +  +  L  YP T+++D+ +L        +  PK     ++A Q+
Sbjct: 380 VPKPALGIAEATLLDEVVRLILGEYPTTVAQDKELLASCANNQGSTSPKSAGRLKMALQV 439

Query: 485 VRMEKKMLNACLQVTADMIM 504
            + EK++LNA L    D I+
Sbjct: 440 RKGEKEILNAVLSELEDFIV 459


>gi|302762396|ref|XP_002964620.1| hypothetical protein SELMODRAFT_81798 [Selaginella moellendorffii]
 gi|300168349|gb|EFJ34953.1| hypothetical protein SELMODRAFT_81798 [Selaginella moellendorffii]
          Length = 464

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 87/378 (23%), Positives = 154/378 (40%), Gaps = 57/378 (15%)

Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
           L L L+YE+ Q K S+W PYI  L         +   P+ +S  ++  +  +P   ++ +
Sbjct: 105 LGLKLLYERAQ-KGSYWWPYISMLPH-------SFTLPIFFSGVDIESIDYAPVTHQVKK 156

Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVH--LQKVS 277
           R   + +   EL  +  +   +       + + A  + +   A V+ ++  +H    K+ 
Sbjct: 157 RCRFLLQFSAELAKLESLPEEVHPFAGQSVDSGALGWAM---AAVSSRAFRIHGVTNKLC 213

Query: 278 LARRFALVPLGPPLLAYSSKCKAMLA--AVDDA-VQLVVDRPYKAGESIVVWYNISFTGH 334
            A    L+ +       ++  +  L+  A D + +++V  R  + G +I + Y       
Sbjct: 214 SAMMLPLIDMCNHSFQPNAHIEEDLSRDAQDVSFLKVVTKRNLEKGSAITLNY------- 266

Query: 335 DFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA--QRNGK----- 387
               GP  N  LL++YGFV  DNP+DR+     L  +    ++ RM+A   R G      
Sbjct: 267 ----GPLSNDLLLLDYGFVIPDNPHDRI----ELRYDGSLMENARMIAGLSRTGSPPFSS 318

Query: 388 -LSVQVFH--------------VHAGREKEAISDMLPYLRLGYVSDTS--EMQSVIS--S 428
             S QV                V  G  +E    +L  LR+ +       E + ++S  +
Sbjct: 319 PASWQVDRLKQLGLADSGESQKVTLGGPEEVDGRLLAALRILHAESQEPLERRELVSLQA 378

Query: 429 LGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRME 488
            G    VS   E  VL  L          +  T+ EDEA L+D +L    R+A Q    +
Sbjct: 379 WGVESMVSSDNEERVLRTLCGLAAIVFNQFKTTIEEDEAKLSDKSLAETSRIAVQFRLTK 438

Query: 489 KKMLNACLQVTADMIMLL 506
           K+++   L+     +M L
Sbjct: 439 KRLVVRVLESLKKRLMDL 456


>gi|384249602|gb|EIE23083.1| SET domain-containing protein, partial [Coccomyxa subellipsoidea
           C-169]
          Length = 306

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 39/150 (26%), Positives = 70/150 (46%), Gaps = 17/150 (11%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           V A +DL  G     +P + V++++    N  IA++L  +++     L + +MYE   GK
Sbjct: 4   VFAVQDLCEGQRLCEIPKTAVLSVQ----NTGIADILEQHRIRGGLGLIIAIMYELSIGK 59

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
           +SFW  Y+ EL ++          PL W+E E + L G+  +    E  E  + ++    
Sbjct: 60  ESFWHGYLEELHKRE-------YLPLFWAEQERSLLQGTEAEHRPQEDEELTQEDFET-- 110

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
                   L +Q+   +  ++FT E F+ A
Sbjct: 111 ----HVPPLVEQHADRLRADSFTLESFRVA 136


>gi|115386294|ref|XP_001209688.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114190686|gb|EAU32386.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 486

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 93/403 (23%), Positives = 156/403 (38%), Gaps = 73/403 (18%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
           V A  D+   +  F++P  LV++ +    N  + +LL+ + L EL     L L +MYE  
Sbjct: 50  VVAQTDIPENEELFTIPRDLVLSTQ----NSKLKDLLSQD-LEELGPWLSLMLVMMYEYL 104

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP---------TKAEILER 220
            G +S W  Y + L R+        ++ + W+ +EL  L GS              ILE 
Sbjct: 105 LGDQSTWAAYFKVLPRK-------FDTLMFWTPSELLELQGSAVIDKIGRQGADESILEM 157

Query: 221 AEGIKREYNEL----------------DTVWFMA---GSLFQQYPYDI--PTEAFTFEIF 259
              I R +  L                  +  +A   GSL   Y +DI  P +       
Sbjct: 158 IAPIVRAHPSLFPPVDGLPSYDGDAGTQALLHLAHTMGSLIMAYAFDIEKPEDEDEEGDG 217

Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 319
           +  ++  +      ++  L++   +VPL   L A + +  A L   ++A+ +   +P   
Sbjct: 218 EGGYMTDE------EEEQLSK--GMVPLADLLNADADRNNARLFQDENALVMKAIKPIAK 269

Query: 320 GESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV-DEDNPYDRLVVE-------AALNTE 371
           GE I   Y           G  P + LL  YG+V D   PYD + V        A L+  
Sbjct: 270 GEEIFNDY-----------GEIPRADLLRRYGYVTDNYAPYDVVEVSLDVICKAAGLSDS 318

Query: 372 DPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP-YLRLGYVSDTSEMQSVISSLG 430
           DP+ Q           L          +E + ++D+LP  L +   + T   + +     
Sbjct: 319 DPEKQPPLEFLDELELLDDGYVIPRPSQEDDQLTDILPDELIILLRTLTLSPEQLAQQRS 378

Query: 431 PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYN 473
              P  P    A    LA   + + A Y  T+++D+ +L+  N
Sbjct: 379 KNKPPKPAFAEAEATILAKAIQLKQAQYATTIAQDQEILSQLN 421


>gi|396468374|ref|XP_003838159.1| hypothetical protein LEMA_P116830.1 [Leptosphaeria maculans JN3]
 gi|312214726|emb|CBX94680.1| hypothetical protein LEMA_P116830.1 [Leptosphaeria maculans JN3]
          Length = 660

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 92/439 (20%), Positives = 176/439 (40%), Gaps = 89/439 (20%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLE-----RVL-----GNETIAELLTTNKLSELACLAL 162
           + A+ D+ A    F++P + ++ +E     R+L     G    AE      L   A L L
Sbjct: 41  IVATRDIPAETTLFTIPRNAIINVETSDLARLLPGIFDGTLNDAEDEKAEPLDPWASLIL 100

Query: 163 YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAE 222
            ++ E   G++S+W PYI  L         + ++P+ W++ EL  L G+   AE + ++E
Sbjct: 101 VMLREYLHGEQSYWKPYIDIL-------PTSFDTPIFWTQDELKELEGTVLTAEKIGKSE 153

Query: 223 ------------------------GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEI 258
                                    I     +L  +    GS    Y +D+  +    + 
Sbjct: 154 SDEMLRTHVLPIVTQNPTAFCPKGAIPLNEEDLLALAHRIGSTIMSYAFDLDDDKEESDA 213

Query: 259 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD---R 315
            ++ +V  +  +  L          +VP+   L A +       A V+   +L V     
Sbjct: 214 EEEGWVEDRDGLTML---------GMVPMADVLNANAD----FNAHVNHGEKLEVTSLRS 260

Query: 316 PYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVD-EDNPYD---------RLVVE 365
             +AG  I+ +Y           GP P+S+LL  YG+V  E + YD         R  + 
Sbjct: 261 DIRAGTEILNYY-----------GPLPSSELLRRYGYVTPEHHRYDVAEVSWELVRSTLV 309

Query: 366 AALNTEDPQYQDKRMVAQRNGKLSVQVFHV---HAGREKEAISDMLPYLRLGYVSDTSE- 421
           A L   D   Q    V  + G+  ++ + V    +G   +    + P       ++ SE 
Sbjct: 310 AHLELSDGILQ---AVETQLGEDELEDYFVLERDSGEPSDEGRLVQPPQTCEVPTELSEQ 366

Query: 422 MQSVISSLGPICP--VSPCMERAVLDQ--LADYFKARLAGYPATLSEDEAMLTDYNLHPK 477
           +++++ +L    P  +   + R  +    + +    +L+ Y  ++ EDE +L + +L  +
Sbjct: 367 LKTILKALKKQQPELIGSAVRRDEILHAVIGEALNRKLSEYATSVEEDEELLENSSLTKR 426

Query: 478 KRVATQLVRMEKKMLNACL 496
            R+A ++   EK++L+  L
Sbjct: 427 HRLAIEVRLGEKRLLHELL 445


>gi|425773952|gb|EKV12277.1| hypothetical protein PDIG_46020 [Penicillium digitatum PHI26]
 gi|425782378|gb|EKV20291.1| hypothetical protein PDIP_17950 [Penicillium digitatum Pd1]
          Length = 487

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 76/307 (24%), Positives = 120/307 (39%), Gaps = 73/307 (23%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
           V A  ++  G+  FS+P ++V+T++    N  +  LL  N   ++     L L ++YE  
Sbjct: 50  VVAQSNIVEGEELFSIPRTMVLTVQ----NSELRTLLAENLEEQMGPWLSLMLVMVYEYL 105

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI------------ 217
           QG+KS W PY R L  +        ++ + WS  EL  L  S    +I            
Sbjct: 106 QGEKSRWAPYFRVLPSR-------FDTLMFWSPAELQELQASTIVEKIGRSNAEESIRDS 158

Query: 218 -----------------LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPT---EAFTFE 257
                            L   EGI  +   L  V  + GSL   Y +DI     +    E
Sbjct: 159 IAPILAKRPDLFPPPPGLASWEGIAGDA-ALIQVGHVMGSLIMAYAFDIEKAEDDDDEGE 217

Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY 317
           +  ++++        L K        +VPL   L A + +  A L   + A+ +   +P 
Sbjct: 218 VNDESYMTDDEEEEQLPK-------GMVPLADLLNADADRNNARLYQEEGALVMKAIKPI 270

Query: 318 KAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNP-YDRL------VVEAA-LN 369
           + G+ I   Y           G  P + LL  YG+V ++   YD L      + EAA L 
Sbjct: 271 QKGDEIFNDY-----------GEIPRADLLRRYGYVTDNYAVYDVLELSLETICEAAGLA 319

Query: 370 TEDPQYQ 376
             DP+ Q
Sbjct: 320 NADPESQ 326


>gi|403350379|gb|EJY74649.1| SET domain containing protein [Oxytricha trifallax]
          Length = 2165

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 76/344 (22%), Positives = 136/344 (39%), Gaps = 76/344 (22%)

Query: 97  LKEKPSHNEKHRPIHYVA------ASEDLQAGDAAFSVPNSLVVTLERVL----GNETIA 146
           L++  SH EK +  +Y A      A+ D++ G+    VP   ++TLE  +    G +   
Sbjct: 154 LEQGGSHFEKLKIRYYTADYRGVHAARDIKKGEIILYVPKHQIITLEMAMTSPVGKKMYE 213

Query: 147 ELLTTNKLS-ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 205
           + L    +S + + L+ Y+M EK++ + S W  YI  L +           P+ ++E E 
Sbjct: 214 KGLRQRLISPKHSFLSTYIMQEKRKPE-SQWQIYIDILPKNFSN------FPIFFTEEER 266

Query: 206 AYLTGSPTKAEILERAEGIKREYN---------------ELDTVWFMAGSLFQQYPYDIP 250
            +L GSP   +ILE+ E IK +Y+               E   +  M  S      + I 
Sbjct: 267 IWLKGSPFLDQILEKIEDIKADYDLICKEVPEYVQFPIREYSEIRMMVSSRI----FGIQ 322

Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 310
            E     +    FVA    + H +           P          K   ++ A++D   
Sbjct: 323 IEG----VKTDGFVAYADMLNHKR-----------PRQTSWTYTDEKQGFIIEAMEDI-- 365

Query: 311 LVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 370
                  + GE +   Y           G + NS+  +NYGF++ +N  + + ++   +T
Sbjct: 366 -------QRGEQVYDSY-----------GKKCNSRFFLNYGFINLNNDANEVPIKVYYHT 407

Query: 371 EDPQYQDKR-MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 413
           +D   Q K+ M+   +     + F V    E   + +   +LR 
Sbjct: 408 DDQLKQVKQDMIVDHS---EFKKFRVVENLEDRVMQEFFSWLRF 448


>gi|242081035|ref|XP_002445286.1| hypothetical protein SORBIDRAFT_07g007800 [Sorghum bicolor]
 gi|241941636|gb|EES14781.1| hypothetical protein SORBIDRAFT_07g007800 [Sorghum bicolor]
          Length = 490

 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 66/254 (25%), Positives = 104/254 (40%), Gaps = 33/254 (12%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + ASE +  G+ A  +P SL+++ E +  +E    L   N ++    L L+ M E+    
Sbjct: 182 MVASESIGVGEIALEIPESLIISDELLCQSEVFLALKDFNSITSETMLLLWSMRERYNLA 241

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
             F  PY   L      G       L +    LA L G+    EI++  + ++++Y+EL 
Sbjct: 242 SKF-KPYFDTLPANFNTG-------LSFGIDGLAALEGTLLFDEIMQAKQHLRQQYDELF 293

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQA--FVAVQSCVVHLQKVSLARRFALVPLGPP 290
            +      L   +P     +  T++ F  A       S +V L    L+    LVP+   
Sbjct: 294 PL------LCTNFPEIFRKDVCTWDNFLWACELWYSNSMMVVLSSGKLST--CLVPVAGL 345

Query: 291 LLAYSSKCKAMLAAVDDA---VQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLL 347
           L    S        VD+A   ++  + RP  AGE   + Y           G  P S L+
Sbjct: 346 LNHSVSPHILNYGRVDEATKSLKFPLSRPCDAGEQCFLSY-----------GKHPGSHLV 394

Query: 348 INYGFVDE-DNPYD 360
             YGF+   DNPYD
Sbjct: 395 TFYGFLPRGDNPYD 408


>gi|308802149|ref|XP_003078388.1| related to histone-lysine N-methyltransferase (ISS) [Ostreococcus
           tauri]
 gi|116056840|emb|CAL53129.1| related to histone-lysine N-methyltransferase (ISS), partial
           [Ostreococcus tauri]
          Length = 446

 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 100/417 (23%), Positives = 164/417 (39%), Gaps = 64/417 (15%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL-ACLALYLMYEK-KQ 170
           VA + D+  G+   +VP    V+      + T+   L+    + L   LA +++ E    
Sbjct: 25  VATTRDVTRGELLATVPLEKCVSTSSARADATLWRGLSARPGASLDGILAAHVLREAFGL 84

Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
           G++S + P++R L  +        ++ + W E EL  L GS   A      +  + EY+ 
Sbjct: 85  GERSAFWPWLRLLPSE-------TDAAVGWDEDELRELQGSNVVAFARAIKKSWREEYDA 137

Query: 231 LDTVWFMAGSLFQQYPYDIPTEAF--------TFEIFKQAFVAVQSCVVHLQKVSLARRF 282
           LD         F     D P EAF        TFE F  A   V S  + L+  S +   
Sbjct: 138 LD---------FAGLGVDFP-EAFGGEHAAHYTFEKFTWARFVVWSRAIDLKTDSTSA-- 185

Query: 283 ALVPLGPPLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFK 337
            ++ +  P+L     A S K      A  +AV++     +K    +   Y+         
Sbjct: 186 PVIRMLVPILDMANHAPSGKLLPRWDAKANAVKIYAGSAFKRNTELRFNYDT-------- 237

Query: 338 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 397
              +P+   L+ YGF+ E NP + + V   L+  D   + K  + +R+G    +      
Sbjct: 238 ---KPSQYFLLQYGFIPEANPAECVEVTMQLSQRDNLRERKEALLRRHGLDPTKRNFEWK 294

Query: 398 GREKEAISDMLPYLRLGYVSDTSEMQSVIS-----SLGPICPVSPCMERAVLDQLADYFK 452
            R  +   D+L   R+    D SE+    S     S   +   +    +AVL +      
Sbjct: 295 VRGLD--YDLLAAARI-IAMDESELDDDTSVALSVSGASVSAKNDARTKAVLLK---SLI 348

Query: 453 ARLAGYPATLSEDEAMLTDYNLH----PKKRVA-TQLVRMEKKMLNACLQVTADMIM 504
             L GY  TL ED + +  +N      PKKR     L+RM +K     L  +AD + 
Sbjct: 349 TSLDGYGTTLGEDNSYIARFNTSSDELPKKRKRFAVLLRMREK---GILLASADALF 402


>gi|367023575|ref|XP_003661072.1| hypothetical protein MYCTH_2300057 [Myceliophthora thermophila ATCC
           42464]
 gi|347008340|gb|AEO55827.1| hypothetical protein MYCTH_2300057 [Myceliophthora thermophila ATCC
           42464]
          Length = 496

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 70/281 (24%), Positives = 108/281 (38%), Gaps = 66/281 (23%)

Query: 113 VAASEDLQAGDAAFSVP-NSLVVTLERVLGNE----------------TIAELLTTNKLS 155
           + A  D+ A    F++P +S++ T    L NE                +  E  T++   
Sbjct: 49  IVARTDIAADTVLFTIPRSSIICTATSALKNEIPGIFDLEGDEDGNSDSGGEDGTSSSQD 108

Query: 156 ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 215
               L L L+YE  QG  S W PY+  L         A ++P+ WS TELA L  S    
Sbjct: 109 SWTLLILILIYEYLQGDASQWKPYLDVL-------PSAFDTPMFWSPTELAELQASALVT 161

Query: 216 EI-LERAEGIKRE-----YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSC 269
           ++  E A+ + R          D V+F  G   Q+   D       FE+  +   A+ + 
Sbjct: 162 KVGREEADRMIRSKILPVIRGHDHVFFPHGR--QRLDDDQ-----LFELAHRMGSAIMAY 214

Query: 270 VVHLQKVSLARR-----------------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 312
              L+K   A                     +VP+   +L   ++  A +    D++   
Sbjct: 215 AFDLEKDDDANEEASEQDEWVDDREGRTMLGMVPMA-DMLNADAEFNAYINHGADSLTAT 273

Query: 313 VDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV 353
             R  KAGE I+ +Y           GP PN +LL  YG+V
Sbjct: 274 ALRTIKAGEEILNYY-----------GPLPNGELLRRYGYV 303


>gi|451854554|gb|EMD67847.1| hypothetical protein COCSADRAFT_34629 [Cochliobolus sativus ND90Pr]
          Length = 476

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 104/440 (23%), Positives = 167/440 (37%), Gaps = 83/440 (18%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           V A +D+   +  FS+P S ++ +E  + +  I    T   L     L L ++YE   G 
Sbjct: 40  VVAKQDIAEHELLFSIPRSSILGVENSILSTEIPPA-TFAHLGPWLSLILIMLYEYHNGS 98

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL-------------- 218
            S W PY   L           ++ + W+E ELA L  S    +I               
Sbjct: 99  ASNWAPYFAVLPTD-------FDTLMFWTEDELAELQASAVVNKIGKEGANEVFIEQLLP 151

Query: 219 -------------ERAEGIKREYNELDTVWFM--AGSLFQQYPYDIPTEAFTFEIFKQAF 263
                        ERA+   +E    + +  M   GSL   Y +D+   A + +   +  
Sbjct: 152 VIEEFADVIFSGDERAKHKAKEMRAPENLELMHKMGSLIMAYAFDVEP-AISDKEVDEEG 210

Query: 264 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESI 323
            A +     L K        +VPL   L A   +C A L    D +++   +P +AG+ I
Sbjct: 211 FAEEEEDAALPK-------GMVPLADMLNADGDRCNARLFYEKDGLEMKALKPIQAGDEI 263

Query: 324 VVWYNISFTGHDFKCGPQPNSKLLINYGFV-DEDNPYD-----------RLVVEAALNTE 371
              Y           GP P S LL  YG++ D    YD            L  +   + E
Sbjct: 264 FNDY-----------GPLPRSDLLRRYGYITDNYAQYDVVEIPVDLVSQTLAHDGLWHEE 312

Query: 372 DPQYQDKRMVAQRNGKLSVQV-FHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLG 430
             +Y D++ +      ++  + F +     +E++S  L  L    +    E +  + S G
Sbjct: 313 RIEYLDEQEIVDTGYDIAASIPFSL-----EESLSPELVILVETMLLPREEFER-LQSKG 366

Query: 431 PICPVSPCMERAVLDQLADYFKARLAGYPATLSED-----EAMLTDYNLHPKKRVA-TQL 484
            + P +  M       L    +AR+A YP TL +D     E          ++RVA  + 
Sbjct: 367 RL-PKAEKMTGKAAKFLYKIVQARIAQYPTTLEQDLQISSETQPVQTMSRKERRVAMARA 425

Query: 485 VRM-EKKMLNACLQVTADMI 503
           VR+ EKK+L    +  AD I
Sbjct: 426 VRIGEKKLLVQTEERLADKI 445


>gi|422293007|gb|EKU20308.1| ribulose- -bisphosphate carboxylase oxygenase small subunit
           n-methyltransferase i [Nannochloropsis gaditana CCMP526]
          Length = 385

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 53/202 (26%), Positives = 83/202 (41%), Gaps = 20/202 (9%)

Query: 78  LGDLKSWMHKN---GLPPCKVILKEKP-SHNEKHRPIHYVAASEDLQAGDAAFSVPNSLV 133
           LG+   WM      G+PP  ++L  +     E       +     +  G+A F +P S+V
Sbjct: 115 LGENGVWMQDKSGWGVPPHPLLLSSRTIDEIELEDSGRGLICKYPINMGNALFQLPLSIV 174

Query: 134 VTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA 193
           +  E+ L     A       ++E   +AL L+ E+  G  SFW PYI  L          
Sbjct: 175 IDKEKSLAAFDGA---LPADINEYFAIALMLIKERALGPSSFWAPYIDVLPTTE-----E 226

Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS-LFQQYPYDIPTE 252
           V   L+W E +LA L  SP  A        +  E+  L+  +  A S +F          
Sbjct: 227 VNPTLVWPEGDLALLEASPLVAATRSLKRKLAAEFALLEEQYMRARSDVFD-------PS 279

Query: 253 AFTFEIFKQAFVAVQSCVVHLQ 274
            FTFE +  AF+ + S  + ++
Sbjct: 280 VFTFEAYLWAFINIFSRAIRVK 301


>gi|224042477|ref|XP_002188626.1| PREDICTED: SET domain-containing protein 4 [Taeniopygia guttata]
          Length = 457

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 94/403 (23%), Positives = 148/403 (36%), Gaps = 78/403 (19%)

Query: 81  LKSWM------HKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSV 128
           LKS+M      HK      K  LKE+   +   RP  +      +  ++ LQAGD   S+
Sbjct: 17  LKSFMDGVNCSHKLEYIKLKKWLKERGFEDSNLRPAEFWETGRGLMTTKALQAGDLIISL 76

Query: 129 PNSLVVT----LERVLGNE------------TIAELLTTNKLSELACLALYL---MYEKK 169
           P   ++T    L   LG               +   L    L  L C    L   + EK 
Sbjct: 77  PEKCLLTTGTVLSSCLGGHIEKWKPPVSPLLALCTFLIGQNLELLECFQFLLVNGIAEKH 136

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
            G+KS W PY+  L +       A   P    E ++  L   P + +  E+   I+  + 
Sbjct: 137 AGQKSPWKPYLDVLPK-------AYTCPAC-LEPDIINLLPKPLQKKAQEQKMLIQELFQ 188

Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP--- 286
                +     LF +   +I    F F   + A+  V +  +++ K      F+L P   
Sbjct: 189 SSRAFFSSLQPLFAEDTGNI----FNFSALQWAWCTVNTRTIYM-KHPHRECFSLEPDVY 243

Query: 287 -LGP--PLLAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQ 341
            L P   LL +S   + KA       + ++  D   K  + +++ Y           GP 
Sbjct: 244 ALAPYLDLLNHSPNVQVKAGFNEQTRSYEIWTDSQCKKYQEVLICY-----------GPH 292

Query: 342 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREK 401
            N +LL+ YGFV  DNP+  + V A    +     DK    QR  K+S+   H       
Sbjct: 293 DNQRLLLEYGFVATDNPHSSVYVSADTLLKYFSSLDK----QREAKVSILKDH------- 341

Query: 402 EAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVL 444
               D L  L  G+   +  + + +  L        C  R +L
Sbjct: 342 ----DFLENLTFGWEGPSWRLLTALKVLSLAADEFACWRRILL 380


>gi|323452617|gb|EGB08490.1| hypothetical protein AURANDRAFT_71532 [Aureococcus anophagefferens]
          Length = 1114

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 93/384 (24%), Positives = 147/384 (38%), Gaps = 65/384 (16%)

Query: 123 DAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRE 182
           DA     + LV + ER   +      L      E   LAL L+YE+++G KS W P+I  
Sbjct: 64  DAMLHARSPLVCSGEREANDARALGALLGKVTREDDALALRLLYERRKGAKSRWGPHIAL 123

Query: 183 LDRQRGRGQLAVESPLLWSETELAYLTGSPTK--------------AEILERAEGIKREY 228
           L            + L WSE ELA L GS                 +EI++++     E 
Sbjct: 124 LP------ATPPHALLRWSEAELAELAGSDALELANRWRSQVSSDFSEIVDKSRAAVEES 177

Query: 229 NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG 288
           +    +   A     ++P+ +  E F++ +           ++  + VS++R+ A     
Sbjct: 178 DPGKQL-SAAVKASLRFPW-LDLEGFSWAV----------SMIWSRCVSVSRKGA----- 220

Query: 289 PPLLAY--------SSKCKAMLAAVDDAVQLVVDR---PYKAGESIVVWYNISFTGHDFK 337
           PP+ A+                   DDA    V R     K G+ + + Y+         
Sbjct: 221 PPIKAFLPVVDMHNHDPGAPENHGFDDARDGFVLRRTGNAKKGDELKLCYD--------- 271

Query: 338 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 397
               PN+ LL+ YGF  +   +    + A L+ E P Y+ KR   +   KL +      A
Sbjct: 272 --GLPNAWLLLLYGFALDHAAHAGRDLYAPLSPEAPHYEAKRAALE---KLGLGATADGA 326

Query: 398 GREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAG 457
              + A  D LP  RL  ++     ++ +  L  +   S    RA    L     A LA 
Sbjct: 327 APFRLAADDALPE-RL--LTALMAQRATLDELPGLPATSEATARAAAGDLVAACDALLAA 383

Query: 458 YPATLSEDEAMLTDYNLHPKKRVA 481
           Y  +  ED A L D    P+ R+A
Sbjct: 384 YRGSEDEDAAALADPATPPRLRLA 407


>gi|255945819|ref|XP_002563677.1| Pc20g11910 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211588412|emb|CAP86520.1| Pc20g11910 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 487

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 75/294 (25%), Positives = 118/294 (40%), Gaps = 68/294 (23%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
           V A  ++  G+  FSVP ++V+T++    N  +  LL  N   ++     L L ++YE  
Sbjct: 50  VVAQSNISEGEELFSVPRAMVLTVQ----NSELRTLLGENLEEQMGPWLSLMLVMVYEYL 105

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA---EGIKR 226
           QG+KS W PY R L  +        ++ + WS  EL  L  S T  E + R+   E I+ 
Sbjct: 106 QGEKSRWAPYFRVLPSR-------FDTLMFWSPAELQELQAS-TIVEKIGRSGAEESIRN 157

Query: 227 ----------------------EYNELDT----VWFMAGSLFQQYPYDI---PTEAFTFE 257
                                 E +  D     V  + GSL   Y +DI     +    E
Sbjct: 158 SIAPILAKRPDLFPPPQGLASWEGDAGDAALIQVGHIMGSLIMAYAFDIEKSEDDGDEGE 217

Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY 317
              ++++        L K        +VPL   L A + +  A L   + A+ +   +P 
Sbjct: 218 ANDESYMTDDEEEEQLPK-------GMVPLADLLNADADRNNARLYQEEGALVMKAIKPI 270

Query: 318 KAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNP-YDRLVVEAALNT 370
           + GE I   Y           G  P + LL  YG+V ++   YD  V+E +L T
Sbjct: 271 QQGEEIFNDY-----------GEIPRADLLRRYGYVTDNYAVYD--VLELSLET 311


>gi|302784522|ref|XP_002974033.1| hypothetical protein SELMODRAFT_414219 [Selaginella moellendorffii]
 gi|300158365|gb|EFJ24988.1| hypothetical protein SELMODRAFT_414219 [Selaginella moellendorffii]
          Length = 527

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 61/237 (25%), Positives = 91/237 (38%), Gaps = 46/237 (19%)

Query: 135 TLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
           T ER L    +  +L  N       +  +L+ E+ +GK+SFW PYI  L       +L++
Sbjct: 90  TAERCL---LVGPMLRKNDFRPWLTMCAHLLVERSRGKESFWHPYISALPSVE---ELSI 143

Query: 195 ESPLLW-SETELAYLTGSPTKAEILERAEGIKREYNELDTVW---FMAGSLFQQYPYDIP 250
             PLLW +ET    L GSP    I  R +  + ++  L T     F+ G           
Sbjct: 144 SHPLLWPAETIQELLQGSPMLDTIATRLKLCQEDHEALLTAGIEKFLPGG---------- 193

Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQKVSLA-------RRFALVPLGPPLLAYSSKCKAMLA 303
                 E   +  V   S V+  +  SL            LVP    L   SS  +    
Sbjct: 194 ------ETLSEGDVRWASAVLLSRAFSLELDVDDDFDTLCLVPWADMLNHCSSAGEESCL 247

Query: 304 AVDD---AVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDN 357
             D       L   + Y  G+ +   Y  + TG          S+L ++YGFVD++N
Sbjct: 248 IFDQDTKTASLEAHKSYSKGDEVFDSYGPALTG----------SQLFLDYGFVDDEN 294


>gi|28393324|gb|AAO42088.1| unknown protein [Arabidopsis thaliana]
          Length = 543

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 63/250 (25%), Positives = 99/250 (39%), Gaps = 30/250 (12%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           ASEDL+ GD A  +P S +++ E V  ++    L T + ++    L L+ M EK      
Sbjct: 180 ASEDLKLGDVALEIPVSSIISEEYVYNSDMYPILETFDGITSETMLLLWTMREKHNLDSK 239

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
           F  PY   L      G       L +    +  L G+    EI++  E ++  Y+EL   
Sbjct: 240 F-KPYFDSLQENFCTG-------LSFGVDAIMELDGTLLLDEIMQAKELLRERYDELI-- 289

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
                 L   +    P E +T+E +  A     S  + ++      +  L+P+   L   
Sbjct: 290 -----PLLSNHREVFPPELYTWEHYLWACELYYSNSMQIKFPDGKLKTCLIPVAGFLNHS 344

Query: 295 SSKCKAMLAAVD---DAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYG 351
                     VD    +++  V RP   GE   + Y           G   +S LL  YG
Sbjct: 345 IYPHIVKYGKVDIETSSLKFPVSRPCNKGEQCFLSY-----------GNYSSSHLLTFYG 393

Query: 352 FVDE-DNPYD 360
           F+ + DNPYD
Sbjct: 394 FLPKGDNPYD 403


>gi|145549620|ref|XP_001460489.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124428319|emb|CAK93092.1| unnamed protein product [Paramecium tetraurelia]
          Length = 482

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 82/419 (19%), Positives = 160/419 (38%), Gaps = 52/419 (12%)

Query: 103 HNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV----LGNETIAELLTTNKLSELA 158
            +E HR +    A++ ++ G+    +P +  ++LE V    L N  + ++    K + + 
Sbjct: 63  QSEGHRTLR---ATQFIRQGEWVLFIPRTQYLSLEEVKKSCLINRKMIQI--NYKPNNIQ 117

Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
              +  + ++ + K SFW PYI  L +           P  +   + A L GSPT   ++
Sbjct: 118 TYFVNHLLQENRRKYSFWKPYIDVLPKD------VSGFPTYFDAEQDALLKGSPTLFTVI 171

Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 278
            + +  K EY  L      A   FQ+Y Y       T++ F +  +   S    +Q    
Sbjct: 172 NQRKVFKEEYENLKE----AVKEFQKYGY-------TYDDFIKFRILTISRSFTVQIGEK 220

Query: 279 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD--RPYKAGESIVVWYNISFTGHDF 336
            ++  LVPL    + + +          DA    +   R  + GE +   Y         
Sbjct: 221 EQQQLLVPLAD-FINHDNNGFLKYGYSKDADGFFMQAVRNIQKGEELFYNY--------- 270

Query: 337 KCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN---GKLSVQVF 393
             G   N    +NYGF    NP ++  ++  LN  D  +  K  + + N   G   V   
Sbjct: 271 --GQWSNKYFFMNYGFASLTNPMNQFDLDICLNKNDRLFNLKISLTKGNMCWGNRLVNET 328

Query: 394 HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICP------VSPCMERAVLDQL 447
                R+  A    + + ++  + D  +++  + +     P       +  +E+A    L
Sbjct: 329 DHDTFRQSLA---TVRFTQISKLDDFLQLEEDVQNFKQFWPGWHTTIKTIELEKATFKAL 385

Query: 448 ADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLL 506
                  L  + +T+ +DE  L D      ++    L   EK+++   + +   M+ ++
Sbjct: 386 KGILVTELGNFASTIEDDERRLNDPQTPEFRKHIIMLTLREKQIIKKNIDICDLMLQVI 444


>gi|358397725|gb|EHK47093.1| hypothetical protein TRIATDRAFT_298882 [Trichoderma atroviride IMI
           206040]
          Length = 481

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 51/204 (25%), Positives = 89/204 (43%), Gaps = 25/204 (12%)

Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
           L L+ E  +G++SFW PYI+ L +       A+  P  W E E   L G+  +  + +  
Sbjct: 90  LLLIKELLRGEESFWWPYIQALPQPEDVDDWAL--PPFWPEEEAELLEGTNVEVGLDKIR 147

Query: 222 EGIKREYNELDTVWFMA--------GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 273
           + +KRE+ E   +   +          L  +  Y+     F+   F+ + V  ++    L
Sbjct: 148 DDLKREFREAKAMLLASQKDAEDDFSELLTRELYNWAYCIFSSRSFRASLVMTEAQQQAL 207

Query: 274 -QKVSLARRFALVPL---GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNI 329
            + VS+     L+PL   G   +A   + +   A    A QL V R ++ G+ I   Y+ 
Sbjct: 208 PEDVSVDDFSVLLPLFDIGNHDMAVDVRWELDAANSGAACQLRVGREHQPGQQIFNNYS- 266

Query: 330 SFTGHDFKCGPQPNSKLLINYGFV 353
                     P+ N++LL+ YGF+
Sbjct: 267 ----------PKTNAELLLGYGFM 280


>gi|79557522|ref|NP_179475.3| SET domain-containing protein [Arabidopsis thaliana]
 gi|56381987|gb|AAV85712.1| At2g18850 [Arabidopsis thaliana]
 gi|330251719|gb|AEC06813.1| SET domain-containing protein [Arabidopsis thaliana]
          Length = 543

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 63/250 (25%), Positives = 99/250 (39%), Gaps = 30/250 (12%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           ASEDL+ GD A  +P S +++ E V  ++    L T + ++    L L+ M EK      
Sbjct: 180 ASEDLKFGDVALEIPVSSIISEEYVYNSDMYPILETFDGITSETMLLLWTMREKHNLDSK 239

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
           F  PY   L      G       L +    +  L G+    EI++  E ++  Y+EL   
Sbjct: 240 F-KPYFDSLQENFCTG-------LSFGVDAIMELDGTLLLDEIMQAKELLRERYDELI-- 289

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
                 L   +    P E +T+E +  A     S  + ++      +  L+P+   L   
Sbjct: 290 -----PLLSNHREVFPPELYTWEHYLWACELYYSNSMQIKFPDGKLKTCLIPVAGFLNHS 344

Query: 295 SSKCKAMLAAVD---DAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYG 351
                     VD    +++  V RP   GE   + Y           G   +S LL  YG
Sbjct: 345 IYPHIVKYGKVDIETSSLKFPVSRPCNKGEQCFLSY-----------GNYSSSHLLTFYG 393

Query: 352 FVDE-DNPYD 360
           F+ + DNPYD
Sbjct: 394 FLPKGDNPYD 403


>gi|347967018|ref|XP_321037.5| AGAP002018-PA [Anopheles gambiae str. PEST]
 gi|333469795|gb|EAA01259.5| AGAP002018-PA [Anopheles gambiae str. PEST]
          Length = 493

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 91/408 (22%), Positives = 163/408 (39%), Gaps = 65/408 (15%)

Query: 121 AGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA--CLALYLMYEKKQGKKSFWLP 178
           AG+   +VP S+   +        + EL+    +SE     LAL L+ E+ + K S W P
Sbjct: 110 AGECIITVPRSMFFYVTNEPRYRQLLELMPGAMMSEQGNIMLALALIMERFRAK-SDWKP 168

Query: 179 YIREL-DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
           Y+  L DR          +PL ++  ++  L  +      L+  + I R+Y  +      
Sbjct: 169 YLDLLPDR--------YTTPLYYTTEDMGELAETDAFLPALKLCKHIARQYGFIRR---- 216

Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS----CVVHLQKV-SLARRFALVPL----- 287
               F Q   D   + FT+++F+ A   V +      V+L +   +    AL+PL     
Sbjct: 217 ----FVQEKVDELRDCFTYDVFRWAVSTVMTRQNKVPVNLAEFDGMDHTLALIPLWDMAN 272

Query: 288 -GPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNS 344
              P  A  ++C A     A ++ ++  + R      S+ ++  I +       G + ++
Sbjct: 273 HAFPDTANETRCVAETCYNATNEQLECSLTREVSDIASVPIF--IVY-------GTRTDA 323

Query: 345 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI 404
           + L++ GFV   NP+  +     L    P Y+++  + +  G  +   F     RE  A 
Sbjct: 324 EFLVHNGFVCPRNPHANVQKRFTLVPAIPLYKERAHLLELLGMPTTGTFSFGPAREPAAA 383

Query: 405 SDMLP----YLRLGYVSDTS------------EMQSVISSLGPICPVSPC--MERAVLDQ 446
           +   P     + L  VS  +              +  + +   + P   C   ER     
Sbjct: 384 TTTTPISQELISLARVSSMTAKELDEYTAMKETQRQTLRTYQALLPAELCARTER----W 439

Query: 447 LADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 494
           LA   K  L  YP T+ +DEA+L   N H  +R+  +    EK++L +
Sbjct: 440 LATVMKIMLLRYPTTIEQDEALLKT-NRHHIRRLLIEYRLGEKQILRS 486


>gi|326472332|gb|EGD96341.1| SET domain-containing protein [Trichophyton tonsurans CBS 112818]
          Length = 485

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 75/356 (21%), Positives = 125/356 (35%), Gaps = 83/356 (23%)

Query: 97  LKEKPSHNEKHRPIHY-----------VAASEDLQAGDAAFSVPNSLVVTLERVLGNETI 145
           LK    H + H  IH            + AS D+   +  F +P+ L+++++       +
Sbjct: 24  LKRSSPHFKMHSGIHIADLRSIGAGRGICASRDIAEDEELFVIPDDLILSVQNSEARSVL 83

Query: 146 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 205
              L   +L     L + ++YE  QG++S W PY R L         + ++ + W++ +L
Sbjct: 84  G--LDDKQLGPWLSLIITMIYEYYQGEQSKWYPYFRILPS-------SFDTLMFWTDEQL 134

Query: 206 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 265
           + L GS    +I + A          DT+      L Q  P+  P               
Sbjct: 135 SELQGSAVVGKIGKAAAD--------DTILQKVVPLIQANPHHFPPR------------- 173

Query: 266 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 325
                               P  PPL +  S+   +  A            ++ G SI++
Sbjct: 174 --------------------PNMPPLNSPDSQNALLCLA------------HRMG-SIIM 200

Query: 326 WYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 385
            Y         K            Y   DED P   +V  A +   D Q  + R+  +  
Sbjct: 201 AYAFDIE----KADEADEDTAEDGYMTDDEDEPAKGMVPLADIFNADAQRNNARLFQEEG 256

Query: 386 GKLSVQVFHVHAGREKEAISDMLPYL----RLGYVSDTSEMQSVIS-SLGPICPVS 436
             +   + ++H+G E       LP      R GYV+D      V+  SL  IC V+
Sbjct: 257 SFVMKAIKNIHSGEEIFNDYGELPRADLLRRYGYVTDNYAQYDVVEFSLDGICKVA 312


>gi|334184301|ref|NP_001189551.1| SET domain-containing protein [Arabidopsis thaliana]
 gi|330251720|gb|AEC06814.1| SET domain-containing protein [Arabidopsis thaliana]
          Length = 536

 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 63/250 (25%), Positives = 99/250 (39%), Gaps = 30/250 (12%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           ASEDL+ GD A  +P S +++ E V  ++    L T + ++    L L+ M EK      
Sbjct: 180 ASEDLKFGDVALEIPVSSIISEEYVYNSDMYPILETFDGITSETMLLLWTMREKHNLDSK 239

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
           F  PY   L      G       L +    +  L G+    EI++  E ++  Y+EL   
Sbjct: 240 F-KPYFDSLQENFCTG-------LSFGVDAIMELDGTLLLDEIMQAKELLRERYDELI-- 289

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
                 L   +    P E +T+E +  A     S  + ++      +  L+P+   L   
Sbjct: 290 -----PLLSNHREVFPPELYTWEHYLWACELYYSNSMQIKFPDGKLKTCLIPVAGFLNHS 344

Query: 295 SSKCKAMLAAVD---DAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYG 351
                     VD    +++  V RP   GE   + Y           G   +S LL  YG
Sbjct: 345 IYPHIVKYGKVDIETSSLKFPVSRPCNKGEQCFLSY-----------GNYSSSHLLTFYG 393

Query: 352 FVDE-DNPYD 360
           F+ + DNPYD
Sbjct: 394 FLPKGDNPYD 403


>gi|66827459|ref|XP_647084.1| hypothetical protein DDB_G0267502 [Dictyostelium discoideum AX4]
 gi|60475269|gb|EAL73204.1| hypothetical protein DDB_G0267502 [Dictyostelium discoideum AX4]
          Length = 472

 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 69/335 (20%), Positives = 134/335 (40%), Gaps = 68/335 (20%)

Query: 195 ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
           ++ L + E E+ YL GSP   +I+   E   + Y++L    F    + +       +   
Sbjct: 163 DTSLYFDEKEIEYLAGSPAFVDIMVEKEVATKLYDQLSQTLFKDNVILEMCQG--QSTII 220

Query: 255 TFEIFKQAFVAVQSCVVHLQK----VSLARRFALVPLGPPLLAYSSKCKAMLAAVD---- 306
            ++ F+ A   + +  +++       S  ++  L P+ PP++ Y +      A +D    
Sbjct: 221 GWDQFRWAHSTITARKIYVTDPDSVGSDGKQMKLSPVVPPIVDYFNHGNQPSAEIDYNEE 280

Query: 307 -DAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGF----VDEDNPYDR 361
             +V +   +  K GE I V Y+  + G D          LL++YG+    +D+ +  + 
Sbjct: 281 LGSVDVKAIKDIKKGEEIFVSYDHHYCGSD----------LLVDYGYLPNQIDDKSCVNV 330

Query: 362 LVVE--AALNTEDPQYQDK-----RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLG 414
           L+ E    +N +DP   DK     +++  ++ KL +                        
Sbjct: 331 LMEELLETINLDDPIKDDKYYLVNKLLETKDIKLKIS----------------------- 367

Query: 415 YVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML--TDY 472
                  M S+   L  I       + ++L+ L      ++  YP T+ +D+  L   +Y
Sbjct: 368 -------MDSLTEDLLKISKYMSYKQESLLEYLKRLVSLKIGHYPTTIIQDKEFLLSKEY 420

Query: 473 N-LHPKKRVATQLVRMEKKMLNAC---LQVTADMI 503
           N L  + ++A  L   EK++L+     LQ   D I
Sbjct: 421 NQLSARSKLAFNLAFQEKQILSNVYTKLQENIDTI 455


>gi|358388339|gb|EHK25932.1| hypothetical protein TRIVIDRAFT_82204 [Trichoderma virens Gv29-8]
          Length = 915

 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 80/370 (21%), Positives = 154/370 (41%), Gaps = 62/370 (16%)

Query: 158 ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
           + L + +M+E  +G +S W PY+  L         + E+P+ WS  EL  L  S T+ ++
Sbjct: 543 SILIIIMMFEYFKGDESKWKPYMDVL-------PASFETPMFWSGAELDELQASATRTKV 595

Query: 218 LERAEGIKREYNELDTVWFMAGSLF---QQYPYDIPTE----------AFTFEIFKQAFV 264
             +A+  +  + ++  V      +F   Q Y  D   +          ++ F+   +   
Sbjct: 596 -GKADAEEMFHAKVLPVIRANHEIFPSSQSYSDDELVQLAHRMGSTIMSYAFDFQNEDEE 654

Query: 265 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 324
             +     ++         +VP+   +L   ++  A +   DDA+ +   R  KAGE I+
Sbjct: 655 DEEDEEEWVEDRESKSTMGMVPMAD-ILNADAEYNAHVNYGDDALTVTALRTIKAGEEIL 713

Query: 325 VWYNISFTGHDFKCGPQPNSKLLINYGFVD-EDNPYD------RLV---VEAALNTEDPQ 374
            +Y           GP PNS+LL  YG+V  + + YD      +LV   + A+L   + Q
Sbjct: 714 NYY-----------GPHPNSELLRRYGYVTPKHSRYDVVELPWKLVENSLAASLGLSEQQ 762

Query: 375 YQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE-----MQSVISSL 429
             + R     +      V      RE +  +    +      SD  E     ++ ++ ++
Sbjct: 763 LDNAREYLDMDEIEDTFVL----DRESDEPNPDGTFTGSARFSDIPEDLRDQLKLLLKAV 818

Query: 430 GPICPVSPCMER-------AVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVAT 482
             + P S   +R       +VL +  D   ++   YP T+ EDE +L+  ++  ++R A 
Sbjct: 819 RKVDPSSVADKRKRDEIQHSVLIKALDALASQ---YPTTIPEDERILSGSDISERQRAAV 875

Query: 483 QLVRMEKKML 492
            +   EK+++
Sbjct: 876 TVRLGEKRLI 885


>gi|302826668|ref|XP_002994755.1| hypothetical protein SELMODRAFT_432653 [Selaginella moellendorffii]
 gi|300136963|gb|EFJ04180.1| hypothetical protein SELMODRAFT_432653 [Selaginella moellendorffii]
          Length = 688

 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 50/200 (25%), Positives = 83/200 (41%), Gaps = 33/200 (16%)

Query: 167 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 226
           +K Q + S W PYI  L          +++  LW +TEL+YL  SP   +  ER E I  
Sbjct: 503 QKFQLQSSAWAPYISCLPEPA-----ELDNTFLWEDTELSYLRASPLYGKTRERLEIITT 557

Query: 227 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 286
           E+ ++     +   LF +          + E F   +  V S     + +++     LV 
Sbjct: 558 EFGQVQNALDVWPQLFGK---------VSVEDFMHVYATVFS-----RPLAIGEDSTLVM 603

Query: 287 LGPPLLAYSSKCKAMLAAVDDAVQL---VVDRPYKAGESIVVWYNISFTGHDFKCGPQPN 343
           +  P+L + +   A  A +     L   VV       E+  +W N         CG   N
Sbjct: 604 I--PMLDFFNHNAASFAKLSFNGLLNYAVVTADRDCAENDQIWIN---------CGDLSN 652

Query: 344 SKLLINYGFVDEDNPYDRLV 363
           ++L ++YGF   +N YD ++
Sbjct: 653 AELALDYGFTVPENRYDEVM 672


>gi|260819628|ref|XP_002605138.1| hypothetical protein BRAFLDRAFT_122719 [Branchiostoma floridae]
 gi|229290469|gb|EEN61148.1| hypothetical protein BRAFLDRAFT_122719 [Branchiostoma floridae]
          Length = 453

 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 49/204 (24%), Positives = 93/204 (45%), Gaps = 22/204 (10%)

Query: 67  SREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAF 126
           +R V    EE       W+H+NG   C+ +  +     E  R +    A++ L+  +   
Sbjct: 19  TRPVSLAHEESFVRFFQWLHRNG---CRNVPLKPAVFPETGRGL---MATKALKHEELIL 72

Query: 127 SVPNSLVVTLERVLGNETIAELL--TTNKLSELACLALYLMYEKKQGKKSFWLPYIRELD 184
            +P  L++T++ ++ +  +A  +    ++L+    LA++LM EK + +KSFW PYI  L 
Sbjct: 73  VIPKRLLITIDAIM-DSYLAPYIERADSQLTPSQALAVFLMCEKCRREKSFWRPYIDILP 131

Query: 185 RQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQ 244
            +          P  ++E +   L  S  + +   +     +E+ EL   + M   LF  
Sbjct: 132 EE-------YTCPAFFTEEDFRLLPNS-LRGKAKAKKYECHKEFMELAPFFKMLADLFPD 183

Query: 245 YPYDIPTEAFTFEIFKQAFVAVQS 268
                  +AF F+ FK A+ A+++
Sbjct: 184 -----QEDAFNFKDFKWAWSAIKT 202


>gi|328772383|gb|EGF82421.1| hypothetical protein BATDEDRAFT_86633 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 648

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 79/331 (23%), Positives = 128/331 (38%), Gaps = 52/331 (15%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
           L  W   +G     V +KE  S +++ +      AS+D+        +P++++++   V 
Sbjct: 36  LVDWGRMHGANIENVEIKETASDDDR-KLTRGAYASKDIPPNSEICFIPSTILLSESDVR 94

Query: 141 GNETIAELLT--------TNKLSE---------LACLALYLMYEKKQ-GKKSFWLPYIRE 182
            +E    +LT          K+S+         L  +A +++++       S WLPY+  
Sbjct: 95  ASEIGKAILTYIDEHQDAKQKISDKIKHPHAEILLAMAAFIVHQVSLPTADSHWLPYLAS 154

Query: 183 LDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL-ERAEGIKREYNELDTVWFMAGSL 241
           L +           PL+W+   +  L G  +   ++ ER E I+   N    V    G  
Sbjct: 155 LPKNYAL-------PLMWTRDRIQNLLGGTSLLYMMIERLEWIQ---NSTKVVENACGHY 204

Query: 242 FQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAM 301
           F       PT A T +  + A  ++ S      K SL  +          +  S  C  +
Sbjct: 205 F-------PTGALTVQSMQWATCSIWSRAFPKAKPSLDLQDGSHQDVQDWIGLSEIC--L 255

Query: 302 LAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKC---------GPQPNSKLLINYGF 352
              +D        R Y+  E  +    +SF   D  C         GP+ N  LL NYGF
Sbjct: 256 FPILD---MFNHKRGYRV-EWRMTEKGVSFITPDGICKGSELLNNYGPKGNENLLSNYGF 311

Query: 353 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 383
           V E+NP D   V   L  EDP Y  K+ V +
Sbjct: 312 VIENNPEDYFKVFLGLQQEDPLYTAKKAVLE 342


>gi|330800139|ref|XP_003288096.1| hypothetical protein DICPUDRAFT_152307 [Dictyostelium purpureum]
 gi|325081857|gb|EGC35358.1| hypothetical protein DICPUDRAFT_152307 [Dictyostelium purpureum]
          Length = 525

 Score = 49.7 bits (117), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 38/156 (24%), Positives = 69/156 (44%), Gaps = 16/156 (10%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + +++DL+  +    +P  +++++       +I+ +LT   +      A+ L+YE   G+
Sbjct: 82  IISNKDLKVNNIVAKIPKDIILSIH----TSSISNILTKYTMERNIATAIALIYEASIGE 137

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
           KS W  YI  L        L V+ P+LW +     L G+  +  I +    I   Y ++ 
Sbjct: 138 KSKWYGYISSL-------PLKVDIPILWDKESQQLLNGTVMEDVIQDDNILINHAYADI- 189

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 268
               +   L + +P     E F+FE FK A   V S
Sbjct: 190 ----VESLLIKNHPEYFSKEIFSFENFKIANSIVSS 221


>gi|119467702|ref|XP_001257657.1| SET domain protein [Neosartorya fischeri NRRL 181]
 gi|119405809|gb|EAW15760.1| SET domain protein [Neosartorya fischeri NRRL 181]
          Length = 492

 Score = 49.7 bits (117), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 72/294 (24%), Positives = 119/294 (40%), Gaps = 72/294 (24%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
           V A  D+  G+  FS+P  LV++ +    N  + +LL+ + L EL     L L +MYE  
Sbjct: 50  VVARSDIFDGEELFSIPRGLVLSAQ----NSKLKDLLSQD-LEELGPWLSLILVMMYEYL 104

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
            G++S W PY + L +       + ++ + WS +EL  L GS   ++I +  EG +    
Sbjct: 105 LGEQSAWAPYFKVLPK-------SFDTLMFWSPSELQELQGSAIVSKIGK--EGAE---- 151

Query: 230 ELDTVWFMAGSLFQQYPYDIPT----EAFTFEIFKQAFVAVQSCVVHLQKVSLARRF--- 282
             D++  M   + +  P   P+     ++  E    A + +   +  L    +A  F   
Sbjct: 152 --DSIMQMIAPVVRANPSLFPSVEGLASWDGEAGSHALLGLAHIMGSL---IMAYAFDIE 206

Query: 283 ---------------------------ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 315
                                       +VPL   L A + +  A L   +D++ +   +
Sbjct: 207 KAEDEDDEDNDEEEGYVTDDEQDQSSKGMVPLADILNADADRNNARLFQEEDSLVMKAIK 266

Query: 316 PYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 369
           P  AGE I   Y           G  P + LL  YG+V  DN     VVE +L+
Sbjct: 267 PIHAGEEIFNDY-----------GELPRADLLRRYGYV-TDNYAHYDVVELSLD 308


>gi|167521575|ref|XP_001745126.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776740|gb|EDQ90359.1| predicted protein [Monosiga brevicollis MX1]
          Length = 390

 Score = 49.7 bits (117), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 65/322 (20%), Positives = 125/322 (38%), Gaps = 46/322 (14%)

Query: 75  EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
           EE+  +L  W+ + G    KV +       +       + A+ +   G+    +P + ++
Sbjct: 24  EEEYDELVDWLKQCGATVDKVAVDHFNGMGQG------LKATAEAAPGETLLRIPEACML 77

Query: 135 --------TLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
                   TL   + ++T+ +L+    L+       + +  +     SFW PYI  L   
Sbjct: 78  SEESARRSTLGAYMDSDTMLKLMPNVTLA-------FHLLLELHDLDSFWRPYIACL--- 127

Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQY 245
                ++   PL W   +L  L GS    E +   + + R+Y  L + +   A      +
Sbjct: 128 ----PVSYSVPLYWDLPDLMSLRGSSLFVEAIRLYKHVCRQYGYLHNKLSVRANPSCSCF 183

Query: 246 P--YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS----LARRFALVPLGPPLLAYSSKCK 299
           P    +  EAFTFE ++ A   V +    + +      +    AL+PL   +   +    
Sbjct: 184 PLTLGLSPEAFTFEDWRWAVATVMTRQNSIPQAGPDGQMKPTLALIPLWDMINHANHPMS 243

Query: 300 AMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPY 359
               +  + ++ V   P K G  I +WY           G + N + L++ GF    +  
Sbjct: 244 TQFDSERECLEFVCPAPAKPGSQITMWY-----------GDRNNGQFLLHQGFFFAGHAN 292

Query: 360 DRLVVEAALNTEDPQYQDKRMV 381
           D + V  +L+  D  Y+ K ++
Sbjct: 293 DYVNVPFSLDETDSLYKIKALL 314


>gi|388516285|gb|AFK46204.1| unknown [Lotus japonicus]
          Length = 271

 Score = 49.7 bits (117), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 40/144 (27%), Positives = 66/144 (45%), Gaps = 18/144 (12%)

Query: 257 EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVD 314
           E FK +F  + S +V L   S+  + ALVP    +L +S   +  L        +    D
Sbjct: 2   ESFKWSFGILFSRMVRLP--SMDGKVALVPWAD-MLNHSCDVETFLDYDKQSKGIVFTTD 58

Query: 315 RPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTED 372
           RPY+ GE + + Y           G + N +LL++YGFV  +  NP D + +  +L   D
Sbjct: 59  RPYQPGEQVFISY-----------GKKSNGELLLSYGFVTREGANPSDSVELSLSLKKSD 107

Query: 373 PQYQDKRMVAQRNGKLSVQVFHVH 396
             Y++K  + ++ G    Q F + 
Sbjct: 108 GSYKEKLELLKKYGLSGSQCFPIR 131


>gi|121707885|ref|XP_001271968.1| SET domain protein [Aspergillus clavatus NRRL 1]
 gi|119400116|gb|EAW10542.1| SET domain protein [Aspergillus clavatus NRRL 1]
          Length = 677

 Score = 49.7 bits (117), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 56/214 (26%), Positives = 93/214 (43%), Gaps = 38/214 (17%)

Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
            +L+ +  QG+  FW PYIR L +      L++ +PL +   +L +L G+       +R 
Sbjct: 97  FFLIGQYLQGEDGFWFPYIRTLPQP-----LSLTTPLYYEGDDLGWLKGTSLWPAREQRM 151

Query: 222 EGIKREYNELDTVWFMAGSLFQ---QYPYD--------IPTEAFTFEIFKQAFVAVQSCV 270
           E +K  Y   + V  +  + FQ   +Y +D        I + AF+ ++  +AF  +    
Sbjct: 152 ELLKEAYE--NGVRELRKAGFQDVDKYTWDLYLWASSMIVSRAFSPKVLAEAFADID--- 206

Query: 271 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNIS 330
           +    VS+     L+P    L+ +    K    A    V  +V     AG+ I   Y   
Sbjct: 207 LPEDGVSV-----LLPC-IDLMNHRPLAKVEWRAGKQDVAYLVLEDVAAGQEIANNY--- 257

Query: 331 FTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
                   GP+ N +L++NYGF   DNP D  +V
Sbjct: 258 --------GPRNNEQLMMNYGFCLPDNPCDYRIV 283


>gi|225678514|gb|EEH16798.1| SET domain-containing protein RMS1 [Paracoccidioides brasiliensis
           Pb03]
          Length = 488

 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 92/439 (20%), Positives = 165/439 (37%), Gaps = 78/439 (17%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
           + A +D+   +  F++P  LV++ +    N  + +L+  N+  L +  CL L ++YE  Q
Sbjct: 50  IVAYDDINEEEELFAIPQGLVLSFQ----NSKLKDLMEINERDLGQWLCLILVMIYEYLQ 105

Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA--------- 221
           G  S W PY + L           ++ + W++ EL  L GS     I + A         
Sbjct: 106 GAASPWAPYFKVLPTD-------FDTLMFWTDAELLELKGSAVLGRIGKSAAEEVFLRDL 158

Query: 222 -------------EGIKREYNELD------TVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
                         G    YN  D      ++    GSL   Y +D+  +        + 
Sbjct: 159 LPLVSKNSELFPLTGGLLSYNSPDGKAALLSLAHRMGSLIMSYAFDVENDE------AEE 212

Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
                  V   ++  L +   ++PL   L A + +  A L   D  + +   +  + GE 
Sbjct: 213 VEGEDGYVTDDEERQLPK--GMIPLADLLNADADRNNARLFQEDGYLAMKSIKSIRKGEE 270

Query: 323 IVVWYNISFTGHDFKCGPQPNSKLLINYGFV-DEDNPYDRLVVE-------AALNTEDPQ 374
           I   Y           G  P ++LL  YG+V D    YD   V        A L +  P 
Sbjct: 271 IFNDY-----------GELPRAELLRRYGYVTDSYAQYDEAEVPIQTICRVAGLKSSTPG 319

Query: 375 YQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICP 434
             + R+    + ++    + +        +++ LP   L  ++         + L     
Sbjct: 320 PDEPRLEFLDDLEVLDDGYGIPRHDRSTPLAETLPTELLVVLNILVMPLEQFNQLKQKSK 379

Query: 435 V-SPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT---DYNLHP------KKRVATQL 484
           V  P +  A    L +  +  L  YP T+++D+ +L    +Y          + ++A Q+
Sbjct: 380 VPKPALGIAEATLLDEVVRLILGEYPTTVAQDKELLASCANYQGSTSPISAGRLKMALQV 439

Query: 485 VRMEKKMLNACLQVTADMI 503
            + EK++LNA L    D I
Sbjct: 440 RKGEKEILNAVLSELEDFI 458


>gi|322698908|gb|EFY90674.1| putative histone-lysine N-methyltransferase [Metarhizium acridum
           CQMa 102]
          Length = 437

 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 65/282 (23%), Positives = 118/282 (41%), Gaps = 48/282 (17%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELL--TTNKLSELACLALYLMYEKKQGKKSFW 176
            + G+   ++P+ ++ T+E    +  +  +L  T+  LS    LA+Y+++ + + K  + 
Sbjct: 34  FKEGENILTIPSGILWTVEHAYADSILGPVLRSTSLPLSVEDTLAIYILFVRSR-KSGYD 92

Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
            P          R  +A       S + + E +L    G+       +  + I+ +Y  L
Sbjct: 93  GP----------RNHVAALPASYSSSIFFMEDQLEVCAGTSLYTITKQLEQRIEDDYRGL 142

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL-----QKVSLARRFALVP 286
             V  M G    QYP   P + FT E +K A   V S  +       + + L   FA   
Sbjct: 143 --VVRMLG----QYPDLFPLDKFTVEDYKWALCTVWSRAMDFVLPDGKSIRLLAPFA--- 193

Query: 287 LGPPLLAYSSKCKA--MLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNS 344
               +L +SS+ K   +  A    + ++  + Y+AG+ + + Y           GP PN+
Sbjct: 194 ---DMLNHSSEAKQCHVYDASSGNLSVLAGKDYEAGDQVFINY-----------GPMPNN 239

Query: 345 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 386
           +LL  YGFV   NP D   +  A +   P ++ K+ +    G
Sbjct: 240 RLLRLYGFVVPGNPNDSYDLVLATHPMAPFFKQKQKLWASAG 281


>gi|308809221|ref|XP_003081920.1| N-methyltransferase (ISS) [Ostreococcus tauri]
 gi|116060387|emb|CAL55723.1| N-methyltransferase (ISS) [Ostreococcus tauri]
          Length = 403

 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 51/171 (29%), Positives = 75/171 (43%), Gaps = 40/171 (23%)

Query: 158 ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
           A LA+ LM +   G  + W  Y   L         AV+S ++WS+ EL  L GS  +   
Sbjct: 47  ATLAVALMQQTNGGASARWRAYCDAL-------PAAVDSLMMWSDEELEVLQGSALRQRA 99

Query: 218 LERAEGIKREYNELDTVWFMAGSLFQQYPYDI-PTEAFTFEIFKQAFVAVQSCVVHLQKV 276
           + R +  KREY+ L    F A  L +  P      EA++F++F+ A+  V          
Sbjct: 100 VFRRDLCKREYDAL----FPA--LARADPETFGDVEAYSFDVFRWAYATV---------- 143

Query: 277 SLARRFALVPLGPPLLAYSSKCKAMLAAVD------DAVQLVVDRPYKAGE 321
            +AR F L  L         +C A+L  +D      DA + VV+R   A E
Sbjct: 144 -MARAFVLPDL---------QCMALLPGLDIYNSARDAEKCVVERDEGACE 184


>gi|317038661|ref|XP_001401929.2| SET domain protein [Aspergillus niger CBS 513.88]
          Length = 699

 Score = 49.3 bits (116), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 58/228 (25%), Positives = 93/228 (40%), Gaps = 18/228 (7%)

Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
           + + E      +L+ +  +G + FW PYIR L  Q G    ++ +P  +   +L +L G+
Sbjct: 103 DAVGEKESTIFFLIGQYLRGTEGFWYPYIRTLP-QPG----SLTTPPYYEGEDLQWLDGT 157

Query: 212 PTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
              A   +R E +K +Y +  T    AG      Y +D+   A +  I +     V S V
Sbjct: 158 SLLAAREKRLEVLKEKYEKGSTELRNAGFEGADAYTWDLYLWAASMFISRAFSAKVLSGV 217

Query: 271 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNIS 330
                +S  +   L+P+   +  +    K    A  D +  VV     AG+ I   Y   
Sbjct: 218 FPETDLSEEKLSVLLPI-IDMGNHRPLAKVEWRAGKDDIAFVVLEDVWAGQEISNNY--- 273

Query: 331 FTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 378
                   GP+ N +L++NYGF    NP D  +V        P Y  K
Sbjct: 274 --------GPRNNEQLMMNYGFCIPGNPCDHRIVSLRAPPGSPLYMAK 313


>gi|226294776|gb|EEH50196.1| SET domain-containing protein [Paracoccidioides brasiliensis Pb18]
          Length = 488

 Score = 49.3 bits (116), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 93/439 (21%), Positives = 166/439 (37%), Gaps = 78/439 (17%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
           + A +D+   +  F++P  LV++ +    N  + +L+  N+  L +  CL L ++YE  Q
Sbjct: 50  IVAYDDINEEEELFAIPQGLVLSFQ----NSKLKDLMEINERDLGQWLCLILVMIYEYLQ 105

Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP--------TKAEILERA- 221
           G  S W PY + L           ++ + W++ EL  L GS         T  E+  R  
Sbjct: 106 GAASPWAPYFKVLPTD-------FDTLMFWTDAELLELKGSAVLGRIGKSTAEEVFLRDL 158

Query: 222 -------------EGIKREYNELD------TVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
                         G    YN  D      ++    GSL   Y +D+  +        + 
Sbjct: 159 LPLVSKNSELFPLTGGLLSYNSPDGKAALLSLAHRMGSLIMSYAFDVENDE------AEE 212

Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
                  V   ++  L +   ++PL   L A + +  A L   D  + +   +  + GE 
Sbjct: 213 VEGEDGYVTDDEERQLPK--GMIPLADLLNADADRNNARLFQEDGYLSMKSIKSIRKGEE 270

Query: 323 IVVWYNISFTGHDFKCGPQPNSKLLINYGFV-DEDNPYDRLVVE-------AALNTEDPQ 374
           I   Y           G  P ++LL  YG+V D    YD   V        A L +  P 
Sbjct: 271 IFNDY-----------GELPRAELLRRYGYVTDSYAQYDEAEVPIQTICRVAGLKSSTPG 319

Query: 375 YQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICP 434
             + R+    + ++    + +        +++ LP   L  ++         + L     
Sbjct: 320 PDEPRLEFLDDLEVLDDGYGIPRHDRSTPLAETLPTELLVVLNILVMPLEQFNQLKQKSK 379

Query: 435 V-SPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT---DYNLHP------KKRVATQL 484
           V  P +  A    L +  +  L  YP T+++D+ +L    +Y          + ++A Q+
Sbjct: 380 VPKPALGIAEATLLDEVVRLILGEYPTTVAQDKELLASCANYQGSTSPISAGRLKMALQV 439

Query: 485 VRMEKKMLNACLQVTADMI 503
            + EK++LNA L    D I
Sbjct: 440 RKGEKEILNAVLSELEDFI 458


>gi|403342378|gb|EJY70508.1| SET domain containing protein [Oxytricha trifallax]
          Length = 653

 Score = 49.3 bits (116), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 31/124 (25%), Positives = 59/124 (47%), Gaps = 14/124 (11%)

Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGN------ETIAELLTTNKLSELACLALYLMYE 167
           AA  +++  D    VP  +++T+ER L +      +  A +    +  +   L ++L+YE
Sbjct: 56  AAKLNIKNNDVIVYVPQKVLITVERALASPIGFIFDNHASIFKATEDRDYLVLLVFLIYE 115

Query: 168 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
            ++G +SFW PY   +D     G L    P  WS+  +  L  S  K +I +  +  + +
Sbjct: 116 HQKGTRSFWHPYFEAID----PGLL----PCFWSDQTIEELADSELKDQIRQERDNYEED 167

Query: 228 YNEL 231
           ++ L
Sbjct: 168 WDML 171


>gi|323449371|gb|EGB05259.1| hypothetical protein AURANDRAFT_66448 [Aureococcus anophagefferens]
          Length = 762

 Score = 49.3 bits (116), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 73/316 (23%), Positives = 118/316 (37%), Gaps = 71/316 (22%)

Query: 66  GSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAA 125
           GS  VV+       +  +W+   G    K+ +K    H    R +   AA E    G+  
Sbjct: 11  GSSAVVTS------EFVAWLRAGGASFDKLAIK----HTALGRGVVATAAYE---PGETL 57

Query: 126 FSVPNSLVVTLERVLGNETIAELLTTNKLSELAC------LALYLMYEKKQGKKSFWLPY 179
            SVP +L++T+++      +A  L   +   +        LAL+L  ++ +     W PY
Sbjct: 58  LSVPEALLLTVDKASRRADVAASLGAARARGVDANGGNLALALFLAGDRSEA----WRPY 113

Query: 180 IRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAG 239
              + R           P  W   + A L GSP   +++ R + I+R+   L        
Sbjct: 114 RNVISRS------VSHLPCFWPTADEALLAGSPLGEDVVRRRDEIRRDCRSLGLTAVEDR 167

Query: 240 SLFQQYPYDIPTEAFTFEIFKQA--FVAVQSCVVHLQK-VSLA-RRFALVPLGPPLLAYS 295
             F      + + AF F   +    F  + +   H ++ V  A  R A V          
Sbjct: 168 QAFAFAEAQVLSRAFAFNGTRAMVPFADLMNTARHHERHVDFAFERGAFV---------- 217

Query: 296 SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDE 355
                 + AV         R   AGE +   Y           GP+ N++ L+NYGF   
Sbjct: 218 ------MRAV---------RRGAAGEPVTDSY-----------GPKSNARYLLNYGFAMA 251

Query: 356 DN--PYDRLVVEAALN 369
           DN     RL+ +AAL+
Sbjct: 252 DNRDEAGRLLDDAALD 267


>gi|346319394|gb|EGX88996.1| Protein kinase-like domain [Cordyceps militaris CM01]
          Length = 1753

 Score = 49.3 bits (116), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 87/405 (21%), Positives = 157/405 (38%), Gaps = 47/405 (11%)

Query: 113  VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
            V A    + G+   ++P++ + T E    +  +  +L + +  LS    LA++L++ K  
Sbjct: 912  VKALRSFKKGERILTIPSACLWTAEAARADPLLGPVLRSAQPPLSVEDTLAIHLLFVKS- 970

Query: 171  GKKSFWLPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIK 225
                      R    +  R  +A       + + ++E EL    GS       +  + ++
Sbjct: 971  ----------RTAGYEGQRLHIAAMPQRHSASIFFAEDELQVCEGSSLHTLTTQLEQRVQ 1020

Query: 226  REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
             ++ +L         L  Q+    P + FT E +K A   + S  +            + 
Sbjct: 1021 DDFRQLLV------QLLSQHRDLFPLDQFTIEDYKWALCTIWSRAMDFAVSDTTSVRLVA 1074

Query: 286  PLGPPLLAYS---SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
            PL   +L +S    +C A      D + ++  + Y+ G+ I ++Y           G  P
Sbjct: 1075 PLAD-MLNHSLDVKQCHAYDPTSGD-LSILAAKDYQVGDQIFIYY-----------GSVP 1121

Query: 343  NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 402
            N++LL  YGFV  DNP D   +    +   P Y+ K  +    G  S     + A  +  
Sbjct: 1122 NNRLLRLYGFVLLDNPNDSYDLVLQTSPMAPLYEQKERLWALAGLDSTCTIPLTA--KHP 1179

Query: 403  AISDMLPYLRLGYVSDTSEMQSVISSL--GPICPVSPCMERAVLDQLADYFKARLAGYPA 460
               ++L YLR   + D +++  +   L  G    V+   E  VL  L D   + L G+  
Sbjct: 1180 LPKNVLRYLRTQRL-DAADVADMTLQLLNGTDGKVNDGNEIQVLQFLIDSLGSVLEGFGI 1238

Query: 461  TLSEDEAMLTD--YNLHPKKRVATQLVRMEKKMLNACLQVTADMI 503
             L + EA L    Y        A Q+   E+ +L    +   DM+
Sbjct: 1239 PLEKLEAQLAGGFYPAGGNAWAAAQVSAGEQGILTRAKKTAEDML 1283


>gi|350632383|gb|EHA20751.1| hypothetical protein ASPNIDRAFT_120572 [Aspergillus niger ATCC
           1015]
          Length = 668

 Score = 49.3 bits (116), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 58/228 (25%), Positives = 93/228 (40%), Gaps = 18/228 (7%)

Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
           + + E      +L+ +  +G + FW PYIR L  Q G    ++ +P  +   +L +L G+
Sbjct: 77  DAVGEKESTIFFLVGQYLRGTEGFWYPYIRTLP-QPG----SLTTPPYYEGEDLQWLDGT 131

Query: 212 PTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
              A   +R E +K +Y +  T    AG      Y +D+   A +  I +     V S V
Sbjct: 132 SLLAAREKRLEVLKEKYEKGSTELRNAGFEGADAYTWDLYLWAASMFISRAFSAKVLSGV 191

Query: 271 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNIS 330
                +S  +   L+P+   +  +    K    A  D +  VV     AG+ I   Y   
Sbjct: 192 FPETDLSEEKLSVLLPI-IDMGNHRPLAKVEWRAGKDDIAFVVLEDVWAGQEISNNY--- 247

Query: 331 FTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 378
                   GP+ N +L++NYGF    NP D  +V        P Y  K
Sbjct: 248 --------GPRNNEQLMMNYGFCIPGNPCDHRIVSLRAPPGSPLYMAK 287


>gi|328872715|gb|EGG21082.1| hypothetical protein DFA_00957 [Dictyostelium fasciculatum]
          Length = 643

 Score = 49.3 bits (116), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 96/463 (20%), Positives = 185/463 (39%), Gaps = 68/463 (14%)

Query: 71  VSKKEEDLGDLKSWM-HKNG-LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
           +S   EDL   + W+ +KN  L P   I+   P           + A+ +++  +    +
Sbjct: 202 ISTTPEDLKSFQQWLSNKNTYLNPSIDIVDLGPPFGRS------MVANTNIKKDEILVEI 255

Query: 129 PNSLVVTLERVLGN--ETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
           P  +++T + ++ N    I + +   K+S     A+ ++Y     + S+W  Y+  L +Q
Sbjct: 256 PKGIMMTPKSMIKNLPRFIIDWMDEMKISRTDQQAIAIIYSILH-EDSYWYEYVSILPKQ 314

Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN-------------ELDT 233
                    + + ++  E+  L  SP       R  G+ R Y+             E D+
Sbjct: 315 -------FTTTVYFTREEMTQLQASPVHRFTEMRLNGVHRHYDTTISRLRFGYEGGEDDS 367

Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLA 293
                 S       +   + +T + FK A   V S    L +        +VPL     A
Sbjct: 368 TKTKTKSQLDAMK-EFKDDRYTLDQFKWALGCVWSRAFSLSE----EDGGMVPLADMFNA 422

Query: 294 YS----SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
            +    SK    ++A   ++     +  +AGE I   Y +  T          + ++L++
Sbjct: 423 DTVISRSKVHPKISASSPSLVYTASQDIEAGEQIFTPYGVYKT--------LGSGQMLMD 474

Query: 350 YGFVDED-NPYDRLVVEAA-LNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDM 407
           YGF+ ED +  D  +V  A +   +P Y  KR + Q NG +  + F +    + +   ++
Sbjct: 475 YGFIHEDGSSADSTIVTVAPIPPSEPLYDLKRHLMQSNG-IESEEFTI---TKNKLAKEL 530

Query: 408 LPYLRLGYVS--DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSED 465
             + R+  ++  ++ +  +   S      ++P  E+A L  L++     L  Y  T+ +D
Sbjct: 531 FLFARIKSINKKESDQASAHFMSTQRHSMLNPRNEKAALRLLSNLISRHLDAYQTTIDQD 590

Query: 466 EAML-------TDYNLHPKKRVAT----QLVRMEKKMLNACLQ 497
             +L       T+ N H      T    +L  MEK +LN+ L+
Sbjct: 591 NQILKEIEKDKTNTN-HSSVTFNTINAIKLRLMEKNILNSFLK 632


>gi|1150596|emb|CAA86307.1| putative transcription regulator [Saccharomyces cerevisiae]
          Length = 496

 Score = 49.3 bits (116), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 63/265 (23%), Positives = 112/265 (42%), Gaps = 33/265 (12%)

Query: 113 VAASEDLQAGDAAFSVPNS--LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
           V A++ ++  +  F +P S  L VT  +++ +    +    N+      L + ++YE + 
Sbjct: 41  VVATQKIKKDETLFKIPRSSVLSVTTSQLIKDYPSLKDKFLNETGSWEGLIICILYEMEV 100

Query: 171 -GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
             ++S W PY +  ++      L     + W + EL  L         G     E+ ER 
Sbjct: 101 LQERSRWAPYFKVWNKPSDMNAL-----IFWDDNELQLLKPSLVLERIGKKEAKEMHERI 155

Query: 222 -EGIKREYNELDTVWFMAGSLFQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKVS 277
            + IK+   E  T      S F  + Y    I + +F  E+   +    +      +++ 
Sbjct: 156 IKSIKQIGGEFSTCVANCPSKFDNFAYIASIILSYSFDLEMQDSSVNENEEEETSEEELE 215

Query: 278 LARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDF 336
             R   +++PL   L A +SKC A L    + +++V  R  +  E +   YNI       
Sbjct: 216 NERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQV---YNIY------ 266

Query: 337 KCGPQPNSKLLINYGFVDED-NPYD 360
             G  PNS+LL  YG+V+ D + YD
Sbjct: 267 --GEHPNSELLRRYGYVEWDGSKYD 289


>gi|168014081|ref|XP_001759585.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689124|gb|EDQ75497.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 340

 Score = 49.3 bits (116), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 52/223 (23%), Positives = 98/223 (43%), Gaps = 39/223 (17%)

Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINY 350
           LL +SS+ +++       +++V ++  + GE++V+ Y           GP  N  LL++Y
Sbjct: 123 LLQHSSESQSL-----PVLEVVAEKDLEKGENVVLNY-----------GPLSNDILLLDY 166

Query: 351 GFVDEDNPYDR--------------LVVEAALNT-EDPQYQDKRMVAQRN--GKLSVQVF 393
           GFV   NP DR              LV +  +++ +DP      ++ + N  G  S Q+ 
Sbjct: 167 GFVMPKNPNDRVELRYDDQLLHMACLVAKVNIDSFKDPTTSQLALLTRLNLHGPSSSQM- 225

Query: 394 HVHAGREKEAISDMLPYLRLGYVSDTSEMQSV----ISSLGPICPVSPCMERAVLDQLAD 449
            V  G  +     +L  +R+ +  D  E+  V    + +     P+    ER  +  L  
Sbjct: 226 -VTLGGTELVEGRLLAAVRVMHAQDPMELLDVDLEALQTWNQSPPLGVLNERKTIRTLIG 284

Query: 450 YFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 492
                LA +P  + ED++ L   ++    R+A Q   ++K++L
Sbjct: 285 LGMLALASFPTEIEEDQSELVKGDISENHRLAIQFRMLKKRLL 327


>gi|451852693|gb|EMD65988.1| hypothetical protein COCSADRAFT_86793 [Cochliobolus sativus ND90Pr]
          Length = 478

 Score = 48.9 bits (115), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 86/380 (22%), Positives = 144/380 (37%), Gaps = 62/380 (16%)

Query: 151 TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 210
           +  L     L L ++YE  QG+ S W  Y+  L +       A E+P+ W+  EL  L G
Sbjct: 100 SEALDSWGSLILVMLYEYLQGEASRWKTYLDILPQ-------AFETPIFWTPDELKELEG 152

Query: 211 SPTKAEILERAEGIK--RE--------------------YNELD--TVWFMAGSLFQQYP 246
           +    E + + E  +  RE                     NE D  ++    GS    Y 
Sbjct: 153 TSLTTEKIGKKESDRMLRERILPIVTSHPDVFSPPGAPRLNEDDLLSLAHRMGSTIMAYA 212

Query: 247 YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD 306
           +D+  E    E  +  ++  +         SL     +VP+   +L  +++  A +   D
Sbjct: 213 FDLENEEEQSEDEEDGWIEDRDGK------SL---IGMVPMAD-MLNANAEFNAHVHHGD 262

Query: 307 DAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV-DEDNPYDRL--- 362
                 +     AG  I+ +Y           GP P+S+LL  YG+V  E + YD     
Sbjct: 263 QLQVTSLRESIPAGSEILNYY-----------GPLPSSELLRRYGYVTSEHHRYDVAEIS 311

Query: 363 --VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS----DMLPYLRLGYV 416
             +V  AL  E    +D     +R  +  ++ F V      E  S       P LR    
Sbjct: 312 WSLVRTALAEELKLSEDTIADIERKLESELEEFFVIERDAGEPSSYGTLTQPPVLREIST 371

Query: 417 SDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 476
               + ++ + +L    P     E      L    + RL  YP +  +DE++L+   L  
Sbjct: 372 ELEEQTKAFLKALKKRDPKRKRSETICNTVLEKALRTRLGQYPTSAKQDESLLSKEGLSK 431

Query: 477 KKRVATQLVRMEKKMLNACL 496
           + R+A ++   EK++L   L
Sbjct: 432 RHRMAVEVRLGEKRLLQEAL 451


>gi|358056332|dbj|GAA97699.1| hypothetical protein E5Q_04377 [Mixia osmundae IAM 14324]
          Length = 347

 Score = 48.9 bits (115), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 38/118 (32%), Positives = 56/118 (47%), Gaps = 15/118 (12%)

Query: 252 EAFTFEIFKQAFVAVQS-CV-VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA- 308
           E   F+ F+ A++ V S CV + L   +    F LVPL   +  +SS C       D A 
Sbjct: 122 EIIDFDAFRWAWLCVNSRCVWLDLDYEAHEENFTLVPL-LDMANHSSTCANATVKYDHAH 180

Query: 309 VQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEA 366
            +L + RP K GE IV      + GHD        + L   YGF++  NP++R+ + A
Sbjct: 181 FELKLTRPVKRGEEIV----FEYGGHD-------QATLWAEYGFIESSNPHERIDLTA 227


>gi|134074534|emb|CAK38827.1| unnamed protein product [Aspergillus niger]
          Length = 625

 Score = 48.9 bits (115), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 58/228 (25%), Positives = 93/228 (40%), Gaps = 18/228 (7%)

Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
           + + E      +L+ +  +G + FW PYIR L  Q G    ++ +P  +   +L +L G+
Sbjct: 29  DAVGEKESTIFFLIGQYLRGTEGFWYPYIRTLP-QPG----SLTTPPYYEGEDLQWLDGT 83

Query: 212 PTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
              A   +R E +K +Y +  T    AG      Y +D+   A +  I +     V S V
Sbjct: 84  SLLAAREKRLEVLKEKYEKGSTELRNAGFEGADAYTWDLYLWAASMFISRAFSAKVLSGV 143

Query: 271 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNIS 330
                +S  +   L+P+   +  +    K    A  D +  VV     AG+ I   Y   
Sbjct: 144 FPETDLSEEKLSVLLPI-IDMGNHRPLAKVEWRAGKDDIAFVVLEDVWAGQEISNNY--- 199

Query: 331 FTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 378
                   GP+ N +L++NYGF    NP D  +V        P Y  K
Sbjct: 200 --------GPRNNEQLMMNYGFCIPGNPCDHRIVSLRAPPGSPLYMAK 239


>gi|302803412|ref|XP_002983459.1| hypothetical protein SELMODRAFT_445547 [Selaginella moellendorffii]
 gi|300148702|gb|EFJ15360.1| hypothetical protein SELMODRAFT_445547 [Selaginella moellendorffii]
          Length = 536

 Score = 48.9 bits (115), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 57/227 (25%), Positives = 87/227 (38%), Gaps = 43/227 (18%)

Query: 145 IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW-SET 203
           +  +L  N       +  +L+ E+ +GK+SFW PYI  L       +L++  PLLW +ET
Sbjct: 97  VGPMLRKNDFRPWLTMCAHLLVERSRGKESFWHPYIAALPSV---DELSISHPLLWPAET 153

Query: 204 ELAYLTGSPTKAEILERAEGIKREYNELDTVW---FMAGSLFQQYPYDIPTEAFTFEIFK 260
               L GSP    I  R +  + ++  L T     F+ G                 E   
Sbjct: 154 IQELLQGSPMLDTIATRLKLCQEDHEALLTAGIEKFLPGG----------------ETLS 197

Query: 261 QAFVAVQSCVVHLQKVSLA-------RRFALVPLGPPLLAYSSKCKAMLAAVDD---AVQ 310
           +  V   S V+  +  SL            LVP    L   SS  +      D       
Sbjct: 198 EGDVRWASAVLLSRAFSLELDVDDDFDTLCLVPWADMLNHCSSAGEESCLIFDQDTKTAS 257

Query: 311 LVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDN 357
           L   + Y  G+ +   Y  + TG          S+L ++YGFVD++N
Sbjct: 258 LEAHKSYSKGDEVFDSYGPALTG----------SQLFLDYGFVDDEN 294


>gi|67540796|ref|XP_664172.1| hypothetical protein AN6568.2 [Aspergillus nidulans FGSC A4]
 gi|40738718|gb|EAA57908.1| hypothetical protein AN6568.2 [Aspergillus nidulans FGSC A4]
 gi|259480141|tpe|CBF71002.1| TPA: SET domain protein (AFU_orthologue; AFUA_6G04520) [Aspergillus
           nidulans FGSC A4]
          Length = 484

 Score = 48.9 bits (115), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 67/288 (23%), Positives = 109/288 (37%), Gaps = 60/288 (20%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN--KLSELACLALYLMYEKKQ 170
           V A  D+   +  F++P  LV++      N  + +LL+ +  +L     L L +++E  Q
Sbjct: 50  VVAQADIDEDEELFAIPRDLVLSTH----NSKLKDLLSQDLDQLGPWLSLMLVMIFEYLQ 105

Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA--------- 221
           G KS W PY + L +         ++ + WS  EL  L GS    +I ++          
Sbjct: 106 GGKSTWAPYFKVLPQN-------FDTLMFWSPEELEELQGSAVVEKIGKQGAEESILKLI 158

Query: 222 --------------EGIKREYNELDTVWFMA-----GSLFQQYPYDIPTEAFTFEIFKQA 262
                          G+     ++     +      GSL   Y +DI T     E   + 
Sbjct: 159 IPVVRANPALFPPINGLASYDGDVGAQALLGLAHTMGSLIMAYAFDIETPENEDEREGED 218

Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
                       K        +VPL   L A + +  A L   ++++ +   +P +AGE 
Sbjct: 219 GYLTDEEEEQSSK-------GMVPLADMLNADAYRNNARLFQEEESLVMKAIKPIRAGEE 271

Query: 323 IVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 370
           I   Y           G  P S LL  YG+V  DN     V+E +L+T
Sbjct: 272 IFNDY-----------GEIPRSDLLRRYGYV-TDNYASYDVIELSLDT 307


>gi|159476096|ref|XP_001696150.1| protein N-methyltransferase [Chlamydomonas reinhardtii]
 gi|158275321|gb|EDP01099.1| protein N-methyltransferase [Chlamydomonas reinhardtii]
          Length = 474

 Score = 48.9 bits (115), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 59/232 (25%), Positives = 99/232 (42%), Gaps = 46/232 (19%)

Query: 157 LACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAE 216
            A +   L++ K+QG +S   P+I +L    G        PL WS+ +LA L      A+
Sbjct: 138 FAKMGAMLLWHKRQGSQSPLAPWIAQLPADTG-------VPLNWSDKQLAALQYPYLVAQ 190

Query: 217 ILERAEGIKREYNEL-DTV-----------------WFMAGSLFQQYPYDIPTEAFTFEI 258
           + E+    +RE+  L DT+                 W+  G + +   +  P    T   
Sbjct: 191 VKEQ----QREWTALYDTLRGSGMAAGAAPPSREEFWWAMG-VVRSRTFSGPYIGSTLSD 245

Query: 259 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAA--VDDAVQLVVDRP 316
             +    V + VV L + SL +++A+ PL   L  ++S  ++ ++     D+  +V  R 
Sbjct: 246 RLRLAGLVAALVVILSR-SL-KQYAICPL-IDLFNHTSAAQSEVSYNYFGDSYSVVASRD 302

Query: 317 YKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 368
           +K GE + + Y           G Q N  L+  YGF + DNP D  V+   L
Sbjct: 303 FKKGEQVFITY-----------GAQSNDSLMQYYGFAEADNPQDTYVISDVL 343


>gi|345325919|ref|XP_001512656.2| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Ornithorhynchus anatinus]
          Length = 345

 Score = 48.9 bits (115), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 52/208 (25%), Positives = 95/208 (45%), Gaps = 29/208 (13%)

Query: 306 DDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVE 365
           DD  + V  + + AGE I ++Y           G + N++ +I+ GF  ++N +DR+ ++
Sbjct: 42  DDRCECVALQDFTAGEQIYIFY-----------GTRSNAEFVIHSGFFFDNNSHDRVKIK 90

Query: 366 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV 425
             ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++    + +
Sbjct: 91  LGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHL 149

Query: 426 IS--------SLG-PICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDY 472
           I         +LG    PVS   E     +L  + +AR    L  Y  T+ ED++ L   
Sbjct: 150 IGDHAIDKIFTLGNSEFPVSWDNEV----KLWTFLEARASLLLKTYKTTIEEDKSFLETP 205

Query: 473 NLHPKKRVATQLVRMEKKMLNACLQVTA 500
           +L     +A +L   EK++L   ++  A
Sbjct: 206 DLTFHATMAIKLRLGEKEILEKAVKSAA 233


>gi|357145323|ref|XP_003573603.1| PREDICTED: SET domain-containing protein 4-like [Brachypodium
           distachyon]
          Length = 532

 Score = 48.9 bits (115), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 62/275 (22%), Positives = 115/275 (41%), Gaps = 41/275 (14%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + ASE++  G  A  +P SL+++ E +  ++    L   N ++    L L+ M E+    
Sbjct: 178 MVASENIGVGHIALEIPESLIISEELLCQSDMFLALKDLNSITTETMLLLWSMRERHNPS 237

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
            +F + +   L      G       L +    LA L G+    E+++  + + ++Y+EL 
Sbjct: 238 SNFKM-FFETLPSNFNTG-------LNFGIGALAALEGTLLFDELMQARQHLHQQYDELF 289

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG---- 288
            +      L  ++P     + +T++ F  A     S  + +   S      L+P+     
Sbjct: 290 PM------LCTKFPEIFTQDIYTWDNFLWACELWYSNSMMVVLSSGKLTTCLIPVAGLLN 343

Query: 289 ----PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNS 344
               P +L Y    +A       +++  + RP KAG+   + Y           G    S
Sbjct: 344 HSVYPHILNYGRVDQAT-----KSLKFPLSRPCKAGQQCFLSY-----------GKHSGS 387

Query: 345 KLLINYGFV-DEDNPYDR--LVVEAALNTEDPQYQ 376
            L+  YGF+  EDNPYD   L ++ +++ ED   Q
Sbjct: 388 HLITFYGFLPREDNPYDVVPLDLDMSVDEEDGTAQ 422


>gi|327295769|ref|XP_003232579.1| hypothetical protein TERG_06571 [Trichophyton rubrum CBS 118892]
 gi|326464890|gb|EGD90343.1| hypothetical protein TERG_06571 [Trichophyton rubrum CBS 118892]
          Length = 488

 Score = 48.9 bits (115), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 77/356 (21%), Positives = 124/356 (34%), Gaps = 83/356 (23%)

Query: 97  LKEKPSHNEKHRPIHY-----------VAASEDLQAGDAAFSVPNSLVVTLERVLGNETI 145
           LK    H + H  IH            + AS D+   +  F +P+ LV++++       +
Sbjct: 24  LKRSSPHFKMHPGIHIADLRSVGAGRGICASRDIAEDEELFIIPDDLVLSVQNSEARSAL 83

Query: 146 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 205
              L   +L     L + ++YE  QG++S W PY R L         + ++ + W++ +L
Sbjct: 84  E--LDDKQLGPWLSLIITMIYEYYQGEQSKWYPYFRILPS-------SFDTLMFWTDEQL 134

Query: 206 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 265
             L GS    +I + A          DT+      L Q  P                   
Sbjct: 135 LELQGSAVVGKIGKAAAD--------DTILQKVVPLIQANP------------------- 167

Query: 266 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 325
                         R F   P  PPL +  S+   +  A            ++ G SI++
Sbjct: 168 --------------RHFPPRPNMPPLNSSDSQNALLCLA------------HRMG-SIIM 200

Query: 326 WYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 385
            Y         K            Y   DED P   +V  A +   D Q  + R+  +  
Sbjct: 201 AYAFDIE----KTDEVDEDTAEDGYMTDDEDEPAKGMVPLADIFNADAQRNNARLFQEEG 256

Query: 386 GKLSVQVFHVHAGREKEAISDMLPYL----RLGYVSDTSEMQSVIS-SLGPICPVS 436
             +   + ++H+G E       LP      R GYV+D      V+  SL  IC V+
Sbjct: 257 SFVMKAIKNIHSGEEIFNDYGELPRADLLRRYGYVTDNYAQYDVVEFSLDSICKVA 312


>gi|428179206|gb|EKX48078.1| hypothetical protein GUITHDRAFT_106158 [Guillardia theta CCMP2712]
          Length = 410

 Score = 48.5 bits (114), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 55/222 (24%), Positives = 96/222 (43%), Gaps = 32/222 (14%)

Query: 153 KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP 212
           +L E   L+L+L+ EK + ++S W  +IR +         + ++   WSE  +A L   P
Sbjct: 19  QLCERQLLSLHLLVEKWKAERSRWWRFIRSIPP-------SYDTLENWSEQSVARLQYKP 71

Query: 213 TKAEILERAEGIKREYNELDTV--------WFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
             A    R   +  E+++L  +        W    +  +     +   +F+ E +  A  
Sbjct: 72  FLAIAARRKRVVNDEFSQLQRLLSRCKKRSWNEPEAAEEAERIQLGFSSFSREDYLWAAG 131

Query: 265 AVQSCVVHLQKVS-LARRFALVPLGPPLLAYSSKCKAMLAAV---DDAV--QLVVDRPYK 318
            V +   H ++ S  + R   V    P+L + +   A +AA     DA+  ++   R Y+
Sbjct: 132 TVSTRSCHYERKSGYSLRGETVGCLVPVLDFLNHSTAPVAACGFCKDAMVYRVTCLRSYE 191

Query: 319 AGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYD 360
            GE +++ Y           G   N+ LL +YGFV EDNP D
Sbjct: 192 EGEQVMIHY-----------GNWSNAGLLEHYGFVLEDNPLD 222


>gi|145518912|ref|XP_001445328.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124412772|emb|CAK77931.1| unnamed protein product [Paramecium tetraurelia]
          Length = 761

 Score = 48.5 bits (114), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 41/141 (29%), Positives = 67/141 (47%), Gaps = 23/141 (16%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERV-LGNETIA-----ELLTTNKLS--ELACLALYL 164
           V A++D+ A  A   VP  L+++ E+  L + +I      EL   N+ S  E   L  YL
Sbjct: 46  VVATKDIPANTAIICVPQPLIISQEKCKLSSLSIVYDKHPELFDENETSDAEFNILIFYL 105

Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
             EKK+G+KSF+ PY++ +         +  + + WS+ EL Y+       E       I
Sbjct: 106 FNEKKKGEKSFYHPYVQAIQ--------SNNTLIDWSKEELNYIEDPIILDEF-----AI 152

Query: 225 KREYNELDTVWFMAGSLFQQY 245
            RE  +L  +W  A  +F ++
Sbjct: 153 VRE--DLKDLWNQAKEIFNEF 171


>gi|126325439|ref|XP_001376285.1| PREDICTED: SET domain-containing protein 4-like [Monodelphis
           domestica]
          Length = 437

 Score = 48.5 bits (114), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 66/294 (22%), Positives = 120/294 (40%), Gaps = 39/294 (13%)

Query: 86  HKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVVTLERV 139
           HK      +  LK++   +   RP  +      + A + LQ G+   S+P   ++T + V
Sbjct: 30  HKQEFIELRKWLKKRKFEDHNLRPTRFSNTGRGLMAVKSLQPGELIISLPKECLLTTDTV 89

Query: 140 LGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
           +    + + +T     +S L  L  +L+ EK  G KS W PY+  L +          + 
Sbjct: 90  I-RSYLGDYITKWMPPISPLLALCAFLISEKHAGNKSPWKPYLDVLPK--------AYTC 140

Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
           L+  E E+  L   P + +  E+   +++ +      +    SLF +   D+    F + 
Sbjct: 141 LVCLEPEVVRLLPRPLQMKAEEQRMQVQKLFISSRGFFSSLQSLFTE---DV-KHVFHYH 196

Query: 258 IFKQAFVAVQSCVV---HLQKVSLARRFALVPLGP--PLLAYSSKCKAMLAAVDDAV--Q 310
            F  A+  + +  V   H QK  L+    +  L P   LL +S +     A  ++    +
Sbjct: 197 AFLWAWCTINTRTVYMKHAQKQCLSAEPDVYALAPYLDLLNHSPRVWVEAAFNEETCCYE 256

Query: 311 LVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
           +      K  E + + Y           GP  N +LL+ YGFV  +NP+  + +
Sbjct: 257 IRTTSHCKKFEELFICY-----------GPHDNHRLLLEYGFVASNNPHSAVYI 299


>gi|453083670|gb|EMF11715.1| SET domain-containing protein [Mycosphaerella populorum SO2202]
          Length = 477

 Score = 48.5 bits (114), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 73/280 (26%), Positives = 108/280 (38%), Gaps = 62/280 (22%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA----CLALYLMYEK 168
           V A++DL   +  FS+P + ++T      NET    L  N   EL      L L +++E 
Sbjct: 45  VVATQDLSEDEELFSIPRASILT------NETTD--LPANLRKELDHPWLSLILVMVHEY 96

Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA------- 221
            +G KS W PY   L           +S + WS+ EL  L GS    +I + +       
Sbjct: 97  LKGTKSSWYPYFNLLPE-------TFDSLMFWSDEELLSLKGSAVVDKIGKESADSTFTE 149

Query: 222 -------------EGIKREYNELDTVWFMAGSLFQQYPYDI--PTEAFTFEIFKQAFVAV 266
                        +   R  +EL ++    GS    Y +D+  P  +             
Sbjct: 150 QLIPLIAQHANIFQTAGRSNDELLSLCHRMGSTIMAYAFDLEKPEPSQPPNQQDDEEWEE 209

Query: 267 QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 326
           +   + L K        ++PL   L A +    A L   DD V +      +AGE +   
Sbjct: 210 EESAISLPK-------GMIPLADMLNANADHNNAKLFYQDDKVVMKTLHAVRAGEELF-- 260

Query: 327 YNISFTGHDFKCGPQPNSKLLINYGFV-DEDNPYDRLVVE 365
                  +DF  GP P S LL  YG+V D+   YD  VVE
Sbjct: 261 -------NDF--GPLPRSDLLRRYGYVTDQYAKYD--VVE 289


>gi|323447496|gb|EGB03414.1| hypothetical protein AURANDRAFT_72732 [Aureococcus anophagefferens]
          Length = 403

 Score = 48.5 bits (114), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 69/280 (24%), Positives = 110/280 (39%), Gaps = 44/280 (15%)

Query: 84  WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
           W+ +NG      +  E  S++++ R +H   A+ DL+  +    VP   ++T+E  +G  
Sbjct: 37  WLTENGGKFADCV--ELRSYDDEVRGVH---ATRDLETEEILVEVPLKCLITVE--MGKA 89

Query: 144 TIAELLTTNKLSELAC-----LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPL 198
           T           EL       L L+++ +++    +F+ PY   L             P+
Sbjct: 90  TDVGRAVLEAELELDAPKHVFLMLFVLLDRRDSS-TFFAPYYDIL------PSTLSNMPI 142

Query: 199 LWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEI 258
            W   EL +L GS    +I ER   IK +Y  +  +W          P  I  +  T E 
Sbjct: 143 FWQPDELEWLKGSYLLTQIEERKRAIKADYEAICGIW----------PSFI--DVCTLEE 190

Query: 259 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSK-CKAMLAAVDDAVQLVVDRPY 317
           FK A + V S    +  V+ AR  A+VP    L  +  +  K        A  +   +  
Sbjct: 191 FKWARMCVCSRNFGVV-VNGARTSAMVPYADMLNHFRPRETKWTFDNSRGAFTITSLQKI 249

Query: 318 KAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDN 357
             G  I   Y           G + N + L+NYGF  EDN
Sbjct: 250 SVGSQIYDSY-----------GQKCNHRFLLNYGFAIEDN 278


>gi|384246985|gb|EIE20473.1| rubisco small subunit N-methyltransferase, partial [Coccomyxa
           subellipsoidea C-169]
          Length = 363

 Score = 48.5 bits (114), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 35/107 (32%), Positives = 54/107 (50%), Gaps = 10/107 (9%)

Query: 124 AAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIREL 183
           A   +P +L VT   V  +E +A L       EL  LAL+LM E+++G++S W P++  L
Sbjct: 2   ALVELPGNLSVTAVDVAAHEEVAGL--AEGRGELTGLALWLMAERQKGEESRWAPFLECL 59

Query: 184 DRQRGRGQLAVESPLLW-SETELAYLTGSPTKAEILERAEGIKREYN 229
                    A  SP+LW  E +   L  SPT  E   R   +++E++
Sbjct: 60  PE-------ATLSPVLWPEEVQDELLKNSPTLKECRARRAALQQEWD 99



 Score = 38.5 bits (88), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 53/219 (24%), Positives = 90/219 (41%), Gaps = 24/219 (10%)

Query: 297 KCKAMLAAVDDAVQLVVDR-------PYKAGESIVVWYNISFTGHDFKCGP--------Q 341
           +C+A  AA+     ++  R        +  G+ + +W  +   G               +
Sbjct: 86  ECRARRAALQQEWDVIAQRIATGDARRFSGGDELKLWITLGSPGWGGTSDKLLMAIYDGR 145

Query: 342 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREK 401
           PN +L +  G V++DN  D L V   L   D  +  K+ + +  G   VQ F +   R  
Sbjct: 146 PNGELAMATGRVEDDNASDCLTVRVGLVQADRLFSVKKQILESLGFDIVQEFPIFRDR-- 203

Query: 402 EAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP-- 459
              + +L YLRL  ++D + +  V  S      ++P  E  VL  L    + RL  Y   
Sbjct: 204 -MPTQLLAYLRLARLTDPALLAKV--SFEEDIILNPVNEYEVLQLLLGECRDRLTSYAGM 260

Query: 460 --ATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 496
              +  ED  +L    L  ++R+A +L + EK +L   L
Sbjct: 261 HMGSAEEDVKLLQRPGLTAQERLAARLRKAEKAILQGTL 299


>gi|449520517|ref|XP_004167280.1| PREDICTED: LOW QUALITY PROTEIN: sulfate transporter 4.1,
           chloroplastic-like, partial [Cucumis sativus]
          Length = 923

 Score = 48.1 bits (113), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 56/219 (25%), Positives = 92/219 (42%), Gaps = 39/219 (17%)

Query: 112 YVAASEDLQAGDAAFSVP-------NSLVVTLERVLGNETIAELLTTNKLSELACLALYL 164
           ++ ASE ++AGD    VP       +SL + +  +LGNE          +  +A LA+ +
Sbjct: 734 FLFASETIRAGDCILKVPFNVQISPDSLPLPIRDLLGNE----------IGNVAKLAVVV 783

Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
           + E K G  S W PYI  L +        + + + W E+EL  +  S    E L +   I
Sbjct: 784 LLEHKLGLGSEWAPYIIRLPQ-----PWEMHNTIFWKESELEMIRKSSLYEESLNQRSQI 838

Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 284
           KRE+  +           + +P  I  +  + + F  A+  V S     +        +L
Sbjct: 839 KREFLAIRKA-------LEAFPEII--DRISCDDFMHAYALVTS-----RAWRSTEGVSL 884

Query: 285 VPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGE 321
           +P     L +    +AML   DD    ++V DR +  GE
Sbjct: 885 IPFA-DFLNHDGASEAMLLNDDDKQLSEVVADRDFAPGE 922


>gi|281201674|gb|EFA75882.1| tryptophan 2,3-dioxygenase [Polysphondylium pallidum PN500]
          Length = 732

 Score = 48.1 bits (113), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 49/170 (28%), Positives = 82/170 (48%), Gaps = 20/170 (11%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLE---RVLGNETIAELLTTNKLSELACLALYLMYEKK 169
           + A+ +++ GD    VP  L ++ E   + +G++   E L   K+     L ++L+ E +
Sbjct: 98  LIANTEIKEGDEFIKVPLKLFMSQETAFKSIGDKVSREPLF--KMLPNMLLVIHLIQETQ 155

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
           + ++SFW PYIR L +       + ++ L ++  E   L GSP    +LE  E I    N
Sbjct: 156 KQQQSFWAPYIRMLPK-------SYKTALYFTLAEFQLLIGSP----VLE--ESINTYRN 202

Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
            L    F+    F + P  + T  FT+E F+Q  +A    +V L K  LA
Sbjct: 203 TLRQYCFLY-DFFGKNPGILSTSNFTWE-FEQNELAAYKSIVSLLKKRLA 250


>gi|303271033|ref|XP_003054878.1| set domain protein [Micromonas pusilla CCMP1545]
 gi|226462852|gb|EEH60130.1| set domain protein [Micromonas pusilla CCMP1545]
          Length = 664

 Score = 48.1 bits (113), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 87/365 (23%), Positives = 149/365 (40%), Gaps = 66/365 (18%)

Query: 149 LTTNKLSELACLALYLMYE-KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY 207
           +T+ +++  A +AL+L++E   Q +KS W P++  L R        VE+PLLW+  ELA 
Sbjct: 189 ITSREVTIDAVIALHLLHELYVQREKSEWWPWVSILPRD-------VETPLLWTPRELAQ 241

Query: 208 LTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ 267
           L GS     I  R   +K    + D ++     L Q++P   P E F  E +  A   V 
Sbjct: 242 LEGSNL---IGFRDAVLKGWTTQRDALF---PKLTQKFPSLFPEEHFRTERWAWAMAIVW 295

Query: 268 SCVVHLQKVSLARRFALVPLGP----------PLL-------------------AYSSKC 298
           S       V + R  A+ P G           PL                          
Sbjct: 296 SRAA---DVPVPRPEAIFPSGDDKSRELRVIVPLFDMINHGYDHAPVTPGGVKGGGGEGR 352

Query: 299 KAMLAAVDDAVQLVVD-RPYKAGESIVVWYNISFTGHD----FKCGPQPNSKLLINYGFV 353
           +     VDD+  L+    P +    + +   + F G +    F  G +P+  +L+ YGFV
Sbjct: 353 EKGGVGVDDSPALIPSWDPSR--RMVAIRAGVPFPGPNYEVRFNYGAKPSQHVLLQYGFV 410

Query: 354 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLS--VQVFHVHAGREKEAISDMLPYL 411
             +NP + + V     + D + +  +    R  +LS   + F  +  R     +D+L   
Sbjct: 411 PMNNPDESVEVAMHAGSRD-KLKSLKSELLRTHELSPRERNFQFYPRRLD---ADLLAAT 466

Query: 412 RLGYVS----DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEA 467
           R+  +S    +++   S   +  P+   +    RA+L + A      L  YP TL ED+ 
Sbjct: 467 RVQMMSELEINSAAAISAAVAGAPVSDRNEAKTRAMLLKAAHDM---LRRYPTTLWEDKK 523

Query: 468 MLTDY 472
           ++ ++
Sbjct: 524 IVQEH 528


>gi|395518633|ref|XP_003763464.1| PREDICTED: SET domain-containing protein 4 [Sarcophilus harrisii]
          Length = 440

 Score = 48.1 bits (113), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 70/294 (23%), Positives = 120/294 (40%), Gaps = 39/294 (13%)

Query: 86  HKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVVTLERV 139
           HK      +  LKE+   +   RP  +      + A + LQ G+   S+P   ++T + V
Sbjct: 29  HKLEFIELRKWLKERKFEDHNLRPTRFSGTGRGLMAVKSLQPGELIISLPEKCLLTTDTV 88

Query: 140 LGNETIAELLT--TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
           + +  + + +T  T  +S L  L  +L+ E   G KS W PY+  L +          + 
Sbjct: 89  IKS-YLGDYITKWTPPISPLLALCTFLISENNAGNKSPWKPYLDILPKDY--------TC 139

Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
           L+  E ++  L   P K +  E+   ++  +      +    SLF +   D+    F + 
Sbjct: 140 LVCLEPQVVRLLPKPLKIKAQEQKTQVQELFVSSRGFFSSLQSLFTE---DVK-HIFHYH 195

Query: 258 IFKQAFVAVQSCVV---HLQKVSLARRFALVPLGPPL-LAYSSKCKAMLAAVDDAVQLVV 313
            F  A+  + +  V   H QK  L+    +  L P L L   S    + AA ++  +   
Sbjct: 196 AFLWAWCTINTRTVYMKHAQKKCLSAEPDVYALAPYLDLLNHSPGVQVNAAFNEKTRCYE 255

Query: 314 DRPY---KAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
            R     K  E + + Y           GP  N +LL+ YGFV  +NP+  + V
Sbjct: 256 IRTTSSCKKYEELFICY-----------GPHDNHRLLLEYGFVAINNPHSAVYV 298


>gi|322706860|gb|EFY98439.1| SET domain protein [Metarhizium anisopliae ARSEF 23]
          Length = 595

 Score = 48.1 bits (113), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 68/299 (22%), Positives = 130/299 (43%), Gaps = 47/299 (15%)

Query: 113 VAASEDLQAGDA------AFSVPNSLVVTLERVLG----NETIAELL-TTNKLSELACLA 161
           + A  DL++ +A        ++P+ LV++ E V      +    +LL    + S    + 
Sbjct: 143 LVAHADLESAEADGTSKGPVTIPHDLVLSAEAVEDFAKVDHNFKQLLEAVGRQSTRGDIM 202

Query: 162 LYLMYEKKQGKK------SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 215
           LYL+ +  Q  +      + W  YIR L R        +  P +W+E E   L G+  +A
Sbjct: 203 LYLVSQFAQSSRPKGLSPTPWTEYIRLLPR-------PIPVPTMWTEPERLLLNGTSLEA 255

Query: 216 EILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK 275
            +  +   + +E++ L  V        + +P+         E+  + +V V +     + 
Sbjct: 256 ALEAKLLSLGKEFDTLREV-------SEDFPFWNEFLWSGEEVSLEDWVLVDAWY-RSRC 307

Query: 276 VSLARR-FALVPLGPPLLAYSSKCKAMLAAVD-DAVQLVV--DRPYKAGESIVVWYNISF 331
           + L R   A+VP G  ++ +SSK  A     D D V L++    P ++GE + + Y  + 
Sbjct: 308 LELPRSGTAMVP-GLDMVNHSSKATAYYEEDDHDNVVLLIRPGCPVRSGEEVTISYGDA- 365

Query: 332 TGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 390
                    +P S++L +YGF+D +N  D+L +      +DP  + K  ++     L++
Sbjct: 366 ---------KPASEMLFSYGFIDPNNIVDKLTLRLDPFPDDPLARAKLRISNSGPTLTI 415


>gi|302834219|ref|XP_002948672.1| hypothetical protein VOLCADRAFT_104004 [Volvox carteri f.
           nagariensis]
 gi|300265863|gb|EFJ50052.1| hypothetical protein VOLCADRAFT_104004 [Volvox carteri f.
           nagariensis]
          Length = 510

 Score = 48.1 bits (113), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 95/397 (23%), Positives = 155/397 (39%), Gaps = 81/397 (20%)

Query: 157 LACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAE 216
            A +A  L++ K+QG +S   P+I +L    G        P+LW E ++A L      A+
Sbjct: 135 FAKMAAMLLWHKRQGSQSPLAPWIAQLPSDTG-------VPVLWDERQIAALQYPYLIAQ 187

Query: 217 ILERAEGIKREYNEL---------------DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ 261
           + E+    ++ Y +L               D  W M  S  +   +  P    T +   +
Sbjct: 188 VKEQQREWQQLYGDLVRSGTPAGVQAPSREDFFWAM--SCVRSRTFSGPYIGSTLQDRLR 245

Query: 262 -----AFVAVQSCVVHL---QKVSLA-------------------RRFALVPLGPPLLAY 294
                A +A  + V+ L   QK   A                   +++A+ PL   L  +
Sbjct: 246 TAGLVAVLAAGNTVLGLADPQKTLSAAIAVLLFNVLYELILSRSLKQYAICPL-IDLFNH 304

Query: 295 SSKCKAMLAA--VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGF 352
           SS  ++ +A     D+  +V  R +K GE + + Y           G Q N  L+  YGF
Sbjct: 305 SSAVQSEVAYNYFGDSYSVVASREFKKGEQVFISY-----------GAQSNDSLMQYYGF 353

Query: 353 VDEDNPYDRLVVEAALN--TEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPY 410
            + +NP D  V+   L   T        R+ A +   L+  +  V   R     S+ L  
Sbjct: 354 AEANNPQDVYVMTDMLRWLTAVRSVGQSRLDALKGSPLANSLQQVAIQRAGFP-SETLQA 412

Query: 411 LRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 470
           +R    +D SE  + +SS       SP  E  + + +A+  +  L    ++L ED A+L+
Sbjct: 413 VRFLLAAD-SEAGADVSSFSKSG--SPDQEAQLAEVVAEVVRRELGHLGSSLQEDLALLS 469

Query: 471 DYNLHPKKR---------VATQLVRMEKK-MLNACLQ 497
                   R         VA    R+EKK +L A LQ
Sbjct: 470 STGASAGGRKGGTAAAAAVAAVAFRVEKKRLLTAVLQ 506


>gi|348676999|gb|EGZ16816.1| hypothetical protein PHYSODRAFT_251772 [Phytophthora sojae]
          Length = 424

 Score = 48.1 bits (113), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 79/357 (22%), Positives = 140/357 (39%), Gaps = 64/357 (17%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK---LSELACLALYLMYEKKQG 171
           A+  + +G+    +P  L+++ +    +  +  +   N+     +   LAL+L+ E    
Sbjct: 43  AAAAVASGEPMLCIPRRLLISEDLCWRDPQLGRVFQDNRDVFTRDDPVLALFLVRELLLA 102

Query: 172 KKSFWLPYIRELDRQRGRGQLAV----ESPLLWSETELAYLTGSPTKAEILER-AEGIKR 226
            +SF+ PY            LAV    ES   W++ EL  L          ER  +   R
Sbjct: 103 DRSFFHPY------------LAVLPYPESVQDWTQAELGELHD--------ERLVDAAAR 142

Query: 227 EYNELDTVWFMAGSLFQ-QYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF--- 282
             +E+D  +       Q +YP + P   +TF+ FK A+  +Q+        +  RR    
Sbjct: 143 RTSEIDVYYRRVMVRLQTKYPGEFPEALYTFDRFKFAWKTIQA-------RTFGRRLPWT 195

Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
           ALVP    L   +   K      D+ +     R Y +G +        F  +    G + 
Sbjct: 196 ALVPFADCLNHTNVATKYDFDVNDNGLF----RLYPSGATSFAQGAEVFNSY----GRRS 247

Query: 343 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 402
           N +LL++YGF   DN +D + VE   +   P          R  KL      V   R+  
Sbjct: 248 NFQLLLDYGFALPDNEWDYVDVEIGKDRAGP----------RGRKLRFMKRVVRIDRQS- 296

Query: 403 AISDMLPYLRLGYVSD------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 453
           ++ ++ P   L  ++D       SE  + +S    +C     +   +++ +AD+  A
Sbjct: 297 SLDELFPPSFLAGLADPVPDEEQSEAAAELSERTALCDALEWLRSILIETIADWGTA 353


>gi|348690659|gb|EGZ30473.1| hypothetical protein PHYSODRAFT_553476 [Phytophthora sojae]
          Length = 437

 Score = 48.1 bits (113), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 101/415 (24%), Positives = 174/415 (41%), Gaps = 65/415 (15%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLS---ELACLALYLMYEK- 168
           V  +ED+      FS+P   V++++ +  N  +  +    +L+   E   LA+ L+YEK 
Sbjct: 47  VFIAEDVTPHAEVFSIPLDSVLSVKSLQENAVLQSIAFFQQLTPEREDDQLAIALLYEKF 106

Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
            +G KS W  +I  L R          + L +   EL  L GS       +  E +  +Y
Sbjct: 107 VRGSKSKWAKHIELLPR-------TYHNALYFGPEELRALEGSNVYFIAQQMEEKVAHDY 159

Query: 229 NELDTVWFMAGSLFQQYP----YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF-- 282
             L     +   LF+  P     D+  E F+ E +K A   + S       V +A++   
Sbjct: 160 ARLKESVLL--ELFENVPEGINVDLFDEFFSLENYKWALSTIWS---RFGDVPVAKQSFK 214

Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQ---LVVDRPYKAGESIVVWYNISFTGHDFKCG 339
           A+VP+   +L +  + + M    D + Q   LV  + + AG  + + Y           G
Sbjct: 215 AMVPVFD-MLNHDPEAE-MSHFFDMSTQRFKLVSHQHWNAGAQMFINY-----------G 261

Query: 340 PQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQ---YQDKRMVAQRNGKLSVQVFHVH 396
           P  N KLL  YGFV   NP+D   VE  L  ++     +Q+K  +   NG         H
Sbjct: 262 PLSNHKLLALYGFVIIGNPFD--AVEMWLPMDEASTKFFQEKEQLLLTNGL-------DH 312

Query: 397 AGREKEAISD-----MLPYLRLGYVS--DTSEMQSVISSLGPICPVSPCMERAVLDQLAD 449
           A    E ++D     +L   R+  +      E + + +       +S   E+  L +L  
Sbjct: 313 ATNPFELVADESNDLLLMAARIQEIDCETVEEFEELANKALEGEMISLENEQEALTRLIY 372

Query: 450 YFKARLAGYPATLSEDEAML------TDYNLHPKKRVATQLVRMEKKMLNACLQV 498
             +  L  +P ++ ED+ +L      TD NL+  +R+A  + R +K +L+  + +
Sbjct: 373 TLEKMLESFPTSIEEDDILLEQDDKKTD-NLN-HERMAVAVRRSDKYILSENINM 425


>gi|366987955|ref|XP_003673744.1| hypothetical protein NCAS_0A08050 [Naumovozyma castellii CBS 4309]
 gi|342299607|emb|CCC67363.1| hypothetical protein NCAS_0A08050 [Naumovozyma castellii CBS 4309]
          Length = 499

 Score = 47.8 bits (112), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 66/296 (22%), Positives = 122/296 (41%), Gaps = 56/296 (18%)

Query: 112 YVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAE------LLTTNKLSELACLALYLM 165
           ++ A+ED++  +  F +P   ++    VL +  ++E      +L    +     L + ++
Sbjct: 42  FILATEDIKTDELLFEIPRESILN---VLTSSLVSEYPAWENILLDGDVGHWEGLIICML 98

Query: 166 YEKKQGKK-SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAE 216
           +E K  K  S W PY   L        L     + W+  EL  L         G+    +
Sbjct: 99  FEIKVKKNMSKWAPYFDVLPESTDLNSL-----MYWTAEELEALKPSLVLDRIGNDGAHQ 153

Query: 217 ILERAEGIKREYNELDTV-----------WFMAGSLFQQYPYDI---PTEAFTFEIFKQA 262
           + E+   + R + +  +V           +    S+   Y +D+   PT A   E   + 
Sbjct: 154 MHEKVMELIRTFEKDHSVDLSFGTITWEDFLYVASIIMSYSFDVELPPTSADENEEDDEV 213

Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
              V+  V +   +      +++PL   L + ++KC A L   +D++++      KAGE 
Sbjct: 214 EEDVEQTVRNEGSLK-----SMIPLADTLNSDTNKCNAHLIYDEDSLKMRAISNIKAGEQ 268

Query: 323 IVVWYNISFTGHDFKCGPQPNSKLLINYGFVD-EDNPYD--RLVVEAALNTEDPQY 375
           +   YNI         G  PN+++L  YG+V+ E + YD   L +E  + T   QY
Sbjct: 269 V---YNIY--------GNHPNAEILRRYGYVEWEGSKYDFGELPLEVIIETLHEQY 313


>gi|298715435|emb|CBJ28046.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 719

 Score = 47.8 bits (112), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 72/323 (22%), Positives = 134/323 (41%), Gaps = 47/323 (14%)

Query: 84  WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
           W+  +G     +   E PS +E    +    A  D+  GD    +P++L+++      + 
Sbjct: 26  WLRSHG---AAIDCVEWPS-SETESGVRGAVARRDIAPGDHMVIIPHALMMSEFHAKADP 81

Query: 144 TIAEL--LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
               +  L T  L     LALY+M E  + ++SF+ PY+R L           ES LL  
Sbjct: 82  KYGHVHRLNTRLLGSDNGLALYIMQEILKEERSFYWPYLRMLPTPCNLRNWNRESLLLLQ 141

Query: 202 ETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ 261
           + +L   T + ++ ++L     + RE     T+ F++ S    YP     + +TFE+F  
Sbjct: 142 DHKLVRRTAARSR-QLL----ALYRE-----TIEFLSSS----YPELYTADRYTFELFDF 187

Query: 262 AFVAVQSCVVHLQKVSLARRF---ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV---DR 315
           A+  +Q+        +  +R    ALVP    L   + + K       +    +    + 
Sbjct: 188 AWRTIQA-------RAFGKRLKSSALVPFADCLNHGNVQTKYDFDVGGNGTFRLFPSGNN 240

Query: 316 PYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL---NTED 372
            Y     ++  Y           G + N  LL++YGF   DN +D   V  +L   + + 
Sbjct: 241 RYPRNSEVLNSY-----------GRRANDNLLLDYGFAMLDNEWDAAEVICSLPPSHDQS 289

Query: 373 PQYQDKRMVAQRNGKLSVQVFHV 395
           P  + ++   + +G+ +V++  V
Sbjct: 290 PLDRRRKACLRASGQHTVRILRV 312


>gi|440802665|gb|ELR23594.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
          Length = 984

 Score = 47.8 bits (112), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 60/275 (21%), Positives = 107/275 (38%), Gaps = 54/275 (19%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNET---IAELLTTNKLSEL------ACLALYLM 165
           A+ED+  G+   S+P  LV+T E    +E    +A  L   +   L      A L  YL+
Sbjct: 32  ATEDILPGEELCSIPVRLVLTTEIARKSEVGRLVAAHLNAVQGERLRVSAGRAILCAYLI 91

Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIK 225
           +++   + +FW PY+R L +   R            + ++ +L G+     + E+ + I+
Sbjct: 92  HQRA-AQDAFWGPYLRSLPKHDDR-----------PDEDIQHLAGTNLFYAMQEKQQQIR 139

Query: 226 REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
             ++       +  +L   +P   P + FT++ F   F A  S           R F   
Sbjct: 140 ESFD------LLFPALCHAHPTVFPPDLFTWDHFLWTFTACSS-----------RSFPQT 182

Query: 286 PLGPPLLAYSSKCKAM-LAAVDDAVQLVVDRPYKAGESIVVWY------NISFTGHDF-- 336
            +  P    S+      L  +D+ +   +D         + W        + F   D   
Sbjct: 183 LVQQPTATTSAHADPYDLLEIDECLLPGLDMLNHQYRKKITWALDPSTGRLKFVTEDTVE 242

Query: 337 -------KCGPQPNSKLLINYGFVDEDNPYDRLVV 364
                    GP+ N +LL+ YGF  EDN  D +++
Sbjct: 243 KGTEAFNNYGPKGNEELLMGYGFCIEDNEQDYVMI 277



 Score = 39.7 bits (91), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 22/70 (31%), Positives = 39/70 (55%), Gaps = 3/70 (4%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELL--TTNKLSELACLALYLMYEKKQ 170
           V A++ + AG A  ++P  L++T++  L    + E L      L E   L L+L++EK +
Sbjct: 497 VFAAQAVPAGQALLTIPRQLLITVDTAL-ESPLGEALQYVEGGLDEDTVLTLFLVWEKGR 555

Query: 171 GKKSFWLPYI 180
           G+ S W P++
Sbjct: 556 GQASPWYPFL 565


>gi|195439104|ref|XP_002067471.1| GK16171 [Drosophila willistoni]
 gi|194163556|gb|EDW78457.1| GK16171 [Drosophila willistoni]
          Length = 511

 Score = 47.8 bits (112), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 76/373 (20%), Positives = 143/373 (38%), Gaps = 49/373 (13%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           A++D+ A      VP   + + E++   E  +    T   +    LA  L+ EK +G  S
Sbjct: 118 ATKDINADQQVLRVPRKKIFSEEQLSKTERESFCNFTTNFN----LANALVVEKSRGADS 173

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
            W PYI  L  +         + L ++  ++  L G+   +  L +   I R+Y +L   
Sbjct: 174 IWKPYIDVLPSR-------YNTVLYFTVEQMRRLRGTSVCSSALRQCRMIARKYAKLYAF 226

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR-------FALVPL 287
            +   S  +            +E+++ A   V +   +L    +A +        AL+P 
Sbjct: 227 AYCDSSYLRPDTGLFTQHGLCYELYRWAVSTVMT-RQNLVPREIATKDDGNSPISALIPC 285

Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLL 347
                    K  +   +    ++       KAG    ++Y           G +PN+ LL
Sbjct: 286 WDMANHRPGKITSFYDSNAHQMECTAQEFCKAGNQFFIYY-----------GDRPNADLL 334

Query: 348 INYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREK-----E 402
           ++ GFVD +N  D + +   L+  D        +A++  +L  ++   H G  +     E
Sbjct: 335 VHNGFVDPNNNKDFVNIRLGLSPTDG-------LAEKRSRLLDRLNIEHKGEFRVLPAPE 387

Query: 403 AIS-DMLPYLRLGYVSD------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARL 455
            IS  +L ++R+  +S        S+++  +  L   C +   +E          FK  L
Sbjct: 388 YISGQLLAFVRVFNMSSDQLDHWCSDLERAVDLLHIDCALETDLETRTWQYFHQRFKLLL 447

Query: 456 AGYPATLSEDEAM 468
               ATL E + +
Sbjct: 448 GVLEATLREADEL 460


>gi|121703688|ref|XP_001270108.1| SET domain protein [Aspergillus clavatus NRRL 1]
 gi|119398252|gb|EAW08682.1| SET domain protein [Aspergillus clavatus NRRL 1]
          Length = 492

 Score = 47.8 bits (112), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 71/289 (24%), Positives = 108/289 (37%), Gaps = 62/289 (21%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
           V A   +  G+  FS+P  LV++ E    N  +  LL+ + L EL     L L ++YE  
Sbjct: 50  VVAQSAIVEGEELFSIPRDLVLSTE----NSKLKSLLSQD-LGELGPWLSLMLVMIYEYL 104

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA-------- 221
             ++S W PY R             ++ + WS  EL  L GS    +I  +         
Sbjct: 105 LREQSAWAPYYRIFPEN-------FDTLMFWSPAELQELQGSAIVDKIGRQGAEESILQM 157

Query: 222 ---------------EGIKREYNELDTVWFMA-----GSLFQQYPYDIPTEAFTFEIFKQ 261
                          +G+     E  T   +      GSL   Y +DI       +   +
Sbjct: 158 IAPVVKANPSLFPPIQGLSSWEGEAGTQALLGLAHVMGSLIMAYAFDIEKVNDEDDEDNE 217

Query: 262 AFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGE 321
                 +     Q         +VPL   L A + +  A L   +D++ +   +P  AG+
Sbjct: 218 GEDGYMTDEEEDQSSK-----GMVPLADILNADADRNNARLFQEEDSLVMKAIKPIAAGD 272

Query: 322 SIVVWYNISFTGHDFKCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALN 369
            I   Y           G  P S LL  YG+V D   PYD  V+EA+L+
Sbjct: 273 EIFNDY-----------GELPRSDLLRRYGYVTDNYAPYD--VIEASLD 308


>gi|169626351|ref|XP_001806576.1| hypothetical protein SNOG_16462 [Phaeosphaeria nodorum SN15]
 gi|160705819|gb|EAT76160.2| hypothetical protein SNOG_16462 [Phaeosphaeria nodorum SN15]
          Length = 474

 Score = 47.8 bits (112), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 76/322 (23%), Positives = 123/322 (38%), Gaps = 66/322 (20%)

Query: 83  SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLER-VLG 141
           +W+ ++G+     I  E   + +  R    V A++D+   +  F +P + ++++E  +L 
Sbjct: 13  AWLRRSGVEISPKIQLEDLRNAQAGRG---VVATQDIPEHELLFRIPRTAILSVENSILS 69

Query: 142 NETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
            E  A   T   L     L L ++YE   G  S W PY   L  +         + + WS
Sbjct: 70  TEIPA--ATFEMLGPWLSLILVMLYEYINGDASNWAPYFSVLPTE-------FNTLMFWS 120

Query: 202 ETELAYLTGSPTKAEI-------------------------------LERAEGIKREYNE 230
           E ELA L  S    +I                                +RAE ++ E N 
Sbjct: 121 EDELAELQASAVLNKIGKEGANEAFMEQLLPIIKEFADIFFAGDERAKQRAEEMRDERNV 180

Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 290
           L  +    GSL   Y +D+   A + +   +   A +     L K        ++PL   
Sbjct: 181 L--LMHKMGSLIMAYAFDVEP-ATSRKDVDEEGFAEEEEDEALPK-------GMIPLADM 230

Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINY 350
           L A +    A L   +  +++   +P +AGE +   Y           GP P S LL  Y
Sbjct: 231 LNADADCNNARLFYEEKYLEMKALKPIRAGEEVFNDY-----------GPLPRSDLLRRY 279

Query: 351 GFV-DEDNPYDRLVVEAALNTE 371
           G+V D    YD + +   L TE
Sbjct: 280 GYVTDNYAQYDVVEINMDLVTE 301


>gi|440302460|gb|ELP94773.1| hypothetical protein EIN_341910 [Entamoeba invadens IP1]
          Length = 823

 Score = 47.8 bits (112), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 92/421 (21%), Positives = 161/421 (38%), Gaps = 100/421 (23%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL---VVTLE 137
           + +W+ ++G     V +K  P +         + +S++   GD   S+P  L    + L 
Sbjct: 4   ITTWVKEHGGHIDGVYVKNFPVYGNG------LCSSKEFHEGDTLLSIPYHLQLNTIELH 57

Query: 138 RVL-----GNET--IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRG 190
            V      G E   + E    N+  E + + LYL    K  +K F  PYI  L       
Sbjct: 58  NVFESMVPGFEVPRLGEG-AKNRDDENSVVYLYLAM-NKTNEKCFHFPYINTLPT----- 110

Query: 191 QLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIP 250
                 PL +SE EL  L G+    ++L   E  K    +L   +    +L  QYP    
Sbjct: 111 --TFSCPLSYSENELKMLKGT----KLLVTVEKTKTFLKKLSDYY---ETLTHQYP---- 157

Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV---PLGP-----PLLAYSSKCKAML 302
           T    F+ F Q  V          +V  +R F ++   P+G      P   +S+      
Sbjct: 158 TRFQQFDDFYQRLVWAH-------QVFWSRAFLVIYPDPIGDVASLIPFADFSNH----- 205

Query: 303 AAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQ--------PNSKLLINYGFVD 354
              +  V  V +R  +        +++        CG Q        PN K+L+ YGFV 
Sbjct: 206 -NTETKVTYVSNRQTQT-------FSLQTNEKVLHCGEQIFNNYRIRPNEKMLLGYGFVI 257

Query: 355 EDNPYDRLVV-----EAALNTEDPQYQDKRM--------------------VAQRNGKLS 389
            +NPYD +++     E     +  + ++ +M                    + Q   +  
Sbjct: 258 SENPYDEVLLRINFKERHFEKQVEESEESKMEVENKENERMEVEEEDNEDEITQILKREG 317

Query: 390 VQVFHVHAGREKEAISDMLPYLRLGYVS--DTSEMQSVISSLGPICPVSPC-MERAVLDQ 446
           V  F  +  REKE  +D+L  LR+  +S  + ++    +  L  + P++     R++++Q
Sbjct: 318 VDRFDYYLTREKELPTDLLRVLRIVNLSLVEANQYSQALLDLSYVSPINEIKATRSLMEQ 377

Query: 447 L 447
           +
Sbjct: 378 I 378


>gi|302832548|ref|XP_002947838.1| hypothetical protein VOLCADRAFT_88145 [Volvox carteri f.
           nagariensis]
 gi|300266640|gb|EFJ50826.1| hypothetical protein VOLCADRAFT_88145 [Volvox carteri f.
           nagariensis]
          Length = 508

 Score = 47.8 bits (112), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 94/414 (22%), Positives = 151/414 (36%), Gaps = 64/414 (15%)

Query: 80  DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
           + +SW+   GL    ++L+         R    + AS  L  G+    +P+ LV+T ER 
Sbjct: 24  EFQSWLRSEGLSTQPLLLRHC------GREGRGLVASRSLSRGEVLVKLPDHLVITAERA 77

Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
            G  ++  LL       LA        ++     + W PY+  L ++ G       + L 
Sbjct: 78  AGEWSLLALLLAEVKGRLAA------GDRSSPAAARWGPYVAVLPQRPG-------TLLD 124

Query: 200 WSETELA-YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEI 258
           W   E+   L GSP +             + EL+ +    G      P  +P      E 
Sbjct: 125 WPAKEVQQLLRGSPLQRLADSITSAASASWRELEPL-IAQGRADGLVPEHVPLSKGDLEW 183

Query: 259 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL---LAYSSKC----------------K 299
              AF  + S  + L   S      L P    L   ++    C                 
Sbjct: 184 ---AFGVLLSRCIRLP--SRGDLQVLAPWADQLNHDVSAEEGCHLDWSWDVAGPAVPGGD 238

Query: 300 AMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV--DEDN 357
               A   A+ L  DRPY AG+ + V Y           GP+ + +LL++YGF      N
Sbjct: 239 RAGGATKGALVLRADRPYAAGQQVYVSY-----------GPKSSGELLLSYGFCPPPASN 287

Query: 358 PYDRLVVEAALNTE-DPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL--RLG 414
           P+    +  A++ + DP    K     R+G  S   F +      E +   L +L  R  
Sbjct: 288 PHQDCRLRVAVDRQGDPLADLKEQALARHGLPSELEFPLKLEGIPEGLLQYLAFLDARPK 347

Query: 415 YVSDTSEMQSVISSLGPICPVSPCMERAV--LDQLADYFKARLAGYPATLSEDE 466
              +T E+ SV+   G   P+    +  V  L  L++   A L  YP ++  D+
Sbjct: 348 VAQETFELASVLFESGGF-PLLDGQDTLVLALRGLSNRCTAALKAYPTSMEADQ 400


>gi|358335378|dbj|GAA53907.1| histone-lysine N-methyltransferase setd3 [Clonorchis sinensis]
          Length = 254

 Score = 47.4 bits (111), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 43/201 (21%), Positives = 96/201 (47%), Gaps = 21/201 (10%)

Query: 310 QLVVDRPYKAGESIVVWYNISFTGHDFKC----GPQPNSKLLINYGFVDEDNPYDRLVVE 365
           Q+  D   ++GE  +++Y++ FT  + +     G + +++ L+  GFV   NP++ + + 
Sbjct: 32  QVTTDFDPESGE--LIFYSMEFTPKNTQILMDYGKRTSAEFLMFSGFVPATNPHNNVRIV 89

Query: 366 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV 425
             ++  D     +  + +     S  + H+    +  ++SD + + R+ +V D+ ++ + 
Sbjct: 90  LGVSKSDQLSSKREQLLELIALQSPLILHITG--DLSSLSDAIAFARV-FVMDSDQLDAH 146

Query: 426 IS---------SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 476
           +S            P+CP  P  ++A+   L   F+  ++ Y   +SEDE      NL P
Sbjct: 147 LSMTTSALHALRTSPLCPGDPIDDQAIA-FLIMRFELLVSAYGPMVSEDEVGYE--NLTP 203

Query: 477 KKRVATQLVRMEKKMLNACLQ 497
            +R   +L   E ++L +C++
Sbjct: 204 IQRYCERLRVQEVQILRSCIE 224


>gi|195132508|ref|XP_002010685.1| GI21676 [Drosophila mojavensis]
 gi|193907473|gb|EDW06340.1| GI21676 [Drosophila mojavensis]
          Length = 593

 Score = 47.4 bits (111), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 86/411 (20%), Positives = 156/411 (37%), Gaps = 59/411 (14%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           A+ D++AG+   SVP  L+ + E +   +     L  N  + L  +   L+ EK +G  S
Sbjct: 206 ATRDIKAGEQVLSVPRKLIFSEELLPEKQ---RQLFRNFPTHLK-VTYTLIMEKLRGADS 261

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
            W P+I  L  +         + L ++  ++  L G+   +  +     I R Y  +   
Sbjct: 262 PWQPFIDTLPSR-------YNTVLYFTVEQMQRLRGTSACSAAVRHCRVIARLYASMYKC 314

Query: 235 WFMA---------GSLFQQYP--YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 283
            FM           +LF  Y   Y++   A +    +Q  V  Q        + ++   A
Sbjct: 315 AFMQLDDSVMGGMANLFTDYGLCYELYRWAVSTVTTRQNLVPRQEIPSDAANLPIS---A 371

Query: 284 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPN 343
           L+P        S K  +        ++      YK+GE   ++Y           G + N
Sbjct: 372 LIPYWDMANHRSGKITSFYDQAAGQMECTAQEAYKSGEQYFIYY-----------GDRSN 420

Query: 344 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV-----AQRNGKLSVQVFHVHAG 398
           +  L++ GFVD  NP D + +   L+  D   + + ++      +R  +L V     H  
Sbjct: 421 ADRLVHNGFVDMQNPKDYVQIRLGLSPTDALAEQRAILLAELNIERKAELRVLPAPEHIS 480

Query: 399 REKEAISDMLPYLRLGYVSD------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFK 452
            E      +L ++R+  +S        S+++  +  L   C +   +E      L    K
Sbjct: 481 GE------LLAFVRVFNMSKEQLEHWCSDLERAVDLLHIDCALETDLETRTWQYLYQRLK 534

Query: 453 ARLAGYPATLSED------EAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 497
             L    ATL E       EA+    +      +  Q  R+E+++L+  LQ
Sbjct: 535 LLLGVLEATLKETDELKQLEALQQQADASEIDIMVLQYRRLERRILSDALQ 585


>gi|322712432|gb|EFZ04005.1| histone-lysine N-methyltransferase [Metarhizium anisopliae ARSEF
           23]
          Length = 462

 Score = 47.4 bits (111), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 83/403 (20%), Positives = 162/403 (40%), Gaps = 45/403 (11%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
           V A    + G+   ++P+ L  T++    +  +   L + +  LS    LA+++++ +  
Sbjct: 28  VKARRRFKQGERILTIPSGLHWTVKHAQNDSLLGPALCSAQPPLSVEDTLAVHILFVRS- 86

Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
            ++S +      ++R       +  S + +++ EL    G+       +  + I+ +Y +
Sbjct: 87  -RESGYDGLRSHVERLPA----SYSSSIFFTDDELEVCAGASLYTITKQLQQRIEDDYRD 141

Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 290
           L         +  QYP   P + FT   +K A  AV S  +  Q    +    L P    
Sbjct: 142 LVV------RVLVQYPDLFPLDKFTLHHYKWALCAVWSRAMDFQLSDGSSIRLLAPFAD- 194

Query: 291 LLAYSS---KCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLL 347
           +L +SS   +C    A+  D + ++  + Y+AG+ + + Y           G  PN +LL
Sbjct: 195 MLNHSSESKQCHVYDASSGD-LSVLAGKDYEAGDQVYIHY-----------GSIPNHRLL 242

Query: 348 INYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDM 407
             YGF+   NP D   +  A +   P ++ K+ +    G  S     +        ++D 
Sbjct: 243 RLYGFIIPGNPNDSYDLVLATHPLAPFFELKQKLWALAGLDSTCTISL-------TLTDP 295

Query: 408 LP-----YLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATL 462
           LP     YLR+  + D S++ S+         +S   E  VL  L +   + L  +   L
Sbjct: 296 LPKNVIRYLRIQRL-DESDLASIALGQAADEKISNSNEVQVLQSLVESIASLLGSFGTRL 354

Query: 463 SEDEAMLTD--YNLHPKKRVATQLVRMEKKMLNACLQVTADMI 503
            + E  L    Y +      A  +   E+++L    +   D++
Sbjct: 355 EKLEEQLATGVYPVGGNAWAAAHVSLGEQRVLKLAKKKAEDLL 397


>gi|328854233|gb|EGG03367.1| hypothetical protein MELLADRAFT_90239 [Melampsora larici-populina
           98AG31]
          Length = 509

 Score = 47.4 bits (111), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 59/227 (25%), Positives = 94/227 (41%), Gaps = 35/227 (15%)

Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
           A+VPL   L A S    A L      + +   +  + GE I   YN             P
Sbjct: 260 AMVPLADILNAKSGCENAKLFYEPTTLNMTTTKSIRKGEQI---YNT--------YADPP 308

Query: 343 NSKLLINYGFVDEDNPYD----------RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQV 392
           N+ LL  YG VD++NP+D          RL  E +L+  DPQ Q+     +   K +++V
Sbjct: 309 NADLLRRYGHVDDENPFDLAEVSLELCIRLAAE-SLHPSDPQNQNTLDELKSRAKWALEV 367

Query: 393 FHV--------HAGRE-KEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 443
             +         + RE KE + D L  +    +S   E Q+   S G +    P M   +
Sbjct: 368 SDIDEIFMLPTKSQREPKEILPDELVIMLRILLSTEEEFQT-WKSKGKVP--KPAMSEPI 424

Query: 444 LDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKK 490
                     RL  Y  T+  D+ +L D +L  ++++ +  VR+ +K
Sbjct: 425 AQLAIQILSNRLNQYSTTIQNDQDLLKDQSL-SRRKLKSIKVRLGEK 470


>gi|240278777|gb|EER42283.1| conserved hypothetical protein [Ajellomyces capsulatus H143]
 gi|325090312|gb|EGC43622.1| conserved hypothetical protein [Ajellomyces capsulatus H88]
          Length = 471

 Score = 47.4 bits (111), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 61/282 (21%), Positives = 110/282 (39%), Gaps = 48/282 (17%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
            + G+   ++P+ ++ T+E    +  +   L + +  LS    LA Y+++ +        
Sbjct: 34  FKEGERILTIPSDVLWTVEHAYADSLLGPTLHSARPPLSVDDTLATYILFVRS------- 86

Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
               RE      R  LA       S + ++E EL   TG+   A   +    I+ +Y  L
Sbjct: 87  ----RESGYNGLRSHLAALPKSYSSSIFFTEDELEVCTGTSLYAITKQLGRCIQDDYKAL 142

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL-----QKVSLARRFALVP 286
                    L  Q+    P   FT E +K A   V S  +       + + L   FA   
Sbjct: 143 VV------RLLIQHRDLFPLSKFTIEDYKWALCTVWSRAMDFVLPDGKSIRLLAPFA--- 193

Query: 287 LGPPLLAYSSKCKAMLAA--VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNS 344
               +L +SS  +   A   +   + ++  + YKAG+ + ++Y           G  PN+
Sbjct: 194 ---DMLNHSSDVRQCHAYDPLSGNLSILAGKDYKAGDQVFIYY-----------GSIPNN 239

Query: 345 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 386
           +LL  YGF+   NP D   +    +   P ++ K  + +  G
Sbjct: 240 RLLRLYGFIIPSNPNDNYELVLETHPMAPFFEQKHKLWESAG 281


>gi|340520781|gb|EGR51016.1| N-methyltransferase [Trichoderma reesei QM6a]
          Length = 470

 Score = 47.4 bits (111), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 51/197 (25%), Positives = 80/197 (40%), Gaps = 32/197 (16%)

Query: 196 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
           S + +SE EL    G+       +  + IK +Y +L      A  LF Q+P   P + FT
Sbjct: 107 SSIFFSEGELEVCAGTSLYTVTKQLEQRIKDDYRQL------AVRLFAQHPDLFPLQKFT 160

Query: 256 FEI----------FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS---KCKAML 302
            E           +K A   V S  +       +    L P    +L +SS   +C A  
Sbjct: 161 IEDVRLLRRATDPYKWALCTVWSRSMDFTLPDGSSIRLLAPFAD-MLNHSSEVKQCHAYD 219

Query: 303 AAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRL 362
               D + +   + Y+ G+ + ++Y           GP PN++LL  YGFV  DNP D  
Sbjct: 220 VKSGD-LSVFAGKDYEIGDQVYIYY-----------GPIPNNRLLRLYGFVIPDNPNDSY 267

Query: 363 VVEAALNTEDPQYQDKR 379
            +    +   P Y+ K+
Sbjct: 268 DLVLTTHPMAPFYEQKQ 284


>gi|347967016|ref|XP_003436005.1| AGAP002018-PB [Anopheles gambiae str. PEST]
 gi|333469796|gb|EGK97407.1| AGAP002018-PB [Anopheles gambiae str. PEST]
          Length = 504

 Score = 47.4 bits (111), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 90/419 (21%), Positives = 160/419 (38%), Gaps = 76/419 (18%)

Query: 121 AGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA--CLALYLMYEKKQGKKSFWLP 178
           AG+   +VP S+   +        + EL+    +SE     LAL L+ E+ + K S W P
Sbjct: 110 AGECIITVPRSMFFYVTNEPRYRQLLELMPGAMMSEQGNIMLALALIMERFRAK-SDWKP 168

Query: 179 YIREL-DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
           Y+  L DR          +PL ++  ++  L  +      L+  + I R+Y  +      
Sbjct: 169 YLDLLPDR--------YTTPLYYTTEDMGELAETDAFLPALKLCKHIARQYGFIRR---- 216

Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFV--------AVQSCVVHLQKV--------SLARR 281
               F Q   D   + FT+++F+            AV + +    KV         +   
Sbjct: 217 ----FVQEKVDELRDCFTYDVFRLLLFSLLIPHSWAVSTVMTRQNKVPVNLAEFDGMDHT 272

Query: 282 FALVPL------GPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWYNISFTG 333
            AL+PL        P  A  ++C A     A ++ ++  + R      S+ ++       
Sbjct: 273 LALIPLWDMANHAFPDTANETRCVAETCYNATNEQLECSLTREVSDIASVPIF------- 325

Query: 334 HDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 393
                G + +++ L++ GFV   NP+  +     L    P Y+++  + +  G  +   F
Sbjct: 326 --IVYGTRTDAEFLVHNGFVCPRNPHANVQKRFTLVPAIPLYKERAHLLELLGMPTTGTF 383

Query: 394 HVHAGREKEAISDMLP----YLRLGYVSDTS------------EMQSVISSLGPICPVSP 437
                RE  A +   P     + L  VS  +              +  + +   + P   
Sbjct: 384 SFGPAREPAAATTTTPISQELISLARVSSMTAKELDEYTAMKETQRQTLRTYQALLPAEL 443

Query: 438 C--MERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 494
           C   ER     LA   K  L  YP T+ +DEA+L   N H  +R+  +    EK++L +
Sbjct: 444 CARTER----WLATVMKIMLLRYPTTIEQDEALLKT-NRHHIRRLLIEYRLGEKQILRS 497


>gi|328772335|gb|EGF82373.1| hypothetical protein BATDEDRAFT_86177 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 966

 Score = 47.4 bits (111), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 71/330 (21%), Positives = 127/330 (38%), Gaps = 71/330 (21%)

Query: 71  VSKKEEDLGDLKS---WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFS 127
            ++K + L  L+S   W+H NG+    + +K+     +    I    ++  +  G+    
Sbjct: 548 TAEKLDQLASLESFTQWLHANGINTDGISIKKVDDSKDVGLGIF---STRQIHKGECLVK 604

Query: 128 VPNSLVVTLERVLGNETIA-----ELLTTNKL--SELACLALYLMYEKKQGKKSFWLPYI 180
           +P      L+ +L N+T A      ++ +N L  ++ + + +  + ++     S W PY 
Sbjct: 605 IP------LKLILSNDTSAMPALNSIVKSNVLLKTDPSVILVIRLLQEYINPMSLWQPYF 658

Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS 240
             L R           P+L S  +LA  TG+    E++     + R+Y  L  +      
Sbjct: 659 DLLPR-------VFTIPVLGSAQDLAAYTGTSIIDEVVHDMIALMRQYLYLQHI------ 705

Query: 241 LFQQYPY-DIPTEAFTFEIFKQAFVAVQS-----CVVHLQKVSLARRFALVPL------- 287
            F+  P   IP   FTF  F  A   V +     C  +     + +   L+PL       
Sbjct: 706 -FKSIPEPPIPLADFTFAAFSWARAIVSTRQNEICYANPSTSEMQQFLCLIPLFDMFNHK 764

Query: 288 -GPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNS 344
            G     + +K  C   +A+ D             GE I + Y           G + N 
Sbjct: 765 PGNSTTQFDTKEYCSETIASCD----------VSPGEQIFIHY-----------GKRSNQ 803

Query: 345 KLLINYGFVDEDN-PYDRLVVEAALNTEDP 373
           ++L+  GFVD  N  YD + +  ++   DP
Sbjct: 804 EMLLYSGFVDPTNIEYDHIKLSVSIPQSDP 833


>gi|308806489|ref|XP_003080556.1| SET-domain transcriptional regulator-like protein (ISS)
           [Ostreococcus tauri]
 gi|116059016|emb|CAL54723.1| SET-domain transcriptional regulator-like protein (ISS)
           [Ostreococcus tauri]
          Length = 394

 Score = 47.0 bits (110), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 68/274 (24%), Positives = 109/274 (39%), Gaps = 57/274 (20%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERV------------------LGNETIAELLTTNKL 154
           V A E ++AG+    VP   ++ +E+                   +G++ I  +  T  L
Sbjct: 4   VRAVERVEAGECVARVPWDALLGVEQTVETSSPSPTSEILKQLTRMGDQIIMVIWLTAAL 63

Query: 155 SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 214
               C      YE+       W P +R L  +         S L W+  +L  + G    
Sbjct: 64  DAFEC-GDASAYEE-------WAPALRALPTR-------ASSSLAWNADDLGAVAGEDLA 108

Query: 215 AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF-TFEIFKQAFVAVQSCVVHL 273
             + E    +K +Y+ L    F A  L +Q P   P  AF  +  F++A+    S  + +
Sbjct: 109 NRLREYRRSVKVQYDAL----FPA--LCEQVPEAFPARAFGDYAKFERAYDIWTSYAMKV 162

Query: 274 QK-VSLARRFALVP----LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYN 328
           Q   SL  R  +VP        L A+S +  ++      A +L + R    GE+I + Y 
Sbjct: 163 QDPDSLQIREVIVPGVFLCNHSLSAHSVRYTSLERGTK-AFRLELSRGCVEGEAITISY- 220

Query: 329 ISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRL 362
                     G   N+ LL+ YGF  E+NPYDR+
Sbjct: 221 ----------GRLDNADLLMFYGFSLENNPYDRV 244


>gi|224012755|ref|XP_002295030.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220969469|gb|EED87810.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 753

 Score = 47.0 bits (110), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 62/238 (26%), Positives = 101/238 (42%), Gaps = 39/238 (16%)

Query: 127 SVPNSLVVTLERVLGNET-IAELLTTNKLSELA----CLALYLMYEKK-QGKKSFWLPYI 180
           S+P S ++T+E  +G  T I   + T+ L   A     L +Y+++++K  G+ SF+ PY 
Sbjct: 141 SIPKSCLITVE--MGQATPIGRKILTSDLELDAPKHIFLMIYILWDRKVNGETSFFAPYY 198

Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS 240
           + L       +     P+ W+  EL  L GS    +I +RAE IK +Y  + ++    G 
Sbjct: 199 KILP------ETLRNMPIFWTREELDALEGSYLLLQIADRAEAIKEDYISICSIAPEFGD 252

Query: 241 LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSK-CK 299
           +             T E F+ A + V S    L  ++  R  ALVP    L     +  K
Sbjct: 253 I------------ATLEEFQWARMIVCSRNFGLL-INGHRTSALVPHADMLNHLRPRETK 299

Query: 300 AMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDN 357
              +    +  +   +    GE +       F  +  KC    N + L+NYGF  E N
Sbjct: 300 WTFSEESQSFTITTLQEIGMGEQV-------FDSYGQKC----NHRFLLNYGFCVERN 346


>gi|113930683|ref|NP_001039027.1| SET domain-containing protein 4 [Danio rerio]
 gi|66911144|gb|AAH96876.1| SET domain containing 4 [Danio rerio]
          Length = 440

 Score = 47.0 bits (110), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 73/330 (22%), Positives = 136/330 (41%), Gaps = 50/330 (15%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVV 134
           L+ W+++ G     +I            P+++      + A++ ++A ++  S+P   ++
Sbjct: 37  LRRWLNERGFTSQSLI------------PVNFHDTGRGLMATQTIKAKNSVISLPEECLL 84

Query: 135 TLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
           T   VL +  +A+ +      +S L  L  +L+ E+  G+ S W PYI  L +       
Sbjct: 85  TTSTVLKS-YMADYIKRWHPPISPLLALCCFLISERHHGEASEWNPYIDILPK------- 136

Query: 193 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 252
               PL + +  +  L  S  K +  ++ E  +  ++   T +     LF Q     PTE
Sbjct: 137 TYTCPLYFPDNVIELLPRSLQK-KATQQKEQFQELFSSSQTFFHSLQPLFNQ-----PTE 190

Query: 253 A-FTFEIFKQAFVAVQSCVV---HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA 308
             F+ +  + A+ +V +  V   H Q   L+R   +  L P  L   + C  +       
Sbjct: 191 ELFSQDALRWAWCSVNTRTVYMEHDQSKYLSREKDVYALAP-YLDLLNHCPNV------Q 243

Query: 309 VQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPY-----DRLV 363
           V+   ++  +  E   V     F       GP  N +LL+ YGFV   NP+     D   
Sbjct: 244 VEAGFNKETRCYEIRSVNGCKKFQQAFINYGPHDNHRLLLEYGFVAPCNPHSVVYVDLET 303

Query: 364 VEAALNTEDPQYQDKRMVAQRNGKLSVQVF 393
           ++  L+ +D Q ++K +  + N  L    F
Sbjct: 304 LKVGLDEKDKQLKEKLLYLKDNDFLRNLTF 333


>gi|336468018|gb|EGO56181.1| hypothetical protein NEUTE1DRAFT_83233 [Neurospora tetrasperma FGSC
           2508]
 gi|350289741|gb|EGZ70966.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
          Length = 459

 Score = 47.0 bits (110), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 60/272 (22%), Positives = 113/272 (41%), Gaps = 42/272 (15%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
            + G+   ++P  ++ T++    +  +   L + +  LS    LA Y+++ K        
Sbjct: 34  FKEGEKILTIPAGILWTVKHAYADPLLGPALRSAQPPLSVEDTLATYILFVKS------- 86

Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
               RE      R  +A       S +L++E +L    G+       +  + I+ ++  L
Sbjct: 87  ----RESGYDGQRSHIAALPTSYSSSILFAEDDLEACAGTSLYTITKQLEQSIEDDHRAL 142

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP-- 289
                    LF Q+P   P + FT E +K A   V S  +      LA   ++  L P  
Sbjct: 143 VV------RLFVQHPDLFPLDKFTVEDYKWALCTVWSRAMDF---VLADGNSIRLLAPFA 193

Query: 290 PLLAYSSKCKA--MLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLL 347
            +L ++S+ K   +       + ++  + Y+AG+ + + Y           GP PNS+LL
Sbjct: 194 DMLNHTSEVKQCHVYDPSSGNLSVLAGKDYEAGDQVFINY-----------GPVPNSRLL 242

Query: 348 INYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 379
             YGFV   NP D   +  + + + P ++ K+
Sbjct: 243 RLYGFVIPGNPNDSYDLVLSTHPQAPFFEQKQ 274


>gi|322697804|gb|EFY89580.1| putative histone-lysine N-methyltransferase [Metarhizium acridum
           CQMa 102]
          Length = 466

 Score = 47.0 bits (110), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 59/276 (21%), Positives = 110/276 (39%), Gaps = 38/276 (13%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
           V A    + G+   ++P++L  T++    +  +   L + +  L+    LA+Y+++ +  
Sbjct: 28  VKARRRFKQGERILTIPSALHWTVQHAQADSLLGPALRSARPPLTVEDTLAVYVLFVRS- 86

Query: 171 GKKSFWLPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIK 225
                     RE      R  +A       S + ++E EL    G+       +  + I+
Sbjct: 87  ----------RESGYNGPRSHVAALPTSYSSSIFFTEDELEVCAGTSLYTITKQLKQRIE 136

Query: 226 REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
            +Y +L         +    P   P   FT   +K A   V S  +  +    +    L 
Sbjct: 137 DDYKDL------IARVLGPRPDLFPLNKFTIHHYKWALCTVWSRAMDFELYDGSSMRLLA 190

Query: 286 PLGPPLLAYSSKCKA--MLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPN 343
           P    +L +SS+ K   +  A    + ++  + Y+AG+ + + Y           G  PN
Sbjct: 191 PFAD-MLNHSSESKQCHVYDASTGNLSILAGKDYEAGDQVYIHY-----------GSIPN 238

Query: 344 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 379
           S+LL  YGFV  DNP D   +  A +   P ++ K+
Sbjct: 239 SRLLRLYGFVIPDNPNDSYDLVLATHPMAPFFEQKQ 274


>gi|302840199|ref|XP_002951655.1| hypothetical protein VOLCADRAFT_105180 [Volvox carteri f.
           nagariensis]
 gi|300262903|gb|EFJ47106.1| hypothetical protein VOLCADRAFT_105180 [Volvox carteri f.
           nagariensis]
          Length = 517

 Score = 47.0 bits (110), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 46/210 (21%), Positives = 87/210 (41%), Gaps = 24/210 (11%)

Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
           +++ E+  L + +MYEK +G++S W PY+  +             PL W   E   L G+
Sbjct: 179 DEILEVQGLIIAVMYEKSRGRQSRWAPYLNLIPDD------MTHMPLYWKHREFKELRGT 232

Query: 212 PTKAEILERAEGIKREYNELDTVWF-MAGSLFQQYP-YDIPTEAFTFEIFKQAFVAVQSC 269
               +++ + +       ++  +W  +     Q++P  ++P     +++++ A  AV S 
Sbjct: 233 AAYDKMMGKVQCPADAPTQVPVLWSEVVEPFIQEHPELELPEGKAGYDLYRWATCAVASY 292

Query: 270 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWY 327
              L         A+VP+   L   + +    L   A    + ++  R    GE +V  Y
Sbjct: 293 SFILGDDKYQ---AMVPVWDLLNHITGRVNVRLHHCAKRHVLHMIATRDILRGEELVNNY 349

Query: 328 NISFTGHDFKCGPQPNSKLLINYGFVDEDN 357
                      G   N++LL  YGFV+  N
Sbjct: 350 -----------GELSNAELLRGYGFVEARN 368


>gi|384248108|gb|EIE21593.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 229

 Score = 47.0 bits (110), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 33/110 (30%), Positives = 52/110 (47%), Gaps = 9/110 (8%)

Query: 80  DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
           D   W+ K G     +   E  +  E  R    V A  +++ G    +VP  L+++    
Sbjct: 5   DFAEWLQKGGALIADI---EPGAVAEGFRG---VIAKANIEEGTLLVAVPERLLLSAHSA 58

Query: 140 LGNETIAE-LLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
             +   AE LL TNK  +     LA +L++E  +G++SFW PY+  L RQ
Sbjct: 59  KKDRAFAEALLATNKQSIGSSQVLAAHLLHEASKGQESFWRPYLATLPRQ 108


>gi|164423408|ref|XP_963594.2| hypothetical protein NCU08733 [Neurospora crassa OR74A]
 gi|157070080|gb|EAA34358.2| conserved hypothetical protein [Neurospora crassa OR74A]
          Length = 459

 Score = 47.0 bits (110), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 60/272 (22%), Positives = 112/272 (41%), Gaps = 42/272 (15%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
            + G+   ++P  ++ T++    +  +   L + +  LS    LA Y+++ K        
Sbjct: 34  FKEGEKILTIPAGILWTVKHAYADPLLGPALRSAQPPLSVEDTLATYILFVKS------- 86

Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
               RE      R  +A       S +L++E +L    G+       +  + I+ ++  L
Sbjct: 87  ----RESGYDGQRSHIAALPASYSSSILFAEDDLEACAGTSLYTITKQLEQSIEDDHRAL 142

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP-- 289
                    LF Q+P   P + FT E +K A   V S  +      LA   ++  L P  
Sbjct: 143 VV------RLFVQHPDLFPLDKFTVEDYKWALCTVWSRAMDF---VLADGNSIRLLAPFA 193

Query: 290 PLLAYSSKCKA--MLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLL 347
            +L ++S+ K   +       + +   + Y+AG+ + + Y           GP PNS+LL
Sbjct: 194 DMLNHTSEVKQCHVYDPSSGTLSVFAGKDYEAGDQVFINY-----------GPVPNSRLL 242

Query: 348 INYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 379
             YGFV   NP D   +  + + + P ++ K+
Sbjct: 243 RLYGFVIPGNPNDSYDLVLSTHPQAPFFEQKQ 274


>gi|225554758|gb|EEH03053.1| SET domain-containing protein [Ajellomyces capsulatus G186AR]
          Length = 485

 Score = 47.0 bits (110), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 70/317 (22%), Positives = 120/317 (37%), Gaps = 56/317 (17%)

Query: 80  DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
           +  SW+ +   P  KV  K K +          + A +D+   +  F++P +LV++ +  
Sbjct: 19  EFMSWLKQR--PGVKVSPKIKIADLRSEGAGRGIVADDDIGEDEELFAIPQNLVLSFQ-- 74

Query: 140 LGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
             N ++ +LL  N+       CL + ++YE  QG  S W  Y + L           ++ 
Sbjct: 75  --NSSLKDLLDFNERDFDPWLCLIVVMIYEYLQGGASTWSRYFQLLPTN-------FDTL 125

Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP-------YDIP 250
           + W++ EL  L+GS     +L +      E N L  +  +       +P       +D P
Sbjct: 126 MFWTDEELRELSGSA----VLNKIGRSDAEANILRNILPLVSGNPSHFPPMSGVASFDSP 181

Query: 251 TE----------------AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
                             A+ F+I K      +    ++          +VPL   L A 
Sbjct: 182 EGKAALLSLAHRMGSLIMAYAFDIEKGENDGGEGQDGYVTDDEEELSKGMVPLADLLNAD 241

Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV- 353
           + +  A L   D  + +   +P + GE I   Y           G  P + LL  YG+V 
Sbjct: 242 TDRNNARLFQEDCYLSMRSIKPIRKGEEIFNDY-----------GELPRADLLRRYGYVT 290

Query: 354 DEDNPYDRLVVEAALNT 370
           D    YD   VE ++ T
Sbjct: 291 DNYAQYDE--VEISMRT 305


>gi|389741836|gb|EIM83024.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
          Length = 502

 Score = 47.0 bits (110), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 104/474 (21%), Positives = 174/474 (36%), Gaps = 127/474 (26%)

Query: 118 DLQAGDAAFSVPNSLVVT-----LERVLGNETIAELLTTNKLSEL----ACLALYLMYEK 168
           D+  G   F++P +L ++     L  +LG       L   K  EL    A L L +M+E+
Sbjct: 37  DIPEGHTLFTLPRNLTLSTRTSALPGLLG-------LDEWKQHELHIGWAGLILCMMWEE 89

Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI----------- 217
            QG  S W  Y+  L         + ++P+ WS  +L  L G+    +I           
Sbjct: 90  AQGASSRWSTYLASLPS-------SFDTPMFWSPDDLEELKGTSVVDKIGRDGAEEDYRS 142

Query: 218 -----------LERAEGIKREYNELDTVWFMAGSLF------------------------ 242
                      L   E + R Y+ L+    M   +                         
Sbjct: 143 KVVPTLQSRPDLFAPEALSRHYS-LENYHLMGSRILSRSFSVERWEGHAADKQEDSASSP 201

Query: 243 -----QQYPYDIPTEAFTFE-------IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 290
                +    D+ TEA T         + + +FV         +        A+VP+   
Sbjct: 202 VADTGRDEAMDVDTEAVTATAPEAEDGVDEPSFVVDDENDSDDEDEEDPANVAMVPMADM 261

Query: 291 LLA-YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
           L A Y S+   +    +D ++++  +P   GE I   Y           G  PNS LL  
Sbjct: 262 LNARYRSENAKLFYETED-LRMITTKPILKGEQIFNTY-----------GDPPNSDLLRR 309

Query: 350 YGFVDE--------DNPYDRLVVE-----AALNTEDPQYQDKRMVAQR------NGKLSV 390
           YG VD          NP D  +VE     A  +  +   Q     A+R       G   V
Sbjct: 310 YGHVDLVPLPNGDIGNPAD--IVELRGDLAFFSISERHKQPVESSAERVDWWLEEGGEDV 367

Query: 391 QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADY 450
            +   +     E   +++P+ RL   S +   ++   S  P   V    + ++L  +A+ 
Sbjct: 368 FILETN----HELPDELVPFCRLLLQSQSEWEKTKSKSKLPKAKV----DESILSTIANA 419

Query: 451 FKARLAGYPATLSEDEAMLTD-YNLHPKKRVATQLVRMEKKMLNACLQVTADMI 503
            + RLA YP ++ ED+ +LT+  +L+ K  V  +L   EK++L+  L    + +
Sbjct: 420 LERRLAEYPTSVEEDQKLLTEPLSLNRKHAVIVRL--GEKRILHGTLSTVKEKL 471


>gi|400602586|gb|EJP70188.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
          Length = 797

 Score = 46.6 bits (109), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 41/88 (46%), Gaps = 13/88 (14%)

Query: 281 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGP 340
           R AL+P+   L   ++ C    +   +A  +  DR Y+AGE +   Y           G 
Sbjct: 593 RLALLPVADVLNHANAGCSVAFST--EAYDITADRAYQAGEEVYTSY-----------GA 639

Query: 341 QPNSKLLINYGFVDEDNPYDRLVVEAAL 368
             N  LL  YGFV  DNP+D+L ++  L
Sbjct: 640 HSNDFLLAEYGFVLPDNPWDQLCLDKVL 667


>gi|346980096|gb|EGY23548.1| SET domain-containing protein RMS1 [Verticillium dahliae VdLs.17]
          Length = 469

 Score = 46.6 bits (109), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 84/382 (21%), Positives = 144/382 (37%), Gaps = 76/382 (19%)

Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 213
           L     L L ++YE  QG  S W PY   L +Q        ++P+ WS+ EL  L G+  
Sbjct: 91  LDSWGQLILVMLYEVLQGDASRWKPYFDILPQQ-------FDTPIFWSDGELLELQGTSL 143

Query: 214 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 273
            AE + + E      +++  +     ++F       PTE     +  +    + +    L
Sbjct: 144 TAEKIGKVESDAMFRSKILPIVQANPAIFYPEGAAQPTEDELLHLAHRMGSTIMAYAFDL 203

Query: 274 QKVSLARR--------------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 319
           +                       +VP+   L A +++  A +   +      +    KA
Sbjct: 204 ENDDENENEEDGWVEDREGRTMLGMVPMADTLNA-NAEFNAHINHGESLEATAIRADIKA 262

Query: 320 GESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVD-EDNPYDRLVVEAALNTEDPQYQDK 378
           G+ I+ +Y           GP P S+LL  YG+V  E + YD + V   L  E       
Sbjct: 263 GDQILNYY-----------GPLPTSELLRRYGYVTPEHSRYDVVEVPWTLVKE------- 304

Query: 379 RMVAQRNGKLSVQVF-HVHAGREKEAISDMLPYLRLGYVSDTSE--------MQSVISSL 429
             V   +  LS + +  V +  + E I D     R     D+ E          +V+  +
Sbjct: 305 --VIVSSLSLSAEAWKQVESQIDDEEIEDYFVIER-----DSGEPGPDGRFTAPAVLREV 357

Query: 430 GP---------ICPVSPCMERAVLDQ----------LADYFKARLAGYPATLSEDEAMLT 470
            P         +  V       + D+          +A+  K RLA YP ++  DE +L 
Sbjct: 358 SPELVEQLKEFLKAVKKLDSERIPDKRKRDEICDAVIAEVLKVRLAQYPTSIETDEKLLA 417

Query: 471 DYNLHPKKRVATQLVRMEKKML 492
           + +L  ++R+A  +   EKK+L
Sbjct: 418 EADLPARRRMAVVVRLGEKKLL 439


>gi|302658278|ref|XP_003020845.1| SET domain protein [Trichophyton verrucosum HKI 0517]
 gi|291184711|gb|EFE40227.1| SET domain protein [Trichophyton verrucosum HKI 0517]
          Length = 692

 Score = 46.6 bits (109), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 48/217 (22%), Positives = 83/217 (38%), Gaps = 46/217 (21%)

Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
           LA ++++E+ +G+ S W PY+  L R          S L + + +L +L G+        
Sbjct: 108 LAFFMVHEQLKGRDSHWWPYLATLPRAS-----EFTSALFYQDNDLEWLQGTNLYQTHQA 162

Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
               ++ EY+   ++    G L          E++ ++IF  A+  + S      +   +
Sbjct: 163 YRNAVQEEYDSAISILRDEGFL--------AVESYRWDIFCWAYTLIAS------RAFTS 208

Query: 280 RRFALVPLGPPLLAYSSKCKAMLAAVDDA----------------VQLVVDRPYKAGESI 323
           R         P L    + + ML  VD +                + L V  P  +GE +
Sbjct: 209 RVLDAYFSNHPTLKQDEEFQIMLPLVDSSNHKPLAKIEWRAEATEIGLKVIEPTSSGEEV 268

Query: 324 VVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYD 360
                     H+   GP  N +L+  YGF   DNP D
Sbjct: 269 ----------HN-NYGPLNNQQLMTTYGFCIVDNPCD 294


>gi|336260071|ref|XP_003344832.1| hypothetical protein SMAC_06115 [Sordaria macrospora k-hell]
          Length = 456

 Score = 46.6 bits (109), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 82/401 (20%), Positives = 150/401 (37%), Gaps = 52/401 (12%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
            + G+   ++P+S++ T+E    +  +   L + +  LS    L  YL++ +        
Sbjct: 34  FKEGEKILTIPSSILWTVEHAYADPLLGPALCSVQPPLSPEDTLTTYLLFVRS------- 86

Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
               RE      R  +A       S + ++E EL    G+       +  + I+ ++  L
Sbjct: 87  ----RESGYDGQRSHVAALPTSYSSSIFFTEEELEVCAGTSLYTITKQLEQSIEDDHRAL 142

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
                    LF Q+    P + F+ E +K A   V S  +  Q         L P    +
Sbjct: 143 ------VMQLFIQHRDLFPLDKFSIEDYKWALCTVWSRRMDFQLRDGKSMRLLAPFAD-M 195

Query: 292 LAYSSKCK--AMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
           L +SS+ K   +       + ++  + Y+ G+ + + Y           G  PNS+LL  
Sbjct: 196 LNHSSEAKPCHVYDVSSGNLSVLAGKDYEPGDQVFINY-----------GSVPNSRLLRL 244

Query: 350 YGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP 409
           YGFV   NP D   +  + + + P Y+ K  +    G  S     +        ++D LP
Sbjct: 245 YGFVIPGNPNDTYDLVLSTHPQAPFYEQKHKLWVSAGLDSTSTIPL-------TLTDPLP 297

Query: 410 -----YLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSE 464
                YLR+     +      + +      VS   E  +L  L + F   L G+   L +
Sbjct: 298 KNVLRYLRIQRADASDLAAMALQNAKADEKVSDSNEVEILQFLVESFGHLLGGFGTPLEK 357

Query: 465 DEAMLTD--YNLHPKKRVATQLVRMEKKMLNACLQVTADMI 503
            E  L    Y+       A  +   E+++L    +   D++
Sbjct: 358 LEEQLAQGVYSPGGNAWAAAHVSLGEQRVLRLAKKRAEDLL 398


>gi|380089029|emb|CCC12973.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 465

 Score = 46.6 bits (109), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 77/365 (21%), Positives = 137/365 (37%), Gaps = 50/365 (13%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
            + G+   ++P+S++ T+E    +  +   L + +  LS    L  YL++ +        
Sbjct: 43  FKEGEKILTIPSSILWTVEHAYADPLLGPALCSVQPPLSPEDTLTTYLLFVRS------- 95

Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
               RE      R  +A       S + ++E EL    G+       +  + I+ ++  L
Sbjct: 96  ----RESGYDGQRSHVAALPTSYSSSIFFTEEELEVCAGTSLYTITKQLEQSIEDDHRAL 151

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
                    LF Q+    P + F+ E +K A   V S  +  Q         L P    +
Sbjct: 152 ------VMQLFIQHRDLFPLDKFSIEDYKWALCTVWSRRMDFQLRDGKSMRLLAPFAD-M 204

Query: 292 LAYSSKCK--AMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
           L +SS+ K   +       + ++  + Y+ G+ + + Y           G  PNS+LL  
Sbjct: 205 LNHSSEAKPCHVYDVSSGNLSVLAGKDYEPGDQVFINY-----------GSVPNSRLLRL 253

Query: 350 YGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP 409
           YGFV   NP D   +  + + + P Y+ K  +    G  S     +        ++D LP
Sbjct: 254 YGFVIPGNPNDTYDLVLSTHPQAPFYEQKHKLWVSAGLDSTSTIPL-------TLTDPLP 306

Query: 410 -----YLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSE 464
                YLR+     +      + +      VS   E  +L  L + F   L G+   L +
Sbjct: 307 KNVLRYLRIQRADASDLAAMALQNAKADEKVSDSNEVEILQFLVESFGHLLGGFGTPLEK 366

Query: 465 DEAML 469
            E  L
Sbjct: 367 LEEQL 371


>gi|367013376|ref|XP_003681188.1| hypothetical protein TDEL_0D03930 [Torulaspora delbrueckii]
 gi|359748848|emb|CCE91977.1| hypothetical protein TDEL_0D03930 [Torulaspora delbrueckii]
          Length = 484

 Score = 46.6 bits (109), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 33/100 (33%), Positives = 48/100 (48%), Gaps = 6/100 (6%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           V A +DL+ G     +P S + +      N +IA LL  +++  +  L +  +YE    K
Sbjct: 40  VFAKQDLEEGTVLLKLPKSCLFSA----SNSSIANLLVDDEIDGVLALNIAFLYETTVFK 95

Query: 173 -KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
            KS W PY++ +      G L V  P  WSETE   L GS
Sbjct: 96  EKSHWFPYLKSIRIYNDDG-LLVLPPSHWSETEKLLLKGS 134


>gi|211826273|gb|AAH09054.2| SETD3 protein [Homo sapiens]
          Length = 228

 Score = 46.6 bits (109), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 53/223 (23%), Positives = 100/223 (44%), Gaps = 29/223 (13%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 5   GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 58

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 59  LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 114

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y     V        Q +P+ 
Sbjct: 115 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV-------IQTHPHA 162

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPL 287
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL
Sbjct: 163 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPL 205


>gi|260807503|ref|XP_002598548.1| hypothetical protein BRAFLDRAFT_118329 [Branchiostoma floridae]
 gi|229283821|gb|EEN54560.1| hypothetical protein BRAFLDRAFT_118329 [Branchiostoma floridae]
          Length = 448

 Score = 46.2 bits (108), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 63/293 (21%), Positives = 124/293 (42%), Gaps = 57/293 (19%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN-KLSELACLALYLMYEKKQGKKSFWL 177
           ++ G     +P  ++++ + VL +     + +   +L+ +  +  +L+Y+K  G+ SFW 
Sbjct: 65  IKRGQTMIKMPQHMILSTKTVLDSVLGPYIESAEPQLTTIQAITTFLIYQKHIGETSFWK 124

Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
           PY+  L  +          P+ + E +  YL  S  +A I  + +   + Y EL   +  
Sbjct: 125 PYLDILPNE-------YTHPVYFGEEDFLYLPHS-LRANIKAKKQECIKSYEELKPFFPS 176

Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAF--VAVQSCVVHLQKVSLARRF--------ALVPL 287
              L   +        FTF+ ++ A+  V  +S  V  +  ++ R          +LVP+
Sbjct: 177 LEPLLPNW-----EGIFTFDAYRWAWSTVKTRSLYVDDKGSTVLRNLDKSGLGVTSLVPM 231

Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQ------ 341
              LL +S   +           L++ +  K G+    +Y ++    D+K G Q      
Sbjct: 232 -VDLLNHSHSART---------GLLIKKSCKNGD---YFYTVT-AEDDYKRGDQVLFCYR 277

Query: 342 --PNSKLLINYGFVDEDNPYDRL----------VVE-AALNTEDPQYQDKRMV 381
              N  LL+NYGFV  DN  D +          ++E      EDP+++ ++++
Sbjct: 278 RADNQTLLLNYGFVLPDNHLDTIKFFLVKDIIGILELMNFEEEDPKFRRRKVL 330


>gi|255070351|ref|XP_002507257.1| predicted protein [Micromonas sp. RCC299]
 gi|226522532|gb|ACO68515.1| predicted protein [Micromonas sp. RCC299]
          Length = 986

 Score = 46.2 bits (108), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 67/292 (22%), Positives = 115/292 (39%), Gaps = 70/292 (23%)

Query: 113 VAASEDLQA----GDAAFSVPNSLVVTLERVLGNET---IAELLTTNK-LSELACLALYL 164
           V A+E++      GD  FS+P + ++T      + T   + EL   ++ + +   L  +L
Sbjct: 45  VIAAENVNGAQDGGDTIFSIPITCLMTPAAAFADVTYGKVFELFAAHQSVEDRTVLVFFL 104

Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
             E+++G  S W PYIREL            +PL WS  E   L G+        R  G 
Sbjct: 105 AIERQRGMTSHWGPYIRELPS-------IFSNPLNWSRAETLRLAGT--------RLGGA 149

Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 284
            +         F   +L Q     +P  AF   +  Q  ++  +  +    +SLA+    
Sbjct: 150 TK---------FHDCALLQLTEVCVP--AFIAILRAQLILSANTKAIASGAISLAQD--- 195

Query: 285 VPLGPPLLAYSSKCKA---------------------MLAAVDDAVQLVVDRPYKAGESI 323
             L P  LA+S  C +                     ML    DA Q+       AG+ +
Sbjct: 196 -ALSPDRLAWSHSCVSSRAFSLFLNGQRTIALVPLGDMLDHSPDA-QIEWRTDDTAGQFL 253

Query: 324 VVWYN------ISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 369
           ++ ++      I F  +    G + N +L++ YGF  + +  + L V  A++
Sbjct: 254 IISHDRLPAGSIMFNNY----GAKSNEELILGYGFFMKSSVLETLYVRLAVD 301


>gi|255075907|ref|XP_002501628.1| predicted protein [Micromonas sp. RCC299]
 gi|226516892|gb|ACO62886.1| predicted protein [Micromonas sp. RCC299]
          Length = 607

 Score = 46.2 bits (108), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 59/238 (24%), Positives = 95/238 (39%), Gaps = 41/238 (17%)

Query: 145 IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 204
           +A  +   +L+  A LAL++++E     +S    Y+  L    G+   +V  PLLW+ T+
Sbjct: 137 VAASMGAPELATHAALALHVLFELGD-PRSEGFAYLATLPGLAGKASPSV--PLLWTPTQ 193

Query: 205 LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
           +A L G+PT   +L RA+ +   +  L                 +P E F +     A  
Sbjct: 194 VATLRGTPTHGRVLRRAKFVSDAHAAL---------FGSGGGGGVPLEKFAW-----ALS 239

Query: 265 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG-ESI 323
           +V S      ++     +A +P G  LL +          VD   +L   +    G E  
Sbjct: 240 SVLSRAASGDRMP----YAFLP-GVDLLNHGG--------VDANCELSAVKLAPGGNEEN 286

Query: 324 VVWYNISFTG-HDFKCGPQ---------PNSKLLINYGFVDEDNPYDRLVVEAALNTE 371
           V W ++  T   D   G Q          N +LL  YGF    N +DR  +E  L  +
Sbjct: 287 VTWGDVEVTCVKDTPAGEQLTISYGDESDNCRLLRLYGFATRGNVHDRRTIELRLTGD 344


>gi|410900968|ref|XP_003963968.1| PREDICTED: SET domain-containing protein 4-like [Takifugu rubripes]
          Length = 386

 Score = 46.2 bits (108), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 60/254 (23%), Positives = 108/254 (42%), Gaps = 22/254 (8%)

Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSF 175
           +++ GD   S+P S ++T   VL N  +   + + K  LS L  L ++L+ E+ +G+ S 
Sbjct: 65  NVKPGDMLISLPESCLLTTSTVL-NSYLGSFIKSWKPHLSPLLALCVFLVCERHRGEASD 123

Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 235
           W PYI  L +       +   P  +++  +A L  S  + +  E+ E + RE +  +  +
Sbjct: 124 WFPYIDVLPK-------SYTCPAYFTDEVMALLPPS-VQRKAREQREAV-REIHSSNKAF 174

Query: 236 FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-CVVHLQKVSLARRFALVPLGPPLLAY 294
           F +       P +   +  T+E  + A+ +V +  V  L   +   R   V    P L  
Sbjct: 175 FRSLQPVLTQPAE---DVLTYEALRWAWCSVNTRSVFMLHSSNDFLRGQDVYALAPFLDL 231

Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVD 354
            + C       D  V+   +   K  E   V   + +       G   N +L++ YGFV 
Sbjct: 232 LNHCP------DVQVKASFNEETKCYEIRSVSRMLQYQQAFINYGSHDNQRLMLEYGFVA 285

Query: 355 EDNPYDRLVVEAAL 368
             NP+  + V+  L
Sbjct: 286 PCNPHSVVYVDKDL 299


>gi|149742140|ref|XP_001496337.1| PREDICTED: SET domain-containing protein 4 [Equus caballus]
          Length = 440

 Score = 46.2 bits (108), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 66/266 (24%), Positives = 108/266 (40%), Gaps = 23/266 (8%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSELACLALYLMYEKKQGKKSFWL 177
           LQ G    S+P S ++T + V+ +   A +      LS L  L  +L+ EK  G +S W 
Sbjct: 68  LQEGQMIISLPESCLLTTDTVIRSYLGAYIAKWQPPLSPLLALCTFLVAEKHAGDRSVWK 127

Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
           PY+  L +       A   P+   E E+  L   P KA+  E+   ++  +      +  
Sbjct: 128 PYLEVLPK-------AYTCPVC-LEPEVVDLLPKPLKAKAREQRTRLQAFFTSSRDFFSS 179

Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSK 297
              LF +    I    F++  F  A+  V +  V+++     RR       P   A +  
Sbjct: 180 LRPLFSEAVESI----FSYSAFLWAWCTVNTRAVYMK----PRRRRCFSAEPDTYALAPY 231

Query: 298 CKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHD--FKC-GPQPNSKLLINYGFVD 354
              +  + D  V+   +   +  E   V    S   H+  F C GP  N +LL+ YGFV 
Sbjct: 232 LDLLNHSPDVQVRAGFNEETRCYEIRTV---SSCRKHEEVFICYGPHDNQRLLLEYGFVS 288

Query: 355 EDNPYDRLVVEAALNTEDPQYQDKRM 380
             NP+  + V   +  +     DK+M
Sbjct: 289 IHNPHACVYVSKDILVKYLPSTDKQM 314


>gi|428163884|gb|EKX32933.1| hypothetical protein GUITHDRAFT_120884 [Guillardia theta CCMP2712]
          Length = 320

 Score = 46.2 bits (108), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 40/154 (25%), Positives = 70/154 (45%), Gaps = 17/154 (11%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           AS+ +  G+    VP  L++   + L   ++  LL   +L +  C+ L LM E      S
Sbjct: 32  ASKRISPGETFLKVPRHLLLGPHQ-LRASSLDRLLEGWQLPD--CMLLLLMCESVN-SSS 87

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
           F+ PY+  L          V++P+ WS+ E   L GSP     ++    + R + E    
Sbjct: 88  FFRPYLDLLPD-------TVDTPITWSKEEAKELVGSPVLHRAVKLRHELARSFQE---- 136

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 268
             M   +F +YP   P   F++E ++ A+  ++S
Sbjct: 137 --MKDKVFDKYPDRFPPLLFSYERYQWAYSILRS 168


>gi|357131865|ref|XP_003567554.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Brachypodium distachyon]
          Length = 316

 Score = 46.2 bits (108), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 47/192 (24%), Positives = 75/192 (39%), Gaps = 33/192 (17%)

Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
           KKS W PY+R L R        + + + W   EL  +  S    E +ER E   +E++ +
Sbjct: 38  KKSGWAPYVRSLPRND-----QMHNMMFWDLNELHMVRISSICDEAIERRERAMKEFSAV 92

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
                    LF +            E F  A     S +V  +    +R  +L+P     
Sbjct: 93  KPSLECFPHLFGE---------IKLEDFMHA-----SALVSSRAWQTSRGVSLIPFA-DF 137

Query: 292 LAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
           L +     ++L      D  +++ DR Y  GE ++V Y           G   N+ L +N
Sbjct: 138 LNHDGVSDSILLYDGQKDIAEVISDRNYAVGEQVMVRY-----------GKYSNAMLALN 186

Query: 350 YGFVDEDNPYDR 361
           +GF    N YD+
Sbjct: 187 FGFTLPRNIYDQ 198


>gi|224125978|ref|XP_002329631.1| predicted protein [Populus trichocarpa]
 gi|222870512|gb|EEF07643.1| predicted protein [Populus trichocarpa]
          Length = 513

 Score = 46.2 bits (108), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 59/267 (22%), Positives = 109/267 (40%), Gaps = 39/267 (14%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           A++DL+ GD A  +P S++++ E V  ++    L   + ++    L L+ M E+      
Sbjct: 149 ATKDLKVGDIALEIPVSIIISEEHVHKSDMYHILEKIDGITSETMLLLWSMKERHNCSSK 208

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
           F + Y   L  +   G       L +    +  L G+    EI++  E ++ +Y+EL   
Sbjct: 209 FKI-YFDTLPEEFKTG-------LSFGVDAIMALDGTLLLEEIMQAKEHLRVQYDEL--- 257

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG------ 288
                 L + YP     E +T+E F  A     S  + +  V    R  L+P+       
Sbjct: 258 ---VPPLCKNYPDVFLPELYTWEQFLWACELWYSNSMKVMFVDGKLRTCLIPIAGFLNHS 314

Query: 289 --PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKL 346
             P ++ Y       + +  + ++  + RP   GE   + Y           G   +S L
Sbjct: 315 LYPHIVHYGK-----VDSATNTLKFPLTRPCCFGEQCCLSY-----------GNFSSSHL 358

Query: 347 LINYGFVDE-DNPYDRLVVEAALNTED 372
           +  YGF+ + DNP D + ++  +   D
Sbjct: 359 ITFYGFMPQGDNPCDVIPLDIDVGDAD 385


>gi|70993754|ref|XP_751724.1| SET domain protein [Aspergillus fumigatus Af293]
 gi|66849358|gb|EAL89686.1| SET domain protein [Aspergillus fumigatus Af293]
 gi|159125354|gb|EDP50471.1| SET domain protein [Aspergillus fumigatus A1163]
          Length = 674

 Score = 46.2 bits (108), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 54/212 (25%), Positives = 89/212 (41%), Gaps = 34/212 (16%)

Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
            +L+ +  +G + FW PYIR L +      L++ +PL +   +L +L G+       +R 
Sbjct: 94  FFLIGQYLRGSEGFWFPYIRTLPQP-----LSLTTPLYYEGDDLRWLDGTSLAPAREQRM 148

Query: 222 EGIKREYNELDTVWFMAG-SLFQQYPYDI--------PTEAFTFEIFKQAFVAVQSCVVH 272
              K +Y    T    AG     QY +D+         + AF+ ++  +A   V+   + 
Sbjct: 149 GVWKEKYENGITELRKAGFEDVDQYTWDLYLWSSSILVSRAFSAKVLAEAVTDVE---LP 205

Query: 273 LQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFT 332
              VS+     L+P    L+ +    K    A    V  VV     +G+ I   Y     
Sbjct: 206 EDGVSV-----LLPC-IDLMNHRPLAKVEWRAGKQDVAFVVLEDVASGQEISNNY----- 254

Query: 333 GHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
                 GP+ N +L++NYGF   DNP D  +V
Sbjct: 255 ------GPRNNEQLMMNYGFCLPDNPCDYRIV 280


>gi|320170159|gb|EFW47058.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 640

 Score = 45.8 bits (107), Expect = 0.049,   Method: Compositional matrix adjust.
 Identities = 76/313 (24%), Positives = 119/313 (38%), Gaps = 50/313 (15%)

Query: 64  VAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGD 123
            A     VS     L  L +W+   GL   ++    +P  N+      Y+ AS  ++A  
Sbjct: 156 AAAQLAAVSTPRGALARLTAWIDNAGL---EINSNARPGLNDVDE--LYLFASNPIEAAT 210

Query: 124 AAFSVPNSLVV--TLERVLGNETIAELLTTNKLSELA------CLALYLMYEKKQGKKSF 175
              +VP  LV+  T  R L N  I  L    +   ++       LA+ L+YE  +  KS 
Sbjct: 211 LVATVPAPLVMFETYLRTLENPMI--LAIDRRFKTMSVPDPSYALAMALLYESYE-PKSM 267

Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP--TKAEILERAEGIKREYNELDT 233
           W  +I  L +        ++S + WS  E   L   P   K +ILER   +++ YN    
Sbjct: 268 WREWISSLPQ-------TLDSTVFWSAEEQDALQSLPLKRKTQILER--HLQQLYNA--- 315

Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-CVVHLQKVSLARRFALVPLGPPLL 292
                  L   +P+      +++E+FK A++ V S  +          +  L PL   L 
Sbjct: 316 ---TTPRLLAAFPHIFAGGNYSYEMFKWAYMIVDSRSLTFSTGPDTLPQIMLAPLVDLLH 372

Query: 293 AYSSKCKAMLAAVDDAV-----QLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLL 347
               +    L    + V      L   R  K GE +V              G  PN +LL
Sbjct: 373 HDPVQTNIQLGVHPEEVLGFEISLKTTRAIKKGEPLV-----------RHIGELPNHQLL 421

Query: 348 INYGFVDEDNPYD 360
           + +G     NPY+
Sbjct: 422 LRFGLAMPRNPYE 434


>gi|402224283|gb|EJU04346.1| hypothetical protein DACRYDRAFT_114691 [Dacryopinax sp. DJM-731 SS1]
          Length = 1313

 Score = 45.8 bits (107), Expect = 0.050,   Method: Compositional matrix adjust.
 Identities = 58/226 (25%), Positives = 91/226 (40%), Gaps = 30/226 (13%)

Query: 283  ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
            A+VP+   L A      A L    D +Q++  +P   GE I   Y           G  P
Sbjct: 1060 AMVPMADMLNARCGCNNAKLFYTRDDLQMMATKPIAKGEQIWNTY-----------GDPP 1108

Query: 343  NSKLLINYGFVDE-------DNPYDRLVVEAALNTEDPQ---YQDKRMVAQRNGKLSVQV 392
            NS LL  YG+VD         +P D + + A    E  +   YQD+       G     V
Sbjct: 1109 NSDLLRRYGYVDALTLPDGVGSPSDVVEINADTVVEAAKVQSYQDRIDWWLEEGGDDAFV 1168

Query: 393  FHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFK 452
              V       ++ D +  L    + +  + +   S      P  P ++    + L    +
Sbjct: 1169 LDV-----TYSVPDEMLSLVRLLLLNQEDWEKAQSK---GKPPKPKLDEKSYEVLLVVLQ 1220

Query: 453  ARLAGYPATLSEDEAMLTDYN-LHPKKRVATQLVRMEKKMLNACLQ 497
             RLA YP +L+E E ML   N L+ K+R A  +   E+++L+  L+
Sbjct: 1221 KRLAMYPISLTEQEGMLRSSNELNEKRRNALIVTTGEQRILHKTLE 1266


>gi|329663327|ref|NP_001192753.1| SET domain-containing protein 4 [Bos taurus]
 gi|296490853|tpg|DAA32966.1| TPA: SET domain containing 4 [Bos taurus]
          Length = 440

 Score = 45.8 bits (107), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 74/328 (22%), Positives = 128/328 (39%), Gaps = 44/328 (13%)

Query: 39  NFGSSLRLVRRKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILK 98
           N G     +RR+  F+    SS+     SR V    + +  +LK W+         +I  
Sbjct: 3   NGGGRTSRIRRRKLFT----SSE-----SRGVNESYKPEFIELKKWLKDRRFEDTTLIPA 53

Query: 99  EKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-SEL 157
             P           + +   LQ G    S+P S ++T + V+ +   A +       S L
Sbjct: 54  HFPGTGRG------LMSKTSLQEGQTIISLPESCLLTTDTVIRSYLGAYIAKWQPPPSPL 107

Query: 158 ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
             L  +L+ EK  G +S W PY+  L +       A   P+   E E+  L  +P K + 
Sbjct: 108 LALCTFLVSEKHAGDRSPWKPYLEVLPK-------AYTCPVC-LEPEVVNLLPNPLKTKA 159

Query: 218 LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK-- 275
            E+   +   ++     +     LF +    I    F++   + A+ AV +  V++++  
Sbjct: 160 WEQRSHVWEFFSSSRGFFSSLQPLFSEAVETI----FSYRALRWAWCAVNTRAVYMKRPP 215

Query: 276 -VSLARRFALVPLGP--PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFT 332
            + L+       L P   LL +S   +   A  ++     +    + G+   V       
Sbjct: 216 LLCLSPEPDTCALAPYLDLLNHSPDVQVKAAFNEETRCYEIRTATRCGKHKEV------- 268

Query: 333 GHDFKC-GPQPNSKLLINYGFVDEDNPY 359
              F C GP  N +LL+ YGFV   NP+
Sbjct: 269 ---FICYGPHDNHRLLLEYGFVCVSNPH 293


>gi|44890428|gb|AAH66931.1| SETD3 protein [Homo sapiens]
          Length = 292

 Score = 45.8 bits (107), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 53/223 (23%), Positives = 100/223 (44%), Gaps = 29/223 (13%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y     V        Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV-------IQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPL 287
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPL 273


>gi|114684050|ref|XP_001168792.1| PREDICTED: SET domain-containing protein 4 isoform 4 [Pan
           troglodytes]
 gi|410222534|gb|JAA08486.1| SET domain containing 4 [Pan troglodytes]
 gi|410259178|gb|JAA17555.1| SET domain containing 4 [Pan troglodytes]
 gi|410287502|gb|JAA22351.1| SET domain containing 4 [Pan troglodytes]
 gi|410336607|gb|JAA37250.1| SET domain containing 4 [Pan troglodytes]
          Length = 307

 Score = 45.8 bits (107), Expect = 0.053,   Method: Compositional matrix adjust.
 Identities = 61/261 (23%), Positives = 102/261 (39%), Gaps = 33/261 (12%)

Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSF 175
            LQ G    S+P S ++T + V+    +   +T  K   S L  L  +L+ EK  G +S 
Sbjct: 67  SLQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGHRSL 125

Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 235
           W PY+  L +       A   P+   E E+  L     KA+  E+   ++  +      +
Sbjct: 126 WKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFF 177

Query: 236 FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--P 290
                LF +    I    F++     A+  V +  V+L   Q+  L+       L P   
Sbjct: 178 SSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPDTCALAPYLD 233

Query: 291 LLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLI 348
           LL +S   + KA       + ++     ++  E + + Y           GP  N +L +
Sbjct: 234 LLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICY-----------GPHDNQRLFL 282

Query: 349 NYGFVDEDNPYDRLVVEAALN 369
            YGFV   NP+  + V    N
Sbjct: 283 EYGFVSVHNPHACVYVSRGWN 303


>gi|40068483|ref|NP_954574.1| histone-lysine N-methyltransferase setd3 isoform b [Homo sapiens]
 gi|28071060|emb|CAD61911.1| unnamed protein product [Homo sapiens]
 gi|111309143|gb|AAI20968.1| SET domain containing 3 [Homo sapiens]
 gi|118341365|gb|AAI27625.1| SET domain containing 3 [Homo sapiens]
 gi|118341638|gb|AAI27626.1| SET domain containing 3 [Homo sapiens]
 gi|119602071|gb|EAW81665.1| SET domain containing 3, isoform CRA_b [Homo sapiens]
 gi|156138972|gb|AAI48252.1| SET domain containing 3 [Homo sapiens]
          Length = 296

 Score = 45.8 bits (107), Expect = 0.054,   Method: Compositional matrix adjust.
 Identities = 53/223 (23%), Positives = 100/223 (44%), Gaps = 29/223 (13%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y     V        Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV-------IQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPL 287
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPL 273


>gi|328771298|gb|EGF81338.1| hypothetical protein BATDEDRAFT_87914 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 607

 Score = 45.8 bits (107), Expect = 0.054,   Method: Compositional matrix adjust.
 Identities = 41/154 (26%), Positives = 71/154 (46%), Gaps = 17/154 (11%)

Query: 78  LGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
           L  LK W  +N +   +  ++ +   N   R    V A + L+ GD   ++P   +++++
Sbjct: 4   LNILKQWFGENKIAYDEEKIRIEHDTNNGFR----VFAKQTLEVGDILCAIPKEAILSIK 59

Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
               N  +A++L    L     L + LM+E+  G+KS W  YI+ L        L    P
Sbjct: 60  ----NCGVADVLEEQGLGGQLGLVIALMFERSLGEKSPWYGYIQSL-------PLRENIP 108

Query: 198 LLWSETELAYLTGSPTKAEILE-RAEGIKREYNE 230
           L W + + A L G+   A +LE   + +K +Y E
Sbjct: 109 LFWEKDQQACLDGTAV-AHLLEPMPKDLKADYKE 141


>gi|325186532|emb|CCA21071.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 441

 Score = 45.8 bits (107), Expect = 0.055,   Method: Compositional matrix adjust.
 Identities = 26/99 (26%), Positives = 51/99 (51%), Gaps = 7/99 (7%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           V A++ LQ G+    +P  L ++    + ++    L   N+L +   +AL+LM E+ +  
Sbjct: 39  VYAAKSLQKGEITMEIPFHLTISKVTAMQSDLRQILQDKNELDQDEIVALFLMIERFKSS 98

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
            SF+ P+I+ L  Q        + P+ W++++ A L G+
Sbjct: 99  DSFFEPFIQSLPSQ-------FDLPIFWNDSDFAELEGT 130



 Score = 41.2 bits (95), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 49/197 (24%), Positives = 85/197 (43%), Gaps = 19/197 (9%)

Query: 339 GPQPNSKLLINYGFV--DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 396
           G   ++KLL +YGFV  +E N ++ + +   +   DP ++ K+ + + N     Q +   
Sbjct: 247 GSYSDAKLLYSYGFVSLNEKNRFNGIDLWMRVPVTDPNFKLKQAILEGNAATRDQTYDFR 306

Query: 397 AGREKEAISD-MLPYLRLGYVS--DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 453
                + + +  L   R+  +S  +  E +    S      VS   E AV   + D  + 
Sbjct: 307 GTIHLDDVDERFLASFRIILLSQEEFREYEKAFDS----TIVSVRNELAVYAAIHDVCEK 362

Query: 454 RLAGYPATLSEDEAMLTDYNLHPKKRVATQL-VRME-KKMLNACLQVTADMIMLL----- 506
           RLA +P +L +D   L +  ++   R    + VRME KK+L +  ++  +   LL     
Sbjct: 363 RLARFPTSLEDDLKKLAELEMNSDLRKTYAISVRMEDKKILQSVCRLMKEWRNLLENDSN 422

Query: 507 ---PDVTVSPCPAPYAP 520
              PDVT    P    P
Sbjct: 423 IYPPDVTRQQQPQLSMP 439


>gi|426218421|ref|XP_004003445.1| PREDICTED: SET domain-containing protein 4 [Ovis aries]
          Length = 439

 Score = 45.8 bits (107), Expect = 0.057,   Method: Compositional matrix adjust.
 Identities = 73/328 (22%), Positives = 127/328 (38%), Gaps = 44/328 (13%)

Query: 39  NFGSSLRLVRRKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILK 98
           N G     +RR+  F     SS+     SR V    + +  +LK W+         +I  
Sbjct: 3   NGGGRTSRIRRRKLFR----SSE-----SRGVNESYKPEFIELKKWLKDRRFEDATLIPA 53

Query: 99  EKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-SEL 157
             P           + +   LQ G    S+P S ++T + V+ +   A +       S L
Sbjct: 54  RFPGTGRG------LMSKTSLQEGQTIISLPESCLLTTDTVIRSYLGAYIAKWQPPPSPL 107

Query: 158 ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
             L  +L+ EK  G +S W PY+  L +       A   P+   E E+  L  +P K + 
Sbjct: 108 LALCTFLVSEKHAGDRSPWKPYLEVLPK-------AYTCPVC-LEPEVVNLLPNPLKTKA 159

Query: 218 LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK-- 275
            E+   ++  ++     +     LF +    I    F++   + A+  V +  V++++  
Sbjct: 160 WEQRSHVQEFFSSSRGFFSSLQPLFSEAIETI----FSYRALRWAWCTVNTRAVYMKRPP 215

Query: 276 -VSLARRFALVPLGP--PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFT 332
            + L+       L P   LL +S   +   A  ++     +    + G+   V       
Sbjct: 216 QLCLSPEPDTCALAPYLDLLNHSPDVQVKAAFNEETRCYEIRTATRCGKHKEV------- 268

Query: 333 GHDFKC-GPQPNSKLLINYGFVDEDNPY 359
              F C GP  N +LL+ YGFV   NP+
Sbjct: 269 ---FICYGPHDNHRLLLEYGFVSVSNPH 293


>gi|303275314|ref|XP_003056953.1| set domain protein [Micromonas pusilla CCMP1545]
 gi|226461305|gb|EEH58598.1| set domain protein [Micromonas pusilla CCMP1545]
          Length = 701

 Score = 45.8 bits (107), Expect = 0.057,   Method: Compositional matrix adjust.
 Identities = 38/136 (27%), Positives = 59/136 (43%), Gaps = 19/136 (13%)

Query: 84  WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
           WM + G+    V +   P      R      A+ D+  GD   SVP   ++T E  +  +
Sbjct: 37  WMKRRGIVLNGVGVGRFP------RTGRGCVATRDIAPGDVLVSVPEDAIITAETSVAAD 90

Query: 144 TIAEL-LTTNKLS-------ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
            + +  L  +++S       E   L L ++ E  +G +S + PY+  L   R     A  
Sbjct: 91  ALTKFGLGGDEMSAEASPRLEREALVLAVLAEMSRGHESDFAPYLAALPTLR-----ATH 145

Query: 196 SPLLWSETELAYLTGS 211
           SPL WS  ELA L G+
Sbjct: 146 SPLAWSGAELAELEGT 161


>gi|145346652|ref|XP_001417799.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144578027|gb|ABO96092.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 490

 Score = 45.8 bits (107), Expect = 0.059,   Method: Compositional matrix adjust.
 Identities = 62/270 (22%), Positives = 105/270 (38%), Gaps = 42/270 (15%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLS------ELACLALYLMY 166
           V A+ DL+ G+   SVP+  V+T++     + + E +            +   L + +M 
Sbjct: 57  VRATRDLRVGEVVVSVPDDAVLTVDACAVKKELGEFVGDGDDEAPSPRLDKELLVIAVMC 116

Query: 167 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS-------PTKAEILE 219
           E   GK S W  Y+  +      G     S L W + ++  L G+           E L+
Sbjct: 117 EMCAGKSSAWCEYLETVHEAVRVGH----SVLAWDDEQVTALFGTDAWRDAYENDDETLD 172

Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF-TFEIFKQAFVAVQSCVVHLQKVSL 278
                +  +  + T++F    LF +    +  EA         A VA  S  +   ++  
Sbjct: 173 LPMMTEEHFENVVTLFF---KLFPKLASGLSVEALRELHFAATAMVAGYSFTLGDDEIQ- 228

Query: 279 ARRFALVPLGPPLLAYSSKCKAMLAAVDD----AVQLVVDRPYKAGESIVVWYNISFTGH 334
               A+VP    +L ++  C+A +    D     +Q++  R  K GE +   Y       
Sbjct: 229 ----AMVPFW-DMLNHAPPCEASVRLHHDQKNGCLQMITVRGVKKGEEVFNTY------- 276

Query: 335 DFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
               GP  N++LL  YGFV   NP+    V
Sbjct: 277 ----GPLRNAELLRRYGFVLPRNPHGGTTV 302


>gi|422293951|gb|EKU21251.1| hypothetical protein NGA_2061300, partial [Nannochloropsis gaditana
           CCMP526]
          Length = 452

 Score = 45.8 bits (107), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 63/285 (22%), Positives = 111/285 (38%), Gaps = 46/285 (16%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
           L  W +KNG+     I     S       +    A+  ++ G+   +VP +L  +++ V 
Sbjct: 66  LLEWCNKNGIKDASKITIGPVSQAGMGLGL---VATAPIKQGETLATVPLNLCFSMDSVR 122

Query: 141 GN---ETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
            +   + I E      L + + +AL L+YE   G KS +  YI+ L R    GQ   + P
Sbjct: 123 ASPLGKVIGEF--EPALGDASLIALQLLYEAHMGPKSKYAVYIKSLPRP---GQDGFDHP 177

Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
           L WS  E   L  S T+       + +  +Y      W  + +L +     +  ++F   
Sbjct: 178 LFWSTAEQGVLAKSSTRNLGETLIDAVAEDYG-----WIQS-ALARGGISGLQADSFDLS 231

Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA---------MLAAVDDA 308
            F+ A   V S     +      R A      PLL  +++ +          +       
Sbjct: 232 DFEWAVAVVLSRSFFAEN---GLRLA------PLLDMANRGEGCTNEPQIGGLGIFGGKG 282

Query: 309 VQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV 353
           ++++ DR    G+ IV+ Y           GP+   + L ++GFV
Sbjct: 283 LKVIADRDTDKGQEIVISY-----------GPKSGIEFLEDHGFV 316


>gi|55953063|ref|NP_001007260.1| SET domain-containing protein 4 isoform 2 [Homo sapiens]
 gi|12804091|gb|AAH02898.1| SET domain containing 4 [Homo sapiens]
 gi|119630161|gb|EAX09756.1| SET domain containing 4, isoform CRA_a [Homo sapiens]
          Length = 307

 Score = 45.8 bits (107), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 61/261 (23%), Positives = 102/261 (39%), Gaps = 33/261 (12%)

Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSF 175
            LQ G    S+P S ++T + V+    +   +T  K   S L  L  +L+ EK  G +S 
Sbjct: 67  SLQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGHRSL 125

Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 235
           W PY+  L +       A   P+   E E+  L     KA+  E+   ++  +      +
Sbjct: 126 WKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFF 177

Query: 236 FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--P 290
                LF +    I    F++     A+  V +  V+L   Q+  L+       L P   
Sbjct: 178 SSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPDTCALAPYLD 233

Query: 291 LLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLI 348
           LL +S   + KA       + ++     ++  E + + Y           GP  N +L +
Sbjct: 234 LLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICY-----------GPHDNQRLFL 282

Query: 349 NYGFVDEDNPYDRLVVEAALN 369
            YGFV   NP+  + V    N
Sbjct: 283 EYGFVSVHNPHACVYVSRGWN 303


>gi|111306423|gb|AAI20969.1| SETD3 protein [Homo sapiens]
          Length = 284

 Score = 45.4 bits (106), Expect = 0.062,   Method: Compositional matrix adjust.
 Identities = 53/223 (23%), Positives = 100/223 (44%), Gaps = 29/223 (13%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y     V        Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV-------IQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPL 287
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPL 273


>gi|428173103|gb|EKX42007.1| hypothetical protein GUITHDRAFT_141487 [Guillardia theta CCMP2712]
          Length = 355

 Score = 45.4 bits (106), Expect = 0.064,   Method: Compositional matrix adjust.
 Identities = 62/267 (23%), Positives = 103/267 (38%), Gaps = 45/267 (16%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERV-------------LGNETIAELLTTNKLSELAC 159
             A +D+  G+   ++P+ +++  ERV             + +++I+       LSE   
Sbjct: 75  TTAKDDIADGELYIAIPDHMLMGPERVEPGSRLDKKLMKIVKSQSISMQEQRRLLSEKNK 134

Query: 160 LALYL---MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAE 216
           + +Y    MY  K  K+SFW PY   +          + SP+ WSE EL  L GS     
Sbjct: 135 VLMYFLLQMYNPK--KESFWKPYFDIMPTN-------LTSPIFWSEDELQELAGSEVSNM 185

Query: 217 ILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
                + ++  Y+EL         +F+        +AFT + +  A     S V+ L + 
Sbjct: 186 ARIEKKRLRAMYDELRE------RIFKHDRKTFLKQAFTLKNWFWANGLYDSRVIQLNRQ 239

Query: 277 SLARRF-ALVPLGPPLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWYNISFTG 333
           +        +PL   +    S+ K  +       A  +  DR    G  +   Y      
Sbjct: 240 TGHGNVPTFIPLIDMVNCIESQDKTFIQYDKKLRAAVMYADRAVSRGVQVFESY------ 293

Query: 334 HDFKCGPQPNSKLLINYGFVDEDNPYD 360
                G + N + L+  GFV EDNP D
Sbjct: 294 -----GNKSNYEYLLYNGFVMEDNPND 315


>gi|387193935|gb|AFJ68731.1| hypothetical protein NGATSA_2061300, partial [Nannochloropsis
           gaditana CCMP526]
          Length = 446

 Score = 45.4 bits (106), Expect = 0.066,   Method: Compositional matrix adjust.
 Identities = 63/285 (22%), Positives = 111/285 (38%), Gaps = 46/285 (16%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
           L  W +KNG+     I     S       +    A+  ++ G+   +VP +L  +++ V 
Sbjct: 60  LLEWCNKNGIKDASKITIGPVSQAGMGLGL---VATAPIKQGETLATVPLNLCFSMDSVR 116

Query: 141 GN---ETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
            +   + I E      L + + +AL L+YE   G KS +  YI+ L R    GQ   + P
Sbjct: 117 ASPLGKVIGEF--EPALGDASLIALQLLYEAHMGPKSKYAVYIKSLPRP---GQDGFDHP 171

Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
           L WS  E   L  S T+       + +  +Y      W  + +L +     +  ++F   
Sbjct: 172 LFWSTAEQGVLAKSSTRNLGETLIDAVAEDYG-----WIQS-ALARGGISGLQADSFDLS 225

Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA---------MLAAVDDA 308
            F+ A   V S     +      R A      PLL  +++ +          +       
Sbjct: 226 DFEWAVAVVLSRSFFAEN---GLRLA------PLLDMANRGEGCTNEPQIGGLGIFGGKG 276

Query: 309 VQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV 353
           ++++ DR    G+ IV+ Y           GP+   + L ++GFV
Sbjct: 277 LKVIADRDTDKGQEIVISY-----------GPKSGIEFLEDHGFV 310


>gi|356553227|ref|XP_003544959.1| PREDICTED: protein SET DOMAIN GROUP 40-like [Glycine max]
          Length = 475

 Score = 45.4 bits (106), Expect = 0.074,   Method: Compositional matrix adjust.
 Identities = 67/301 (22%), Positives = 120/301 (39%), Gaps = 58/301 (19%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSELACLALYLMYEKKQG 171
           + A  DL+ G+    VP S ++T E V+ ++ + + +  +  LS    L + L+YE  +G
Sbjct: 55  LGAVRDLRRGEIVLRVPKSALMTRETVMEDKKLCDAVNRHSSLSSAQILIVCLLYEMGKG 114

Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
           K S W PY+  L           +   ++ E E   L       ++ E     ++   + 
Sbjct: 115 KTSRWHPYLMHLPH-------TYDVLAMFGEFEKHAL-------QVDEAMWVTEKAMLKA 160

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
            + W  A SL Q   +    + FTF+ +  A   + S  +H   +       L P+G  L
Sbjct: 161 KSEWKEAHSLMQDLMF--KPQFFTFKAWVWAAATISSRTLH---IPWDEAGCLCPVG-DL 214

Query: 292 LAYSSKC--KAMLAAVDDAVQL---------------------VVDRPYKAGESIVVWYN 328
             Y +     + +  +D A QL                          YK G+ +++ Y 
Sbjct: 215 FNYDAPGIEPSGIEDLDHAEQLDSHSWRLTDGGFEEDANAYCFYAREHYKKGDQVLLCY- 273

Query: 329 ISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV--EAALNTEDPQYQDKRMVAQRNG 386
                     G   N +LL +YGF+ ++NP D++ +  E AL +    +  + +    NG
Sbjct: 274 ----------GTYTNLELLEHYGFLLQENPNDKVFIPLEPALYSS-TSWSKESLYIHHNG 322

Query: 387 K 387
           K
Sbjct: 323 K 323


>gi|171679805|ref|XP_001904849.1| hypothetical protein [Podospora anserina S mat+]
 gi|170939528|emb|CAP64756.1| unnamed protein product [Podospora anserina S mat+]
          Length = 468

 Score = 45.4 bits (106), Expect = 0.078,   Method: Compositional matrix adjust.
 Identities = 52/210 (24%), Positives = 81/210 (38%), Gaps = 40/210 (19%)

Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
            +L+ E  + K S+W PYI  L +       A+  P +W E ++  L  +     + E  
Sbjct: 104 FFLVKEYLKEKDSYWWPYISTLPQPDRVDTWAL--PAVWPEDDIECLEETNAHVAVREIQ 161

Query: 222 EGIKREYNE----LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS 277
             IK+EY      L  V F     + Q  Y      FT   F+ + +  Q    H+    
Sbjct: 162 ANIKKEYKHARKLLKEVDFPGWQEYTQLLYKWAFCIFTSRSFRPSLILSQETQDHV---- 217

Query: 278 LARRFALVPLGP---------PLLAY-----SSKCKAMLAAVDDAVQLVVDRPYKAGESI 323
                 L P G          PLL       +S+ +  L  VD   QL+ +  Y+ G+ +
Sbjct: 218 ----LGLTPHGTKVDDFSILQPLLDIGNHDPTSQYQWNL-EVDGTCQLICNNAYQPGQQV 272

Query: 324 VVWYNISFTGHDFKCGPQPNSKLLINYGFV 353
              Y           G + NS+LL+ YGF+
Sbjct: 273 FNNY-----------GLKSNSELLLGYGFI 291


>gi|403350232|gb|EJY74567.1| hypothetical protein OXYTRI_04175 [Oxytricha trifallax]
          Length = 766

 Score = 45.4 bits (106), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 39/164 (23%), Positives = 67/164 (40%), Gaps = 23/164 (14%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--------SELACLALYL 164
           V A ED++  +A   VP  L++T+E    +  I  +   NK          E   L +++
Sbjct: 27  VRAREDIEHREAFLYVPFKLLITMELAHNHPIIGHVFKENKQIFTKEHEDFEQLTLTVFM 86

Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
           +YE ++G +SFW PY+  L          VE    WS++++  +       E       I
Sbjct: 87  LYEYQKGLESFWFPYLNLLP--------DVEFFCNWSKSDIEAIDDQELAYETKSYKRDI 138

Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 268
           + E+ E++        L   YP    +      +F + F  V S
Sbjct: 139 EIEWKEIEL-------LLLHYPQHFSSALIDKHLFMRIFAQVCS 175


>gi|168016200|ref|XP_001760637.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162687997|gb|EDQ74376.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 450

 Score = 45.1 bits (105), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 72/319 (22%), Positives = 123/319 (38%), Gaps = 62/319 (19%)

Query: 81  LKSWMHKNGLPP--CKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLER 138
            + WM  NG+    C++    +PS N ++      A  ++ Q        P  L +T   
Sbjct: 18  FRDWMQINGVQSRFCEI----RPSSNGENAGFGLFATKDNAQG--VLMVTPLLLAITPMT 71

Query: 139 VLGNETIA----ELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
           VL +  +     +L+   ++ +   + L+L+ E+ +G+ SFW PY+  L  + G      
Sbjct: 72  VLQDPELGGHYCKLMEEGEVDDRLLIMLFLVIERARGRFSFWAPYLEILPFKFG------ 125

Query: 195 ESPLLWSETELAYLTGSPT-KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 253
            +PL +SE EL+ L G+   +A   +   G+      LD     A S+F     +IP   
Sbjct: 126 -TPLSFSEEELSELKGTHLFQATQQQSTTGLILRCPVLDR----ANSVFWTRALNIPCP- 179

Query: 254 FTFEIFKQAFVAVQSCVVH----------LQKVSLARRFALVPLGPPLLAYSSKCKAM-- 301
                F   F        H             V +     +  L P +   +   KA+  
Sbjct: 180 ---HSFNNRFAVDLDSTTHKKPEESSAADTDDVKIPSSVWVEGLVPGIDFCNHDLKAVAL 236

Query: 302 ---------LAAVDDAVQLV--VDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINY 350
                    +  V +++ LV  +D     G  I + Y           G + N +LL  Y
Sbjct: 237 WEVDGPEGSVTGVPNSMYLVTGLDVVISNGSEIFISY-----------GNKSNEELLYLY 285

Query: 351 GFVDEDNPYDRLVVEAALN 369
           GFV  +NP D L+V + + 
Sbjct: 286 GFVLVENPDDYLMVRSTIG 304


>gi|299472213|emb|CBN77183.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloropl [Ectocarpus
           siliculosus]
          Length = 460

 Score = 45.1 bits (105), Expect = 0.082,   Method: Compositional matrix adjust.
 Identities = 65/281 (23%), Positives = 112/281 (39%), Gaps = 46/281 (16%)

Query: 84  WMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLG 141
           W+ K+G  L    V+    P   E+      + A++ ++ G +  ++P SL +T    L 
Sbjct: 18  WLTKSGVRLTDNAVLAGRSPLAGERG-----LVAAKAIETGQSVLAIPQSLGLTATG-LK 71

Query: 142 NETIAELLTT--NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
           +  IA+ +        E   +AL +++E+ QG+ S   P+I  L ++   G+L  E PL 
Sbjct: 72  SSGIAQYVEGFEGWTGETGLIALQVLWERAQGEGSKMAPWIAVLPKE---GEL--EMPLF 126

Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
           W E +L     S T+        G   + +E D  W ++ + F ++P   P + F    F
Sbjct: 127 WGEADLTLADASSTRG-----ISGFVADVDE-DFAW-LSENAFAKHPKVFPADKFGPGDF 179

Query: 260 KQAFVAVQSCVVHLQK-------VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 312
           + A     S    +         V  A   +L  +  P    +     +  +   AV L 
Sbjct: 180 RWAVGVALSRSFFVDGELRLTPLVDFANHSSLRGVSEP----TGGTTGLFGS--KAVVLR 233

Query: 313 VDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV 353
             + Y+ GE   V Y           GP+  +  L   GFV
Sbjct: 234 AGKNYEEGEEFFVSY-----------GPKGAAGYLEENGFV 263


>gi|426392958|ref|XP_004062802.1| PREDICTED: SET domain-containing protein 4 [Gorilla gorilla
           gorilla]
          Length = 440

 Score = 45.1 bits (105), Expect = 0.084,   Method: Compositional matrix adjust.
 Identities = 66/289 (22%), Positives = 114/289 (39%), Gaps = 38/289 (13%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
           LQ G    S+P S ++T + V+    +   +T  K   S L  L  +L+ EK  G++S W
Sbjct: 68  LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGRRSLW 126

Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
            PY+  L +       A   P+   E E+  L     KA+  E+   ++  +      + 
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178

Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
               LF +    I    F++     A+  V +  V+L   Q+  L+       L P   L
Sbjct: 179 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPDTCALAPYLDL 234

Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
           L +S   + KA       + ++     ++  E + + Y           GP  N +L + 
Sbjct: 235 LNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICY-----------GPHDNQRLFLE 283

Query: 350 YGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 393
           YGFV   NP+      R ++   L + D Q   K  + + +G +    F
Sbjct: 284 YGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332


>gi|358386801|gb|EHK24396.1| hypothetical protein TRIVIDRAFT_168260 [Trichoderma virens Gv29-8]
          Length = 370

 Score = 45.1 bits (105), Expect = 0.084,   Method: Compositional matrix adjust.
 Identities = 36/151 (23%), Positives = 63/151 (41%), Gaps = 17/151 (11%)

Query: 218 LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS 277
           LE  E +++   E    W    + F+    D+P E +T+        +  +      K  
Sbjct: 113 LESREHLRKREKEFQGNW----NAFKDAFPDVPYEEYTYAWMIVNTRSFYNETPETLKYP 168

Query: 278 LARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFK 337
              R AL+P+          CK   +A  D   +V DR YK GE +     IS++ H   
Sbjct: 169 WEDRLALIPVADLFNHSDDGCKVYYSA--DGYHIVADREYKKGEELF----ISYSSHS-- 220

Query: 338 CGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 368
                N  +L+ YGF+ +++  D + ++ A+
Sbjct: 221 -----NDYILLEYGFIPDESLDDDVYIDDAV 246


>gi|33468718|emb|CAE30375.1| SI:dZ63M10.4 (novel protein similar to human chromosome 21 open
           reading frame 18 (C21orf18)) [Danio rerio]
          Length = 440

 Score = 45.1 bits (105), Expect = 0.084,   Method: Compositional matrix adjust.
 Identities = 72/330 (21%), Positives = 135/330 (40%), Gaps = 50/330 (15%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVV 134
           L+ W+++ G     +I            P+++      + +++ ++A ++  S+P   ++
Sbjct: 37  LRRWLNERGFTSQSLI------------PVNFHGNGRGLMSTQTIKAKNSLISLPEECLL 84

Query: 135 TLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
           T   VL +  +A+ +      +S L  L  +L+ E+  G+ S W PYI  L +       
Sbjct: 85  TTSTVLKS-YMADYIKRWHPPISPLLALCCFLISERHHGEASEWNPYIDILPK------- 136

Query: 193 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 252
               PL + +  +  L  S  K    ++ E  +  ++   T +     LF Q     PTE
Sbjct: 137 TYTCPLYFPDNVIELLPRSLQKKAT-QQKEQFQELFSSSQTFFHSLQPLFNQ-----PTE 190

Query: 253 A-FTFEIFKQAFVAVQSCVV---HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA 308
             F+ +  + A+ +V +  V   H Q   L+R   +  L P  L   + C  +       
Sbjct: 191 ELFSQDALRWAWCSVNTRTVYMEHDQSKYLSREKDVYALAP-YLDLLNHCPNV------Q 243

Query: 309 VQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPY-----DRLV 363
           V+   ++  +  E   V     F       GP  N +LL+ YGFV   NP+     D   
Sbjct: 244 VEAGFNKETRCYEIRSVNGCKKFQQAFINYGPHDNHRLLLEYGFVAPCNPHSVVYVDLET 303

Query: 364 VEAALNTEDPQYQDKRMVAQRNGKLSVQVF 393
           ++  L+ +D Q ++K +  + N  L    F
Sbjct: 304 LKVGLDEKDKQLKEKLLYLKDNDFLRNLTF 333


>gi|116206234|ref|XP_001228926.1| hypothetical protein CHGG_02410 [Chaetomium globosum CBS 148.51]
 gi|88183007|gb|EAQ90475.1| hypothetical protein CHGG_02410 [Chaetomium globosum CBS 148.51]
          Length = 442

 Score = 45.1 bits (105), Expect = 0.089,   Method: Compositional matrix adjust.
 Identities = 50/206 (24%), Positives = 80/206 (38%), Gaps = 31/206 (15%)

Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
            +L+ E  +GK SFW PY+  L          +  P  W E ++AYL  +     I E  
Sbjct: 113 FFLIKEYLKGKDSFWWPYLATLPSPDQVNAWVL--PAFWPEDDIAYLECTNAHVAIQEIQ 170

Query: 222 EGIKREYNE----LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---- 273
             +K E+ +    L    F   + +    Y      FT   F+ + +   +   H+    
Sbjct: 171 ANVKGEFKQARKILKNENFPDVAAYTSLMYKWAFTIFTSRSFRPSLILSDTTKRHISTLL 230

Query: 274 -QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD-----DAVQLVVDRPYKAGESIVVWY 327
            Q V L   F+++    PLL  ++     + + D     DA  LV    Y  G  +    
Sbjct: 231 PQSVEL-DDFSIL---QPLLDIANHSPTAVYSWDTTSPADACTLVCGDRYPPGAQV---- 282

Query: 328 NISFTGHDFKCGPQPNSKLLINYGFV 353
              F  +  K     NS+LL+ YGF+
Sbjct: 283 ---FNNYGLKT----NSELLLGYGFI 301


>gi|146180409|ref|XP_001020886.2| hypothetical protein TTHERM_00411920 [Tetrahymena thermophila]
 gi|146144524|gb|EAS00641.2| hypothetical protein TTHERM_00411920 [Tetrahymena thermophila
           SB210]
          Length = 726

 Score = 45.1 bits (105), Expect = 0.092,   Method: Compositional matrix adjust.
 Identities = 48/188 (25%), Positives = 81/188 (43%), Gaps = 31/188 (16%)

Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGNE-----TIAELLTTNKLSELA---CLALYLM 165
           AA++D+    A  S+PN ++++ +R   +E       +E L + K ++ A    L ++ M
Sbjct: 67  AATKDIAPLTAFISIPNKIIISYDRARFSELKSFFKQSEDLFSEKENDEAGVNVLTVFFM 126

Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIK 225
           YE+ +GKKS W  Y   L+          E+ L W+  E+  +     + +         
Sbjct: 127 YERLKGKKSLWHEYFEILENN--------ETILTWTAEEINRIPDPYIQKQA-------- 170

Query: 226 REYNE-LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-CVVHLQKVSLARRFA 283
           REY E +D +W     L    P        T E+F  A+  V S C  + QK +     +
Sbjct: 171 REYKEQVDELWDELKELLHSQPNFFQKATATKELFLWAYNIVMSRCFGYTQKGT-----S 225

Query: 284 LVPLGPPL 291
           +VP    L
Sbjct: 226 IVPFADCL 233


>gi|50556556|ref|XP_505686.1| YALI0F20944p [Yarrowia lipolytica]
 gi|49651556|emb|CAG78495.1| YALI0F20944p [Yarrowia lipolytica CLIB122]
          Length = 402

 Score = 45.1 bits (105), Expect = 0.092,   Method: Compositional matrix adjust.
 Identities = 60/230 (26%), Positives = 107/230 (46%), Gaps = 46/230 (20%)

Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIREL-DRQRGRGQLAVESPLLWSETELAYLTGSP 212
           +S    LAL+L+ ++  G KS W  ++  L DR+ G     ++ PL WS+ +   LT  P
Sbjct: 79  MSAHQVLALFLVIQQSLGSKSDWKAFMGLLPDRKEG----FLDVPLQWSKEDQDSLT--P 132

Query: 213 TKAEILERA-EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-CV 270
               +L++  +  + +Y++  T  F+A     +Y  D P +A+ +     A++ V S C+
Sbjct: 133 EGIVVLKKTLDTFEADYDKTKT--FVA-----KYDSD-PRDAYLW-----AWLCVNSRCL 179

Query: 271 VHLQKVSLARRFAL-----VPLGP--PLLAYS-----SKCKAMLAAVDDAVQLVVDRPYK 318
                ++  ++ A      + L P   L+ +S     + C+   +++   + L   R Y 
Sbjct: 180 YFDLTLTTGKKDAQEVPDNITLAPYVDLINHSVESGPTHCQLKTSSIGFEI-LCGQRGYT 238

Query: 319 AGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 368
           A E I + Y           GP+ NS LL  YGF   +NP+D + +  AL
Sbjct: 239 ADEEIFLCY-----------GPRSNSVLLCEYGFTVPENPWDDVDISDAL 277


>gi|145517214|ref|XP_001444490.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124411912|emb|CAK77093.1| unnamed protein product [Paramecium tetraurelia]
          Length = 748

 Score = 45.1 bits (105), Expect = 0.094,   Method: Compositional matrix adjust.
 Identities = 37/141 (26%), Positives = 66/141 (46%), Gaps = 23/141 (16%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERV-LGNETIA-----ELLTTNKLS--ELACLALYL 164
           V A++D+ A  A   VP +L+++ E+  L + +I      EL   N+ S  E   L  YL
Sbjct: 46  VVATQDIPANTAIICVPQTLIISQEKCKLSSLSIVYDKHPELFDENQTSDAEFNILIFYL 105

Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
             EKK+G++SF+ PYI+ +            + + W++ EL+ +       E    +E +
Sbjct: 106 FNEKKKGEQSFFYPYIQAIQTNN--------TLIDWTKEELSQIEDPIVLDEFAIVSEDL 157

Query: 225 KREYNELDTVWFMAGSLFQQY 245
           K        +W  A  +F ++
Sbjct: 158 K-------VLWNYAQDIFNEF 171


>gi|395332633|gb|EJF65011.1| SET domain-containing protein [Dichomitus squalens LYAD-421 SS1]
          Length = 502

 Score = 45.1 bits (105), Expect = 0.095,   Method: Compositional matrix adjust.
 Identities = 62/234 (26%), Positives = 98/234 (41%), Gaps = 38/234 (16%)

Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
           A+VP+   L A      A L   +  +++V  +P +AGE I   Y           G  P
Sbjct: 257 AMVPMADMLNARFESENAKLFYEERELKMVTTKPVEAGEQIWNTY-----------GDPP 305

Query: 343 NSKLLINYGFVD----------EDNPYDRLVVEAAL----NTEDPQYQDKRMVAQRNGKL 388
           NS LL  YG VD            NP D + V A L     ++  +Y  +  V     + 
Sbjct: 306 NSDLLRRYGHVDVVPLRPPLSGMGNPRDIVEVRADLIVSAVSKKVEYSLQERVDWWLEEA 365

Query: 389 SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLA 448
              VF +    + E   +++ + RL ++S+   +++   S  P   V P     VL    
Sbjct: 366 EDDVFILRT--DCELPEELVSFERLLFLSEDEWIKTAKKSKLPKPKVDP----DVLTVAI 419

Query: 449 DYFKARLAGYPATLSEDEAMLT-----DYNLHPKKRVATQLVRMEKKMLNACLQ 497
           D   ARL  YP ++ EDE +L+       +L+ K  V  +L   EK++L   L+
Sbjct: 420 DVLSARLKEYPTSIEEDEKLLSADKVESLSLNKKHAVIVRL--GEKRILQGTLK 471


>gi|320163219|gb|EFW40118.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 1188

 Score = 45.1 bits (105), Expect = 0.096,   Method: Compositional matrix adjust.
 Identities = 49/212 (23%), Positives = 85/212 (40%), Gaps = 31/212 (14%)

Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 256
           PL W++ EL +L G+     I ER   ++ ++  +  V      L ++ P   P + FT+
Sbjct: 245 PLWWNDAELDHLDGTNIGGYIQERRNQVRNQFLNVFPV------LSREQPALFPKDVFTY 298

Query: 257 EIFKQAFVAVQSCVVHLQ-KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 315
           E +  AF    S    L+  V+         +G P+      C   L  + D +      
Sbjct: 299 EAYLWAFSTCSSRAFPLRVTVNPTTGVESHAIGNPM---KEPCVECLLPLLDMMN----- 350

Query: 316 PYKAGESIVVWYNIS-----FTGHDFK--------CGPQPNSKLLINYGFVDEDNPYDRL 362
            ++ G SI  W+        FTG   +         GP+ N +LL+ YGF   +N  D +
Sbjct: 351 -HQFGASI-TWFTDETSVRFFTGAKVRKGEQVYNNYGPKSNEELLMGYGFCLPNNEADHV 408

Query: 363 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFH 394
            ++  +   DP  + K  + + +G       H
Sbjct: 409 KIQLTVGN-DPDGEAKLAILRWHGLSLTHFLH 439


>gi|302510645|ref|XP_003017274.1| hypothetical protein ARB_04152 [Arthroderma benhamiae CBS 112371]
 gi|291180845|gb|EFE36629.1| hypothetical protein ARB_04152 [Arthroderma benhamiae CBS 112371]
          Length = 479

 Score = 45.1 bits (105), Expect = 0.096,   Method: Compositional matrix adjust.
 Identities = 30/126 (23%), Positives = 55/126 (43%), Gaps = 14/126 (11%)

Query: 97  LKEKPSHNEKHRPIHY-----VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTT 151
           LK    H + H  IH        A   +   +  F +PN L+++++       +   L  
Sbjct: 24  LKRSSPHFKMHPGIHIADLRSTGAGRGISEDEELFVIPNDLILSVQNSEARSVLG--LDD 81

Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
            +L     L + ++YE  QG++S W PY R L         + ++ + W++ +L+ L GS
Sbjct: 82  KQLGPWLSLIITMIYEYYQGEQSKWYPYFRILPS-------SFDTLMFWTDEQLSELQGS 134

Query: 212 PTKAEI 217
               +I
Sbjct: 135 AVVGKI 140


>gi|256270722|gb|EEU05884.1| Set7p [Saccharomyces cerevisiae JAY291]
          Length = 494

 Score = 45.1 bits (105), Expect = 0.097,   Method: Compositional matrix adjust.
 Identities = 64/266 (24%), Positives = 113/266 (42%), Gaps = 37/266 (13%)

Query: 113 VAASEDLQAGDAAFSVPNS--LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKK- 169
           V A++ ++  +  F +P S  L VT  +++ +    +    N+      L + ++YE + 
Sbjct: 41  VVATQKIKKDETLFKIPRSSVLSVTTSQLIKDYPSLKDKFLNETGSWEGLIICILYEMEV 100

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
             ++S W PY +  ++      L     + W + EL  L         G     E+ ER 
Sbjct: 101 LQERSRWAPYFKVWNKPSDMNAL-----IFWDDNELQLLKPSLVLERIGKKEAKEMHERI 155

Query: 222 -EGIKREYNELDTVWFMAGSL-FQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
            + IK+   E   V   A S  F  + Y    I + +F  E+   +    +      +++
Sbjct: 156 IKSIKQIGGEFSRV---ATSFEFDNFAYIASIILSYSFDLEMQDSSVNENEEEETSEEEL 212

Query: 277 SLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHD 335
              R   +++PL   L A +SKC A L    + +++V  R  +  E +   YNI      
Sbjct: 213 ENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQV---YNIY----- 264

Query: 336 FKCGPQPNSKLLINYGFVDED-NPYD 360
              G  PNS+LL  YG+V+ D + YD
Sbjct: 265 ---GEHPNSELLRRYGYVEWDGSKYD 287


>gi|449506720|ref|XP_004162829.1| PREDICTED: uncharacterized LOC101212907 [Cucumis sativus]
          Length = 559

 Score = 45.1 bits (105), Expect = 0.098,   Method: Compositional matrix adjust.
 Identities = 63/267 (23%), Positives = 106/267 (39%), Gaps = 39/267 (14%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           A EDL  GD    +P +++++ E V  +     L     +     + L+ M EK      
Sbjct: 189 AKEDLDVGDTVLEIPLAIIISEELVQKSTMYPVLSKVEGMLPETMMLLWSMKEKHIVDSE 248

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
           F + Y   L         A  + L +    +  L G+    E+++  E ++++YNEL   
Sbjct: 249 FRV-YFDTLPE-------AFNTGLSFGVGAMTTLVGTLLFDELMQAKEHLRKQYNEL--- 297

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP-------- 286
            F A  L   +P   P E +++E F  A     S  + +       R  LVP        
Sbjct: 298 -FPA--LCNNHPDIFPEEFYSWEEFLWACELWYSNSLKIMFPDGNVRTCLVPIAGFLNHS 354

Query: 287 LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKL 346
           L P +L Y       + +  D+++  + RP +AGE   + Y           G    S L
Sbjct: 355 LHPHILHYGK-----VDSDTDSLKFRLSRPCRAGEECYLSY-----------GNYSGSHL 398

Query: 347 LINYGFVDE-DNPYDRLVVEAALNTED 372
           +  YGF+ E DN  D + ++     +D
Sbjct: 399 VTFYGFLPEGDNVNDVIPLDIDFGDDD 425


>gi|6320463|ref|NP_010543.1| Rkm4p [Saccharomyces cerevisiae S288c]
 gi|46577338|sp|Q12504.1|RKM4_YEAST RecName: Full=Ribosomal N-lysine methyltransferase 4; AltName:
           Full=SET domain-containing protein 7
 gi|1136212|emb|CAA92714.1| unknown [Saccharomyces cerevisiae]
 gi|1226033|emb|CAA94096.1| unknown [Saccharomyces cerevisiae]
 gi|51830266|gb|AAU09704.1| YDR257C [Saccharomyces cerevisiae]
 gi|190404795|gb|EDV08062.1| hypothetical protein SCRG_00269 [Saccharomyces cerevisiae RM11-1a]
 gi|259145494|emb|CAY78758.1| Set7p [Saccharomyces cerevisiae EC1118]
 gi|285811273|tpg|DAA12097.1| TPA: Rkm4p [Saccharomyces cerevisiae S288c]
 gi|323349272|gb|EGA83501.1| Set7p [Saccharomyces cerevisiae Lalvin QA23]
 gi|365766338|gb|EHN07836.1| Set7p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
 gi|392300372|gb|EIW11463.1| Rkm4p [Saccharomyces cerevisiae CEN.PK113-7D]
          Length = 494

 Score = 45.1 bits (105), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 64/266 (24%), Positives = 113/266 (42%), Gaps = 37/266 (13%)

Query: 113 VAASEDLQAGDAAFSVPNS--LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKK- 169
           V A++ ++  +  F +P S  L VT  +++ +    +    N+      L + ++YE + 
Sbjct: 41  VVATQKIKKDETLFKIPRSSVLSVTTSQLIKDYPSLKDKFLNETGSWEGLIICILYEMEV 100

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
             ++S W PY +  ++      L     + W + EL  L         G     E+ ER 
Sbjct: 101 LQERSRWAPYFKVWNKPSDMNAL-----IFWDDNELQLLKPSLVLERIGKKEAKEMHERI 155

Query: 222 -EGIKREYNELDTVWFMAGSL-FQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
            + IK+   E   V   A S  F  + Y    I + +F  E+   +    +      +++
Sbjct: 156 IKSIKQIGGEFSRV---ATSFEFDNFAYIASIILSYSFDLEMQDSSVNENEEEETSEEEL 212

Query: 277 SLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHD 335
              R   +++PL   L A +SKC A L    + +++V  R  +  E +   YNI      
Sbjct: 213 ENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQV---YNIY----- 264

Query: 336 FKCGPQPNSKLLINYGFVDED-NPYD 360
              G  PNS+LL  YG+V+ D + YD
Sbjct: 265 ---GEHPNSELLRRYGYVEWDGSKYD 287


>gi|323334121|gb|EGA75505.1| Set7p [Saccharomyces cerevisiae AWRI796]
          Length = 515

 Score = 44.7 bits (104), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 64/266 (24%), Positives = 113/266 (42%), Gaps = 37/266 (13%)

Query: 113 VAASEDLQAGDAAFSVPNS--LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKK- 169
           V A++ ++  +  F +P S  L VT  +++ +    +    N+      L + ++YE + 
Sbjct: 41  VVATQKIKKDETLFKIPRSSVLSVTTSQLIKDYPSLKDKFLNETGSWEGLIICILYEMEV 100

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
             ++S W PY +  ++      L     + W + EL  L         G     E+ ER 
Sbjct: 101 LQERSRWAPYFKVWNKPSDMNAL-----IFWDDNELQLLKPSLVLERIGKKEAKEMHERI 155

Query: 222 -EGIKREYNELDTVWFMAGSL-FQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
            + IK+   E   V   A S  F  + Y    I + +F  E+   +    +      +++
Sbjct: 156 IKSIKQIGGEFSRV---ATSFEFDNFAYIASIILSYSFDLEMQDSSVNENEEEETSEEEL 212

Query: 277 SLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHD 335
              R   +++PL   L A +SKC A L    + +++V  R  +  E +   YNI      
Sbjct: 213 ENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQV---YNIY----- 264

Query: 336 FKCGPQPNSKLLINYGFVDED-NPYD 360
              G  PNS+LL  YG+V+ D + YD
Sbjct: 265 ---GEHPNSELLRRYGYVEWDGSKYD 287


>gi|151942233|gb|EDN60589.1| SET domain-containing protein [Saccharomyces cerevisiae YJM789]
          Length = 494

 Score = 44.7 bits (104), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 64/266 (24%), Positives = 113/266 (42%), Gaps = 37/266 (13%)

Query: 113 VAASEDLQAGDAAFSVPNS--LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKK- 169
           V A++ ++  +  F +P S  L VT  +++ +    +    N+      L + ++YE + 
Sbjct: 41  VVATQKIKKDETLFKIPRSSVLSVTTSQLIKDYPSLKDKFLNETGSWEGLIICILYEMEV 100

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
             ++S W PY +  ++      L     + W + EL  L         G     E+ ER 
Sbjct: 101 LQERSRWAPYFKVWNKPSDMNAL-----IFWDDNELQLLKPSLVLERIGKKEAKEMHERI 155

Query: 222 -EGIKREYNELDTVWFMAGSL-FQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
            + IK+   E   V   A S  F  + Y    I + +F  E+   +    +      +++
Sbjct: 156 IKSIKQIGGEFSRV---ATSFEFDNFAYIASIILSYSFDLEMQDSSINENEEEETSEEEL 212

Query: 277 SLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHD 335
              R   +++PL   L A +SKC A L    + +++V  R  +  E +   YNI      
Sbjct: 213 ENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQV---YNIY----- 264

Query: 336 FKCGPQPNSKLLINYGFVDED-NPYD 360
              G  PNS+LL  YG+V+ D + YD
Sbjct: 265 ---GEHPNSELLRRYGYVEWDGSKYD 287


>gi|328772032|gb|EGF82071.1| hypothetical protein BATDEDRAFT_23340 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 419

 Score = 44.7 bits (104), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 64/265 (24%), Positives = 110/265 (41%), Gaps = 49/265 (18%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERV--LGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           A+ D Q GD    +P  L++   R   L N   A +     L +   +AL++ ++K    
Sbjct: 49  ATSDFQIGDPVVRIPARLLLVPRRTHKLFNNHPAIV----ALKQHPSIALFIAWQKIHPT 104

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
              W PYI  L R      L ++  LL     L Y        +I E A   K + ++LD
Sbjct: 105 PE-WSPYIDILPRSFDTMPLCIDLKLL---AMLPY--------DIQEIA---KNQQSKLD 149

Query: 233 TVWFMAGSLFQQYPYD-IPTEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-------RFAL 284
           T +    +      Y+ IP +     IFK A++ V +  + +   ++++       +  +
Sbjct: 150 TDYAFVCTALAVSGYEMIPKD-----IFKWAWIVVNTRCITMNTNAISKPQLSHIHQQPI 204

Query: 285 VPLGPPLLAYSSKCKAMLAAVDDAVQ----LVVDRPYKAGESIVVWYNISFTGHDFKCGP 340
           + L P L   +    A ++A  D V+    +    PYK G  + + Y           GP
Sbjct: 205 ITLAPFLDCLNHTSTARISAGYDTVEKAYIIRTLVPYKKGSQVFINY-----------GP 253

Query: 341 QPNSKLLINYGFVDEDNPYDRLVVE 365
             N+ LL  YGF    NP++ +V++
Sbjct: 254 HDNNFLLAEYGFAILKNPFNHVVLD 278


>gi|428174941|gb|EKX43834.1| hypothetical protein GUITHDRAFT_140267 [Guillardia theta CCMP2712]
          Length = 805

 Score = 44.7 bits (104), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 43/190 (22%), Positives = 77/190 (40%), Gaps = 35/190 (18%)

Query: 69  EVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
           E  +K+ EDL +   W+ +NG+   KV L+    H       + + A + ++  +  F +
Sbjct: 540 EGSAKRNEDLIEFSKWLRRNGVDDSKVKLRADGGHGMG----NSLYARQMIKEDELLFRI 595

Query: 129 PNSLVVTLERVLGNETIAELLTTNKL-----SELACLALYLM--------YE-------- 167
           P  +    + V  + T+  ++   ++      E   L+L LM        YE        
Sbjct: 596 PLKIAFYSDAVRRHPTLGSVIKGARIPQGMQGETFLLSLMLMGPLTHLEQYEACQVGHME 655

Query: 168 ---KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
              K   + SFWLPYI+ L +          +P+ W+E E   L GS     + +     
Sbjct: 656 TGCKLSNETSFWLPYIKILPK-------TFSAPIFWNEVERQELKGSQVMEMLNDDLAQA 708

Query: 225 KREYNELDTV 234
           +RE+  +  V
Sbjct: 709 RREWEMMKIV 718


>gi|348552908|ref|XP_003462269.1| PREDICTED: SET domain-containing protein 4-like [Cavia porcellus]
          Length = 440

 Score = 44.7 bits (104), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 63/292 (21%), Positives = 114/292 (39%), Gaps = 44/292 (15%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-SELACLALYLMYEKKQGKKSFWL 177
           L+ G    S+P S ++T + V+ +   A ++      S L  L  +L+ EK  G +S W 
Sbjct: 68  LREGQMIISLPGSCLLTTDTVIRSSLGAYIIKWKPPPSPLLALCTFLVSEKHAGDQSVWK 127

Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
           PY+  L +       +   P+   E E+  L   P KA+  E+   +++ +      +  
Sbjct: 128 PYLDILPK-------SYTCPVC-LEPEVVNLLPEPLKAKAEEQRMSVQQFFASSRDFFSS 179

Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS- 296
              LF++    +    F++     A+  V +  V+L+     RR   + L P   A +  
Sbjct: 180 LQPLFEEATDSV----FSYSALLWAWCTVNTRAVYLR----TRRRDCLSLEPDTCALAPY 231

Query: 297 ----------KCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKL 346
                     + KA         ++     Y+  + + + Y           GP  N +L
Sbjct: 232 LDLLNHSPNVQVKAAFNEETGCYEIRTASDYRKHKEVFICY-----------GPHDNHRL 280

Query: 347 LINYGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 393
           L+ YGFV   NP+      R ++   L + D Q   K  + + +G L    F
Sbjct: 281 LLEYGFVSLCNPHACVYVSREILVKYLPSTDKQMNKKISILKDHGFLENLTF 332


>gi|349577313|dbj|GAA22482.1| K7_Set7p [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 494

 Score = 44.7 bits (104), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 64/266 (24%), Positives = 113/266 (42%), Gaps = 37/266 (13%)

Query: 113 VAASEDLQAGDAAFSVPNS--LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKK- 169
           V A++ ++  +  F +P S  L VT  +++ +    +    N+      L + ++YE + 
Sbjct: 41  VVATQKIKKDETLFKIPRSSVLSVTTSQLIKDYPSLKDKFLNETGSWEGLIICILYEMEV 100

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
             ++S W PY +  ++      L     + W + EL  L         G     E+ ER 
Sbjct: 101 LQERSRWAPYFKVWNKPSDMNAL-----IFWDDNELQLLKPSLVLERIGKKEAKEMHERI 155

Query: 222 -EGIKREYNELDTVWFMAGSL-FQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
            + IK+   E   V   A S  F  + Y    I + +F  E+   +    +      +++
Sbjct: 156 IKSIKQIGGEFSRV---ATSFEFDNFAYIASIILSYSFDLEMQDSSINENEEEETSEEEL 212

Query: 277 SLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHD 335
              R   +++PL   L A +SKC A L    + +++V  R  +  E +   YNI      
Sbjct: 213 ENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQV---YNIY----- 264

Query: 336 FKCGPQPNSKLLINYGFVDED-NPYD 360
              G  PNS+LL  YG+V+ D + YD
Sbjct: 265 ---GEHPNSELLRRYGYVEWDGSKYD 287


>gi|323355591|gb|EGA87411.1| Set7p [Saccharomyces cerevisiae VL3]
          Length = 515

 Score = 44.7 bits (104), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 64/266 (24%), Positives = 113/266 (42%), Gaps = 37/266 (13%)

Query: 113 VAASEDLQAGDAAFSVPNS--LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKK- 169
           V A++ ++  +  F +P S  L VT  +++ +    +    N+      L + ++YE + 
Sbjct: 41  VVATQKIKKDETLFKIPRSSVLSVTTSQLIKDYPSLKDKFLNETGSWEGLIICILYEMEV 100

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
             ++S W PY +  ++      L     + W + EL  L         G     E+ ER 
Sbjct: 101 LQERSRWAPYFKVWNKPSDMNAL-----IFWDDNELQLLKPSLVLERIGKKEAKEMHERI 155

Query: 222 -EGIKREYNELDTVWFMAGSL-FQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
            + IK+   E   V   A S  F  + Y    I + +F  E+   +    +      +++
Sbjct: 156 IKSIKQIGGEFSRV---ATSFEFDNFAYIASIILSYSFDLEMQDSSVNENEEEETSEEEL 212

Query: 277 SLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHD 335
              R   +++PL   L A +SKC A L    + +++V  R  +  E +   YNI      
Sbjct: 213 ENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQV---YNIY----- 264

Query: 336 FKCGPQPNSKLLINYGFVDED-NPYD 360
              G  PNS+LL  YG+V+ D + YD
Sbjct: 265 ---GEHPNSELLRRYGYVEWDGSKYD 287


>gi|255080174|ref|XP_002503667.1| set domain protein [Micromonas sp. RCC299]
 gi|226518934|gb|ACO64925.1| set domain protein [Micromonas sp. RCC299]
          Length = 401

 Score = 44.7 bits (104), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 30/119 (25%), Positives = 54/119 (45%), Gaps = 10/119 (8%)

Query: 144 TIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET 203
           ++AE L   +L     L + +M+E+  G+ S W  Y   L     RG+  +  P+ W+  
Sbjct: 38  SVAETLREARLGGGLALNIAIMHERSLGEGSRWAGYFAVLP---ARGERTL--PMFWTSA 92

Query: 204 ELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
           +L +L G+     + E AE ++ ++NE      +   L   +P   P    T E + +A
Sbjct: 93  QLEHLRGTDLLRHVTEDAESMRLDFNE-----NVVDGLCVTHPVAFPPGKHTLEAYMEA 146


>gi|392569623|gb|EIW62796.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
          Length = 509

 Score = 44.7 bits (104), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 62/232 (26%), Positives = 93/232 (40%), Gaps = 34/232 (14%)

Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
           A+VP+   L A      A L   +  +++V  +P KAGE I   Y           G  P
Sbjct: 264 AMVPMADMLNARFESENAKLFYDERELKMVSTKPIKAGEQIWNTY-----------GDPP 312

Query: 343 NSKLLINYGFVDE----------DNPYDRLVVEAALNTEDPQYQDKRMVAQRNG----KL 388
           NS LL  YG VD            NP D + V A L       + K  + +R      + 
Sbjct: 313 NSDLLRRYGHVDLVPLSAPLSGLGNPGDVVEVRADLIVSVAAKKVKHDLKERVDWWLEEA 372

Query: 389 SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLA 448
              VF +    + E   +++ ++RL  +      ++   S  P     P +++ VL    
Sbjct: 373 DDDVFVLRT--DCELAEELVSFVRLLLLPKDEWEKAAQKSKLP----KPKLDKDVLTIAV 426

Query: 449 DYFKARLAGYPATLSEDEAMLTDY---NLHPKKRVATQLVRMEKKMLNACLQ 497
           D  + RL  YP TL EDEA+        L   KR A  +   EK++L   L+
Sbjct: 427 DVLEKRLKDYPTTLEEDEALFAPERFGELSLNKRHAVVVRLGEKRILRGTLK 478


>gi|302804174|ref|XP_002983839.1| hypothetical protein SELMODRAFT_445692 [Selaginella moellendorffii]
 gi|300148191|gb|EFJ14851.1| hypothetical protein SELMODRAFT_445692 [Selaginella moellendorffii]
          Length = 236

 Score = 44.7 bits (104), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 49/195 (25%), Positives = 80/195 (41%), Gaps = 36/195 (18%)

Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
           + S W PYI  L    G     +++  LW +TEL+YL  SP   +  ER E I  E+ ++
Sbjct: 74  QSSAWAPYISCLPEPAG-----LDNTFLWEDTELSYLRASPLYGKTRERLEIITTEFGQV 128

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
                +   LF +          + E F   +  V S     + +++     LV +  P+
Sbjct: 129 QNALDVWPQLFGK---------VSVEDFMHVYATVFS-----RPLAIGEDSTLVMI--PM 172

Query: 292 LAYSSKCKAMLAAVDDAVQL---VVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLI 348
           L + +   A  A +     L   VV       E+  +W N         CG   N++L +
Sbjct: 173 LDFFNHNAASFAKLSFNGLLNYAVVTADRDCAENDQIWIN---------CGDLSNAELAL 223

Query: 349 NYGFVDEDNPYDRLV 363
           +YGF     P +RL+
Sbjct: 224 DYGFT---VPENRLI 235


>gi|402862437|ref|XP_003895567.1| PREDICTED: SET domain-containing protein 4 [Papio anubis]
          Length = 456

 Score = 44.7 bits (104), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 67/305 (21%), Positives = 113/305 (37%), Gaps = 54/305 (17%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
           LQ G    S+P S ++T + V+    +   +T  K   S L  L  +L+ EK  G +S W
Sbjct: 68  LQEGQMIISLPESCLLTADTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGDRSLW 126

Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
            PY+  L +       A   P+   E E+  L     KA+  E+   ++  +      + 
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178

Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARR--------FALV 285
               LF +    I    F++     A+  V +  V+L   Q+  L+            L 
Sbjct: 179 SLQPLFVEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPDTCALAPACLC 234

Query: 286 PLGP------------PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTG 333
           P  P            PL   S + KA       + ++     ++  E + + Y      
Sbjct: 235 PFSPLCSSGLQNLMLHPLSGLSQQVKAAFNEETHSYEIRTTSRWRKHEEVFICY------ 288

Query: 334 HDFKCGPQPNSKLLINYGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKL 388
                GP  N +L + YGFV   NP+      R ++   L + D Q   K  + + +G +
Sbjct: 289 -----GPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSRDKQMDKKISILKDHGYI 343

Query: 389 SVQVF 393
               F
Sbjct: 344 ENLTF 348


>gi|302896942|ref|XP_003047350.1| hypothetical protein NECHADRAFT_106552 [Nectria haematococca mpVI
           77-13-4]
 gi|256728280|gb|EEU41637.1| hypothetical protein NECHADRAFT_106552 [Nectria haematococca mpVI
           77-13-4]
          Length = 471

 Score = 44.7 bits (104), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 53/241 (21%), Positives = 102/241 (42%), Gaps = 47/241 (19%)

Query: 161 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-LE 219
           A++L+ +   G++SFW PYI+ L +       A+  PLLW E++L +L G+  +  +  +
Sbjct: 111 AIFLVQQYLLGEQSFWYPYIQILPQPDDDKDSAI--PLLWPESDLLWLRGTHLEEAVSKQ 168

Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
           + + +KR        W  A    Q+Y +D P++ FT E+   A+    S           
Sbjct: 169 KVDHVKR--------WTEAMETLQKYGWD-PSQ-FTLELGLWAYYCFYS----------- 207

Query: 280 RRFALVPLGPPLLAYSSKCKAMLAA---VDDAVQLV--VDRPYKAGESIVVWYNIS---- 330
           R F  + L P +     + + ++ A   +DD  +++  +       +     YN+     
Sbjct: 208 RYFWSIILEPDVANIKPEFQHLVKAGMNLDDTAKILLPILETLNHAQETNTEYNLDDKGL 267

Query: 331 --------------FTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 376
                         +  +D +     N+ LL ++GF+  DN    LV+ +  +   P + 
Sbjct: 268 SVSKNIELKPGDPFYIAYDKETQRFNNTVLLKDFGFILPDNEAAELVLSSPFDLTRPMHL 327

Query: 377 D 377
           D
Sbjct: 328 D 328


>gi|328864871|gb|EGG13257.1| hypothetical protein DFA_11018 [Dictyostelium fasciculatum]
          Length = 1658

 Score = 44.7 bits (104), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 81/365 (22%), Positives = 143/365 (39%), Gaps = 44/365 (12%)

Query: 113  VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELL--TTNKLSELACLALYLMYEKKQ 170
            V  ++ ++  +   SVP   ++ ++    +  +  +L      L++   L L+++YEK +
Sbjct: 1212 VVTTKKVEENECVVSVPRKFLINVDCARKHPVLNSILFEEATGLNDDTILFLFVIYEK-E 1270

Query: 171  GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
               SFW P+   L            + + ++ TEL  L G+     + E    IK     
Sbjct: 1271 NPNSFWRPFFDTLPS-------YFPTSIHYTTTELLELEGT----NLFEETIQIKEHLES 1319

Query: 231  LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 290
            +  + F    L  QYP   P   FT E F  A     S  + L K+       LVP    
Sbjct: 1320 IRELLF--PELSNQYPDVFPESLFTMENFLWARSLFDSRAIQL-KIDGRIVNCLVP---- 1372

Query: 291  LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES----IVVWYNISFTGHDF-KCGPQPNSK 345
                      M   ++   Q  + + Y   E+    ++   NI  T   F + G   + +
Sbjct: 1373 ----------MADMINHHDQAQISQRYFDQENDCFRMISCCNIPATSQIFLQYGALQSWE 1422

Query: 346  LLINYGFVDEDNPYDRLVVEAALNTED-PQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI 404
            L + YGFV  +N YD + +   +  ED P+ ++++        L+V   ++H        
Sbjct: 1423 LALYYGFVISNNHYDSVHIGFDMPEEDTPELREEKQKLLDRHLLTVDHHYLHRSN---IP 1479

Query: 405  SDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSE 464
            S +L  LR+  +++  E    +    PI   S   E  VL  L       L  + +T  E
Sbjct: 1480 SKLLASLRVALLAE-DEFNPHVDVWNPI---SRSNEEVVLYTLYSTVLMLLKQFSSTCDE 1535

Query: 465  DEAML 469
            D+ +L
Sbjct: 1536 DQQLL 1540


>gi|67484540|ref|XP_657490.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
 gi|56474743|gb|EAL52100.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
          Length = 791

 Score = 44.3 bits (103), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 72/328 (21%), Positives = 122/328 (37%), Gaps = 47/328 (14%)

Query: 78  LGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
           + D+K W+ +NG     V +K    +         + A+++ +  +   S+P S  + + 
Sbjct: 1   MEDIKKWVIQNGGVIDGVDVKTFEGYGRG------LCANKEFKKDEVIMSIPYS--IQIN 52

Query: 138 RVLGNETIAELLT------TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQ 191
           R+  N    E+         +   +L  L    +   K   K F  PYI  L        
Sbjct: 53  RINLNHIWPEVKLPKFNEGDDDRDDLNGLVYLYLAVNKTNPKCFHWPYINVLPE------ 106

Query: 192 LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPT 251
              + PL ++  EL  + G+   A + E+          +  V +    L QQ+P     
Sbjct: 107 -TYDCPLSYTIDELNLMKGTKLYAAV-EKINAFL-----MKVVDYYNNKLIQQFPQYF-- 157

Query: 252 EAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQL 311
           ++F  ++FK+   A QS       V   + F  V    P   +S+ C           Q 
Sbjct: 158 QSFD-DLFKRLQWAHQSFWSRAFLVIYPQPFGEVGSLIPFCDFSNHCTQAKVTYISNTQT 216

Query: 312 VV------DRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVE 365
                   +   K GE I   Y I           + N KLL+ YGFV+E+NP D L++ 
Sbjct: 217 ETFSFQTNEELVKPGEQIFNNYRI-----------RSNEKLLLGYGFVEENNPCDNLLLR 265

Query: 366 AALNTEDPQYQDKRMVAQRNGKLSVQVF 393
                +D QY +   + ++    S   F
Sbjct: 266 IYFEVDDNQYNEIEEILKQEEIKSFDFF 293


>gi|397507017|ref|XP_003824008.1| PREDICTED: SET domain-containing protein 4 [Pan paniscus]
          Length = 440

 Score = 44.3 bits (103), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 66/289 (22%), Positives = 113/289 (39%), Gaps = 38/289 (13%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
           LQ G    S+P S ++T + V+    +   +T  K   S L  L  +L+ EK  G +S W
Sbjct: 68  LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGHRSLW 126

Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
            PY+  L +       A   P+   E E+  L     KA+  E+   ++  +      + 
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178

Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
               LF +    I    F++     A+  V +  V+L   Q+  L+       L P   L
Sbjct: 179 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPDTCALAPYLDL 234

Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
           L +S   + KA       + ++     ++  E + + Y           GP  N +L + 
Sbjct: 235 LNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICY-----------GPHDNQRLFLE 283

Query: 350 YGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 393
           YGFV   NP+      R ++   L + D Q   K  + + +G +    F
Sbjct: 284 YGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332


>gi|388579878|gb|EIM20197.1| RuBisCO-cytochrome methylase [Wallemia sebi CBS 633.66]
          Length = 447

 Score = 44.3 bits (103), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 91/461 (19%), Positives = 184/461 (39%), Gaps = 85/461 (18%)

Query: 84  WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT-----LER 138
           W   NG    K I+    + +   R +  VA   D++A  + F++P  +V++      + 
Sbjct: 10  WFTTNGGEFSKDIVAIGENVDGMGRGLVAVA---DIKAQTSLFTIPRDIVLSTRTSSFKE 66

Query: 139 VLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPL 198
            +G +   +L   N +     L + + +E  QG  S W  Y + L +Q         S +
Sbjct: 67  KVGQDVYKQLENDN-IGSWTPLIMAMCWEYNQGGSSKWDAYFKILPKQ-------FTSLM 118

Query: 199 LWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEI 258
            WS+ EL+ L G+    +I    E I+ E+  +  +     ++F     DI    +T ++
Sbjct: 119 FWSKEELSLLKGTTVVDKI--GLEDIENEFERVRDIVKQNENVFG----DIAN--YTLDL 170

Query: 259 FKQ--AFVAVQSCVVHLQKV---------------------SLARRFALVPLGPPLLAYS 295
           FK+  + +  +S  V   K                      +     A+VP+   L + +
Sbjct: 171 FKRMGSLILSRSFTVEEWKTEEEREKEEEEEEDEDEEIDLRTSVDDVAMVPMADILNSRT 230

Query: 296 SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVD- 354
               A     ++ ++++  +  KAG+ I   YN             PN+ L+  YG VD 
Sbjct: 231 DSVNAHTEYEENCLRMISLQDIKAGDQIFNTYN-----------DPPNADLIRRYGHVDY 279

Query: 355 ----ED-----NPYD------RLVVEAALNTEDPQYQDKRM--VAQRNGKLSVQVFHVHA 397
               +D     N  D       +++E AL      ++++R+  +    G+ S ++ H   
Sbjct: 280 SPLSQDPDFMGNKNDVVELPADILLELALPDAKESHKERRVEFLLDECGEDSFELTH--- 336

Query: 398 GREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAG 457
               + + ++L    L +    +E ++   S     P +    +   + L    K R+  
Sbjct: 337 ---DDLVPELLKICVLLFTESEAEFKTREKSRK--LPKASGFTKGKAEFLIKAIKQRMEQ 391

Query: 458 YPATLSEDEAMLTDYNLHPKKRVATQLVRM-EKKMLNACLQ 497
           Y +TL +D + L + +  P+      +V + E+++LN  ++
Sbjct: 392 YGSTLEDDISKLDNKDSLPENNFKALVVTVGERRILNKAIE 432


>gi|154272535|ref|XP_001537120.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
 gi|150409107|gb|EDN04563.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
          Length = 485

 Score = 44.3 bits (103), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 35/140 (25%), Positives = 62/140 (44%), Gaps = 15/140 (10%)

Query: 80  DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
           +  SW+ +   P  KV  K K +          + A +D+   +  F++P SLV++ +  
Sbjct: 19  EFMSWLKQR--PGVKVSPKIKIADLRSEGAGRGIVADDDIGEDEELFAIPQSLVLSFQ-- 74

Query: 140 LGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
             N  + +LL  N+       CL + ++YE  QG  S W  Y + L           ++ 
Sbjct: 75  --NSRLKDLLDFNERDFDPWLCLIVVMIYEYLQGGASTWSRYFQLLPTN-------FDTL 125

Query: 198 LLWSETELAYLTGSPTKAEI 217
           + W++ EL  L+GS    +I
Sbjct: 126 MFWTDEELRELSGSAVLNKI 145


>gi|8393013|ref|NP_059134.1| SET domain-containing protein 4 isoform 1 [Homo sapiens]
 gi|12229715|sp|Q9NVD3.1|SETD4_HUMAN RecName: Full=SET domain-containing protein 4
 gi|7023055|dbj|BAA91819.1| unnamed protein product [Homo sapiens]
 gi|119630162|gb|EAX09757.1| SET domain containing 4, isoform CRA_b [Homo sapiens]
 gi|119630163|gb|EAX09758.1| SET domain containing 4, isoform CRA_b [Homo sapiens]
 gi|119630165|gb|EAX09760.1| SET domain containing 4, isoform CRA_b [Homo sapiens]
          Length = 440

 Score = 44.3 bits (103), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 66/289 (22%), Positives = 113/289 (39%), Gaps = 38/289 (13%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
           LQ G    S+P S ++T + V+    +   +T  K   S L  L  +L+ EK  G +S W
Sbjct: 68  LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGHRSLW 126

Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
            PY+  L +       A   P+   E E+  L     KA+  E+   ++  +      + 
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178

Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
               LF +    I    F++     A+  V +  V+L   Q+  L+       L P   L
Sbjct: 179 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPDTCALAPYLDL 234

Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
           L +S   + KA       + ++     ++  E + + Y           GP  N +L + 
Sbjct: 235 LNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICY-----------GPHDNQRLFLE 283

Query: 350 YGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 393
           YGFV   NP+      R ++   L + D Q   K  + + +G +    F
Sbjct: 284 YGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332


>gi|388452885|ref|NP_001253203.1| SET domain-containing protein 4 [Macaca mulatta]
 gi|355560299|gb|EHH16985.1| SET domain-containing protein 4 [Macaca mulatta]
 gi|387541878|gb|AFJ71566.1| SET domain-containing protein 4 isoform 1 [Macaca mulatta]
          Length = 440

 Score = 44.3 bits (103), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 66/289 (22%), Positives = 113/289 (39%), Gaps = 38/289 (13%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
           LQ G    S+P S ++T + V+    +   +T  K   S L  L  +L+ EK  G +S W
Sbjct: 68  LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGDRSLW 126

Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
            PY+  L +       A   P+   E E+  L     KA+  E+   ++  +      + 
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178

Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
               LF +    I    F++     A+  V +  V+L   Q+  L+       L P   L
Sbjct: 179 SLQPLFVEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPDTCALAPYLDL 234

Query: 292 LAYSSKC--KAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
           L +S +   KA       + ++     ++  E + + Y           GP  N +L + 
Sbjct: 235 LNHSPRVQVKAAFNEETHSYEIRTTSRWRKHEEVFICY-----------GPHDNQRLFLE 283

Query: 350 YGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 393
           YGFV   NP+      R ++   L + D Q   K  + + +G +    F
Sbjct: 284 YGFVSVHNPHACVYVSREILVKYLPSRDKQMDKKISILKDHGYIENLTF 332


>gi|255077808|ref|XP_002502485.1| set domain protein [Micromonas sp. RCC299]
 gi|226517750|gb|ACO63743.1| set domain protein [Micromonas sp. RCC299]
          Length = 728

 Score = 44.3 bits (103), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 40/137 (29%), Positives = 61/137 (44%), Gaps = 19/137 (13%)

Query: 83  SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE-RVLG 141
           +WM K G+    V +   P H  +        A+ D++ GD    VP + ++T +  V G
Sbjct: 51  AWMKKKGVKLNGVSIGRFP-HTGRG-----CVATRDIKEGDVLVEVPEAAIITADGSVAG 104

Query: 142 NETIAELLTTNKL-------SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
           +  +A  L    L        E   L L +M E  +G++S + PY+  L   R     A 
Sbjct: 105 SALVAFGLGGEALLHEYSPRLEREALVLAVMAEMSRGEESEFAPYLAALPTLR-----AT 159

Query: 195 ESPLLWSETELAYLTGS 211
            SPL WS  EL+ L G+
Sbjct: 160 HSPLGWSGAELSELEGT 176


>gi|332872029|ref|XP_001168891.2| PREDICTED: SET domain-containing protein 4 isoform 8 [Pan
           troglodytes]
 gi|410222532|gb|JAA08485.1| SET domain containing 4 [Pan troglodytes]
 gi|410259176|gb|JAA17554.1| SET domain containing 4 [Pan troglodytes]
 gi|410287500|gb|JAA22350.1| SET domain containing 4 [Pan troglodytes]
 gi|410336605|gb|JAA37249.1| SET domain containing 4 [Pan troglodytes]
          Length = 440

 Score = 44.3 bits (103), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 66/289 (22%), Positives = 113/289 (39%), Gaps = 38/289 (13%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
           LQ G    S+P S ++T + V+    +   +T  K   S L  L  +L+ EK  G +S W
Sbjct: 68  LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGHRSLW 126

Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
            PY+  L +       A   P+   E E+  L     KA+  E+   ++  +      + 
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178

Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
               LF +    I    F++     A+  V +  V+L   Q+  L+       L P   L
Sbjct: 179 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPDTCALAPYLDL 234

Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
           L +S   + KA       + ++     ++  E + + Y           GP  N +L + 
Sbjct: 235 LNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICY-----------GPHDNQRLFLE 283

Query: 350 YGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 393
           YGFV   NP+      R ++   L + D Q   K  + + +G +    F
Sbjct: 284 YGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332


>gi|323309789|gb|EGA62995.1| Set7p [Saccharomyces cerevisiae FostersO]
          Length = 417

 Score = 44.3 bits (103), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 64/266 (24%), Positives = 113/266 (42%), Gaps = 37/266 (13%)

Query: 113 VAASEDLQAGDAAFSVPNS--LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKK- 169
           V A++ ++  +  F +P S  L VT  +++ +    +    N+      L + ++YE + 
Sbjct: 41  VVATQKIKKDETLFKIPRSSVLSVTTSQLIKDYPSLKDKFLNETGSWEGLIICILYEMEV 100

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
             ++S W PY +  ++      L     + W + EL  L         G     E+ ER 
Sbjct: 101 LQERSRWAPYFKVWNKPSDMNAL-----IFWDDXELQLLKPSLVLERIGKKEAKEMHERI 155

Query: 222 -EGIKREYNELDTVWFMAGSL-FQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
            + IK+   E   V   A S  F  + Y    I + +F  E+   +    +      +++
Sbjct: 156 IKSIKQIGGEFSRV---ATSFEFDNFAYIASIILSYSFDLEMQDSSVNENEEEETSEEEL 212

Query: 277 SLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHD 335
              R   +++PL   L A +SKC A L    + +++V  R  +  E +   YNI      
Sbjct: 213 ENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQV---YNIY----- 264

Query: 336 FKCGPQPNSKLLINYGFVDED-NPYD 360
              G  PNS+LL  YG+V+ D + YD
Sbjct: 265 ---GEHPNSELLRRYGYVEWDGSKYD 287


>gi|407852222|gb|EKG05847.1| hypothetical protein TCSYLVIO_003073 [Trypanosoma cruzi]
          Length = 565

 Score = 44.3 bits (103), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 48/174 (27%), Positives = 77/174 (44%), Gaps = 21/174 (12%)

Query: 311 LVVDRPYKAGESIVVWYNISFTGHDF------KCGPQPNSKLLINYGFVDEDNPYDRL-- 362
           ++V R    G   V+    S T  D         GP  N +LL  YGFV E N +DRL  
Sbjct: 336 VLVRRVEPNGGDFVMQIGASLTAQDIGREIWMSYGPLQNWELLQFYGFVLEGNEHDRLPF 395

Query: 363 ---VVEAALNTEDPQYQDKR--MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS 417
                EAA+  E   +  +R  +VA+    L+   +  H GR   A   ++  LR+ +++
Sbjct: 396 PLDFPEAAVGDE---WDGRRAALVAKYGLHLAGCCWICHDGRPPPA---LVALLRV-HLA 448

Query: 418 DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 471
           +  E  ++  + GP   +    E  V   +AD  +  L  +  +L EDE +L +
Sbjct: 449 EAEEFDTMERN-GPFASLGAGTEARVFATIADTIRCILDLFSTSLEEDERLLEN 501


>gi|297707870|ref|XP_002830708.1| PREDICTED: SET domain-containing protein 4 [Pongo abelii]
          Length = 440

 Score = 44.3 bits (103), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 65/289 (22%), Positives = 113/289 (39%), Gaps = 38/289 (13%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
           LQ G    S+P S ++T + V+    +   +T  K   S L  L  +L+ EK  G +S W
Sbjct: 68  LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGDRSLW 126

Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
            PY+  L +       A   P+   E E+  L     KA+  E+   ++  +      + 
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPQSLKAKAEEQRAHVQEFFASSRDFFS 178

Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ---KVSLARRFALVPLGP--PL 291
               LF +    I    F++     A+  V +  V+L+   +  L+       L P   L
Sbjct: 179 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRHRECLSAELDTCALAPYLDL 234

Query: 292 LAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
           L +S   + KA       + ++     ++  E + + Y           GP  N +L + 
Sbjct: 235 LNHSPHVQVKAAFNEETHSYEIRTTSRWRRHEEVFICY-----------GPHDNQRLFLE 283

Query: 350 YGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 393
           YGFV   NP+      R ++   L + D Q   K  + + +G +    F
Sbjct: 284 YGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332


>gi|384254260|gb|EIE27734.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 724

 Score = 44.3 bits (103), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 81/331 (24%), Positives = 126/331 (38%), Gaps = 43/331 (12%)

Query: 71  VSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVP- 129
           + K  E     + W  ++G+    + L E          +  +AA++++  G+   S+P 
Sbjct: 67  IQKSGEGPLGFQEWALQSGITSPSLRLAEFAG-------LRGMAAADNIAKGEVLVSLPV 119

Query: 130 -NSLVVT-LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQR 187
             +LVV+  ER     T       +K      +AL L+YE++ G  S   PY+  L    
Sbjct: 120 AAALVVSPKERSQLPGTFCSSAFYSKKPWYVQMALNLLYERQLGPASKLAPYVAALP--- 176

Query: 188 GRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL------------DTVW 235
               +   +PL WSE +L  L       E+  + EG+KR + EL            D +W
Sbjct: 177 ----VDFSTPLSWSEAQLQALCYPQLIREVATQREGLKRLHAELAVSTPGTPITEQDLIW 232

Query: 236 FMAGSLFQQY--PYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP-LGPPLL 292
            +     + +  PY  PT     + F        + +     ++ A   AL   L   +L
Sbjct: 233 ALQAVRSRAFSGPYAGPTWRSRLKTFGALGALAAASITVAHVLNGAIAAALFNLLYDVVL 292

Query: 293 AYSSKCKAMLAAVD-----DAVQLVVDRPYKAGESIVVWYNISFTGHD--FKCGPQPNSK 345
           +   K  AM   VD       VQ  V+  Y A    V   +    G       G Q N  
Sbjct: 293 SQKVKWYAMCPVVDFLNHKSTVQSEVEYEYFADRFSVRCQSYFSKGEQVFISYGKQSNDS 352

Query: 346 LLINYGFVDEDNPYDRLVV----EAALNTED 372
           LL  YGFV+   P+D   +     AAL   D
Sbjct: 353 LLQYYGFVEPGIPHDTYTIPDLRAAALALSD 383


>gi|238494116|ref|XP_002378294.1| SET domain protein [Aspergillus flavus NRRL3357]
 gi|317148877|ref|XP_001822982.2| SET domain protein [Aspergillus oryzae RIB40]
 gi|220694944|gb|EED51287.1| SET domain protein [Aspergillus flavus NRRL3357]
          Length = 478

 Score = 44.3 bits (103), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 81/316 (25%), Positives = 126/316 (39%), Gaps = 62/316 (19%)

Query: 83  SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
           SW+  +G P  KV  K + +          V A  D+  G+  F++P   V++ +    N
Sbjct: 22  SWL--SGKPGVKVNPKIRLADLRSRAAGRGVVAQSDIAEGEELFTIPREHVLSTQ----N 75

Query: 143 ETIAELLTTN--KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
             + +LL+ +  +L     L L ++YE   G +S W  Y + L R+        ++ + W
Sbjct: 76  SKLKDLLSQDVEELGPWLSLMLVMIYEYLLGDQSAWASYFKILPRK-------FDTLMFW 128

Query: 201 SETELAYLTGSPTKAEILER--AEGIKREYNELDTVWFMAG-SLF------QQYPYDIPT 251
           S +EL  L GS     I++R   EG +    E+      A  SLF        Y  D  T
Sbjct: 129 SPSELQELQGSA----IVDRIGKEGAEESILEMIAPIVRANPSLFPPVDGLASYDGDAGT 184

Query: 252 EAF-------TFEIFKQAF-----------VAVQSCVVHLQKVSLARRFALVPLGPPLLA 293
           +A           I   AF              +S  V   +  L++   +VPL   L A
Sbjct: 185 QALLNLAHVMGSLIMAYAFDIEKPEDEDDEGDDESGYVTDDEEQLSK--GMVPLADLLNA 242

Query: 294 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV 353
            + +  A L   +  + +   +P  AG  I   Y           G  P + LL  YG+V
Sbjct: 243 DADQNNARLFQEETGLVMKAIKPISAGAEIFNDY-----------GEIPRADLLRRYGYV 291

Query: 354 -DEDNPYDRLVVEAAL 368
            D  +PYD  VVE +L
Sbjct: 292 TDNYSPYD--VVELSL 305


>gi|345561352|gb|EGX44442.1| hypothetical protein AOL_s00188g347 [Arthrobotrys oligospora ATCC
           24927]
          Length = 468

 Score = 43.9 bits (102), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 69/310 (22%), Positives = 122/310 (39%), Gaps = 52/310 (16%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTT--NKLSELACLALYLMYEKKQGKKSFW 176
            + G+   ++P+S++ T+E    +  I  +L +    LS    LA+Y+++ +        
Sbjct: 34  FKEGERILTIPSSILWTVEHAYADSIIRPVLQSMQGALSVDDTLAIYILFVRS------- 86

Query: 177 LPYIRELDRQRGRGQL-----AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
               RE      R  +     +  S + +++ EL    GS       +  + I+ +Y  L
Sbjct: 87  ----RESGYNGLRSHVEALPTSYSSSIFFTDDELEVCAGSSLYTITKQLKQQIQDDYRTL 142

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
                    LF QY        FT E +K A   V S  +   +        L P    +
Sbjct: 143 ------VERLFGQYLDIFSLGKFTIEDYKWALCTVWSRAMDFVQPDGKSIRLLAPFAD-M 195

Query: 292 LAYSS---KCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLI 348
           L +SS   KC     +  D + ++  + Y+ G+ + + Y           G  PN++LL 
Sbjct: 196 LNHSSDVKKCHVYDTSSGD-LSILAGKDYEPGDQVFINY-----------GSIPNNRLLR 243

Query: 349 NYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDML 408
            YGFV  +NP D   +      E P ++ K+ +    G  SV    +       +++D L
Sbjct: 244 LYGFVVPNNPNDSYDLVLMTQPEAPFFELKQKLWVSAGLDSVSTISL-------SLNDPL 296

Query: 409 P-----YLRL 413
           P     YLR+
Sbjct: 297 PKSVLQYLRI 306


>gi|345325921|ref|XP_001512684.2| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Ornithorhynchus anatinus]
          Length = 392

 Score = 43.9 bits (102), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 37/164 (22%), Positives = 73/164 (44%), Gaps = 29/164 (17%)

Query: 73  KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHY-----VAASEDLQAGDAAFS 127
           K+E+   DL  W   NG            +  E    +++     + A+ +++A +    
Sbjct: 74  KREDYFPDLMKWATANG------------ASTEGFELVNFEEGFGLRATREIKAEELFLW 121

Query: 128 VPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELD 184
           VP  L++T+E    N  +  L + +++ +      LA +L+ E+     SFWLPYI+ L 
Sbjct: 122 VPRKLLMTVESA-KNSVLGSLYSQDRILQAMGNITLAFHLLCERAN-PSSFWLPYIQTLP 179

Query: 185 RQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
            +        ++PL + E E+ YL  +    ++  + +   R+Y
Sbjct: 180 SE-------YDTPLYFEEDEVQYLQSTQAIHDVFSQYKNTARQY 216


>gi|22328112|gb|AAH36556.1| SETD4 protein [Homo sapiens]
 gi|119630166|gb|EAX09761.1| SET domain containing 4, isoform CRA_d [Homo sapiens]
 gi|167773807|gb|ABZ92338.1| SET domain containing 4 [synthetic construct]
          Length = 416

 Score = 43.9 bits (102), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 66/289 (22%), Positives = 114/289 (39%), Gaps = 38/289 (13%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
           LQ G    S+P S ++T + V+ +  +   +T  K   S L  L  +L+ EK  G +S W
Sbjct: 44  LQEGQMIISLPESCLLTTDTVIRS-YLGAYITKWKPPPSPLLALCTFLVSEKHAGHRSLW 102

Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
            PY+  L +       A   P+   E E+  L     KA+  E+   ++  +      + 
Sbjct: 103 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 154

Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
               LF +    I    F++     A+  V +  V+L   Q+  L+       L P   L
Sbjct: 155 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPDTCALAPYLDL 210

Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
           L +S   + KA       + ++     ++  E + + Y           GP  N +L + 
Sbjct: 211 LNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICY-----------GPHDNQRLFLE 259

Query: 350 YGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 393
           YGFV   NP+      R ++   L + D Q   K  + + +G +    F
Sbjct: 260 YGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 308


>gi|403338831|gb|EJY68658.1| hypothetical protein OXYTRI_10728 [Oxytricha trifallax]
          Length = 770

 Score = 43.9 bits (102), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 21/77 (27%), Positives = 40/77 (51%), Gaps = 6/77 (7%)

Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGNE------TIAELLTTNKLSELACLALYLMYE 167
           A  ED+Q  +A   +PN  ++T+ER   +E          +  +++  +   L +++M E
Sbjct: 67  AVKEDIQHNEAFVYIPNKCLITVERARSSEIGFIFANHENVFKSSEDRDFLTLLVFMMCE 126

Query: 168 KKQGKKSFWLPYIRELD 184
            ++G +SFW PY   +D
Sbjct: 127 FQKGDQSFWYPYFNAVD 143


>gi|410079629|ref|XP_003957395.1| hypothetical protein KAFR_0E01060 [Kazachstania africana CBS 2517]
 gi|372463981|emb|CCF58260.1| hypothetical protein KAFR_0E01060 [Kazachstania africana CBS 2517]
          Length = 534

 Score = 43.9 bits (102), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 40/144 (27%), Positives = 63/144 (43%), Gaps = 9/144 (6%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           V A  DL AG     +P S + +      N TI+ LL   ++  +  L L  +YE    +
Sbjct: 40  VFAKRDLPAGTTLLQLPKSAIFSA----SNSTISNLLVEEEIDGVLALNLAFIYETTVFR 95

Query: 173 -KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE-YNE 230
            KS W PY++ +     +G ++V  P  WSE     L G  T  + L  A   ++E Y  
Sbjct: 96  EKSHWYPYLKSIQVVDSQGNISV-PPGYWSEEAKDLLRG--TTLDTLYDALSPQQEVYEG 152

Query: 231 LDTVWFMAGSLFQQYPYDIPTEAF 254
            +    +A    Q++   +P E F
Sbjct: 153 FEISLHVAKKWNQEFSLPLPEEYF 176


>gi|302410103|ref|XP_003002885.1| SET domain-containing protein RMS1 [Verticillium albo-atrum
           VaMs.102]
 gi|261357909|gb|EEY20337.1| SET domain-containing protein RMS1 [Verticillium albo-atrum
           VaMs.102]
          Length = 469

 Score = 43.9 bits (102), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 82/384 (21%), Positives = 143/384 (37%), Gaps = 80/384 (20%)

Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 213
           L     L L ++YE  QG  S W PY   L +Q        ++P+ WS+ EL  L G+  
Sbjct: 91  LDSWGQLILVMLYEVLQGDSSRWKPYFDILPQQ-------FDTPIFWSDGELLELQGTSL 143

Query: 214 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 273
            AE + + E      +++  +     ++F       PTE     +  +    + +    L
Sbjct: 144 TAEKIGKVESDAMFRSKILPIVQANPAIFYPEGAAQPTEDELLHLAHRMGSTIMAYAFDL 203

Query: 274 QKVSLARR--------------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 319
           +                       +VP+   L A +++  A +   +      +    +A
Sbjct: 204 ENDDENENEEDGWVEDREGRTMLGMVPMADTLNA-NAEFNAHINHGESLEATAIRADIRA 262

Query: 320 GESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVD-EDNPYDRLVVEAALNTEDPQYQDK 378
           G+ ++ +Y           GP P S+LL  YG+V  E + YD + V   L  E       
Sbjct: 263 GDQVLNYY-----------GPLPTSELLRRYGYVTPEHSRYDVVEVPWTLVKE------- 304

Query: 379 RMVAQRNGKLSVQVF-HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSP 437
             V      LS + +  V +  + E I D   Y  +   S         ++   +  VSP
Sbjct: 305 --VIVSCLSLSAEAWKQVESQIDDEEIED---YFVIERDSGEPGPDGRFTAPAVLREVSP 359

Query: 438 CMERAVLDQLADYFKA-----------------------------RLAGYPATLSEDEAM 468
                +++QL ++ KA                             RLA YP ++  DE +
Sbjct: 360 ----ELVEQLKEFLKAVKKLDSERIPDKRKRDEICDAVIAEVLKVRLAQYPTSIETDEKL 415

Query: 469 LTDYNLHPKKRVATQLVRMEKKML 492
           L + +L  ++R+A  +   EKK+L
Sbjct: 416 LAEADLPARRRMAVVVRLGEKKLL 439


>gi|407035166|gb|EKE37568.1| [Ribulose-bisphosphate-carboxylase]-lysine N-methyltransferase
           [Entamoeba nuttalli P19]
          Length = 791

 Score = 43.9 bits (102), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 73/329 (22%), Positives = 122/329 (37%), Gaps = 49/329 (14%)

Query: 78  LGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
           + D+K W+ +NG     V +K    +         + A+++ +  +   S+P S  + + 
Sbjct: 1   MEDIKKWVIQNGGVIDGVDVKTFDGYGRG------LCANKEFKKDEIIMSIPYS--IQIN 52

Query: 138 RVLGNETIAELLT------TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQ 191
           R+  N    E+         +   +L  L    +   K   K F  PYI  L        
Sbjct: 53  RINLNHIWPEVKLPKFNEGDDDRDDLNGLVYLYLAVNKTNPKCFHWPYINVLPE------ 106

Query: 192 LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP-YDIP 250
              + PL ++  EL  + G+   A + E+          +  V +    L QQ+P Y  P
Sbjct: 107 -TYDCPLSYTIDELNLMKGTKLYAAV-EKINAFL-----MKVVDYYNNKLIQQFPQYFQP 159

Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 310
            +    ++FK+   A QS       V   + F  V    P   +S+ C           Q
Sbjct: 160 FD----DLFKRLQWAHQSFWSRAFLVIYPQPFGEVGSLIPFCDFSNHCTQAKVTYISNTQ 215

Query: 311 L------VVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
                    +   K GE I   Y I           + N KLL+ YGFV+E+NP D L++
Sbjct: 216 TETFSFQTNEALVKPGEQIFNNYRI-----------RSNEKLLLGYGFVEENNPCDNLLL 264

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVF 393
                 +D QY +   + ++    S   F
Sbjct: 265 RIYFEVDDNQYNEIEEILKQEEIKSFDFF 293


>gi|355747383|gb|EHH51880.1| SET domain-containing protein 4 [Macaca fascicularis]
          Length = 440

 Score = 43.9 bits (102), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 65/289 (22%), Positives = 113/289 (39%), Gaps = 38/289 (13%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
           LQ G    S+P S ++T + V+    +   +T  K   S L  L  +L+ EK  G +S W
Sbjct: 68  LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGDRSLW 126

Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
            PY+  L +       A   P+   E E+  L     KA+  E+   ++  +      + 
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178

Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
               LF +    I    F++     A+  + +  V+L   Q+  L+       L P   L
Sbjct: 179 SLQPLFVEAVDSI----FSYSALLWAWCTINTRAVYLRPRQRECLSAEPDTCALAPYLDL 234

Query: 292 LAYSSKC--KAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
           L +S +   KA       + ++     ++  E + + Y           GP  N +L + 
Sbjct: 235 LNHSPRVQVKAAFNEETHSYEIRTTSRWRKHEEVFICY-----------GPHDNQRLFLE 283

Query: 350 YGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 393
           YGFV   NP+      R ++   L + D Q   K  + + +G +    F
Sbjct: 284 YGFVSVHNPHACVYVSREILVKYLPSRDKQMDKKISILKDHGYIENLTF 332


>gi|307104961|gb|EFN53212.1| hypothetical protein CHLNCDRAFT_137077 [Chlorella variabilis]
          Length = 512

 Score = 43.9 bits (102), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 23/73 (31%), Positives = 40/73 (54%), Gaps = 2/73 (2%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
           V A++D+ AG     VP  L++++E    +  +   L  ++  L+    LA++L+ E  +
Sbjct: 37  VLATQDIPAGTCVLRVPRHLLMSVESARRDAELCTALRQHRAALTSDQVLAVHLLCEASK 96

Query: 171 GKKSFWLPYIREL 183
           G  SFW PY+R L
Sbjct: 97  GAASFWQPYLRSL 109


>gi|167389227|ref|XP_001738871.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165897700|gb|EDR24782.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
          Length = 791

 Score = 43.9 bits (102), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 72/329 (21%), Positives = 126/329 (38%), Gaps = 49/329 (14%)

Query: 78  LGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
           + D+K W+ +NG     V +K    +         + A+++ +  +   S+P S  + + 
Sbjct: 1   MEDIKKWVIQNGGIIDGVDVKTFEGYGRG------LCANKEFKQDEIIMSIPYS--IQIN 52

Query: 138 RVLGNETIAELLT------TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQ 191
           R+  N    E+         +   +L  L    +   K   K F  PYI  L +      
Sbjct: 53  RINLNHIWPEVKLPKFNEGDDDRDDLNGLVYLYLAINKTNPKCFHWPYINVLPK------ 106

Query: 192 LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP-YDIP 250
              + PL ++  EL  + G+     + E+          +  V +    L QQ+P Y  P
Sbjct: 107 -TYDCPLSYTIDELNIMKGTKLYVAV-EKINAFL-----MKVVDYYNNKLIQQFPQYFQP 159

Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC-KAMLAAVDDAV 309
            +    ++FK+   A QS       V   + F  V    P   +S+ C +A +  + +  
Sbjct: 160 FD----DLFKRLQWAHQSFWSRAFLVIYPQPFGEVGSLIPFCDFSNHCTQAKVTYISNTR 215

Query: 310 QLVV-----DRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVV 364
                    +   K GE I   Y I           + N KLL+ YGFV+E+NP D L++
Sbjct: 216 TETFSFQTNEEVVKPGEQIFNNYRI-----------RSNEKLLLGYGFVEENNPCDNLLL 264

Query: 365 EAALNTEDPQYQDKRMVAQRNGKLSVQVF 393
                 +D QY +   + ++    S   F
Sbjct: 265 RIYFEVDDNQYNEIEEILKQEEIKSFDFF 293


>gi|50546259|ref|XP_500648.1| YALI0B08624p [Yarrowia lipolytica]
 gi|49646514|emb|CAG82890.1| YALI0B08624p [Yarrowia lipolytica CLIB122]
          Length = 490

 Score = 43.9 bits (102), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 53/185 (28%), Positives = 71/185 (38%), Gaps = 18/185 (9%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           V A +DL AGD    VP S  ++  R  G   IA LL  + L  +A L +  +YE+  G 
Sbjct: 36  VFAKKDLDAGDIVLKVPKSACLS-PRTCG---IANLLDEHDLDNIAGLLVAFLYERSLGD 91

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE-YNEL 231
           +S W  +   L           E P  WS  E   L       EI     G   E Y EL
Sbjct: 92  QSPWHEFFESLKPVIAD---VPEIPKFWSNDEDRALLSGTEVEEIGGLETGEDEEVYQEL 148

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
              +F            I  E  +F+ FK+  V + S    + +    R   LVP G  L
Sbjct: 149 IVPFFEDNGKL------INLECPSFDEFKKLVVVIASRAFEVDQF---RELCLVP-GACL 198

Query: 292 LAYSS 296
             +S 
Sbjct: 199 FNHSD 203


>gi|302820198|ref|XP_002991767.1| hypothetical protein SELMODRAFT_430007 [Selaginella moellendorffii]
 gi|300140448|gb|EFJ07171.1| hypothetical protein SELMODRAFT_430007 [Selaginella moellendorffii]
          Length = 389

 Score = 43.5 bits (101), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 32/120 (26%), Positives = 58/120 (48%), Gaps = 9/120 (7%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           A+  ++AG+    +P+ LV+T E++  ++ + +LL+T    +   L L ++ E+ +G+ S
Sbjct: 14  AARSIRAGEQIVRIPHDLVLTAEKL--DDCVKKLLSTEY--DWCPLTLLILAEQHKGEAS 69

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
            W PY+  L            S + W + EL +L  +       ER E I  EY  +  V
Sbjct: 70  RWAPYVSCLPSFGDH-----HSTIFWEKEELKFLECTRAFRGTAERREMISDEYISVKNV 124


>gi|403271547|ref|XP_003927684.1| PREDICTED: SET domain-containing protein 4 [Saimiri boliviensis
           boliviensis]
          Length = 440

 Score = 43.5 bits (101), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 66/289 (22%), Positives = 112/289 (38%), Gaps = 38/289 (13%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
           LQ G    S+P S ++T + V+    +   +T  K   S L  L  +L+ EK  G +S W
Sbjct: 68  LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGDRSLW 126

Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
            PY+  L +       A   P+   E E+  L     KA+  E+   ++  +      + 
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178

Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
               LF +    I    F++     A+  V +  V+L   Q+  L+       L P   L
Sbjct: 179 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQQECLSAEPDTCALAPYLDL 234

Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
           L +S   + KA         ++     ++  E + + Y           GP  N +L + 
Sbjct: 235 LNHSPHVQVKAAFNEETHCYEIRTTSRWRKHEEVFICY-----------GPHDNQRLFLE 283

Query: 350 YGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 393
           YGFV   NP+      R ++   L + D Q   K  + + +G +    F
Sbjct: 284 YGFVSAHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332


>gi|449466129|ref|XP_004150779.1| PREDICTED: uncharacterized protein LOC101212907 [Cucumis sativus]
          Length = 559

 Score = 43.5 bits (101), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 63/267 (23%), Positives = 105/267 (39%), Gaps = 39/267 (14%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           A EDL  GD    +P +++++ E V  +     L     +       L+ M EK      
Sbjct: 189 AKEDLDVGDTVLEIPLAIIISEELVQKSTMYPVLSKVEGMLPETMTLLWSMKEKHIVDSE 248

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
           F + Y   L         A  + L +    +  L G+    E+++  E ++++YNEL   
Sbjct: 249 FRV-YFDTLPE-------AFNTGLSFGVGAMTTLVGTLLFDELMQAKEHLRKQYNEL--- 297

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP-------- 286
            F A  L   +P   P E +++E F  A     S  + +       R  LVP        
Sbjct: 298 -FPA--LCNNHPDIFPEEFYSWEEFLWACELWYSNSLKIMFPDGNVRTCLVPIAGFLNHS 354

Query: 287 LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKL 346
           L P +L Y       + +  D+++  + RP +AGE   + Y           G    S L
Sbjct: 355 LHPHILHYGK-----VDSDTDSLKFRLSRPCRAGEECYLSY-----------GNYSGSHL 398

Query: 347 LINYGFVDE-DNPYDRLVVEAALNTED 372
           +  YGF+ E DN  D + ++     +D
Sbjct: 399 VTFYGFLPEGDNVNDVIPLDIDFGDDD 425


>gi|393217169|gb|EJD02658.1| SET domain-containing protein [Fomitiporia mediterranea MF3/22]
          Length = 513

 Score = 43.5 bits (101), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 105/451 (23%), Positives = 164/451 (36%), Gaps = 93/451 (20%)

Query: 115 ASEDLQAGDAAFSVPNSLVVT-----LERVLGNETIAELLTTNKLSELACLALYLMYEKK 169
           A +D+  G   FSVP SL ++     L +++G E   + L  NK      L L +M+E+ 
Sbjct: 34  ALQDIPEGHTLFSVPRSLTLSTHTSELPKLIG-EAAWKSLRLNK--GWVGLILCMMWEEC 90

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
           +   S W  Y   L R       A ++P+ W+  EL  L G+    +I +  E  +R+Y 
Sbjct: 91  RWTDSKWCGYFNILPR-------AFDTPMFWTGDELKELDGTDVLGKIGK--EQAERDYY 141

Query: 230 EL-------DTVWFMAGSLFQQYPYD--------IPTEAFTFEIFKQAFVAVQS------ 268
           E+           F  G +   Y  +        I + +F  E +K+     QS      
Sbjct: 142 EILNPAVRTRPDLFDPGHIASFYSLENYHVMGSRILSRSFHVEKWKEQTPGSQSRASSEL 201

Query: 269 -----CV------VHLQKVSLAR--------------RFALVPLGPPLLAYSSKCKAMLA 303
                C+       +L  V                    A+VP+   L A      A L 
Sbjct: 202 HENGDCMDIDDESSNLSAVGAENGGDDDSDDEAENPSDIAMVPMADMLNAQYGSENAKLF 261

Query: 304 AVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVD--------- 354
                + +V  +P + GE I   YN          G  PNS LL  YG VD         
Sbjct: 262 YEPTHLNMVSTKPIRRGEQI---YNA--------YGDLPNSALLREYGHVDLVPLPGVPW 310

Query: 355 -EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS------DM 407
            E NP D + + A L          R+ A+ + K  +  +    G +   +       D+
Sbjct: 311 KEGNPADVVEIPADLALHAVLSSQARVDAE-SLKERIDWWLEEGGDDVFVLGTDLELPDV 369

Query: 408 LPYLRLGYVSDTSEMQSVISSLGPICP-VSPCMERAVLDQLADYFKARLAGYPATLSEDE 466
           +       +    E +   S   P  P +    +      +    + RLA YP TL  DE
Sbjct: 370 MISFLKLLLLSKLEWEKARSKSKPPKPKLDMDSKLQTFPLVLGMLERRLAKYPTTLEHDE 429

Query: 467 AMLTDYNLHPKKRVATQLVRM-EKKMLNACL 496
           A+L+     P       +VR+ EK +L  C+
Sbjct: 430 ALLSGQTSLPYNVRNAIIVRIGEKHILVGCM 460


>gi|384251962|gb|EIE25439.1| ResB-domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 889

 Score = 43.5 bits (101), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 35/135 (25%), Positives = 55/135 (40%), Gaps = 15/135 (11%)

Query: 79  GDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLER 138
           G L+ W+   GLPP KV +  +            V   E L       +VP  L++T + 
Sbjct: 32  GSLEDWLTHRGLPPQKVAISHEIPEGRGLVATRRVRKHEKL------LNVPAQLLLTADV 85

Query: 139 VLGNETIAELLTTNKLSELACLALYLMYEKKQ--GKKSFWLPYIRELDRQRGRGQLAVES 196
            L +     LL +  +   + LA +L   ++Q  G K+ W  Y+  L  Q G        
Sbjct: 86  ALQHSAYGGLLESCGVPAWSVLATFLAETRRQPEGDKNVWGQYVDALPSQTG-------C 138

Query: 197 PLLWSETELAYLTGS 211
            L W+  E+  L G+
Sbjct: 139 VLEWASEEVDLLRGT 153


>gi|409045252|gb|EKM54733.1| hypothetical protein PHACADRAFT_97093 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 513

 Score = 43.5 bits (101), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 54/192 (28%), Positives = 82/192 (42%), Gaps = 33/192 (17%)

Query: 331 FTGHDFKCGPQ--------PNSKLLINYGFVD----------EDNPYD------RLVVEA 366
            T H+ K G Q        PNS LL  YGF+D            NP D       LVVEA
Sbjct: 301 MTVHEIKAGEQIWNTYGDPPNSDLLRRYGFIDVTKLESPLSGAGNPADIVEIPANLVVEA 360

Query: 367 ALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVI 426
           A      + QD+  V     +    VF V  G + E   +M+   RL  +   +E +   
Sbjct: 361 ATKHTTSKTQDR--VDWWLEEAEDDVFVV--GTDCELPPEMVSLARL-LLQPKAEWEKT- 414

Query: 427 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYN-LHPKKRVATQLV 485
            + G +    P M+  +     D  ++RL  YP ++ EDE +L D + L   +++A  + 
Sbjct: 415 KAKGKVP--KPTMDTTIAAIAMDVLQSRLKEYPTSVEEDERLLADESQLGFNRKMAVTVR 472

Query: 486 RMEKKMLNACLQ 497
             EK++L   L+
Sbjct: 473 LGEKRILAGTLR 484


>gi|217074704|gb|ACJ85712.1| unknown [Medicago truncatula]
          Length = 209

 Score = 43.5 bits (101), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 32/120 (26%), Positives = 56/120 (46%), Gaps = 8/120 (6%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           AS+ +Q GD    VP SL +T + +       +   +  +  +A LA  L+  K  G+ S
Sbjct: 66  ASKSIQTGDCILQVPYSLQLTPDNLPPE---IKPFISEDVGNIAKLATVLLIHKNLGQDS 122

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
            W PYI  L  Q       + + + W+E+EL  +  S    E + +   I++++ E+  V
Sbjct: 123 EWHPYISCLPPQA-----EMHNTIFWNESELEMIRQSSVYQETIYQKSQIEKDFLEIKPV 177


>gi|367009050|ref|XP_003679026.1| hypothetical protein TDEL_0A04830 [Torulaspora delbrueckii]
 gi|359746683|emb|CCE89815.1| hypothetical protein TDEL_0A04830 [Torulaspora delbrueckii]
          Length = 484

 Score = 43.1 bits (100), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 27/79 (34%), Positives = 42/79 (53%), Gaps = 12/79 (15%)

Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
           +++PL   L A +SKC A L    +++++   +P   GE +   YNI         G  P
Sbjct: 213 SMIPLADTLNANTSKCNANLVYDIESLKMCATKPIGMGEQV---YNIY--------GDHP 261

Query: 343 NSKLLINYGFVD-EDNPYD 360
           NS+LL  YG+V+ E + YD
Sbjct: 262 NSELLRRYGYVEWEGSKYD 280


>gi|325095092|gb|EGC48402.1| SET domain-containing protein [Ajellomyces capsulatus H88]
          Length = 485

 Score = 43.1 bits (100), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 69/313 (22%), Positives = 116/313 (37%), Gaps = 56/313 (17%)

Query: 84  WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
           W+ +   P  KV  K K +          + A +D+   +  F++P +LV+  +    N 
Sbjct: 23  WLKQR--PGVKVSPKIKIADLRSEGAGRGIVADDDIGEDEELFAIPQNLVLGFQ----NS 76

Query: 144 TIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
            + +LL  N+       CL + ++YE  QG  S W  Y + L           ++ + W+
Sbjct: 77  RLKDLLDFNERDFDPWLCLIVVMIYEYLQGGASTWSRYFQLLPTN-------FDTLMFWT 129

Query: 202 ETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP-------YDIPTE-- 252
           + EL  L+GS     +L +      E N L  +  +       +P       +D P    
Sbjct: 130 DEELRELSGSA----VLNKIGRSDAEANILRNILPLVSGNPSHFPPMSGVASFDSPEGKA 185

Query: 253 --------------AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC 298
                         A+ F+I K      +    ++          +VPL   L A + + 
Sbjct: 186 ALLSLAHRMGSLIMAYAFDIEKGENDGREGQDGYVTDDEEELSKGMVPLADLLNADADRN 245

Query: 299 KAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV-DEDN 357
            A L   D  + +   +P + GE I   Y           G  P + LL  YG+V D   
Sbjct: 246 NARLFQEDCYLSMRSIKPIRKGEEIFNDY-----------GELPRADLLRRYGYVTDNYA 294

Query: 358 PYDRLVVEAALNT 370
            YD   VE ++ T
Sbjct: 295 QYDE--VEISMRT 305


>gi|240276868|gb|EER40379.1| SET domain-containing protein [Ajellomyces capsulatus H143]
          Length = 485

 Score = 43.1 bits (100), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 69/313 (22%), Positives = 116/313 (37%), Gaps = 56/313 (17%)

Query: 84  WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
           W+ +   P  KV  K K +          + A +D+   +  F++P +LV+  +    N 
Sbjct: 23  WLKQR--PGVKVSPKIKIADLRSEGAGRGIVADDDIGEDEELFAIPQNLVLGFQ----NS 76

Query: 144 TIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
            + +LL  N+       CL + ++YE  QG  S W  Y + L           ++ + W+
Sbjct: 77  RLKDLLDFNERDFDPWLCLIVVMIYEYLQGGASTWSRYFQLLPTN-------FDTLMFWT 129

Query: 202 ETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP-------YDIPTE-- 252
           + EL  L+GS     +L +      E N L  +  +       +P       +D P    
Sbjct: 130 DEELRELSGSA----VLNKIGRSDAEANILRNILPLVSGNPSHFPPMSGVASFDSPEGKA 185

Query: 253 --------------AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC 298
                         A+ F+I K      +    ++          +VPL   L A + + 
Sbjct: 186 ALLSLAHRMGSLIMAYAFDIEKGENDGREGQDGYVTDDEEELSKGMVPLADLLNADADRN 245

Query: 299 KAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV-DEDN 357
            A L   D  + +   +P + GE I   Y           G  P + LL  YG+V D   
Sbjct: 246 NARLFQEDCYLSMRSIKPIRKGEEIFNDY-----------GELPRADLLRRYGYVTDNYA 294

Query: 358 PYDRLVVEAALNT 370
            YD   VE ++ T
Sbjct: 295 QYDE--VEISMRT 305


>gi|359476494|ref|XP_002269094.2| PREDICTED: protein SET DOMAIN GROUP 40-like [Vitis vinifera]
          Length = 504

 Score = 43.1 bits (100), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 43/74 (58%), Gaps = 1/74 (1%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN-KLSELACLALYLMYEKKQG 171
           +AA+ DL  G+   +VP S ++T + +L +E ++  +  +  LS    L + L+ E  +G
Sbjct: 51  LAAARDLSQGELILTVPKSALMTSQSLLKDEKLSVAVKRHTSLSSPQILTICLLAEMSKG 110

Query: 172 KKSFWLPYIRELDR 185
           K S+W PY+ +L R
Sbjct: 111 KSSWWHPYLMQLPR 124


>gi|297738159|emb|CBI27360.3| unnamed protein product [Vitis vinifera]
          Length = 449

 Score = 43.1 bits (100), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 43/74 (58%), Gaps = 1/74 (1%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN-KLSELACLALYLMYEKKQG 171
           +AA+ DL  G+   +VP S ++T + +L +E ++  +  +  LS    L + L+ E  +G
Sbjct: 51  LAAARDLSQGELILTVPKSALMTSQSLLKDEKLSVAVKRHTSLSSPQILTICLLAEMSKG 110

Query: 172 KKSFWLPYIRELDR 185
           K S+W PY+ +L R
Sbjct: 111 KSSWWHPYLMQLPR 124


>gi|255584095|ref|XP_002532790.1| Protein SET DOMAIN GROUP, putative [Ricinus communis]
 gi|223527460|gb|EEF29592.1| Protein SET DOMAIN GROUP, putative [Ricinus communis]
          Length = 510

 Score = 43.1 bits (100), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 23/74 (31%), Positives = 39/74 (52%), Gaps = 1/74 (1%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSELACLALYLMYEKKQG 171
           + A+ DL+ G+    VP S ++T +  L +  +   +  +  LS    L + L+YE  +G
Sbjct: 57  LGAARDLKKGELVLRVPKSALLTKDSFLKDGLLLSAINNHSALSPTQTLTVCLLYEMSKG 116

Query: 172 KKSFWLPYIRELDR 185
           + SFW PY+  L R
Sbjct: 117 QSSFWYPYLMHLPR 130


>gi|294948379|ref|XP_002785721.1| hypothetical protein Pmar_PMAR008080 [Perkinsus marinus ATCC 50983]
 gi|239899769|gb|EER17517.1| hypothetical protein Pmar_PMAR008080 [Perkinsus marinus ATCC 50983]
          Length = 353

 Score = 42.7 bits (99), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 22/82 (26%), Positives = 38/82 (46%), Gaps = 11/82 (13%)

Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQ 341
             ++PL       S+K    +  V++  Q++ ++P K GE I   Y           G  
Sbjct: 171 LCVIPLADQFNHSSTKWHTRVREVEEGFQMLAEKPVKKGEEIFNNY-----------GLY 219

Query: 342 PNSKLLINYGFVDEDNPYDRLV 363
            N  LL+ +GF++ DNP+D  +
Sbjct: 220 TNEMLLLTHGFIEFDNPHDHFI 241


>gi|449702130|gb|EMD42824.1| Hypothetical protein EHI5A_004190 [Entamoeba histolytica KU27]
          Length = 749

 Score = 42.7 bits (99), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 59/238 (24%), Positives = 89/238 (37%), Gaps = 34/238 (14%)

Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
           LYL   K   K   W PYI  L           + PL ++  EL  + G+   A + E+ 
Sbjct: 42  LYLAVNKTNPKCFHW-PYINVLPE-------TYDCPLSYTIDELNLMKGTKLYAAV-EKI 92

Query: 222 EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR 281
                    +  V +    L QQ+P     ++F  ++FK+   A QS       V   + 
Sbjct: 93  NAFL-----MKVVDYYNNKLIQQFPQYF--QSFD-DLFKRLQWAHQSFWSRAFLVIYPQP 144

Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV------DRPYKAGESIVVWYNISFTGHD 335
           F  V    P   +S+ C           Q         +   K GE I   Y I      
Sbjct: 145 FGEVGSLIPFCDFSNHCTQAKVTYISNTQTETFSFQTNEELVKPGEQIFNNYRI------ 198

Query: 336 FKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 393
                + N KLL+ YGFV+E+NP D L++      +D QY +   + ++    S   F
Sbjct: 199 -----RSNEKLLLGYGFVEENNPCDNLLLRIYFEVDDNQYNEIEEILKQEEIKSFDFF 251


>gi|428175768|gb|EKX44656.1| hypothetical protein GUITHDRAFT_109433 [Guillardia theta CCMP2712]
          Length = 591

 Score = 42.7 bits (99), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 70/284 (24%), Positives = 120/284 (42%), Gaps = 50/284 (17%)

Query: 100 KPSHNEKH-RPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK----- 153
           KP   E+  R I  +A  E++      FS+P  ++++ +  + + +IA +   +K     
Sbjct: 35  KPHDGERGVRVISDIAPCEEM------FSIPEKILMSRKSCMAS-SIAHVFRKHKDVLFS 87

Query: 154 -LSELACLALYLMYEK-KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
              ELA L L ++YEK  QG  SFW P I  L    G       +   WSE EL  L   
Sbjct: 88  SRDELA-LTLLILYEKLDQGNASFWKPMIDILPADPG-------AASKWSEEELQELQDE 139

Query: 212 PTKAEILERAEGIKREYNE-LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
             KAE +     +++ Y   L  +    G +F         + +T+E F+ A + V+S  
Sbjct: 140 SLKAEAMIVVASMQQTYQRVLRPILVQHGDVF-------SVDRYTWEEFRWALLCVESRT 192

Query: 271 VHLQKVSLARRF----ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 326
                     RF    ++VP    L   + +        +     + D    +GE +   
Sbjct: 193 FG--------RFLPHPSIVPFADLLNHVNVQTSYRWLPEERRAAYMCD---ASGEHVHRR 241

Query: 327 YNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 370
              +F  +    GP+ N++LL++YGF  + N Y+ + +   +NT
Sbjct: 242 GEEAFMSY----GPRSNAELLLHYGFALQSNRYEAVELNFRINT 281


>gi|302754816|ref|XP_002960832.1| hypothetical protein SELMODRAFT_437299 [Selaginella moellendorffii]
 gi|300171771|gb|EFJ38371.1| hypothetical protein SELMODRAFT_437299 [Selaginella moellendorffii]
          Length = 418

 Score = 42.7 bits (99), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 48/183 (26%), Positives = 77/183 (42%), Gaps = 33/183 (18%)

Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF---TF 256
           W +TEL+YL  SP   +  ER E I  E+ ++    F    L Q    D+  + F   + 
Sbjct: 202 WEDTELSYLRASPLYGKARERLEMITTEFGQVQND-FCTCVLEQ--ALDVWPQLFGKVSL 258

Query: 257 EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY-----SSKCKAMLAAVDDAVQL 311
           E  K  +  V S     + +++     LV +  P+L +     +S  K     + +   +
Sbjct: 259 EDLKHVYATVFS-----RSLAIGEDSTLVMI--PMLDFFNHNATSFAKLSFNGLLNYAVV 311

Query: 312 VVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 371
             DR Y   + I  W N          G   N++L ++YGF   +NPYD    E  L T+
Sbjct: 312 TADRDYAENDQI--WINY---------GDLSNAELALDYGFTVPENPYD----ETELLTQ 356

Query: 372 DPQ 374
            P+
Sbjct: 357 FPE 359


>gi|326913214|ref|XP_003202935.1| PREDICTED: SET domain-containing protein 4-like, partial [Meleagris
           gallopavo]
          Length = 241

 Score = 42.7 bits (99), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 31/100 (31%), Positives = 48/100 (48%), Gaps = 9/100 (9%)

Query: 94  KVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAE 147
           K  LK++   +   RP  +      +  +  LQAG+   S+P   +VT   VL N  + E
Sbjct: 36  KKWLKDRGFGDSSLRPAQFWGTGRGLMTTRALQAGELVISLPEKCLVTTNTVL-NSCLGE 94

Query: 148 LLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDR 185
            +   K  +S L  L  +L+ EK  G+KS W PY+  L +
Sbjct: 95  YIMKWKPPVSPLIALCTFLIAEKHAGEKSLWKPYLDVLPK 134


>gi|296810368|ref|XP_002845522.1| SET domain-containing protein [Arthroderma otae CBS 113480]
 gi|238842910|gb|EEQ32572.1| SET domain-containing protein [Arthroderma otae CBS 113480]
          Length = 491

 Score = 42.7 bits (99), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 65/284 (22%), Positives = 111/284 (39%), Gaps = 56/284 (19%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A  D+   +  F +P  L++++E     E +   L   +L     L + ++YE  QG+
Sbjct: 61  LGAVRDIAEDEELFVIPEDLILSVENSKAREALG--LNETQLGPWLSLIIVMIYEYYQGE 118

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA--EGIKREYNE 230
           +S W PY   L         + ++ + W+E +L  L G     +I + A  E I ++   
Sbjct: 119 QSRWEPYFHIL-------PTSFDTLMFWTEAQLQELQGCAVVDKIGKSAADEAILQKVVP 171

Query: 231 L--------------------DTVWFMA---GSLFQQYPYDI-PTEAFTFEIFKQAFVAV 266
           L                    D +  +A   GSL   Y +DI  TE    +  +  ++  
Sbjct: 172 LIQANPHHFPARSGMPPLDSNDALLCLAHRMGSLIMAYAFDIEKTEGADDDAAEDGYMTD 231

Query: 267 QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 326
                   +   A+   +VPL     A + +  A L   + +  +   R  +AGE I   
Sbjct: 232 -------DEDEPAK--GMVPLADIFNADAQRNNARLFQEEGSFVMKAIRNIQAGEEIFND 282

Query: 327 YNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 370
           Y           G  P + LL  YG+V  DN     VVE +L++
Sbjct: 283 Y-----------GELPRADLLRRYGYV-TDNYAQYDVVEFSLDS 314


>gi|189190580|ref|XP_001931629.1| conserved hypothetical protein [Pyrenophora tritici-repentis
           Pt-1C-BFP]
 gi|187973235|gb|EDU40734.1| conserved hypothetical protein [Pyrenophora tritici-repentis
           Pt-1C-BFP]
          Length = 372

 Score = 42.7 bits (99), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 58/256 (22%), Positives = 101/256 (39%), Gaps = 35/256 (13%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A+ D+QAG+    VP  L  TL+ V   + I+  L  N +S  A LA YL  +K    
Sbjct: 33  IIATRDIQAGETILFVPFKLFRTLKHV--PKAISRRLPRN-MSLHALLATYLSLDKTD-- 87

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
            +F +P     D       +    P LW   EL      P    ++++    KR+     
Sbjct: 88  -TFAIPNKTLPDLSSFEAGM----PFLWP-AELHPFLPKPALDLLMKQQRSFKRD----- 136

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
             W +    +     D    A+   +  ++F         ++++    R A++P+     
Sbjct: 137 --WDIVSKAYSNISQDQYLHAWLL-VNTRSFYCTTPI---MERLPHDDRLAILPVADLFN 190

Query: 293 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGF 352
                C+A  A+  +    + DR Y+ GE + + Y           G      LL  YGF
Sbjct: 191 HADVGCEARFAS--ENYSFIADRDYRTGEELHISY-----------GSHSTDFLLTEYGF 237

Query: 353 VDEDNPYDRLVVEAAL 368
           V  +N +D + ++ A+
Sbjct: 238 VPTENCWDVVCLDEAI 253


>gi|302754814|ref|XP_002960831.1| hypothetical protein SELMODRAFT_402223 [Selaginella moellendorffii]
 gi|300171770|gb|EFJ38370.1| hypothetical protein SELMODRAFT_402223 [Selaginella moellendorffii]
          Length = 486

 Score = 42.7 bits (99), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 45/173 (26%), Positives = 72/173 (41%), Gaps = 29/173 (16%)

Query: 196 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF- 254
           S   W +TEL+YL  SP   +  ER E I  E+ ++    F    L Q    D+  + F 
Sbjct: 313 STFRWEDTELSYLRASPLYGKARERLEMITTEFGQVQND-FCTCVLEQ--ALDVWPQLFG 369

Query: 255 --TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY-----SSKCKAMLAAVDD 307
             + E  K  +  V S     + +++     LV +  P+L +     +S  K     + +
Sbjct: 370 KVSLEDLKHVYATVFS-----RSLAIGEDSTLVMI--PMLDFFNHNATSFAKLSFNGLLN 422

Query: 308 AVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYD 360
              +  DR Y   + I  W N          G   N++L ++YGF   +NPYD
Sbjct: 423 YAVVTADRDYAENDQI--WINY---------GDLSNAELALDYGFTVPENPYD 464


>gi|332229557|ref|XP_003263953.1| PREDICTED: SET domain-containing protein 4 [Nomascus leucogenys]
          Length = 440

 Score = 42.4 bits (98), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 66/289 (22%), Positives = 112/289 (38%), Gaps = 38/289 (13%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
           LQ G    S+P S ++T + V+    +   +T  K   S L  L  +L+ EK  G +S W
Sbjct: 68  LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGDRSLW 126

Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
            PY+  L +       A   P+   E E+  L     KA+  E+   ++  +      + 
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178

Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
               LF +    I    F++     A+  V +  V+L   Q   L+       L P   L
Sbjct: 179 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQWECLSAEPDTCALAPYLDL 234

Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
           L +S   + KA       + ++     ++  E + + Y           GP  N +L + 
Sbjct: 235 LNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICY-----------GPHDNQRLFLE 283

Query: 350 YGFVDEDNPY-----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 393
           YGFV   NP+      R ++   L + D Q   K  + + +G +    F
Sbjct: 284 YGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332


>gi|303272215|ref|XP_003055469.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226463443|gb|EEH60721.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 468

 Score = 42.4 bits (98), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 32/119 (26%), Positives = 52/119 (43%), Gaps = 10/119 (8%)

Query: 144 TIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET 203
           ++A+ L   +L     L + +M E+  G +S W  Y   L   RG   L    P+ W+E 
Sbjct: 85  SVAKELRDARLGGGLALNVAVMVERALGSESRWRDYFAVLP-SRGERTL----PMFWTEA 139

Query: 204 ELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
            L  L G+     + E AE ++ +Y+E      +   L   +P     E  TFE + +A
Sbjct: 140 RLEALKGTDLATHVREDAENLRADYDEE-----VVNGLCVAHPEKFRREELTFERYLEA 193


>gi|340507383|gb|EGR33354.1| SET domain protein [Ichthyophthirius multifiliis]
          Length = 165

 Score = 42.4 bits (98), Expect = 0.56,   Method: Composition-based stats.
 Identities = 33/103 (32%), Positives = 50/103 (48%), Gaps = 20/103 (19%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAEL--------LTTNKLSELACLALYL 164
           V A E++ A     ++PN+L+++   V  +E    L        L  +  ++   LALYL
Sbjct: 51  VIAKEEIPANKVFVAIPNNLLLSTYLVEQSELKVILEENPHLFDLDEDDDAQFNKLALYL 110

Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL--WSETEL 205
           M EK +G+ SFW PY+          Q+A ES  L  W E E+
Sbjct: 111 MKEKIKGENSFWYPYL----------QIAPESFTLLDWKEEEV 143


>gi|403370373|gb|EJY85047.1| hypothetical protein OXYTRI_17100 [Oxytricha trifallax]
          Length = 777

 Score = 42.4 bits (98), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 39/165 (23%), Positives = 74/165 (44%), Gaps = 31/165 (18%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIA-------ELLTTNKLSELACLALYLM 165
           VAA + +   +A   +PN L++  +++  +E          E   T K S+   L  ++ 
Sbjct: 112 VAAKKFIGPNEAYLYIPNKLIINEDKLYKSEYAQIFIDHPNEFKNTEK-SDQTSLIFFVA 170

Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVES--PLLWSETELAYLTGSPTKAEILERAEG 223
            E  +G++S+W PY           + A +S  P  W +  +  L  +  KAE+      
Sbjct: 171 LELLKGEESYWHPYF----------ETAQDSDLPQFWEDQNIDELEDALIKAEL------ 214

Query: 224 IKREYNELDTV--WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 266
              + +++D +  + +A  +   YP  +  E FT EI+K+A+  V
Sbjct: 215 ---QMHQVDFIGDYEIAHGIANHYPDLVHAEKFTIEIYKRAYNIV 256


>gi|453087416|gb|EMF15457.1| SET domain-containing protein [Mycosphaerella populorum SO2202]
          Length = 454

 Score = 42.4 bits (98), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 73/347 (21%), Positives = 142/347 (40%), Gaps = 49/347 (14%)

Query: 126 FSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIREL 183
            ++P+ L+ T++R   +  +  LL++ +  LS    LA Y+++ + + K  +  P     
Sbjct: 41  LTIPHGLLWTVKRAYADPVLGPLLSSTRPPLSVDDTLATYILFIRAR-KSGYDGP----- 94

Query: 184 DRQRGRGQL--AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSL 241
             Q     L  +  S + +++ EL    GS         A  I+ +Y +L         L
Sbjct: 95  --QSHVAALPASYSSSIFFADAELEICAGSSLYTTTKHLARQIEVDYKDL------VARL 146

Query: 242 FQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYS---SKC 298
           F ++    P++ FT + +K A   V S  +   K+       L+     +L +S    +C
Sbjct: 147 FGRHRDVFPSDKFTIDDYKWALCTVWSRAMDF-KLRDGESIRLMAPFADMLNHSPDVGQC 205

Query: 299 KAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNP 358
                   + + ++  + Y+ G+ + + Y           GP PN++L   YGFV   NP
Sbjct: 206 HVYDPQSGN-LSILAGKSYEPGDQVFINY-----------GPIPNNRLSRLYGFVVPGNP 253

Query: 359 YDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP-----YLRL 413
            D   +  + +   P ++ K  +    G  S     +        ++D LP     YLR+
Sbjct: 254 NDSYDLVLSTHPMAPFFEQKHKLWIAAGLDSTSTVSL-------TLTDPLPRSVLRYLRI 306

Query: 414 GYVSDTSEMQSVISSLGPIC--PVSPCMERAVLDQLADYFKARLAGY 458
             +++T ++ +V +    +    +S   E  VL  L +   A L G+
Sbjct: 307 QRLNET-DLAAVGTRQSDVAFEKISDSNETEVLTFLVESISALLDGF 352


>gi|71425330|ref|XP_813082.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70877934|gb|EAN91231.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 565

 Score = 42.4 bits (98), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 87/364 (23%), Positives = 143/364 (39%), Gaps = 59/364 (16%)

Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL- 218
           L L L+YE+   + S W     EL      G   V  P  W   +LA L G     ++L 
Sbjct: 201 LVLSLIYERYVAETSHW----NELLLSCPGGYPNV--PSFWDWEDLAELEGLDVLDDVLA 254

Query: 219 ERAEGIKREYNELDTVWFMAGSLFQ--QYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
           ++A+  + +   +  + F+  +L    ++  D   E F+ E    A     S   +L  V
Sbjct: 255 KKAQLAQFQTETMAVLPFIHEALAGGCRFGKDEFLECFSIEAMMWARATFDSRAFNL-NV 313

Query: 277 SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDF 336
                 ALVP+   ++ + ++   ++  V+             G   V+    S T  D 
Sbjct: 314 DGRVVIALVPVAD-MINHHNRSDVLVRKVE-----------PNGGDFVMQIGASLTAQDI 361

Query: 337 ------KCGPQPNSKLLINYGFVDEDNPYDRL-----VVEAALNTEDPQYQDKR--MVAQ 383
                   GP  N +LL  YGFV E N +DRL       E  +  E   +  +R  +VA 
Sbjct: 362 GREIWMSYGPLQNWELLQFYGFVLEGNEHDRLPFPFDFPEGVVGDE---WDGRRAALVAT 418

Query: 384 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 443
               L+ + +  H GR   A+  +L      ++++  E  + +   GP   +    E  V
Sbjct: 419 YGLHLAGRCWICHDGRPPPALVALLRV----HLAEAEEFDT-MERKGPFASLGAGTEARV 473

Query: 444 LDQLADYFKARLAGYPATLSEDEAML------------TDYNLHP---KKRVATQLVRME 488
           +  +AD  +  L  +  +L EDE +L             D N  P    KR+A  L+RM 
Sbjct: 474 VATIADTIRCILDLFSTSLEEDERLLENGSGPVATHSGDDGNTQPLSCNKRLAI-LLRMG 532

Query: 489 KKML 492
            K +
Sbjct: 533 MKRI 536


>gi|261190993|ref|XP_002621905.1| SET domain-containing protein [Ajellomyces dermatitidis SLH14081]
 gi|239590949|gb|EEQ73530.1| SET domain-containing protein [Ajellomyces dermatitidis SLH14081]
 gi|239613147|gb|EEQ90134.1| SET domain-containing protein [Ajellomyces dermatitidis ER-3]
 gi|327354785|gb|EGE83642.1| SET domain-containing protein [Ajellomyces dermatitidis ATCC 18188]
          Length = 481

 Score = 42.0 bits (97), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 63/274 (22%), Positives = 108/274 (39%), Gaps = 61/274 (22%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELL--TTNKLSELACLALYLMYEKKQ 170
           + A  ++   +  F++P +LV++ +    N  + +LL  +   L    CL L ++YE  Q
Sbjct: 50  IVALSNINEDEELFAIPQNLVLSFQ----NSKLKDLLHISEKDLGPWLCLILVMIYEYLQ 105

Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI--LERAEGIKRE- 227
           G  S W  Y + L  +        ++ + W++ EL  L+GS    +I   +    I R+ 
Sbjct: 106 GGASPWSRYFQVLPTE-------FDTLMFWTDEELRELSGSAVLNKIGKSDAEAAILRDI 158

Query: 228 -------------------YNELD---TVWFMA---GSLFQQYPYDIPTEAFTFEIFKQA 262
                              Y+  D   T+  +A   GSL   Y +DI       E  +  
Sbjct: 159 FPIVSTNPHLFPPISGLGSYDSPDGRATLLSLAHRMGSLIMAYAFDI-------EKGEDE 211

Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
              VQ   +  +   L +   +VPL   L A + +  A L   D  + +   +P + GE 
Sbjct: 212 EGEVQDGYITDEGEELTK--GMVPLADLLNADADRNNARLFQEDGYLAMKSIKPIRNGEE 269

Query: 323 IVVWYNISFTGHDFKCGPQPNSKLLINYGFVDED 356
           I   Y           G  P + LL  YG+V ++
Sbjct: 270 IFNDY-----------GELPRADLLRRYGYVTDN 292


>gi|400596811|gb|EJP64567.1| histone-lysine N-methyltransferase [Beauveria bassiana ARSEF 2860]
          Length = 406

 Score = 42.0 bits (97), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 87/403 (21%), Positives = 164/403 (40%), Gaps = 48/403 (11%)

Query: 81  LKSWMHKNG-LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
           + +W++K+G +    + L + P           V A    +  +   ++P + + T++  
Sbjct: 1   MDAWLNKSGAVGLGDLDLADFPETGRG------VKAQRPFKEDERILTIPANCLWTVKGA 54

Query: 140 LGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
             +     +L + +  LS    LALY+++ + +G+   +       +RQ     L  E  
Sbjct: 55  YADPLFGPVLQSVQPPLSVEDTLALYILFVRSRGEDPAYA------ERQTHVAMLPSEYT 108

Query: 198 L--LWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAF 254
           L   +++ EL    GS            +  +Y +L T  FM    LF       P + F
Sbjct: 109 LSMYFTDEELRVCAGSSLYTLTTHLRGRVGDDYKKLLTGVFMRHRDLF-------PLDKF 161

Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV- 313
           +F+ +K A  ++ S  +    +S      L+     +L ++S  K   A       L V 
Sbjct: 162 SFQHYKWALSSIWSRGMDF-TISEGNSVRLMAPFADMLNHASDAKQCHAYDPSTGSLTVL 220

Query: 314 -DRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 372
             R Y+ G+ + ++Y           G   NS+LL  YGFV  DNP D   +    ++  
Sbjct: 221 ACRDYEVGDQVFIYY-----------GNVSNSRLLRLYGFVLPDNPNDNYELVLQTSSMA 269

Query: 373 PQYQDKRMVAQRNGKLSV--QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLG 430
           P Y+ K    QR  KL+   ++  +    +      +L YLR+  + D S++ ++   + 
Sbjct: 270 PLYEQK----QRLWKLAGLDEISTIPLSLQNPLPDSVLRYLRIQRL-DASDLGTMTMQIA 324

Query: 431 --PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 471
                 +S   E  +L  L+   +A L G+  +L + E  L +
Sbjct: 325 TESYTKISDENESQILLFLSQSIEALLEGFEISLEKLETQLAE 367


>gi|409079523|gb|EKM79884.1| hypothetical protein AGABI1DRAFT_119942 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 401

 Score = 42.0 bits (97), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 83/346 (23%), Positives = 124/346 (35%), Gaps = 79/346 (22%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE--- 137
            K+W H+ G             +        Y  A+ED+       + P S+ +T E   
Sbjct: 9   FKAWFHERG------------GYLHDGVGFTYTVATEDIPTNATIITCPFSIAITEELAK 56

Query: 138 RVLGN----ETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA 193
           R L N    +++ +      +S   CL   L   K    + +  PY   LD      QL 
Sbjct: 57  RNLSNFFSADSLVDWTEKQWISTYICLHWILSDSKPTELEHY--PY---LDTLPPLNQL- 110

Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 253
             +PL +++ E+    GS      L R   +K E+ E  +V      L  Q   D   + 
Sbjct: 111 -RTPLQFTKIEIETFKGSNLYHATLNRERQLKEEWQECQSV------LISQN--DSWGKG 161

Query: 254 FTFEIFKQAFVAVQSCVVHLQKVSLARRF--ALVPLGPPLLAYSSKCKAMLAAVD----- 306
           FT+E +  A   V S           R F   ++   P L+A       +L  VD     
Sbjct: 162 FTWERYLTAATYVSS-----------RAFPSTILSPNPSLIATPETKFVLLPGVDAFNHK 210

Query: 307 --DAVQLVVDRPYKAGESIVVWY---NISFTGHDFKC---------GPQPNSKLLINYGF 352
              AV   V  P K+G S+   Y    IS   H             GP+PN  L++ YGF
Sbjct: 211 RAQAVSWSVTYPDKSG-SLASSYKGPTISLVPHTKTSAGEEIFNNYGPKPNGNLILGYGF 269

Query: 353 VDEDNPYDRLVVE------------AALNTEDPQYQDKRMVAQRNG 386
               NP D ++++            AA   E    +  RMV +  G
Sbjct: 270 SLPANPDDTILLKIGGFEKKWEIGRAATGVEGLWTELLRMVKENQG 315


>gi|294659704|ref|XP_462118.2| DEHA2G13354p [Debaryomyces hansenii CBS767]
 gi|199434171|emb|CAG90604.2| DEHA2G13354p [Debaryomyces hansenii CBS767]
          Length = 480

 Score = 42.0 bits (97), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 56/245 (22%), Positives = 97/245 (39%), Gaps = 42/245 (17%)

Query: 134 VTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA 193
           +T+E +LG            LS    L+ Y+ +EK++G  SFW P+I  L  +     LA
Sbjct: 135 LTMEEMLG------------LSSFQLLSFYICFEKQRGSSSFWKPFIDMLP-ETSDFDLA 181

Query: 194 VESPLLWS------ETELAYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYP 246
              PL+W         EL  L  + TK  + +  +  + +YN + D +      +     
Sbjct: 182 ---PLVWKVLKVDHYEELLKLLPNSTKRHMDKIYDRFQTDYNVVKDLISIKLKEISDNER 238

Query: 247 YDIPTEAFT----FEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGPPLLAYSSKCK 299
            +  T+A       E++  +++ + S  +++   Q  + A  F + P     L +S   +
Sbjct: 239 SNDLTDAIRHLVPIELYLWSWMCINSRCLYMEIPQSKNAADNFTMAPY-VDFLNHSCDDQ 297

Query: 300 AMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPY 359
             L       Q+     Y   E + + Y           GP  N  LL  YGF   +N +
Sbjct: 298 CGLKIDGTGFQVYTTCSYNPDEQLFLSY-----------GPHSNEFLLCEYGFTLPENKW 346

Query: 360 DRLVV 364
           + L V
Sbjct: 347 NDLDV 351


>gi|66813084|ref|XP_640721.1| hypothetical protein DDB_G0281543 [Dictyostelium discoideum AX4]
 gi|60468751|gb|EAL66753.1| hypothetical protein DDB_G0281543 [Dictyostelium discoideum AX4]
          Length = 1339

 Score = 42.0 bits (97), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 68/325 (20%), Positives = 126/325 (38%), Gaps = 40/325 (12%)

Query: 42   SSLRLVRRKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKP 101
            S ++L   KN+FS     S   +  S EV  + E       +W+   G+   K+ +    
Sbjct: 728  SLIQLSNLKNQFSKPTIKSVPFIKPSDEVYKRFE-------NWLKAGGVQFPKLQIA--- 777

Query: 102  SHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLA 161
              N        V  ++ +   +A   VP   ++ ++    +  +  +     L++   L 
Sbjct: 778  --NFTDSTGRGVVTTKKVDENEAVVVVPKKYLINVDVAKAHPILGPIFEELHLNDDTILF 835

Query: 162  LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
            L+++YEK     SFW P+   L            + + +S TEL  L G+     + E  
Sbjct: 836  LFVIYEKGNAN-SFWRPFYDTLPS-------YFTTSIHYSATELLELEGT----NLFEET 883

Query: 222  EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR 281
               K++ N      F    L +QYP   P   F++E F  A   + S  + L K+  + +
Sbjct: 884  LHTKQQLNSFRDYLF--PELSKQYPDIFPESQFSWENFLWARSLLDSRAIQL-KIDGSIK 940

Query: 282  FALVPLGPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCG 339
              LVP+   +  +++    +        + +++      A   I + Y           G
Sbjct: 941  SCLVPMADMINHHTNAQISERFFDHDSQSFKMISSCNIPANNQIFLHY-----------G 989

Query: 340  PQPNSKLLINYGFVDEDNPYDRLVV 364
               N +L + YGF+  +N YD L +
Sbjct: 990  ALQNWELALYYGFIIPNNIYDSLHI 1014


>gi|302829721|ref|XP_002946427.1| hypothetical protein VOLCADRAFT_86703 [Volvox carteri f.
           nagariensis]
 gi|300268173|gb|EFJ52354.1| hypothetical protein VOLCADRAFT_86703 [Volvox carteri f.
           nagariensis]
          Length = 658

 Score = 42.0 bits (97), Expect = 0.79,   Method: Compositional matrix adjust.
 Identities = 25/90 (27%), Positives = 40/90 (44%), Gaps = 17/90 (18%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-----------------LS 155
           + A+ DLQ G+A   VP  L++T      +  +A  L  +                  L 
Sbjct: 32  IVATRDLQPGEAVLRVPERLLLTTRSAARDPQLAAALQRHTERSRGVAAAPSCGGGCGLG 91

Query: 156 ELACLALYLMYEKKQGKKSFWLPYIRELDR 185
               LA +L+ E  +G +SFW PY+++L R
Sbjct: 92  PHQVLACHLLLEVSRGPQSFWWPYLKQLPR 121


>gi|169847976|ref|XP_001830696.1| hypothetical protein CC1G_03233 [Coprinopsis cinerea okayama7#130]
 gi|116508170|gb|EAU91065.1| hypothetical protein CC1G_03233 [Coprinopsis cinerea okayama7#130]
          Length = 496

 Score = 42.0 bits (97), Expect = 0.79,   Method: Compositional matrix adjust.
 Identities = 35/121 (28%), Positives = 60/121 (49%), Gaps = 19/121 (15%)

Query: 115 ASEDLQAGDAAFSVPNSLVVT-----LERVLGNETIAELLTTNKLSE-LACLALYLMYEK 168
           A +DL  G   F++P +L ++     L  + G E    L    KL +  A L L +M+E 
Sbjct: 42  ALKDLPEGHVLFTIPRALTLSTRTSRLPELFGLEEWKRL----KLHQGWAGLMLCMMWEA 97

Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
            QGK+S W  Y+  L         A ++P+ W+E +L+ L G+    ++ +  E  +R+Y
Sbjct: 98  AQGKESRWAGYLDIL-------PAAFDTPMFWNEEDLSELAGTSIVGKLGK--EDAERDY 148

Query: 229 N 229
           +
Sbjct: 149 D 149


>gi|320166344|gb|EFW43243.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 514

 Score = 42.0 bits (97), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 59/241 (24%), Positives = 98/241 (40%), Gaps = 31/241 (12%)

Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
           N +  +  LAL LMYE+ +   S W  ++R L          +ES L W++ EL  +   
Sbjct: 132 NAIDPMTALALGLMYERSRA-DSPWRAWLRMLPD-------PIESMLEWNDVELWPVEQL 183

Query: 212 PTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 271
             K    ER   ++  Y  + T           Y  D+    FT E F  A V  Q+  +
Sbjct: 184 YVKELREERIRNLEAVYESVIT------PFIDTYESDLVGVDFTIEAFVWAAVIAQTRGL 237

Query: 272 HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISF 331
           H    S     +L+P+   ++ +  +  A++ A    + +      KAGE I + Y +S 
Sbjct: 238 H---ESEKNGLSLLPIV-DMINHHREPNAVVVASGPNILVRTKTSLKAGEEITIDYEMS- 292

Query: 332 TGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ 391
                      +  LL+ YGFV+     D   +  +  ++D  Y  +  + +  G LS Q
Sbjct: 293 -----------SHVLLLLYGFVEMSENLDFYPIRLSWESKDIDYPRRLRLLEGRG-LSRQ 340

Query: 392 V 392
           V
Sbjct: 341 V 341


>gi|452982650|gb|EME82409.1| hypothetical protein MYCFIDRAFT_40308 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 449

 Score = 42.0 bits (97), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 63/261 (24%), Positives = 102/261 (39%), Gaps = 47/261 (18%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           V A+ DL + +  F +P + ++T E     + I + LT   LS    L L +++E   G 
Sbjct: 42  VVATSDLTSDEEIFRIPRTSILTTETTDLPQEILQQLTDPWLS----LILAMIFEYLLGT 97

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-LERAEGIKREY--- 228
            S + PY+  L         +  + + W++ EL YL GS   ++I  E A+    E    
Sbjct: 98  NSRFKPYLDILPE-------SFNTLMFWTDNELQYLQGSAILSKIGKEEADNTFSEQLLP 150

Query: 229 ----------------NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVH 272
                            +L  +    GS+   Y +D+     T      +          
Sbjct: 151 IITKNPEIFKIGTCNNQDLLALCHRMGSIIMSYAFDLDPPPTT--TTSSSEEWESDSDSE 208

Query: 273 LQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFT 332
            +K+S     AL+PL   L A      + L    D+  +   +P  AGE ++        
Sbjct: 209 NEKISPK---ALIPLADMLNANGDLTNSKLFFSSDSFIMKTLQPVAAGEELL-------- 257

Query: 333 GHDFKCGPQPNSKLLINYGFV 353
            +DF  GP P + LL  YGFV
Sbjct: 258 -NDF--GPLPPADLLRRYGFV 275


>gi|294868786|ref|XP_002765694.1| hypothetical protein Pmar_PMAR013760 [Perkinsus marinus ATCC 50983]
 gi|239865773|gb|EEQ98411.1| hypothetical protein Pmar_PMAR013760 [Perkinsus marinus ATCC 50983]
          Length = 330

 Score = 42.0 bits (97), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 22/83 (26%), Positives = 37/83 (44%), Gaps = 11/83 (13%)

Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQ 341
             ++PL       S+K    +  V+   Q++ ++P K GE I   Y           G  
Sbjct: 171 LCVIPLADQFNHSSTKWHTRVREVEGGFQMLAEKPVKKGEEIFNNY-----------GLY 219

Query: 342 PNSKLLINYGFVDEDNPYDRLVV 364
            N  LL+ +GF++ DNP+D  + 
Sbjct: 220 TNEMLLLTHGFIEFDNPHDHFIT 242


>gi|354548388|emb|CCE45124.1| hypothetical protein CPAR2_701280 [Candida parapsilosis]
          Length = 565

 Score = 42.0 bits (97), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 73/333 (21%), Positives = 127/333 (38%), Gaps = 76/333 (22%)

Query: 108 RPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYE 167
           +P ++ A S+    G A+  +P  LVVT ++  G +   +  T    +  + L +YL Y 
Sbjct: 28  KPNYFGAISK--SNGKASIQIPRELVVTCDK--GIDLYKD--TYKNANHSSLLKIYLCYS 81

Query: 168 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
           + Q  +SF  PY+  L   +     A++SP +WS  + A L G+     + E    +  E
Sbjct: 82  RTQ--QSFHQPYLDTLPSLQ-----AIDSPYIWSAEDKALLKGTNLGNSLKENISSLVEE 134

Query: 228 YNELDTVWFMAGSLFQQYPYDIP------------------TEAFTFEIFKQAFVAVQSC 269
                  W+ A +L    P D+P                  T+   +  F +   +  + 
Sbjct: 135 -------WWNAINLL---PEDVPKPEQHFINLKFYYENKFYTDDDYYSYFNEVDTSNWTS 184

Query: 270 VVHLQKVSL---ARRFALVPLGPP-------------LLAYSSKCKAMLAAVDDAVQLVV 313
             +    SL   +R F    + P              LL ++ K K   +  D       
Sbjct: 185 FPNYLWASLVLKSRAFPAYIIDPSLPKNEPMLLPVVDLLNHNPKTKVQWSGTDGGFLFQS 244

Query: 314 DRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 373
           D    +GE +   Y           G + N +LL+ YGF  E+NP D     AAL  + P
Sbjct: 245 DDA-SSGEELFNNY-----------GQKGNEELLLAYGFAIENNPADS----AALKIKIP 288

Query: 374 QYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD 406
              D ++   ++  + +   H +     + +SD
Sbjct: 289 ---DSKLQVVKDLGIKLPSIHDYTNSVIDQVSD 318


>gi|400594002|gb|EJP61885.1| histone-lysine N-methyltransferase [Beauveria bassiana ARSEF 2860]
          Length = 481

 Score = 41.6 bits (96), Expect = 0.89,   Method: Compositional matrix adjust.
 Identities = 65/288 (22%), Positives = 112/288 (38%), Gaps = 35/288 (12%)

Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
           + +S+ E+    GS       +    I  +Y +L T   M      ++    P   F  E
Sbjct: 115 IFFSDEEMQVCKGSSLYTLTTQLRGRIGDDYKKLLTRVLM------RHRNLFPLSKFGIE 168

Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS---KCKAMLAAVDDAVQLVVD 314
            +K A   V S  +    VS      L+     +L +SS   +C A      D + ++  
Sbjct: 169 HYKWALCTVWSRGMDF-TVSEGNSLRLLAPFADMLNHSSDVKQCHAYDPTTGD-LSILAS 226

Query: 315 RPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQ 374
           + Y  G+ + ++Y           GP PN++LL  YGFV  +NP+D   +    +   P 
Sbjct: 227 KDYNVGDQVFIYY-----------GPVPNNRLLRLYGFVLPENPHDSYDLVLQTSPMAPL 275

Query: 375 YQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSL----- 429
           Y+ K  + +  G  +     + A       +D LP   L Y+      +S++ ++     
Sbjct: 276 YEQKERLWKLAGLDTACTIPLTA-------NDPLPRSVLRYLRIQRLDESLLGAMTMQIA 328

Query: 430 -GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 476
            G    +S   E  +L  L D   A L G+   L    A L   +++P
Sbjct: 329 TGADEKISDDSETLILQFLIDSISAILEGFSIPLDILTAQLAAGDVYP 376


>gi|407417214|gb|EKF38012.1| hypothetical protein MOQ_001785 [Trypanosoma cruzi marinkellei]
          Length = 578

 Score = 41.6 bits (96), Expect = 0.93,   Method: Compositional matrix adjust.
 Identities = 37/137 (27%), Positives = 65/137 (47%), Gaps = 9/137 (6%)

Query: 339 GPQPNSKLLINYGFVDEDNPYDRLVVEAAL--NTEDPQYQDKR--MVAQRNGKLSVQVFH 394
           GP  N +LL  YGFV E+N +DRL             ++  +R  +VA     L+ + + 
Sbjct: 370 GPLQNWELLQFYGFVVEENEHDRLPFPFDFPEGVAGDEWDRRRATLVATYGLHLAGRCWI 429

Query: 395 VHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 454
            H GR   A   ++  LR+ ++++  E  ++  + GP   +    E  V+  +AD  +  
Sbjct: 430 CHDGRPPPA---LVALLRV-HLAEAEEFDTMERN-GPFASLGAGTEARVVATIADTIRCI 484

Query: 455 LAGYPATLSEDEAMLTD 471
           L  +  +L EDE +L +
Sbjct: 485 LDLFSTSLEEDEWLLEN 501


>gi|358399747|gb|EHK49084.1| hypothetical protein TRIATDRAFT_213818 [Trichoderma atroviride IMI
           206040]
          Length = 378

 Score = 41.6 bits (96), Expect = 0.99,   Method: Compositional matrix adjust.
 Identities = 39/182 (21%), Positives = 76/182 (41%), Gaps = 35/182 (19%)

Query: 192 LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPT 251
             V  P++W   EL +L     +  + +R +  + ++++           F +   DI  
Sbjct: 102 FEVGMPMMWPR-ELKHLLPLEPRNLVFKREKAFQGDWSD-----------FHKAFSDISY 149

Query: 252 EAFTFEIFKQAFVAVQSCVVHLQ-----KVSLARRFALVPLGPPLLAYSSKCKAMLAAVD 306
           E +T+     A++ V +   + +     K     R AL+P+        + C+   +   
Sbjct: 150 EEYTY-----AWLTVNTRTFYNESPETLKYPWEDRLALIPVADLFNHADAGCRVYYSP-- 202

Query: 307 DAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEA 366
           +   +V DR YK GE +     IS++ H        N   L+ YGFV ++NP D + ++ 
Sbjct: 203 EGYHIVADRDYKRGEELY----ISYSSHS-------NDYNLVEYGFVPDENPSDDVYIDD 251

Query: 367 AL 368
            +
Sbjct: 252 VI 253


>gi|299470104|emb|CBN78133.1| protein N-methyltransferase [Ectocarpus siliculosus]
          Length = 482

 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 29/85 (34%), Positives = 37/85 (43%), Gaps = 11/85 (12%)

Query: 281 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGP 340
           R AL+PL   +  YS     M    D A+ L V   +  G+     Y           GP
Sbjct: 215 RMALLPLIDSINHYSRMPTHMYWEADGALSLSVGAAFDPGDHAFASY-----------GP 263

Query: 341 QPNSKLLINYGFVDEDNPYDRLVVE 365
             N  LL  YGFV++DNP D  V+E
Sbjct: 264 VSNDDLLQYYGFVEQDNPSDTYVLE 288


>gi|115391295|ref|XP_001213152.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114194076|gb|EAU35776.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 691

 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 51/204 (25%), Positives = 88/204 (43%), Gaps = 26/204 (12%)

Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
            +L+ +  +G + FW PYI  L +    G L   +PL +   +L +L G+ + A   E+ 
Sbjct: 115 FFLIGQYLRGSEGFWYPYICTLPQP---GDLT--TPLYYEGADLRWLEGT-SLAPAREQK 168

Query: 222 EGIKRE-----YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
           E + +E     + EL    F      ++Y +++   A T  + +     V + VV   ++
Sbjct: 169 ESLLKEKYQSTFEELRKSGFGDA---EKYTWELYLWASTIFVSRAFSAKVLAGVVPHAEL 225

Query: 277 SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDF 336
                  L+P    +L +    K    A +  V  VV     AGE +   Y         
Sbjct: 226 PEENVSVLLPF-IDVLNHRPLAKVEWRAGERDVLFVVLEHVAAGEEVANNY--------- 275

Query: 337 KCGPQPNSKLLINYGFVDEDNPYD 360
             GP+ N +L++NYGF  ++NP D
Sbjct: 276 --GPRNNEQLMMNYGFCLQNNPCD 297


>gi|302498903|ref|XP_003011448.1| SET domain protein [Arthroderma benhamiae CBS 112371]
 gi|291174999|gb|EFE30808.1| SET domain protein [Arthroderma benhamiae CBS 112371]
          Length = 689

 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 43/207 (20%), Positives = 82/207 (39%), Gaps = 25/207 (12%)

Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
           LA ++++E+ +G+ S W PY+  L R        + S L + + +L +L G+        
Sbjct: 104 LAFFMVHEQLKGRDSHWWPYLATLPRAS-----ELTSALFYQDNDLEWLQGTNLYQTHQA 158

Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
               +K EY+   ++    G L          E++ ++IF  A+  + S      +   +
Sbjct: 159 YRNAVKEEYDSAISILRDEGFL--------AVESYRWDIFCWAYTLIAS------RAFTS 204

Query: 280 RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNI---SFTG--- 333
           R         P L    + + ML  VD +    + +     E+  +   +   +F+G   
Sbjct: 205 RVLDAYFSNHPTLKQDEEFQIMLPLVDSSNHKPLAKIEWRAEATEIGLKVIEPTFSGEEV 264

Query: 334 HDFKCGPQPNSKLLINYGFVDEDNPYD 360
           H+          ++  YGF   DNP D
Sbjct: 265 HNNYGSLNNQQSVMTTYGFCIVDNPCD 291


>gi|218189844|gb|EEC72271.1| hypothetical protein OsI_05430 [Oryza sativa Indica Group]
          Length = 1243

 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 31/114 (27%), Positives = 54/114 (47%), Gaps = 8/114 (7%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           AS+ +Q GD    VP  + +TL+++         L  + + + + LA  L+ E+  G +S
Sbjct: 66  ASKPIQEGDCIMQVPYHVQLTLDKLPQKFNT---LLDHAVGDTSKLAALLIMEQHLGNES 122

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
            W PYI+ L  +       + + +LW   EL  +  S    E +E  E  K+E+
Sbjct: 123 GWAPYIKSLPTKD-----QMHNMVLWDLNELHAVQNSSIYDEAIEHKEQAKKEF 171


>gi|255637489|gb|ACU19071.1| unknown [Glycine max]
          Length = 497

 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 39/72 (54%), Gaps = 1/72 (1%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSELACLALYLMYEKKQG 171
           + A  DL+ G+    VP S ++T E V+ ++ + + +  +  LS    L + L+YE  +G
Sbjct: 55  LGAVRDLRRGEIVLRVPKSALMTRETVMEDKKLCDAVNRHSSLSSAQILIVCLLYEMGKG 114

Query: 172 KKSFWLPYIREL 183
           K S W PY+  L
Sbjct: 115 KTSRWHPYLMHL 126


>gi|297807745|ref|XP_002871756.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297317593|gb|EFH48015.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 493

 Score = 41.6 bits (96), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 22/75 (29%), Positives = 40/75 (53%), Gaps = 2/75 (2%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAE--LLTTNKLSELACLALYLMYEKKQ 170
           + A  +L+ G+    VP + ++T E ++  +      ++    LS    L++ L+YE  +
Sbjct: 54  LGAVRELKKGELVLKVPRNALMTTESMIAKDRKLNDAVILHGSLSSTQILSVCLLYEMGK 113

Query: 171 GKKSFWLPYIRELDR 185
           GK+SFW PY+  L R
Sbjct: 114 GKRSFWYPYLVHLPR 128


>gi|402076002|gb|EJT71425.1| hypothetical protein GGTG_10683 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 497

 Score = 41.2 bits (95), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 54/213 (25%), Positives = 83/213 (38%), Gaps = 38/213 (17%)

Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
            L L +++E  +G  S W PY+  L  +        E+P+ WS  ELA L  SP  A + 
Sbjct: 104 SLILVMIHEHLRGSASPWRPYLDVLPAR-------FETPMFWSAAELAELQASPVVASV- 155

Query: 219 ERAEG-------IKREYNELDTVWFMAGS----------LFQQYPYDIPTEAFTFEIFKQ 261
            RAEG       I     E + ++F AG           L  +    I   AF  E    
Sbjct: 156 GRAEGDAMIRSRILPVIRENEALFFGAGGAAMGDEELVELAHRMGSTIMAYAFDLERDDD 215

Query: 262 AFVAVQSCVVHLQKVSLARR-FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 320
           A     +      +    R    +VP+   +L   ++  A +   ++A+  +  R   AG
Sbjct: 216 AMDEDDAEGDGWVEDRDGRTVMGMVPMA-DILNADAEFNAHINHSEEALVAISLRKIPAG 274

Query: 321 ESIVVWYNISFTGHDFKCGPQPNSKLLINYGFV 353
           E I+ +Y           GP PN +L   YG+ 
Sbjct: 275 EEILNYY-----------GPLPNGQLCRRYGYT 296


>gi|12718364|emb|CAC28558.1| related to histone-lysine N-methyltransferase [Neurospora crassa]
          Length = 471

 Score = 41.2 bits (95), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 60/276 (21%), Positives = 112/276 (40%), Gaps = 46/276 (16%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
            + G+   ++P  ++ T++    +  +   L + +  LS    LA Y+++ K        
Sbjct: 42  FKEGEKILTIPAGILWTVKHAYADPLLGPALRSAQPPLSVEDTLATYILFVKS------- 94

Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
               RE      R  +A       S +L++E +L    G+       +  + I+ ++  L
Sbjct: 95  ----RESGYDGQRSHIAALPASYSSSILFAEDDLEACAGTSLYTITKQLEQSIEDDHRAL 150

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEI----FKQAFVAVQSCVVHLQKVSLARRFALVPL 287
                    LF Q+P   P + FT E     +K A   V S  +      LA   ++  L
Sbjct: 151 VV------RLFVQHPDLFPLDKFTVEDVGLHYKWALCTVWSRAMDF---VLADGNSIRLL 201

Query: 288 GP--PLLAYSSKCKA--MLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPN 343
            P   +L ++S+ K   +       + +   + Y+AG+ + + Y           GP PN
Sbjct: 202 APFADMLNHTSEVKQCHVYDPSSGTLSVFAGKDYEAGDQVFINY-----------GPVPN 250

Query: 344 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 379
           S+LL  YGFV   NP D   +  + + + P ++ K+
Sbjct: 251 SRLLRLYGFVIPGNPNDSYDLVLSTHPQAPFFEQKQ 286


>gi|324503528|gb|ADY41532.1| SET domain-containing protein 3 [Ascaris suum]
          Length = 502

 Score = 41.2 bits (95), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 68/318 (21%), Positives = 121/318 (38%), Gaps = 69/318 (21%)

Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
           L++    + I + +    L+ + C        +K    S WLPY+  L +          
Sbjct: 142 LKKCFEQDMIVKTMDNVALALMVCC-------QKLSPDSSWLPYLDALPQ-------TFS 187

Query: 196 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS--------------- 240
           +PL +S  EL  L+ SP   E L     + R++     V+F+A                 
Sbjct: 188 TPLYFSALELRKLSPSPAYEESLIMYRNVARQF-----VYFLAAVQRSERSRSAKKDKNH 242

Query: 241 -------LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL----QKVSLARRFALVPLGP 289
                  LF   P+ +    FTF++++ A   V + +  +     K S  +  A VP   
Sbjct: 243 AAVGMEPLFLNAPFTVSN--FTFDLYRWAVACVTTRINFIPSQYAKDSNGQPVA-VPCLI 299

Query: 290 PLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNS 344
           PLL      +        +   D   +   + YKAG+ + ++Y I           + N 
Sbjct: 300 PLLDMANHEFDHPLTVHFSTEGDYASIKATKDYKAGDEVTIFYGI-----------RTNR 348

Query: 345 KLLINYGFV-DEDNPYDRLVVEAALNTEDPQYQDKRMV---AQRNGKLSVQVFHVHAGRE 400
           +  ++ GFV D +N  D   ++      D Q + +  +   A  N +  V VF V+A   
Sbjct: 349 QFFLHNGFVPDGENKNDTYKLKIGFPRGDKQVRARLKLMHDAGFNAESRVFVFEVNASER 408

Query: 401 KEAISDMLPYLRLGYVSD 418
              +S +L + R+  V +
Sbjct: 409 PVPLS-LLDFARVFLVEN 425


>gi|345566622|gb|EGX49564.1| hypothetical protein AOL_s00078g53 [Arthrobotrys oligospora ATCC
           24927]
          Length = 611

 Score = 41.2 bits (95), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 75/325 (23%), Positives = 131/325 (40%), Gaps = 54/325 (16%)

Query: 158 ACLAL---YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 214
           AC  L    L+ E+ Q    FW PYIR L +         ++PL +++ E+  L G+   
Sbjct: 112 ACFHLSQHLLLKEQSQ----FW-PYIRLLPK-------TFDTPLYFNDDEMERLAGTNLG 159

Query: 215 A-EILERAEGIKREYNELDTVWFMAG---SLFQQYPYDIPTEA---FTFEIFKQAFVAV- 266
           A ++L R +    E+       F+ G      ++Y +D+   A   +T   F    V + 
Sbjct: 160 AGDVLLRKQLWMEEWEAGKQ--FLEGVGAERAREYTWDLFLRAATIYTSRSFPSKLVGIT 217

Query: 267 -QSCVVHLQKVSLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 324
             S +     +S    F  L+PL   +L +    K +      +  L+       G  + 
Sbjct: 218 MDSSIEENTMLSDDNGFPVLIPL-VDILNHKPNTKIIWEPTQTSFSLITPETISEGSQVF 276

Query: 325 VWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQR 384
             Y           GP+ N +LL+ YGFV  +NP D L ++  ++   P+ Q  ++  QR
Sbjct: 277 NNY-----------GPKGNEELLMGYGFVIPENPGDSLAMKFTIS---PRGQAAQIWEQR 322

Query: 385 NGKLSV-QVFHVHAGREKEAISDMLPYLRLGY-----------VSDTSEMQSVISSLGPI 432
             K +  +VFH+    +    +  +P L   +           V++ +E+  + +     
Sbjct: 323 ALKQTWREVFHLTKSADSGQKTSTVPALESDWPEAFVDLFRILVANENEIDDLENGDINA 382

Query: 433 CPVSPCMERAVLDQLADYFKARLAG 457
            P+S   E AV   L    K +LA 
Sbjct: 383 TPISIRNELAVALGLKAAIKQKLAA 407


>gi|308812602|ref|XP_003083608.1| unnamed protein product [Ostreococcus tauri]
 gi|116055489|emb|CAL58157.1| unnamed protein product [Ostreococcus tauri]
          Length = 427

 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 52/216 (24%), Positives = 89/216 (41%), Gaps = 30/216 (13%)

Query: 151 TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 210
           +N+ +E A LA+ L  E+++G  S + P++                  L+  T       
Sbjct: 79  SNESAEWA-LAIELAMEREKGVASRYRPFV----------------DSLYERTPANSTVV 121

Query: 211 SPTKAEIL--ERAEGIKREYNELDTV--WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 266
           S    E L    AE + R Y+E D V  W  A   F+ +P     + FT   F++A   V
Sbjct: 122 SKKARERLAEHHAEKVMRRYDE-DIVRGWNAAVRTFRTFPTIFRAQDFTRSKFEEALAIV 180

Query: 267 QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 326
           ++    + +    R   LVPL   L+  +S     +  VDD   + VD  ++AG+ +   
Sbjct: 181 RANSFEVTRADGVRERVLVPLAHLLVHDTSSSVPCVKMVDDTFVINVD-EHRAGDELSCS 239

Query: 327 YNISFTGHDF-KCGP------QPNSKLLINYGFVDE 355
           +        F + G       + N++ +I + F DE
Sbjct: 240 HGEYSDAETFARFGTSAVYSEENNARDVITFTFPDE 275


>gi|302660547|ref|XP_003021952.1| hypothetical protein TRV_03939 [Trichophyton verrucosum HKI 0517]
 gi|291185873|gb|EFE41334.1| hypothetical protein TRV_03939 [Trichophyton verrucosum HKI 0517]
          Length = 479

 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 72/350 (20%), Positives = 126/350 (36%), Gaps = 77/350 (22%)

Query: 97  LKEKPSHNEKHRPIHY-----VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTT 151
           LK    H + H  IH        A   +   +  F +P+ L+++++       +   L  
Sbjct: 24  LKRSSPHFKMHPGIHIADLRSTGAGRGISEDEELFVIPDDLILSVQNSEARSVLG--LDD 81

Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
            +L     L + ++YE  QG++S W  Y R L         + ++ + W++ +L+ L GS
Sbjct: 82  KQLGPWLSLIITMIYEYYQGEQSKWYSYFRILPS-------SFDTLMFWTDEQLSELQGS 134

Query: 212 PTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 271
               +I + A          DT+                               +Q  V 
Sbjct: 135 SVVGKIGKAAAD--------DTI-------------------------------LQKVVP 155

Query: 272 HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISF 331
            +Q  S  R F   P  PPL +  S+   +  A            ++ G SI++ Y    
Sbjct: 156 LIQANS--RHFPPRPNMPPLNSPDSQNALLCLA------------HRMG-SIIMAYAFDI 200

Query: 332 TGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ 391
              D       +   + +    DED P   +V  A +   D Q  + R+  +    +   
Sbjct: 201 EKTDEADEHTADDGYMTD----DEDEPAKGMVPLADIFNADAQRNNARLFQEEGSFVMKA 256

Query: 392 VFHVHAGREKEAISDMLPYL----RLGYVSDTSEMQSVIS-SLGPICPVS 436
           + ++++G E       LP      R GYV+D      V+  SL  IC V+
Sbjct: 257 IKNIYSGEEIFNDYGELPRADLLRRYGYVTDNYAQYDVVEFSLDAICKVA 306


>gi|145344497|ref|XP_001416768.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144576994|gb|ABO95061.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 514

 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 101/428 (23%), Positives = 169/428 (39%), Gaps = 76/428 (17%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEK-KQG 171
           VA + ++ AG+    VP    +       +  +   +  +  S  A LA +++ E    G
Sbjct: 85  VATTRNVSAGELLAEVPLEKCLCAASARMDARLWRAIGASGASGDAILAAHVLREAFDAG 144

Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA--EGIKREYN 229
            KS + P++R L R        V+S + W+E EL+ L+GS     +  RA     + EY+
Sbjct: 145 SKSAYWPWLRLLPRD-------VDSTVGWNEDELSELSGS--NVVVFTRAIKAQWRMEYD 195

Query: 230 ELDTVWFMAGSLFQQYPYDIPTEA---FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 286
            LD       +L +++P     E    +TF+ F  A   + S  + L   S       + 
Sbjct: 196 ALDV-----PTLGEKFPDVFGGERAAHYTFDKFTWARFIIWSRAIDLSTESA--EAPTIR 248

Query: 287 LGPPLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQ 341
           +  PLL     A   K +    A  +AV++     ++             T   F    +
Sbjct: 249 VLVPLLDMANHAPGGKLRPEWDARSNAVKVYAASAFR-----------EHTELRFNYDTK 297

Query: 342 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREK 401
           P+   L+ YGF+ E NP +   VEA +   D         + R+ K  +   H    +++
Sbjct: 298 PSQYFLLQYGFIPETNPAE--CVEATVRVSDHD-------SLRDAKEELLRLHGLDPKKR 348

Query: 402 E------AIS-DMLPYLRLGYVSDTSEMQSVIS-----SLGPICPVSPCMERAV-LDQLA 448
                  +I  D+L   R+    D +EM    S     S   +   +    +AV L  LA
Sbjct: 349 NFEWKPRSIDYDLLAATRV-ITMDEAEMSDATSLTLAVSGASVSAKNDARTKAVLLKSLA 407

Query: 449 DYFKARLAGYPATLSEDEAML-------TDYNLHPKKRVATQLVRMEKKMLNACLQVTAD 501
            +    L  Y  TL+ED   +        D  L  K++    L+RM +K +   L  +AD
Sbjct: 408 SF----LESYTTTLAEDNEYVARVDDESNDEPLPGKRKRFAVLLRMREKQI---LLASAD 460

Query: 502 MIML-LPD 508
            +   LPD
Sbjct: 461 ALFKELPD 468


>gi|330797452|ref|XP_003286774.1| hypothetical protein DICPUDRAFT_54488 [Dictyostelium purpureum]
 gi|325083217|gb|EGC36675.1| hypothetical protein DICPUDRAFT_54488 [Dictyostelium purpureum]
          Length = 1335

 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 55/241 (22%), Positives = 96/241 (39%), Gaps = 34/241 (14%)

Query: 129  PNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
            P   ++ ++    N  +  +     L++   L L+++YEK +   +FW P+   L     
Sbjct: 849  PRKYLINVDVAKSNPILGPIFEELHLNDETILFLFVIYEK-ENPNTFWRPFYDTLPS--- 904

Query: 189  RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIK--REYNELDTVWFMAGSLFQQYP 246
                   + + +S TEL  L G+   AE L   + ++  R+Y        +   L  QYP
Sbjct: 905  ----YFTTSIHYSSTELLELEGTNLFAETLAVKQQLQAFRDY--------LFPELSNQYP 952

Query: 247  YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD 306
               P   F++E F  A   + S  + L K+    +  LVP+   ++ + +  +      D
Sbjct: 953  DIFPESVFSWENFLWARSLLDSRAIQL-KIDGKIKSCLVPMAD-MINHHTNAQISERHFD 1010

Query: 307  ---DAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLV 363
               +  ++V      A   I + Y           G   NS L + YGFV  +N YD   
Sbjct: 1011 QDSNCFRMVSSCNIPANNQIFLHY-----------GALQNSDLALYYGFVIPNNIYDSFH 1059

Query: 364  V 364
            V
Sbjct: 1060 V 1060


>gi|299115489|emb|CBN75653.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 451

 Score = 41.2 bits (95), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 22/81 (27%), Positives = 39/81 (48%), Gaps = 11/81 (13%)

Query: 306 DDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVE 365
           +D + LV  + + +G  +            F  GP PNS+LL+ +GF   DNP++ + + 
Sbjct: 229 NDCLHLVTLQDWASGSEV-----------KFSYGPLPNSRLLLLHGFCLPDNPFESVELW 277

Query: 366 AALNTEDPQYQDKRMVAQRNG 386
           A +    P + +K  +   NG
Sbjct: 278 AMMEPGAPGFAEKNKIMLDNG 298


>gi|408392258|gb|EKJ71616.1| hypothetical protein FPSE_08255 [Fusarium pseudograminearum CS3096]
          Length = 527

 Score = 41.2 bits (95), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 36/74 (48%), Gaps = 13/74 (17%)

Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVD 354
           S  CK + +A+  +VQ   DR YK GE + V Y           GP  N  LL  YGF+ 
Sbjct: 194 SQGCKLVYSALGYSVQ--TDRAYKQGEEVFVSY-----------GPHSNDFLLTEYGFIL 240

Query: 355 EDNPYDRLVVEAAL 368
           + N +D + ++  +
Sbjct: 241 DTNRWDEVYLDEVI 254


>gi|315045047|ref|XP_003171899.1| SET domain-containing protein 6 [Arthroderma gypseum CBS 118893]
 gi|311344242|gb|EFR03445.1| SET domain-containing protein 6 [Arthroderma gypseum CBS 118893]
          Length = 485

 Score = 41.2 bits (95), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 24/105 (22%), Positives = 49/105 (46%), Gaps = 9/105 (8%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + AS D+   +  F +P  L+++++       +   L   +L     L + ++YE  QG+
Sbjct: 51  ICASRDITEDEELFVIPEDLILSVQNSEARTVLG--LDDKQLGPWLSLIIAMIYEYYQGE 108

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
           +S W PY   L         + ++ + W++ +L+ L GS    +I
Sbjct: 109 QSKWYPYFGVLPS-------SFDTLMFWTDEQLSELQGSAVVGKI 146


>gi|363747032|ref|XP_003643892.1| PREDICTED: histone-lysine N-methyltransferase setd3-like, partial
           [Gallus gallus]
          Length = 283

 Score = 41.2 bits (95), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 42/182 (23%), Positives = 85/182 (46%), Gaps = 23/182 (12%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
           + A+ +++A +    VP  L++T+E    N  +  L + +++ +      LA +L+ E+ 
Sbjct: 108 LKATREIKAEELFLWVPRKLLMTVESA-KNSVLGSLYSQDRILQAMGNITLAFHLLCERA 166

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
               SFWLPYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y 
Sbjct: 167 -NPNSFWLPYIQTLPSE-------YDTPLYFEEDEVQYLRSTQAIHDVFSQYKNTARQYA 218

Query: 230 ELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALV 285
               V        Q +P    +P  ++FT++ ++ A  +V +    +     +R   AL+
Sbjct: 219 YFYKV-------IQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPTEDGSRVTLALI 271

Query: 286 PL 287
           PL
Sbjct: 272 PL 273


>gi|71659283|ref|XP_821365.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70886742|gb|EAN99514.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 661

 Score = 40.8 bits (94), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 45/95 (47%), Gaps = 10/95 (10%)

Query: 149 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA-- 206
           L ++ +  +AC+A Y+ YEKKQ + +  L Y R L       Q+ V++  LW+   L   
Sbjct: 466 LDSSNMESIACIAAYMFYEKKQPEIALRL-YRRLL-------QMGVQTTELWNNLGLCCF 517

Query: 207 YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSL 241
           Y +        L+RA  I  E   L  VW+  G +
Sbjct: 518 YSSQYDIALSCLQRAVAISTEDETLADVWYNIGHI 552


>gi|403158396|ref|XP_003307692.2| hypothetical protein PGTG_00642 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375163798|gb|EFP74686.2| hypothetical protein PGTG_00642 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 622

 Score = 40.8 bits (94), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 32/89 (35%), Positives = 41/89 (46%), Gaps = 13/89 (14%)

Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQ 341
            A+VPL   L A +    A L    D +++   R  K GE I   YN          G  
Sbjct: 345 IAMVPLADLLNAKTGSENARLFYETDCLKMKATRNIKKGEQI---YNT--------YGDP 393

Query: 342 PNSKLLINYGFVDEDNPYDRLVVEAALNT 370
           PNS LL  YG VD+ N +D  VVE ++ T
Sbjct: 394 PNSDLLRRYGHVDDPNRFD--VVEISIKT 420


>gi|315042966|ref|XP_003170859.1| SET domain-containing protein [Arthroderma gypseum CBS 118893]
 gi|311344648|gb|EFR03851.1| SET domain-containing protein [Arthroderma gypseum CBS 118893]
          Length = 693

 Score = 40.8 bits (94), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 51/205 (24%), Positives = 85/205 (41%), Gaps = 22/205 (10%)

Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
           LAL++ +++ + K S W PY+  L R        + S L +   +L +L G+        
Sbjct: 108 LALFVAHQQLKEKGSHWWPYLATLPRAS-----ELTSALFYHGDDLEWLQGTNLYQTHQA 162

Query: 220 RAEGIKREYNELDTVWFMAGSLFQQ-YPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 278
               +K EY+   ++    G L  + Y +D+   A+T  I  +AF + +   V+L +   
Sbjct: 163 YMNAVKEEYDSAISILRDEGCLAAELYSWDLFCWAYTV-IASRAFTS-RVLSVYLSRNPA 220

Query: 279 ARRFALVPLGPPLLAYSSK---CKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHD 335
            ++     +  PL+  S+     K    A    + L V  P  + E I   Y        
Sbjct: 221 LKQDEEFQILLPLVDSSNHKPLAKIEWRAEAAEIGLKVVEPIVSEEEIHNNY-------- 272

Query: 336 FKCGPQPNSKLLINYGFVDEDNPYD 360
              GP  N +L+  YGF   DNP D
Sbjct: 273 ---GPLNNQQLMTTYGFCIVDNPCD 294


>gi|207346544|gb|EDZ73016.1| YDR257Cp-like protein [Saccharomyces cerevisiae AWRI1631]
          Length = 354

 Score = 40.8 bits (94), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 27/79 (34%), Positives = 41/79 (51%), Gaps = 12/79 (15%)

Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
           +++PL   L A +SKC A L    + +++V  R  +  E +   YNI         G  P
Sbjct: 80  SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQV---YNIY--------GEHP 128

Query: 343 NSKLLINYGFVDED-NPYD 360
           NS+LL  YG+V+ D + YD
Sbjct: 129 NSELLRRYGYVEWDGSKYD 147


>gi|4185151|gb|AAD08954.1| unknown protein [Arabidopsis thaliana]
 gi|20197036|gb|AAM14885.1| unknown protein [Arabidopsis thaliana]
          Length = 441

 Score = 40.8 bits (94), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 40/152 (26%), Positives = 64/152 (42%), Gaps = 15/152 (9%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           ASEDL+ GD A  +P S +++ E V  ++    L T + ++    L L+ M EK      
Sbjct: 173 ASEDLKFGDVALEIPVSSIISEEYVYNSDMYPILETFDGITSETMLLLWTMREKHNLDSK 232

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
           F  PY   L      G       L +    +  L G+    EI++  E ++  Y+EL   
Sbjct: 233 F-KPYFDSLQENFCTG-------LSFGVDAIMELDGTLLLDEIMQAKELLRERYDEL--- 281

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 266
                 L   +    P E +T+E +  A+  V
Sbjct: 282 ----IPLLSNHREVFPPELYTWEHYLWAYFDV 309


>gi|358395377|gb|EHK44764.1| hypothetical protein TRIATDRAFT_80097 [Trichoderma atroviride IMI
           206040]
          Length = 463

 Score = 40.8 bits (94), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 76/355 (21%), Positives = 140/355 (39%), Gaps = 52/355 (14%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
            Q G+   ++P   + T+E    +  +  +L + +  LS    LA+YL++ +        
Sbjct: 34  FQQGERILTIPGDSLWTVEHADSDPLLGPVLRSVQPPLSVEDTLAVYLLFVR-------- 85

Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
              +RE   +  R  +A       S + ++E EL    G+       +  E I+ +Y   
Sbjct: 86  ---LREHGYEGPRSHVAAMPARYSSSIFFNEDELEVCAGTSLYTITKQLEERIEDDYR-- 140

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
                +   +F Q+P  +P    + + +K A   V S  +     +      L P    +
Sbjct: 141 ----VLVMRVFTQHPDLLPLAKISIQDYKWALCTVWSRAMDFVLPNGKPLRVLAPFAD-M 195

Query: 292 LAYSSKCKAMLAAVDDAVQLVV--DRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
           + +S + K   A    +  L V   + Y+ G+ I + Y           G  PN++LL  
Sbjct: 196 INHSPEVKQCHAYDPSSGNLSVLAGKDYEIGDQIYISY-----------GSIPNNRLLRL 244

Query: 350 YGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP 409
           YGFV  +NP D   +  + +   P Y+ K+ +    G  S     +        + D LP
Sbjct: 245 YGFVIPENPNDSYDLVLSTHPMAPFYEQKQKLWASAGLDSASTIPL-------TLIDPLP 297

Query: 410 -----YLRLGYVSDTSEMQSV-ISSLGPICPVSPCMERAVLDQLADYFKARLAGY 458
                YLR+  + D S++ ++ +  L     +S   E  +L  L +   A L G+
Sbjct: 298 KSVLRYLRIQRL-DASDLAAIALQKLDTNEKISNSKEVEILQFLVESISALLDGF 351


>gi|46129354|ref|XP_389038.1| hypothetical protein FG08862.1 [Gibberella zeae PH-1]
          Length = 478

 Score = 40.8 bits (94), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 26/88 (29%), Positives = 41/88 (46%), Gaps = 14/88 (15%)

Query: 281 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGP 340
           R   +P+   L  +   CK + +A+  +VQ   DR YK GE + V Y           GP
Sbjct: 178 RLVCMPVAD-LFNHDQGCKLVYSALGYSVQ--TDRVYKQGEEVYVSY-----------GP 223

Query: 341 QPNSKLLINYGFVDEDNPYDRLVVEAAL 368
             N  LL  YGF+ + N +D + ++  +
Sbjct: 224 HSNDFLLTEYGFILDTNRWDEVYLDEVI 251


>gi|395848935|ref|XP_003797093.1| PREDICTED: SET domain-containing protein 4 [Otolemur garnettii]
          Length = 440

 Score = 40.8 bits (94), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 76/337 (22%), Positives = 134/337 (39%), Gaps = 47/337 (13%)

Query: 65  AGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHY------VAASED 118
           A SR V    + +  +LK W            LK++   +    P H+      + +   
Sbjct: 20  AESRGVNESFKCEFIELKKW------------LKDRKFEDTNLMPAHFPGTGRGLMSKTS 67

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
           LQ G    S+P + ++T + V+ +  +   +T  K   S L  L  +L+ EK  G +S W
Sbjct: 68  LQEGQMIISLPENCLLTTDTVIES-YLGAYITKWKPPPSPLLALCTFLVSEKHAGDQSPW 126

Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
            PY+  L +       A   P+   E E+  L   P KA+  E+   ++  +      + 
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKPLKAKAEEQRAHVQEFFASSRDFFS 178

Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS 296
               LF +    I    F++     A+  V +  V+L+     RR   +   P   A + 
Sbjct: 179 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRH----RRRECLSAEPDTCALAP 230

Query: 297 KCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHD--FKC-GPQPNSKLLINYGFV 353
               +  + +  V+   +   +  E   +    S+  H+  F C G   N +LL+ YGFV
Sbjct: 231 YLDLLNHSPNVQVRAAFNEETRCYE---IRTASSWRKHEEVFICYGHHDNQRLLLEYGFV 287

Query: 354 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 390
              NP+  + V   +  +     DK+M    N K+S+
Sbjct: 288 SIQNPHACVYVSREILVKYLPSTDKQM----NKKISI 320


>gi|296808191|ref|XP_002844434.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
 gi|238843917|gb|EEQ33579.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
          Length = 684

 Score = 40.4 bits (93), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 52/205 (25%), Positives = 86/205 (41%), Gaps = 22/205 (10%)

Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
           LA ++ +++ + K S W PY+  L R    G+L   S L +   +L +L  +        
Sbjct: 111 LAFFVAHQQLKAKDSHWWPYLATLPRA---GELT--SALFYQGEDLEWLQDTNFYHARQM 165

Query: 220 RAEGIKREYNELDTVWFMAGS-LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 278
             + +K EY+   ++    G  L + Y ++I   A+T  I  +AF + +    ++ K   
Sbjct: 166 YHDAVKTEYDAAISILRKEGCPLVESYSWNIFCWAYTV-IASRAFTS-RVLEAYISKNPA 223

Query: 279 ARRFALVPLGPPLLAYSSK---CKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHD 335
            R+     +  PL+  S+     K    A    + L V  P  A E I          H+
Sbjct: 224 LRQDDEFQIMLPLVDSSNHRPLAKIEWRAEATRIGLKVIDPVSAKEEI----------HN 273

Query: 336 FKCGPQPNSKLLINYGFVDEDNPYD 360
              GP  N +L+  YGF   DNP D
Sbjct: 274 -NYGPLNNQQLMATYGFCIVDNPCD 297


>gi|145502426|ref|XP_001437191.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124404340|emb|CAK69794.1| unnamed protein product [Paramecium tetraurelia]
          Length = 637

 Score = 40.4 bits (93), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 38/132 (28%), Positives = 64/132 (48%), Gaps = 23/132 (17%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIA------ELLT---TNKLSELACLALY 163
           V ASEDL +      +P SL+++ ++   +  I       E+     TN+ +E   L  Y
Sbjct: 46  VVASEDLPSDTVIICIPQSLIISPDKCKQSTLITVYNSHPEMFDEEETNE-AEFNILTFY 104

Query: 164 LMYEKKQGKKSFWLPYIRELDRQRGRGQLA--------VESPLLWSETELAY--LTGSPT 213
           +  EKK+G++SF+ PYI+ +  Q     +A        +E PL+  E +L      G  +
Sbjct: 105 MFNEKKKGEQSFYYPYIQAI--QTSNTLMAWSNEDLQKIEDPLILEEFQLIKQDFLGLWS 162

Query: 214 KAE-ILERAEGI 224
           KA+ I + A+ I
Sbjct: 163 KAKLIFDNAQDI 174


>gi|452823683|gb|EME30691.1| hypothetical protein Gasu_19370 [Galdieria sulphuraria]
          Length = 370

 Score = 40.4 bits (93), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 67/288 (23%), Positives = 113/288 (39%), Gaps = 55/288 (19%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSE--------LACLALYLMY 166
           A + +  G     +P+ L++T     GN+    L   N +          +  ++++L +
Sbjct: 38  AKKPITKGSILLEIPDPLLIT-----GNKVCKWLERNNWIGHQQISSVQGVLLVSIFLFF 92

Query: 167 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 226
           E +Q   SFW PY++ L        L  +  LL      +Y+T    +A+I++  E ++R
Sbjct: 93  ESRQSD-SFWKPYLQVLPTSYDLLFLYRDGLLL------SYVT----EADIMQMVESVRR 141

Query: 227 EYNELDTVWFMAGSLFQQY--PY-----DIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
                     +    FQ Y  P+     D       F+ F + + AV S + +L      
Sbjct: 142 ----------ILRDTFQTYVIPHFSSVDDRDKWNVLFKEFVRWYCAVVSRICYLPDDIAG 191

Query: 280 RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDF-KC 338
              ALVPLG               AVD  V ++  +  +         N S     F   
Sbjct: 192 ---ALVPLGDIFNH---------EAVDTPVDILYAKWERGYYVFRAHRNFSIGTQVFVSY 239

Query: 339 GPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 386
           G   N++L++ YGF   DNP+D L        E  ++ + R+V  R G
Sbjct: 240 GALSNTELMMYYGFTLNDNPWDTLSFYPHELDESIKFYE-RVVLDREG 286


>gi|340505659|gb|EGR31971.1| SET domain protein [Ichthyophthirius multifiliis]
          Length = 705

 Score = 40.4 bits (93), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 26/104 (25%), Positives = 52/104 (50%), Gaps = 10/104 (9%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN---------KLSELACLALY 163
           +AA+ED+ A      +PN ++++L ++   E + +++  N           +E   +A+Y
Sbjct: 21  IAAAEDIPANTIIACIPNKIMISLNQIKECE-LKDIINENPSLFDEEENAEAEFNIIAMY 79

Query: 164 LMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY 207
           +++EK +G+KSF+ PY   + R        +E   L    E+ Y
Sbjct: 80  VIHEKLKGEKSFYKPYFDTIQRSYTMYDWTIEEVKLTESEEIIY 123


>gi|302828172|ref|XP_002945653.1| hypothetical protein VOLCADRAFT_120141 [Volvox carteri f.
           nagariensis]
 gi|300268468|gb|EFJ52648.1| hypothetical protein VOLCADRAFT_120141 [Volvox carteri f.
           nagariensis]
          Length = 163

 Score = 40.4 bits (93), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 20/73 (27%), Positives = 36/73 (49%)

Query: 111 HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
           + + A E ++ G     VP  L+++ +    +E     +    L+E   L L+L+ E+  
Sbjct: 77  YSLVADEPVRRGQILVRVPRRLLMSQDTARASEACGRTVREAGLNEWQSLILHLLCERAL 136

Query: 171 GKKSFWLPYIREL 183
           G +SFW PY+  L
Sbjct: 137 GSRSFWAPYLDTL 149


>gi|428167603|gb|EKX36559.1| hypothetical protein GUITHDRAFT_155193 [Guillardia theta CCMP2712]
          Length = 321

 Score = 40.0 bits (92), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 31/133 (23%), Positives = 58/133 (43%), Gaps = 10/133 (7%)

Query: 57  VSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNG-LPPCKVILKEKPSHNEKHRPIHYVAA 115
           V++ D   A   +  ++ +ED      W   NG +   K+ +K +            V  
Sbjct: 50  VAAGDQGAASGADQQAQLQEDWTAFVKWFRSNGGIISSKLTVKVRNGRQG-------VYF 102

Query: 116 SEDLQAGDAAFSVPNSLVVTLERVLGNET--IAELLTTNKLSELACLALYLMYEKKQGKK 173
            E ++ G+   S P +L +  +  +  +   + + L  +K      + L++++E K GK 
Sbjct: 103 KERMRRGETIVSFPRNLRLDEKTAMKGKAGHVFQRLKQDKCYPDLMVILHVVHEDKLGKD 162

Query: 174 SFWLPYIRELDRQ 186
           SFW PY + L RQ
Sbjct: 163 SFWFPYFKLLRRQ 175


>gi|346327621|gb|EGX97217.1| SET domain-containing protein, putative [Cordyceps militaris CM01]
          Length = 371

 Score = 40.0 bits (92), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 26/88 (29%), Positives = 39/88 (44%), Gaps = 13/88 (14%)

Query: 281 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGP 340
           R AL+P+       S  C    +   +   +  DR Y+A E +   Y           G 
Sbjct: 178 RLALLPVADMFNHASVGCAVAFST--EVYDVTADRDYEADEELYTSY-----------GA 224

Query: 341 QPNSKLLINYGFVDEDNPYDRLVVEAAL 368
             N  LL  YGF+ +DNP+D+L ++A L
Sbjct: 225 HSNDFLLAEYGFMLQDNPHDQLCLDAVL 252


>gi|159131477|gb|EDP56590.1| SET domain protein [Aspergillus fumigatus A1163]
          Length = 490

 Score = 40.0 bits (92), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 66/307 (21%), Positives = 123/307 (40%), Gaps = 58/307 (18%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDL-QAGDAA--FSVPNSLVVTLE 137
           L SW   NG+    +  ++  S +   +    VA +E   + G+A    +VP+ L +TLE
Sbjct: 11  LSSWAKLNGISLEGIAFQKLYSEHGTDKGSAIVATAEKKDEEGEANTLLTVPSDLALTLE 70

Query: 138 RVLGN-----------ETIAELLTTNKLSELACLALYLMY--------EKKQGKKSFWLP 178
            V  +           + + +   T + + L  L + + +         +K G  + W  
Sbjct: 71  YVHNHAKIDRHLREVLDAVGDFGRTARGAILIFLIIQITHASPDFVNKRQKIGISNPWTE 130

Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL-----DT 233
           YIR +         +V  P  +S  E   L G+  +  +  +   +++E++ L     + 
Sbjct: 131 YIRFM-------PASVPLPTFYSAEERELLRGTSLQTAVDAKLGSLEKEFDHLRQATEEI 183

Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP---LGPP 290
            W       Q++ +D  T  FTF+ +K      +S VV L +       A+VP   +   
Sbjct: 184 PWC------QEHWWDEDTGKFTFDDWKYVDAVYRSRVVDLPRSG----HAIVPCVDMANH 233

Query: 291 LLAYSSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
               S K K       +AV QL   +  + GE + + Y             +P S+++ +
Sbjct: 234 ACEDSVKAKYDEEGAGNAVLQLRTGKKLRVGEEVTISYG----------DEKPASEMVFS 283

Query: 350 YGFVDED 356
           YGFV+ +
Sbjct: 284 YGFVENE 290


>gi|119495234|ref|XP_001264406.1| SET domain protein [Neosartorya fischeri NRRL 181]
 gi|119412568|gb|EAW22509.1| SET domain protein [Neosartorya fischeri NRRL 181]
          Length = 492

 Score = 40.0 bits (92), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 62/311 (19%), Positives = 121/311 (38%), Gaps = 66/311 (21%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASE----DLQAGDAAFSVPNSLVVTL 136
           L +W   NG+    +  ++    +   +    VA +E    D +A D   +VP+ L +TL
Sbjct: 11  LSTWAKLNGMSLEGIAFQKLHGEHGTDKGTAIVATAEKKDEDAEA-DTLLTVPSDLALTL 69

Query: 137 ERVLGN-----------ETIAELLTTNKLSELACLALYLMY--------EKKQGKKSFWL 177
           E V  +           + + +   T + + L  L + + +         +K G  + W 
Sbjct: 70  EYVHNHAKTDRHLREVLDAVGDFGRTARGAILIFLIVQITHASPDFANQRQKIGVSNPWT 129

Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL-----D 232
            YIR +         ++  P  +S  E   L G+  +  +  +   +++E+  L     D
Sbjct: 130 EYIRFM-------PASIPLPTFYSAEERELLRGTSLQTAVDAKLGSLEKEFEHLRQATED 182

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
             W       Q++ +D  T  FTF+  K      +S VV L +   A       + P + 
Sbjct: 183 IHWC------QEHWWDEDTGKFTFDDLKYVDAVYRSRVVDLPRSGHA-------IVPCVD 229

Query: 293 AYSSKCKAMLAAVDD-------AVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSK 345
             +  C+ ++ A  D        +QL   +  + GE + + Y             +P S+
Sbjct: 230 MANHACEDLVKARYDEDGAGNAVLQLRTGKKLRVGEEVTISYG----------DEKPASE 279

Query: 346 LLINYGFVDED 356
           ++ +YGFV+ +
Sbjct: 280 MVFSYGFVENE 290


>gi|71409849|ref|XP_807248.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70871208|gb|EAN85397.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 544

 Score = 40.0 bits (92), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 45/95 (47%), Gaps = 10/95 (10%)

Query: 149 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA-- 206
           L ++ +  +AC+A Y+ YEKKQ + +  L Y R L       Q+ V++  LW+   L   
Sbjct: 349 LDSSNMESIACIAAYMFYEKKQPEIALRL-YRRLL-------QMGVQTTELWNNLGLCCF 400

Query: 207 YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSL 241
           Y +        L+RA  I  E   L  VW+  G +
Sbjct: 401 YSSQYDIALSCLQRAVAISTEDETLADVWYNIGHI 435


>gi|407846232|gb|EKG02467.1| hypothetical protein TCSYLVIO_006496 [Trypanosoma cruzi]
          Length = 546

 Score = 40.0 bits (92), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 45/95 (47%), Gaps = 10/95 (10%)

Query: 149 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA-- 206
           L ++ +  +AC+A Y+ YEKKQ + +  L Y R L       Q+ V++  LW+   L   
Sbjct: 351 LDSSNMESIACIAAYMFYEKKQPEIALRL-YRRLL-------QMGVQTTELWNNLGLCCF 402

Query: 207 YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSL 241
           Y +        L+RA  I  E   L  VW+  G +
Sbjct: 403 YSSQYDIALSCLQRAVAISTEDETLADVWYNIGHI 437


>gi|344300819|gb|EGW31140.1| hypothetical protein SPAPADRAFT_142076 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 436

 Score = 39.7 bits (91), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 49/221 (22%), Positives = 79/221 (35%), Gaps = 30/221 (13%)

Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW---SETELAYLTG 210
           LS    L +Y+  E ++GK SFW P+   LD         +  PL+W   ++ +L  L  
Sbjct: 117 LSSFQLLGMYITIETQRGKSSFWKPF---LDMLPSIADFEL-MPLVWQINNQHDLLDLLP 172

Query: 211 SPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
            P +    +       +YN +        +   Q   D        + F  A++ + S  
Sbjct: 173 QPIRKTSEKVYTRFTSDYNTV--------TALLQTKIDNTEAVLPLDQFLLAWICINSRC 224

Query: 271 VHLQ---KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWY 327
           +++      S +  F + P     L +S      L       Q+     Y   E + + Y
Sbjct: 225 LYMNLPTSKSASDNFTMAPY-VDFLNHSPNDHCTLKIDGRGFQVFSTCAYSENEQVYLSY 283

Query: 328 NISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 368
                      GP  N  LL  YGF   DN ++ L V   L
Sbjct: 284 -----------GPHSNDFLLCEYGFTISDNKWNDLDVTEYL 313


>gi|407923069|gb|EKG16157.1| hypothetical protein MPH_06594 [Macrophomina phaseolina MS6]
          Length = 305

 Score = 39.7 bits (91), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 55/261 (21%), Positives = 100/261 (38%), Gaps = 53/261 (20%)

Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWL 177
           D+Q G+  F++P S V++      +  + +L     L   A L + ++YE  +G  S W 
Sbjct: 3   DIQEGEVLFTIPRSAVLSATNSSLSSILPQLF--EHLDPWASLIVTMIYEYLRGDASPWK 60

Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG----------IKRE 227
           PY   L           ++ + WS+ ELA L  S    +I + +            ++R 
Sbjct: 61  PYFDVLPAH-------FDTLMFWSDDELAELQASAVTQKIGKDSANEMFTNTIIPLVRRH 113

Query: 228 YN---------------ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVH 272
            +               +L  +    GS    Y +DI  +  + ++ ++ + +       
Sbjct: 114 ASVFFPDPNTAQGASDGDLLALAHRMGSTIMAYAFDIEPDPASKQVDEEGYASDD----- 168

Query: 273 LQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFT 332
            +  +L +   +VPL   L A + +  A L    DA+ +       AG+ +   Y     
Sbjct: 169 -EDEALPK--GMVPLADMLNADADRNNARLHYGPDALTMEAVTNISAGDEVFNDY----- 220

Query: 333 GHDFKCGPQPNSKLLINYGFV 353
                 G  P S LL  YG+V
Sbjct: 221 ------GSLPRSDLLRRYGYV 235


>gi|270005260|gb|EFA01708.1| hypothetical protein TcasGA2_TC007288 [Tribolium castaneum]
          Length = 253

 Score = 39.7 bits (91), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 35/119 (29%), Positives = 58/119 (48%), Gaps = 12/119 (10%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSEL--ACLALYLMYEKK 169
           V A+ D+       +VP  L++++E       + +L+  +K L  +    L+++L+ EK 
Sbjct: 116 VKANVDIAESSLVIAVPRKLMMSVENA-KESVLKDLIEKDKILGSMPNVALSIFLLLEKY 174

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
           +G  SFW PYI  L +          + L +S  EL  L GSPT    L + + I R+Y
Sbjct: 175 KGD-SFWKPYIDILPK-------TYTTVLYFSIDELEELRGSPTLEVALRQIKSITRQY 225


>gi|363746364|ref|XP_003643627.1| PREDICTED: histone-lysine N-methyltransferase setd3-like, partial
           [Gallus gallus]
          Length = 225

 Score = 39.7 bits (91), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 29/119 (24%), Positives = 59/119 (49%), Gaps = 12/119 (10%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
           + A+ +++A +    VP  L++T+E    N  +  L + +++ +      LA +L+ E+ 
Sbjct: 108 LKATREIKAEELFLWVPRKLLMTVESA-KNSVLGSLYSQDRILQAMGNITLAFHLLCERA 166

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
               SFWLPYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y
Sbjct: 167 -NPNSFWLPYIQTLPSE-------YDTPLYFEEDEVQYLRSTQAIHDVFSQYKNTARQY 217


>gi|390602144|gb|EIN11537.1| SET domain-containing protein [Punctularia strigosozonata HHB-11173
           SS5]
          Length = 503

 Score = 39.7 bits (91), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 60/231 (25%), Positives = 88/231 (38%), Gaps = 35/231 (15%)

Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQP 342
           A+VP+   L A      A L      +++V  +P  +GE I   Y           G  P
Sbjct: 255 AMVPMADMLNARYGSENAKLFYESRDLRMVTTKPIASGEQIWNTY-----------GDPP 303

Query: 343 NSKLLINYGFVD---------EDNPYDRLVVEA--ALNTEDPQYQDKRMVAQ-----RNG 386
           NS LL  YG VD           NP D + V A   LN  + + Q   +  +       G
Sbjct: 304 NSDLLRRYGHVDLLALSDGDGMGNPSDIVEVRADLVLNHVNSKKQSHELEERIDWWLEEG 363

Query: 387 KLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQ 446
              V VF     R+ E  S+++  +RL  +  T   ++      P   V       +L  
Sbjct: 364 GDDVFVFT----RDAELPSELVSLIRLLILPPTEWTKTRDKGKLPKGKVDDVR---ILHV 416

Query: 447 LADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 497
           +      RL  YP ++ +DEA+L    L   KR A  +   EK +L   L 
Sbjct: 417 VTGALHERLQQYPTSIEDDEALLA-TALSENKRQAVIVRLAEKHILRKALH 466


>gi|302754812|ref|XP_002960830.1| hypothetical protein SELMODRAFT_402221 [Selaginella moellendorffii]
 gi|300171769|gb|EFJ38369.1| hypothetical protein SELMODRAFT_402221 [Selaginella moellendorffii]
          Length = 393

 Score = 39.7 bits (91), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 39/72 (54%), Gaps = 6/72 (8%)

Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
           LAL ++ E+ +G+ + W PYI  L +        +++   W +TEL+YL  SP   +  E
Sbjct: 167 LALIVLMERYKGQ-AIWAPYISCLPQPA-----ELDNTFRWEDTELSYLRASPLYGKARE 220

Query: 220 RAEGIKREYNEL 231
           R E I  E+ ++
Sbjct: 221 RLEMITTEFGQV 232


>gi|345795412|ref|XP_544872.3| PREDICTED: SET domain-containing protein 4 [Canis lupus familiaris]
          Length = 440

 Score = 39.7 bits (91), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 76/358 (21%), Positives = 133/358 (37%), Gaps = 50/358 (13%)

Query: 41  GSSLRLVRRKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEK 100
           G     +RR+  F   VS         R V    + +  +LK W+         +I    
Sbjct: 5   GGRTSRIRRRKLFRCSVS---------RGVNESYKPEFIELKKWLKDRKFEDTNLIPACF 55

Query: 101 PSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-SELAC 159
           P           + +   L+ G    S+P S ++T + V+ +     +       S L  
Sbjct: 56  PGTGRG------LMSKTSLREGQMIISLPESCLITTDTVIRSYLGTYIAKWQPPPSPLLA 109

Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
           L  +L+ EK  G +S W PY+  L +       A   P+   E E+  L   P KA+  E
Sbjct: 110 LCTFLVSEKHAGDQSLWKPYLEILPQ-------AYTCPVC-LEPEVVNLFPKPLKAKAEE 161

Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQKV 276
           +   ++  ++     +     LF +    I    F++     A+  V +  V   H Q+ 
Sbjct: 162 QRARVQEFFSSSRDFFSSLQPLFSEAVESI----FSYRALLWAWCTVNTRAVYVKHRQRQ 217

Query: 277 SLARRFALVPLGP--PLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWYNISFT 332
             +       L P   LL +S + +   A  ++    ++      +  E + + Y     
Sbjct: 218 CFSTEPNTYALAPYLDLLNHSPEVQVKGAFNEETRCYEIRTASNCRKHEEVFICY----- 272

Query: 333 GHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 390
                 GP  N +LL+ YGFV   NP+  + V   +  +     DK+M    N K+S+
Sbjct: 273 ------GPHDNQRLLLEYGFVSIHNPHACVYVSEDILVKYLPTTDKQM----NKKISI 320


>gi|300122775|emb|CBK23792.2| unnamed protein product [Blastocystis hominis]
          Length = 854

 Score = 39.7 bits (91), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 32/121 (26%), Positives = 56/121 (46%), Gaps = 10/121 (8%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGN----ETIAELLTTNKLSELACLALYLMYEK 168
           V A E +Q G+    + N  V+ L   L +    +  +     N+LSE A +AL L++EK
Sbjct: 512 VIAKEAIQKGEEVLRIHNDTVIGLHTALTHPRFGKAFSAFYHQNQLSEYALIALTLLWEK 571

Query: 169 KQGKK-SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
              ++ S + P++ +L             P+L S+ +L +L GS    E+      + RE
Sbjct: 572 FDNERWSLFAPFLAKLPSIE-----EFHHPVLLSKDDLLHLYGSALLDEVSALNATLHRE 626

Query: 228 Y 228
           +
Sbjct: 627 F 627


>gi|258567286|ref|XP_002584387.1| predicted protein [Uncinocarpus reesii 1704]
 gi|237905833|gb|EEP80234.1| predicted protein [Uncinocarpus reesii 1704]
          Length = 706

 Score = 39.3 bits (90), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 56/217 (25%), Positives = 88/217 (40%), Gaps = 33/217 (15%)

Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIREL--DRQRGRGQLAVESPLLWSE-TELAYLTG 210
           + E   LA +LM +   G +SFW PYI+ L  D Q  R +      L W E T L  L  
Sbjct: 120 VEEPGALAFFLMDQYLLGDESFWAPYIQSLPDDSQFTRLEYYTGDDLKWLEGTNLLKLRE 179

Query: 211 SPTKAEILERAEGIK--REYNELDT---VW--FMAGSLFQQYPYDIPTEAFTFEIFKQAF 263
              +    +   G++  +E+   +T    W  F+  S        I + AF+ E+ K   
Sbjct: 180 KLLERLKAKYETGLRLLKEFPNKNTPKYTWERFLWASSI------ILSRAFSSEVLKDYI 233

Query: 264 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESI 323
               + V  L+  S+     LVPL   +  +    +   A   + + L+V +    GE +
Sbjct: 234 KGTPTRVKPLEDFSV-----LVPLV-DISNHQPLAQVEWATSLEKIGLIVHKTLLPGEEV 287

Query: 324 VVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYD 360
              Y           GP+ N +L++NYGF    N  D
Sbjct: 288 PNNY-----------GPRSNERLMMNYGFCIRGNVCD 313


>gi|397642897|gb|EJK75526.1| hypothetical protein THAOC_02751 [Thalassiosira oceanica]
          Length = 395

 Score = 39.3 bits (90), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 70/329 (21%), Positives = 131/329 (39%), Gaps = 54/329 (16%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
            K W    G+        E+   ++K R I Y   +   +AG A   VP  L+++  + +
Sbjct: 82  FKYWASTMGIEKNDCFKLEE--QDKKQREI-YAMTTRSTEAGTAVLYVPEHLILSSSKAM 138

Query: 141 ----------GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRG 190
                       E +A +   ++L E   L L ++ E ++G  S W  ++  L R     
Sbjct: 139 AELRTDGMAEAEEYLASVGAESQLREY-YLMLKVLLEYQKGSDSEWHKWLDALPRYYSNA 197

Query: 191 QLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIP 250
                  +  +E  L  L     K  + ER       Y+ + +V F+A  + + +P D+ 
Sbjct: 198 -------VAMTEFCLTCLPPLMKKLAVEERDAQKLLSYDSIQSVPFLADDIKEGFPRDMV 250

Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAV 309
           T A+               +V+ + V        ++P+G     ++S    ++   D+A 
Sbjct: 251 TWAYQ--------------IVYTRSVETEDGDLKIIPMG-DFFDHASDYAEIVPQYDEAG 295

Query: 310 QLVVDRPYK--AGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNP--YDRLVVE 365
                  Y   AG+ +   Y+           P+  S LL  YGF+DE  P  Y +L + 
Sbjct: 296 NYYAVTAYDVPAGKKLRYIYS----------NPRNPSHLLARYGFIDEICPATYCKL-LP 344

Query: 366 AALNTE--DPQYQDKRMVAQRNGKLSVQV 392
             +N E  +  Y  ++M+  R+G+++ +V
Sbjct: 345 PTVNEEMIELGYSQEKMLFYRSGEVADEV 373


>gi|70995934|ref|XP_752722.1| SET domain protein [Aspergillus fumigatus Af293]
 gi|66850357|gb|EAL90684.1| SET domain protein [Aspergillus fumigatus Af293]
          Length = 490

 Score = 39.3 bits (90), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 65/307 (21%), Positives = 123/307 (40%), Gaps = 58/307 (18%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDL-QAGDAA--FSVPNSLVVTLE 137
           L SW   NG+    +  ++  S +   +    VA +E   + G+A    +VP+ L +TLE
Sbjct: 11  LSSWAKLNGISLEGIAFQKLYSEHGTDKGSAIVATAEKKDEEGEANTLLTVPSDLALTLE 70

Query: 138 RVLGN-----------ETIAELLTTNKLSELACLALYLMY--------EKKQGKKSFWLP 178
            V  +           + + +   T + + L  L + + +         +K G  + W  
Sbjct: 71  YVHNHAKIDRHLREVLDAVGDFGRTARGAILIFLIIQITHASPDFVNKRQKIGISNPWTE 130

Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL-----DT 233
           YIR +         +V  P  +S  E   L G+  +  +  +   +++E++ L     + 
Sbjct: 131 YIRFM-------PASVPLPTFYSAEERELLRGTSLQTAVDAKLGSLEKEFDHLRQATEEI 183

Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP---LGPP 290
            W       Q++ +D  T  FTF+ +K      +S VV L +       A+VP   +   
Sbjct: 184 PWC------QEHWWDEDTGKFTFDDWKYVDAVYRSRVVDLPRSG----HAIVPCVDMANH 233

Query: 291 LLAYSSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
               S K +       +AV QL   +  + GE + + Y             +P S+++ +
Sbjct: 234 ACEDSVKARYDEEGAGNAVLQLRTGKKLRVGEEVTISYG----------DEKPASEMVFS 283

Query: 350 YGFVDED 356
           YGFV+ +
Sbjct: 284 YGFVENE 290


>gi|443733230|gb|ELU17670.1| hypothetical protein CAPTEDRAFT_97123, partial [Capitella teleta]
          Length = 199

 Score = 39.3 bits (90), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 51/207 (24%), Positives = 85/207 (41%), Gaps = 23/207 (11%)

Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
           L ++L+ E+ +G  SFW PY+  L         +    L W+  E+  L    TK    +
Sbjct: 1   LVIFLLCERNKGCSSFWKPYVDILPS-------SYTDILHWTSKEMDLLPKF-TKRRACD 52

Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
                +  +N L   +     L +Q P      AFT+++FK A+ +V +  V++   S  
Sbjct: 53  LRLKAEESFNRLCNGFLPL--LVRQMPQF--NGAFTWDLFKWAWSSVNTRCVYM---SQP 105

Query: 280 RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW----YNISFTGHD 335
           +   L P      A +     +   VD  V    D   K+ +   +     Y+  F  + 
Sbjct: 106 QNSVLSPDEEDKSALAPFLDLLNHTVDVEVNARFDDSSKSYKITTLTACKPYDQVFINY- 164

Query: 336 FKCGPQPNSKLLINYGFVDEDNPYDRL 362
              GP  N KLL+ YGF    NP++ +
Sbjct: 165 ---GPHSNEKLLLEYGFTLPCNPHNNI 188


>gi|410970027|ref|XP_003991492.1| PREDICTED: SET domain-containing protein 4 [Felis catus]
          Length = 440

 Score = 39.3 bits (90), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 65/281 (23%), Positives = 114/281 (40%), Gaps = 37/281 (13%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-SELACLALYLMYEKKQGKKSFWL 177
           LQ G    S+P + ++T + V+ +   A +       S L  L  +L+ EK  G +S W 
Sbjct: 68  LQEGQVIISLPETCLLTTDTVIRSYLGAYIAKWRPPPSPLLALCTFLVSEKHAGDQSVWK 127

Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
           PY+  L +       A   P+   E E+  L   P +A+  E+   ++  ++     +  
Sbjct: 128 PYLEILPK-------AYTCPVC-LEPEVVNLFPKPLRAKAEEQRARVREFFSSSRGFFSS 179

Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP----LGP--PL 291
              LF +    I    F++     A+  V +  V++ K    R F+  P    L P   L
Sbjct: 180 LQPLFSEAVGSI----FSYRALLWAWCTVNTRAVYV-KPRRRRCFSAEPDTCALAPYLDL 234

Query: 292 LAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLIN 349
           L +S   +   A  ++    ++      +  E + + Y           GP  N +LL+ 
Sbjct: 235 LNHSPHVQVEAAFNEETRCYEIRTASSCRKHEEVFICY-----------GPHDNQRLLLE 283

Query: 350 YGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 390
           YGFV   NP+  + V   +  +     DK+M    N K+S+
Sbjct: 284 YGFVSIHNPHACVYVSEDILVKYLPSTDKQM----NKKISI 320


>gi|406978090|gb|EKE00118.1| hypothetical protein ACD_22C00090G0009 [uncultured bacterium]
          Length = 478

 Score = 38.9 bits (89), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 44/191 (23%), Positives = 74/191 (38%), Gaps = 28/191 (14%)

Query: 109 PIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEK 168
           P+HY+   ++   G   FS P   + T   + G E + E L     SE     +Y MY K
Sbjct: 77  PMHYMQKEKEHVKG---FS-PELAIAT---IAGGEKLTEELAIRPTSETI---MYDMYRK 126

Query: 169 KQGKKSFWLPYIRELD----------RQRGRGQLAVE-SPLLWSETELAYLTGSPTKAEI 217
                  W    R+L           R   R  L +  S  LW E   A+LT   +   +
Sbjct: 127 -------WTNSWRDLPVLINQWCNVVRWEKRTYLFLRTSEFLWQEGHCAHLTHEESTDTV 179

Query: 218 LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS 277
           +      K+ YN+L +++ + G   +   +    + +TFE       ++Q+C  H    +
Sbjct: 180 IWAINAYKKTYNDLMSIYGIVGVKSESEKFAGAVKTYTFESLMPNGKSLQTCTSHDLGQN 239

Query: 278 LARRFALVPLG 288
            ++ F     G
Sbjct: 240 FSKSFEWTVQG 250


>gi|358384831|gb|EHK22428.1| hypothetical protein TRIVIDRAFT_84056 [Trichoderma virens Gv29-8]
          Length = 458

 Score = 38.9 bits (89), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 67/327 (20%), Positives = 126/327 (38%), Gaps = 66/327 (20%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
           ++ W+ ++G     + L   P+     R +         + G+   ++P+  + T+E   
Sbjct: 1   MEGWLRESGAELDGLELAHFPAIGRGVRTLRC------FKQGERILTIPSGCLWTVEHAY 54

Query: 141 GNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV---- 194
            +  +  +L + +  LS    LA+Y+++ +            RE      R  +A     
Sbjct: 55  ADAVLGPVLRSAQPPLSVEDTLAIYILFVRS-----------RESGYDGLRSHVAALPAS 103

Query: 195 -ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT-VWFMAGSLFQQYPYDIPTE 252
             S + + + EL    GS       +  + I+ +Y  L   V+ +   LF       P  
Sbjct: 104 YSSSIFFEDDELEVCAGSSLYTITRQLEQRIEEDYRGLVVRVFGLHLDLF-------PLN 156

Query: 253 AFTFEI--FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL---------LAYSSKCKA- 300
            FT E   +K A   V S  +            ++P G PL         + +S + K  
Sbjct: 157 KFTIENVGYKWALCTVWSRAMDF----------VLPNGNPLRLLAPFADMVNHSPEVKQC 206

Query: 301 -MLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPY 359
            +  A    + ++  + Y+A + + ++Y           GP PNS+LL  YGFV  DNP 
Sbjct: 207 HVYDASSGNLSILAGKDYEAEDQVFIYY-----------GPMPNSRLLRLYGFVIPDNPN 255

Query: 360 DRLVVEAALNTEDPQYQDKRMVAQRNG 386
           D   +  + +   P Y+ K+ +    G
Sbjct: 256 DSYDLVLSTHPLAPFYEQKQKLWASAG 282


>gi|440792461|gb|ELR13682.1| [Ribulose-bisphosphate-carboxylase]-lysine N-methyltransferase
           [Acanthamoeba castellanii str. Neff]
          Length = 400

 Score = 38.9 bits (89), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 50/215 (23%), Positives = 89/215 (41%), Gaps = 34/215 (15%)

Query: 161 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE- 219
           A+  + E     +SFW PY+ EL         AV +   W++ ELA +  +    E++E 
Sbjct: 49  AVLWLLESVNCAQSFWQPYLSELPD-------AVATVDRWNQEELAEVGHTLMLYEMVEY 101

Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
           + + I  +Y  +   +    +  Q +   IP+E    E +++A   V S      +  L 
Sbjct: 102 KKKKIAADYAAILLPFLQENT--QLFGGSIPSE----EEYRRALSLVYSRTFDFSE--LI 153

Query: 280 RRFALVPLGPPLLAYS------SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTG 333
                +P     L +S      + C        D  +L+    Y  GE + + Y      
Sbjct: 154 GEHVFIPF-VDFLNHSINDTGKAACTYSYNHDKDCFELLAGADYDEGEEVFISY------ 206

Query: 334 HDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 368
                G + +S+LL +YGF+ E+N  D + + A+L
Sbjct: 207 -----GEKTSSQLLASYGFMYENNAEDTVDITASL 236


>gi|301763371|ref|XP_002917104.1| PREDICTED: SET domain-containing protein 4-like [Ailuropoda
           melanoleuca]
          Length = 440

 Score = 38.9 bits (89), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 65/280 (23%), Positives = 110/280 (39%), Gaps = 35/280 (12%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-SELACLALYLMYEKKQGKKSFWL 177
           L+ G    S+P S ++T + V+ +   A +       S L  L  +L+ EK  G +S W 
Sbjct: 68  LREGQMIISLPESCLLTTDTVIRSYLGAYIAKWQPPPSPLLALCTFLVSEKHAGDQSLWK 127

Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
           PY+  L +       A   P+   E E+  L   P KA+  E+   ++  ++     +  
Sbjct: 128 PYLEILPK-------AYTCPVC-LEPEVVNLFPKPLKAKAEEQRARVQGFFSSSRDFFSS 179

Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQKVSLARRFALVPLGP--PLL 292
              LF +    I    F++     A+  V +  V   H Q+   +       L P   LL
Sbjct: 180 LQPLFSEAVESI----FSYSALLWAWCTVNTRAVYVKHRQEQCFSTEPNTCALAPYLDLL 235

Query: 293 AYSSKC--KAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINY 350
            +S +   KA         ++      +  E + + Y           GP  N +LL+ Y
Sbjct: 236 NHSPRVQVKAAFNEETRCYEIRTASGCRKHEEVFICY-----------GPHDNQQLLLEY 284

Query: 351 GFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 390
           GFV   NP+  + V   +  +     DK+M    N K+S+
Sbjct: 285 GFVSIQNPHACVYVSEDVLVKYLPLTDKQM----NKKISI 320


>gi|392594054|gb|EIW83379.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
          Length = 508

 Score = 38.9 bits (89), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 28/109 (25%), Positives = 48/109 (44%), Gaps = 17/109 (15%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELAC------LALYLMYEK 168
           A +D+  G   FS+P  L ++L       T+  LL  ++  E         L L +M+E+
Sbjct: 43  ALQDIHEGTTLFSLPRELTLSLR----TSTLPSLLGVDRWKEFGLNKGWVGLILCMMWEE 98

Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
            +G +S W  Y+  L           ++P+ WS  +L  L G+    +I
Sbjct: 99  SRGVESKWDVYLSSLPS-------TFDTPMFWSAEDLEELKGTAVPDKI 140


>gi|340503949|gb|EGR30449.1| SET domain protein [Ichthyophthirius multifiliis]
          Length = 518

 Score = 38.9 bits (89), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 51/222 (22%), Positives = 92/222 (41%), Gaps = 36/222 (16%)

Query: 306 DDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVE 365
           D+   +   +P K G+ I   Y           G + N  LL+ YGF    N YD   + 
Sbjct: 250 DNYFVVTTQKPEKKGQQIYNCY-----------GQRTNKFLLMWYGFCFNKNRYDSYSLR 298

Query: 366 AALNTEDPQYQD---KRMVAQ---------------RNGKLSVQVFHVHAGREKEAIS-D 406
             +N    Q  +   +++V Q               +  K+++     +   +K  I+ D
Sbjct: 299 LWINMRQEQLNNDLFEKIVFQEFLEKEDCKGGFVWKKQEKVNLDDITQNFRIKKNKINID 358

Query: 407 MLPYLRLGYVSDTS--EMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSE 464
           ++ YLRL  +      +++ V+ SL    PVSP  E  VL          L+ +  T+ +
Sbjct: 359 LIIYLRLYLMMHYKGPDLKRVMVSL----PVSPVYECFVLSFAIRLLSYLLSRFTTTIKD 414

Query: 465 DEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLL 506
           D+ +L + NL+ K R A      +K++L   + +    ++LL
Sbjct: 415 DKELLQNQNLNYKYRFAIIYRLNQKEILQEQISLMNQALILL 456


>gi|414886518|tpg|DAA62532.1| TPA: hypothetical protein ZEAMMB73_960129 [Zea mays]
          Length = 483

 Score = 38.9 bits (89), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 64/292 (21%), Positives = 118/292 (40%), Gaps = 69/292 (23%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNE-TIAELLTTNK--LSELACLALYLMYEKK 169
           +AA+ DL+ G+    +P + ++T +RV  ++  IA  ++ +K  LS +  L + L+ E  
Sbjct: 51  LAAARDLRRGELVLRLPRAALLTSDRVTADDPRIAACVSAHKPRLSSVQILIVCLLAEVG 110

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI---KR 226
           +G  S W PY+ +L                   T LA  T +  + E L+  + I   ++
Sbjct: 111 KGSNSVWYPYLCQLPSYY---------------TILA--TFNDFEVEALQVDDAIWVAQK 153

Query: 227 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 286
             + + + W  A  L ++  +    +   F+ +  AF  V S  +H   ++      L P
Sbjct: 154 AKSAIKSDWEDATPLMKELEF--KPKLLMFKSWLWAFATVSSRTLH---IAWDEAGCLCP 208

Query: 287 LG-------------------PPLLAYSSKCKAMLAAVD----------DAVQLVVDRPY 317
           +G                     L  Y  K   M  + +          +A  L   + Y
Sbjct: 209 VGDLFNYAAPDDDTLLEDEDTAELTNYQQK-NGMTNSSERLTDGGYEDCNAYCLYARKNY 267

Query: 318 KAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 369
           K GE +++ Y           G   N +LL +YGF+  +NP ++  +E  L+
Sbjct: 268 KKGEQVLLAY-----------GTYTNLELLEHYGFLLGENPNEKTFIELDLD 308


>gi|281338852|gb|EFB14436.1| hypothetical protein PANDA_005285 [Ailuropoda melanoleuca]
          Length = 415

 Score = 38.9 bits (89), Expect = 7.1,   Method: Compositional matrix adjust.
 Identities = 65/280 (23%), Positives = 110/280 (39%), Gaps = 35/280 (12%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-SELACLALYLMYEKKQGKKSFWL 177
           L+ G    S+P S ++T + V+ +   A +       S L  L  +L+ EK  G +S W 
Sbjct: 44  LREGQMIISLPESCLLTTDTVIRSYLGAYIAKWQPPPSPLLALCTFLVSEKHAGDQSLWK 103

Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
           PY+  L +       A   P+   E E+  L   P KA+  E+   ++  ++     +  
Sbjct: 104 PYLEILPK-------AYTCPVC-LEPEVVNLFPKPLKAKAEEQRARVQGFFSSSRDFFSS 155

Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQKVSLARRFALVPLGP--PLL 292
              LF +    I    F++     A+  V +  V   H Q+   +       L P   LL
Sbjct: 156 LQPLFSEAVESI----FSYSALLWAWCTVNTRAVYVKHRQEQCFSTEPNTCALAPYLDLL 211

Query: 293 AYSSKC--KAMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINY 350
            +S +   KA         ++      +  E + + Y           GP  N +LL+ Y
Sbjct: 212 NHSPRVQVKAAFNEETRCYEIRTASGCRKHEEVFICY-----------GPHDNQQLLLEY 260

Query: 351 GFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 390
           GFV   NP+  + V   +  +     DK+M    N K+S+
Sbjct: 261 GFVSIQNPHACVYVSEDVLVKYLPLTDKQM----NKKISI 296


>gi|328726082|ref|XP_001952202.2| PREDICTED: SET domain-containing protein 3-like [Acyrthosiphon
           pisum]
          Length = 241

 Score = 38.9 bits (89), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 38/161 (23%), Positives = 73/161 (45%), Gaps = 20/161 (12%)

Query: 73  KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
           + ++ +  L  W  KNG      IL     H  ++   + + A++++  GD   +VP +L
Sbjct: 81  RNDQSIEKLTKWATKNG-----AILNGVEIHQFENYA-YGMKANKNITVGDKLVTVPRAL 134

Query: 133 VVTLERV----LGNETIAELLTTNKLSELACLALYLMYEK-KQGKKSFWLPYIRELDRQR 187
           ++T E +    L      +++  N  +    LA++++ E  ++ KKSFW  Y+  L    
Sbjct: 135 MMTEENIPSSPLWKLHSQDMMLRNMPN--VALAIFILVESLRKDKKSFWHSYLTTLP--- 189

Query: 188 GRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
               +   +P+ +   +L  L GSP     L+    I R+Y
Sbjct: 190 ----VTYSTPVYFDVADLEALKGSPAFEAALKLNRNIARQY 226


>gi|222640175|gb|EEE68307.1| hypothetical protein OsJ_26571 [Oryza sativa Japonica Group]
          Length = 422

 Score = 38.9 bits (89), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 32/123 (26%), Positives = 54/123 (43%), Gaps = 28/123 (22%)

Query: 305 VDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVDEDNPYDR--L 362
           V  +++  + RP KAGE   + Y           G  P S L+  YGF+  DNPYD   L
Sbjct: 315 VTKSLKFPLSRPCKAGEQCFLSY-----------GKHPGSHLITFYGFLPRDNPYDVIPL 363

Query: 363 VVEAALNTED----------PQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 412
            ++ +++ ED            +  +RM+  R  +  +Q     + ++   +     YLR
Sbjct: 364 DLDTSVDEEDSSSPSVTTSQTSHMGERMLG-RQSRTGLQ----RSTKKDSFVHCYFVYLR 418

Query: 413 LGY 415
           LG+
Sbjct: 419 LGH 421


>gi|414886517|tpg|DAA62531.1| TPA: hypothetical protein ZEAMMB73_960129 [Zea mays]
          Length = 147

 Score = 38.9 bits (89), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 23/74 (31%), Positives = 43/74 (58%), Gaps = 3/74 (4%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNE-TIAELLTTNK--LSELACLALYLMYEKK 169
           +AA+ DL+ G+    +P + ++T +RV  ++  IA  ++ +K  LS +  L + L+ E  
Sbjct: 51  LAAARDLRRGELVLRLPRAALLTSDRVTADDPRIAACVSAHKPRLSSVQILIVCLLAEVG 110

Query: 170 QGKKSFWLPYIREL 183
           +G  S W PY+ +L
Sbjct: 111 KGSNSVWYPYLCQL 124


>gi|351694473|gb|EHA97391.1| SET domain-containing protein 3 [Heterocephalus glaber]
          Length = 297

 Score = 38.9 bits (89), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 40/72 (55%), Gaps = 4/72 (5%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQG 171
           A+ D++AG+    VP  LV+T+E    N  +  L + +++ +      LA +L+  ++  
Sbjct: 112 ATRDIKAGELFLWVPRKLVMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLLCERAS 170

Query: 172 KKSFWLPYIREL 183
             SFWLPYI+ L
Sbjct: 171 PISFWLPYIQTL 182


>gi|403375581|gb|EJY87766.1| hypothetical protein OXYTRI_23666 [Oxytricha trifallax]
          Length = 789

 Score = 38.5 bits (88), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 43/194 (22%), Positives = 85/194 (43%), Gaps = 26/194 (13%)

Query: 62  TLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQA 121
           T +   +  + K++E   + + W+  NG+    V   + P    +   +  +AA +D+  
Sbjct: 28  TFIHHEKTNLLKQQEKYVNFQKWLEDNGVLHPGV---DYPVAFGRQGQLIGMAARKDIPP 84

Query: 122 GDAAFSVPNSLVVTLERVLGNETIAELLTTN-------KLSELACLALYLMYEKKQGKKS 174
             A   VP  L+++ E  + N  IA LL+ +       + +E   +  ++ +E  +G+ S
Sbjct: 85  QKAFLFVPQRLMIS-EVTVRNSKIAPLLSKHPEIFKHHQDAEYLVIIAFVWHELMKGEAS 143

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-NELDT 233
           FW PY + ++          + P+LWS+ E+        + +I    +  K EY NE   
Sbjct: 144 FWHPYFQIIN--------LSDLPMLWSDQEIQEFQDQVLQKDI----QDYKVEYENEWKL 191

Query: 234 VW--FMAGSLFQQY 245
           V+  F     + +Y
Sbjct: 192 VYEAFSKDETYDEY 205


>gi|384490907|gb|EIE82103.1| hypothetical protein RO3G_06808 [Rhizopus delemar RA 99-880]
          Length = 216

 Score = 38.5 bits (88), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 28/119 (23%), Positives = 59/119 (49%), Gaps = 12/119 (10%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           V  ++ ++  +   +VP S+ +T E+V  N       T    S     +L+L+ +K  GK
Sbjct: 22  VYTTDTVKENEKFATVPFSICIT-EKVARNA----FPTLTGFSGRVLQSLFLVQQKNLGK 76

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
           KSF+ PYI  L ++       + + L + E ++ Y+  +  +  + ER   ++ ++++L
Sbjct: 77  KSFYFPYINILPKK-------IVTALHFDENDMNYIKKTNLELALRERKTALRDDFDKL 128


>gi|116197927|ref|XP_001224775.1| hypothetical protein CHGG_07119 [Chaetomium globosum CBS 148.51]
 gi|88178398|gb|EAQ85866.1| hypothetical protein CHGG_07119 [Chaetomium globosum CBS 148.51]
          Length = 555

 Score = 38.5 bits (88), Expect = 9.0,   Method: Compositional matrix adjust.
 Identities = 64/286 (22%), Positives = 118/286 (41%), Gaps = 56/286 (19%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
            + G+   ++P+ ++ T+E    +  +  +L + +  LS    LA Y+++ +        
Sbjct: 121 FKKGERILTIPSGILWTVEHAYADPLVGPVLRSARPPLSVEDTLATYILFIRS------- 173

Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSP--TKAEILERAEGIKREYN 229
               RE      R  +A       S + ++E EL    G+   T  + L+R+  I+ +Y 
Sbjct: 174 ----RESGYDGLRSHVAAFPTSYPSSIFFAEEELEVCAGTSLYTITKKLDRS--IEDDYR 227

Query: 230 ELDTVWFMAGS--LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL-----QKVSLARRF 282
            L  V  +A S  LF       P + F+ E +K A   V S  +         + L   F
Sbjct: 228 TL-VVRVLAQSRDLF-------PLDKFSIEDYKWALCTVWSRAMDFVLPDGNSIRLVAPF 279

Query: 283 ALVPLGPPLLAYSSKCK--AMLAAVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGP 340
           A       +L +SS+ +   +  A    + ++  + Y+AG+   ++Y           G 
Sbjct: 280 A------DMLNHSSEVEPCHIYDASSGNLSVLAGKDYEAGDQAFIYY-----------GS 322

Query: 341 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 386
            PNS+LL  YGFV   NP D   +  + +   P ++ K+ +    G
Sbjct: 323 IPNSRLLRLYGFVMPGNPNDSYDLVISTHPSAPFFERKQKLWASAG 368


>gi|320584053|gb|EFW98265.1| Nuclear protein that contains a SET-domain [Ogataea parapolymorpha
           DL-1]
          Length = 499

 Score = 38.5 bits (88), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 58/231 (25%), Positives = 91/231 (39%), Gaps = 42/231 (18%)

Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
           GN+ + E      L++   L L L YE   G++S W  Y+  L  +         S + W
Sbjct: 74  GNQEVLE-----TLNQWEALILCLAYEMMLGEESRWSSYLAVLPEK-------FNSLMFW 121

Query: 201 SETELAYLTGSPTKAEI-LERAE--------------GIKR--EYNELDTVWFMAGSLFQ 243
           S  EL  L  S     I  E+AE              G K+  EY  +D  + +  S+  
Sbjct: 122 SSEELEKLKPSNVLQRIGREQAEQMYSKLVPEYCLRLGSKKLVEYLTIDR-FHVVASIIM 180

Query: 244 QYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA 303
            Y +D+       E  +              K     + ++VPL   L + ++   A L+
Sbjct: 181 SYSFDVDDPEDDPEDDEDEEEDFDEIEQECIKYDGYLK-SMVPLADTLNSNTNLVNANLS 239

Query: 304 AVDDAVQLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYGFVD 354
             +DA+ +   +  K GE I   YNI         G  PNS++L  YG+V+
Sbjct: 240 YENDALVMTATKDIKKGEQI---YNIY--------GELPNSEILRKYGYVE 279


>gi|121701277|ref|XP_001268903.1| SET domain protein [Aspergillus clavatus NRRL 1]
 gi|119397046|gb|EAW07477.1| SET domain protein [Aspergillus clavatus NRRL 1]
          Length = 498

 Score = 38.5 bits (88), Expect = 9.2,   Method: Compositional matrix adjust.
 Identities = 47/213 (22%), Positives = 90/213 (42%), Gaps = 27/213 (12%)

Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 235
           W  YIR +         ++  P  ++E EL  L G+  +  +  +   +++E+  L    
Sbjct: 134 WTEYIRFM-------PPSIRLPTFYTEAELELLRGTSLRTAVFAKLASLEKEFERLRQS- 185

Query: 236 FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP---LGPPLL 292
                  Q+Y +D  T   TF+ +K      +S VV L +       A+VP   +     
Sbjct: 186 TEGIPWCQKYWWDEDTGRLTFDDWKYVDAVYRSRVVELPESG----HAIVPCVDMANHAS 241

Query: 293 AYSSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWYNISFTGHDFKCGPQPNSKLLINYG 351
             S K +   ++ +DA+ QL   R   +GE + + Y             +P S+++ +YG
Sbjct: 242 EDSVKARYDESSTEDALLQLRQGRRICSGEEVTISYG----------SEKPASEMVFSYG 291

Query: 352 FVD-EDNPYDRLVVEAALNTEDPQYQDKRMVAQ 383
           FV+ E     ++ ++  +  +DP    K+M  +
Sbjct: 292 FVENERTDAKQIFLDLEIPDDDPLRMAKQMFCK 324


>gi|242045610|ref|XP_002460676.1| hypothetical protein SORBIDRAFT_02g032970 [Sorghum bicolor]
 gi|241924053|gb|EER97197.1| hypothetical protein SORBIDRAFT_02g032970 [Sorghum bicolor]
          Length = 489

 Score = 38.5 bits (88), Expect = 9.6,   Method: Compositional matrix adjust.
 Identities = 22/74 (29%), Positives = 43/74 (58%), Gaps = 3/74 (4%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNE-TIAELLTTNK--LSELACLALYLMYEKK 169
           +AA+ DL+ G+     P + ++T +RV  ++  IA  ++ ++  LS +  L + L+ E  
Sbjct: 57  LAAARDLRRGELVLRAPRAALLTSDRVTADDPRIAACVSAHRPRLSSVQILIVCLLAEVG 116

Query: 170 QGKKSFWLPYIREL 183
           +G+ S W PY+ +L
Sbjct: 117 KGRNSVWYPYLSQL 130


>gi|428183324|gb|EKX52182.1| hypothetical protein GUITHDRAFT_150712, partial [Guillardia theta
           CCMP2712]
          Length = 205

 Score = 38.1 bits (87), Expect = 10.0,   Method: Compositional matrix adjust.
 Identities = 29/116 (25%), Positives = 55/116 (47%), Gaps = 15/116 (12%)

Query: 116 SEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSF 175
           ++D+++     S+P+ L   L R         +   +KL     +AL ++YEK + ++SF
Sbjct: 88  TQDVKSNSVVCSIPSKLF--LSRSTTRAAFGSM--ADKLDVRTAMALQILYEKSKKEESF 143

Query: 176 WLPYIREL-DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
           W  +++ L DR+       + +P LW E +   L G+     + E  E  K  Y++
Sbjct: 144 WCEWLKVLPDREN------LGTPCLWPEDDQNLLKGT----SVFEEVEASKSLYSK 189


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.319    0.134    0.393 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,913,178,062
Number of Sequences: 23463169
Number of extensions: 319442466
Number of successful extensions: 833965
Number of sequences better than 100.0: 734
Number of HSP's better than 100.0 without gapping: 84
Number of HSP's successfully gapped in prelim test: 650
Number of HSP's that attempted gapping in prelim test: 832737
Number of HSP's gapped (non-prelim): 998
length of query: 523
length of database: 8,064,228,071
effective HSP length: 147
effective length of query: 376
effective length of database: 8,910,109,524
effective search space: 3350201181024
effective search space used: 3350201181024
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 79 (35.0 bits)