BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 010377
         (512 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|449455876|ref|XP_004145676.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cucumis
           sativus]
 gi|449492872|ref|XP_004159127.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cucumis
           sativus]
          Length = 521

 Score =  828 bits (2139), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 404/499 (80%), Positives = 442/499 (88%), Gaps = 10/499 (2%)

Query: 21  HHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSSDTLVAGSR-------EVVSK 73
           H PL + S IS+S  R  +F +S   +RR N  S   SSS+TLVAGSR       E V+K
Sbjct: 24  HRPLLLLSKISVSAPRISHFSNSFSPIRRWNVCS--ASSSETLVAGSRKENGKTGEAVTK 81

Query: 74  KEED-LGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
           KE+D  GDLK+WMH NGLPPCKVIL+EKPSH++ HRPIHYVAASEDL+ GD AFSVPNSL
Sbjct: 82  KEDDEFGDLKAWMHDNGLPPCKVILEEKPSHDKNHRPIHYVAASEDLEVGDVAFSVPNSL 141

Query: 133 VVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
           VVTLERVLGNET+AELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQL
Sbjct: 142 VVTLERVLGNETVAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQL 201

Query: 193 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 252
           AVESPLLWSE EL YL+GSPTK E+LERAEGIK+EYNELDTVWFMAGSLFQQYPYDIPTE
Sbjct: 202 AVESPLLWSEDELDYLSGSPTKKEVLERAEGIKKEYNELDTVWFMAGSLFQQYPYDIPTE 261

Query: 253 AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 312
           AF+FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY S CKAML AVD AV+LV
Sbjct: 262 AFSFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYRSNCKAMLTAVDGAVELV 321

Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 372
           VDRPYKAGESI VWCGPQPNSKLL+NYGFVDEDN YDRLVVEAALNTEDPQYQDKRMVAQ
Sbjct: 322 VDRPYKAGESIAVWCGPQPNSKLLLNYGFVDEDNRYDRLVVEAALNTEDPQYQDKRMVAQ 381

Query: 373 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 432
           RNG+LS+Q F+V+AG+EKEA+ DMLPYLRLGYV+  SEMQSVISS GP+CPVSPCMERA+
Sbjct: 382 RNGRLSIQAFYVYAGKEKEAVLDMLPYLRLGYVTHPSEMQSVISSQGPVCPVSPCMERAM 441

Query: 433 LDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 492
           L+Q+ADYFK RLAGYP TLSEDE +L D NL+PKKRVATQLVR+EKK+L++CL+VT D I
Sbjct: 442 LEQVADYFKRRLAGYPTTLSEDEFLLADGNLNPKKRVATQLVRLEKKLLHSCLEVTIDFI 501

Query: 493 MLLPDVTVSPCPAPYAPLL 511
             LPD TVSPCPAPYAPLL
Sbjct: 502 NQLPDHTVSPCPAPYAPLL 520


>gi|225452167|ref|XP_002264334.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
           [Vitis vinifera]
          Length = 509

 Score =  828 bits (2138), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 396/462 (85%), Positives = 423/462 (91%), Gaps = 7/462 (1%)

Query: 58  SSSDTLVAGSR-------EVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPI 110
           S SDTLVAGSR       E   KKE++ GDLKSWMH+NGLPPCKV+LKE+PSH+E+H+ I
Sbjct: 48  SGSDTLVAGSRKEDGRVSEAARKKEDEFGDLKSWMHENGLPPCKVVLKERPSHHEQHKAI 107

Query: 111 HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
           HY+AASEDLQAGD AFSVP+SLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ
Sbjct: 108 HYIAASEDLQAGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 167

Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
           GKKSFW PYIRELDRQRGRGQLAVESPLLWSE+ELAYLTGSPTKAE+LERAEGIKREYNE
Sbjct: 168 GKKSFWYPYIRELDRQRGRGQLAVESPLLWSESELAYLTGSPTKAEVLERAEGIKREYNE 227

Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 290
           LDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFVA+QSCVVHLQKVSLARRFALVPLGPP
Sbjct: 228 LDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFVAIQSCVVHLQKVSLARRFALVPLGPP 287

Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 350
           LLAY S CKAMLAAVD +VQLVVDRPYKAGESIVVWCGPQPNSKLL+NYGFVDEDN YDR
Sbjct: 288 LLAYRSNCKAMLAAVDGSVQLVVDRPYKAGESIVVWCGPQPNSKLLLNYGFVDEDNSYDR 347

Query: 351 LVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE 410
           +VVEAALNTEDPQYQDKRMVAQRNGKL+VQ FHV  G+E+EA+SDMLPYLRLGYVSD SE
Sbjct: 348 IVVEAALNTEDPQYQDKRMVAQRNGKLTVQKFHVSVGKEREAVSDMLPYLRLGYVSDPSE 407

Query: 411 MQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVA 470
           MQSVISS GPICPVSPCMERAVLDQL DYF+ RLAGYP T+SEDE +L D NL+PKK VA
Sbjct: 408 MQSVISSQGPICPVSPCMERAVLDQLVDYFERRLAGYPTTMSEDECLLADSNLNPKKLVA 467

Query: 471 TQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 512
           TQLVR+EKKMLNACL+ T D+I  LPD TVSPCPAPY PLL 
Sbjct: 468 TQLVRLEKKMLNACLKATVDLINQLPDHTVSPCPAPYTPLLK 509


>gi|359488614|ref|XP_003633789.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
           [Vitis vinifera]
          Length = 515

 Score =  821 bits (2121), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 396/468 (84%), Positives = 423/468 (90%), Gaps = 13/468 (2%)

Query: 58  SSSDTLVAGSR-------EVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPI 110
           S SDTLVAGSR       E   KKE++ GDLKSWMH+NGLPPCKV+LKE+PSH+E+H+ I
Sbjct: 48  SGSDTLVAGSRKEDGRVSEAARKKEDEFGDLKSWMHENGLPPCKVVLKERPSHHEQHKAI 107

Query: 111 HYVAASEDLQ------AGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYL 164
           HY+AASEDLQ      AGD AFSVP+SLVVTLERVLGNETIAELLTTNKLSELACLALYL
Sbjct: 108 HYIAASEDLQGFLLLQAGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYL 167

Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
           MYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE+ELAYLTGSPTKAE+LERAEGI
Sbjct: 168 MYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSESELAYLTGSPTKAEVLERAEGI 227

Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 284
           KREYNELDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFVA+QSCVVHLQKVSLARRFAL
Sbjct: 228 KREYNELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFVAIQSCVVHLQKVSLARRFAL 287

Query: 285 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 344
           VPLGPPLLAY S CKAMLAAVD +VQLVVDRPYKAGESIVVWCGPQPNSKLL+NYGFVDE
Sbjct: 288 VPLGPPLLAYRSNCKAMLAAVDGSVQLVVDRPYKAGESIVVWCGPQPNSKLLLNYGFVDE 347

Query: 345 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 404
           DN YDR+VVEAALNTEDPQYQDKRMVAQRNGKL+VQ FHV  G+E+EA+SDMLPYLRLGY
Sbjct: 348 DNSYDRIVVEAALNTEDPQYQDKRMVAQRNGKLTVQKFHVSVGKEREAVSDMLPYLRLGY 407

Query: 405 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 464
           VSD SEMQSVISS GPICPVSPCMERAVLDQL DYF+ RLAGYP T+SEDE +L D NL+
Sbjct: 408 VSDPSEMQSVISSQGPICPVSPCMERAVLDQLVDYFERRLAGYPTTMSEDECLLADSNLN 467

Query: 465 PKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 512
           PKK VATQLVR+EKKMLNACL+ T D+I  LPD TVSPCPAPY PLL 
Sbjct: 468 PKKLVATQLVRLEKKMLNACLKATVDLINQLPDHTVSPCPAPYTPLLK 515


>gi|224117488|ref|XP_002331687.1| SET domain protein [Populus trichocarpa]
 gi|222874165|gb|EEF11296.1| SET domain protein [Populus trichocarpa]
          Length = 502

 Score =  809 bits (2090), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 405/517 (78%), Positives = 439/517 (84%), Gaps = 22/517 (4%)

Query: 1   MEASCSLRSSKFISPPIRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSS 60
           ME +C    +K ISP        L++ S +SIS    P    S    RR   F+      
Sbjct: 1   MEFTC--LHNKCISPS-------LTVLSRVSISFSNLPKRAVSFHRRRRNLCFA------ 45

Query: 61  DTLVAGSR--EVVSKK----EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVA 114
            TLV G R  EVVSK+    E++ GDLKSWMHKNGLPPCKV+LKE+PSH++K RPIHYVA
Sbjct: 46  -TLVDGKRTSEVVSKRGGEEEDEFGDLKSWMHKNGLPPCKVVLKERPSHDKKLRPIHYVA 104

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           ASEDLQA D A SVPNSLVVTLERVLGNET+AELLTTNKLSELACLALYLMYEKKQGKKS
Sbjct: 105 ASEDLQASDVAVSVPNSLVVTLERVLGNETLAELLTTNKLSELACLALYLMYEKKQGKKS 164

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
           FW PYIRELDRQRGRGQLAVESPLLWSE ELAYLTGSPTKAE+L+RA+GIKREY ELDTV
Sbjct: 165 FWYPYIRELDRQRGRGQLAVESPLLWSEAELAYLTGSPTKAEVLDRADGIKREYEELDTV 224

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
           WFMAGSLFQQYPYDIPTEAF FEIFKQAFVA+QSCVVHLQKVSLARRFALVPLGPPLLAY
Sbjct: 225 WFMAGSLFQQYPYDIPTEAFPFEIFKQAFVAIQSCVVHLQKVSLARRFALVPLGPPLLAY 284

Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
           SS CKAML AVD AV+LVVDRPYKAGE IVVWCGPQPNSKLL+NYGFVDEDNPYDR+ VE
Sbjct: 285 SSNCKAMLTAVDGAVELVVDRPYKAGEPIVVWCGPQPNSKLLLNYGFVDEDNPYDRIAVE 344

Query: 355 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV 414
           AALNTEDPQYQDKRMVAQRNGKLSVQVF V+AG+EKEA+SD+LPYLRLGYVSD SEMQSV
Sbjct: 345 AALNTEDPQYQDKRMVAQRNGKLSVQVFQVYAGKEKEAVSDILPYLRLGYVSDPSEMQSV 404

Query: 415 ISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLV 474
           ISS GP+CPVSPCME+AVLDQL  YF+ RLAGY  ++SEDE ML D NL+PKKRVATQLV
Sbjct: 405 ISSQGPVCPVSPCMEQAVLDQLTVYFRTRLAGYCTSISEDELMLADPNLNPKKRVATQLV 464

Query: 475 RMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 511
           R+EKKML ACLQ T D+I  LPD T+ PCPAPYAPLL
Sbjct: 465 RLEKKMLKACLQATVDLINQLPDHTMPPCPAPYAPLL 501


>gi|296090251|emb|CBI40070.3| unnamed protein product [Vitis vinifera]
          Length = 428

 Score =  793 bits (2047), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 376/427 (88%), Positives = 401/427 (93%)

Query: 85  MHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNET 144
           MH+NGLPPCKV+LKE+PSH+E+H+ IHY+AASEDLQAGD AFSVP+SLVVTLERVLGNET
Sbjct: 1   MHENGLPPCKVVLKERPSHHEQHKAIHYIAASEDLQAGDVAFSVPDSLVVTLERVLGNET 60

Query: 145 IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 204
           IAELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE+E
Sbjct: 61  IAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSESE 120

Query: 205 LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
           LAYLTGSPTKAE+LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFV
Sbjct: 121 LAYLTGSPTKAEVLERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFV 180

Query: 265 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 324
           A+QSCVVHLQKVSLARRFALVPLGPPLLAY S CKAMLAAVD +VQLVVDRPYKAGESIV
Sbjct: 181 AIQSCVVHLQKVSLARRFALVPLGPPLLAYRSNCKAMLAAVDGSVQLVVDRPYKAGESIV 240

Query: 325 VWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV 384
           VWCGPQPNSKLL+NYGFVDEDN YDR+VVEAALNTEDPQYQDKRMVAQRNGKL+VQ FHV
Sbjct: 241 VWCGPQPNSKLLLNYGFVDEDNSYDRIVVEAALNTEDPQYQDKRMVAQRNGKLTVQKFHV 300

Query: 385 HAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARL 444
             G+E+EA+SDMLPYLRLGYVSD SEMQSVISS GPICPVSPCMERAVLDQL DYF+ RL
Sbjct: 301 SVGKEREAVSDMLPYLRLGYVSDPSEMQSVISSQGPICPVSPCMERAVLDQLVDYFERRL 360

Query: 445 AGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCP 504
           AGYP T+SEDE +L D NL+PKK VATQLVR+EKKMLNACL+ T D+I  LPD TVSPCP
Sbjct: 361 AGYPTTMSEDECLLADSNLNPKKLVATQLVRLEKKMLNACLKATVDLINQLPDHTVSPCP 420

Query: 505 APYAPLL 511
           APY PLL
Sbjct: 421 APYTPLL 427


>gi|357497055|ref|XP_003618816.1| SET domain protein [Medicago truncatula]
 gi|355493831|gb|AES75034.1| SET domain protein [Medicago truncatula]
          Length = 501

 Score =  782 bits (2019), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 375/476 (78%), Positives = 418/476 (87%), Gaps = 2/476 (0%)

Query: 36  RDPNFGSSLRLVRRKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKV 95
           R P+F S     RR+ R     S+SDTLVA + +   K++ED GDLK+WMHKNGLPPCKV
Sbjct: 27  RLPSFLSLSTNHRRRRRSFCSASNSDTLVAATGK--KKRDEDDGDLKTWMHKNGLPPCKV 84

Query: 96  ILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLS 155
           +LK+KPS ++  +PIHYVAASEDLQ GD AFSVPNSLVVTLERVLGNETIAELLTTNK S
Sbjct: 85  VLKDKPSLDDSVKPIHYVAASEDLQKGDIAFSVPNSLVVTLERVLGNETIAELLTTNKFS 144

Query: 156 ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 215
           ELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE+ELAYL GSP K 
Sbjct: 145 ELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSESELAYLEGSPLKD 204

Query: 216 EILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK 275
           EI++R EGI++EYNELDTVWFM+GSLFQQYPYD+PTEAF FEIFKQAF AVQSCVVHLQ 
Sbjct: 205 EIVKRIEGIRKEYNELDTVWFMSGSLFQQYPYDLPTEAFPFEIFKQAFAAVQSCVVHLQN 264

Query: 276 VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
           VSLARRFALVPLGPPLLAY S CKAML AVD AVQLVVDRPYKAG+ IVVWCGPQPN+KL
Sbjct: 265 VSLARRFALVPLGPPLLAYCSNCKAMLTAVDGAVQLVVDRPYKAGDPIVVWCGPQPNTKL 324

Query: 336 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD 395
           L NYGFVDEDN  DRL+VE AL+TEDPQYQDKR+VAQRNGKLS+Q F+V+ G+E+EA+SD
Sbjct: 325 LTNYGFVDEDNSNDRLIVEVALSTEDPQYQDKRIVAQRNGKLSIQTFYVYTGKEREAVSD 384

Query: 396 MLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 455
           M+PY+RLGYVSD SEMQSVISS GP+CPVSPCMERAVLDQLADYF  RLA YP TL+EDE
Sbjct: 385 MIPYMRLGYVSDPSEMQSVISSQGPVCPVSPCMERAVLDQLADYFNTRLAAYPTTLAEDE 444

Query: 456 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 511
           +MLTD +L+PK+RVATQLVR+EKKML+ACLQ   D+I  LPD +VSPCPAPYAP L
Sbjct: 445 SMLTDGSLNPKRRVATQLVRLEKKMLHACLQAIMDLISQLPDHSVSPCPAPYAPSL 500


>gi|22326803|ref|NP_196930.2| Rubisco methyltransferase family protein [Arabidopsis thaliana]
 gi|30684815|ref|NP_851038.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
 gi|42573363|ref|NP_974778.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
 gi|17473570|gb|AAL38260.1| putative protein [Arabidopsis thaliana]
 gi|23297671|gb|AAN13005.1| unknown protein [Arabidopsis thaliana]
 gi|332004624|gb|AED92007.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
 gi|332004625|gb|AED92008.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
 gi|332004626|gb|AED92009.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
          Length = 514

 Score =  778 bits (2010), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 389/520 (74%), Positives = 433/520 (83%), Gaps = 16/520 (3%)

Query: 1   MEASCSLRSSKFISPPIRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSS 60
           ME   +   +K +S PIR    PLS  S  S+   R+    SS R V  +   S+ VSSS
Sbjct: 1   MEGVITCFHTKCVSLPIR--SFPLSRVS--SLPRWRNNKLISSSRSVHLR---SLCVSSS 53

Query: 61  DTLVAGSR--------EVVSKKE-EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIH 111
           DTLVA           +V SKKE +D  DLK WM KNGLPPCKVILKE+P+H++KH+PIH
Sbjct: 54  DTLVASGSPKEDERQSKVSSKKEGDDSEDLKFWMDKNGLPPCKVILKERPAHDQKHKPIH 113

Query: 112 YVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
           YVAASEDLQ GD AFSVP+SLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG
Sbjct: 114 YVAASEDLQKGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 173

Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
           KKS W PYIRELDRQRGRGQL  ESPLLWSE EL YLTGSPTKAE+LERAEGIKREYNEL
Sbjct: 174 KKSVWYPYIRELDRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAEGIKREYNEL 233

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
           DTVWFMAGSLFQQYP+DIPTEAF+FEIFKQAFVA+QSCVVHLQ V LARRFALVPLGPPL
Sbjct: 234 DTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVAIQSCVVHLQNVGLARRFALVPLGPPL 293

Query: 292 LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 351
           LAY S CKAML AVD AV+LVVDRPYKAG+ IVVWCGPQPN+KLL+NYGFVDEDNPYDR+
Sbjct: 294 LAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVVWCGPQPNAKLLLNYGFVDEDNPYDRV 353

Query: 352 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM 411
           +VEAALNTEDPQYQDKRMVAQRNGKLS QVF V  G+E+EA+ DMLPYLRLGY+SD SEM
Sbjct: 354 IVEAALNTEDPQYQDKRMVAQRNGKLSQQVFQVRVGKEREAVQDMLPYLRLGYMSDPSEM 413

Query: 412 QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVAT 471
           QSVISS GP+CP+SPCMERAVLDQLA+YF  RL+GYP T  ED+A+L D +L P+KRVAT
Sbjct: 414 QSVISSQGPVCPMSPCMERAVLDQLANYFMRRLSGYPTTPKEDDALLADPSLSPRKRVAT 473

Query: 472 QLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 511
           +LV++EKK+L ACL  T D++  LPD  +SPCPAPYAP L
Sbjct: 474 RLVQLEKKILVACLTTTVDLLNQLPDTAISPCPAPYAPSL 513


>gi|18377718|gb|AAL67009.1| unknown protein [Arabidopsis thaliana]
          Length = 514

 Score =  776 bits (2004), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 388/520 (74%), Positives = 432/520 (83%), Gaps = 16/520 (3%)

Query: 1   MEASCSLRSSKFISPPIRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSS 60
           ME   +   +K +S PIR    PLS  S  S+   R+    SS R V  +   S+ VSSS
Sbjct: 1   MEGVITCFHTKCVSLPIR--SFPLSRVS--SLPRWRNNKLISSSRSVHLR---SLCVSSS 53

Query: 61  DTLVAGSR--------EVVSKKE-EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIH 111
           DTLVA           +V SKKE +D  DLK WM KNGLPPCKVILKE+P+H++KH+PIH
Sbjct: 54  DTLVASGSPKEDERQSKVSSKKEGDDSEDLKFWMDKNGLPPCKVILKERPAHDQKHKPIH 113

Query: 112 YVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
           YVAASEDLQ GD AFSVP+SLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG
Sbjct: 114 YVAASEDLQKGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 173

Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
           KKS W PYIRELDRQRGRGQL  ESPLLWSE EL YLTGSPTKAE+LERAEGIKREYNEL
Sbjct: 174 KKSVWYPYIRELDRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAEGIKREYNEL 233

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
           DTVWFMAGSLFQQYP+DIPTEAF+FEIFKQAFVA+QSCVVHLQ V LARRFALVPLGPPL
Sbjct: 234 DTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVAIQSCVVHLQNVGLARRFALVPLGPPL 293

Query: 292 LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 351
           LAY S CKAML AVD AV+LVVDRPYKAG+ IVVWCGPQPN+KLL+NYGFVDEDNPYDR+
Sbjct: 294 LAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVVWCGPQPNAKLLLNYGFVDEDNPYDRV 353

Query: 352 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM 411
           +VEAALNTE PQYQDKRMVAQRNGKLS QVF V  G+E+EA+ DMLPYLRLGY+SD SEM
Sbjct: 354 IVEAALNTEGPQYQDKRMVAQRNGKLSQQVFQVRVGKEREAVQDMLPYLRLGYMSDPSEM 413

Query: 412 QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVAT 471
           QSVISS GP+CP+SPCMERAVLDQLA+YF  RL+GYP T  ED+A+L D +L P+KRVAT
Sbjct: 414 QSVISSQGPVCPMSPCMERAVLDQLANYFMRRLSGYPTTPKEDDALLADPSLSPRKRVAT 473

Query: 472 QLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 511
           +LV++EKK+L ACL  T D++  LPD  +SPCPAPYAP L
Sbjct: 474 RLVQLEKKILVACLTTTVDLLNQLPDTAISPCPAPYAPSL 513


>gi|297807453|ref|XP_002871610.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297317447|gb|EFH47869.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 516

 Score =  772 bits (1994), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 385/521 (73%), Positives = 431/521 (82%), Gaps = 14/521 (2%)

Query: 1   MEASCSLRSSKFISPPIRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSS 60
           ME   +   +K +S PIR    PLS  S  S+   R+    SS R V  ++   +  SSS
Sbjct: 1   MEGVITCFHTKCVSLPIR--SFPLSRVS--SLPRWRNTKLISSSRSVPLRS-LCVSASSS 55

Query: 61  DTLVAGSR--------EVVSKKE-EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIH 111
           DTLVAG          +V SKKE +D  DLK WM KNGLPPCKV+LKE+P+H+ K++PIH
Sbjct: 56  DTLVAGGSPKEDERQSKVSSKKEGDDSEDLKFWMDKNGLPPCKVLLKERPAHDLKYKPIH 115

Query: 112 YVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
           YVAASEDLQ GD AFSVP+SLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG
Sbjct: 116 YVAASEDLQKGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 175

Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
           KKS W PYIRELDRQRGRGQL  ESPLLWSE EL YLTGSPTKAE+LERAEGIKREYNEL
Sbjct: 176 KKSVWYPYIRELDRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAEGIKREYNEL 235

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
           DTVWFMAGSLFQQYP+DIPTEAF+FEIFKQAFVA+QSCVVHLQ V LARRFALVPLGPPL
Sbjct: 236 DTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVAIQSCVVHLQNVGLARRFALVPLGPPL 295

Query: 292 LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 351
           LAY S CKAML AVD AV+LVVDRPYKAG+ IVVWCGPQPN+KLL+NYGFVDEDNPYDR+
Sbjct: 296 LAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVVWCGPQPNAKLLLNYGFVDEDNPYDRI 355

Query: 352 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM 411
           +VEAALNTEDPQYQDKRMVAQRNGKLS QVF V  G+E+EA+ DMLPYLRLGY+SD SEM
Sbjct: 356 IVEAALNTEDPQYQDKRMVAQRNGKLSQQVFQVRVGKEREAVQDMLPYLRLGYMSDPSEM 415

Query: 412 QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVAT 471
           QSVISS GP+C +SPCMERAVLDQLA+YF  RL+GYP T  ED+A+L D +L P+KRVAT
Sbjct: 416 QSVISSQGPVCTMSPCMERAVLDQLANYFMRRLSGYPTTPKEDDALLADPSLSPRKRVAT 475

Query: 472 QLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 512
           +LV++EKK+L ACL  T D++  LPD  +SPCPAPYAP L 
Sbjct: 476 RLVQLEKKILAACLTTTVDLLNQLPDTAISPCPAPYAPSLK 516


>gi|356571407|ref|XP_003553868.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
           large subunit N-methyltransferase, chloroplastic-like
           isoform 1 [Glycine max]
 gi|356571409|ref|XP_003553869.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
           large subunit N-methyltransferase, chloroplastic-like
           isoform 2 [Glycine max]
          Length = 502

 Score =  769 bits (1987), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 361/432 (83%), Positives = 397/432 (91%)

Query: 80  DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
           DLKSWMHK+GLPPCKV+LK+KP  N+ H+PIHYVAAS+DLQ GD AFSVPNSLVVTLERV
Sbjct: 70  DLKSWMHKHGLPPCKVVLKDKPCPNDSHKPIHYVAASQDLQVGDVAFSVPNSLVVTLERV 129

Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
           LGNET+AELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQL+VESPLL
Sbjct: 130 LGNETVAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLSVESPLL 189

Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
           W ++EL YL+GSP K E+++R E I++EYNELDTVWFMAGSLFQQYPYDIPTEAF+FEIF
Sbjct: 190 WLKSELDYLSGSPIKDEVIQREEAIRKEYNELDTVWFMAGSLFQQYPYDIPTEAFSFEIF 249

Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 319
           KQAF A+QSCVVHLQKVSLARRFALVPLGPPLL+Y S CKAML AVD AV+L VDRPYKA
Sbjct: 250 KQAFAAIQSCVVHLQKVSLARRFALVPLGPPLLSYQSNCKAMLTAVDGAVELAVDRPYKA 309

Query: 320 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 379
           G+ IVVWCGPQPNSKLLINYGFVDE+N  DRL+VEAALNTEDPQYQDKRMVAQRNGKLSV
Sbjct: 310 GDPIVVWCGPQPNSKLLINYGFVDENNSNDRLIVEAALNTEDPQYQDKRMVAQRNGKLSV 369

Query: 380 QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADY 439
           QVFHV+AG+E+EA+ DML Y+RLGYVSD SEM+SVISS GP+CPVSPCMERA LDQLADY
Sbjct: 370 QVFHVYAGKEREAVLDMLRYMRLGYVSDPSEMESVISSQGPVCPVSPCMERAALDQLADY 429

Query: 440 FKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVT 499
           FKARLAGYP TL+EDE+MLTD NL+PKKRVATQ VR+EKKML+ACLQ T D I  LPD T
Sbjct: 430 FKARLAGYPTTLAEDESMLTDDNLNPKKRVATQYVRLEKKMLHACLQATTDFINQLPDHT 489

Query: 500 VSPCPAPYAPLL 511
           +SPCPAPYAPLL
Sbjct: 490 ISPCPAPYAPLL 501


>gi|356511552|ref|XP_003524489.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
           large subunit N-methyltransferase, chloroplastic-like
           [Glycine max]
          Length = 503

 Score =  765 bits (1975), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 361/433 (83%), Positives = 396/433 (91%), Gaps = 1/433 (0%)

Query: 80  DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
           DLKSWMHK+GLPPCKV+LK+KP  N+ H+PIHYVAAS+DLQ GD AFSVPNSLVVTLERV
Sbjct: 70  DLKSWMHKHGLPPCKVVLKDKPCPNDSHKPIHYVAASQDLQVGDVAFSVPNSLVVTLERV 129

Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
           LGNET+AELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQL+VESPLL
Sbjct: 130 LGNETVAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLSVESPLL 189

Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
           WS++EL YL+GSP K E+++R E I++EY ELDTVWFMAGSLFQQYPYDIPTEAF+FEIF
Sbjct: 190 WSKSELDYLSGSPIKDEVIQREEAIRKEYKELDTVWFMAGSLFQQYPYDIPTEAFSFEIF 249

Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 319
           KQAF A+QSCVVHLQKVSLARRFALVPLGPPLL+Y S CKAML AVD AV+L VDRPYKA
Sbjct: 250 KQAFAAIQSCVVHLQKVSLARRFALVPLGPPLLSYQSNCKAMLTAVDGAVELAVDRPYKA 309

Query: 320 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 379
           G+ IVVWCGPQPNSKLLINYGFVDE+N  DRL+VEAALNTEDPQYQDKRMVAQRNGKLSV
Sbjct: 310 GDPIVVWCGPQPNSKLLINYGFVDENNSNDRLIVEAALNTEDPQYQDKRMVAQRNGKLSV 369

Query: 380 QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADY 439
           QVFHV+AG+E+EA+ DML Y+RLGYVSD SEMQSVISS GP+CPVSPCMERA LDQLADY
Sbjct: 370 QVFHVYAGKEREAVLDMLRYMRLGYVSDPSEMQSVISSQGPVCPVSPCMERAALDQLADY 429

Query: 440 FKARLAGYPATLSEDEAMLTD-YNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDV 498
           FKARLAGYP  L+EDE+MLTD  NL+PKKRVATQ VR+EKKML+ACLQ T D I  LPD 
Sbjct: 430 FKARLAGYPTILAEDESMLTDGGNLNPKKRVATQYVRLEKKMLHACLQATIDFINQLPDH 489

Query: 499 TVSPCPAPYAPLL 511
           T+SPCPAPYAPLL
Sbjct: 490 TISPCPAPYAPLL 502


>gi|116786810|gb|ABK24248.1| unknown [Picea sitchensis]
          Length = 507

 Score =  724 bits (1870), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 348/476 (73%), Positives = 403/476 (84%), Gaps = 7/476 (1%)

Query: 42  SSLRLVRRKNRFSIRVSSSDTLVAGS------REVVSKKEEDLGDLKSWMHKNGLPPCKV 95
           S +RL  R   F + V S+DTL A S      ++ +  KEE++ DLKSWMH++GLPPC+V
Sbjct: 32  SRVRLPGRCVGFPMVVYSADTLTASSQHGEDKKDAIRGKEEEV-DLKSWMHRHGLPPCRV 90

Query: 96  ILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLS 155
           +LKE+PS + KH+PI YVAASEDLQ GD AFS+PNSL+VTLERVLGNETIAELLTTNKLS
Sbjct: 91  MLKERPSPDGKHKPIKYVAASEDLQPGDVAFSIPNSLIVTLERVLGNETIAELLTTNKLS 150

Query: 156 ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 215
           ELACLALYLMYEKKQG +SFW P+IRELDRQRGRGQLAVESPLLWS  EL Y TGSP K 
Sbjct: 151 ELACLALYLMYEKKQGNQSFWRPFIRELDRQRGRGQLAVESPLLWSSEELKYFTGSPMKE 210

Query: 216 EILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK 275
            +LER  GIKREY ELDTVWFMAGSLF+QYPYDIPTEAF FEIFKQAFVAVQSCVVHLQ 
Sbjct: 211 IMLERNSGIKREYEELDTVWFMAGSLFKQYPYDIPTEAFPFEIFKQAFVAVQSCVVHLQN 270

Query: 276 VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
           V+LARRFALVPLGPPLL+Y S CKAML AV D+VQL VDR YKAGE IVVWCGPQPN++L
Sbjct: 271 VNLARRFALVPLGPPLLSYKSNCKAMLKAVGDSVQLEVDREYKAGEPIVVWCGPQPNARL 330

Query: 336 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD 395
           L+NYGFVDEDNP+DRL+VE +L+T+DP YQDKR++AQRNGKLSVQ F+++ GREKEA+ D
Sbjct: 331 LLNYGFVDEDNPHDRLIVEVSLDTKDPLYQDKRIIAQRNGKLSVQTFNIYIGREKEAVLD 390

Query: 396 MLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 455
           MLPYLRL YVSD SEMQSV+SS GP+CPVSPC ERAVLDQL+ YF+ RLAGYP T SEDE
Sbjct: 391 MLPYLRLAYVSDPSEMQSVLSSQGPVCPVSPCTERAVLDQLSRYFRERLAGYPTTASEDE 450

Query: 456 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 511
            +L D   +PK++VATQLV +EKKMLN+CL    ++I  LPD+ V+PCP+PY+P+L
Sbjct: 451 IVLADPTTNPKRQVATQLVLIEKKMLNSCLAAVYEIIDQLPDLAVTPCPSPYSPIL 506


>gi|7573451|emb|CAB87765.1| putative protein [Arabidopsis thaliana]
          Length = 537

 Score =  698 bits (1802), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 357/489 (73%), Positives = 392/489 (80%), Gaps = 34/489 (6%)

Query: 1   MEASCSLRSSKFISPPIRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSS 60
           ME   +   +K +S PIR    PLS  S  S+   R+    SS R V  +   S+ VSSS
Sbjct: 1   MEGVITCFHTKCVSLPIR--SFPLSRVS--SLPRWRNNKLISSSRSVHLR---SLCVSSS 53

Query: 61  DTLVAGSR--------EVVSKKE-EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIH 111
           DTLVA           +V SKKE +D  DLK WM KNGLPPCKVILKE+P+H++KH+PIH
Sbjct: 54  DTLVASGSPKEDERQSKVSSKKEGDDSEDLKFWMDKNGLPPCKVILKERPAHDQKHKPIH 113

Query: 112 YVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
           YVAASEDLQ GD AFSVP+SLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG
Sbjct: 114 YVAASEDLQKGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 173

Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
           KKS W PYIRELDRQRGRGQL  ESPLLWSE EL YLTGSPTKAE+LERAEGIKREYNEL
Sbjct: 174 KKSVWYPYIRELDRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAEGIKREYNEL 233

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL------------------ 273
           DTVWFMAGSLFQQYP+DIPTEAF+FEIFKQAFVA+QSCVVHL                  
Sbjct: 234 DTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVAIQSCVVHLQVVLVASSNLDCYASSCT 293

Query: 274 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 333
           Q V LARRFALVPLGPPLLAY S CKAML AVD AV+LVVDRPYKAG+ IVVWCGPQPN+
Sbjct: 294 QNVGLARRFALVPLGPPLLAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVVWCGPQPNA 353

Query: 334 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI 393
           KLL+NYGFVDEDNPYDR++VEAALNTEDPQYQDKRMVAQRNGKLS QVF V  G+E+EA+
Sbjct: 354 KLLLNYGFVDEDNPYDRVIVEAALNTEDPQYQDKRMVAQRNGKLSQQVFQVRVGKEREAV 413

Query: 394 SDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSE 453
            DMLPYLRLGY+SD SEMQSVISS GP+CP+SPCMERAVLDQLA+YF  RL+GYP T  E
Sbjct: 414 QDMLPYLRLGYMSDPSEMQSVISSQGPVCPMSPCMERAVLDQLANYFMRRLSGYPTTPKE 473

Query: 454 DEAMLTDYN 462
           D+A+    N
Sbjct: 474 DDALEASCN 482


>gi|125536207|gb|EAY82695.1| hypothetical protein OsI_37912 [Oryza sativa Indica Group]
          Length = 505

 Score =  678 bits (1750), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 320/434 (73%), Positives = 369/434 (85%), Gaps = 3/434 (0%)

Query: 81  LKSWMHKNGLPPCKVILKEKPS---HNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
              W+ ++GLPP KV + ++P       K  P+HYVAA +DL+AGD AF VP SLVVTLE
Sbjct: 71  FSDWLREHGLPPGKVAILDRPVPCFREGKDLPLHYVAAGQDLEAGDVAFEVPMSLVVTLE 130

Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
           RVLG+E++AELLTTNKLSELACLALYLMYEKKQG+ SFW PYI+ELDRQRGRGQLAVESP
Sbjct: 131 RVLGDESVAELLTTNKLSELACLALYLMYEKKQGQDSFWYPYIKELDRQRGRGQLAVESP 190

Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
           LLW+E+EL YL GSP K E++ R EGI+REYNELDT+WFMAGSLFQQYP+DIPTEAF FE
Sbjct: 191 LLWTESELNYLKGSPIKDEVVARDEGIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFE 250

Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY 317
           IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL Y S CKAML AV D+V+LVVDRPY
Sbjct: 251 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTAVGDSVRLVVDRPY 310

Query: 318 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL 377
           KAGE I+VWCGPQPNS+LL+NYGF+DEDNPYDR+V+EA+LN EDPQ+Q+KRMVAQRNGKL
Sbjct: 311 KAGEPIIVWCGPQPNSRLLLNYGFIDEDNPYDRIVIEASLNIEDPQFQEKRMVAQRNGKL 370

Query: 378 SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLA 437
           ++Q FHV  G+EKE I++MLPYLRLGY+SD  EMQS++SS G  CPVSPC ERAVLDQL 
Sbjct: 371 AIQNFHVCVGKEKETIAEMLPYLRLGYISDPDEMQSILSSEGDTCPVSPCTERAVLDQLV 430

Query: 438 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPD 497
            Y ++RLA YP TL ED+AML D NL PKK VAT+LVR+EKK+L+ CLQ   + I  LPD
Sbjct: 431 GYLESRLADYPTTLDEDDAMLADGNLEPKKEVATRLVRLEKKLLHGCLQAANEFINDLPD 490

Query: 498 VTVSPCPAPYAPLL 511
            TVSPCPAP+AP L
Sbjct: 491 HTVSPCPAPFAPEL 504


>gi|115487958|ref|NP_001066466.1| Os12g0236900 [Oryza sativa Japonica Group]
 gi|77554044|gb|ABA96840.1| SET domain containing protein, expressed [Oryza sativa Japonica
           Group]
 gi|113648973|dbj|BAF29485.1| Os12g0236900 [Oryza sativa Japonica Group]
          Length = 509

 Score =  678 bits (1749), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 320/434 (73%), Positives = 369/434 (85%), Gaps = 3/434 (0%)

Query: 81  LKSWMHKNGLPPCKVILKEKPS---HNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
              W+ ++GLPP KV + ++P       K  P+HYVAA +DL+AGD AF VP SLVVTLE
Sbjct: 75  FSDWLREHGLPPGKVAILDRPVPCFREGKDLPLHYVAAGQDLEAGDVAFEVPMSLVVTLE 134

Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
           RVLG+E++AELLTTNKLSELACLALYLMYEKKQG+ SFW PYI+ELDRQRGRGQLAVESP
Sbjct: 135 RVLGDESVAELLTTNKLSELACLALYLMYEKKQGQDSFWYPYIKELDRQRGRGQLAVESP 194

Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
           LLW+E+EL YL GSP K E++ R EGI+REYNELDT+WFMAGSLFQQYP+DIPTEAF FE
Sbjct: 195 LLWTESELNYLKGSPIKDEVVARDEGIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFE 254

Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY 317
           IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL Y S CKAML AV D+V+LVVDRPY
Sbjct: 255 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTAVGDSVRLVVDRPY 314

Query: 318 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL 377
           KAGE I+VWCGPQPNS+LL+NYGF+DEDNPYDR+V+EA+LN EDPQ+Q+KRMVAQRNGKL
Sbjct: 315 KAGEPIIVWCGPQPNSRLLLNYGFIDEDNPYDRIVIEASLNIEDPQFQEKRMVAQRNGKL 374

Query: 378 SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLA 437
           ++Q FHV  G+EKE I++MLPYLRLGY+SD  EMQS++SS G  CPVSPC ERAVLDQL 
Sbjct: 375 AIQNFHVCVGKEKETIAEMLPYLRLGYISDPDEMQSILSSEGDTCPVSPCTERAVLDQLV 434

Query: 438 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPD 497
            Y ++RLA YP TL ED+AML D NL PKK VAT+LVR+EKK+L+ CLQ   + I  LPD
Sbjct: 435 GYLESRLADYPTTLDEDDAMLADGNLEPKKEVATRLVRLEKKLLHGCLQAANEFINDLPD 494

Query: 498 VTVSPCPAPYAPLL 511
            TVSPCPAP+AP L
Sbjct: 495 HTVSPCPAPFAPEL 508


>gi|357160358|ref|XP_003578740.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
           large subunit N-methyltransferase, chloroplastic-like
           [Brachypodium distachyon]
          Length = 516

 Score =  677 bits (1748), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 321/432 (74%), Positives = 366/432 (84%), Gaps = 3/432 (0%)

Query: 84  WMHKNGLPPCKVILKEKP---SHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
           W+  +GLPP KV + E+P   S   K RP+H+VAA +DL+ GD AF +P SLVVTLERVL
Sbjct: 85  WLLTHGLPPGKVAILERPVPCSRGGKDRPLHFVAAGQDLEVGDVAFEMPMSLVVTLERVL 144

Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
           G+E++AELLTTNKLSELACLALYLMYEKKQGK S W PYI+ELDRQRGRGQLAVESPLLW
Sbjct: 145 GDESVAELLTTNKLSELACLALYLMYEKKQGKDSLWYPYIKELDRQRGRGQLAVESPLLW 204

Query: 201 SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK 260
           +E+EL YL GSP + E++ R EGI+REYNELDT+WFMAGSLF+QYP+D+PTEAF FEIFK
Sbjct: 205 TESELDYLNGSPMRDEVVVRDEGIRREYNELDTLWFMAGSLFKQYPFDVPTEAFPFEIFK 264

Query: 261 QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 320
           QAFVAVQSCVVHLQKVSLARRFALVPLGPPLL Y S CKAML AVDD+V+LVVDRPYKAG
Sbjct: 265 QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTAVDDSVRLVVDRPYKAG 324

Query: 321 ESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ 380
           E I+VWCGPQPNS+LL+NYGFVDEDNPYDR+ +EA+LN EDPQYQ+KRMVAQRNGKL++Q
Sbjct: 325 EPIIVWCGPQPNSRLLLNYGFVDEDNPYDRIAIEASLNMEDPQYQEKRMVAQRNGKLAIQ 384

Query: 381 VFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYF 440
            F V  G+EKE IS+MLPYLRLGY+SD  EMQ ++SS G  CPVSPC ERAVLDQL  Y 
Sbjct: 385 KFQVCVGKEKETISEMLPYLRLGYISDPDEMQCILSSEGDTCPVSPCSERAVLDQLVVYL 444

Query: 441 KARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTV 500
           K+RLAGYP TL EDEAML D NL PKK VAT+LVR+EKK+L+ CLQ   + I  LPD TV
Sbjct: 445 KSRLAGYPTTLDEDEAMLADGNLEPKKEVATRLVRLEKKLLHGCLQAAHEFISALPDHTV 504

Query: 501 SPCPAPYAPLLN 512
           SPCPA YAP L 
Sbjct: 505 SPCPALYAPNLK 516


>gi|326510275|dbj|BAJ87354.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326525555|dbj|BAJ88824.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 523

 Score =  677 bits (1746), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 336/519 (64%), Positives = 395/519 (76%), Gaps = 23/519 (4%)

Query: 8   RSSKFISPP-----IRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSSDT 62
           RSS+  +PP     +   HH L +   +     R P  GS     R  +R  +    +DT
Sbjct: 14  RSSEARAPPMASSALSGTHHRLLLPCFLR----RLPQPGS-----RSCSRLRLAACHADT 64

Query: 63  LVAGS------REVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKP---SHNEKHRPIHYV 113
           L++ S                 G    W+  NGLPP K+ + E+P   S   + RP+H+V
Sbjct: 65  LLSSSGAQGPPSPAACLSASSAGGFSDWLLTNGLPPGKLAILERPVPCSRGGRDRPLHFV 124

Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKK 173
           AA +DL+AGD AF VP SLVVTLERVLG+E++AELLTTNKLSELACLALYLMYEKKQG+ 
Sbjct: 125 AAGQDLEAGDVAFEVPMSLVVTLERVLGDESVAELLTTNKLSELACLALYLMYEKKQGRD 184

Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
           S W PYI+ELDRQRGRGQLAVESPLLW+E+EL YL GSP + E++ R EGIK+EYNELDT
Sbjct: 185 SLWYPYIKELDRQRGRGQLAVESPLLWTESELDYLNGSPMRDEVVVRDEGIKKEYNELDT 244

Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLA 293
           +WFMAGSLF+QYP+D+PTEAF FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 
Sbjct: 245 LWFMAGSLFKQYPFDVPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLT 304

Query: 294 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
           Y S CKAML AVD +V+L+VDRPYKAGE I+VWCGPQPNS+LL+NYGFVDEDNPYDR+ +
Sbjct: 305 YKSNCKAMLTAVDGSVRLLVDRPYKAGEPIIVWCGPQPNSRLLLNYGFVDEDNPYDRIAI 364

Query: 354 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 413
           EA+LNTEDPQYQ+KRMVAQRNGKL++Q F V  G+EK+ IS+MLPYLRLGY+SD  EMQ 
Sbjct: 365 EASLNTEDPQYQEKRMVAQRNGKLAIQKFQVCVGKEKQTISEMLPYLRLGYISDPDEMQC 424

Query: 414 VISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQL 473
           ++SS G  CPVSPC ERAVLDQL  Y K+RLAGYP  L EDEAML D +L PKK VAT+L
Sbjct: 425 ILSSEGDTCPVSPCSERAVLDQLVVYLKSRLAGYPTNLDEDEAMLADGSLEPKKEVATRL 484

Query: 474 VRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLLN 512
           VR+EKKML+ CL+   + I  LPD TVSPCPA YAP L 
Sbjct: 485 VRLEKKMLHGCLEAANEFISGLPDHTVSPCPALYAPELK 523


>gi|242053769|ref|XP_002456030.1| hypothetical protein SORBIDRAFT_03g029140 [Sorghum bicolor]
 gi|241928005|gb|EES01150.1| hypothetical protein SORBIDRAFT_03g029140 [Sorghum bicolor]
          Length = 512

 Score =  673 bits (1736), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 318/431 (73%), Positives = 367/431 (85%), Gaps = 3/431 (0%)

Query: 84  WMHKNGLPPCKVILKEKPSH---NEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
           W+   GLPP KV ++E+P     N K  P+ YVAA  DLQAGD AF VP SLVVTLERVL
Sbjct: 81  WLRARGLPPGKVDIRERPVPCLLNGKDLPLRYVAAGVDLQAGDVAFEVPMSLVVTLERVL 140

Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
           G+E+IAELLT NKLSELACLALYLMYEKKQGK SFW PYI+ELDR RGRGQLAVESPLLW
Sbjct: 141 GDESIAELLTNNKLSELACLALYLMYEKKQGKDSFWYPYIKELDRHRGRGQLAVESPLLW 200

Query: 201 SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK 260
           +E+EL YLTGSP K E++ R E I+REYNELDT+WFMAGSLFQQYP+DIPTEAF FEIFK
Sbjct: 201 TESELDYLTGSPLKDEVVARDEAIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIFK 260

Query: 261 QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 320
           QAFVAVQSCVVHLQKVSLARRFALVPLGPPLL Y S CKAML A  D+V+LVVDRPYKAG
Sbjct: 261 QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTADGDSVRLVVDRPYKAG 320

Query: 321 ESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ 380
           E I++WCGPQ NS+L++NYGFVDEDNP+DR+ +EA+LN+EDPQYQ+KRMVAQRNGKL++Q
Sbjct: 321 EPIIIWCGPQTNSRLVLNYGFVDEDNPFDRIAIEASLNSEDPQYQEKRMVAQRNGKLAIQ 380

Query: 381 VFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYF 440
            F+V+ G+EK+ +++MLPYLRLGY+SD  EMQS++SS G  CP+SPC ERAVLDQL  Y 
Sbjct: 381 NFNVYVGKEKQTVAEMLPYLRLGYISDPDEMQSILSSEGDTCPLSPCTERAVLDQLVGYL 440

Query: 441 KARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTV 500
           ++RLAGYP TL EDEAML D +L PKK VAT+LVR+EKKM++ACLQ T + I  LPD TV
Sbjct: 441 ESRLAGYPTTLDEDEAMLADGSLEPKKEVATRLVRLEKKMIHACLQATNEFINDLPDHTV 500

Query: 501 SPCPAPYAPLL 511
           SPCPAPYAP L
Sbjct: 501 SPCPAPYAPEL 511


>gi|414881266|tpg|DAA58397.1| TPA: hypothetical protein ZEAMMB73_027665 [Zea mays]
          Length = 512

 Score =  672 bits (1734), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 324/457 (70%), Positives = 374/457 (81%), Gaps = 3/457 (0%)

Query: 58  SSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPS---HNEKHRPIHYVA 114
           SSS+   A    V    E        W+   GLPP KV ++E+P     + K +P+ YV+
Sbjct: 55  SSSEARAAPGPAVEPSSESATDCFVDWLRARGLPPGKVDIRERPVPCLRDGKDQPLRYVS 114

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           A  DLQAGD AF V  SLVVTLERVLG+E+IAELLT NKLSELACLALYLMYEKKQGK S
Sbjct: 115 AVVDLQAGDVAFEVSMSLVVTLERVLGDESIAELLTNNKLSELACLALYLMYEKKQGKDS 174

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
           FW PYI+ELDR RGRGQLAVESPLLW+E+EL YLTGSP K E++ R E I+REYNELDT+
Sbjct: 175 FWYPYIKELDRHRGRGQLAVESPLLWTESELDYLTGSPLKDEVVARDEAIRREYNELDTL 234

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
           WFMAGSLFQQYP+DIPTEAF FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL Y
Sbjct: 235 WFMAGSLFQQYPFDIPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTY 294

Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
            S CKAML A  D+V+LVVDRPYKAGE I++WCGPQ NS+L++NYGFVDEDNP+DR+ +E
Sbjct: 295 RSNCKAMLTADGDSVRLVVDRPYKAGEPIIIWCGPQTNSRLVLNYGFVDEDNPFDRVAIE 354

Query: 355 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV 414
           A+LNTEDPQYQ+KRMVAQRNGKL++Q F+V+ G+EK+ +++MLPYLRLGY+S+  EMQS+
Sbjct: 355 ASLNTEDPQYQEKRMVAQRNGKLAIQNFNVYVGKEKQTVAEMLPYLRLGYISNPDEMQSI 414

Query: 415 ISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLV 474
           +SS G  CPVSPC ERAVLDQL  Y ++RLAGYP TL EDEAML D NL PKK VAT+LV
Sbjct: 415 LSSEGDTCPVSPCTERAVLDQLVGYLESRLAGYPTTLDEDEAMLADGNLEPKKEVATRLV 474

Query: 475 RMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 511
           R+EKKML+ACLQ T + I  LPD TVSPCPAPYAP L
Sbjct: 475 RLEKKMLHACLQATNEFINDLPDHTVSPCPAPYAPEL 511


>gi|125578929|gb|EAZ20075.1| hypothetical protein OsJ_35675 [Oryza sativa Japonica Group]
          Length = 536

 Score =  648 bits (1671), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 305/393 (77%), Positives = 345/393 (87%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLP 178
           LQAGD AF VP SLVVTLERVLG+E++AELLTTNKLSELACLALYLMYEKKQG+ SFW P
Sbjct: 143 LQAGDVAFEVPMSLVVTLERVLGDESVAELLTTNKLSELACLALYLMYEKKQGQDSFWYP 202

Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMA 238
           YI+ELDRQRGRGQLAVESPLLW+E+EL YL GSP K E++ R EGI+REYNELDT+WFMA
Sbjct: 203 YIKELDRQRGRGQLAVESPLLWTESELNYLKGSPIKDEVVARDEGIRREYNELDTLWFMA 262

Query: 239 GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC 298
           GSLFQQYP+DIPTEAF FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL Y S C
Sbjct: 263 GSLFQQYPFDIPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSNC 322

Query: 299 KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 358
           KAML AV D+V+LVVDRPYKAGE I+VWCGPQPNS+LL+NYGF+DEDNPYDR+V+EA+LN
Sbjct: 323 KAMLTAVGDSVRLVVDRPYKAGEPIIVWCGPQPNSRLLLNYGFIDEDNPYDRIVIEASLN 382

Query: 359 TEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSL 418
            EDPQ+Q+KRMVAQRNGKL++Q FHV  G+EKE I++MLPYLRLGY+SD  EMQS++SS 
Sbjct: 383 IEDPQFQEKRMVAQRNGKLAIQNFHVCVGKEKETIAEMLPYLRLGYISDPDEMQSILSSE 442

Query: 419 GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEK 478
           G  CPVSPC ERAVLDQL  Y ++RLA YP TL ED+AML D NL PKK VAT+LVR+EK
Sbjct: 443 GDTCPVSPCTERAVLDQLVGYLESRLADYPTTLDEDDAMLADGNLEPKKEVATRLVRLEK 502

Query: 479 KMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 511
           K+L+ CLQ   + I  LPD TVSPCPAP+AP L
Sbjct: 503 KLLHGCLQAANEFINDLPDHTVSPCPAPFAPEL 535


>gi|168044593|ref|XP_001774765.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162673920|gb|EDQ60436.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 523

 Score =  607 bits (1566), Expect = e-171,   Method: Compositional matrix adjust.
 Identities = 305/481 (63%), Positives = 361/481 (75%), Gaps = 10/481 (2%)

Query: 38  PNFGSSLRLVRRKNRFSIRVSSSDTLVAGSRE---VVSKKEEDLG--DLKSWMHKNGLPP 92
           P FG+    V  + R S   ++  T V    E     SKK+E     DLK WM + GLP 
Sbjct: 45  PRFGTQKVAVSSEKRGSRCRNTLTTDVYKQDENDLAQSKKQEHESGIDLKQWMEEQGLPE 104

Query: 93  CKVILKE-KPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTT 151
           CKV L E +PS  +K +PIHYV ASEDLQ G+ A ++P SLVVTLERVLG+ETIAELLTT
Sbjct: 105 CKVSLAEHQPSEGDKGKPIHYVVASEDLQPGELALTIPKSLVVTLERVLGDETIAELLTT 164

Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTG 210
           NKLSELACLALYLMYEKKQGK+S+W PYIRELDRQRGRGQL+V SPLLWS  EL  Y TG
Sbjct: 165 NKLSELACLALYLMYEKKQGKESYWYPYIRELDRQRGRGQLSVASPLLWSREELNEYFTG 224

Query: 211 SPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
           S  K  +LER  GIKREY ELDTVWFMAGSLF+QYP+D+PTEAF+FEIFKQAFVAVQSCV
Sbjct: 225 STMKEVVLERLAGIKREYEELDTVWFMAGSLFKQYPFDLPTEAFSFEIFKQAFVAVQSCV 284

Query: 271 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 330
           VHLQ VSLARRFALVPLGPPLLAY S CKAML AVDD V L VDR YKAG+ I VWCGPQ
Sbjct: 285 VHLQGVSLARRFALVPLGPPLLAYKSNCKAMLKAVDDNVVLEVDRAYKAGDPIAVWCGPQ 344

Query: 331 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREK 390
           PNSKLL+NYGFVDEDNPYDRL VEA+L+TEDP YQ KR + Q+N +L++Q F ++ G+E 
Sbjct: 345 PNSKLLLNYGFVDEDNPYDRLAVEASLDTEDPLYQQKRAIVQKNNRLTIQTFQIYKGKEM 404

Query: 391 EAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPAT 450
           EA+ DMLPY+RL +++D  EM++V  + GP+CPVS C ERAVL+QL  YF+ RLAGY ++
Sbjct: 405 EAVLDMLPYMRLAHLADPEEMETVSFAQGPVCPVSACNERAVLEQLEQYFEKRLAGYKSS 464

Query: 451 LSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPL 510
            + +     D   + KKRVA +L+ +EK +L   L    ++I  LPD  +SPC  PY P 
Sbjct: 465 HATEGG---DAKKNAKKRVAEKLLCIEKSILRNALAAVQELISQLPDSAISPCIGPYLPN 521

Query: 511 L 511
           L
Sbjct: 522 L 522


>gi|302794360|ref|XP_002978944.1| hypothetical protein SELMODRAFT_110000 [Selaginella moellendorffii]
 gi|300153262|gb|EFJ19901.1| hypothetical protein SELMODRAFT_110000 [Selaginella moellendorffii]
          Length = 432

 Score =  593 bits (1530), Expect = e-167,   Method: Compositional matrix adjust.
 Identities = 293/432 (67%), Positives = 344/432 (79%), Gaps = 7/432 (1%)

Query: 84  WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
           WM + GLPPCKV LKE+  +    + I YV ASEDL+ GD A SVP SLVVTLERVLGNE
Sbjct: 3   WMLEQGLPPCKVSLKERDLNG---KTIRYVVASEDLKPGDLALSVPMSLVVTLERVLGNE 59

Query: 144 TIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET 203
           TIAELLTTNKLSELACLALYLMYEKK+GK+SFW P+IRELDRQRGRGQ+AVESPLLW+  
Sbjct: 60  TIAELLTTNKLSELACLALYLMYEKKRGKESFWYPFIRELDRQRGRGQVAVESPLLWTSE 119

Query: 204 EL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
           EL  Y TGS  K  +LER EGIKREY ELDTVWFMAGSLF++YP+DIPTEAF+FEIFKQA
Sbjct: 120 ELDEYFTGSRMKEVVLERLEGIKREYQELDTVWFMAGSLFKEYPFDIPTEAFSFEIFKQA 179

Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
           FVAVQSCVVHLQ VSL RRFALVPLGPPLLAY S CKAML A  D V+L VDR YK GE 
Sbjct: 180 FVAVQSCVVHLQGVSLPRRFALVPLGPPLLAYKSNCKAMLKAAGDLVRLEVDRAYKKGEQ 239

Query: 323 IVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 382
           I+VWCGPQPN++LL+NYGFVD DNP+DRL VEA+LNT DP YQ+KR++ Q+N +L++Q F
Sbjct: 240 ILVWCGPQPNTRLLLNYGFVDPDNPHDRLSVEASLNTRDPFYQNKRIIVQKNNRLTIQNF 299

Query: 383 HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 442
            +  GREKEA+ +MLPYLRLG+VSD   M+SV S+ GP CPVS C ERAVLDQLA YF+ 
Sbjct: 300 QIFKGREKEAVLEMLPYLRLGHVSDPYHMESVFSAEGPTCPVSACNERAVLDQLAQYFQE 359

Query: 443 RLAGYPATLSEDEAMLTD--YNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTV 500
           R+A Y  T+ ED A+L D   +++PK+RVATQL+ +EK++L+  L V       LPD +V
Sbjct: 360 RIAKYKTTIDEDRALLEDGSSDINPKQRVATQLLLIEKEILHNTLDVVNGFRNQLPDGSV 419

Query: 501 S-PCPAPYAPLL 511
           + PC   + P L
Sbjct: 420 APPCCGDFVPKL 431


>gi|302809535|ref|XP_002986460.1| hypothetical protein SELMODRAFT_269129 [Selaginella moellendorffii]
 gi|300145643|gb|EFJ12317.1| hypothetical protein SELMODRAFT_269129 [Selaginella moellendorffii]
          Length = 432

 Score =  593 bits (1529), Expect = e-167,   Method: Compositional matrix adjust.
 Identities = 293/432 (67%), Positives = 344/432 (79%), Gaps = 7/432 (1%)

Query: 84  WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
           WM + GLPPCKV LKE+  +    + I YV ASEDL+ GD A SVP SLVVTLERVLGNE
Sbjct: 3   WMLEQGLPPCKVSLKERDLNG---KTIRYVVASEDLKPGDLALSVPMSLVVTLERVLGNE 59

Query: 144 TIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET 203
           TIAELLTTNKLSELACLALYLMYEKK+GK+SFW P+IRELDRQRGRGQ+AVESPLLW+  
Sbjct: 60  TIAELLTTNKLSELACLALYLMYEKKRGKESFWYPFIRELDRQRGRGQVAVESPLLWTSE 119

Query: 204 EL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
           EL  Y TGS  K  +LER EGIKREY ELDTVWFMAGSLF++YP+DIPTEAF+FEIFKQA
Sbjct: 120 ELDEYFTGSRMKEVVLERLEGIKREYQELDTVWFMAGSLFKEYPFDIPTEAFSFEIFKQA 179

Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
           FVAVQSCVVHLQ VSL RRFALVPLGPPLLAY S CKAML A  D V+L VDR YK GE 
Sbjct: 180 FVAVQSCVVHLQGVSLPRRFALVPLGPPLLAYKSNCKAMLKAAGDLVRLEVDRAYKKGEQ 239

Query: 323 IVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 382
           I+VWCGPQPN++LL+NYGFVD DNP+DRL VEA+LNT DP YQ+KR++ Q+N +L++Q F
Sbjct: 240 ILVWCGPQPNTRLLLNYGFVDPDNPHDRLSVEASLNTRDPFYQNKRIIVQKNNRLTIQNF 299

Query: 383 HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 442
            +  GREKEA+ +MLPYLRLG+VSD   M+SV S+ GP CPVS C ERAVLDQLA YF+ 
Sbjct: 300 QIFKGREKEAVLEMLPYLRLGHVSDPYHMESVFSAEGPTCPVSACNERAVLDQLAQYFQE 359

Query: 443 RLAGYPATLSEDEAMLTD--YNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTV 500
           R+A Y  T+ ED A+L D   +++PK+RVATQL+ +EK++L+  L V       LPD +V
Sbjct: 360 RIAKYKTTIDEDRALLEDCSSDINPKQRVATQLLLIEKEILHNTLDVVNGFRNQLPDGSV 419

Query: 501 S-PCPAPYAPLL 511
           + PC   + P L
Sbjct: 420 APPCCGDFVPKL 431


>gi|168020073|ref|XP_001762568.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162686301|gb|EDQ72691.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 427

 Score =  574 bits (1480), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 281/429 (65%), Positives = 330/429 (76%), Gaps = 5/429 (1%)

Query: 85  MHKNGLPPCKVILKE-KPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
           M + GLP C V L E + +  +K +PIHYV AS+DLQ GD A +VP SLVVTLERVLG+E
Sbjct: 1   MEEQGLPKCNVALVEHQLAEGDKGKPIHYVVASQDLQPGDVALTVPKSLVVTLERVLGDE 60

Query: 144 TIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET 203
           TIAELLTTNKLSELACLALYLMYEKKQGK+S+W PYIRELDRQRGRGQL+V SPLLWS  
Sbjct: 61  TIAELLTTNKLSELACLALYLMYEKKQGKESYWYPYIRELDRQRGRGQLSVASPLLWSPE 120

Query: 204 EL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
           EL  Y TGS  K  +LER  GIKREY ELDTVWFMAGSLF+QYP+D+PTEAF+FEIFKQA
Sbjct: 121 ELNEYFTGSTMKEVVLERLAGIKREYEELDTVWFMAGSLFKQYPFDLPTEAFSFEIFKQA 180

Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
           FVAVQSCVVHLQ VSLARRFALVPLGPPLLAY S CKAML AV D VQL VD  YK G+ 
Sbjct: 181 FVAVQSCVVHLQGVSLARRFALVPLGPPLLAYKSNCKAMLKAVGDNVQLEVDHAYKTGDP 240

Query: 323 IVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 382
           I VWCGPQPNSKLL+NYGFVDEDNP+DRL VEA+LNTEDP YQ KR V Q+N +L++Q F
Sbjct: 241 IAVWCGPQPNSKLLLNYGFVDEDNPFDRLAVEASLNTEDPLYQQKRAVVQKNNRLTIQTF 300

Query: 383 HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 442
            ++ G+E EA+ DMLPY+RLG+++D  E+++V  +  P+C VS C ERAVL+Q+  +F+ 
Sbjct: 301 QIYKGKEMEAVRDMLPYMRLGHLADPEEIETVSFAQEPLCYVSACNERAVLNQIEHFFER 360

Query: 443 RLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSP 502
           RLAGY    S D     D     K+ VA +L+ +EK +L   L    ++I  LPD  +SP
Sbjct: 361 RLAGYK---SSDTTKAVDAKKDAKRTVAKKLMSIEKNILRNALAAVHELIRELPDGAISP 417

Query: 503 CPAPYAPLL 511
           C  PY P L
Sbjct: 418 CIGPYLPNL 426


>gi|326503142|dbj|BAJ99196.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 425

 Score =  555 bits (1430), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 274/426 (64%), Positives = 326/426 (76%), Gaps = 23/426 (5%)

Query: 8   RSSKFISPP-----IRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSSDT 62
           RSS+  +PP     +   HH L +   +     R P  GS     R  +R  +    +DT
Sbjct: 9   RSSEARAPPMASSALSGTHHRLLLPCFLR----RLPQPGS-----RSCSRLRLAACHADT 59

Query: 63  LVAGS------REVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKP---SHNEKHRPIHYV 113
           L++ S                 G    W+  NGLPP K+ + E+P   S   + RP+H+V
Sbjct: 60  LLSSSGAQGPPSPAACLSASSAGGFSDWLLTNGLPPGKLAILERPVPCSRGGRDRPLHFV 119

Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKK 173
           AA +DL+AGD AF VP SLVVTLERVLG+E++AELLTTNKLSELACLALYLMYEKKQG+ 
Sbjct: 120 AAGQDLEAGDVAFEVPMSLVVTLERVLGDESVAELLTTNKLSELACLALYLMYEKKQGRD 179

Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
           S W PYI+ELDRQRGRGQLAVESPLLW+E+EL YL GSP + E++ R EGIK+EYNELDT
Sbjct: 180 SLWYPYIKELDRQRGRGQLAVESPLLWTESELDYLNGSPMRDEVVVRDEGIKKEYNELDT 239

Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLA 293
           +WFMAGSLF+QYP+D+PTEAF FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 
Sbjct: 240 LWFMAGSLFKQYPFDVPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLT 299

Query: 294 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
           Y S CKAML AVD +V+L+VDRPYKAGE I+VWCGPQPNS+LL+NYGFVDEDNPYDR+ +
Sbjct: 300 YKSNCKAMLTAVDGSVRLLVDRPYKAGEPIIVWCGPQPNSRLLLNYGFVDEDNPYDRIAI 359

Query: 354 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 413
           EA+LNTEDPQYQ+KRMVAQRNGKL++Q F V  G+EK+ IS+MLPYLRLGY+SD  EMQ 
Sbjct: 360 EASLNTEDPQYQEKRMVAQRNGKLAIQKFQVCVGKEKQTISEMLPYLRLGYISDPDEMQC 419

Query: 414 VISSLG 419
           ++SS G
Sbjct: 420 ILSSEG 425


>gi|384246822|gb|EIE20311.1| hypothetical protein COCSUDRAFT_48681 [Coccomyxa subellipsoidea
           C-169]
          Length = 539

 Score =  397 bits (1019), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 197/372 (52%), Positives = 264/372 (70%), Gaps = 9/372 (2%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           A+ DLQAG+ A  +P+ LV+TL+RV  +E++AELLTT+KLSELACL LYLMYEKK G++S
Sbjct: 49  AARDLQAGELALRIPDHLVITLDRVFEDESLAELLTTDKLSELACLTLYLMYEKKNGRQS 108

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDT 233
            W  +I+ELDR +GRGQ+  +SPLLW E ++  YL GSP  AEI ER +GI++EY ELDT
Sbjct: 109 VWYEFIKELDRIQGRGQMGAKSPLLWDEGQVDEYLAGSPLVAEIKERLKGIEKEYAELDT 168

Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLA 293
           VWFMAGSLF+ YPYD+PTEAF+ ++F+Q F AVQ+ VVHLQ V L++RFALVPLGPPLL+
Sbjct: 169 VWFMAGSLFKSYPYDVPTEAFSLKLFRQGFAAVQASVVHLQGVPLSKRFALVPLGPPLLS 228

Query: 294 YSSKCKAMLAAVDDA--VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 351
           YSS  KAML    +A  VQL VDR Y  GE I  WCGPQPN +LL+NYG V ++NP+D++
Sbjct: 229 YSSTAKAMLTYNREAKEVQLAVDRSYTKGEPIEAWCGPQPNRRLLLNYGIVTDNNPHDKM 288

Query: 352 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM 411
            +   L   DP +Q KR V Q+N   + Q F +   R+K     +LPYLRL + +D + +
Sbjct: 289 ALTVTLPHADPLFQAKRAVLQQNNLSTQQTFQLQ--RDKGLPELLLPYLRLAHCTDAASL 346

Query: 412 QSVISSLGPIC--PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRV 469
           +  +++L   C  P+SP  ER VL QLA + + RL  Y  T  EDE ++      P+++V
Sbjct: 347 K--LATLDTCCAAPISPENERTVLHQLASHLQDRLDRYKTTCEEDEVIIRSTTAGPRQKV 404

Query: 470 ATQLVRMEKKML 481
           A +L+R+EK +L
Sbjct: 405 AARLLRIEKAIL 416


>gi|212721460|ref|NP_001132025.1| uncharacterized protein LOC100193433 [Zea mays]
 gi|194693232|gb|ACF80700.1| unknown [Zea mays]
 gi|414881264|tpg|DAA58395.1| TPA: hypothetical protein ZEAMMB73_027665 [Zea mays]
 gi|414881265|tpg|DAA58396.1| TPA: hypothetical protein ZEAMMB73_027665 [Zea mays]
          Length = 252

 Score =  374 bits (961), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 173/242 (71%), Positives = 206/242 (85%)

Query: 270 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 329
           ++  QKVSLARRFALVPLGPPLL Y S CKAML A  D+V+LVVDRPYKAGE I++WCGP
Sbjct: 10  LIQEQKVSLARRFALVPLGPPLLTYRSNCKAMLTADGDSVRLVVDRPYKAGEPIIIWCGP 69

Query: 330 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGRE 389
           Q NS+L++NYGFVDEDNP+DR+ +EA+LNTEDPQYQ+KRMVAQRNGKL++Q F+V+ G+E
Sbjct: 70  QTNSRLVLNYGFVDEDNPFDRVAIEASLNTEDPQYQEKRMVAQRNGKLAIQNFNVYVGKE 129

Query: 390 KEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 449
           K+ +++MLPYLRLGY+S+  EMQS++SS G  CPVSPC ERAVLDQL  Y ++RLAGYP 
Sbjct: 130 KQTVAEMLPYLRLGYISNPDEMQSILSSEGDTCPVSPCTERAVLDQLVGYLESRLAGYPT 189

Query: 450 TLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAP 509
           TL EDEAML D NL PKK VAT+LVR+EKKML+ACLQ T + I  LPD TVSPCPAPYAP
Sbjct: 190 TLDEDEAMLADGNLEPKKEVATRLVRLEKKMLHACLQATNEFINDLPDHTVSPCPAPYAP 249

Query: 510 LL 511
            L
Sbjct: 250 EL 251


>gi|255536985|ref|XP_002509559.1| conserved hypothetical protein [Ricinus communis]
 gi|223549458|gb|EEF50946.1| conserved hypothetical protein [Ricinus communis]
          Length = 348

 Score =  373 bits (957), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 174/190 (91%), Positives = 182/190 (95%)

Query: 85  MHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNET 144
           MHKNGLPPCKV+LKE+PSH+ K RPIHYVAASEDLQ GD AFSVPNSLVVTLERVLGNET
Sbjct: 1   MHKNGLPPCKVVLKERPSHDAKLRPIHYVAASEDLQTGDVAFSVPNSLVVTLERVLGNET 60

Query: 145 IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 204
           + ELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE E
Sbjct: 61  VVELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSEAE 120

Query: 205 LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
           LAYLTGSPTKAE+LERA+GIKREY+ELDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFV
Sbjct: 121 LAYLTGSPTKAEVLERADGIKREYDELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFV 180

Query: 265 AVQSCVVHLQ 274
           A+QSCVVHLQ
Sbjct: 181 AIQSCVVHLQ 190



 Score =  278 bits (712), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 130/159 (81%), Positives = 148/159 (93%)

Query: 353 VEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQ 412
           ++AALNTEDPQYQDKRMVAQRNGKLS+QVF ++ G+EKEAISD+LPYLRLGYVSD SEMQ
Sbjct: 189 LQAALNTEDPQYQDKRMVAQRNGKLSIQVFQIYVGKEKEAISDILPYLRLGYVSDPSEMQ 248

Query: 413 SVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQ 472
           SVISS GPICPVSPCME+AVLDQLADYFK RLAGYP +L+EDE ML D+NL+PKKRVATQ
Sbjct: 249 SVISSQGPICPVSPCMEQAVLDQLADYFKRRLAGYPTSLNEDELMLADHNLNPKKRVATQ 308

Query: 473 LVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPLL 511
           LVR+EKK+LNACLQ TAD+I  LPD++VSPCPAPYAP+L
Sbjct: 309 LVRLEKKILNACLQATADLINQLPDLSVSPCPAPYAPIL 347


>gi|307107385|gb|EFN55628.1| hypothetical protein CHLNCDRAFT_57818 [Chlorella variabilis]
          Length = 435

 Score =  365 bits (936), Expect = 4e-98,   Method: Compositional matrix adjust.
 Identities = 191/404 (47%), Positives = 262/404 (64%), Gaps = 9/404 (2%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
           +  W+ ++G P  KV L+       +   +    A+E LQ GD A  +P  L+VTL+RVL
Sbjct: 9   MMQWLTESGAPQQKVKLQTVVREGTE---VDITVAAEALQPGDVALRIPEHLIVTLDRVL 65

Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
            + T+AEL+TT KLSELACL LYL YEKK+GK+  W  +I+ELDR +GRG    +SPLLW
Sbjct: 66  EDNTLAELVTTGKLSELACLTLYLAYEKKRGKEGCWYRFIKELDRMQGRGSQGAKSPLLW 125

Query: 201 SETELA-YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
            E + A  L GSP   EI  R +GI++EY ELDTVW++AGSLF + P+  PTE F+F +F
Sbjct: 126 DEGQAAELLAGSPVVGEIEARLQGIRKEYEELDTVWYLAGSLFNRQPFSPPTEQFSFPVF 185

Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPY 317
           +QAF AVQS VVHLQ V+L +RFALVP+GPPLL YSS  KAML    ++  V+L VDR Y
Sbjct: 186 RQAFTAVQSSVVHLQGVALGKRFALVPMGPPLLTYSSTAKAMLKFDPESHEVRLAVDRAY 245

Query: 318 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL 377
           + GE+++ WCGPQPNS+LLINYG VDE NPYD+L +   + ++DP Y+ KR      G  
Sbjct: 246 QPGEAVLAWCGPQPNSRLLINYGIVDESNPYDKLPLSITIPSDDPLYRLKRDRLAERGLS 305

Query: 378 SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLA 437
           + Q F + A     A   +LPYLRL + +  ++++ V        PV+P  E  VL+QL 
Sbjct: 306 TQQTFQLQAAASLPA--QLLPYLRLVHSTREADVEGVKWE-EEAGPVAPENELTVLNQLI 362

Query: 438 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 481
            + + R + Y  T+ EDEA++ D    P+  VA +L+++EK +L
Sbjct: 363 THLRLRQSRYRTTIEEDEAIIADPAKGPRPTVAARLLKIEKGIL 406


>gi|413950742|gb|AFW83391.1| hypothetical protein ZEAMMB73_866859 [Zea mays]
 gi|413950743|gb|AFW83392.1| hypothetical protein ZEAMMB73_866859 [Zea mays]
          Length = 252

 Score =  363 bits (933), Expect = 9e-98,   Method: Compositional matrix adjust.
 Identities = 166/240 (69%), Positives = 203/240 (84%)

Query: 270 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 329
           ++  QKVSLARRFALVPLGPPLL Y S CKAML    ++V+LVVDRPYKAGE I++WCGP
Sbjct: 10  LIQEQKVSLARRFALVPLGPPLLTYKSNCKAMLTVDGESVRLVVDRPYKAGEPIIIWCGP 69

Query: 330 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGRE 389
           Q NS+L++NYGFVDE+NP+DR+ +EA+LNTEDPQYQ+KRMVAQRNGK ++Q F+V+ G+E
Sbjct: 70  QTNSRLVLNYGFVDENNPFDRISIEASLNTEDPQYQEKRMVAQRNGKHAIQNFNVYVGKE 129

Query: 390 KEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 449
           K+ +++MLPYLRLGY+SD  EMQS++SS G  CPVSPC ERAVLDQL  Y ++RLAGYP 
Sbjct: 130 KQTVAEMLPYLRLGYISDPDEMQSILSSEGDTCPVSPCTERAVLDQLGGYLESRLAGYPT 189

Query: 450 TLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAP 509
           TL+EDEAML D +L PK+ VAT+LVR+EKKML+ACLQ T + I  LPD TVSPCPA YAP
Sbjct: 190 TLNEDEAMLADGSLEPKQEVATRLVRLEKKMLHACLQATNEFITDLPDHTVSPCPAQYAP 249


>gi|302847476|ref|XP_002955272.1| hypothetical protein VOLCADRAFT_76643 [Volvox carteri f.
           nagariensis]
 gi|300259344|gb|EFJ43572.1| hypothetical protein VOLCADRAFT_76643 [Volvox carteri f.
           nagariensis]
          Length = 488

 Score =  347 bits (890), Expect = 8e-93,   Method: Compositional matrix adjust.
 Identities = 192/408 (47%), Positives = 255/408 (62%), Gaps = 14/408 (3%)

Query: 80  DLKSWMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
           +L  W+ +NG  +   +V   + PS     RP+  V A   L AG+ A SVP  L +TL+
Sbjct: 43  ELVDWLRENGAKIDAVEVKTMDVPSAG---RPLDVVVAGRSLAAGEVALSVPERLCLTLD 99

Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
           R+  +E +AELLTT+KLSELACLALYLMYEKK  KKSFW PYI+ELD+Q+ RG  A ESP
Sbjct: 100 RIFESEFVAELLTTDKLSELACLALYLMYEKKLKKKSFWYPYIKELDKQQARGPQAAESP 159

Query: 198 LLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 256
           LLW + EL + L GSP    + +R  GI++EY  LDTVWFMAGSLF +YP+D+PTE F+F
Sbjct: 160 LLWGDQELDSLLKGSPLLPAVRQRQAGIRKEYEALDTVWFMAGSLFNKYPFDLPTETFSF 219

Query: 257 EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD--AVQLVVD 314
           E+F+QAF  VQ+ +VHLQ V +A+RFALVPLGPPL+AYSS  K M+   +D  +V+LVV 
Sbjct: 220 ELFQQAFAVVQASIVHLQGVPIAKRFALVPLGPPLMAYSSTSKNMMTYDEDSRSVRLVVS 279

Query: 315 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE--AALNTEDPQYQDKRMVAQ 372
            P +AG  +  WCGPQPNS+LL+NYG VDE NP+D+L       L T DP +  KR V  
Sbjct: 280 GPVEAGRPVAAWCGPQPNSRLLLNYGVVDEHNPFDKLQARFTFTLPTSDPLFPAKRAVLS 339

Query: 373 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 432
             G  + Q F V   R       +LPY+ L   +   ++ SV  S          +E A 
Sbjct: 340 EAGLATQQSFDVSVARPLP--PQLLPYMMLALATTPEQVASV--SFSDTAGHDRELEAAA 395

Query: 433 LDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKM 480
           L  L  Y + R A Y   L  D  ++ D +  P+++VA +L ++EK +
Sbjct: 396 LAALMAYVQRRTAAYAHPLWRDLEIINDPSSTPRQKVAARLTKIEKSI 443


>gi|413950744|gb|AFW83393.1| hypothetical protein ZEAMMB73_866859 [Zea mays]
          Length = 281

 Score =  347 bits (889), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 166/269 (61%), Positives = 203/269 (75%), Gaps = 29/269 (10%)

Query: 270 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 329
           ++  QKVSLARRFALVPLGPPLL Y S CKAML    ++V+LVVDRPYKAGE I++WCGP
Sbjct: 10  LIQEQKVSLARRFALVPLGPPLLTYKSNCKAMLTVDGESVRLVVDRPYKAGEPIIIWCGP 69

Query: 330 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGRE 389
           Q NS+L++NYGFVDE+NP+DR+ +EA+LNTEDPQYQ+KRMVAQRNGK ++Q F+V+ G+E
Sbjct: 70  QTNSRLVLNYGFVDENNPFDRISIEASLNTEDPQYQEKRMVAQRNGKHAIQNFNVYVGKE 129

Query: 390 KEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 449
           K+ +++MLPYLRLGY+SD  EMQS++SS G  CPVSPC ERAVLDQL  Y ++RLAGYP 
Sbjct: 130 KQTVAEMLPYLRLGYISDPDEMQSILSSEGDTCPVSPCTERAVLDQLGGYLESRLAGYPT 189

Query: 450 TLSEDEAM-----------------------------LTDYNLHPKKRVATQLVRMEKKM 480
           TL+EDEAM                             L D +L PK+ VAT+LVR+EKKM
Sbjct: 190 TLNEDEAMVMSCDFLRVVSWSLYKLAECYGIGFGHCQLADGSLEPKQEVATRLVRLEKKM 249

Query: 481 LNACLQVTADMIMLLPDVTVSPCPAPYAP 509
           L+ACLQ T + I  LPD TVSPCPA YAP
Sbjct: 250 LHACLQATNEFITDLPDHTVSPCPAQYAP 278


>gi|413950741|gb|AFW83390.1| hypothetical protein ZEAMMB73_201403, partial [Zea mays]
          Length = 130

 Score =  233 bits (593), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 108/128 (84%), Positives = 117/128 (91%)

Query: 147 ELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA 206
           ELLT NKLSELACLALYLMYEKKQGK SFW PYI+ELDR RGRGQLAVESPLLW+E+EL 
Sbjct: 1   ELLTNNKLSELACLALYLMYEKKQGKDSFWYPYIKELDRHRGRGQLAVESPLLWTESELD 60

Query: 207 YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 266
           YL+GSP K E++ R E I+REYNELDT+WFMAGSLFQQYP+DIPTEAF FEIFKQAFVAV
Sbjct: 61  YLSGSPLKDEVVARDEAIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIFKQAFVAV 120

Query: 267 QSCVVHLQ 274
           QSCVVHLQ
Sbjct: 121 QSCVVHLQ 128


>gi|168002824|ref|XP_001754113.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162694667|gb|EDQ81014.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 638

 Score =  144 bits (364), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 127/445 (28%), Positives = 201/445 (45%), Gaps = 40/445 (8%)

Query: 45  RLVRRKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHN 104
           R V  + R  +R  +  ++   S + ++     L  L  W+ K G P   VIL       
Sbjct: 54  RTVWNEGRRGLRGVARCSMSGNSMQSMA-----LHQLSEWLSKQGFPTQDVILT---GFG 105

Query: 105 EKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYL 164
           E+   +   AA  D + G+ A  +P +  VT   V+ +  +A         ++  L L+L
Sbjct: 106 EEGVGL---AAGRDFKEGEVALKIPENYTVTGVDVVNHPVVAAPAAGR--GDVIGLTLWL 160

Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS-ETELAYLTGSPTKAEILERAEG 223
           MYE+  G+KS W PY++              SP+LW+ E +   L GSP   E+ +R+  
Sbjct: 161 MYERSLGEKSVWYPYLQTFPS-------TTLSPILWTAEEQQKLLKGSPALEEVQQRSAA 213

Query: 224 IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 283
           ++ EY +L        S F + P   P E F+ E FK AF  + S  V+L    L   FA
Sbjct: 214 LEGEYEDLQ-------SYFTKDPQAFPQEYFSLEAFKSAFSVILSRAVYLPSADL---FA 263

Query: 284 LVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCG-PQPNSKLLINYG 340
           LVP    L  + +  +A L  +  D AV   VDR YK GE +    G  + N+ LLI YG
Sbjct: 264 LVPYADAL-NHRADSQAYLDYSMEDQAVVFPVDRNYKEGEQVFTSYGRERSNADLLITYG 322

Query: 341 FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL 400
           FVDE+N  D L +E  L   D     K+ + Q+    S Q F ++  R     + +L Y+
Sbjct: 323 FVDENNAMDYLDLEVGLVDGDRLLVLKQQILQQAMLDSPQTFPLYLDRFP---TQLLTYM 379

Query: 401 RLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 460
           RL  + D +    ++     +  +    E   L  L    + +L  Y   + ++  +L +
Sbjct: 380 RLSRLQDPALFPKIVFDKDIM--LDQANEYECLQLLMGECRTKLGNYEGGVDDEIRLLKN 437

Query: 461 YNLHPKKRVATQLVRMEKKMLNACL 485
             +  ++RVA QL   EKK+L + +
Sbjct: 438 KKISQRERVAAQLRLCEKKILTSTM 462


>gi|298706765|emb|CBJ29688.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Ectocarpus siliculosus]
          Length = 521

 Score =  141 bits (355), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 122/433 (28%), Positives = 197/433 (45%), Gaps = 39/433 (9%)

Query: 81  LKSWMHKNGL-----------PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVP 129
           LK WM +NG+           P    +  +    NE       + A+ +++ GD  F++P
Sbjct: 92  LKEWMGENGVWVYDKSDWGVGPHALSVAVDTVDENENETAGRGMIANREIKEGDELFTLP 151

Query: 130 NSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
             L++T +     E  A+++T + LSE   +AL  ++EK +GK+SFW  YI  L      
Sbjct: 152 IDLLLT-KDAAKKEFGADVITED-LSEYIAIALLAVHEKAKGKESFWSSYIGVLPTVE-- 207

Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDI 249
               V    LW+E +LA L GSP  A        ++ EY  ++        L  ++P  +
Sbjct: 208 ---EVYPTYLWAEEDLALLEGSPVIAATESMRRKLEVEYATVEN------DLLDKFPEIL 258

Query: 250 PTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG------PPLLAY-SSKCKAML 302
           P E  T+E F+ AF  + S  + L  +S     ALVP        P   +Y  ++ + + 
Sbjct: 259 PREVHTYEEFQWAFAMLFSRAIRLGGLSTGEAVALVPYADLFNHNPFANSYIDARQQGLF 318

Query: 303 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 362
            +  D V +  DR YK  E + +  GP+ NS LL+ YGF  + NPY+ + V  +L+  D 
Sbjct: 319 FSKTDEVVVYADRSYKKMEQVYISYGPKGNSDLLLLYGFSLDRNPYNSVDVTVSLDENDE 378

Query: 363 QYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPIC 422
            Y+ K+      G    + F ++  R  +   ++L YLRL  ++        +  L    
Sbjct: 379 LYERKKAFLSEAGLPPTKAFPLYNDRYPD---ELLQYLRLIQLNTDQLRGRTLEDLSFEK 435

Query: 423 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD----YNLHPKKRVATQLVRMEK 478
             +   E  VLD L +  KA +AGYP T  +D  ++ D      L   +R+A +  R EK
Sbjct: 436 KQTDVNELMVLDSLVEACKATIAGYPTTEEQDSKLMNDPGMFRALSKTQRMAVKHRRQEK 495

Query: 479 KMLNACL-QVTAD 490
            +L   +  VT D
Sbjct: 496 VILRRTIAAVTKD 508


>gi|452821842|gb|EME28868.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
           N-methyltransferase I [Galdieria sulphuraria]
          Length = 490

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 119/451 (26%), Positives = 208/451 (46%), Gaps = 52/451 (11%)

Query: 56  RVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGL----------PPCKVILKEKPSHNE 105
           R S + +  +G   V      ++     W+ +NG+           P ++++ E+ + +E
Sbjct: 58  RSSDAFSFTSGDPAVQKGWSSEISAFYDWLKENGVYLSEKASWTHAPHRLVIAEE-TKDE 116

Query: 106 KHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLM 165
                  + +S  +  G+    +P  L+ T  R L  ET    +  +   E   + L L+
Sbjct: 117 GEYSGRGLLSSRSVNLGEKVLEIPEKLMFT--RKLALETFPTSIIASIEDEYVSIGLLLL 174

Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPL-LWSETELAYLTGSPTKAEILERAEGI 224
           YEK +G  SF+ PY+  L        L   +PL LWS  +L  L GSPT +   +  + +
Sbjct: 175 YEKAKGFDSFFKPYLDILP------TLDELNPLFLWSNKDLDLLQGSPTLSACEQLRDKL 228

Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 284
            REY       ++  ++  Q P +  ++   F+ F+ AF  + S  +       ++R AL
Sbjct: 229 LREYT------YLGKNIIPQIP-NFASKPIDFKQFQWAFGILFSRAICFPS---SKRIAL 278

Query: 285 VPLGPPLLAYSSKCKAML--------AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLL 336
           VP    LL +S  C A +          V +AV + VDR Y+  E + V  GP+ N +LL
Sbjct: 279 VPYAD-LLNHSPFCSAFIDEEKIPFGNGVTEAV-VYVDRLYEPYEQVYVSYGPRSNQELL 336

Query: 337 INYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDM 396
           + YGF  E NP+D + +   L+  DP Y +K  + +  GK  +Q F ++  R      +M
Sbjct: 337 LLYGFSLERNPFDCVEITIGLDKTDPLYLEKCRMLESYGKSPLQSFPLYMDRYP---VEM 393

Query: 397 LPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEA 456
             +LR   +   +++Q+   ++     VS   E + LD+L +Y   +L  YP +L +DE 
Sbjct: 394 AEFLRFCCIDTETDLQADFGTI-----VSASNEESALDKLLNYIVDQLRKYPTSLEDDEK 448

Query: 457 MLTD----YNLHPKKRVATQLVRMEKKMLNA 483
           ++ D      L   +R+A +    EK++L+A
Sbjct: 449 IIRDRAMFQTLEKNQRMAIRQRLGEKRILHA 479


>gi|302764082|ref|XP_002965462.1| hypothetical protein SELMODRAFT_406852 [Selaginella moellendorffii]
 gi|300166276|gb|EFJ32882.1| hypothetical protein SELMODRAFT_406852 [Selaginella moellendorffii]
          Length = 481

 Score =  125 bits (313), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 121/409 (29%), Positives = 185/409 (45%), Gaps = 35/409 (8%)

Query: 80  DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
           D+  W+ + G P   +++    S  +K        A+ DLQAGDAA S+P +  VT   V
Sbjct: 49  DMTKWLQEQGFPQQPLLVS---SFEDKGLG---CCATRDLQAGDAALSIPENFTVTAVDV 102

Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
             +  I+         EL  LAL+LMYE+++ + S W PY++           +  SPLL
Sbjct: 103 ANHPVISS--AAEGRDELVGLALWLMYEQERSQDSPWYPYLKVF-------PASTLSPLL 153

Query: 200 WSETELAYLT-GSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFE 257
           W + E   L  GS   A++ ++   +++ ++ L DT+         +   D P E FTF 
Sbjct: 154 WEQEEQEELLRGSSALAKVKDQLTSLRQTFDALKDTL---------KDNKDFPMEKFTFS 204

Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY 317
            FK AF  V S  V+L    L   FALVP G  +   SS+        +  V+L VD+ Y
Sbjct: 205 AFKAAFSVVLSRAVYLPSAEL---FALVPFGDLINHESSRSLLDYDIEEQKVKLAVDKRY 261

Query: 318 KAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 376
           K G+ +   +     ++  LI YGF+DE +  D + +E  L + D     KR + Q  G 
Sbjct: 262 KKGDQVFASYAQNLTSADFLIRYGFLDESDENDFIEIEVGLVSGDSLAPLKREILQEVGL 321

Query: 377 LSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQL 436
              Q F V+  R     + +L Y RL  + D+     +      I  V    E   L  L
Sbjct: 322 TVPQKFPVYLNRFP---TQLLTYTRLARIQDSGLFAKITFEKDLI--VCQTNEYETLMLL 376

Query: 437 ADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 485
               + +L  +  T+ +D   L   NL  K+RVA QL   EK++L   +
Sbjct: 377 MADCRTKLLSFSDTMEDDMQTLKRKNLSYKQRVAAQLRLKEKRILTDTM 425


>gi|302823067|ref|XP_002993188.1| hypothetical protein SELMODRAFT_449044 [Selaginella moellendorffii]
 gi|300138958|gb|EFJ05708.1| hypothetical protein SELMODRAFT_449044 [Selaginella moellendorffii]
          Length = 600

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 116/405 (28%), Positives = 180/405 (44%), Gaps = 33/405 (8%)

Query: 80  DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
           D+  W+ + G P   +++    S  +K        A+ DLQAGDAA S+P +  VT   V
Sbjct: 49  DMTKWLQEQGFPQQPLLV---SSFEDKGLG---CCATRDLQAGDAALSIPENFTVTAVDV 102

Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
             +  I+         EL  LAL+LMYE+++ + S W PY++            +   L 
Sbjct: 103 ANHPVISS--AAEGRDELVGLALWLMYEQERSQDSPWYPYVKVFPAS------TLSLLLW 154

Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFEI 258
             E +   L GS   A++ ++   +++ ++ L DT+         +   D P E FTF  
Sbjct: 155 EQEEQEELLRGSSALAKVKDQLTSLRQTFDALKDTL---------KDNKDFPMEKFTFSA 205

Query: 259 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYK 318
           FK AF  V S  V+L    L   FALVP G  +   SS+        +  V+L VD+ YK
Sbjct: 206 FKTAFSVVLSRAVYLPSAEL---FALVPFGDLINHESSRSLLDYDIEEQKVKLAVDKRYK 262

Query: 319 AGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL 377
            G+ +   +     ++  LI YGF+DE +  D + +E  L + D     KR + Q  G  
Sbjct: 263 KGDQVFASYAQNLTSADFLIRYGFLDESDENDCIEIEVGLVSGDSLAPLKREILQEVGLT 322

Query: 378 SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLA 437
             Q F ++  R     + +L Y RL  + D+     +      I  VS   E   L  L 
Sbjct: 323 VPQKFPLYLNR---FPTQLLTYTRLARIQDSGLFAKITFEKDLI--VSQTNEYETLMLLM 377

Query: 438 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 482
              + +L     T+ ++   L   NL  K+RVA QL   EK++L 
Sbjct: 378 ADCRTKLLSSSDTMEDEMQTLRRKNLSYKQRVAAQLRLKEKRILT 422


>gi|297829320|ref|XP_002882542.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297328382|gb|EFH58801.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 504

 Score =  119 bits (297), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 93/315 (29%), Positives = 152/315 (48%), Gaps = 27/315 (8%)

Query: 76  EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
           E+   L++W+  +GLPP K+ + ++    E+      + AS++L+ G+    VP SLV++
Sbjct: 72  ENATSLQNWLSDSGLPPQKMAI-DRVDIGERG-----LVASQNLRKGEKLLFVPPSLVIS 125

Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
            +    N    E++    + +   LA YL+ E    K S W  YI  L RQ         
Sbjct: 126 ADSEWTNPEAGEVMKRYDVPDWPLLATYLISEASLQKSSRWYNYISALPRQ-------PY 178

Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
           S L W+ TEL  YL  S  +   +ER   +   Y +L +       +F ++P+  P E F
Sbjct: 179 SLLYWTRTELDMYLEASQIRERAIERITNVVGTYEDLRS------RIFSKHPHLFPKEVF 232

Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLV 312
             E FK +F  + S +V L   S+  RFALVP    +L ++ + +  L        V   
Sbjct: 233 NDETFKWSFGILFSRLVRLP--SMDGRFALVPWA-DMLNHNCEVETFLDYDKSSKGVVFT 289

Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMV 370
            DRPY+ GE + +  G + N +LL++YGFV  +  NP D + +  +L   D  Y++K   
Sbjct: 290 TDRPYQPGEQVFISYGNKSNGELLLSYGFVPREGTNPSDSVELALSLRKNDKCYKEKLDA 349

Query: 371 AQRNGKLSVQVFHVH 385
            +++G  + Q F V 
Sbjct: 350 LKKHGLSTPQCFPVR 364


>gi|21537309|gb|AAM61650.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Arabidopsis thaliana]
          Length = 504

 Score =  119 bits (297), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 93/315 (29%), Positives = 151/315 (47%), Gaps = 27/315 (8%)

Query: 76  EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
           E+   L++W+  +GLPP K+ + ++    E+      + AS++L+ G+    VP SLV++
Sbjct: 72  ENATSLQNWLSDSGLPPQKMAI-DRVDIGERG-----LVASQNLRKGEKLLFVPPSLVIS 125

Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
            +    N    E++    + +   LA YL+ E    K S W  YI  L RQ         
Sbjct: 126 ADSEWTNAEAGEVMKRYDVPDWPLLATYLISEANLQKSSRWFNYISALPRQ-------PY 178

Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
           S L W+ TEL  YL  S  +   +ER   +   Y +L +       +F ++P   P E F
Sbjct: 179 SLLYWTRTELDMYLEASQIRERAIERITNVVGTYEDLRS------RIFSKHPQLFPKEVF 232

Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLV 312
             E FK +F  + S +V L   S+  RFALVP    +L ++ + +  L        V   
Sbjct: 233 NDETFKWSFGILFSRLVRLP--SMDGRFALVPWA-DMLNHNCEVETFLDYDKSSKGVIFT 289

Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMV 370
            DRPY+ GE + +  G + N +LL++YGFV  +  NP D + +  +L   D  Y++K   
Sbjct: 290 TDRPYQPGEQVFISYGNKSNGELLLSYGFVPREGTNPSDSVELALSLRKNDKCYEEKLDA 349

Query: 371 AQRNGKLSVQVFHVH 385
            +++G  + Q F V 
Sbjct: 350 LKKHGLSTPQCFPVR 364


>gi|15231493|ref|NP_187424.1| rubisco methyltransferase-like protein [Arabidopsis thaliana]
 gi|6466950|gb|AAF13085.1|AC009176_12 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Arabidopsis thaliana]
 gi|6648179|gb|AAF21177.1|AC013483_1 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Arabidopsis thaliana]
 gi|15028205|gb|AAK76599.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Arabidopsis thaliana]
 gi|19310671|gb|AAL85066.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Arabidopsis thaliana]
 gi|332641064|gb|AEE74585.1| rubisco methyltransferase-like protein [Arabidopsis thaliana]
          Length = 504

 Score =  119 bits (297), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 93/315 (29%), Positives = 151/315 (47%), Gaps = 27/315 (8%)

Query: 76  EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
           E+   L++W+  +GLPP K+ + ++    E+      + AS++L+ G+    VP SLV++
Sbjct: 72  ENATSLQNWLSDSGLPPQKMAI-DRVDIGERG-----LVASQNLRKGEKLLFVPPSLVIS 125

Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
            +    N    E++    + +   LA YL+ E    K S W  YI  L RQ         
Sbjct: 126 ADSEWTNAEAGEVMKRYDVPDWPLLATYLISEASLQKSSRWFNYISALPRQ-------PY 178

Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
           S L W+ TEL  YL  S  +   +ER   +   Y +L +       +F ++P   P E F
Sbjct: 179 SLLYWTRTELDMYLEASQIRERAIERITNVVGTYEDLRS------RIFSKHPQLFPKEVF 232

Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLV 312
             E FK +F  + S +V L   S+  RFALVP    +L ++ + +  L        V   
Sbjct: 233 NDETFKWSFGILFSRLVRLP--SMDGRFALVPWA-DMLNHNCEVETFLDYDKSSKGVVFT 289

Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMV 370
            DRPY+ GE + +  G + N +LL++YGFV  +  NP D + +  +L   D  Y++K   
Sbjct: 290 TDRPYQPGEQVFISYGNKSNGELLLSYGFVPREGTNPSDSVELALSLRKNDKCYEEKLDA 349

Query: 371 AQRNGKLSVQVFHVH 385
            +++G  + Q F V 
Sbjct: 350 LKKHGLSTPQCFPVR 364


>gi|3065835|gb|AAC14296.1| putative methyltransferase [Arabidopsis thaliana]
          Length = 504

 Score =  114 bits (286), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 92/315 (29%), Positives = 149/315 (47%), Gaps = 27/315 (8%)

Query: 76  EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
           E+   L++W+  +GLPP K+ + ++    E+      + AS++L+ G+    V  SLV+ 
Sbjct: 72  ENATSLQNWLSDSGLPPQKMAI-DRVDIGERG-----LVASQNLRKGEKLLFVSPSLVIC 125

Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
            +    N    E++    + +   LA YL+ E    K S W  YI  L RQ         
Sbjct: 126 ADSEWTNAEAGEVMKRYDVPDWPLLATYLISEASLQKSSRWFNYISALPRQ-------PY 178

Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
           S L W+ TEL  YL  S  +   +ER   +   Y +L +       +F ++P   P E F
Sbjct: 179 SLLYWTRTELDMYLEASQIRERAIERITNVVGTYEDLRS------RIFSKHPQLFPKEVF 232

Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLV 312
             E FK +F  + S +V L   S+  RFALVP    +L ++ + +  L        V   
Sbjct: 233 NDETFKWSFGILFSRLVRLP--SMDGRFALVPWAD-MLNHNCEVETFLDYDKSSKGVVFT 289

Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMV 370
            DRPY+ GE + +  G + N +LL++YGFV  +  NP D + +  +L   D  Y++K   
Sbjct: 290 TDRPYQPGEQVFISYGNKSNGELLLSYGFVPREGTNPSDSVELALSLRKNDKCYEEKLDA 349

Query: 371 AQRNGKLSVQVFHVH 385
            +++G  + Q F V 
Sbjct: 350 LKKHGLSTPQCFPVR 364


>gi|219121061|ref|XP_002185762.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
           N-methyltransferase I [Phaeodactylum tricornutum CCAP
           1055/1]
 gi|209582611|gb|ACI65232.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
           N-methyltransferase I [Phaeodactylum tricornutum CCAP
           1055/1]
          Length = 575

 Score =  114 bits (284), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 103/406 (25%), Positives = 184/406 (45%), Gaps = 54/406 (13%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKK-QGKK 173
           A  D+  GD    +P +L +T  +    + + + +  ++++E   +A +L+YE+  +G++
Sbjct: 131 ARRDINDGDELLRIPMALCMT--KSAARKAVGKDVLPSEINEYLAMACHLIYERNVRGEE 188

Query: 174 SFWLPYIR---ELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
           S W PY+    ++D         V     W + +LA+L GSP  A        ++REY+ 
Sbjct: 189 SPWKPYLDVLPDIDE--------VNPTFTWPDEDLAFLNGSPVIAATKSLQMKLRREYDA 240

Query: 231 LDTVWFMAG--SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG 288
           L     + G   L  +YP   P EAF F+ ++ AF  + S  + L+ +      ALVP  
Sbjct: 241 L-----LGGEDGLLAKYPDRFPAEAFNFKAWEWAFTMLFSRAIRLRSLKQGETLALVPYA 295

Query: 289 PPLLAYSSKCKAMLAAV----------DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 338
             L+ +S   +A + A           D+ V L  DR Y+  E I +  GP+ N++LL+ 
Sbjct: 296 D-LINHSPFSQAYIDARQNGDWLFKSGDEEVILYADRGYRRMEQIYISYGPKSNAELLLL 354

Query: 339 YGFVDEDNPYDRLVVEAALNTE---------------DPQYQDKRMVAQRNGKLSVQVFH 383
           YGF  E NP++ + V  ++                  DP  ++K    ++ G+ +   F 
Sbjct: 355 YGFAVERNPFNSVDVTVSIAPRTASFVKELDDDTIPVDPLAEEKAAFLEQVGRDATVDFP 414

Query: 384 VHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 443
            +A R      +ML YLRL  ++        ++       +S   E AVL  +      +
Sbjct: 415 CYADRYP---VEMLEYLRLMQMTPEDTRGKPLAEFDYSRTISLGNEAAVLTSVITAVSRQ 471

Query: 444 LAGYPATLSEDEAMLTDYNLHP----KKRVATQLVRMEKKMLNACL 485
           L+ YP +  +D A++ D +L       +R+A +  R EK++L   +
Sbjct: 472 LSNYPQSEEDDAALIKDKSLFRLLSYNQRMAVRHRRNEKRLLKRTI 517


>gi|224129218|ref|XP_002320530.1| predicted protein [Populus trichocarpa]
 gi|222861303|gb|EEE98845.1| predicted protein [Populus trichocarpa]
          Length = 503

 Score =  114 bits (284), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 93/312 (29%), Positives = 145/312 (46%), Gaps = 27/312 (8%)

Query: 76  EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
           E+   L+ W+  +GLPP K+ + +K    E+      + A ++++ G+    VP SLV+ 
Sbjct: 71  ENAEALQKWLSDSGLPPQKMAI-QKVEVGERG-----LVALKNIRKGEMLLFVPPSLVIA 124

Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
            +         E+L    + +   LA YL+ E    K S W  YI  L RQ         
Sbjct: 125 ADSEWSCPEAGEVLKKYSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------PY 177

Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
           S L W+  EL  YL  S  +   +ER   +   YN+L         +F +YP+  P E F
Sbjct: 178 SLLYWTRAELDTYLEASQIRERAIERITNVTGTYNDLRL------RIFSKYPHLFPEEVF 231

Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLV 312
             E FK +F  + S +V L   S+  R ALVP    +L +SS+ +  L        V   
Sbjct: 232 NMETFKWSFGILFSRLVRLP--SMDGRVALVPWAD-MLNHSSEVETFLDYDKSSKGVVFT 288

Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMV 370
            DRPY+ GE + +  G + N +LL++YGFV  +  NP D + +  +L   D  Y++K   
Sbjct: 289 TDRPYQPGEQVFISYGRKSNGELLLSYGFVPREGTNPSDSVELSLSLKKSDKCYKEKLEA 348

Query: 371 AQRNGKLSVQVF 382
            +++G    Q F
Sbjct: 349 LKKHGLSVSQCF 360


>gi|168003103|ref|XP_001754252.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162694354|gb|EDQ80702.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 431

 Score =  112 bits (279), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 89/327 (27%), Positives = 154/327 (47%), Gaps = 31/327 (9%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
           L+ W+ K GL   K++L    S          + A++ L+ G+    VP+ L++T +   
Sbjct: 16  LQDWLMKEGLAKQKLVLDRVDSGGRG------LVATQSLRQGERLLFVPSGLLITADSEW 69

Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
           G      ++    L E   LA++L+ E  + + S W PY   L +          S L W
Sbjct: 70  GCAETGRIIKEAGLPEWPMLAIFLISEASREESSRWFPYFATLPK-------TPSSILQW 122

Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
           +E E+  +LT SP + + LE    +   Y +L        ++F ++P   P++ +T   F
Sbjct: 123 TEEEVNTWLTASPVREKALECIRDVTETYRDL------RATIFLKHPEVFPSQVYTLAAF 176

Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD---AVQLVVDRP 316
           K AF  + S +V L  V    + ALVP    +L +S +  + L    +   +V  V DR 
Sbjct: 177 KWAFGILFSRLVRLPSVG---KLALVPWA-DMLNHSPQVDSFLDFDQNNAKSVVTVTDRA 232

Query: 317 YKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 375
           Y++GE + +  G + + +L + YGF+  E N +D + +E  ++++DP ++ K   A   G
Sbjct: 233 YQSGEQVFISYGKRSSGELFLAYGFIPSELNVHDSVELEMEIDSDDPSFEAKLRAANEQG 292

Query: 376 KLSVQVFHVHAGREKEAISDMLPYLRL 402
             S Q F V   R+    + +L Y RL
Sbjct: 293 LSSPQRFPV---RKDGFPAQLLAYARL 316


>gi|356547583|ref|XP_003542190.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Glycine
           max]
          Length = 499

 Score =  112 bits (279), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 91/315 (28%), Positives = 147/315 (46%), Gaps = 27/315 (8%)

Query: 76  EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
           E+   L+ W+ ++GLPP K+ + E+    E+      + A ++++ G+    VP SLV+T
Sbjct: 67  ENSSALQRWLSESGLPPQKMGI-ERVEVGERG-----LVALKNIRKGEKLLFVPPSLVIT 120

Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
            +         E+L  N + +   LA YL+ E    + S W  YI  L RQ         
Sbjct: 121 PDSEWSCPEAGEVLKRNSVPDWPLLATYLISEASLMESSRWSNYISALPRQ-------PY 173

Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
           S L W++ EL  YL  S  +   +ER   +   YN+L         +F +YP   P E F
Sbjct: 174 SLLYWTQAELDRYLEASQIRERAIERINNVIGTYNDLRL------RIFSKYPDLFPDEVF 227

Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLV 312
             E FK +F  + S +V L   S+    ALVP    +L +S   +  L        +   
Sbjct: 228 NIESFKWSFGILFSRLVRLP--SMGGNVALVPWAD-MLNHSCDVETFLDYDKTSKGIVFT 284

Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMV 370
            DRPY+ GE + +  G + N +LL++YGFV ++  NP D + +  +L   D  Y++K  +
Sbjct: 285 TDRPYQPGEQVFISYGKKSNGELLLSYGFVPKEGANPSDSVELSLSLKKSDASYKEKLEL 344

Query: 371 AQRNGKLSVQVFHVH 385
            +  G  + Q F + 
Sbjct: 345 LKNYGLSASQCFPIQ 359


>gi|357462493|ref|XP_003601528.1| SET domain-containing protein [Medicago truncatula]
 gi|355490576|gb|AES71779.1| SET domain-containing protein [Medicago truncatula]
 gi|388500078|gb|AFK38105.1| unknown [Medicago truncatula]
          Length = 497

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 91/315 (28%), Positives = 148/315 (46%), Gaps = 27/315 (8%)

Query: 76  EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
           E+   L+ W+ ++GLP  K+ + +K    E+      + A  +++ G+    VP  LV+T
Sbjct: 65  ENSSSLQKWLSQSGLPSQKMSI-DKVDVGERG-----LVALNNIRKGEKLLFVPPQLVIT 118

Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
            +         E+L  N + +   LA YL+ E    K S W  YI  L RQ         
Sbjct: 119 PDSEWSCPEAGEVLKKNSVPDWPLLATYLISEASLMKSSRWFSYISALPRQ-------PY 171

Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
           S L WS+ EL  YL  S  +   +ER   +   YN+      M   +F +YP   P E F
Sbjct: 172 SLLYWSQAELDRYLEASQIRERAIERTNNVIGTYND------MRVRIFSKYPDFFPEEVF 225

Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV-- 312
             E FK +F  + S +V L   S+  + ALVP    ++ +S + +  L     +  +V  
Sbjct: 226 NIESFKWSFGILFSRMVRLP--SMDGKNALVPWA-DMMNHSCEVETFLDYDKSSKGIVFP 282

Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMV 370
            DRPY+ GE + +  G + N +LL++YGFV ++  NP D + +  +L   D  Y++K  +
Sbjct: 283 TDRPYQPGEQVFISYGKKSNGELLLSYGFVPKEGTNPSDSVELSLSLKKSDESYKEKLEL 342

Query: 371 AQRNGKLSVQVFHVH 385
            ++ G    Q F + 
Sbjct: 343 LKKYGLSGSQCFPIR 357


>gi|357469947|ref|XP_003605258.1| SET domain-containing protein [Medicago truncatula]
 gi|355506313|gb|AES87455.1| SET domain-containing protein [Medicago truncatula]
          Length = 494

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 91/315 (28%), Positives = 148/315 (46%), Gaps = 27/315 (8%)

Query: 76  EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
           E+   L+ W+ ++GLP  K+ + +K    E+      + A  +++ G+    VP  LV+T
Sbjct: 62  ENSSSLQKWLSQSGLPSQKMSI-DKVDVGERG-----LVALNNIRKGEKLLFVPPQLVIT 115

Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
            +         E+L  N + +   LA YL+ E    K S W  YI  L RQ         
Sbjct: 116 PDSEWSCPEAGEVLKKNSVPDWPLLATYLISEASLMKSSRWFSYISALPRQ-------PY 168

Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
           S L WS+ EL  YL  S  +   +ER   +   YN+      M   +F +YP   P E F
Sbjct: 169 SLLYWSQAELDRYLEASQIRERAIERTNNVIGTYND------MRVRIFSKYPDFFPEEVF 222

Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV-- 312
             E FK +F  + S +V L   S+  + ALVP    ++ +S + +  L     +  +V  
Sbjct: 223 NIESFKWSFGILFSRMVRLP--SMDGKNALVPWA-DMMNHSCEVETFLDYDKSSKGIVFP 279

Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMV 370
            DRPY+ GE + +  G + N +LL++YGFV ++  NP D + +  +L   D  Y++K  +
Sbjct: 280 TDRPYQPGEQVFISYGKKSNGELLLSYGFVPKEGTNPSDSVELSLSLKKSDESYKEKLEL 339

Query: 371 AQRNGKLSVQVFHVH 385
            ++ G    Q F + 
Sbjct: 340 LKKYGLSGSQCFPIR 354


>gi|225447500|ref|XP_002267469.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic [Vitis
           vinifera]
 gi|296085051|emb|CBI28466.3| unnamed protein product [Vitis vinifera]
          Length = 497

 Score =  110 bits (275), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 92/316 (29%), Positives = 151/316 (47%), Gaps = 29/316 (9%)

Query: 76  EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
           E+   L+ W+  +GLPP K+ + E+    E+      + A ++++ G+    VP SLV+T
Sbjct: 65  ENAALLQKWLSDSGLPPQKMGI-ERVEVGERG-----LVALKNIRKGEKLLFVPPSLVIT 118

Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
            +         E+L  N + +   LA YL+ E    + S W  YI  L RQ         
Sbjct: 119 ADSEWSCTEAGEVLKRNSVPDWPLLATYLIGEASFMQSSRWSNYISALPRQ-------PY 171

Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
           S L W+  EL  YL  S  +   +ER   +   YN+L         +F ++P+  P E F
Sbjct: 172 SLLYWTRAELDKYLEASQIRERAIERINDVTGTYNDLRL------RIFSKHPHLFPEEVF 225

Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV- 313
             E FK +F  + S +V L   S+  + ALVP    +L +S + +  L   D + Q VV 
Sbjct: 226 NMETFKWSFGILFSRLVRLP--SMDEKIALVPWA-DMLNHSCEVETFL-DYDKSSQGVVF 281

Query: 314 --DRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRM 369
             DR Y+  E + +  G + N +LL++YGFV  +  NP D++ +  +L   D  Y++K  
Sbjct: 282 TTDRTYQPSEQVFISYGKKSNGELLLSYGFVPREGTNPNDKVELLLSLKKSDKCYKEKSE 341

Query: 370 VAQRNGKLSVQVFHVH 385
             +++G  + Q F + 
Sbjct: 342 AMKKHGLSTSQCFPIQ 357


>gi|255582876|ref|XP_002532210.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase, chloroplast precursor, putative
           [Ricinus communis]
 gi|223528106|gb|EEF30179.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase, chloroplast precursor, putative
           [Ricinus communis]
          Length = 508

 Score =  109 bits (272), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 94/311 (30%), Positives = 147/311 (47%), Gaps = 29/311 (9%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
           L+ W+  NGLP  K+ + +K    E+      + A ++++ G+    VP SLV+T +   
Sbjct: 81  LQRWLSNNGLPDQKMAI-DKVEVGERG-----LVALKNIRKGEKLLFVPPSLVITADSEW 134

Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
                 E+L    + +   LA+YL+ E    K S W  YI  L RQ         S L W
Sbjct: 135 SCPEAGEVLKQYSVPDWPLLAIYLISEANLQKSSKWSNYISALPRQ-------PYSLLYW 187

Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
           +  EL  YL  S  +   +ER   +   YN+L         +F +YP   P E F  E F
Sbjct: 188 TRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVFNLETF 241

Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV---DRP 316
           K +F  + S +V L   S+  + ALVP    +L +S + +  L   D + Q VV   DR 
Sbjct: 242 KWSFGILFSRLVRLP--SMDGKVALVPWA-DMLNHSCEVETFL-DYDKSSQGVVFTTDRQ 297

Query: 317 YKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRN 374
           Y+ GE + +  G + N +LL++YGFV  +  NP D + +  +L   D  Y++K    +++
Sbjct: 298 YEPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELSLSLKKSDKSYKEKLEALKKH 357

Query: 375 GKLSVQVFHVH 385
           G  + Q F V 
Sbjct: 358 GFSASQCFPVR 368


>gi|242066146|ref|XP_002454362.1| hypothetical protein SORBIDRAFT_04g029430 [Sorghum bicolor]
 gi|241934193|gb|EES07338.1| hypothetical protein SORBIDRAFT_04g029430 [Sorghum bicolor]
          Length = 499

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 104/386 (26%), Positives = 175/386 (45%), Gaps = 59/386 (15%)

Query: 9   SSKFISPPIRPPHH----PLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSSDTLV 64
           S+  + PP+R P H    P S +S+ S                R + R  IR S++    
Sbjct: 4   STTTLHPPLRAPRHLRPLPHSYSSSFS----------------RTRGRAPIRASAASASA 47

Query: 65  AGSREVVS--------KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAAS 116
              RE  +        +  E    L+ W+  +GLP  ++ + ++    E+      + A 
Sbjct: 48  PAQREAAAGVPWGCEIESLESAASLERWLIDSGLPEQRLAI-QRVDIGERG-----LVAL 101

Query: 117 EDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFW 176
           ++++ G+    VP SLV+T +   G   + E++  N + +   +A YL+ E      S W
Sbjct: 102 KNIRKGEKLLFVPPSLVITADSEWGRPEVGEVMKRNSVPDWPLIATYLISEASLEGSSRW 161

Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL-DTV 234
             YI  L RQ         S L W+  EL AYL  SP +   ++R   +   YN+L D +
Sbjct: 162 SSYIAALPRQ-------PYSLLYWTRAELDAYLVASPIRKRAIQRITDVIGTYNDLRDRI 214

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
           +     LF       P E +  E F  +F  + S +V L   S+  + ALVP    +L +
Sbjct: 215 FSRHSDLF-------PEEVYNIETFLWSFGILFSRLVRLP--SMDEKVALVPWA-DMLNH 264

Query: 295 SSKCKAMLAAVDDAVQLVV---DRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYD 349
           S + +  L   D + Q +V   DR Y+ GE + +  G + + +LL++YGFV ++  NP D
Sbjct: 265 SPEVETFL-DFDKSSQGIVFTTDRSYQPGEQVFISYGKKSSGELLLSYGFVPKEGTNPND 323

Query: 350 RLVVEAALNTEDPQYQDKRMVAQRNG 375
            + +  +L+  D  Y++K    +RNG
Sbjct: 324 SVELLVSLDKSDKCYKEKLQALKRNG 349


>gi|440792294|gb|ELR13522.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
          Length = 568

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 123/461 (26%), Positives = 194/461 (42%), Gaps = 70/461 (15%)

Query: 76  EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
           +DL  L+ W+ KNGL    +   E  ++  +      V A +D + G+    VP  L+ T
Sbjct: 66  DDLEQLRVWLLKNGLDSKWLEGIEFAANLPEGSG---VVAKKDFKKGEPFLQVPRKLMFT 122

Query: 136 LERVLGNETIAELLTTNKL---SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
            +  + N  + +LL  +K    S   CLAL+L+ EK     SFW PYI+ L +  G    
Sbjct: 123 CQ-AMQNTPLGQLLKVDKFLAQSPSLCLALHLLVEK-HNHSSFWTPYIKTLPKSYG---- 176

Query: 193 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 252
              + L ++  EL  L GSPT    ++    +  +Y  +         LFQ     +   
Sbjct: 177 ---TCLYFTLEELEGLRGSPTFTSAIKVIATVAIQYTYIH-------DLFQIRKDILHIN 226

Query: 253 AFTFEIFKQAFVAV---QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV 309
           AFT++ F  A  AV   Q+ V      +L+  +AL+P            +       D+ 
Sbjct: 227 AFTWDEFIWAMSAVGSRQNQVPQWGHNALSE-YALIPAWDMCNHDHGDLQTFWDVNSDST 285

Query: 310 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK-R 368
           +    R YK GE + ++ GP+PNS LL++ GFV E+N +D L +   L  +    +DK R
Sbjct: 286 ESHAMRAYKKGEQVYIFYGPRPNSDLLLHAGFVYENNRFDALAIRVRLAPDAEHIKDKLR 345

Query: 369 MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISS----------- 417
           ++   N K+  Q +    G       D++ +LR+ +  +  E+Q V+ +           
Sbjct: 346 LLHLNNMKMDSQYYLYGLG----LAVDLMAFLRI-HAMNEQELQQVLGAYDQQEAKVHNG 400

Query: 418 --------------LGPICPVSPCMERAVLDQLADYFKARLAGYPATLS---------ED 454
                           P   ++   E A L        + L+ YP TL          ED
Sbjct: 401 EHPASNGEVVASGVFDPRVKLNDRNELAALQLAEAKCLSLLSLYPTTLQVANGVELKQED 460

Query: 455 EAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLL 495
           +A L   +L P  R  T L   EK++LN  L    D I LL
Sbjct: 461 QAALRTTSLTPNMRAVTLLRLKEKEILNRTL----DAIRLL 497


>gi|226501968|ref|NP_001140387.1| uncharacterized protein LOC100272441 [Zea mays]
 gi|194699272|gb|ACF83720.1| unknown [Zea mays]
 gi|413923744|gb|AFW63676.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
           N-methyltransferase I [Zea mays]
          Length = 503

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 83/305 (27%), Positives = 145/305 (47%), Gaps = 27/305 (8%)

Query: 76  EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
           E    L+ W+  +GLP  ++ + ++    E+      + A ++++ G+    VP SLV+T
Sbjct: 71  ESAASLERWLIDSGLPEQRLAI-QRVDIGERG-----LVALKNIRKGEKLLFVPPSLVIT 124

Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
            +   G   + +++  N + +   +A YL+ E      S W+ YI  L RQ         
Sbjct: 125 ADSEWGRPEVGDVMKRNSVPDWPLIATYLISEASLEGSSRWISYIAALPRQ-------PY 177

Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
           S L W+  EL AYL  SP +   ++R   +   YN+L         +F ++P   P E +
Sbjct: 178 SLLYWTRAELDAYLVASPIRKRAIQRITDVIGTYNDLRD------RIFSRHPDLFPEEVY 231

Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLV 312
             E F  +F  + S +V L   S+  R ALVP    +L +S + +  L        +   
Sbjct: 232 NIETFLWSFGILFSRLVRLP--SMDGRVALVPWA-DMLNHSPEVETFLDFDKSSRGIVFT 288

Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMV 370
            DR Y+ GE + +  G + + +LL++YGFV ++  NP D + +  +L+  D  Y++K   
Sbjct: 289 TDRSYQPGEQVFISYGKKSSGELLLSYGFVPKEGTNPNDSVELLVSLDKSDNCYKEKLQA 348

Query: 371 AQRNG 375
            +RNG
Sbjct: 349 LKRNG 353


>gi|397613505|gb|EJK62256.1| hypothetical protein THAOC_17139 [Thalassiosira oceanica]
          Length = 648

 Score =  105 bits (262), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 106/414 (25%), Positives = 180/414 (43%), Gaps = 58/414 (14%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLE---RVLGNETIAELLTTNKLSELACLALYLMYEK-KQ 170
           A   +  GD    +P  L +T +   R LG + + E      ++E   +A  L++EK  +
Sbjct: 210 ARRSINDGDELLKIPLDLCLTRKSARRELGKDALQE-----GINEYLAVACQLIHEKFVK 264

Query: 171 GKKSFWLPYIR---ELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
           G+ SF+  Y+    E+D         V     W + +LA+L GSP  A        ++RE
Sbjct: 265 GEDSFYAAYMGVLPEVDE--------VNPTFTWPDEDLAFLEGSPVVAATRSLQMKLRRE 316

Query: 228 YNELDTVWFMAG--SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
           Y++L     + G   L  ++P   P E +TFE ++ AF  + S  + L+ + +  R A+V
Sbjct: 317 YDDL-----LGGPDGLVAKFPLRFPAEHYTFENWEWAFTMLFSRAIRLRNLQVGERLAMV 371

Query: 286 PLGPPLLAYSSKCKAMLAA----------VDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
           P    L+ +S+  +A + A           ++ V L  DR Y+  E + +  G + N++L
Sbjct: 372 PYAD-LINHSAFSQAFIDARESGDWLFKSGEEEVILYADRGYRQMEQVYISYGQKSNAEL 430

Query: 336 LINYGFVDEDNPYDRLVVEAALN-------------TEDPQYQDKRMVAQRNGKLSVQVF 382
           L+ YGF  E NPY+ + V  ++               EDP   +K       G+     F
Sbjct: 431 LLLYGFALERNPYNSVDVTVSIAPRTKQIAEANEGVEEDPLADEKLEFLLSVGRDQTVDF 490

Query: 383 HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 442
             +A R      +ML YLRL  ++        +S       +S   E +VL  +    K 
Sbjct: 491 PCYADRYP---VEMLEYLRLMMMTPEDTRGKPLSDFDYSRTISSANEASVLRSVVAAVKY 547

Query: 443 RLAGYPATLSEDEAMLTDYNLHP----KKRVATQLVRMEKKMLNACLQVTADMI 492
           +L  +P T  +D A++ D  +       +R+A +  R EK++L   L      I
Sbjct: 548 QLGLFPQTEEDDAAIIKDKGMFRLFSYNQRMAVRHRRNEKRLLKRTLAALEKQI 601


>gi|449453618|ref|XP_004144553.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Cucumis
           sativus]
 gi|449511789|ref|XP_004164054.1| PREDICTED: LOW QUALITY PROTEIN: ribulose-1,5 bisphosphate
           carboxylase/oxygenase large subunit N-methyltransferase,
           chloroplastic-like [Cucumis sativus]
          Length = 497

 Score =  105 bits (261), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 92/316 (29%), Positives = 150/316 (47%), Gaps = 29/316 (9%)

Query: 76  EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
           E+   L+ W+ ++GLP  K+ ++     N   R +    A ++++ G+    VP SLV++
Sbjct: 65  ENASALQKWLSESGLPDQKMSIQRV---NVGERGL---VALKNVRKGEKLLFVPPSLVIS 118

Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
            E         E+L  N + +   +A YL+ E    K S W  YI  L RQ         
Sbjct: 119 AESEWSCPEAGEVLKRNSVPDWPLIATYLISEASLMKSSRWNNYISALPRQ-------PY 171

Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
           S L W+  EL  YL  S  +   +ER   +   YN+L         +F ++P   P E F
Sbjct: 172 SLLYWTREELDRYLEASEIRERAIERITNVVGTYNDLSI------RVFSKHPELFPEEVF 225

Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV- 313
             E FK +F  + S +V L   S+  + ALVP    +L ++ + +  L   D A Q VV 
Sbjct: 226 NIETFKWSFGILFSRLVRLP--SMDGKVALVPWA-DMLNHNCEVETFL-DYDKASQGVVF 281

Query: 314 --DRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRM 369
             DR Y+ GE + +  G + N +LL++YGFV ++  NP D + +  +L   D  Y++K  
Sbjct: 282 TTDRAYQPGEQVFISYGKKSNGELLLSYGFVPKEGSNPSDSVELLLSLKKSDKCYKEKLE 341

Query: 370 VAQRNGKLSVQVFHVH 385
             +++G  + Q F + 
Sbjct: 342 ALKKHGLRASQCFPIQ 357


>gi|195651313|gb|ACG45124.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
           N-methyltransferase I [Zea mays]
          Length = 503

 Score =  104 bits (260), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 82/305 (26%), Positives = 144/305 (47%), Gaps = 27/305 (8%)

Query: 76  EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
           E    L+ W+  +GLP  ++ + ++    E+      + A ++++ G+    VP SLV+T
Sbjct: 71  ESAASLERWLIDSGLPEQRLAI-QRVDIGERG-----LVALKNIRKGENLLFVPPSLVIT 124

Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
            +   G   + +++  N + +   +A YL+ E      S W+ YI  L RQ         
Sbjct: 125 ADSEWGRPEVGDVMKRNSVPDWPLIATYLISEASLEGSSRWISYIAALPRQ-------PY 177

Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
           S L W+  EL AYL  SP +   ++R   +   YN+L         +F ++P   P E +
Sbjct: 178 SLLYWTRAELDAYLVASPIRKRAIQRITDVIGTYNDLRD------RIFSRHPDLFPEEVY 231

Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLV 312
             E F  +F  + S +V L   S+  R  LVP    +L +S + +  L        +   
Sbjct: 232 NIETFLWSFGILFSRLVRLP--SMDGRVVLVPWA-DMLNHSPEVETFLDFDKSSRGIVFT 288

Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMV 370
            DR Y+ GE + +  G + + +LL++YGFV ++  NP D + +  +L+  D  Y++K   
Sbjct: 289 TDRSYQPGEQVFISYGKKSSGELLLSYGFVPKEGTNPNDSVELLVSLDKSDNCYKEKLQA 348

Query: 371 AQRNG 375
            +RNG
Sbjct: 349 LKRNG 353


>gi|307109960|gb|EFN58197.1| hypothetical protein CHLNCDRAFT_142047 [Chlorella variabilis]
          Length = 485

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 112/423 (26%), Positives = 189/423 (44%), Gaps = 40/423 (9%)

Query: 49  RKNRFSIRVSSSDTLV-----AGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSH 103
           R  R   R+++   L+     AGS E+ +  E +  +LK+W+ + GLPP K+     P  
Sbjct: 26  RHRRCRCRLAAQAGLLDLLRGAGSTEIATDAEGE--ELKAWLIERGLPPPKLAAAATPGS 83

Query: 104 NEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALY 163
                    + A++ +  G++  S+P  LV+T    L    +  LL    L   + LAL+
Sbjct: 84  GRG------LVAAQPIGKGESLLSIPQQLVLTPAAALEQSCLRPLLEEQPLPAWSVLALW 137

Query: 164 LMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG 223
           L  ++  G    W PY+R L  + G         L WSE E+ +L GS   ++ LE    
Sbjct: 138 LAEQRAAGSAGGWWPYVRLLPERTG-------CVLEWSEEEVEWLCGSQLHSDALEIRAA 190

Query: 224 IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 283
            +  + E+  V   A +  +   +     AF     + AF  + S +V L    L  + A
Sbjct: 191 AEASWAEMQAVLAAAKAQGRAPAHG----AFGRAQLQWAFAVLLSRLVRL--AGLGDQEA 244

Query: 284 LVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
           L+P    LL +     + L  +A + AV L  +R Y+AGE +++  G + + +LL++YGF
Sbjct: 245 LLPWA-DLLNHDCAAASFLDWSATEAAVVLRAERRYRAGEQLLISYGQKTSGELLLSYGF 303

Query: 342 VDE--DNPYD--RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDML 397
             +   NP+D  RL++E  L   D     K    +++G  + Q+F +   R   A  +++
Sbjct: 304 CPDLGSNPHDGCRLLLE--LAPGDAARNWKAAALRQHGLAASQLFPL---RMAAAPFELV 358

Query: 398 PY--LRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 455
            Y       V    E + +   L     + P ++ A L+ +    KA LA YP +   D 
Sbjct: 359 HYTAFSAAVVGSRQEAEQLARRLFEEGDIPPALQTAALEAVVAACKAALAAYPRSFDGDR 418

Query: 456 AML 458
           A L
Sbjct: 419 AEL 421


>gi|326495906|dbj|BAJ90575.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 507

 Score =  102 bits (253), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 85/306 (27%), Positives = 147/306 (48%), Gaps = 29/306 (9%)

Query: 76  EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
           E    L+ W+  +GLP  ++ L EK    E+      + A ++++ G+    VP +LV+T
Sbjct: 74  ESAASLERWLTASGLPEQRLAL-EKVDIGERG-----LVALKNVRNGEKLLFVPPTLVIT 127

Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
            +    N  + +++    + +   LA YL+ E      S W  YI  L RQ         
Sbjct: 128 ADSEWTNREVGDVMKRYSVPDWPLLATYLISEASLEGSSRWSSYIDALPRQ-------PY 180

Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
           S L W+ TE+ AYL  SP +   + R   +   YN+L         +F ++P   P + +
Sbjct: 181 SLLYWTRTEIDAYLVASPIRERAISRISDVIGTYNDLRD------RIFSKHPDLFPEKVY 234

Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV- 313
             E F+ +F  + S +V L+  S+  + ALVP    +L +S +  A L   D + Q +V 
Sbjct: 235 NMENFRWSFGILFSRLVRLE--SMGGKVALVPWA-DMLNHSPEVDAFL-DYDKSSQGIVF 290

Query: 314 --DRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRM 369
             DR Y+ GE + +  G + + +LL++YGFV ++  NP D +    +L   D  Y++K  
Sbjct: 291 TTDRSYQPGEQVFISYGKKSSGELLLSYGFVPKEGTNPNDSVEFLVSLKKSDECYKEKLQ 350

Query: 370 VAQRNG 375
             +++G
Sbjct: 351 ALKKHG 356


>gi|223992783|ref|XP_002286075.1| rubisco small subunit small subunit n-methyltransferase
           [Thalassiosira pseudonana CCMP1335]
 gi|220977390|gb|EED95716.1| rubisco small subunit small subunit n-methyltransferase
           [Thalassiosira pseudonana CCMP1335]
          Length = 434

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 101/401 (25%), Positives = 171/401 (42%), Gaps = 46/401 (11%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKK-QGKK 173
           A   +  GD    +P  L +T  R    + + +    + ++E   +A  L++EK   G +
Sbjct: 50  ARRSINDGDELLKIPMDLCIT--RKSARKALGKDALQDGINEYLAIACQLIHEKYVLGDE 107

Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
           S W  Y+  L          V     W + +LA+L GSP  A        ++REY+ L  
Sbjct: 108 SEWDAYMGVLPEVE-----EVNPTFTWKDEDLAFLDGSPVVAATRSLQMKLRREYDAL-- 160

Query: 234 VWFMAG--SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
              + G   L  ++P   P E FT+E +  AF  + S  + L+ + +  R A+VP    L
Sbjct: 161 ---LGGQDGLIAKFPDRFPAEHFTYENWVWAFTMLFSRAIRLRNLQVGERLAMVPYAD-L 216

Query: 292 LAYSSKCKAMLAAVD----------DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
           + +S+   A + A +          + V L  DR Y+  E + +  G + N++LL+ YGF
Sbjct: 217 INHSAFSGAFIDARESGDWLFKNGEEEVILYADRGYRQMEQVYISYGQKSNAELLLLYGF 276

Query: 342 VDEDNPYDRLVVEAALNTE-------------DPQYQDKRMVAQRNGKLSVQVFHVHAGR 388
             E NPY+ + V  ++                DP  Q+K       G+     F  +A R
Sbjct: 277 ALERNPYNSVDVTVSIAPRTAALAAANEGIEVDPLAQEKVEFLASVGRDQTVDFPCYADR 336

Query: 389 EKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP 448
                 +ML +LRL  ++        ++       +SP  E AVL  + +  K +L  YP
Sbjct: 337 YP---VEMLEFLRLMMMTPEDTRGKPLADFDYSRTISPANEAAVLSSVVEAVKYQLNLYP 393

Query: 449 ATLSEDEAMLTDYNLHP----KKRVATQLVRMEKKMLNACL 485
            +  +D  ++ D  L       +R+A +  R EK++L   L
Sbjct: 394 QSEEDDANIIKDKALFRLLSYNQRMAVRHRRNEKRLLKRTL 434


>gi|302785554|ref|XP_002974548.1| hypothetical protein SELMODRAFT_101776 [Selaginella moellendorffii]
 gi|300157443|gb|EFJ24068.1| hypothetical protein SELMODRAFT_101776 [Selaginella moellendorffii]
          Length = 467

 Score =  100 bits (249), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 82/299 (27%), Positives = 136/299 (45%), Gaps = 26/299 (8%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
           L+ W+ + GLP  KV LK   +          + +   L  GD    +P +L +T E   
Sbjct: 41  LQQWLSQAGLPIQKVELKNVGAGGRG------LVSKRMLYKGDRLLFLPATLAITTESEW 94

Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
                 +++    L E   LA YL+ E   GK S W PYI  L R+ G       S LLW
Sbjct: 95  ACAEAGKVIRAKDLPEWPFLACYLISEASLGKSSPWYPYIAALPRRPG-------SILLW 147

Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
           +  ++ A+L+ +  K   L+    ++  +N+L+   FM          + P E F  E F
Sbjct: 148 TALDVEAHLSATSIKDRALQCVREVEDTFNDLNKQVFMKNR------EEFPPEVFNLESF 201

Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPY 317
           K AF  + S +V L   SL ++ AL+P G  +L + ++    L   +   ++   +DR Y
Sbjct: 202 KWAFGILFSRLVRLP--SLGQKLALIPFG-DMLNHDTEVTTFLDFDSGSKSITCTLDRGY 258

Query: 318 KAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 375
           ++ + + +  G + N +LL+ YGFV    N  D + +   L+  D  Y+ K    + +G
Sbjct: 259 ESNKEVFISYGKRSNGELLVAYGFVPSGKNSEDSVSITLGLDPADEMYEAKLGALKEHG 317


>gi|444909511|ref|ZP_21229702.1| hypothetical protein D187_00317 [Cystobacter fuscus DSM 2262]
 gi|444720460|gb|ELW61244.1| hypothetical protein D187_00317 [Cystobacter fuscus DSM 2262]
          Length = 445

 Score = 99.0 bits (245), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 110/433 (25%), Positives = 180/433 (41%), Gaps = 45/433 (10%)

Query: 68  REVVSKKEEDLGDLKSWMHKNG-LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAF 126
           R      E+ L  L  WM + G L P   I+++             V A  D+  G+   
Sbjct: 2   RTSAESSEQKLSSLLRWMEQGGALFPKMHIVRQADGERS-------VLARTDIAEGEVVL 54

Query: 127 SVPNSLVVTLERV----LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRE 182
            +P + + TLER     +G    ++L   N   +   LA +L+ EK +G  SFW P++  
Sbjct: 55  QIPTTHLFTLERAKASDIGRRIQSQLQPDN---DFLYLASWLLEEKHRGADSFWKPFVDS 111

Query: 183 LDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLF 242
           L       +     PL +SE E A + GS      LER   ++R+  E +         +
Sbjct: 112 LP------EAYPHVPLFYSEQERARMKGSQ-----LERLVEVQRQSFEQE---------Y 151

Query: 243 QQYPYDIPT-EAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAM 301
            Q    +P  E F FE +  A +++ S +  L+        +LVPL   +  +      +
Sbjct: 152 AQLREKLPEYERFGFEEYVWARISLYSRLFSLKGGLQGP--SLVPLSD-MFNHRQPPDVL 208

Query: 302 LAAVDDA--VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALN 358
            +  +D    +++  R   AG  I    G + +   L++ GFV D +   D + +   L 
Sbjct: 209 WSTSEDGQTFRMIAQRAVPAGTEIHTHYGAKSSDVFLLHSGFVPDGNEENDEVYLSVGLP 268

Query: 359 TEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM---QSVI 415
             DP    K+ +       +   F V    +  A   +  +LR+ + S    +     ++
Sbjct: 269 PGDPLASVKQQMFGLASATAKHPFKVSRQGKYLASWSVFSFLRMAHASPDEFLALSNRLL 328

Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
           S    I PVS   E  VL  LA   + RL  +P TL EDE +L +  L P +R    L R
Sbjct: 329 SGTKTIAPVSVACEERVLGTLAAACEERLKAFPTTLEEDERLLREGPLSPNERSCVLLRR 388

Query: 476 MEKKMLNACLQVT 488
            EK++L   L++T
Sbjct: 389 QEKRLLGDYLELT 401


>gi|115448405|ref|NP_001047982.1| Os02g0725200 [Oryza sativa Japonica Group]
 gi|45735887|dbj|BAD12920.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase [Oryza sativa Japonica
           Group]
 gi|45736017|dbj|BAD13045.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase [Oryza sativa Japonica
           Group]
 gi|113537513|dbj|BAF09896.1| Os02g0725200 [Oryza sativa Japonica Group]
 gi|215737236|dbj|BAG96165.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222623589|gb|EEE57721.1| hypothetical protein OsJ_08208 [Oryza sativa Japonica Group]
          Length = 502

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 86/308 (27%), Positives = 145/308 (47%), Gaps = 29/308 (9%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
           L+ W+  +GLP  ++ + ++    E+      + A ++++ G+    VP SLV+T +   
Sbjct: 75  LERWLTDSGLPEQRLGI-QRVDVGERG-----LVALKNIRKGEKLLFVPPSLVITADSEW 128

Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
           G   +  +L  N + +   +A YL+ E      S W  YI  L RQ         S L W
Sbjct: 129 GCPEVGNVLKRNSVPDWPLIATYLISEASLESSSRWSSYIAALPRQ-------PYSLLYW 181

Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFEI 258
           +  EL AYL  SP +   ++R   +   YN+L D ++     LF       P E +  E 
Sbjct: 182 TRPELDAYLVASPIRERAIQRITDVVGTYNDLRDRIFSKHSDLF-------PEEVYNLET 234

Query: 259 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRP 316
           F+ +F  + S +V L   S+  R ALVP    +L +S + +  L        +    DR 
Sbjct: 235 FRWSFGILFSRLVRLP--SMDGRVALVPWA-DMLNHSPEVETFLDYDKSSGGIVFTTDRS 291

Query: 317 YKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRN 374
           Y+ GE + +  G + + +LL++YGFV ++  NP D + +  +LN  D  Y++K    +RN
Sbjct: 292 YQPGEQVFISYGKKSSGELLLSYGFVPKEGTNPNDSVELLVSLNKSDKCYKEKLQALKRN 351

Query: 375 GKLSVQVF 382
           G    + F
Sbjct: 352 GLSEFESF 359


>gi|308807993|ref|XP_003081307.1| putative methyltransferase (ISS) [Ostreococcus tauri]
 gi|116059769|emb|CAL55476.1| putative methyltransferase (ISS) [Ostreococcus tauri]
          Length = 505

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 108/410 (26%), Positives = 173/410 (42%), Gaps = 67/410 (16%)

Query: 80  DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
           DL  W+  NGL   K+ L+   +          + A+E+++ G+A   V  S ++T+ER 
Sbjct: 66  DLTRWLASNGLRAQKMTLESNLAEG------RGLVATEEIKRGEALLGVDASCLITVERA 119

Query: 140 LGNETIAELLTTNKLSELACLALYLMYEK---KQGKKSFWLPYIRELDRQRGRGQLAVES 196
           +    +       +L E + LA +L  +    + G    +  YIR L R+ G       S
Sbjct: 120 IAEAKLGP--RHAELQEWSVLATFLAQQAMALESGNAGTFGEYIRALPRRTG-------S 170

Query: 197 PLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
            L W E E+   L GSP++    ER E +     E+ +           +P DI   A  
Sbjct: 171 VLDWPEDEVETLLKGSPSRLAAAERQESVNAAIAEIRS----------SFP-DITEGALR 219

Query: 256 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 315
           +     AF  + S ++ L   ++    ALVP    +L +   C A +     AV L  DR
Sbjct: 220 W-----AFDILFSRLIRLD--AMGGELALVPW-ADMLNHKPGCAAFIDLNGSAVNLTTDR 271

Query: 316 PYKAGESIVVWCGPQPNSKLLINYGFVDE--DNPYDRLVVEAALNTEDPQYQDKRMVAQR 373
            Y AGE +    G +P+S+LLI+YGF  E  +NP D   +   ++  DP  Q K  V +R
Sbjct: 272 AYAAGEQVWASYGQRPSSELLISYGFAPEVGENPDDEYSLTLGVDVNDPYAQAKADVLRR 331

Query: 374 NGKLSVQVFHVH-AGREKEAI-------------SDMLPYLRLGYVSDTSEMQSVISSL- 418
            G   V+ F +   G  ++ +             S++    R  +    +  QS+  S+ 
Sbjct: 332 MGLSPVETFPLRLNGYPRQLLQYASFILCNPDKPSELEGLARTAFTGSANFGQSIFDSVR 391

Query: 419 -----------GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM 457
                      G I    P  E AV + LAD     L+ YP +L +D+ +
Sbjct: 392 GLAQGQARGKQGVILGGVPG-EIAVREMLADMCAEALSAYPNSLEKDKGI 440


>gi|218191491|gb|EEC73918.1| hypothetical protein OsI_08761 [Oryza sativa Indica Group]
          Length = 502

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 86/308 (27%), Positives = 145/308 (47%), Gaps = 29/308 (9%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
           L+ W+  +GLP  ++ + ++    E+      + A ++++ G+    VP SLV+T +   
Sbjct: 75  LERWLTDSGLPEQRLGI-QRVDVGERG-----LVALKNIRKGEKLLFVPPSLVITADSEW 128

Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
           G   +  +L  N + +   +A YL+ E      S W  YI  L RQ         S L W
Sbjct: 129 GCPEVGNVLKRNSVPDWPLIATYLISEASLESSSRWSSYIAALPRQ-------PYSLLYW 181

Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFEI 258
           +  EL AYL  SP +   ++R   +   YN+L D ++     LF       P E +  E 
Sbjct: 182 TRPELDAYLVASPIRERAIQRITDVVGTYNDLRDRIFSKHSDLF-------PEEVYNLET 234

Query: 259 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRP 316
           F+ +F  + S +V L   S+  R ALVP    +L +S + +  L        +    DR 
Sbjct: 235 FRWSFGILFSRLVRLP--SMDGRVALVPWA-DMLNHSPEVETFLDYDKSSGGIVFTTDRS 291

Query: 317 YKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQRN 374
           Y+ GE + +  G + + +LL++YGFV ++  NP D + +  +LN  D  Y++K    +RN
Sbjct: 292 YQPGEQVFISYGKKSSGELLLSYGFVPKEGTNPNDSVELLVSLNKSDKCYKEKLQALKRN 351

Query: 375 GKLSVQVF 382
           G    + F
Sbjct: 352 GLSEFESF 359


>gi|323456050|gb|EGB11917.1| hypothetical protein AURANDRAFT_61181 [Aureococcus anophagefferens]
          Length = 516

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 92/343 (26%), Positives = 151/343 (44%), Gaps = 34/343 (9%)

Query: 151 TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 210
            +  +E   +AL L+ E+ +G +SFW  YI  L      G     +   W   ELAYL G
Sbjct: 162 NDDTNEYIAIALLLILERSKGSRSFWSEYIAILPTNEDVG-----ATFTWPAEELAYLEG 216

Query: 211 SPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
           SP  +        ++ E+         A  L      D   E FTFE ++ AF  + S  
Sbjct: 217 SPAASATASMMAKLRAEH---------AAVLEGNSALD--PEIFTFEAWQWAFTNLFSRA 265

Query: 271 VHLQKVSLARRFALVPL-----GPPLLAYSSKCKAMLAAV-----DDAVQLVVDRPYKAG 320
           + L+        A+VP        P  +     + +  A      +D V L  DR YK  
Sbjct: 266 IRLKASRAGELLAMVPYVDFINHSPFSSSYVDAREVPKAFPWEEKEDEVVLFADRAYKKF 325

Query: 321 ESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK-LSV 379
           E + +  GP+ N+ LL+ YGF  + NP++ + +    + +D  Y  K   A+  G+ +S 
Sbjct: 326 EQVFISYGPKSNADLLLLYGFALDRNPFNSVDLAVGASKDDALYDAKERFARGAGRDVSS 385

Query: 380 QVFHVHAGREKEAISDMLPYLRLGYVS-DTSEMQSVISSLGPICPVSPCMERAVLDQLAD 438
             F ++A R  +   +++ +LR+   + D    + +      +  +S   E AVLD + D
Sbjct: 386 AAFPLYADRFPD---ELVQFLRMACATEDHLGARPLDDPDNYVDILSLDNELAVLDTIRD 442

Query: 439 YFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 481
              A +A YPA   +D   + D  L   +R+A +LV  EK++L
Sbjct: 443 ACDAAVAAYPAKSGDD---VPDAFLSRNQRMAKRLVNTEKRIL 482


>gi|357137766|ref|XP_003570470.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like
           [Brachypodium distachyon]
          Length = 389

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 77/269 (28%), Positives = 131/269 (48%), Gaps = 23/269 (8%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A  +++ G+    VP SLV++ +    N  + +++ +  + +   LA YL+ E     
Sbjct: 13  LVALTNVRNGEKLLFVPPSLVISADSEWSNREVGDVMKSYSVPDWPLLATYLISEASLEG 72

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL 231
            S W  YI  L RQ         S L W+ TE+ AYL  SP +   + R   +   YN+L
Sbjct: 73  SSRWSSYIDALPRQ-------PYSLLYWTRTEIDAYLVASPIRERAISRIGDVIGTYNDL 125

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
                    +F ++P   P E +  E F+ +F  + S +V L   S+  + ALVP    +
Sbjct: 126 ------RDRIFSKHPELFPEEVYNMENFRWSFGILFSRLVRLP--SMDGKVALVPWA-DM 176

Query: 292 LAYSSKCKAMLAAVDDAVQLVV---DRPYKAGESIVVWCGPQPNSKLLINYGFVDED--N 346
           L ++ +  A L   D + Q +V   DR Y+ GE + +  G + + +LL++YGFV ++  N
Sbjct: 177 LNHNPEVDAFL-DFDKSSQGIVFTTDRSYQPGEQVFISYGKKSSGELLLSYGFVPKEGTN 235

Query: 347 PYDRLVVEAALNTEDPQYQDKRMVAQRNG 375
           P D +    +LN  D  Y++K    +R+G
Sbjct: 236 PNDSVEFSVSLNKSDDCYREKLQALKRHG 264


>gi|302759643|ref|XP_002963244.1| hypothetical protein SELMODRAFT_80789 [Selaginella moellendorffii]
 gi|300168512|gb|EFJ35115.1| hypothetical protein SELMODRAFT_80789 [Selaginella moellendorffii]
          Length = 467

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 80/299 (26%), Positives = 134/299 (44%), Gaps = 26/299 (8%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
           L+ W+ + GLP  KV LK   +          + +   L  GD    +P +L +T E   
Sbjct: 41  LQQWLSQAGLPIQKVELKNVGAGGRG------LVSKRMLYKGDRLLFLPATLAITTESEW 94

Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
                 +++    L E   LA YL+ E   GK S W PYI  L R+ G       S LLW
Sbjct: 95  ACAEAGKVIRAKDLPEWPFLACYLISEASLGKSSPWYPYIAALPRRPG-------SILLW 147

Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
           +  ++  +L+ +  K   L+    ++  +N+L+   FM          + P E F  + F
Sbjct: 148 TALDVETHLSATSIKDRALQCVREVEDTFNDLNKQVFMKNR------EEFPPEVFNLKSF 201

Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPY 317
           K AF  + S +V L   SL ++ AL+P G  +L + ++    L   +   ++   +DR Y
Sbjct: 202 KWAFGILFSRLVRLP--SLGQKLALIPFG-DMLNHDTEVTTFLDFDSGSKSITCTLDRGY 258

Query: 318 KAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 375
           ++   + +  G + N +LL+ YGFV    N  D + +   L+  D  Y+ K    + +G
Sbjct: 259 ESNREVFISYGKRSNGELLVAYGFVPSGKNSEDSVSITLGLDPADEMYEAKLGTLKEHG 317


>gi|443722302|gb|ELU11224.1| hypothetical protein CAPTEDRAFT_181634 [Capitella teleta]
          Length = 541

 Score = 96.3 bits (238), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 98/423 (23%), Positives = 184/423 (43%), Gaps = 35/423 (8%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPI--HYVAASEDLQAGDAAFSVP 129
           S +E++      W+  N +    V ++        H  +  + + A+ D + G+   ++P
Sbjct: 71  SGREKNFDGFMGWLKSNSVDAEAVEIQ--------HFDVGGYGIKATRDFKEGELFLAIP 122

Query: 130 NSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQ 186
            S+++T +    N  +  L+  N++ +      LAL+++ E      SFWLPY++ L   
Sbjct: 123 RSVMMTTDTA-KNSALGALIADNRILQTMPNILLALHVLCELC-SPASFWLPYLKILPH- 179

Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP 246
                 +  SPL ++  +L  L  SPT +E++ +   I R+Y     + F    L  + P
Sbjct: 180 ------SYSSPLYFNPEDLQLLKASPTLSEMINQFRNITRQYAYFFNL-FQGHELASKLP 232

Query: 247 YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF-ALVPLGPPLLAYSSKCKAMLAAV 305
             I  +   ++ ++ A  +V +    +  +   R   AL+PL       + +     +  
Sbjct: 233 --IQVKNICYDDYRWAVSSVMTRQNQIPTLDGQRMISALIPLWDMCNHTNGQITTDFSLK 290

Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 365
           +D  +        AG  + ++ G + N++LLI+ GFV   N  DRL +   ++  DP + 
Sbjct: 291 NDRSECFSLEGTVAGAQVFIFYGSRSNAELLIHNGFVYPQNHSDRLTIRLGISKNDPLFS 350

Query: 366 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICP-- 423
            K  V  R    + ++F +H G      SD L +LR+  +++  ++++ ++    I    
Sbjct: 351 MKSEVLSRLSMQASRLFSLHCG-VNPVDSDTLAFLRVVVMTE-DDLRTALACRQQISKLR 408

Query: 424 -----VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEK 478
                VS   ER     LA      L  YP +  ED  +L   +L    R+A QL   EK
Sbjct: 409 DFDDFVSEDNERKAWAFLATRVLLLLKAYPTSAQEDATLLQGNDLSTHARLAVQLRHCEK 468

Query: 479 KML 481
            +L
Sbjct: 469 NIL 471


>gi|62857953|ref|NP_001016577.1| histone-lysine N-methyltransferase setd3 [Xenopus (Silurana)
           tropicalis]
 gi|89272100|emb|CAJ81720.1| novel protein containing a SET domain [Xenopus (Silurana)
           tropicalis]
          Length = 581

 Score = 96.3 bits (238), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 105/455 (23%), Positives = 202/455 (44%), Gaps = 47/455 (10%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   +L  W  +NG       L E P           + A+ +++A +    VP  
Sbjct: 72  GKREDYFPELMEWCKENGASTDGFELVEFPEEG------FGLKATREIKAEELFLWVPRK 125

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E   G+  +  L + +++ +      LA +L+ E+     SFWLPYI+ L  +  
Sbjct: 126 LLMTVESAKGS-VLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWLPYIKTLPNE-- 181

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL ++E E+ YL  +    ++  + +   R+Y      +F    + Q +P  
Sbjct: 182 -----YDTPLYFNEDEVQYLQSTQAILDVFSQYKNTARQY-----AYFY--KVIQTHPNA 229

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FTF+ ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 230 NKLPLKDSFTFDDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 289

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  + +K+GE I ++ G + N++ +I+ GF  E+N +DR+ ++  ++  D  Y
Sbjct: 290 EDDRCECVALQDFKSGEQIYIFYGTRSNAEFVIHNGFFFENNLHDRVKIKLGVSKSDRLY 349

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL---------GYVSDTSEMQSVI 415
             K  V  R G  +  VF +H   E    + +L +LR+         G++     +  + 
Sbjct: 350 AMKAEVLARAGIPTSSVFALHV-TEPPISAQLLAFLRVFCMNEDELKGHLIGDHAIDKIF 408

Query: 416 SSLGPICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVAT 471
           +      PVS   E  +  +L  + +AR    L  Y  T+ +D  +L   ++     +A 
Sbjct: 409 TLGNSEFPVS--WENEI--KLWTFLEARASLLLKTYKTTVEDDNKVLEQPDMTFHSAMAI 464

Query: 472 QLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAP 506
           +L R+EK++L   L+  +D   L    +    P P
Sbjct: 465 KLRRVEKEILEKALKSASDNRKLYSKNSEEGTPLP 499


>gi|3403236|gb|AAC29137.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
           N-methyltransferase I [Spinacia oleracea]
          Length = 491

 Score = 95.9 bits (237), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 98/384 (25%), Positives = 171/384 (44%), Gaps = 37/384 (9%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A +D+   +    VP    +  + V  +E  +     N L     +AL+LM EKK G 
Sbjct: 86  LVAQKDISRNEVVLEVPQKFWINPDTVAASEIGS---VCNGLKPWVSVALFLMREKKLGN 142

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
            S W PYI  L         +  S + WSE EL+ L GS      L   E +  E+ +L+
Sbjct: 143 SSSWKPYIDILPD-------STNSTIYWSEEELSELQGSQLLNTTLGVKELVANEFAKLE 195

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG---- 288
               +     Q +P+D+  + F F  F        +C+         +   L+PL     
Sbjct: 196 EEVLVPHK--QLFPFDVTQDDF-FWAFGMLRSRAFTCLE-------GQSLVLIPLADLAN 245

Query: 289 --PPLLA--YSSKCK-AMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFV 342
             P + A  Y+ + + A L + +    L    P KAG+ +++ +   + N++L ++YG  
Sbjct: 246 HSPDITAPKYAWEIRGAGLFSRELVFSLRNPTPVKAGDQVLIQYDLNKSNAELALDYGLT 305

Query: 343 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 402
           +  +  +   +   +   D  Y DK  +A+ NG      F +    E+   ++MLPYLRL
Sbjct: 306 ESRSERNAYTLTLEIPESDSFYGDKLDIAESNGMGESAYFDIVL--EQPLPANMLPYLRL 363

Query: 403 GYVS--DTSEMQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM 457
             +   D   ++S+   S  G +  P+SP  E  +   + D   + L+GY  T++EDE +
Sbjct: 364 VALGGEDAFLLESIFRNSIWGHLDLPISPANEELICQVIRDACTSALSGYSTTIAEDEKL 423

Query: 458 LTDYNLHPKKRVATQLVRMEKKML 481
           L + ++ P+  +A  +   EKK+L
Sbjct: 424 LAEGDIDPRLEIAITIRLGEKKVL 447


>gi|340780678|pdb|3SMT|A Chain A, Crystal Structure Of Human Set Domain-Containing Protein3
          Length = 497

 Score = 95.9 bits (237), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 101/434 (23%), Positives = 195/434 (44%), Gaps = 39/434 (8%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 72  GKREDYFPDLXKWASENG---ASVEGFEXVNFKEEGFGLR---ATRDIKAEELFLWVPRK 125

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L+ T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 126 LLXTVESA-KNSVLGPLYSQDRILQAXGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 181

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 182 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 229

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 230 NKLPLKDSFTYEDYRWAVSSVXTRQNQIPTEDGSRVTLALIPLWDXCNHTNGLITTGYNL 289

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 290 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 349

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL---------GYVSDTSEMQSVI 415
             K  V  R G  +  VF +H   E    + +L +LR+          ++   S +  + 
Sbjct: 350 AXKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCXTEEELKEHLLGDSAIDRIF 408

Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
           +      PVS   E  +   L D     L  Y  T+ ED+++L +++L  + + A +L  
Sbjct: 409 TLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNHDLSVRAKXAIKLRL 468

Query: 476 MEKKMLNACLQVTA 489
            EK++L   ++  A
Sbjct: 469 GEKEILEKAVKSAA 482


>gi|332321747|sp|B7ZUF3.1|SETD3_XENTR RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|213624517|gb|AAI71209.1| LOC549331 protein [Xenopus (Silurana) tropicalis]
          Length = 582

 Score = 95.9 bits (237), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 105/455 (23%), Positives = 202/455 (44%), Gaps = 47/455 (10%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   +L  W  +NG       L E P           + A+ +++A +    VP  
Sbjct: 73  GKREDYFPELMEWCKENGASTDGFELVEFPEEG------FGLKATREIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E   G+  +  L + +++ +      LA +L+ E+     SFWLPYI+ L  +  
Sbjct: 127 LLMTVESAKGS-VLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWLPYIKTLPNE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL ++E E+ YL  +    ++  + +   R+Y      +F    + Q +P  
Sbjct: 183 -----YDTPLYFNEDEVQYLQSTQAILDVFSQYKNTARQY-----AYFY--KVIQTHPNA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FTF+ ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTFDDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  + +K+GE I ++ G + N++ +I+ GF  E+N +DR+ ++  ++  D  Y
Sbjct: 291 EDDRCECVALQDFKSGEQIYIFYGTRSNAEFVIHNGFFFENNLHDRVKIKLGVSKSDRLY 350

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL---------GYVSDTSEMQSVI 415
             K  V  R G  +  VF +H   E    + +L +LR+         G++     +  + 
Sbjct: 351 AMKAEVLARAGIPTSSVFALHV-TEPPISAQLLAFLRVFCMNEDELKGHLIGDHAIDKIF 409

Query: 416 SSLGPICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVAT 471
           +      PVS   E  +  +L  + +AR    L  Y  T+ +D  +L   ++     +A 
Sbjct: 410 TLGNSEFPVS--WENEI--KLWTFLEARASLLLKTYKTTVEDDNKVLEQPDMTFHSAMAI 465

Query: 472 QLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAP 506
           +L R+EK++L   L+  +D   L    +    P P
Sbjct: 466 KLRRVEKEILEKALKSASDNRKLYSKNSEEGTPLP 500


>gi|308802083|ref|XP_003078355.1| ribulose-1,5-bisphosphate carb (ISS) [Ostreococcus tauri]
 gi|116056807|emb|CAL53096.1| ribulose-1,5-bisphosphate carb (ISS) [Ostreococcus tauri]
          Length = 520

 Score = 95.9 bits (237), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 110/432 (25%), Positives = 184/432 (42%), Gaps = 48/432 (11%)

Query: 76  EDLGDLKSWM-HKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
           ED  +L +W+ +  G+    +  KE      +      V    D +AG  A  VP S  V
Sbjct: 48  EDARELAAWLSYDKGVDASALAFKEDAKGGVR------VILKADAEAGATALRVPQSAAV 101

Query: 135 TLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
           T   V  +  ++EL +     EL  LAL+L  E+ +G  S W PY++ L       +   
Sbjct: 102 TSVDVGEHPIVSELASGR--PELIGLALWLCAERIKGGASEWAPYVKTL-------RANP 152

Query: 195 ESPLLWSET-ELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 253
           ++PL W++  + A L GSP  A+ +ER++  + EY  +  V        +  P   P EA
Sbjct: 153 DAPLFWTDAKDFALLKGSPVAADAIERSKSARTEYASITEV-------IKSDPSSYPPEA 205

Query: 254 FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL----------GPPLLAYSS-----KC 298
           + F    +   A+ +       +  A+ +ALVPL           P +L  S+     +C
Sbjct: 206 YEFLTEARFVDALATVCAKATWLPTAQCYALVPLLDVISIGGAPVPGVLPPSASDGVVRC 265

Query: 299 KAMLAAVDDAVQLVVDRPYKAGESIVVWCGP--QPNSKLLINYGFVDEDNPYDRLVVEAA 356
                 VD A  ++      A  S V+      + N +L +N G+VD+ +P D + ++  
Sbjct: 266 GPADYDVDTASVVLRCATKAAANSEVIQLDALQRNNGELFLNTGYVDQKHPGDYIYMKTD 325

Query: 357 LNTEDPQYQDKRMVAQRNGKLSV-QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVI 415
           + T D  +  K+ V +  G  +  Q F V+  R     + +  YLR   V D  EM +V 
Sbjct: 326 IQTSDRLFTAKKQVLEGMGFTAADQYFPVYKDRMP---TQLYSYLRFSRVQDPGEMMAVS 382

Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
                I  VS   E  +L  L    +  +A Y     ++  +L   +  P + +    +R
Sbjct: 383 FEEDKI--VSVMNEYEILQILMGDCRELMAEYDTNEEDELNLLKLSDQMPVREIEAAKLR 440

Query: 476 M-EKKMLNACLQ 486
           M EKK++ A + 
Sbjct: 441 MSEKKLIGATMN 452


>gi|194038089|ref|XP_001925323.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Sus scrofa]
 gi|456754196|gb|JAA74239.1| SET domain containing 3 [Sus scrofa]
          Length = 595

 Score = 95.5 bits (236), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 105/434 (24%), Positives = 194/434 (44%), Gaps = 39/434 (8%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W   NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASDNG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L   +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYAQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP-- 246
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P  
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPQA 230

Query: 247 YDIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
           + +P  E+FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 HKLPLKESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  R ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 291 EDDRCECVALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------IS 416
             K  V  R G  +  VF +H   E    + +L +LR+  +++    + +        I 
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPVSAQLLAFLRVFCMTEGELKEHLLGENAIDRIF 409

Query: 417 SLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
           +LG    PVS   E  +   L D     L  Y  T+ ED+  L ++ L  +  +A +L  
Sbjct: 410 TLGNSEYPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKTFLKNHGLSVRATMAVKLRL 469

Query: 476 MEKKMLNACLQVTA 489
            EK++L   ++  A
Sbjct: 470 GEKEILEKAVESAA 483


>gi|3403234|gb|AAC29136.1| ribulose-1,5-bisphosphate carboxylase/oxygenase N-methyltransferase
           [Spinacia oleracea]
 gi|3403238|gb|AAC29138.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
           N-methyltransferase II [Spinacia oleracea]
          Length = 495

 Score = 95.1 bits (235), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 95/381 (24%), Positives = 173/381 (45%), Gaps = 27/381 (7%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A +D+   +    VP    +  + V  +E  +     N L     +AL+LM EKK G 
Sbjct: 86  LVAQKDISRNEVVLEVPQKFWINPDTVAASEIGS---VCNGLKPWVSVALFLMREKKLGN 142

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
            S W PYI  L         +  S + WSE EL+ L GS      L   E +  E+ +L+
Sbjct: 143 SSSWKPYIDILPD-------STNSTIYWSEEELSELQGSQLLNTTLGVKELVANEFAKLE 195

Query: 233 TVWFMAGSLFQQYPYDIPTEAF--TFEIFK-QAFVAVQS---CVVHLQKVSLARRFALVP 286
               +     Q +P+D+  + F   F + + +AF  ++     ++ L  + + +      
Sbjct: 196 EEVLVPHK--QLFPFDVTQDDFFWAFGMLRSRAFTCLEGQSLVLIPLADLWVQQANHSPD 253

Query: 287 LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDED 345
           +  P  A+  +  A L + +    L    P KAG+ +++ +   + N++L ++YG  +  
Sbjct: 254 ITAPKYAWEIR-GAGLFSRELVFSLRNPTPVKAGDQVLIQYDLNKSNAELALDYGLTESR 312

Query: 346 NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV 405
           +  +   +   +   D  Y DK  +A+ NG      F +    E+   ++MLPYLRL  +
Sbjct: 313 SERNAYTLTLEIPESDSFYGDKLDIAESNGMGESAYFDIVL--EQPLPANMLPYLRLVAL 370

Query: 406 S--DTSEMQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 460
              D   ++S+   S  G +  P+SP  E  +   + D   + L+GY  T++EDE +L +
Sbjct: 371 GGEDAFLLESIFRNSIWGHLDLPISPANEELICQVIRDACTSALSGYSTTIAEDEKLLAE 430

Query: 461 YNLHPKKRVATQLVRMEKKML 481
            ++ P+  +A  +   EKK+L
Sbjct: 431 GDIDPRLEIAITIRLGEKKVL 451


>gi|148744485|gb|AAI42996.1| SET domain containing 3 [Homo sapiens]
          Length = 594

 Score = 95.1 bits (235), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 101/434 (23%), Positives = 198/434 (45%), Gaps = 39/434 (8%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD---------TSEMQSVI 415
             K  V  R G  +  VF +H   E    + +L +LR+  +++          S +  + 
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIF 409

Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
           +      PVS   E  +   L D     L  Y  T+ ED+++L +++L  + ++A +L  
Sbjct: 410 TLGNSKFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNHDLSVRAKMAIKLRL 469

Query: 476 MEKKMLNACLQVTA 489
            EK++L   ++  A
Sbjct: 470 GEKEILEKAVKSAA 483


>gi|40068481|ref|NP_115609.2| histone-lysine N-methyltransferase setd3 isoform a [Homo sapiens]
 gi|74750394|sp|Q86TU7.1|SETD3_HUMAN RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|28071092|emb|CAD61927.1| unnamed protein product [Homo sapiens]
 gi|119602070|gb|EAW81664.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
 gi|119602072|gb|EAW81666.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
 gi|119602073|gb|EAW81667.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
 gi|194380984|dbj|BAG64060.1| unnamed protein product [Homo sapiens]
 gi|307686103|dbj|BAJ20982.1| SET domain containing 3 [synthetic construct]
          Length = 594

 Score = 94.7 bits (234), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 101/434 (23%), Positives = 198/434 (45%), Gaps = 39/434 (8%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD---------TSEMQSVI 415
             K  V  R G  +  VF +H   E    + +L +LR+  +++          S +  + 
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIF 409

Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
           +      PVS   E  +   L D     L  Y  T+ ED+++L +++L  + ++A +L  
Sbjct: 410 TLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNHDLSVRAKMAIKLRL 469

Query: 476 MEKKMLNACLQVTA 489
            EK++L   ++  A
Sbjct: 470 GEKEILEKAVKSAA 483


>gi|355718753|gb|AES06373.1| SET domain containing 3 [Mustela putorius furo]
          Length = 585

 Score = 94.7 bits (234), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 105/434 (24%), Positives = 198/434 (45%), Gaps = 39/434 (8%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+  L  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRDLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  +AFT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDAFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  R ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 291 EDDRCECVALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------IS 416
             K  V  R G  +  VF +H+  E    + +L +LR+  +++    + +        I 
Sbjct: 351 AMKAEVLARAGIPTSSVFALHS-TEPPVSAQLLAFLRVFCMTEEELKEHLLGDNALDRIF 409

Query: 417 SLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
           +LG    PVS   E  +   L D     L  Y  T+ ED++ L D++L  +  +A +L  
Sbjct: 410 TLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLKDHDLSVRAAMAIKLRL 469

Query: 476 MEKKMLNACLQVTA 489
            EK++L   ++  A
Sbjct: 470 GEKEILEKAVKSAA 483


>gi|426248573|ref|XP_004018037.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
           setd3 [Ovis aries]
          Length = 596

 Score = 94.7 bits (234), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 102/434 (23%), Positives = 196/434 (45%), Gaps = 39/434 (8%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 80  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 133

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 134 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 189

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 190 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--RVIQTHPHA 237

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       S         
Sbjct: 238 HKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTSGLITTGYNL 297

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 298 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 357

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVI 415
             K  V  R G  +  VF +H   E    + +L +LR+  +++          S +  + 
Sbjct: 358 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIF 416

Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
           +      PVS   E  +   L D     L  Y  T+ ED++ L +++L  +  +A +L  
Sbjct: 417 TLGNSEYPVSWDNEVRLWAFLEDRASLLLKTYKTTIEEDKSFLKNHDLSARATMAVKLRL 476

Query: 476 MEKKMLNACLQVTA 489
            EK++L   ++  A
Sbjct: 477 GEKEILERAVKSAA 490


>gi|440907688|gb|ELR57800.1| SET domain-containing protein 3 [Bos grunniens mutus]
          Length = 594

 Score = 94.7 bits (234), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 102/434 (23%), Positives = 196/434 (45%), Gaps = 39/434 (8%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       S         
Sbjct: 231 HKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTSGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVI 415
             K  V  R G  +  VF +H   E    + +L +LR+  +++          S +  + 
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIF 409

Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
           +      PVS   E  +   L D     L  Y  T+ ED++ L +++L  +  +A +L  
Sbjct: 410 TLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLKNHDLSARATMAIKLRL 469

Query: 476 MEKKMLNACLQVTA 489
            EK++L   ++  A
Sbjct: 470 GEKEILERAVKSAA 483


>gi|431839268|gb|ELK01195.1| SET domain-containing protein 3 [Pteropus alecto]
          Length = 805

 Score = 94.7 bits (234), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 104/434 (23%), Positives = 196/434 (45%), Gaps = 39/434 (8%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E     E+   +    A+ D++A +    VP  
Sbjct: 252 GKREDYFPDLMKWASENG---ASVEGFEMVDFKEEGFGLR---ATRDIKAEELFLWVPRK 305

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 306 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 361

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 362 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 409

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 410 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 469

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  R ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 470 EDDRCECVALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 529

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------IS 416
             K  V  R G  +  VF +H   E    + +L +LR+  +++    + +        I 
Sbjct: 530 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDRIF 588

Query: 417 SLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
           +LG    PVS   E  +   L D     L  Y  T+ ED + L +++L  +  +A +L  
Sbjct: 589 TLGNSEYPVSWDNEVKLWTFLEDRASLLLKTYKTTVEEDRSFLRNHDLSVRAAMAVKLRL 648

Query: 476 MEKKMLNACLQVTA 489
            EK++L   ++  A
Sbjct: 649 GEKEILERAVKSAA 662


>gi|110331827|gb|ABG67019.1| hypothetical protein LOC84193 [Bos taurus]
          Length = 488

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 101/431 (23%), Positives = 193/431 (44%), Gaps = 39/431 (9%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 80  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 133

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 134 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 189

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y     V        Q +P+ 
Sbjct: 190 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV-------IQTHPHA 237

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       S         
Sbjct: 238 HKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTSGLITTGYNL 297

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 298 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 357

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD---------TSEMQSVI 415
             K  V  R G  +  VF +H   E    + +L +LR+  +++          S +  + 
Sbjct: 358 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIF 416

Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
           +      PVS   E  +   L D     L  Y  T+ ED++ L +++L  +  +A +L  
Sbjct: 417 TLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLKNHDLSARATMAIKLRL 476

Query: 476 MEKKMLNACLQ 486
            EK++L   ++
Sbjct: 477 GEKEILERAVK 487


>gi|10439587|dbj|BAB15525.1| unnamed protein product [Homo sapiens]
          Length = 512

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 97/412 (23%), Positives = 191/412 (46%), Gaps = 38/412 (9%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQG 171
           A+ D++A +    VP  L++T+E    N  +  L + +++ +      LA +L+ E+   
Sbjct: 28  ATRDIKAEELFLWVPRKLLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERAS- 85

Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
             SFW PYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y   
Sbjct: 86  PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY--- 135

Query: 232 DTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPL 287
              +F    + Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL
Sbjct: 136 --AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPL 191

Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
                  +          DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N 
Sbjct: 192 WDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNS 251

Query: 348 YDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD 407
           +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++
Sbjct: 252 HDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTE 310

Query: 408 ---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 458
                     S +  + +      PVS   E  +   L D     L  Y  T+ ED+++L
Sbjct: 311 EELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVL 370

Query: 459 TDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYAPL 510
            +++L  + ++A +L   EK++L   ++  A    + P + +SP     APL
Sbjct: 371 KNHDLSVRAKMAIKLRLGEKEILEKAVKSAA----VNPGI-LSPTDGGKAPL 417


>gi|119914085|ref|XP_589822.3| PREDICTED: histone-lysine N-methyltransferase setd3 [Bos taurus]
 gi|297488270|ref|XP_002696879.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Bos taurus]
 gi|296475307|tpg|DAA17422.1| TPA: SET domain containing 3 [Bos taurus]
          Length = 601

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 102/434 (23%), Positives = 196/434 (45%), Gaps = 39/434 (8%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 80  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 133

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 134 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 189

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 190 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 237

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       S         
Sbjct: 238 HKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTSGLITTGYNL 297

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 298 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 357

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVI 415
             K  V  R G  +  VF +H   E    + +L +LR+  +++          S +  + 
Sbjct: 358 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIF 416

Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
           +      PVS   E  +   L D     L  Y  T+ ED++ L +++L  +  +A +L  
Sbjct: 417 TLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLKNHDLSARATMAIKLRL 476

Query: 476 MEKKMLNACLQVTA 489
            EK++L   ++  A
Sbjct: 477 GEKEILERAVKSAA 490


>gi|134254196|gb|AAI35195.1| LOC549331 protein [Xenopus (Silurana) tropicalis]
          Length = 507

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 105/452 (23%), Positives = 201/452 (44%), Gaps = 47/452 (10%)

Query: 75  EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
           E+   +L  W  +NG       L E P           + A+ +++A +    VP  L++
Sbjct: 1   EDYFPELMEWCKENGASTDGFELVEFPEEG------FGLKATREIKAEELFLWVPRKLLM 54

Query: 135 TLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQ 191
           T+E   G+  +  L + +++ +      LA +L+ E+     SFWLPYI+ L  +     
Sbjct: 55  TVESAKGS-VLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWLPYIKTLPNE----- 107

Query: 192 LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY--DI 249
              ++PL ++E E+ YL  +    ++  + +   R+Y      +F    + Q +P    +
Sbjct: 108 --YDTPLYFNEDEVQYLQSTQAILDVFSQYKNTARQY-----AYFY--KVIQTHPNANKL 158

Query: 250 P-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDD 307
           P  ++FTF+ ++ A  +V +    +     +R   AL+PL       +S         DD
Sbjct: 159 PLKDSFTFDDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNSLITTGYNLEDD 218

Query: 308 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 367
             + V  + +K+GE I ++ G + N++ +I+ GF  E+N +DR+ ++  ++  D  Y  K
Sbjct: 219 RCECVALQDFKSGEQIYIFYGTRSNAEFVIHNGFFFENNLHDRVKIKLGVSKSDRLYAMK 278

Query: 368 RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL---------GYVSDTSEMQSVISSL 418
             V  R G  +  VF +H   E    + +L +LR+         G++     +  + +  
Sbjct: 279 AEVLARAGIPTSSVFALHV-TEPPISAQLLAFLRVFCMNEDELKGHLIGDHAIDKIFTLG 337

Query: 419 GPICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVATQLV 474
               PVS   E  +  +L  + +AR    L  Y  T+ +D  +L   ++     +A +L 
Sbjct: 338 NSEFPVS--WENEI--KLWTFLEARASLLLKTYKTTVEDDNKVLEQPDMTFHSAMAIKLR 393

Query: 475 RMEKKMLNACLQVTADMIMLLPDVTVSPCPAP 506
           R+EK++L   L+  +D   L    +    P P
Sbjct: 394 RVEKEILEKALKSASDNRKLYSKNSEEGTPLP 425


>gi|302821397|ref|XP_002992361.1| hypothetical protein SELMODRAFT_430576 [Selaginella moellendorffii]
 gi|300139777|gb|EFJ06511.1| hypothetical protein SELMODRAFT_430576 [Selaginella moellendorffii]
          Length = 463

 Score = 94.0 bits (232), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 81/327 (24%), Positives = 136/327 (41%), Gaps = 57/327 (17%)

Query: 80  DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
           +L SW+   G      +LK  P           + A  D++AG+    V    ++T +R+
Sbjct: 39  ELVSWLKIRGEHDACSLLKTGPDKRG-------LFAVRDIKAGECILRVSRDTMMTADRL 91

Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
                  +LL++  +SE A LAL L++EK+ G+ S W PYI  L R        + S   
Sbjct: 92  --PLEFQQLLSSG-VSEWAQLALLLLFEKRAGEASIWAPYISCLPRWG-----TIHSTAF 143

Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
           W + ELA +  S    E + R   I+ E+NE+  +       FQ+Y + +     ++  F
Sbjct: 144 WRKEELAMIQESSLSYETMSRRAAIREEFNEMQPI-------FQRYEH-VFGGPVSYASF 195

Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD------------ 307
           K A+V    C     ++    + A+VP     + +     AML    D            
Sbjct: 196 KHAYVTATVCS-RAWRIDGLEKLAMVPFAD-FMNHDWSSNAMLTYDTDNGSTEVEEVKVY 253

Query: 308 ---------AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 358
                      QL  D+ Y AGE + +  GP  N+ L +++GF    NP+D++ +   ++
Sbjct: 254 SDCLDIALFCAQLFADKNYAAGEQVTISFGPLCNASLALDFGFTVPYNPWDKVQLWLGIS 313

Query: 359 TEDPQYQDKRMVAQRNGKLSVQVFHVH 385
             D   ++K           +Q  H H
Sbjct: 314 RRDSLRKEK-----------LQYLHAH 329


>gi|426377975|ref|XP_004055723.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Gorilla
           gorilla gorilla]
          Length = 594

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 101/434 (23%), Positives = 197/434 (45%), Gaps = 39/434 (8%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD---------TSEMQSVI 415
             K  V  R G  +  VF +H   E    + +L +LR+  +++          S +  + 
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIF 409

Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
           +      PVS   E  +   L D     L  Y  T+ ED+++L + +L  + ++A +L  
Sbjct: 410 TLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRL 469

Query: 476 MEKKMLNACLQVTA 489
            EK++L   ++  A
Sbjct: 470 GEKEILEKTVKSAA 483


>gi|386780935|ref|NP_001247800.1| SET domain containing 3 [Macaca mulatta]
 gi|355693560|gb|EHH28163.1| hypothetical protein EGK_18532 [Macaca mulatta]
 gi|380817110|gb|AFE80429.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
 gi|383422129|gb|AFH34278.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
 gi|384949778|gb|AFI38494.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
          Length = 595

 Score = 93.2 bits (230), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 101/434 (23%), Positives = 197/434 (45%), Gaps = 39/434 (8%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERAN-PNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVI 415
             K  V  R G  +  VF +H   E    + +L +LR+  +++          S +  + 
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIF 409

Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
           +      PVS   E  +   L D     L  Y  T+ ED+++L + +L  + ++A +L  
Sbjct: 410 TLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRL 469

Query: 476 MEKKMLNACLQVTA 489
            EK++L   ++  A
Sbjct: 470 GEKEILEKAVKSAA 483


>gi|114654683|ref|XP_522946.2| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2 [Pan
           troglodytes]
 gi|332843114|ref|XP_003314566.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Pan
           troglodytes]
 gi|397525919|ref|XP_003832895.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1 [Pan
           paniscus]
 gi|397525921|ref|XP_003832896.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2 [Pan
           paniscus]
 gi|410227562|gb|JAA11000.1| SET domain containing 3 [Pan troglodytes]
 gi|410255618|gb|JAA15776.1| SET domain containing 3 [Pan troglodytes]
 gi|410289938|gb|JAA23569.1| SET domain containing 3 [Pan troglodytes]
 gi|410342147|gb|JAA40020.1| SET domain containing 3 [Pan troglodytes]
          Length = 594

 Score = 93.2 bits (230), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 101/434 (23%), Positives = 197/434 (45%), Gaps = 39/434 (8%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD---------TSEMQSVI 415
             K  V  R G  +  VF +H   E    + +L +LR+  +++          S +  + 
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIF 409

Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
           +      PVS   E  +   L D     L  Y  T+ ED+++L + +L  + ++A +L  
Sbjct: 410 TLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRL 469

Query: 476 MEKKMLNACLQVTA 489
            EK++L   ++  A
Sbjct: 470 GEKEILEKAVKSAA 483


>gi|332252553|ref|XP_003275417.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
           [Nomascus leucogenys]
 gi|332252555|ref|XP_003275418.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
           [Nomascus leucogenys]
          Length = 595

 Score = 93.2 bits (230), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 101/434 (23%), Positives = 197/434 (45%), Gaps = 39/434 (8%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD---------TSEMQSVI 415
             K  V  R G  +  VF +H   E    + +L +LR+  +++          S +  + 
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIF 409

Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
           +      PVS   E  +   L D     L  Y  T+ ED+++L + +L  + ++A +L  
Sbjct: 410 TLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRL 469

Query: 476 MEKKMLNACLQVTA 489
            EK++L   ++  A
Sbjct: 470 GEKEILEKAVKSAA 483


>gi|355778846|gb|EHH63882.1| hypothetical protein EGM_16943 [Macaca fascicularis]
          Length = 595

 Score = 93.2 bits (230), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 101/434 (23%), Positives = 197/434 (45%), Gaps = 39/434 (8%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERAN-PNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVI 415
             K  V  R G  +  VF +H   E    + +L +LR+  +++          S +  + 
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIF 409

Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
           +      PVS   E  +   L D     L  Y  T+ ED+++L + +L  + ++A +L  
Sbjct: 410 TLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRL 469

Query: 476 MEKKMLNACLQVTA 489
            EK++L   ++  A
Sbjct: 470 GEKEILEKAVKSAA 483


>gi|297695854|ref|XP_002825140.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
           [Pongo abelii]
 gi|395746278|ref|XP_003778419.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
           [Pongo abelii]
          Length = 595

 Score = 93.2 bits (230), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 101/434 (23%), Positives = 197/434 (45%), Gaps = 39/434 (8%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD---------TSEMQSVI 415
             K  V  R G  +  VF +H   E    + +L +LR+  +++          S +  + 
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIF 409

Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
           +      PVS   E  +   L D     L  Y  T+ ED+++L + +L  + ++A +L  
Sbjct: 410 TLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRL 469

Query: 476 MEKKMLNACLQVTA 489
            EK++L   ++  A
Sbjct: 470 GEKEILEKAVKSAA 483


>gi|281182452|ref|NP_001162549.1| histone-lysine N-methyltransferase setd3 [Papio anubis]
 gi|332321745|sp|A9X1D0.1|SETD3_PAPAN RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|163781076|gb|ABY40825.1| SET domain containing 3, isoform 1 (predicted) [Papio anubis]
          Length = 595

 Score = 93.2 bits (230), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 101/434 (23%), Positives = 197/434 (45%), Gaps = 39/434 (8%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERAN-PNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVI 415
             K  V  R G  +  VF +H   E    + +L +LR+  +++          S +  + 
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIF 409

Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
           +      PVS   E  +   L D     L  Y  T+ ED+++L + +L  + ++A +L  
Sbjct: 410 TLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRL 469

Query: 476 MEKKMLNACLQVTA 489
            EK++L   ++  A
Sbjct: 470 GEKEILEKAVKSAA 483


>gi|146181028|ref|XP_001021989.2| SET domain containing protein [Tetrahymena thermophila]
 gi|146144300|gb|EAS01744.2| SET domain containing protein [Tetrahymena thermophila SB210]
          Length = 590

 Score = 93.2 bits (230), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 101/404 (25%), Positives = 185/404 (45%), Gaps = 58/404 (14%)

Query: 103 HNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNET-IAELLTTNKLSELA--- 158
           +++ +R +H   A   +   +    +P S ++TLE  +  ET +A+ +   KL+ L+   
Sbjct: 173 YSKNYRGVH---ARRKVYNKETILFIPKSHLITLE--MAKETDVAKKIIAAKLNLLSPKH 227

Query: 159 -CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
             L+ +L+ E+K  K+S W PY+  L           + P+ +SE +L++L GSP + ++
Sbjct: 228 SFLSTFLLQERK-NKESKWKPYLDILPSDYN------QFPIFFSEDDLSWLKGSPFQNQV 280

Query: 218 LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS 277
            E+   IKR+Y+++ +V       F +Y         TFE F  A +   S V  LQ ++
Sbjct: 281 REKKADIKRDYDDICSV----APEFAEY---------TFEDFCWARMTASSRVFGLQ-IN 326

Query: 278 LARRFALVPLGPPLLAYSSKCKAMLAAVDD-----AVQLVVDRPYKAGESIVVWCGPQPN 332
             +  A VPL   L     + K      DD      +Q + D P   GE +    G + N
Sbjct: 327 EQKTDAFVPLADML--NHRRPKQTSWQYDDQREGFVIQALEDIP--RGEQVYDSYGRKCN 382

Query: 333 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL-SVQVFHVHAGREKE 391
           S+  +NYGF++ DN  + + +    + EDP  + K+ +    G +   +V+ +    +++
Sbjct: 383 SRFFLNYGFINLDNDANEVALRLTFDAEDPTIERKKEMM--GGDVPEFKVYRILENYQEQ 440

Query: 392 AISDMLPYLRLGYVSDTS-------------EMQSVISSLGP--ICPVSPCMERAVLDQL 436
            +S+ + YLR   + D S             E    +S   P    P+S   E  +  ++
Sbjct: 441 NVSEFMSYLRFILIRDNSKLLMLSSLHEQQTENSENLSGYKPQKTPPISIQNETDMWVRI 500

Query: 437 ADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKM 480
           ++  +  ++ Y  TL ED+ +L   NL   +R    L   EK++
Sbjct: 501 SNMCQTSISLYNTTLKEDKELLAKDNLTQNQRNCVLLRSGEKEV 544


>gi|332320543|sp|B0VX69.2|SETD3_CALJA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
          Length = 595

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 103/434 (23%), Positives = 199/434 (45%), Gaps = 39/434 (8%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEEEVRYLQSTQAVHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------IS 416
             K  V  R G  +  VF +H   E    + +L +LR+  +++    + +        I 
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDRIF 409

Query: 417 SLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
           +LG    PVS   E  +   L D     L  Y  T+ ED+++L + +L  + ++A +L  
Sbjct: 410 TLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRL 469

Query: 476 MEKKMLNACLQVTA 489
            EK++L   ++  A
Sbjct: 470 GEKEILEKAVKSAA 483


>gi|293333172|ref|NP_001168589.1| uncharacterized protein LOC100382373 [Zea mays]
 gi|223949395|gb|ACN28781.1| unknown [Zea mays]
 gi|414885391|tpg|DAA61405.1| TPA: hypothetical protein ZEAMMB73_723554 [Zea mays]
          Length = 489

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 107/387 (27%), Positives = 168/387 (43%), Gaps = 38/387 (9%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A+ DL  G+    VP  L +  + V  ++          L     +AL L+ E  +G 
Sbjct: 84  LVAARDLPRGEVVAEVPKKLWMDADAVAASDIGRACGGGGGLRPWVAVALLLLSEVARGA 143

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY--NE 230
            S W PY+  L RQ        +S + WSE EL  + G+    ++L    G+K EY  +E
Sbjct: 144 DSPWAPYLAILPRQ-------TDSTIFWSEEELLEIQGT----QLLSTTVGVK-EYVQSE 191

Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV---VHLQKVSLARRFALVPL 287
            D+V     S  +    D+   + TF+ F  AF  ++S V   +   K++L     LV  
Sbjct: 192 FDSVQAEIISTNK----DLFPGSITFDDFLWAFGMLRSRVFPELRGDKLALIPFADLVNH 247

Query: 288 GPPLLAYSS----KCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFV 342
            P + +  S    K K +    +    L      K+G+ I + +   + N++L ++YGFV
Sbjct: 248 SPNITSEGSSWEIKGKGLFGR-ELMFSLRTPVNVKSGQQIYIQYDLDKSNAELALDYGFV 306

Query: 343 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 402
           + +   D   V   ++  DP Y DK  +A+ NG      F V     +     MLPYLRL
Sbjct: 307 ESNPSRDSFTVTLEISESDPFYGDKLDIAEANGLGETAYFDVIL--NEPLPPQMLPYLRL 364

Query: 403 GYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 455
             +  T           SV   L    P+SP  E ++   + D  K+ LA Y  T+ EDE
Sbjct: 365 LCIGGTDAFLLEALFRNSVWGHLE--LPLSPDNEESICQAMRDACKSALADYHTTIEEDE 422

Query: 456 AMLTDYNLHPKKRVATQLVRMEKKMLN 482
            +    NL P+  +A  +   EKK+L 
Sbjct: 423 ELSGRENLQPRLAIAIGVRAGEKKVLQ 449


>gi|296215874|ref|XP_002754318.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Callithrix jacchus]
          Length = 610

 Score = 92.8 bits (229), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 103/434 (23%), Positives = 199/434 (45%), Gaps = 39/434 (8%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 88  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 141

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 142 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 197

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 198 -----YDTPLYFEEEEVRYLQSTQAVHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 245

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 246 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 305

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 306 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 365

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------IS 416
             K  V  R G  +  VF +H   E    + +L +LR+  +++    + +        I 
Sbjct: 366 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDRIF 424

Query: 417 SLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
           +LG    PVS   E  +   L D     L  Y  T+ ED+++L + +L  + ++A +L  
Sbjct: 425 TLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRL 484

Query: 476 MEKKMLNACLQVTA 489
            EK++L   ++  A
Sbjct: 485 GEKEILEKAVKSAA 498


>gi|332321478|sp|B1MTJ4.2|SETD3_CALMO RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
          Length = 595

 Score = 92.8 bits (229), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 103/434 (23%), Positives = 199/434 (45%), Gaps = 39/434 (8%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------IS 416
             K  V  R G  +  VF +H   E    + +L +LR+  +++    + +        I 
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDRIF 409

Query: 417 SLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
           +LG    PVS   E  +   L D     L  Y  T+ ED+++L + +L  + ++A +L  
Sbjct: 410 TLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRL 469

Query: 476 MEKKMLNACLQVTA 489
            EK++L   ++  A
Sbjct: 470 GEKEILEKAVKSAA 483


>gi|168986666|gb|ACA35060.1| SET domain containing 3 isoform a (predicted) [Callithrix jacchus]
          Length = 597

 Score = 92.8 bits (229), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 103/434 (23%), Positives = 199/434 (45%), Gaps = 39/434 (8%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 75  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 128

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 129 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 184

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 185 -----YDTPLYFEEEEVRYLQSTQAVHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 232

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 233 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 292

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 293 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 352

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------IS 416
             K  V  R G  +  VF +H   E    + +L +LR+  +++    + +        I 
Sbjct: 353 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDRIF 411

Query: 417 SLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
           +LG    PVS   E  +   L D     L  Y  T+ ED+++L + +L  + ++A +L  
Sbjct: 412 TLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRL 471

Query: 476 MEKKMLNACLQVTA 489
            EK++L   ++  A
Sbjct: 472 GEKEILEKAVKSAA 485


>gi|169409575|gb|ACA57918.1| SET domain containing 3 isoform a (predicted) [Callicebus moloch]
          Length = 597

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 103/434 (23%), Positives = 199/434 (45%), Gaps = 39/434 (8%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 75  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 128

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 129 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 184

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 185 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 232

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 233 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 292

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 293 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 352

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------IS 416
             K  V  R G  +  VF +H   E    + +L +LR+  +++    + +        I 
Sbjct: 353 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDRIF 411

Query: 417 SLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
           +LG    PVS   E  +   L D     L  Y  T+ ED+++L + +L  + ++A +L  
Sbjct: 412 TLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKLRL 471

Query: 476 MEKKMLNACLQVTA 489
            EK++L   ++  A
Sbjct: 472 GEKEILEKAVKSAA 485


>gi|332321743|sp|C1FXW2.1|SETD3_DASNO RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|226526916|gb|ACO71275.1| SET domain containing 3 isoform a (predicted) [Dasypus
           novemcinctus]
          Length = 589

 Score = 92.0 bits (227), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 103/434 (23%), Positives = 198/434 (45%), Gaps = 39/434 (8%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSMLGPLYSQDRILQAMGNITLAFHLLCERAN-PNSFWQPYIQSLPGE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRYLHSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------IS 416
             K  V  R G  +  VF +H   E    + +L +LR+  +++    + +        I 
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGENAIDRIF 409

Query: 417 SLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
           +LG    PVS   E  +   L D     L  Y  T+ ED++ L +++L  +  +A +L  
Sbjct: 410 TLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSFLKNHDLSVRATMAIKLRL 469

Query: 476 MEKKMLNACLQVTA 489
            EK++L   ++  A
Sbjct: 470 GEKEILEKAVKSAA 483


>gi|332321746|sp|B2KI88.1|SETD3_RHIFE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|183637154|gb|ACC64548.1| SET domain containing 3 isoform a (predicted) [Rhinolophus
           ferrumequinum]
          Length = 594

 Score = 92.0 bits (227), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 103/434 (23%), Positives = 197/434 (45%), Gaps = 39/434 (8%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  S  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVSFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFGEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 291 EDDRCECVALQDFQAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------IS 416
             K  V  R G  +  VF +H   E    + +L +LR+  +++    + +        I 
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDRIF 409

Query: 417 SLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
           +LG    PVS   E  +   L D     L  Y   + ED++ L +++L  +  +A +L  
Sbjct: 410 TLGNSEYPVSWDNEVKLWTFLEDRASLLLKTYKTNIEEDKSFLKNHDLSVRATMAIKLRL 469

Query: 476 MEKKMLNACLQVTA 489
            EK++L   ++  A
Sbjct: 470 GEKEILEKAVKSAA 483


>gi|348554489|ref|XP_003463058.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cavia
           porcellus]
          Length = 789

 Score = 92.0 bits (227), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 105/434 (24%), Positives = 197/434 (45%), Gaps = 39/434 (8%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFWLPYI+ L  +  
Sbjct: 127 LLMTVESA-KNSILGPLYSQDRILQAMGNIALAFHLLCERAN-PNSFWLPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+  L  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEEEVQCLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------IS 416
             K  V  R G  +  VF +H   E    + +L +LR+  +++    + +        I 
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGENAIDRIF 409

Query: 417 SLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
           +LG    PVS   E  +   L D     L  Y  T+ ED+A+L    L  + ++A +L  
Sbjct: 410 TLGNSEFPVSWENEVKLWSFLEDRASLLLKTYKTTIEEDKAVLKGPELPTRMKMAVKLRL 469

Query: 476 MEKKMLNACLQVTA 489
            EK++L   +Q  A
Sbjct: 470 GEKEILERTVQSAA 483


>gi|344273731|ref|XP_003408672.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Loxodonta
           africana]
          Length = 597

 Score = 91.7 bits (226), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 103/434 (23%), Positives = 197/434 (45%), Gaps = 39/434 (8%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEVVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAN-PNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ +L  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRHLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------IS 416
             K  V  R G  +  VF +H   E    + +L +LR+  +++    + +        I 
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDRIF 409

Query: 417 SLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
           +LG    PVS   E  +   L D     L  Y  T+ ED+A L  ++L  +  +A +L  
Sbjct: 410 TLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKAFLKGHDLSIRATMAVKLRL 469

Query: 476 MEKKMLNACLQVTA 489
            EK++L   ++  A
Sbjct: 470 GEKEILERAVKSAA 483


>gi|332321744|sp|B5FW36.1|SETD3_OTOGA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|197215622|gb|ACH53017.1| SET domain containing 3 isoform a (predicted) [Otolemur garnettii]
          Length = 595

 Score = 91.3 bits (225), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 103/434 (23%), Positives = 197/434 (45%), Gaps = 39/434 (8%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E    DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKRENYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQSLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------IS 416
             K  V  R G  +  VF +H   E    + +L +LR+  +++    + +        I 
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDRIF 409

Query: 417 SLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
           +LG    PVS   E  +   L D     L  Y  T+ ED+ +L +++L  +  +A +L  
Sbjct: 410 TLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKFVLKNHDLSVRATMAIKLRL 469

Query: 476 MEKKMLNACLQVTA 489
            EK++L   ++  A
Sbjct: 470 GEKEILEKAVKSAA 483


>gi|444915331|ref|ZP_21235465.1| SET domain containing protein [Cystobacter fuscus DSM 2262]
 gi|444713560|gb|ELW54457.1| SET domain containing protein [Cystobacter fuscus DSM 2262]
          Length = 449

 Score = 91.3 bits (225), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 109/438 (24%), Positives = 189/438 (43%), Gaps = 46/438 (10%)

Query: 70  VVSKKEEDLGDLKSWMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFS 127
             S   + L +L  W+ + G   P  +++ +E             V A   + AG+    
Sbjct: 10  AASSSNQKLSNLLRWLEEGGARFPKLQLVRREDGERA--------VLAQAPISAGETVLQ 61

Query: 128 VPNSLVVTLERVLGNET-----IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRE 182
           VP + ++TLE  L  E+     IAE L  +  +E   LA +L+ EK + + SFW PYI  
Sbjct: 62  VPRTHMLTLE--LARESDIGRAIAEGLDPD--NEDLYLASFLLQEKHR-EGSFWKPYIDS 116

Query: 183 LDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLF 242
           L           + PL +   E A L G      +  +A+ ++ +Y           SL 
Sbjct: 117 LPESYS------QMPLFYGSEEHALLKGCFALTLLTHQAQSLREDYL----------SLC 160

Query: 243 QQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML 302
           Q  P     E FT   F  A ++V S +  L+K     +  LVP+   +L +      + 
Sbjct: 161 QNVP---GYERFTPGEFVWARLSVSSRLFSLKKGGFLGQ-TLVPMAD-MLNHRRPPDVLW 215

Query: 303 AAVDDAVQLVV--DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 360
              +D    V+  +    AG+ +    G + N  +L+++GFV +DN +D   +   +   
Sbjct: 216 ETTEDGESFVMKANNAVAAGDEVHDSYGAKSNDLMLLHFGFVTDDNEHDEAFLGLRILDG 275

Query: 361 DPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV--SDTSEMQS-VISS 417
           DP    K+M+       + + F +              +LR+     +D  ++ S V+S 
Sbjct: 276 DPLAATKQMLLMLPSPTAARPFKISRPYVHTTTRMAFSFLRIAAAVPNDIEDISSRVMSG 335

Query: 418 LGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRME 477
              + P+S   E  VL+ LA   +ARL+ +P +L++DE +L   +L P  R    + R E
Sbjct: 336 ERALGPLSVENEENVLELLAATCQARLSIFPTSLAQDEELLRGESLSPNARNCVLVRRAE 395

Query: 478 KKMLNACLQVTADMIMLL 495
           K+++   L++T   + LL
Sbjct: 396 KQLIEDYLEMTRVCLKLL 413


>gi|346474100|gb|AEO36894.1| hypothetical protein [Amblyomma maculatum]
          Length = 459

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 105/439 (23%), Positives = 189/439 (43%), Gaps = 65/439 (14%)

Query: 84  WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
           W  +NG     V +K++P  +      + + A E ++       +P  LV+T      ++
Sbjct: 50  WCSENGAYLGSVAIKDRPDGD------YGLVAEEKIEESMQFLGIPMKLVMTTASARKSK 103

Query: 144 TIAELLTTN----KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
            +  LL  +     +S +A LA++L+ E   G+ SFW PYI  L         +  + L 
Sbjct: 104 -LGPLLRDDPIMKSMSNVA-LAIFLILELSAGESSFWHPYISVLPD-------SFNTVLY 154

Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
           ++  EL  L+GS    E L+    I R+Y     + F    L +  P+    + FT++++
Sbjct: 155 FNIEELELLSGSAVLDEALKLHRSIARQYAYFHKI-FRTHPLAKSLPF---KDCFTYDLY 210

Query: 260 KQAFVAV---QSCVVHLQKVSL----------ARRFALVPL--------GPPLLAYSS-- 296
           + A  AV   Q+ V   +   L          A   ALVPL        G  L  Y S  
Sbjct: 211 RWAVSAVMTRQNAVPWTESDGLGGDDVEIDGTAAVTALVPLWDMCNHSDGKVLTDYDSSA 270

Query: 297 ---KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
              +C AM             R +  GE + ++ G + N++  I+ GFV EDN YD + +
Sbjct: 271 SMVRCYAM-------------RDFDKGEEVTIFYGKRTNAEFFIHNGFVFEDNRYDAVDI 317

Query: 354 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 413
           +  ++ +DP +  K  + + +  LS+        R++    D+  +LR+  + D S+ ++
Sbjct: 318 KLGVSKKDPLFAVKSKLCE-DHDLSLSGTFALVARDRPVSEDLSTFLRILVLKDASQPEA 376

Query: 414 VISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQL 473
              S   I   S    R  L  L    +  L  +P +  E E ++ D   + + ++A +L
Sbjct: 377 F--SAEHILTSSDSNARDALTFLVVRIELLLKAFPKSDEEYEDIIKDGASNARVKMAARL 434

Query: 474 VRMEKKMLNACLQVTADMI 492
             +E K+L + L+   + +
Sbjct: 435 RLLESKVLASVLETLGNHV 453


>gi|395827792|ref|XP_003787079.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Otolemur
           garnettii]
          Length = 595

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 103/434 (23%), Positives = 197/434 (45%), Gaps = 39/434 (8%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E    DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKRENYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQSLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIYDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------IS 416
             K  V  R G  +  VF +H   E    + +L +LR+  +++    + +        I 
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDNAIDRIF 409

Query: 417 SLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
           +LG    PVS   E  +   L D     L  Y  T+ ED+ +L +++L  +  +A +L  
Sbjct: 410 TLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKFVLKNHDLSVRATMAIKLRL 469

Query: 476 MEKKMLNACLQVTA 489
            EK++L   ++  A
Sbjct: 470 GEKEILEKAVKSAA 483


>gi|218202140|gb|EEC84567.1| hypothetical protein OsI_31339 [Oryza sativa Indica Group]
          Length = 649

 Score = 90.9 bits (224), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 105/397 (26%), Positives = 170/397 (42%), Gaps = 59/397 (14%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A+ DL  G+    VP  L +  + V  ++ +   +    L     +AL L+ E  +G 
Sbjct: 244 LVAARDLPRGEVLAEVPKKLWLDADAVAASD-LGGAVGRGGLRPWVAVALLLLREAARGA 302

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
            S W PY+  L RQ        +S + WSE EL  + G+    ++L    G+K EY +  
Sbjct: 303 GSPWAPYLAILPRQ-------TDSTIFWSEEELLEIQGT----QLLSTTMGVK-EYVQ-- 348

Query: 233 TVWFMAGSLFQQYPYDIPTE-------AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
                  S F+    +I +E         TF  F  AF  ++S V    +     + AL+
Sbjct: 349 -------SEFESVEAEIISENRELFPGTVTFNDFLWAFGILRSRVFAELR---GDKLALI 398

Query: 286 PLGPPLLAYSS-----------KCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNS 333
           P    L+ +S            K K +    D    L      K+GE I + +   + N+
Sbjct: 399 PFAD-LVNHSDDITSKESSWEIKGKGLFGR-DVVFSLRTPVNVKSGEQIYIQYDLDKSNA 456

Query: 334 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI 393
           +L ++YGF + ++  D   +   ++  DP Y DK  +A+ NG      F +  G   E++
Sbjct: 457 ELALDYGFTESNSSRDAYTLTLEISESDPFYDDKLDIAELNGMGETAYFDIVLG---ESL 513

Query: 394 S-DMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLADYFKARLA 445
              MLPYLRL  +  T           +V   L    PVS   E A+   + +  K+ L 
Sbjct: 514 PPQMLPYLRLLCLGGTDAFLLEALFRNAVWGHL--ELPVSQDNEEAICQVIRNACKSALG 571

Query: 446 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 482
            Y  T+ EDE +L   NL P+ ++A ++   EKK+L 
Sbjct: 572 AYHTTIEEDEELLGSENLQPRLQIAVEVRAGEKKVLQ 608


>gi|332321742|sp|E2RBS6.1|SETD3_CANFA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
          Length = 588

 Score = 90.9 bits (224), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 103/434 (23%), Positives = 197/434 (45%), Gaps = 39/434 (8%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+  L  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRDLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  +AFT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDAFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  R ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 291 EDDRCECVALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------IS 416
             K  V  R G  +  VF +H   +    + +L +LR+  +++    + +        I 
Sbjct: 351 AMKAEVLARAGIPTSSVFALHY-TDPPVSAQLLAFLRVFCMTEEELKEHLLGDNALDRIF 409

Query: 417 SLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
           +LG    PVS   E  +   L D     L  Y  T+ ED++ L +++L  +  +A +L  
Sbjct: 410 TLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLRNHDLSVRATMAIKLRL 469

Query: 476 MEKKMLNACLQVTA 489
            EK++L   ++  A
Sbjct: 470 GEKEILEKAVKSAA 483


>gi|410962953|ref|XP_003988033.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Felis catus]
          Length = 591

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 103/433 (23%), Positives = 194/433 (44%), Gaps = 47/433 (10%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+  L  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRDLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAV 305
             +P  +AFT+E ++   V   S  +    + L     +   G P +      +      
Sbjct: 231 NKLPLKDAFTYEDYRLGLV---SLALGRWALGLECGVGIARCGKPQITTGYNLE------ 281

Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 365
           DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y 
Sbjct: 282 DDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 341

Query: 366 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISS 417
            K  V  R G  +  VF +H   E    + +L +LR+  +++    + +        I +
Sbjct: 342 MKAEVLARAGIPTSSVFALHF-TEPPVSAQLLAFLRVFCMTEEELKEHLLGDNAIDRIFT 400

Query: 418 LG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 476
           LG    PVS   E  +   L D     L  Y  T+ ED+A L ++NL  +  +A +L   
Sbjct: 401 LGNSEYPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKAFLKNHNLSVRATMAIKLRLG 460

Query: 477 EKKMLNACLQVTA 489
           EK++L   ++  A
Sbjct: 461 EKEILEKAVKSAA 473


>gi|73964462|ref|XP_547974.2| PREDICTED: SET domain containing 3 [Canis lupus familiaris]
          Length = 589

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 103/434 (23%), Positives = 197/434 (45%), Gaps = 39/434 (8%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+  L  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRDLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  +AFT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDAFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  R ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 291 EDDRCECVALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------IS 416
             K  V  R G  +  VF +H   +    + +L +LR+  +++    + +        I 
Sbjct: 351 AMKAEVLARAGIPTSSVFALHY-TDPPVSAQLLAFLRVFCMTEEELKEHLLGDNALDRIF 409

Query: 417 SLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
           +LG    PVS   E  +   L D     L  Y  T+ ED++ L +++L  +  +A +L  
Sbjct: 410 TLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLRNHDLSVRATMAIKLRL 469

Query: 476 MEKKMLNACLQVTA 489
            EK++L   ++  A
Sbjct: 470 GEKEILEKAVKSAA 483


>gi|50252331|dbj|BAD28364.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplast precursor
           [Oryza sativa Japonica Group]
 gi|215769445|dbj|BAH01674.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 495

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 102/395 (25%), Positives = 167/395 (42%), Gaps = 55/395 (13%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A+ DL  G+    VP  L +  + V  ++ +   +    L     +AL L+ E  +G 
Sbjct: 90  LVAARDLPRGEVLAEVPKKLWLDADAVAASD-LGGAVGRGGLRPWVAVALLLLREAARGA 148

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
            S W PY+  L RQ        +S + WSE EL  + G+    ++L    G+K EY +  
Sbjct: 149 GSPWAPYLAILPRQ-------TDSTIFWSEEELLEIQGT----QLLSTTMGVK-EYVQ-- 194

Query: 233 TVWFMAGSLFQQYPYDIPTE-------AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
                  S F+    +I +E         TF  F  AF  ++S V    +     + AL+
Sbjct: 195 -------SEFESVEAEIISENRELFPGTVTFNDFLWAFGILRSRVFAELR---GDKLALI 244

Query: 286 PLGPPL----------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSK 334
           P    +           ++  K K +    D    L      K+GE I + +   + N++
Sbjct: 245 PFADLVNHSDDITSKESSWEIKGKGLFGR-DVVFSLRTPVNVKSGEQIYIQYDLDKSNAE 303

Query: 335 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 394
           L ++YGF + ++  D   +   ++  DP Y DK  +A+ NG      F +  G  +    
Sbjct: 304 LALDYGFTESNSSRDAYTLTLEISESDPFYDDKLDIAELNGMGETAYFDIVLG--ESLPP 361

Query: 395 DMLPYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGY 447
            MLPYLRL  +  T           +V   L    PVS   E A+   + +  K+ L  Y
Sbjct: 362 QMLPYLRLLCLGGTDAFLLEALFRNAVWGHLE--LPVSQDNEEAICQVIRNACKSALGAY 419

Query: 448 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 482
             T+ EDE +L   NL P+ ++A ++   EKK+L 
Sbjct: 420 HTTIEEDEELLGSENLQPRLQIAVEVRAGEKKVLQ 454


>gi|343961019|dbj|BAK62099.1| SET domain containing 3 isoform a [Pan troglodytes]
          Length = 492

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 97/418 (23%), Positives = 189/418 (45%), Gaps = 39/418 (9%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD---------TSEMQSVI 415
             K  V  R G  +  VF +H   E    + +L +LR+  +++          S +  + 
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIF 409

Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQL 473
           +      PVS   E  +   L D     L  Y  T+ ED+++L + +L  + ++A +L
Sbjct: 410 TLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNQDLSVRAKMAIKL 467


>gi|338719872|ref|XP_001488117.2| PREDICTED: histone-lysine N-methyltransferase setd3-like [Equus
           caballus]
          Length = 609

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 101/434 (23%), Positives = 194/434 (44%), Gaps = 39/434 (8%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 87  GKREDYFPDLMKWASENG---ASVDGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 140

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 141 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 196

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 197 -----YDTPLYFEEDEVRYLQSTQAVHDVFSQYKNTARQY-----AYFY--RVIQTHPHA 244

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 245 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTTGLITTGYNL 304

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 305 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 364

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVI 415
             K  V  R G  +  VF +H   E    + +L +LR+  +++          + +  + 
Sbjct: 365 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKDHLLGDNAIDRIF 423

Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
           +      PVS   E  +   L D     L  Y  T+ ED A L + +L  +  +A +L  
Sbjct: 424 TLGNSEYPVSWDNEVKLWTFLEDRALLLLKTYKTTVEEDRAFLKNSDLSVRATMAIKLRL 483

Query: 476 MEKKMLNACLQVTA 489
            EK++L   ++  A
Sbjct: 484 GEKEILEKAVKSAA 497


>gi|301764186|ref|XP_002917505.1| PREDICTED: SET domain-containing protein 3-like [Ailuropoda
           melanoleuca]
          Length = 591

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 103/434 (23%), Positives = 197/434 (45%), Gaps = 39/434 (8%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+  L  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEEEVRDLQCTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  +AFT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDAFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  R ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 291 EDDRCECVALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------IS 416
             K  V  R G  +  VF +H   +    + +L +LR+  +++    + +        I 
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TDPPVSAQLLAFLRVFCMTEEELKEHLLGDNALDRIF 409

Query: 417 SLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
           +LG    PVS   E  +   L D     L  Y  T+ ED++ L +++L  +  +A +L  
Sbjct: 410 TLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLKNHDLSVRATMAIKLRL 469

Query: 476 MEKKMLNACLQVTA 489
            EK++L   ++  A
Sbjct: 470 GEKEILEKAVKSAA 483


>gi|281338628|gb|EFB14212.1| hypothetical protein PANDA_005835 [Ailuropoda melanoleuca]
          Length = 585

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 103/434 (23%), Positives = 197/434 (45%), Gaps = 39/434 (8%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+  L  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEEEVRDLQCTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  +AFT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDAFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  R ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 291 EDDRCECVALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------IS 416
             K  V  R G  +  VF +H   +    + +L +LR+  +++    + +        I 
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TDPPVSAQLLAFLRVFCMTEEELKEHLLGDNALDRIF 409

Query: 417 SLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
           +LG    PVS   E  +   L D     L  Y  T+ ED++ L +++L  +  +A +L  
Sbjct: 410 TLGNSEYPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKSFLKNHDLSVRATMAIKLRL 469

Query: 476 MEKKMLNACLQVTA 489
            EK++L   ++  A
Sbjct: 470 GEKEILEKAVKSAA 483


>gi|291411315|ref|XP_002721936.1| PREDICTED: SET domain containing 3 [Oryctolagus cuniculus]
          Length = 591

 Score = 89.4 bits (220), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 102/437 (23%), Positives = 193/437 (44%), Gaps = 45/437 (10%)

Query: 72  SKKEEDLGDLKSWMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVP 129
            K+E+   +L  W   NG  +   +V+  E+            + A+ +++A +    VP
Sbjct: 73  GKREDYFPELMKWASANGASVEGFEVVNFEEEGFG--------LRATREIKAEELFLWVP 124

Query: 130 NSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQ 186
             L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +
Sbjct: 125 RKLLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERAS-PNSFWQPYIQTLPSE 182

Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP 246
                   ++PL + E E+ YL  +    ++  + +   R+Y      +F    + Q +P
Sbjct: 183 -------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--RVIQTHP 228

Query: 247 Y--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAML 302
           +   +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +       
Sbjct: 229 HANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGY 288

Query: 303 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 362
              DD  + V  R + AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D 
Sbjct: 289 NLEDDRCECVALRDFHAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDR 348

Query: 363 QYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPI- 421
            Y  K  V  R G  +  VF +H   E    + +L +LR+ +     E++  +   G I 
Sbjct: 349 LYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRV-FCMTEEELREHLLGDGAID 406

Query: 422 ---------CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQ 472
                     PVS   E  +   L D     L  Y  T+ ED+A+L    L  +  +A +
Sbjct: 407 RIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKAVLRSPALSARAAMAVK 466

Query: 473 LVRMEKKMLNACLQVTA 489
           L   EK++L   ++  A
Sbjct: 467 LRLGEKEILEKAVRSAA 483


>gi|432952574|ref|XP_004085141.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Oryzias
           latipes]
          Length = 606

 Score = 89.4 bits (220), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 101/436 (23%), Positives = 187/436 (42%), Gaps = 45/436 (10%)

Query: 62  TLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQA 121
           T+  GSRE      +   DL SW  +NG       +    +     R       + D++A
Sbjct: 69  TVFEGSRE------DSFADLMSWAQENGASCDGFTITNFGTEGYGLR------TTRDIKA 116

Query: 122 GDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKKQGKKSFWLP 178
            +    VP  +++T+E    N  +  + + +++ +      LAL+L+ E+     SFW P
Sbjct: 117 EELFLWVPRKMLMTVESAQ-NSVLGPIYSQDRILQAMGNVTLALHLLCERGD-PASFWSP 174

Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMA 238
           YIR L ++        ++PL + + ++  L G+    ++L + +   R+Y      +F  
Sbjct: 175 YIRSLPQE-------YDTPLYYQQEDVQLLLGTQAVQDVLNQYKNTARQY-----AYFY- 221

Query: 239 GSLFQQYP--YDIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAY 294
             L Q +P    +P  + F+F+ ++ A  +V +    +  V  +R   AL+PL       
Sbjct: 222 -KLVQTHPAASKLPLKDGFSFDDYRWAVSSVMTRQNQIPTVDGSRVTLALIPLWDMCNHT 280

Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
           +          DD  + V  + YK  E I ++ G + N++ +I+ GF  +DN +DR+ ++
Sbjct: 281 NGLITTGYNLEDDRCECVALQDYKKNEQIYIFYGTRSNAEFVIHNGFFFQDNAHDRVKIK 340

Query: 355 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL---------GYV 405
             ++  +  Y  K  V  R G  +  VF +H   +    + +L +LR+          Y+
Sbjct: 341 LGVSKSERLYAMKAEVLARAGIPASCVFALHC-NDPPISAQLLAFLRVFCMTEEELKDYL 399

Query: 406 SDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 465
                +  + +      PVS   E  +   L       L  Y  T  ED ++L   +L  
Sbjct: 400 LGERAINKIFTLGNSDFPVSWENEIKLWTFLETRAALLLKTYKTTSEEDRSILEKPDLSL 459

Query: 466 KKRVATQLVRMEKKML 481
             R+A QL   EK++L
Sbjct: 460 HTRLAVQLRLAEKQIL 475


>gi|403274243|ref|XP_003928891.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Saimiri
           boliviensis boliviensis]
          Length = 513

 Score = 89.0 bits (219), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 93/391 (23%), Positives = 182/391 (46%), Gaps = 33/391 (8%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQG 171
           A+ D++A +    VP  L++T+E    N  +  L + +++ +      LA +L+ E+   
Sbjct: 28  ATRDIKAEELFLWVPRKLLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-S 85

Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
             SFW PYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y   
Sbjct: 86  PNSFWQPYIQTLPSE-------YDTPLYFEEEEVRYLQSTQAIHDVFSQYKNTARQY--- 135

Query: 232 DTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPL 287
              +F    + Q +P+   +P  ++FT+E ++ A  +V +    +     +R   AL+PL
Sbjct: 136 --AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPL 191

Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
                  +          DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N 
Sbjct: 192 WDMCNHTNGLITTGYNLEDDRCECVALQDFQAGEQIYIFYGTRSNAEFVIHSGFFFDNNS 251

Query: 348 YDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD 407
           +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    + +L +LR+  +++
Sbjct: 252 HDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTE 310

Query: 408 TSEMQSV--------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 458
               + +        I +LG    PVS   E  +   L D     L  Y  T+ ED+ +L
Sbjct: 311 EELKEHLLGDNAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKFVL 370

Query: 459 TDYNLHPKKRVATQLVRMEKKMLNACLQVTA 489
            + +L  + ++A +L   EK++L   ++  A
Sbjct: 371 KNQDLSVRAKMAIKLRLGEKEILEKAVKSAA 401


>gi|255080880|ref|XP_002504006.1| predicted protein [Micromonas sp. RCC299]
 gi|226519273|gb|ACO65264.1| predicted protein [Micromonas sp. RCC299]
          Length = 529

 Score = 88.6 bits (218), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 107/395 (27%), Positives = 164/395 (41%), Gaps = 50/395 (12%)

Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWL 177
           D++AG+    +P +L VT   V  +  +A L       EL  LAL+L  E+ +G  S W 
Sbjct: 83  DVRAGEPLIEIPQNLAVTSVDVADSPIVAGLAAGR--GELVGLALWLCLERHKGPLSEWA 140

Query: 178 PYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWF 236
           PY+  L           + PLLW+  EL   L GSP + + + R E    EY  +     
Sbjct: 141 PYVATLP------SAGSDHPLLWTAGELQTLLQGSPVREQAVSRLESADDEYASI----- 189

Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFV-AVQSCVVHLQKVSLARRFALVPL-------- 287
                 +  P D P +A+ F + + AFV A+ + +     ++ A  +A+VPL        
Sbjct: 190 --ADQIRSNPNDFPPDAYEF-LTRDAFVDALATVLARAVWLNAANCYAMVPLVDLLPLVG 246

Query: 288 -----------------GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 330
                            G P LA ++      AA +    +  +   +    + V    +
Sbjct: 247 SPPPGVSPAAAAGGPAVGKPGLAAAAGVVDYDAATECVAVVSANDAQQTARVVCVDPLAR 306

Query: 331 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG-KLSVQVFHVHAGRE 389
               L +  G VDE +  D L   A+    D  Y+ KR + +  G     Q F V A R 
Sbjct: 307 NAGDLFLATGAVDESHCGDYLAFAASCTQTDRLYEAKRQILEGMGMSADGQTFPVFADRM 366

Query: 390 KEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 449
                 +L Y+R   V D  E+ SV      I  VSP  E  VL  L    +  LA Y +
Sbjct: 367 P---MQLLAYMRFARVQDPGELMSVSFEEDRI--VSPMNEYEVLQLLMQDAREMLAEYES 421

Query: 450 TLSEDEAM-LTDYNLHPKKRVATQLVRMEKKMLNA 483
           +  E E + L +  L  ++RVA +L   EK+++NA
Sbjct: 422 SSEEFELLQLKEKGLSARQRVAAKLRLAEKRLINA 456


>gi|432098266|gb|ELK28072.1| Histone-lysine N-methyltransferase setd3 [Myotis davidii]
          Length = 585

 Score = 88.6 bits (218), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 100/434 (23%), Positives = 194/434 (44%), Gaps = 39/434 (8%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   +L  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 62  GKREDYFPNLMKWASENG---ASVEGFEMFNFKEEGFGLR---ATRDIKAEELFLWVPRK 115

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 116 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 171

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+  L  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 172 -----YDTPLYFEEDEVRSLQSTQAVHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 219

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 220 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 279

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  R ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 280 EDDRCECVALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 339

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVI 415
             K  V  R G  +  VF +H   E    + +L +LR+  +++          + +  + 
Sbjct: 340 AMKAEVLARAGIPTSSVFALHF-MEPPISAQLLAFLRVFCMTEEELKDHLLGDNAIDKIF 398

Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
           +      PVS   E  +   L D     L  Y  T  ED++ L +++L  + R+A +L  
Sbjct: 399 TLGNSEYPVSWDNEVKLWTFLEDRASLLLKTYKTTSEEDKSFLKNHDLSVRARMAIKLRL 458

Query: 476 MEKKMLNACLQVTA 489
            EK++L   +   A
Sbjct: 459 GEKEILEKAVTSAA 472


>gi|224051705|ref|XP_002200601.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Taeniopygia
           guttata]
          Length = 593

 Score = 88.2 bits (217), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 91/393 (23%), Positives = 184/393 (46%), Gaps = 33/393 (8%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
           + A+ +++A +    VP  L++T+E    N  +  L + +++ +      LA +L+ E+ 
Sbjct: 108 LKATREIKAEELFLWVPRKLLMTVESA-KNSVLGSLYSQDRILQAMGNITLAFHLLCERA 166

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
               SFWLPYI+ L  +        ++PL + E E+ +L  +    ++  + +   R+Y 
Sbjct: 167 N-PHSFWLPYIQTLPSE-------YDTPLYFEEDEVQHLQSTQAIHDVFSQYKNTARQY- 217

Query: 230 ELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALV 285
                +F    + Q +P    +P  ++FT++ ++ A  +V +    +     +R   AL+
Sbjct: 218 ----AYFY--KVIQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPTEDGSRVTLALI 271

Query: 286 PLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 345
           PL       +          DD  + V  + +KAGE I ++ G + N++ +I+ GF  ++
Sbjct: 272 PLWDMCNHTNGLITTGYNLEDDRCECVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDN 331

Query: 346 NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV 405
           N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H+  E    + +L +LR+  +
Sbjct: 332 NSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPAISAQLLAFLRVFCM 390

Query: 406 SDTSEMQSVIS--SLGPICPVSPCMERAVLD---QLADYFKAR----LAGYPATLSEDEA 456
           S+    + +I   ++G I  +         D   +L  + +AR    L  Y  T+  D++
Sbjct: 391 SEEELKEHLIGEHAIGKIFTLGNSDFPVSWDNEVKLWTFLEARASLLLKTYKTTVEVDKS 450

Query: 457 MLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 489
            L  ++L P   +A +L   EK++L   ++  A
Sbjct: 451 FLETHDLTPHAIMAIKLRLGEKEILEKAVKSAA 483


>gi|145350419|ref|XP_001419603.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144579835|gb|ABO97896.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 524

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 81/309 (26%), Positives = 134/309 (43%), Gaps = 40/309 (12%)

Query: 80  DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
           +L  W+    LP  K+ L+   +          + A+E+++ G+A   VP + ++T+ER 
Sbjct: 83  ELARWLEGRRLPGQKMALEVNLAEG------RGLVATEEIKRGEALLGVPRTTLITVERA 136

Query: 140 LGNETIAELLTTNKLSELACLALYLMYEK---KQGKKSFWLPYIRELDRQRGRGQLAVES 196
           +    +       +L E + LA +L  +    + G    +  YIR L R+ G       S
Sbjct: 137 IAEAKLGP--KHAELQEWSVLATFLAQQALALESGTAGTFGEYIRALPRRTG-------S 187

Query: 197 PLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
            L W E E+   L GSP++    ER + +    +E+ +           +P        T
Sbjct: 188 VLDWPEDEVDKLLKGSPSRLAAAERQDSVNAAIDEIRSY----------FPE------IT 231

Query: 256 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 315
               + AF  + S ++ L   ++    ALVP    +L +   C A +    DAV L  DR
Sbjct: 232 VGALRWAFDILFSRLIRLD--AMGGELALVPW-ADMLNHKPGCAAFIDLNGDAVNLTTDR 288

Query: 316 PYKAGESIVVWCGPQPNSKLLINYGFVDE--DNPYDRLVVEAALNTEDPQYQDKRMVAQR 373
            Y  GE +    G +P+S+LLI+YGF  E  +NP D   +   ++  DP    K  V + 
Sbjct: 289 SYVKGEQVWASYGQRPSSELLISYGFAPEVGENPDDEYALTLGVDVNDPLADAKAQVLRD 348

Query: 374 NGKLSVQVF 382
            G   V+ F
Sbjct: 349 MGLSPVETF 357


>gi|115657973|ref|XP_798530.2| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Strongylocentrotus purpuratus]
          Length = 682

 Score = 87.4 bits (215), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 104/438 (23%), Positives = 189/438 (43%), Gaps = 37/438 (8%)

Query: 64  VAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGD 123
           VAG     S +E        W++ NG+    V +        K    + + A++D++   
Sbjct: 66  VAGEPMQQSDREVHFETFFKWLNTNGVTTDAVKMA-------KFDEGYGLQATQDIKMDQ 118

Query: 124 AAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKKQGKKSFWLPYI 180
              ++P  +++T +  + + TI +L+  ++L +      LA++++ EK +   SFW PY+
Sbjct: 119 ELMNIPRKVMMTDQNAVDSPTIGDLVRGDRLLKGMPNVSLAIFILSEKLK-SDSFWKPYL 177

Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS 240
             L         +   PL ++  E+    GS    E L++ + I R+Y  L    F   +
Sbjct: 178 DVLPS-------SYSLPLYFTPDEIQLFQGSTMYGECLKQHKNIARQYAYL----FKLLN 226

Query: 241 LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL-QKVSLARRFALVPLGPPLLAYSSKCK 299
           L +     I  E FT++ ++ A   V +    +  K       +L+PL       + + K
Sbjct: 227 LPENSKLHI-REYFTYDFYRWAVSTVMTRQNQIPAKDGKGMSLSLIPLWDMCNHANGEMK 285

Query: 300 AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 359
                  D+   +  R +  GE I +  G + ++ LL+  GFV   N YD + ++  L++
Sbjct: 286 TDFIEERDSCVNMALRDFSVGEQIFICYGRRSSADLLLYSGFVYPGNVYDGMAIQLGLSS 345

Query: 360 EDPQYQDKRMVAQRNGKLSV--QVFHVHAGREKEAISDMLPYLRLGYVSD---------T 408
            D  Y  K  +     KL V  Q +H+ AG+E   + ++L +LR+  + D          
Sbjct: 346 SDRLYAMKAQLCSVM-KLGVPSQNYHISAGKEPVTL-ELLTFLRIFCMQDLELRDRLLGD 403

Query: 409 SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKR 468
           +  Q++ S +     +S   E      LA         Y  ++ EDE  L D NL  ++R
Sbjct: 404 NRAQALFSLVDRSQIISKLNELRTCVYLATRVTLLQRQYKTSIQEDEEKLKDGNLSAQER 463

Query: 469 VATQLVRMEKKMLNACLQ 486
            A QL+ +EK  L   L+
Sbjct: 464 SALQLLLIEKCTLENVLE 481


>gi|145344456|ref|XP_001416748.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144576974|gb|ABO95041.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 515

 Score = 87.4 bits (215), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 115/431 (26%), Positives = 175/431 (40%), Gaps = 52/431 (12%)

Query: 76  EDLGDLKSWM-HKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
           ED  +L +W+ +  G+    ++ KE        R    VA   D+ AG    +VP    V
Sbjct: 47  EDARELAAWLSYDKGVDASGLVFKEG------ARGEVEVALRGDVDAGARVLAVPQDCAV 100

Query: 135 TLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
           T   V  +  ++ L       EL  LAL+L  E+ +G  S W PY++ L           
Sbjct: 101 TSVDVDAHPIVSGL--AKGRPELVGLALWLCAERIKGGASDWAPYVKTLAAN-------P 151

Query: 195 ESPLLWSETE-LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 253
           ++PL W+E E  A L GSP   + +ER+   + EY  +  V        +  P   P EA
Sbjct: 152 DAPLFWTEAEDFALLKGSPIVNDAVERSRSAREEYAAIVEV-------IKGDPTAFPAEA 204

Query: 254 FTF---EIFKQAFVAVQSCVVHLQKVSLARRFALVPL-------GPPLLAY---SSKCKA 300
           + F   E F  A   V +    L   S    +ALVPL       G P+      S+K   
Sbjct: 205 YEFFTEERFVDALATVCAKATWLPTASC---YALVPLLDVITIAGSPVPGVSPPSAKDGI 261

Query: 301 MLAAVD---DAVQLVVDRPYKA-GESIVVWCGP--QPNSKLLINYGFVDEDNPYDRLVVE 354
              A D   D+  +V+    KA   S VV   P  + N +L +N G VD+ +P D L + 
Sbjct: 262 ARCAADYDVDSACVVLSAVVKAPANSRVVQLDPLQRNNGELFLNTGRVDQKHPGDYLYMR 321

Query: 355 AALNTEDPQYQDKRMVAQRNG-KLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 413
             +   D  +  K+ V +  G     Q F V+   E    + +  YLR   V D  EM +
Sbjct: 322 TEIQPSDRLFSAKKQVLEGMGFTAENQYFPVY---EDRMPTQLYSYLRFARVQDPGEMMA 378

Query: 414 VISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQL 473
           V      I  VS   E  +L  L    +  ++ Y     ++  +L   +    + +    
Sbjct: 379 VSFEEDKI--VSVMNEYEILQLLMGDCRELMSEYDTNEEDELNLLKLSDTMRVREIEAAK 436

Query: 474 VRMEKKMLNAC 484
           +RM +K L  C
Sbjct: 437 LRMSEKKLIGC 447


>gi|148686779|gb|EDL18726.1| mCG18357, isoform CRA_d [Mus musculus]
          Length = 597

 Score = 87.4 bits (215), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 102/434 (23%), Positives = 197/434 (45%), Gaps = 39/434 (8%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 76  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 129

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 130 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 185

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+  L  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 186 -----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 233

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  E+FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 234 NKLPLKESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 293

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  + ++AG+ I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 294 EDDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 353

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------IS 416
             K  V  R G  +  VF +H+  E    + +L +LR+  +++    + +        I 
Sbjct: 354 AMKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIF 412

Query: 417 SLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
           +LG    PVS   E  +   L D     L  Y  T+ ED+ +L + +L  +  +A +L  
Sbjct: 413 TLGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRL 472

Query: 476 MEKKMLNACLQVTA 489
            EK++L   ++  A
Sbjct: 473 GEKEILEKAVKSAA 486


>gi|268370088|ref|NP_082538.2| histone-lysine N-methyltransferase setd3 [Mus musculus]
 gi|81879567|sp|Q91WC0.1|SETD3_MOUSE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=Endothelial differentiation inhibitory protein D10;
           AltName: Full=SET domain-containing protein 3
 gi|16359331|gb|AAH16123.1| SET domain containing 3 [Mus musculus]
 gi|18044800|gb|AAH19973.1| Setd3 protein [Mus musculus]
 gi|26327255|dbj|BAC27371.1| unnamed protein product [Mus musculus]
 gi|74145116|dbj|BAE27425.1| unnamed protein product [Mus musculus]
 gi|74151505|dbj|BAE38861.1| unnamed protein product [Mus musculus]
 gi|148686776|gb|EDL18723.1| mCG18357, isoform CRA_a [Mus musculus]
          Length = 594

 Score = 87.4 bits (215), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 100/434 (23%), Positives = 195/434 (44%), Gaps = 39/434 (8%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+  L  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  E+FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  + ++AG+ I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 291 EDDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVI 415
             K  V  R G  +  VF +H+  E    + +L +LR+  +++          S +  + 
Sbjct: 351 AMKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIF 409

Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
           +      PVS   E  +   L D     L  Y  T+ ED+ +L + +L  +  +A +L  
Sbjct: 410 TLGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRL 469

Query: 476 MEKKMLNACLQVTA 489
            EK++L   ++  A
Sbjct: 470 GEKEILEKAVKSAA 483


>gi|302814473|ref|XP_002988920.1| hypothetical protein SELMODRAFT_129035 [Selaginella moellendorffii]
 gi|300143257|gb|EFJ09949.1| hypothetical protein SELMODRAFT_129035 [Selaginella moellendorffii]
          Length = 389

 Score = 87.4 bits (215), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 67/251 (26%), Positives = 125/251 (49%), Gaps = 23/251 (9%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           AS  +  G+    V + L++T E++   E + +LL+ + +S  A LAL+L+  +K+ + S
Sbjct: 5   ASRPIHTGECMLHVSHDLMITPEKL--PEEVTKLLSKD-VSAWAKLALFLLAHQKKKETS 61

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
            W PYI  L         ++ S + W++ EL YL  SP   E ++R + ++ E+   +  
Sbjct: 62  AWAPYISCLPPFG-----SMHSTIFWTQDELVYLKVSPVYRETVQRKDVVRMEFAAAENA 116

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
             +        P+   +     E FK A+  V S    ++ +   +  ALVP       +
Sbjct: 117 LLLC-------PHIFGSRVSALE-FKHAYATVCSRAWGIETI---KSLALVPF-VDFFNH 164

Query: 295 SSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 352
            + C+AML+  +D    ++V DR Y  G+ +V+  G   N+ L +++GF    NP+D++ 
Sbjct: 165 DANCRAMLSYDEDRHCAEVVSDRDYATGDQVVISYGQLSNATLALDFGFALPFNPHDQVA 224

Query: 353 -VEAALNTEDP 362
            +  +L+ +DP
Sbjct: 225 GIWLSLSEKDP 235


>gi|354483159|ref|XP_003503762.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Cricetulus
           griseus]
 gi|344254671|gb|EGW10775.1| SET domain-containing protein 3 [Cricetulus griseus]
          Length = 577

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 100/434 (23%), Positives = 194/434 (44%), Gaps = 39/434 (8%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERAS-PNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+  L  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  + ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 291 EDDRCECVALQDFQAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVI 415
             K  V  R G  +  VF +H   E    + +L +LR+  +++          S +  + 
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIF 409

Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
           +      PVS   E  +   L D     L  Y  T+ ED+ +L + +L  +  +A +L  
Sbjct: 410 TLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRL 469

Query: 476 MEKKMLNACLQVTA 489
            EK++L   ++  A
Sbjct: 470 GEKEILEKAVKSAA 483


>gi|12848462|dbj|BAB27964.1| unnamed protein product [Mus musculus]
 gi|46241521|gb|AAS82953.1| endothelial differentiation inhibitory protein D10 [Mus musculus]
          Length = 594

 Score = 86.3 bits (212), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 102/434 (23%), Positives = 197/434 (45%), Gaps = 39/434 (8%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+  L  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  E+FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  + ++AG+ I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 291 EDDRCECVALQDFQAGDQIYIFYGTRSNAESVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------IS 416
             K  V  R G  +  VF +H+  E    + +L +LR+  +++    + +        I 
Sbjct: 351 AMKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIF 409

Query: 417 SLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
           +LG    PVS   E  +   L D     L  Y  T+ ED+ +L + +L  +  +A +L  
Sbjct: 410 TLGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRL 469

Query: 476 MEKKMLNACLQVTA 489
            EK++L   ++  A
Sbjct: 470 GEKEILEKAVKSAA 483


>gi|302804384|ref|XP_002983944.1| hypothetical protein SELMODRAFT_119151 [Selaginella moellendorffii]
 gi|300148296|gb|EFJ14956.1| hypothetical protein SELMODRAFT_119151 [Selaginella moellendorffii]
          Length = 439

 Score = 85.9 bits (211), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 100/377 (26%), Positives = 167/377 (44%), Gaps = 39/377 (10%)

Query: 127 SVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
           S+P +L +  + V  +E I E      L     +ALYL++EK +   S W  YIR L R 
Sbjct: 37  SIPKTLWMDADTVRRSE-IGE--CCEGLRPWIAVALYLLHEKAK-PHSDWSAYIRVLPR- 91

Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP 246
                  ++SPL WSE ELA L G+   + +    E +KREY+++ T       + +  P
Sbjct: 92  ------TLDSPLFWSEEELAELKGTQLLSSMNGFKEFLKREYDKVMT------EVIEPRP 139

Query: 247 YDIPTEAFTFEIFKQAFVAVQSCVV------HLQKVSLA----RRFALVPLGPPLLAYSS 296
                  +T E F  AF  ++S         +L  V LA      F L    P    +  
Sbjct: 140 DVFDRSLYTLEAFTWAFGILRSRTFPPLIGDNLALVPLADFVNHGFGLTNEDP---GWKV 196

Query: 297 KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVE 354
           K   + A  +           +  E ++ +   + N++L  +YGFVD D  N  D   + 
Sbjct: 197 KSAGVFARQETLTLQAAANCAEKQEVLIQYGKKKGNAQLATDYGFVDSDEKNNRDSFTLT 256

Query: 355 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLG--YVSDTSEMQ 412
             ++  +    DK  +AQ  G  S   F+++  R +    DM+ YLRL   + SD+  ++
Sbjct: 257 LQVSLSERFADDKVDIAQMAGLDSTAYFNLY--RNQGPPEDMIAYLRLIALFGSDSFLLE 314

Query: 413 SVISSL---GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRV 469
           ++  +        P+S   E A+ + + +  +A L  Y +T+ ED  +L    L  +K++
Sbjct: 315 ALFRNTVWDHLRLPISRENEEAICEAMIEGCRATLREYSSTIDEDTMLLNSSELSTRKKM 374

Query: 470 ATQLVRMEKKMLNACLQ 486
           A  +   EK++L   LQ
Sbjct: 375 AVVVRLGEKRILQEQLQ 391


>gi|147843303|emb|CAN82664.1| hypothetical protein VITISV_015206 [Vitis vinifera]
          Length = 507

 Score = 85.9 bits (211), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 94/380 (24%), Positives = 168/380 (44%), Gaps = 28/380 (7%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A  D+   +A   VP    +  + V  +E  +       L     +AL+L+ EK +  
Sbjct: 81  LVAQRDIARNEAVLEVPKRFWINPDAVAASEIGS---VCGGLKPWVSVALFLIREKLR-D 136

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
           +S W  Y+  L            S + WSE EL  + G+      L   E ++ E+ +++
Sbjct: 137 ESPWRSYLDILPEY-------TNSTIYWSEEELVEIQGTQLSNTTLGVKEYVQSEFLKVE 189

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
               +  S    +P  +    + F I + +AF  ++      Q + L     L+   P +
Sbjct: 190 EEVILPHSQLFPFPVTLDDFLWAFGILRSRAFSRLRG-----QNLVLIPLADLINHSPSI 244

Query: 292 LA--YSSKCK-AMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNP 347
               Y+ + K A L + D    L      KAGE +++ +   + N++L ++YGF++    
Sbjct: 245 TTEEYAWEIKGAGLFSRDQLFSLRTPVSVKAGEQVLIQYDLDKSNAELALDYGFIESRPN 304

Query: 348 YDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS- 406
            +   +   ++  DP + DK  +A+ NG   +  F +  G+   A   MLPYLRL  +  
Sbjct: 305 RNSYTLTLEISESDPFFGDKLDIAESNGLSEIAYFDIVLGQSLPAA--MLPYLRLVALGG 362

Query: 407 -DTSEMQSVISS--LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYN 462
            D   ++S+  +   G +  PVS   E  +   + D  K+ L+GY  T+ EDE +  + N
Sbjct: 363 PDAFLLESIFRNTIWGHLELPVSRANEELICQVIQDACKSALSGYLTTIEEDEKLKEEGN 422

Query: 463 LHPKKRVATQLVRMEKKMLN 482
           LHP+  +A  +   EKK+L 
Sbjct: 423 LHPRLEIAVGVRTGEKKVLQ 442


>gi|225462926|ref|XP_002267249.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
           large subunit N-methyltransferase, chloroplastic [Vitis
           vinifera]
 gi|296087793|emb|CBI35049.3| unnamed protein product [Vitis vinifera]
          Length = 484

 Score = 85.9 bits (211), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 96/385 (24%), Positives = 170/385 (44%), Gaps = 38/385 (9%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A  D+   +A   VP    +  + V  +E  +       L     +AL+L+ EK +  
Sbjct: 81  LVAQRDIARNEAVLEVPKRFWINPDAVAASEIGS---VCGGLKPWVSVALFLIREKLR-D 136

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
           +S W  Y+  L            S + WSE EL  + G+      L   E ++ E+ +++
Sbjct: 137 ESPWRSYLDILPEY-------TNSTIYWSEEELVEIQGTQLSNTTLGVKEYVQSEFLKVE 189

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG---- 288
               +  S  Q +P+ +     T + F  AF  ++S      +    +   L+PL     
Sbjct: 190 EEVILPHS--QLFPFPV-----TLDDFLWAFGILRSRAFSRLR---GQNLVLIPLADLIN 239

Query: 289 --PPLLA--YSSKCK-AMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFV 342
             P +    Y+ + K A L + D    L      KAGE +++ +   + N++L ++YGF+
Sbjct: 240 HSPSITTEEYAWEIKGAGLFSRDQLFSLRTPVSVKAGEQVLIQYDLDKSNAELALDYGFI 299

Query: 343 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 402
           +     +   +   ++  DP + DK  +A+ NG   +  F +  G+   A   MLPYLRL
Sbjct: 300 ESRPNRNSYTLTLEISESDPFFGDKLDIAESNGLSEIAYFDIVLGQSLPAA--MLPYLRL 357

Query: 403 GYVS--DTSEMQSVISS--LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM 457
             +   D   ++S+  +   G +  PVS   E  +   + D  K+ L+GY  T+ EDE +
Sbjct: 358 VALGGPDAFLLESIFRNTIWGHLELPVSRANEELICQVIQDACKSALSGYLTTIEEDEKL 417

Query: 458 LTDYNLHPKKRVATQLVRMEKKMLN 482
             + NLHP+  +A  +   EKK+L 
Sbjct: 418 KEEGNLHPRLEIAVGVRTGEKKVLQ 442


>gi|302754606|ref|XP_002960727.1| hypothetical protein SELMODRAFT_449995 [Selaginella moellendorffii]
 gi|300171666|gb|EFJ38266.1| hypothetical protein SELMODRAFT_449995 [Selaginella moellendorffii]
          Length = 430

 Score = 85.5 bits (210), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 101/377 (26%), Positives = 168/377 (44%), Gaps = 39/377 (10%)

Query: 127 SVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
           S+P +L + ++ V  +E I E      L     +ALYL++EK +   S W  YIR L R 
Sbjct: 37  SIPKTLWMDVDTVRRSE-IGECCAG--LRPWIAVALYLLHEKAK-PHSDWSAYIRVLPR- 91

Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP 246
                  ++SPL WSE ELA L G+   + I    E +KREY+++ T       + +  P
Sbjct: 92  ------TLDSPLFWSEEELAELKGTQLLSSINGFKEFLKREYDKVMT------EVIEPRP 139

Query: 247 YDIPTEAFTFEIFKQAFVAVQSCVV------HLQKVSLA----RRFALVPLGPPLLAYSS 296
                  +T E F  AF  ++S         +L  V LA      F L    P    +  
Sbjct: 140 DVFDRSLYTLEAFTWAFGILRSRTFPPLIGDNLALVPLADFVNHGFGLTNEDP---YWHV 196

Query: 297 KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVE 354
           K   + A  +           +  E ++ +   + N++L  +YGFVD D  N  D   + 
Sbjct: 197 KSAGVFARQETLTLQAAANCAEKQEVLMQYGKKKGNAQLATDYGFVDSDEKNNRDSFTLT 256

Query: 355 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLG--YVSDTSEMQ 412
             ++  +    DK  +AQ  G  S   F+++  R +    DM+ YLRL   + SD+  ++
Sbjct: 257 LQVSLSERFADDKVDIAQMAGLDSTAYFNLY--RNQGPPEDMIAYLRLIALFGSDSFLLE 314

Query: 413 SVISSL---GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRV 469
           ++  +        P+S   E A+ + + +  +A L  Y +T+ ED  +L    L  +K++
Sbjct: 315 ALFRNTVWDHLRLPISRENEEAICEAMIEGCRATLREYSSTIDEDTMLLNSSELSTRKKM 374

Query: 470 ATQLVRMEKKMLNACLQ 486
           A  +   EK++L   LQ
Sbjct: 375 AVVVRLGEKRILQEQLQ 391


>gi|41056027|ref|NP_956348.1| histone-lysine N-methyltransferase setd3 [Danio rerio]
 gi|82187658|sp|Q7SXS7.1|SETD3_DANRE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|32766447|gb|AAH55261.1| SET domain containing 3 [Danio rerio]
          Length = 596

 Score = 85.5 bits (210), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 89/383 (23%), Positives = 168/383 (43%), Gaps = 29/383 (7%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKK 169
           + A++D++A +    +P  +++T+E    N  +  L + +++ +      LAL+L+ E+ 
Sbjct: 108 LKATKDIKAEELFLWIPRKMLMTVESA-KNSVLGPLYSQDRILQAMGNVTLALHLLCERA 166

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
               S WLPYI+ L  +        ++PL + E E+ +L  +    ++L + +   R+Y 
Sbjct: 167 N-PSSPWLPYIKTLPSE-------YDTPLYFEEEEVRHLLATQAIQDVLSQYKNTARQY- 217

Query: 230 ELDTVWFMAGSLFQQYPYDIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPL 287
                +F            +P  +AFTF+ ++ A  +V +    +     +R   AL+PL
Sbjct: 218 ----AYFYKVIHTHPNASKLPLKDAFTFDDYRWAVSSVMTRQNQIPTADGSRVTLALIPL 273

Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
                  +          DD  + V  + YK GE I ++ G + N++ +I+ GF  EDN 
Sbjct: 274 WDMCNHTNGLITTGYNLEDDRCECVALKDYKEGEQIYIFYGTRSNAEFVIHNGFFFEDNA 333

Query: 348 YDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL----- 402
           +DR+ ++  ++  +  Y  K  V  R G  +  +F +H   E    + +L +LR+     
Sbjct: 334 HDRVKIKLGVSKGERLYAMKAEVLARAGIPASSIFALHCS-EPPISAQLLAFLRVFCMTE 392

Query: 403 ----GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 458
                Y+     +  + +      PVS   E  +   L       L  Y     ED +ML
Sbjct: 393 EELRDYLVGDHAINKIFTLGNTEFPVSWENEIKLWTFLETRAALLLKTYKTASEEDRSML 452

Query: 459 TDYNLHPKKRVATQLVRMEKKML 481
              +L    R+A +L   EK++L
Sbjct: 453 EKPDLSLHSRIAIKLRLAEKEIL 475


>gi|392341246|ref|XP_002726820.2| PREDICTED: histone-lysine N-methyltransferase setd3 [Rattus
           norvegicus]
 gi|392349051|ref|XP_216781.6| PREDICTED: histone-lysine N-methyltransferase setd3 [Rattus
           norvegicus]
 gi|149044195|gb|EDL97577.1| rCG27725, isoform CRA_a [Rattus norvegicus]
          Length = 596

 Score = 85.1 bits (209), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 99/434 (22%), Positives = 194/434 (44%), Gaps = 39/434 (8%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSILGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+  L  +    ++  + +   R+Y      +F    + Q +P+ 
Sbjct: 183 -----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  + ++AG+ I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 291 EDDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVI 415
             K  V  R G  +  VF +H   E    + +L +LR+  +++          S +  + 
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIF 409

Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
           +      PVS   E  +   L D     L  Y  T+ ED+ +L + +L  +  +A +L  
Sbjct: 410 TLGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKTVLKNPDLSVRATMAIKLRL 469

Query: 476 MEKKMLNACLQVTA 489
            EK++L   ++  A
Sbjct: 470 GEKEILEKAVKSAA 483


>gi|410928182|ref|XP_003977480.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Takifugu
           rubripes]
          Length = 598

 Score = 84.7 bits (208), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 94/392 (23%), Positives = 175/392 (44%), Gaps = 43/392 (10%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKKQG 171
           A+ D++A +    +P  +++T+E       +  L   +++ +      LAL+L+ E+   
Sbjct: 110 ATRDIKAEELFLWIPRKMLMTVESA-KKSVLGPLYNQDRILQAMDNVTLALHLLCERAN- 167

Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
             SFWLPYIR L ++        ++PL + + E+  L G+    ++L +     R+Y   
Sbjct: 168 PASFWLPYIRTLPQE-------YDTPLFYEQDEVQLLQGTQAVQDVLSQYRNTARQY--- 217

Query: 232 DTVWFMAGSLFQQYPYD--IP-TEAFTFEIFKQAFVAVQSCVVHL-----QKVSLARRFA 283
              +F    L Q +P    +P  ++FTF+ ++ A  +V +    +     ++V+LA    
Sbjct: 218 --AYFY--KLIQTHPASSKLPLKDSFTFDDYRWAVSSVMTRQNQIPTEDGRQVTLA---- 269

Query: 284 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 343
           L+PL       +          DD  + V  + YK  E I ++ G + N++ +I+ GF  
Sbjct: 270 LIPLWDMCNHRNGLITTGYNLEDDRCECVALQDYKKNEQIYIFYGTRSNAEFVIHNGFFY 329

Query: 344 EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLG 403
           ++N +D++ ++  ++  +  Y  K  V  R G     +F ++   E+   + +L +LR+ 
Sbjct: 330 QENAHDQVKIKLGISKSERLYAMKAEVLARAGIPVSSIFALYCN-EQPISAQLLAFLRV- 387

Query: 404 YVSDTSEMQSV---------ISSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSE 453
           +     E++           I +LG +  PVS   E  +   L       L  Y  T  E
Sbjct: 388 FCMKEEELRDYLLGGHAINKIVTLGSMEFPVSWDNEIKLWTFLETRVALLLKAYKTTSEE 447

Query: 454 DEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 485
           D + L    L P  R+A QL   EK +L   L
Sbjct: 448 DSSTLEKSELSPHSRMAIQLRLAEKWILEKAL 479


>gi|449280698|gb|EMC87934.1| SET domain-containing protein 3 [Columba livia]
          Length = 593

 Score = 84.7 bits (208), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 94/397 (23%), Positives = 184/397 (46%), Gaps = 41/397 (10%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
           + A+ +++A +    VP  L++T+E    N  +  L + +++ +      LA +L+ E+ 
Sbjct: 108 LKATREIKAEELFLWVPRRLLMTVESA-KNSVLGSLYSQDRILQAMGNITLAFHLLCERA 166

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
               SFWLPYI+ L  +         +PL + E E+ YL  +    ++  + +   R+Y 
Sbjct: 167 N-PNSFWLPYIQTLPSE-------YNTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY- 217

Query: 230 ELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALV 285
                +F    + Q +P    +P  ++FT++ ++ A  +V +    +     +R   AL+
Sbjct: 218 ----AYFY--KVIQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPTEDGSRVTLALI 271

Query: 286 PLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 345
           PL       +          DD  + V  + +KAGE I ++ G + N++ +I+ GF  ++
Sbjct: 272 PLWDMCNHTNGLITTGYNLEDDRCECVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDN 331

Query: 346 NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV 405
           N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H+  E    + +L +LR+  +
Sbjct: 332 NSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCM 390

Query: 406 SDTSEMQSVIS--------SLG-PICPVSPCMERAVLDQLADYFKAR----LAGYPATLS 452
           S+    + +I         +LG    PVS   E     +L  + +AR    L  Y  T+ 
Sbjct: 391 SEEELKEHLIGEHAIDKIFTLGNSEFPVSWDNEV----KLWTFLEARASLLLKTYKTTVE 446

Query: 453 EDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 489
           +D++ L  ++L     +A +L   EK++L   ++  A
Sbjct: 447 DDKSFLETHDLTSHAIMAIKLRLGEKEILEKAVKSAA 483


>gi|160774366|gb|AAI55279.1| SET domain containing 3 [Danio rerio]
          Length = 596

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 88/383 (22%), Positives = 167/383 (43%), Gaps = 29/383 (7%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKK 169
           + A++D++A +    +P  +++T+E    N  +  L + +++ +      LAL+L+ E+ 
Sbjct: 108 LKATKDIKAEELFLWIPRKMLMTVESA-KNSVLGPLYSQDRILQAMGNVTLALHLLCERA 166

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
               S WLPYI+ L  +        ++PL + E E+ +L  +    ++L + +   R+Y 
Sbjct: 167 N-PSSPWLPYIKTLPSE-------YDTPLYFEEEEVRHLLATQAIQDVLSQYKNTARQY- 217

Query: 230 ELDTVWFMAGSLFQQYPYDIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPL 287
                +F            +P  +AFTF+ ++ A  +V +    +     +R   AL+PL
Sbjct: 218 ----AYFYKVIHTHPNASKLPLKDAFTFDDYRWAVSSVMTRQNQIPTADGSRVTLALIPL 273

Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
                  +          DD  + V  + YK GE I ++ G + N++ +I+ GF  EDN 
Sbjct: 274 WDMCNHTNGLITTGYNLEDDRCECVALKDYKEGEQIYIFYGTRSNAEFVIHNGFFFEDNA 333

Query: 348 YDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL----- 402
           +DR+ ++  ++  +  Y  K  V  R G  +  +F +H   E    + +L +LR+     
Sbjct: 334 HDRVKIKLGVSKSERLYAMKAEVLARAGIPASSIFALHCS-EPPISAQLLAFLRVFCMTE 392

Query: 403 ----GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 458
                Y+     +  + +      PVS   E  +   L       L  Y     ED +ML
Sbjct: 393 EELRDYLVGDHAINKIFTLGNTEFPVSWENEIKLWTFLETRAALLLKTYKTASEEDRSML 452

Query: 459 TDYNLHPKKRVATQLVRMEKKML 481
              +L    R+  +L   EK++L
Sbjct: 453 EKPDLSLHSRITIKLRLAEKEIL 475


>gi|168063638|ref|XP_001783777.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664720|gb|EDQ51429.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 395

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 75/274 (27%), Positives = 128/274 (46%), Gaps = 28/274 (10%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           A+  ++ G+    V   L++T  ++     + ELL T  ++E A LAL+++ E+  G+ S
Sbjct: 5   AARPIEVGEQVLRVSGDLMITPNKL--PTEVKELLPTG-VTEWARLALFILVEQHLGQAS 61

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
            W PYI  L         A+ S + W + EL  +  +    E ++R   I  E+  +  V
Sbjct: 62  QWAPYINCLPTCG-----ALHSTVFWKKEELELVRFTSLHRETMQRRAVIGSEFASVLPV 116

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
                   Q+ P+ I  E      FKQA+   +S      + S   R   VP       +
Sbjct: 117 -------LQKCPH-IFGERVLHSKFKQAYATGKSL-----RRSSNTRILTVPF-VDFFNH 162

Query: 295 SSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 352
            S C+A+L+  ++    +++ D+ Y  GE +V+  G  PN+ L +++GF    NPYD++ 
Sbjct: 163 DSNCRALLSYDEERACAEVIADKNYARGEQVVISYGRLPNTTLALDFGFTISCNPYDQVE 222

Query: 353 VEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 386
           V  AL+  DP  + K  +   +G  +V    VHA
Sbjct: 223 VWMALSHRDPLRKMKLALLHAHGMPTV----VHA 252


>gi|47215092|emb|CAF98166.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 444

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 92/391 (23%), Positives = 173/391 (44%), Gaps = 41/391 (10%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKKQG 171
           A+ D++A +    +P  +++T+E       +  L T +++ +      LAL+L+ E+   
Sbjct: 28  ATRDIKAEELFLWIPRKMLMTVESA-KKSVLGPLYTQDRILQAMDNVTLALHLLCERAD- 85

Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
             SFWLPYIR L ++        ++PL + + ++  L G+    ++L +     R+Y   
Sbjct: 86  PASFWLPYIRTLPQE-------YDTPLFYQQQDVQLLHGTQAIQDVLSQYRNTARQY--- 135

Query: 232 DTVWFMAGSLFQQYPYD--IP-TEAFTFEIFKQAFVAVQSCVVHL-----QKVSLARRFA 283
              +F    L Q +P    +P  ++FTF+ ++ A  +V +    +     ++V+LA    
Sbjct: 136 --AYFY--KLVQTHPASSKLPLKDSFTFDDYRWAVSSVMTRQNQIPTEDGRQVTLA---- 187

Query: 284 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 343
           L+PL       +          DD  + V  + YK  E I ++ G + N++ +I+ GF  
Sbjct: 188 LIPLWDMCNHRNGLITTGYNLEDDRCECVALQDYKKNEQIYIFYGTRSNAEFVIHNGFFY 247

Query: 344 EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL- 402
           ++N +D++ ++  ++  +  Y  K  V  R G     VF ++   E    + +L +LR+ 
Sbjct: 248 QENAHDQVKIKLGISKSERLYAMKAEVLGRAGIPVSSVFALYCN-EPPISAQLLAFLRVF 306

Query: 403 --------GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSED 454
                    Y+     +  +++      PVS   E  +   L       L  Y  T  ED
Sbjct: 307 CMMEEELKDYLFGAQAINRLVTLGSMEFPVSWENEIKLWTFLETRAALLLKAYKTTAEED 366

Query: 455 EAMLTDYNLHPKKRVATQLVRMEKKMLNACL 485
            + L   +L P  R+A QL   EK +L   L
Sbjct: 367 SSTLDKTDLSPHSRMAVQLRLAEKAILEKAL 397


>gi|302786274|ref|XP_002974908.1| hypothetical protein SELMODRAFT_102436 [Selaginella moellendorffii]
 gi|300157067|gb|EFJ23693.1| hypothetical protein SELMODRAFT_102436 [Selaginella moellendorffii]
          Length = 389

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 67/251 (26%), Positives = 125/251 (49%), Gaps = 25/251 (9%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           AS  +  G+    V + L++T E++   E + +LL+ + +S  A LAL+L+  +K+ + S
Sbjct: 5   ASRPIHTGECMLHVSHDLMITPEKL--PEEVTKLLSKD-VSAWAKLALFLLAHQKKKETS 61

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
            W PYI  L         ++ S + W++ EL YL  SP   E ++R + ++ E+   + V
Sbjct: 62  AWAPYISCLPPFG-----SMHSTIFWTQDELVYLKVSPVYRETVQRKDVVRMEFAAAENV 116

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
                 L QQ    + +   T       ++ V S    ++ +   +  ALVP       +
Sbjct: 117 CM----LMQQVKLFVCSRILT------DYITVCSRAWGIETI---KSLALVPF-VDFFNH 162

Query: 295 SSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 352
            + C+AML+  +D    ++V DR Y  G+ +V+  G   N+ L +++GF    NP+D++ 
Sbjct: 163 DANCRAMLSYDEDRHCAEVVSDRDYATGDQVVISYGQLSNATLALDFGFALPFNPHDQVA 222

Query: 353 -VEAALNTEDP 362
            +  +L+ +DP
Sbjct: 223 GIWLSLSEKDP 233


>gi|57529914|ref|NP_001006486.1| histone-lysine N-methyltransferase setd3 [Gallus gallus]
 gi|363734802|ref|XP_003641459.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Gallus
           gallus]
 gi|75571462|sp|Q5ZML9.1|SETD3_CHICK RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|53127281|emb|CAG31024.1| hypothetical protein RCJMB04_1k10 [Gallus gallus]
          Length = 593

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 92/397 (23%), Positives = 185/397 (46%), Gaps = 41/397 (10%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
           + A+ +++A +    VP  L++T+E    N  +  L + +++ +      LA +L+ E+ 
Sbjct: 108 LKATREIKAEELFLWVPRKLLMTVESA-KNSVLGSLYSQDRILQAMGNITLAFHLLCERA 166

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
               SFWLPYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y 
Sbjct: 167 N-PNSFWLPYIQTLPSE-------YDTPLYFEEDEVQYLRSTQAIHDVFSQYKNTARQY- 217

Query: 230 ELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALV 285
                +F    + Q +P    +P  ++FT++ ++ A  +V +    +     +R   AL+
Sbjct: 218 ----AYFY--KVIQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPTEDGSRVTLALI 271

Query: 286 PLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 345
           PL       +          DD  + V  + +KAGE I ++ G + N++ +I+ GF  ++
Sbjct: 272 PLWDMCNHTNGLITTGYNLEDDRCECVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDN 331

Query: 346 NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV 405
           N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H+  E    + +L +LR+  +
Sbjct: 332 NSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-IEPPISAQLLAFLRVFCM 390

Query: 406 SDTSEMQSVIS--------SLG-PICPVSPCMERAVLDQLADYFKAR----LAGYPATLS 452
           ++    + +I         +LG    P+S   E     +L  + +AR    L  Y  T+ 
Sbjct: 391 NEEELKEHLIGEHAIDKIFTLGNSEFPISWDNEV----KLWTFLEARASLLLKTYKTTVE 446

Query: 453 EDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 489
           +D++ L  ++L     +A +L   EK++L   ++  A
Sbjct: 447 DDKSFLETHDLTSHATMAIKLRLGEKEILEKAVKSAA 483


>gi|357153645|ref|XP_003576520.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
           large subunit N-methyltransferase, chloroplastic-like
           [Brachypodium distachyon]
          Length = 492

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 91/387 (23%), Positives = 169/387 (43%), Gaps = 38/387 (9%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A+ +L  G+    VP  L +  + V  ++      +   L     ++L ++ E  +G 
Sbjct: 86  LVAARNLPRGEVVAEVPKKLWMDADAVAASDIGRACRSGGDLRPWVSVSLLILREAARGG 145

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
            S W PY+  L RQ        +S + WSE EL  + G+   +  +   E ++ E++ ++
Sbjct: 146 DSLWAPYLAILPRQ-------TDSTIFWSEEELLEIQGTQLLSTTMGVKEYVQSEFDNVE 198

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
               + G     +P     +  TF+ F  AF  ++S V    +     + AL+P    L+
Sbjct: 199 AK--IIGPNKDLFP-----DTITFDDFLWAFGILRSRVFPELR---GDKLALIPFAD-LI 247

Query: 293 AYSS-----------KCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYG 340
            +S+           + K  L   D    L      K+GE + V +   + N++L ++YG
Sbjct: 248 NHSADITSKQSCWEIQGKGFLGR-DVVFSLRTPMEVKSGEQVYVQYDLDKSNAELALDYG 306

Query: 341 FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL 400
           F + ++  D   +   ++  DP Y DK  +A+ NG      F V  G  +     M+ YL
Sbjct: 307 FTETNSTRDSYTLTLEISESDPFYGDKLDIAELNGMGETAYFDVVLG--ESLPPQMITYL 364

Query: 401 RLGYVSDTSE--MQSVISS--LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 455
           RL  +  T    ++++  +   G +  PVS   E ++   +    K+ L  Y  T+ EDE
Sbjct: 365 RLLCLGGTDAFLLEALFRNKVWGFLELPVSRDNEESICQVIQTACKSALTAYHTTIEEDE 424

Query: 456 AMLTDYNLHPKKRVATQLVRMEKKMLN 482
            +L   +L  + ++A ++   EKK+L 
Sbjct: 425 ELLKREDLQSRHQIAVEVRAGEKKVLQ 451


>gi|427784595|gb|JAA57749.1| Putative histone-lysine n-methyltransferase setd3 [Rhipicephalus
           pulchellus]
          Length = 485

 Score = 83.2 bits (204), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 96/417 (23%), Positives = 175/417 (41%), Gaps = 35/417 (8%)

Query: 84  WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
           W   NG     V +K+ P         +   A E ++  +    VP  L++T        
Sbjct: 80  WCSDNGAYLGSVSIKDLPDGE------YGFVADEHIEESNQFLGVPLKLMMTTAAA-KKS 132

Query: 144 TIAELLTTN----KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
            +  LL  +     +S +A LA++L+ E   G+ SFW PYI  L         +  + L 
Sbjct: 133 KLGPLLRDDPIMMSMSNVA-LAMFLILEFCTGESSFWHPYISTL-------PASFNTVLY 184

Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
           +S  EL  L GS    E L+    I R+Y+    + F    L +  PY    + FT++++
Sbjct: 185 FSVEELELLHGSTVLDEALKLHRSIARQYSYFHKI-FRTHPLAKSLPY---KDCFTYDLY 240

Query: 260 KQAFVAVQS--CVVHLQKVSLARR-------FALVPLGPPLLAYSSKCKAMLAAVDDAVQ 310
           + A  AV +    V L   +            A+VPL         K         + ++
Sbjct: 241 RWAVSAVMTRQNAVPLTDTAGGDDEDGTDAMTAMVPLWDMCNHSDGKVFTDYDISANMLR 300

Query: 311 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 370
               R ++ G+ + ++ G + N++  I+ GFV  +N +D + ++  ++ +DP Y  K  +
Sbjct: 301 CYAMRDFEKGQEVTIFYGRRTNAEFFIHNGFVFPENRHDSVDIKLGISKQDPLYAVKAKL 360

Query: 371 AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMER 430
              +      +F +   RE+    D+  +LR+  + D S+  S       I   +    R
Sbjct: 361 CDDHELTPSGIFAL-VPRERPVCEDLSTFLRILVLKDASQAASFTDE--HIMVATDDNAR 417

Query: 431 AVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQV 487
             L+ L    +  L  +P +  E E ++ D   + + ++A QL  +E+K+L A L+ 
Sbjct: 418 EALNFLIVRIQLLLRAFPKSDQEYENIIADEGSNGRLKMAAQLRLLERKILTAVLET 474


>gi|395504553|ref|XP_003756612.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Sarcophilus
           harrisii]
          Length = 602

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 102/438 (23%), Positives = 195/438 (44%), Gaps = 47/438 (10%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG       L      N K      + A+ +++A +    VP  
Sbjct: 80  GKREDYFPDLIKWAAENGASTDGFELV-----NFKEEGFG-LRATREIKAEELFLWVPRK 133

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFWLPYI+ L  +  
Sbjct: 134 LLMTVESA-KNSVLGALYSQDRILQAMGNITLAFHLLCERA-NPSSFWLPYIQTLPSE-- 189

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ +L  +    ++  + +   R+Y      +F    + Q +P  
Sbjct: 190 -----YDTPLYFEEDEVQHLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPNA 237

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 238 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 297

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  + +  GE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 298 EDDRCECVALQDFNVGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 357

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS-------- 416
             K  V  R G  +  VF +H   E    + +L +LR+  +++    + +I         
Sbjct: 358 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLIGEHAIDRIF 416

Query: 417 SLG-PICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVAT 471
           +LG    PVS   E     +L  + +AR    L  Y  T+ ED++ L  ++L     +A 
Sbjct: 417 TLGNSEFPVSWDNEV----KLWTFLEARASLLLKTYKTTIEEDKSFLATHDLTFHATMAI 472

Query: 472 QLVRMEKKMLNACLQVTA 489
           +L   EK++L   ++  A
Sbjct: 473 KLRLGEKEILEKAVKSAA 490


>gi|326921018|ref|XP_003206761.1| PREDICTED: LOW QUALITY PROTEIN: SET domain-containing protein
           3-like [Meleagris gallopavo]
          Length = 593

 Score = 82.0 bits (201), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 92/397 (23%), Positives = 184/397 (46%), Gaps = 41/397 (10%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
           + A+ +++A +    VP  L++T+E    +  +  L + +++ +      LA +L+ E+ 
Sbjct: 108 LKATREIKAEELFLWVPRKLLMTVESA-KSSVLGSLYSQDRILQAMGNITLAFHLLCERA 166

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
               SFWLPYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y 
Sbjct: 167 N-PNSFWLPYIQTLPNE-------YDTPLYFEEDEVQYLRSTQAIHDVFSQYKNTARQY- 217

Query: 230 ELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALV 285
                +F    + Q +P    +P  ++FT++ ++ A  +V +    +     +R   AL+
Sbjct: 218 ----AYFY--KVIQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPTEDGSRVTLALI 271

Query: 286 PLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 345
           PL       +          DD  + V  + +KAGE I ++ G + N++ +I+ GF  ++
Sbjct: 272 PLWDMCNHTNGLITTGYNLEDDRCECVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDN 331

Query: 346 NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV 405
           N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H+  E    + +L +LR+  +
Sbjct: 332 NSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-IEPPISAQLLAFLRVFCM 390

Query: 406 SDTSEMQSVIS--------SLG-PICPVSPCMERAVLDQLADYFKAR----LAGYPATLS 452
           ++    + +I         +LG    P+S   E     +L  + +AR    L  Y  T+ 
Sbjct: 391 NEEELKEHLIGEHAIDKIFTLGNSEFPISWDNEV----KLWTFLEARASLLLKTYKTTVE 446

Query: 453 EDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 489
           +D+  L  ++L     +A +L   EKK+L   ++  A
Sbjct: 447 DDKLFLETHDLTSHATMAIKLRLGEKKILEKTVKSAA 483


>gi|387016380|gb|AFJ50309.1| Histone-lysine N-methyltransferase setd3 [Crotalus adamanteus]
          Length = 592

 Score = 81.6 bits (200), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 94/397 (23%), Positives = 182/397 (45%), Gaps = 41/397 (10%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
           + A+ D++A +    VP  L++T+E    N  +  L + +++ +      LA +L+ E+ 
Sbjct: 108 LKATRDIKAEELFLWVPRKLLMTVESA-KNSILGSLYSQDRILQAMGNITLAFHLLCER- 165

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
               SFWLPYI+ L  +         + L + E E+ YL  +    +I  + +   R+Y 
Sbjct: 166 YNPNSFWLPYIQTLPNE-------YNTALYFEEDEVQYLQSTQAIHDIFSQYKNTARQY- 217

Query: 230 ELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALV 285
                +F    + Q +P    +P  ++FT++ ++ A  +V +    +     +R   AL+
Sbjct: 218 ----AYFY--KVVQTHPNASKLPLKDSFTYDDYRWAVSSVMARQNQIPAEDGSRVTLALI 271

Query: 286 PLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 345
           PL       +          DD  + V  + +KAGE I ++ G + N++ +I+ GF  ++
Sbjct: 272 PLWDMCNHTNGLITTGYNLKDDRCECVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDN 331

Query: 346 NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV 405
           N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H+  E    + +L +LR+  +
Sbjct: 332 NSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCM 390

Query: 406 SDTSEMQSVIS--------SLG-PICPVSPCMERAVLDQLADYFKAR----LAGYPATLS 452
           ++    + +I         +LG    PVS   E     +L  + +AR    L  Y  T+ 
Sbjct: 391 TEDELKEHLIGEHTIDRIFTLGNSEFPVSWDNEV----KLWTFLEARASLLLKTYKTTIH 446

Query: 453 EDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 489
           +D+ +L   +L     +A +L   EK++L   ++  A
Sbjct: 447 DDKFILETQDLTHNATMAIKLRLGEKEILEKAIKSAA 483


>gi|126290266|ref|XP_001367810.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Monodelphis domestica]
          Length = 595

 Score = 81.6 bits (200), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 102/438 (23%), Positives = 194/438 (44%), Gaps = 47/438 (10%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W   NG       L      N K      + A+ +++A +    VP  
Sbjct: 73  GKREDYFPDLIKWAAANGASTDGFELV-----NFKEEGFG-LRATREIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFWLPYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGALYSQDRILQAMGNITLAFHLLCERA-NPSSFWLPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ +L  +    ++  + +   R+Y      +F    + Q +P  
Sbjct: 183 -----YDTPLYFEEDEVQHLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPNA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  + +  GE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 291 EDDRCECVALQDFNVGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 350

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS-------- 416
             K  V  R G  +  VF +H   E    + +L +LR+  +++    + +I         
Sbjct: 351 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLIGEHAIDRIF 409

Query: 417 SLG-PICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVAT 471
           +LG    PVS   E     +L  + +AR    L  Y  T+ ED++ L  ++L     +A 
Sbjct: 410 TLGNSEFPVSWDNEV----KLWTFLEARASLLLKTYKTTIEEDKSFLATHDLTFHATMAI 465

Query: 472 QLVRMEKKMLNACLQVTA 489
           +L   EK++L   ++  A
Sbjct: 466 KLRLGEKEILEKAVKSAA 483


>gi|330822500|ref|XP_003291689.1| hypothetical protein DICPUDRAFT_57488 [Dictyostelium purpureum]
 gi|325078125|gb|EGC31794.1| hypothetical protein DICPUDRAFT_57488 [Dictyostelium purpureum]
          Length = 540

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 103/455 (22%), Positives = 184/455 (40%), Gaps = 52/455 (11%)

Query: 67  SREVVSKKEED---LGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGD 123
            +++  K+E D   + +   W+  +G    K  +K   +  E       + ++ D++ G+
Sbjct: 56  GKQIAVKQETDQQLVSNFMEWLKNSGFDETKSKVKIGRNLAEGSG----LVSTCDIKEGE 111

Query: 124 AAFSVPNSL---VVTLERVLGNETIAELLTTNKLSEL--ACLALYLMYEKKQGKKSFWLP 178
               +P  L   ++T  +  G      LL  N +  +    LALYL+ E      S   P
Sbjct: 112 EFLEIPEKLFIDIMTALKSFGQSGYDILLRDNLIRRVPNLVLALYLIKESTNPDSSI-AP 170

Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMA 238
           Y++ L +          +   W   +   L GSP     +    G  R+Y      +F  
Sbjct: 171 YLKVLPK-------TYSTIGYWGIEDFKQLEGSPVFQTAVNYTRGSMRQY-----CYFY- 217

Query: 239 GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP--PLLAYSS 296
             LF   P  + T  FT+E F  A   VQS       V   +  AL+P        ++  
Sbjct: 218 -QLFDNNPGILQTSNFTYEAFIWAVATVQS---RQNPVGGGQEMALIPFWDFCNHSSHGG 273

Query: 297 KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 356
           K    +  V   +     + YK GE + ++ GP+PNS+  +  GF  + N  D    +  
Sbjct: 274 KITTFIDPVKHVLTCSAAKSYKKGEQVYMYYGPRPNSQFYLFQGFSLKTNLNDDYSFDMD 333

Query: 357 LNTEDPQ--YQDK-RMVAQRNGKLSVQVFHVHAGREKEAI-SDMLPYLRLGYVS--DTSE 410
           L+ ED +    DK  ++ +R G    Q   +      E + ++++P+ R+  +S  +T +
Sbjct: 334 LDNEDDRDIAHDKIHILEERCGLRVGQTVSLSQNPSSEKLPAEIIPFYRIAALSPEETKK 393

Query: 411 M-----------QSVISSLGP--ICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM 457
           +                 + P     +S   E+     L D  KARL+GYP TL++DE  
Sbjct: 394 LAPPQEEGHHHHHQGPMDMKPEAFNIISEENEKKAFKLLLDSLKARLSGYPTTLAQDEQE 453

Query: 458 LTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 492
           + + N   ++R    ++  EKK+L   ++    +I
Sbjct: 454 MKN-NPTTQRRYVLYILINEKKILERNIKYVEQLI 487


>gi|307108530|gb|EFN56770.1| hypothetical protein CHLNCDRAFT_8187, partial [Chlorella
           variabilis]
          Length = 398

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 104/389 (26%), Positives = 171/389 (43%), Gaps = 37/389 (9%)

Query: 116 SEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSF 175
           S+ +  G+  F+VP +  +T +    ++  + L     L     +AL+L++E+  G  S 
Sbjct: 4   SKAVNKGEQLFAVPEAAWITADTAQQSQIGSHL---TGLESWLAIALFLLHERAMGNASR 60

Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 235
           W PYI  L    G       SP+ W E +LA L GS     +       ++ +++L    
Sbjct: 61  WAPYIALLPADSG-------SPVQWEEADLAELQGSQVLGTVQGYRAYFQQRFDQLQAEV 113

Query: 236 FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV------VHLQKVSLARRFALVPLGP 289
           F   S      +D P   F F+ F  A   V++         ++  V LA      P  P
Sbjct: 114 FGPNS----QAFD-PI-VFNFDAFLWAACTVRARAHPPLDGGNIALVPLADMVRSQPSWP 167

Query: 290 PLLA-YSSKCKAMLAAVDDAVQLVVDRP--YKAGESIVVWCGPQ-PNSKLLINYGFVDE- 344
           P  A +  K    L        LV++      AG++I +  GPQ  + +LL+++G +D  
Sbjct: 168 PDSAGWQLKQTGGLFGAGSTQALVMEASGSMAAGDAIAMDFGPQKSDGQLLVDHGVIDPL 227

Query: 345 -DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV-HAGREKEAISDMLPYLRL 402
            + P   L +E  L+ ED  Y DK  + + N +L+    H+  A R  +A    +  L  
Sbjct: 228 VNQPSYALTLE--LSKEDRNYDDKADILELN-ELAESTEHILRADRAPDAGLLPVLRLLN 284

Query: 403 GYVSDTSEMQSVISSL---GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 459
              +D   ++S+  +        PVS   ER    QL D   A LA YP ++ ED A++ 
Sbjct: 285 LSGTDAFLLESIFRNEVWEHMQLPVSEDNERGCYQQLIDGCTAALAAYPTSIDEDLALMA 344

Query: 460 DYNLHPKKRVATQL-VRM-EKKMLNACLQ 486
             +L P  R  + + VR+ EK+ L+A L+
Sbjct: 345 SGSLQPGSRRQSAVRVRLGEKEALDATLR 373


>gi|260803924|ref|XP_002596839.1| hypothetical protein BRAFLDRAFT_284593 [Branchiostoma floridae]
 gi|229282099|gb|EEN52851.1| hypothetical protein BRAFLDRAFT_284593 [Branchiostoma floridae]
          Length = 500

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 96/389 (24%), Positives = 175/389 (44%), Gaps = 39/389 (10%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKK 169
           + A +D++A +   ++P  L++T E      ++  L+  +++ ++     LAL+++ EK 
Sbjct: 121 LKAVKDIKAEELFITIPRKLMLTTE-TARESSLGPLIKKDRILQVMANVSLALHVLCEK- 178

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
               SFW PYI               +PL + E E+ +L GS   +++L + + I R+Y 
Sbjct: 179 YSSNSFWAPYINIFPG-------TYTTPLYFEEGEMLHLQGSLNFSDVLNQYKSIARQY- 230

Query: 230 ELDTVWFMAGSLFQQYP--YDIP-TEAFTFEIFKQAFVAVQSCVVHLQKV--SLARRF-- 282
                +F    LFQ  P    +P  E FTF+ ++ A   V + +    +V  S  R    
Sbjct: 231 ----AYFY--KLFQTQPEAAGLPLKECFTFDEYRWA---VSTVMTRQNQVPTSDGRHLIT 281

Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
           AL+P+       + +         D+ + +  R +     + ++ G + N++ LI+ GFV
Sbjct: 282 ALIPMWDMCNHSNGEVSTEFNLGSDSAECLAMREFPTDSQVYIFYGMRSNAEFLIHNGFV 341

Query: 343 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 402
             +N +DR+ V+  ++  D  +  K  V  R G  +   F VH G++     ++L +LR+
Sbjct: 342 YPENVHDRVNVKLGVSKNDSLFAMKAEVLSRAGIHASTSFQVHCGKDP-IPPELLVFLRV 400

Query: 403 -----GYVSD--TSEMQSV-ISSLG-PICPVSPCMERAVLDQLADYFKARLAGYPATLSE 453
                G + D  TSE QS  +S LG   C V+   E      L       +  Y  ++ +
Sbjct: 401 FTMVEGDLRDLLTSEHQSAYLSCLGRSDCMVTQEQETKAWAFLETRLSLLIRSYRTSIKD 460

Query: 454 DEAMLTDYNLHPKKRVATQLVRMEKKMLN 482
            E  L   ++    R A QL   E ++L+
Sbjct: 461 VETELQAPDMTYHSRAALQLKLAEMQILS 489


>gi|168067849|ref|XP_001785817.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162662541|gb|EDQ49381.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 489

 Score = 80.1 bits (196), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 105/432 (24%), Positives = 181/432 (41%), Gaps = 61/432 (14%)

Query: 93  CKVILKEKPSHNEKHRPIHYVAASEDL--------QAGDAAFSVPNSLVVTLERVLGNET 144
            + I     SH  +   +     SE L         AGD   +VP S+ + L  V  N +
Sbjct: 43  VQTIWSWAQSHGIQGEAVKPAEVSEGLGLIAQRPVNAGDEILNVPESVWINLAAVQ-NSS 101

Query: 145 IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 204
           + +      L     +AL+L++E      S W PY+  L +       +++SPL WS+ E
Sbjct: 102 LGK--ACEGLKPWVAVALFLIHESSN-PSSKWRPYLDSLPK-------SLDSPLFWSDEE 151

Query: 205 LAYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF 263
           LA L G+     +    E ++ EYN L + V      +F    Y       TF+ FK AF
Sbjct: 152 LAELVGTQLLGSVTGYLEFLENEYNNLVEEVLEPNNKIFNPAVY-------TFDGFKWAF 204

Query: 264 VAVQSCVVHLQKVSLARRFALVPL------------GPP--LLAYSSKCKAMLAAVDDAV 309
             ++S              ALVP+            G P  +   +S+   +     D +
Sbjct: 205 GILRSRTFSPLT---GEDIALVPIADLVNHGKGLGDGSPSWVRKGTSQFWNIGKGSSDLL 261

Query: 310 QLVVDRPYKAGESIVVWCGP-QPNSKLLINYGFVDEDN--------PYDRLVVEAALNTE 360
            +     + AGE +++  G  + N+ L ++YGFV+ D           D L +   ++ +
Sbjct: 262 TVRASANFSAGEQVLMQYGATKSNADLALDYGFVERDRGSQFSPGIERDSLALSLEISPD 321

Query: 361 DPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS--DTSEMQSVI--S 416
           D    DK  + + NG      F +  G+      +M+ +LRL  +S  D+  ++++    
Sbjct: 322 DRFVDDKADILEINGFQCSMQFDLSRGQGPS--DEMITFLRLSALSGPDSFLLEALFRNE 379

Query: 417 SLGPIC-PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
           + G +  PVS   E A+   + +  KA L GY  T+ +D  +L   +L  +  +A  +VR
Sbjct: 380 AWGHVSLPVSRDNEEALCTSMLEGLKAALDGYSTTVEQDMELLARGDLSTRMEIAV-VVR 438

Query: 476 MEKKMLNACLQV 487
           + +K +   LQ 
Sbjct: 439 LGEKRVMQELQT 450


>gi|356534483|ref|XP_003535783.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Glycine
           max]
          Length = 463

 Score = 80.1 bits (196), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 63/255 (24%), Positives = 119/255 (46%), Gaps = 19/255 (7%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           AS+ +Q GD    VP  + +T + +L        L   ++  +A LA  ++ EKK G+ S
Sbjct: 65  ASKIIQTGDCILKVPYRVQITADNLLPE---IRSLIGEEVGNIAKLATVILIEKKLGQGS 121

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
            W PYI  L +Q   G+L   + + W+E+EL  +  S    E +++   I++++  +  +
Sbjct: 122 EWYPYISCLPQQ---GEL--HNTVFWTESELEMIRPSSVYQETIDQKSQIEKDFLAIKHI 176

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
           +  +   F    Y     A T  +F    V +    V  +        AL+P     L +
Sbjct: 177 FECSHQSFGDSTYKDFMHACTLVLFDHFNVELP---VGSRAWGSTNGLALIPFAD-FLNH 232

Query: 295 SSKCKAMLAAVDD-------AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
               +A++ + DD       ++Q++ DR Y  GE +++  G   N+ L++++GF    N 
Sbjct: 233 DGVSEAIVMSDDDKQCSEVQSLQIIADRDYAPGEQVLIRYGKFSNATLMLDFGFTIPYNI 292

Query: 348 YDRLVVEAALNTEDP 362
           YD++ ++  +   DP
Sbjct: 293 YDQVQIQFDIPKHDP 307


>gi|302755392|ref|XP_002961120.1| hypothetical protein SELMODRAFT_402746 [Selaginella moellendorffii]
 gi|300172059|gb|EFJ38659.1| hypothetical protein SELMODRAFT_402746 [Selaginella moellendorffii]
          Length = 371

 Score = 80.1 bits (196), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 69/251 (27%), Positives = 113/251 (45%), Gaps = 39/251 (15%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           A+  + AG     +P   ++T E V     ++ LL+T+       L+L+L+ EK + ++S
Sbjct: 10  ATRRVPAGSRFLEIPRIAIITPENVPSQ--VSHLLSTSNPKTR--LSLFLLSEKHKAQES 65

Query: 175 FWLPYIRELDRQRGRGQLA-VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
            W PY+R L       QL  +ES + W + ELA+L  SPT  E +E  + IK E++ L+ 
Sbjct: 66  QWAPYLRCL------PQLGDIESTMFWKDEELAWLKHSPTYRETMECLKIIKSEFHVLEA 119

Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLA 293
             F        +  D+  E  +   F  A+   Q                 +P       
Sbjct: 120 NVF-------PWCRDVLGEV-SLTDFMHAYSTDQ-----------------IPFA-DFFN 153

Query: 294 YSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 351
           +   C+  L+     D    V D+ YKAG+ I +  G  PNS L ++YGF    NP++++
Sbjct: 154 HDHNCQTRLSYDKEKDCAVAVADQDYKAGDEIFLSYGSTPNSILAVDYGFAVASNPHEQV 213

Query: 352 VVEAALNTEDP 362
            V   ++  DP
Sbjct: 214 EVPMGVSLTDP 224


>gi|168043570|ref|XP_001774257.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674384|gb|EDQ60893.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 458

 Score = 79.3 bits (194), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 65/259 (25%), Positives = 118/259 (45%), Gaps = 22/259 (8%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLP 178
           ++ G+    V   L++T  R+    +  E   +  ++E + LAL+ +  K  GK S W P
Sbjct: 70  IKRGEQVLRVSRELMITPNRL---PSCVEESLSEDVNEWSRLALFQLLHKHAGKASPWEP 126

Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMA 238
           YIR L   RG     +++ + W + EL  L  S    +   R   I  +++ +  V    
Sbjct: 127 YIRCLPPLRG-----LQNTVFWRDEELELLRQSNVYDQTEHRKTLISNQFDLVQAV---- 177

Query: 239 GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC 298
                +YP ++  E  T E FK A+    S    ++ +       +VP    +  + S  
Sbjct: 178 ---VNKYP-ELFGETVTLESFKHAYCVASSRSWGVEALG---SITMVPF-VDMFNHDSSA 229

Query: 299 KAMLAAVDDA--VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 356
           +A+LA  ++    ++V D+ Y  G  +V+  G  PNS L +++GF   DNP+D + +   
Sbjct: 230 RALLAYYEEEGYAEVVADKDYNQGSQVVITYGTLPNSSLALDFGFTLPDNPHDEVQIWME 289

Query: 357 LNTEDPQYQDKRMVAQRNG 375
             + DP   +K  + + +G
Sbjct: 290 APSGDPLRAEKLKLLRDHG 308


>gi|291235388|ref|XP_002737626.1| PREDICTED: SET domain containing 4-like [Saccoglossus kowalevskii]
          Length = 353

 Score = 79.3 bits (194), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 78/298 (26%), Positives = 138/298 (46%), Gaps = 36/298 (12%)

Query: 75  EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
           + D  +L  WM +NG    K  L +  +  E  R +    A++  Q GD   S+P  L++
Sbjct: 28  DNDYIELVRWMSRNGF---KGALLKPANFKETGRGL---MATKPFQIGDQVISIPEMLLI 81

Query: 135 TLERVLGNETIAELL---TTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQ 191
           T + VL +  + + +   T  KLS +  +  YL+ E+ + K SFW  YI+ L +      
Sbjct: 82  TTQNVLSS-YLGDFIKQQTRPKLSPMQVICTYLICERSRQKDSFWYNYIKVLPK------ 134

Query: 192 LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPT 251
            +  +P+ ++  E+ +L     K ++ +  E I   Y EL  ++ +  S F  +      
Sbjct: 135 -SYSNPVYFTNEEINWLP-RRIKRKVFDECEKINTAYRELKNLFSILESTFVSFK----- 187

Query: 252 EAFTFEIFKQAFVAVQSCVVHLQK-----VSLAR-RFALVPLGPPLLAYSS--KCKAMLA 303
             F +  F+ A+  V +  V++ +     +S+ R  +AL P    LL +++  + KA   
Sbjct: 188 GIFEYSAFRWAWCTVNTRSVYMLQEQNPHLSIERDHYALAPF-LDLLNHTNTVEVKASYN 246

Query: 304 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 361
            V    ++      K  + + ++ GP  N KL I YGFV   N ++  VVE  L+ ED
Sbjct: 247 PVSKCYEIFTCTACKKYDQMFIYYGPHDNVKLFIEYGFVLPQNQHN--VVE--LDFED 300


>gi|217038301|gb|ACJ76599.1| SET domain-containing protein 3 (predicted) [Oryctolagus cuniculus]
          Length = 394

 Score = 79.3 bits (194), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 79/340 (23%), Positives = 153/340 (45%), Gaps = 34/340 (10%)

Query: 72  SKKEEDLGDLKSWMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVP 129
            K+E+   +L  W   NG  +   +V+  E+            + A+ +++A +    VP
Sbjct: 73  GKREDYFPELMKWASANGASVEGFEVVNFEEEGFG--------LRATREIKAEELFLWVP 124

Query: 130 NSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQ 186
             L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +
Sbjct: 125 RKLLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE 182

Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP 246
                   ++PL + E E+ YL  +    ++  + +   R+Y     V        Q +P
Sbjct: 183 -------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYRV-------IQTHP 228

Query: 247 Y--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAML 302
           +   +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +       
Sbjct: 229 HANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGY 288

Query: 303 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 362
              DD  + V  R + AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D 
Sbjct: 289 NLEDDRCECVALRDFHAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDR 348

Query: 363 QYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 402
            Y  K  V  R G  +  VF +H   E    + +L +LR+
Sbjct: 349 LYAMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRV 387


>gi|146162512|ref|XP_001009518.2| SET domain containing protein [Tetrahymena thermophila]
 gi|146146406|gb|EAR89273.2| SET domain containing protein [Tetrahymena thermophila SB210]
          Length = 789

 Score = 79.3 bits (194), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 104/417 (24%), Positives = 181/417 (43%), Gaps = 69/417 (16%)

Query: 107 HRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNET-IAELLTTNKLSELA----CLA 161
           +R +H   A + ++ G+    +P   ++TLE  L  E  I +L+ +  +  L+     L+
Sbjct: 373 YRGVH---ARQKIKKGECILFIPVDNMITLE--LSKELPICQLIESKNIRLLSPKHTFLS 427

Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES---PLLWSETELAYLTGSPTKAEIL 218
           +Y++ EKK   KSFW P++           L VE    P+L+++ EL +L GSP   ++ 
Sbjct: 428 IYIIIEKK-NHKSFWKPFL---------DILPVEYTTFPILYTDEELFWLKGSPFLNQVK 477

Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA--FTFEIFKQAFVAVQSCVVHLQKV 276
           ER E I ++Y              Q     IP  A   T + F  A +   S +  L  +
Sbjct: 478 ERRECITQDY--------------QAIVSKIPEFAKLCTLDEFAWARMMAASRIYGL-FI 522

Query: 277 SLARRFALVPLGPPLL----AYSS------KCKAMLAAVDDAVQLVVDRPYKAGESIVVW 326
           +  R  A VPL         AY++      K   ML A +D          + G+ I   
Sbjct: 523 NKKRTDAFVPLADMFNHRRPAYTNWGFCEDKGGFMLKASEDI---------RRGDQIYYS 573

Query: 327 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK-RMVAQRNGKLSVQVFHVH 385
           CG + NS+ L+NYGFV ++N  + + +    + +D     K +M+ +R  K    +F +H
Sbjct: 574 CGRKCNSRFLLNYGFVVKNNEANEIQLRVDFDKKDETLPIKLQMIGKR--KPESLIFRIH 631

Query: 386 AGREKEAISDMLPYLRLGYVSDTSEMQ-----SVISSLGPIC--PVSPCMERAVLDQLAD 438
              E++++ +   +LR   + D   ++     S      P+   P S   E+ +  ++  
Sbjct: 632 INYEEKSVLEFFGFLRFVLIRDYIVLEKFHEMSEGKEFDPLRTPPFSIENEKQMWTEIHK 691

Query: 439 YFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLL 495
                +  YP TL ED+ +L    L   ++    L   EK++L   + +   M  LL
Sbjct: 692 ICAEIMIQYPTTLDEDKKILETSKLTINQKNCVILRMGEKEILMYYITMADRMKKLL 748


>gi|42565948|ref|NP_191068.2| SET domain-containing protein [Arabidopsis thaliana]
 gi|56236044|gb|AAV84478.1| At3g55080 [Arabidopsis thaliana]
 gi|59958342|gb|AAX12881.1| At3g55080 [Arabidopsis thaliana]
 gi|332645816|gb|AEE79337.1| SET domain-containing protein [Arabidopsis thaliana]
          Length = 463

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 67/273 (24%), Positives = 128/273 (46%), Gaps = 25/273 (9%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           AS+ + AGD    VP +  +T + +  +     +L +N++  +  LA  L+ EKK G+KS
Sbjct: 75  ASKVIYAGDCMLKVPFNAQITPDELPSD---IRVLLSNEVGNIGMLAAVLIREKKMGQKS 131

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
            W+PYI  L +        + S + W E EL+ +  S    E +++   I+++++     
Sbjct: 132 RWVPYISRLPQ-----PAEMHSSIFWGEDELSMIRCSAVHQETVKQKAQIEKDFS----- 181

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
            F+A +  Q  P  I TE    E F  A+  V S     +    ++R +L+P    +   
Sbjct: 182 -FVAQAFKQHCP--IVTERPDLEDFMYAYALVGS-----RAWENSKRISLIPFADFMNHD 233

Query: 295 SSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
                 +L   D+ + ++  DR Y  G+ + +  G   N+ L++++GF    N +D + +
Sbjct: 234 GLSASIVLRDEDNQLSEVTADRNYSPGDEVFIKYGEFSNATLMLDFGFTFPYNIHDEVQI 293

Query: 354 EAALNTEDPQYQDKRMVAQRNGKLSVQ---VFH 383
           +  +  +DP    K  + Q +   +V+   +FH
Sbjct: 294 QMDVPNDDPLRNMKLGLLQTHHTRTVKDINIFH 326


>gi|145524453|ref|XP_001448054.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124415587|emb|CAK80657.1| unnamed protein product [Paramecium tetraurelia]
          Length = 581

 Score = 78.6 bits (192), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 101/422 (23%), Positives = 180/422 (42%), Gaps = 72/422 (17%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNET-IAELLTTNKLSELA----CLALYLMYE 167
           V A + + A +    +P S ++TLE  +  ET +A+ +   +L  L+     L+ +L+ E
Sbjct: 165 VNAKQKINAKELILFIPKSHMITLE--MAKETPVAKKMIQFRLDLLSPKHSFLSTFLLQE 222

Query: 168 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
           K +   SFW PY+  L       Q     P+ ++  +L +L GSP   +I ++   +K++
Sbjct: 223 KSRPN-SFWKPYLDIL------PQSYPSFPIFFNNYDLEWLQGSPFLKQINDKLSDLKKD 275

Query: 228 YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 287
           YN++  V       F QY         +F  F  A +   S +  +  +   +  A VPL
Sbjct: 276 YNDICNV----APEFSQY---------SFYEFCWARMTASSRIFGI-NIKGVKTDAFVPL 321

Query: 288 G-------PPLLA--YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 338
                   P L +  YS + +  +   D+ +    DR    G+ I    G + NS+ L+N
Sbjct: 322 ADMLNHKRPKLTSWCYSEEKQGFIIETDEKI----DR----GQMIFDSYGRKCNSRFLLN 373

Query: 339 YGFVDEDNPYDRLVVEAALNTEDPQYQDK--------------RMVAQRNGKLSVQVFHV 384
           YGFV +DN  + + V  A    DP  Q K              R++   +G   +  F  
Sbjct: 374 YGFVVDDNDANEVNVTVAAEFNDPLIQLKEDATEEQLKQPKTFRLIMDTDGINEITHFL- 432

Query: 385 HAGREKEAISDMLPYLRLGYVSDTSEMQSVISS-----LGP--ICPVSPCMERAVLDQLA 437
                +  + + + Y+R   + D +++Q +++      + P  I P+    E  + D + 
Sbjct: 433 -----EATVMEFMSYIRFLVIRDQTQLQFLLNERESKYIKPTKIQPLGIHNELDMWDLIR 487

Query: 438 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPD 497
                 L+ YP TL +D+ +L   +L   +R    L   EK++L    Q +  M  LL +
Sbjct: 488 RICYVSLSRYPTTLEQDKEILQICDLTTNQRNCLILRMGEKEILKFYYQFSEKMKQLLSN 547

Query: 498 VT 499
             
Sbjct: 548 FN 549


>gi|148686777|gb|EDL18724.1| mCG18357, isoform CRA_b [Mus musculus]
          Length = 466

 Score = 78.2 bits (191), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 80/344 (23%), Positives = 156/344 (45%), Gaps = 29/344 (8%)

Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+  L  +    ++ 
Sbjct: 28  ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVF 79

Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 275
            + +   R+Y      +F    + Q +P+   +P  E+FT+E ++ A  +V +    +  
Sbjct: 80  SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKESFTYEDYRWAVSSVMTRQNQIPT 132

Query: 276 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 334
              +R   AL+PL       +          DD  + V  + ++AG+ I ++ G + N++
Sbjct: 133 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAE 192

Query: 335 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 394
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H+  E    +
Sbjct: 193 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHS-TEPPISA 251

Query: 395 DMLPYLRLGYVSDT---------SEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 445
            +L +LR+  +++          S +  + +      PVS   E  +   L D     L 
Sbjct: 252 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNAEFPVSWDNEVKLWTFLEDRASLLLK 311

Query: 446 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 489
            Y  T+ ED+ +L + +L  +  +A +L   EK++L   ++  A
Sbjct: 312 TYKTTIEEDKIVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAA 355


>gi|297849804|ref|XP_002892783.1| hypothetical protein ARALYDRAFT_471564 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297338625|gb|EFH69042.1| hypothetical protein ARALYDRAFT_471564 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 482

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 98/393 (24%), Positives = 175/393 (44%), Gaps = 38/393 (9%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A  D+   +    +P  L +  E V  ++ I  L     L     +AL+L+ EK + +
Sbjct: 79  LVARRDIGRNEVVLEIPKRLWINPETVTASK-IGPL--CGGLKPWVSVALFLIREKYE-E 134

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
           +S W  Y+  L +       + +S + WSE ELA L G+   +  L   E ++ E+ +L+
Sbjct: 135 ESSWRLYLDMLPQ-------STDSTVFWSEEELAELKGTQLLSTTLGVKEYVENEFLKLE 187

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG---- 288
               +          D+ +   T + F  AF  ++S      +    +   L+PL     
Sbjct: 188 QEILLPNK-------DLFSSRITLDDFIWAFGILKSRAFSRLR---GQNLVLIPLADLIN 237

Query: 289 --PPLLA--YSSKCK-AMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFV 342
             P +    Y+ + K A L + D    L      KAGE + + +   + N++L ++YGFV
Sbjct: 238 HNPAITTEDYAYEIKGAGLFSRDLLFSLKSPVYVKAGEQVYIQYDLNKSNAELALDYGFV 297

Query: 343 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 402
           + +   +   +   +   DP + DK  +A+ N       F V  G+   A   ML YLRL
Sbjct: 298 ESNPNRNSYTLTIEIPESDPFFGDKLDIAETNKMGETGYFDVVDGQTLPA--GMLQYLRL 355

Query: 403 GYV--SDTSEMQSVISS--LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM 457
             +  SD   ++S+ ++   G +  PVS   E  +   + D  K+ L+G+  T+ EDE +
Sbjct: 356 VALGGSDAFLLESIFNNTIWGHLELPVSRSNEELICRVVRDACKSALSGFSTTIEEDEKL 415

Query: 458 LTDYNLHPKKRVATQLVRMEKKMLNACLQVTAD 490
           L +  L P+  +A ++   EK++L    Q+  D
Sbjct: 416 LEEGKLDPRLEMALKIRIGEKRVLQQIDQIFKD 448


>gi|297820264|ref|XP_002878015.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323853|gb|EFH54274.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 473

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 60/249 (24%), Positives = 119/249 (47%), Gaps = 22/249 (8%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           AS+ + AGD    VP ++ +T + +  +  ++    T+++  +  LA  L+ EKK+G+KS
Sbjct: 75  ASKVIHAGDCMLKVPFNVQITPDELSPDIRVS---LTDEVGNIGKLAAVLIREKKKGQKS 131

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
            W+PYI  L +        + S + W E E + +  S    E +++   I++E++     
Sbjct: 132 RWVPYISRLPQ-----PAEMHSTIFWGEDEFSMIRCSAVHKETVKQKAQIEKEFS----- 181

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
            F+A +  Q YP  I  E    E F  A+  V S     +    ++  +L+P    +   
Sbjct: 182 -FVAQAFKQHYPMVI--ERPYLEDFMYAYALVGS-----RAWETSKGISLIPFADFMNHD 233

Query: 295 SSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
                 +L+  D+ + ++  DR Y  G+ + +  G   N+ L++++GF    N +D + +
Sbjct: 234 GLSASIVLSDEDNQLSEVTADRNYSPGDEVFIKYGEFSNATLMLDFGFTVPYNIHDEVQI 293

Query: 354 EAALNTEDP 362
           +  +  +DP
Sbjct: 294 QMDVPNDDP 302


>gi|424513480|emb|CCO66102.1| predicted protein [Bathycoccus prasinos]
          Length = 571

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 104/405 (25%), Positives = 172/405 (42%), Gaps = 56/405 (13%)

Query: 116 SEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSF 175
           S++++ GD   S+P    VT      +  +A L+   +  EL  LAL+L  EK + K S 
Sbjct: 122 SKNVEGGDVILSIPQDNCVTAVDAKEHPIVAPLI--EEKPELVQLALWLCCEKAKAKGSE 179

Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELA-YLTGSPTKAEILERAEGIKREYNELDTV 234
           W PY++ L+           S L ++E E    L G+    E  +R +  K EY  L   
Sbjct: 180 WWPYLKTLNGNPN-------SVLRFTEEEFKELLKGTSIDKEARQRRDSAKEEYEALRAA 232

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFV-AVQSCVVHLQKVSLARRFALVPL------ 287
                    + P   P + + F + + AF+ A+       Q ++ A  +A+VPL      
Sbjct: 233 -------IAEDPGKYPLDVYAF-LTESAFIDALDIVCARAQWLNSANCYAMVPLMDAIPI 284

Query: 288 --GPPLLA-------------------YSSKCKAMLAAVDDA-VQLVVDRPYKAGESIV- 324
              PP ++                    + +C      VD A V L  +    AG  I+ 
Sbjct: 285 CGAPPPVSPEDPSFARFYEIRDIKTGLTAVRCGYADYDVDSASVVLCANTRASAGSKILQ 344

Query: 325 VWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV-QVFH 383
           +    + NS+L +++G VD+ +P D       L+  DP Y  K+ V +  G     Q F 
Sbjct: 345 IDHSVRNNSELYLSFGDVDDQHPGDYEYWPTELSENDPLYAAKKSVLEAQGFADKGQTFP 404

Query: 384 VHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 443
           V+  R      + L YLR   V+++ E+ +V  +   +  VSP  E   L  L    + R
Sbjct: 405 VYKDRMPR---EFLSYLRFARVTNSEELFAVSFTEDKV--VSPMNEYETLQLLMADCRDR 459

Query: 444 LAGYPATLSEDEAMLTDY-NLHPKKRVATQLVRMEKKMLNACLQV 487
           ++ Y  T  EDE +L    ++  K R A++L R EK+++   +  
Sbjct: 460 MSAYD-TNEEDELLLQKRDDVSLKIRNASRLRRCEKELVGEMMNA 503


>gi|229596469|ref|XP_001008992.3| SET domain containing protein [Tetrahymena thermophila]
 gi|225565279|gb|EAR88747.3| SET domain containing protein [Tetrahymena thermophila SB210]
          Length = 629

 Score = 77.4 bits (189), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 81/288 (28%), Positives = 121/288 (42%), Gaps = 43/288 (14%)

Query: 66  GSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAA 125
             +E + K E    +L SW+  N      + LK   +HN        + +   +QA +  
Sbjct: 142 ADKETLKKSE----NLLSWVQANKGEFSSIKLKYLSTHNRS------IVSKRIIQADETV 191

Query: 126 FSVPNSLVVTLERVLGNETIAELLTTNKLSEL-----ACLALYLMYEKKQGKKSFWLPYI 180
            S+P   V+TL+ V  +    ++LT  K ++L     A  AL+L+ E+K+   S +  YI
Sbjct: 192 ISIPQEQVITLD-VASSSDFCKILT-EKNTQLVQQKHAYFALFLLQEQKKKDASHYKAYI 249

Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS 240
             L             P L+SE EL YL G+     + E+ E IK +Y  +  V      
Sbjct: 250 DSLPTDLSSF------PALFSEEELQYLEGTAALKLVQEQKEDIKTDYESISQV------ 297

Query: 241 LFQQYPYDIP--TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC 298
                   IP     F+FE F+ AF+   S V  + KV   +   +VPL   L    S  
Sbjct: 298 --------IPEFKSEFSFEQFRWAFLCSHSRVFGI-KVKGVKTSVMVPLADMLNHKHSGQ 348

Query: 299 KAMLAAVDDAVQLVVDRPYKA---GESIVVWCGPQPNSKLLINYGFVD 343
           +      DDA      +  K     + I    G + NSKL +NYGFVD
Sbjct: 349 EDSEWVFDDATNCFTVKALKKIQRNQQIHFSYGSKCNSKLFLNYGFVD 396


>gi|302766942|ref|XP_002966891.1| hypothetical protein SELMODRAFT_408134 [Selaginella moellendorffii]
 gi|300164882|gb|EFJ31490.1| hypothetical protein SELMODRAFT_408134 [Selaginella moellendorffii]
          Length = 374

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 72/256 (28%), Positives = 115/256 (44%), Gaps = 46/256 (17%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           A+  + AG     +P   ++T E V     ++ LL+T+  +    L+L+L+ EK + ++S
Sbjct: 10  ATRRVPAGSRFLEIPRIAIITPENVPSQ--VSHLLSTS--NPKTRLSLFLLSEKHKAQES 65

Query: 175 FWLPYIRELDRQRGRGQLA-VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
            W PY+R L       QL  +ES + W   ELA+L  SPT  E +E  + IK E++ L  
Sbjct: 66  QWAPYLRCL------PQLGDIESTMFWKAEELAWLKHSPTYRETMECLKIIKSEFHLLT- 118

Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL-----ARRFALVPLG 288
              +A    Q +P+                     C   L +VSL     A     +P  
Sbjct: 119 ---LANK--QVFPW---------------------CRDALGEVSLTDFMHAYSTDQIPFA 152

Query: 289 PPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 346
                +   C+  L+     D    V D+ YKAG+ I +  G  PNS L ++YGF    N
Sbjct: 153 -DFFNHDHNCQTRLSYDKEKDCAVAVADQDYKAGDEIFLSYGSTPNSILAVDYGFAVASN 211

Query: 347 PYDRLVVEAALNTEDP 362
           P++++ V   ++  DP
Sbjct: 212 PHEQVEVPMGVSLTDP 227


>gi|17368377|sp|P94026.1|RBCMT_TOBAC RecName: Full=Ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic; AltName:
           Full=[Ribulose-bisphosphate carboxylase]-lysine
           N-methyltransferase; Short=RuBisCO LSMT; Short=RuBisCO
           methyltransferase; Short=rbcMT; Flags: Precursor
 gi|1731475|gb|AAC49565.1| ribulose-1,5-bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase [Nicotiana tabacum]
 gi|1731477|gb|AAC49566.1| ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase [Nicotiana tabacum]
          Length = 491

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 89/381 (23%), Positives = 168/381 (44%), Gaps = 31/381 (8%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A  D+  G+    VP    +  + V  +E I  + +   L     +AL+L+ EK +  
Sbjct: 87  LVAKRDIAKGETVLQVPKRFWINPDAVAESE-IGNVCS--GLKPWISVALFLLREKWR-D 142

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
            S W  Y+  L +       + +S + WSE EL+ + G+   +  +   + ++ E+ +++
Sbjct: 143 DSKWKYYMDVLPK-------STDSTIYWSEEELSEIQGTQLLSTTMSVKDYVQNEFQKVE 195

Query: 233 TVWFMAGSLFQQYPYDIPTEAF--TFEIFK-QAFVAVQS-CVVHLQKVSLARRFALVPLG 288
               +     Q +P+ I  + F   F I + +AF  +++  ++ +    L    A V   
Sbjct: 196 EEVILRNK--QLFPFPITLDDFFWAFGILRSRAFSRLRNQNLILVPFADLTNHNARVTTE 253

Query: 289 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNP 347
               A+  +  A L + D    L      KAG+ + + +   + N+ + ++YGF++  + 
Sbjct: 254 DH--AHEVRGPAGLFSWDLLFSLRSPLKLKAGDQLFIQYDLNKSNADMALDYGFIEPSSA 311

Query: 348 YDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD 407
            D   +   ++  D  Y DK  +A+ NG      F +  G+       M+PYLRL  +  
Sbjct: 312 RDAFTLTLEISESDEFYGDKLDIAETNGIGETAYFDIKIGQSLPPT--MIPYLRLVALGG 369

Query: 408 TSEM-------QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 460
           T           SV   LG   PVS   E  +   + D  K+ L+GY  T+ EDE ++ +
Sbjct: 370 TDAFLLESIFRNSVWGHLG--LPVSRANEELICKVVRDACKSALSGYHTTIEEDEKLMEE 427

Query: 461 YNLHPKKRVATQLVRMEKKML 481
            NL  + ++A  +   EK++L
Sbjct: 428 GNLSTRLQIAVGIRLGEKRVL 448


>gi|348537527|ref|XP_003456245.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Oreochromis niloticus]
          Length = 607

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 102/451 (22%), Positives = 182/451 (40%), Gaps = 78/451 (17%)

Query: 74  KEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLV 133
           +E+   +L SW  +NG   C+             +  + + A+ D++A +    +P  ++
Sbjct: 75  REDHFPELMSWAKENG-ASCECFTVANFG-----KEGYGLRATRDIKAEELFLWIPRKML 128

Query: 134 VTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRG 190
           +T+E    N  +  L + +++ +      LAL+L+ E+     SFWLPYIR L ++    
Sbjct: 129 MTVESA-QNSILGPLYSQDRILQAMGNVTLALHLLCERA-NPASFWLPYIRSLPQE---- 182

Query: 191 QLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY----------NELDTVWF---- 236
               + PL + + ++  L G+    ++L + +   R+Y            L +V      
Sbjct: 183 ---YDIPLYYQQEDVQLLLGTQAVQDVLSQYKNTARQYAYFYKLVQDKGMLGSVELRLFA 239

Query: 237 -----MAGSLFQQYPY--------DIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 283
                M G LF Q+           IPTE  +                   +V+LA    
Sbjct: 240 SLTPVMGGKLFDQWAVSSVMTRQNQIPTEDGS-------------------RVTLA---- 276

Query: 284 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 343
           L+PL       +          DD  + V  + YK  E I ++ G + N++ +I+ GF  
Sbjct: 277 LIPLWDMCNHTNGLITTGYNLEDDRCECVALQDYKENEQIYIFYGTRSNAEFVIHNGFFF 336

Query: 344 EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLG 403
           +D+ +DR+ ++  ++  +  Y  K  V  R G  +  VF +H   E    + +L +LR+ 
Sbjct: 337 QDDAHDRVKIKLGVSKSERLYAMKAEVLARAGIPASYVFALHCN-EPPISAQLLAFLRVF 395

Query: 404 ---------YVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSED 454
                    Y+     +  + +      PVS   E  +   L       L  Y  T  ED
Sbjct: 396 CMTEDELKYYLLGDRAINKIFTLGNSEFPVSWENEIKLWTFLETRAALLLKTYKTTSEED 455

Query: 455 EAMLTDYNLHPKKRVATQLVRMEKKMLNACL 485
            +ML   +L    R+A QL   EK++L   L
Sbjct: 456 RSMLEKPDLSLHSRMAIQLRLAEKQILEKAL 486


>gi|303277863|ref|XP_003058225.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226460882|gb|EEH58176.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 612

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 89/306 (29%), Positives = 133/306 (43%), Gaps = 42/306 (13%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIA--ELLTTNKLSELACLALYLMYEKKQ 170
            AAS DL AG  A ++P+S ++T    L + T       T   L E   + L+L+YEK  
Sbjct: 195 AAASTDLPAGADALTIPSSALLTSRVALEDPTARGDAYRTFAGLGEDTLMTLWLVYEKYA 254

Query: 171 -GKKSFWLPYIREL---------DRQRGRGQLAVESPLLW-SETELAYLTGSPTKAEILE 219
            G +S W P +  L           +   G L + +P  W +E   A L G+P   + ++
Sbjct: 255 LGDRSPWAPLLASLPMDDGGGDDGDRTAAGALGL-TPASWPAEVTDALLRGAPLLDDAVK 313

Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
             E   R++  L    F A  L + +P   PTE +T   F+ A  A  +  + +Q  ++ 
Sbjct: 314 ARETTARQHAAL----FPA--LGEHFPEVFPTELYTLRRFRIASEAWNAYGMTVQAETVG 367

Query: 280 RRFALVPLGPPLLAYSSKCKAMLA-------AV------DDAVQLVVDRPYKAGESIVVW 326
                    PP         A+L        AV      DDA+ L + R  +AGE I V 
Sbjct: 368 GASGGGEHHPPAPTTCLPPIALLCNHATWPHAVRYSRLRDDALHLPIARGVRAGEEIFVS 427

Query: 327 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQR-----NGKLSVQV 381
            G + N++LL+ YGF   DNPYD    +  L+ E PQ + + + A R       KLS+  
Sbjct: 428 YGAKSNAELLLFYGFGVRDNPYD----DVPLSLELPQGEVRDVSALRERVLHRAKLSLSP 483

Query: 382 FHVHAG 387
             V  G
Sbjct: 484 HSVRCG 489


>gi|159479580|ref|XP_001697868.1| rubisco large subunit N-methyltransferase [Chlamydomonas
           reinhardtii]
 gi|158273966|gb|EDO99751.1| rubisco large subunit N-methyltransferase [Chlamydomonas
           reinhardtii]
          Length = 475

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 109/424 (25%), Positives = 176/424 (41%), Gaps = 50/424 (11%)

Query: 83  SWMHKNGLPPCKV-----ILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
           +W  K G    K      IL +KP           + AS D+Q G++   VP++  V++ 
Sbjct: 46  AWATKQGAKLEKANLSTDILTDKP----------ILVASADVQPGESLIVVPDAAWVSVP 95

Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
            V    T+ +L ++  L     LAL L+ E+    KS    Y   L    G       +P
Sbjct: 96  NV-AKTTVGKLASSAGLEPWLQLALVLVAERFGSAKSELAGYASSLPEDLG-------TP 147

Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
           LLWSE E   L G+     +       +  + +L         LF   P   P   FT  
Sbjct: 148 LLWSEEETRALAGTQVAGTLNSYLTFFRSTFAQLQA------GLFTANPAAFPPAVFTLP 201

Query: 258 IFKQAFVAVQSCV---VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV- 313
            F  A  AV+S     +   K++LA    LV L     A ++K     + +    Q+ V 
Sbjct: 202 NFVWAVAAVRSRSHPPLEGDKIALA---PLVDLVSHRRAANTKLSVRSSGLFGRGQVAVV 258

Query: 314 --DRPYKAGESIVVWCGP-QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 370
              R  + GE++ +   P + +  +L++YG +D  +P     +   L+  D    DK  +
Sbjct: 259 EATRAIRKGEALGMDYAPGKLDGPVLLDYGVMDTASPKPGYSLTLTLDESDKFVDDKADI 318

Query: 371 AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS--DTSEMQSVISSLGPICPVSPCM 428
            +  G      + +    +++   +M+ +LRL  +   D   ++S+  +      VS   
Sbjct: 319 VEGAGLRPSMTYSITP--DQQPGEEMMAFLRLMNIKAMDAFLLESIFRN-----EVSEGN 371

Query: 429 ERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRV-ATQLVRM-EKKMLNACLQ 486
           E AV   LA+  +A LAGYP TL +D A L   +     R  A  LVR+ EK+ L+A  +
Sbjct: 372 EEAVCAMLAEGARAALAGYPTTLDQDLAALRSNSTPLGSRAEAALLVRLGEKESLDAVAR 431

Query: 487 VTAD 490
              D
Sbjct: 432 FFED 435


>gi|413923745|gb|AFW63677.1| hypothetical protein ZEAMMB73_839660 [Zea mays]
 gi|413923746|gb|AFW63678.1| hypothetical protein ZEAMMB73_839660 [Zea mays]
          Length = 306

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 66/252 (26%), Positives = 113/252 (44%), Gaps = 25/252 (9%)

Query: 76  EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
           E    L+ W+  +GLP  ++ + ++    E+      + A ++++ G+    VP SLV+T
Sbjct: 71  ESAASLERWLIDSGLPEQRLAI-QRVDIGERG-----LVALKNIRKGEKLLFVPPSLVIT 124

Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
            +   G   + +++  N + +   +A YL+ E      S W+ YI  L RQ         
Sbjct: 125 ADSEWGRPEVGDVMKRNSVPDWPLIATYLISEASLEGSSRWISYIAALPRQ-------PY 177

Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
           S L W+  EL AYL  SP +   ++R   +   YN+L         +F ++P   P E +
Sbjct: 178 SLLYWTRAELDAYLVASPIRKRAIQRITDVIGTYNDL------RDRIFSRHPDLFPEEVY 231

Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLV 312
             E F  +F  + S +V L   S+  R ALVP    +L +S + +  L        +   
Sbjct: 232 NIETFLWSFGILFSRLVRLP--SMDGRVALVPWA-DMLNHSPEVETFLDFDKSSRGIVFT 288

Query: 313 VDRPYKAGESIV 324
            DR Y+ G  I+
Sbjct: 289 TDRSYQPGIYIL 300


>gi|149044197|gb|EDL97579.1| rCG27725, isoform CRA_c [Rattus norvegicus]
          Length = 468

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 79/344 (22%), Positives = 155/344 (45%), Gaps = 29/344 (8%)

Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+  L  +    ++ 
Sbjct: 28  ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVF 79

Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 275
            + +   R+Y      +F    + Q +P+   +P  ++FT+E ++ A  +V +    +  
Sbjct: 80  SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 132

Query: 276 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 334
              +R   AL+PL       +          DD  + V  + ++AG+ I ++ G + N++
Sbjct: 133 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAE 192

Query: 335 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 394
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    +
Sbjct: 193 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 251

Query: 395 DMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 445
            +L +LR+  +++          S +  + +      PVS   E  +   L D     L 
Sbjct: 252 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 311

Query: 446 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 489
            Y  T+ ED+ +L + +L  +  +A +L   EK++L   ++  A
Sbjct: 312 TYKTTIEEDKTVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAA 355


>gi|58177849|gb|AAH89108.1| Setd3 protein [Rattus norvegicus]
          Length = 450

 Score = 76.3 bits (186), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 79/344 (22%), Positives = 155/344 (45%), Gaps = 29/344 (8%)

Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
            LA +L+ E+     SFW PYI+ L  +        ++PL + E E+  L  +    ++ 
Sbjct: 10  ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVF 61

Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 275
            + +   R+Y      +F    + Q +P+   +P  ++FT+E ++ A  +V +    +  
Sbjct: 62  SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 114

Query: 276 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 334
              +R   AL+PL       +          DD  + V  + ++AG+ I ++ G + N++
Sbjct: 115 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAE 174

Query: 335 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 394
            +I+ GF  ++N +DR+ ++  ++  D  Y  K  V  R G  +  VF +H   E    +
Sbjct: 175 FVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF-TEPPISA 233

Query: 395 DMLPYLRLGYVSD---------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 445
            +L +LR+  +++          S +  + +      PVS   E  +   L D     L 
Sbjct: 234 QLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSEFPVSWDNEVKLWTFLEDRASLLLK 293

Query: 446 GYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 489
            Y  T+ ED+ +L + +L  +  +A +L   EK++L   ++  A
Sbjct: 294 TYKTTIEEDKTVLKNPDLSVRATMAIKLRLGEKEILEKAVKSAA 337


>gi|392349055|ref|XP_003750278.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Rattus
           norvegicus]
          Length = 416

 Score = 76.3 bits (186), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 79/338 (23%), Positives = 153/338 (45%), Gaps = 30/338 (8%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 12  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 65

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 66  LLMTVESA-KNSILGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 121

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+  L  +    ++  + +   R+Y     V        Q +P+ 
Sbjct: 122 -----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQYAYFYKV-------IQTHPHA 169

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL       +         
Sbjct: 170 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 229

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
            DD  + V  + ++AG+ I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y
Sbjct: 230 EDDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLY 289

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 402
             K  V  R G  +  VF +H   E    + +L +LR+
Sbjct: 290 AMKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRV 326


>gi|302768639|ref|XP_002967739.1| hypothetical protein SELMODRAFT_408995 [Selaginella moellendorffii]
 gi|300164477|gb|EFJ31086.1| hypothetical protein SELMODRAFT_408995 [Selaginella moellendorffii]
          Length = 421

 Score = 75.5 bits (184), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 70/288 (24%), Positives = 118/288 (40%), Gaps = 46/288 (15%)

Query: 80  DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
           +L SW+   G      +LK  P           + A  D++AG+    V    ++T +R+
Sbjct: 39  ELVSWLKIRGEHDACSLLKTGPDKRG-------LFAVRDIKAGECILRVSRDTMMTADRL 91

Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
                  +LL++  +SE A LAL L++EK+ G+ S W PYI  L R        + S   
Sbjct: 92  --PLEFQQLLSSG-VSEWAQLALLLLFEKRAGEASIWAPYISCLPRWG-----TIHSTAF 143

Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
           W + EL  +  S    E + R   I+ E+NE+ +V F          +D  + A      
Sbjct: 144 WRKEELTMIQESSLSYETMSRRAAIREEFNEMQSVPFA-----DFMNHDWSSNAMLTYDT 198

Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 319
                 V+   V+   + +A   A                          QL  D+ Y A
Sbjct: 199 DNGSTEVEEVKVYSDCLYIALFCA--------------------------QLFADKNYAA 232

Query: 320 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 367
           GE + +  GP  N+ L +++GF    NP+D++ +   ++  D   ++K
Sbjct: 233 GEQVTISFGPLCNASLALDFGFTVPYNPWDKVQLWLGISRRDSLRKEK 280


>gi|242007310|ref|XP_002424484.1| SET domain-containing protein, putative [Pediculus humanus
           corporis]
 gi|212507902|gb|EEB11746.1| SET domain-containing protein, putative [Pediculus humanus
           corporis]
          Length = 492

 Score = 75.5 bits (184), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 97/430 (22%), Positives = 189/430 (43%), Gaps = 45/430 (10%)

Query: 74  KEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLV 133
           +E+   +L SW+ +NG     V +K     NE     + + A++DL+  +   ++P +++
Sbjct: 81  REDHFSNLISWIKENGGVADNVTIKH---FNEMG---YGLEAAKDLEESELICAIPKNVM 134

Query: 134 VTLERVLGNETIAELLTTN----KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
           +TL+ V     +  L   N     +  +A LAL+L+ E  + + SFW  YI  L      
Sbjct: 135 MTLDNV-KVSPLKYLYENNPILKNMGNVA-LALFLILEHVKNENSFWHHYISSLPSD--- 189

Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYD- 248
                 + L +   +   +  SPT     +  + I R+Y      +F   +LFQ    + 
Sbjct: 190 ----YNTVLYFDLNDFLEMKNSPTFEMATKHCKNIARQY-----AYF--NNLFQNSNDEA 238

Query: 249 --IPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF-----ALVPLGPPLLAYSSKCKAM 301
             I    FT+++++ A   V +    +   S +         L+PL   +  +++   + 
Sbjct: 239 SLILRNVFTYQLYRWAVSTVMTRQNFIPSSSTSNDVENGINGLIPLWD-MCNHTNGYLST 297

Query: 302 LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 361
              VD +  L   +P+K GE ++++ G + NS  L++ GFV ++NP+D   +   ++  D
Sbjct: 298 QYKVDRSECLAC-KPFKKGEQVLIFYGERSNSDFLVHNGFVYDENPHDSFRLRLGISKSD 356

Query: 362 PQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY--VSDTSEMQSVISSLG 419
             +  +  + +  G      F++++G E     ++L +LR+    V + +  +S  S L 
Sbjct: 357 KLHGLRCELLKDLGIPDSGDFYLYSGSEP-VRENLLAFLRIFNMDVENLNHWKSHSSRLS 415

Query: 420 PI----CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
            +    C +   +E  V + L D     L  Y     E E  + D N    +R+    ++
Sbjct: 416 DLMWKDCALDTKIESKVWNFLYDRINLLLKTYKG--DEVEVRVEDSNSTECRRLVRAQLK 473

Query: 476 MEKKMLNACL 485
            EKK L++ L
Sbjct: 474 CEKKFLSSIL 483


>gi|15223054|ref|NP_172856.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase
           [Arabidopsis thaliana]
 gi|17369870|sp|Q9XI84.1|RBCMT_ARATH RecName: Full=[Fructose-bisphosphate aldolase]-lysine
           N-methyltransferase, chloroplastic; AltName:
           Full=Aldolases N-methyltransferase; AltName:
           Full=[Ribulose-bisphosphate carboxylase]-lysine
           N-methyltransferase-like; Short=AtLSMT-L;
           Short=LSMT-like enzyme; Flags: Precursor
 gi|5080779|gb|AAD39289.1|AC007576_12 Putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase [Arabidopsis thaliana]
 gi|28973755|gb|AAO64193.1| putative ribulose-1,5 bisphosphate carboxylase oxygenase large
           subunit N-methyltransferase [Arabidopsis thaliana]
 gi|332190979|gb|AEE29100.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase
           [Arabidopsis thaliana]
          Length = 482

 Score = 75.5 bits (184), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 96/394 (24%), Positives = 171/394 (43%), Gaps = 40/394 (10%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A  D+   +    +P  L +  E V  ++ I  L     L     +AL+L+ EK + +
Sbjct: 79  LVARRDIGRNEVVLEIPKRLWINPETVTASK-IGPL--CGGLKPWVSVALFLIREKYE-E 134

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
           +S W  Y+  L +       + +S + WSE ELA L G+   +  L   E ++ E+ +L+
Sbjct: 135 ESSWRVYLDMLPQ-------STDSTVFWSEEELAELKGTQLLSTTLGVKEYVENEFLKLE 187

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL- 291
               +          D+ +   T + F  AF  ++S      +    +   L+PL   + 
Sbjct: 188 QEILLPNK-------DLFSSRITLDDFIWAFGILKSRAFSRLR---GQNLVLIPLADLIN 237

Query: 292 ---------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGF 341
                     AY  K  A L + D    L      KAGE + + +   + N++L ++YGF
Sbjct: 238 HNPAIKTEDYAYEIKG-AGLFSRDLLFSLKSPVYVKAGEQVYIQYDLNKSNAELALDYGF 296

Query: 342 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 401
           V+ +   +   +   +   DP + DK  +A+ N       F +  G+   A   ML YLR
Sbjct: 297 VESNPKRNSYTLTIEIPESDPFFGDKLDIAESNKMGETGYFDIVDGQTLPA--GMLQYLR 354

Query: 402 LGYVS--DTSEMQSVISS--LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEA 456
           L  +   D   ++S+ ++   G +  PVS   E  +   + D  K+ L+G+  T+ EDE 
Sbjct: 355 LVALGGPDAFLLESIFNNTIWGHLELPVSRTNEELICRVVRDACKSALSGFDTTIEEDEK 414

Query: 457 MLTDYNLHPKKRVATQLVRMEKKMLNACLQVTAD 490
           +L    L P+  +A ++   EK++L    Q+  D
Sbjct: 415 LLDKGKLEPRLEMALKIRIGEKRVLQQIDQIFKD 448


>gi|361129824|gb|EHL01706.1| putative Ribosomal N-lysine methyltransferase 4 [Glarea lozoyensis
           74030]
          Length = 483

 Score = 75.5 bits (184), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 109/440 (24%), Positives = 180/440 (40%), Gaps = 57/440 (12%)

Query: 84  WMHKNGLPPC-KVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL-G 141
           W+ K G+    K+ LK+  S          V A+ D +  +  F +P + V+ +  V  G
Sbjct: 16  WLSKIGVRINPKMTLKDLKSEGRGR----GVVAAADFEEDEVVFCIPRTAVLNVNNVFAG 71

Query: 142 NETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
            ++ A      ++     L   +M E +Q   S W PY+  L ++       ++S + WS
Sbjct: 72  QDSGASKEALLQMPNWLALTATMMSEGQQSD-SRWAPYLAVLPQK-------LDSLVFWS 123

Query: 202 ETELAYLTGSPTKAEILERA--EGIKREYNELDTVWF------MAGSLFQQYPYDIPTEA 253
           E ELA L  S    +I   +  E   +  + L    F         S+   Y +DIP E 
Sbjct: 124 EEELAELQASSVAKKIGRSSAEEMFTKHISPLGLGEFNVELCHQVASVIMAYAFDIPEE- 182

Query: 254 FTFEIFKQAFVAVQSCVVHL-------QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD 306
              E  KQ     +     L       +K  L+    ++PL   L A + +  A +   +
Sbjct: 183 ---EPAKQENGGAEGETDDLVSDDGEDEKTILS----MIPLADMLNADAERNNARIYYEN 235

Query: 307 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED-NPYDRLVVEAA----LNTED 361
           + +++   +P  AGE I    G  P S LL  YG+V E+   YD + + +A    L TE 
Sbjct: 236 EDLEMRTIKPIMAGEEIFNDYGQLPRSDLLRRYGYVTENYAQYDVVEISSASIKSLMTEK 295

Query: 362 PQ---------------YQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS 406
           PQ                +++  +A R G L        A  E+ AI D L  L   ++ 
Sbjct: 296 PQEIQSGQFLDPLTSAEAEERVALADREGILEDSYDVNIANAEERAIPDELLALLYLFLL 355

Query: 407 DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPK 466
           D   ++++++S   +   S      V   L    + R A Y  TL EDE +L   NL  +
Sbjct: 356 DNENLEAIVTSQSALPSRSKLATELVGKVLVKVLRHREAEYATTLEEDEKLLQAANLPRR 415

Query: 467 KRVATQLVRMEKKMLNACLQ 486
             +A Q+   EK++L   ++
Sbjct: 416 TAMAIQVRHGEKRVLRLAVE 435


>gi|340720054|ref|XP_003398458.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Bombus
           terrestris]
          Length = 484

 Score = 75.1 bits (183), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 101/438 (23%), Positives = 182/438 (41%), Gaps = 50/438 (11%)

Query: 73  KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
           K+ + +G   +W+ +NG       + E P ++        + A  +    +    +P  L
Sbjct: 77  KRSQGIGRFINWLKQNGANVYGASVAEFPGYDLG------LKAERNFLENELILRIPREL 130

Query: 133 VVTLERV------LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
           + ++         L N+ + +L+          LA+ L+ EK + + S W PY+  L   
Sbjct: 131 IFSIHNAAPELVALQNDPLLQLMPQ------VALAIALLIEKHK-EYSKWKPYLDILPT- 182

Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP 246
                    + L  +  ++  L GSPT    L++   I R+Y      +F    LFQ+  
Sbjct: 183 ------TYTTVLYMTAADMNELKGSPTLEAALKQCRNIARQY-----AYF--NKLFQKNN 229

Query: 247 YDIPT---EAFTFEIFKQAF--VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAM 301
             +     + FT+E +  A   V  +  ++  +  SL    AL+P+        SK    
Sbjct: 230 NAVSAILRDVFTYEKYCWAVSTVMTRQNIIPSKDGSLMIH-ALIPMWDMCNHEDSKITTD 288

Query: 302 LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 361
             A  +  +    R +K  E I +  GP+ NS   ++ GFV  DN  D   +   ++  D
Sbjct: 289 FNATLNCCECYALRDFKKAEQIFISYGPRTNSDFFVHSGFVYMDNEQDGFKLRLGISKAD 348

Query: 362 PQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD-MLPYLRLGYVSDTSEMQSVISS--L 418
           P ++++  +  +    +V  F +  G   E ISD +L +LR+ +     E+   I S  +
Sbjct: 349 PLHKERVELLNKLDLPAVGEFLLKPG--TEPISDTLLAFLRV-FSMRKEELAHWIQSDRV 405

Query: 419 GPICPVSPCMERAVLDQLADYFKARL----AGYPATLSEDEAMLTDYNLHPKKRVATQLV 474
             +  +   +E  V + +  +   RL    A YP TL ED  +L +  L   K++A QL 
Sbjct: 406 NDLKHMDCALETVVEENVKKFLLTRLQLLIANYPTTLKEDLQLL-ETTLPRIKKLAIQLR 464

Query: 475 RMEKKMLNACLQVTADMI 492
             EK++L   L+     I
Sbjct: 465 VTEKRILQGALEYVQQWI 482


>gi|8778402|gb|AAF79410.1|AC068197_20 F16A14.25 [Arabidopsis thaliana]
          Length = 474

 Score = 75.1 bits (183), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 94/388 (24%), Positives = 171/388 (44%), Gaps = 36/388 (9%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A  D+   +    +P  L +  E V  ++ I  L     L     +AL+L+ EK + +
Sbjct: 79  LVARRDIGRNEVVLEIPKRLWINPETVTASK-IGPL--CGGLKPWVSVALFLIREKYE-E 134

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
           +S W  Y+  L +       + +S + WSE ELA L G+   +  L   E ++ E+ +L+
Sbjct: 135 ESSWRVYLDMLPQ-------STDSTVFWSEEELAELKGTQLLSTTLGVKEYVENEFLKLE 187

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL- 291
               +          D+ +   T + F  AF      +++ + ++    F  +   P + 
Sbjct: 188 QEILLPNK-------DLFSSRITLDDFIWAF-----GILNRESLTSMFEFEQINHNPAIK 235

Query: 292 ---LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNP 347
               AY  K  A L + D    L      KAGE + + +   + N++L ++YGFV+ +  
Sbjct: 236 TEDYAYEIKG-AGLFSRDLLFSLKSPVYVKAGEQVYIQYDLNKSNAELALDYGFVESNPK 294

Query: 348 YDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS- 406
            +   +   +   DP + DK  +A+ N       F +  G+   A   ML YLRL  +  
Sbjct: 295 RNSYTLTIEIPESDPFFGDKLDIAESNKMGETGYFDIVDGQTLPA--GMLQYLRLVALGG 352

Query: 407 -DTSEMQSVISS--LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYN 462
            D   ++S+ ++   G +  PVS   E  +   + D  K+ L+G+  T+ EDE +L    
Sbjct: 353 PDAFLLESIFNNTIWGHLELPVSRTNEELICRVVRDACKSALSGFDTTIEEDEKLLDKGK 412

Query: 463 LHPKKRVATQLVRMEKKMLNACLQVTAD 490
           L P+  +A ++   EK++L    Q+  D
Sbjct: 413 LEPRLEMALKIRIGEKRVLQQIDQIFKD 440


>gi|326492674|dbj|BAJ90193.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 449

 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 95/391 (24%), Positives = 164/391 (41%), Gaps = 40/391 (10%)

Query: 109 PIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEK 168
           P+  + A  +L  G+    VP  L +  + V  +       +   L     ++L ++ E 
Sbjct: 39  PVLGLVAERNLPRGEVVAEVPKKLWLDADAVAASVLGRVCGSGGDLRPWVSVSLLILREA 98

Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
            +G  S W PY+  L RQ        +S + WSE EL  + G+   +  +   E ++ E+
Sbjct: 99  ARGGDSLWAPYLAILPRQ-------TDSTIFWSEEELLEIQGTQLLSTTMGVKEYVQSEF 151

Query: 229 NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG 288
           + ++     AG +      D+     TF+ F  AF  ++S V    +     + AL+P  
Sbjct: 152 DNVE-----AGII--NVNKDLFPGTITFDDFLWAFGVLRSRVFPELR---GDKLALIPFA 201

Query: 289 PPL----------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLI 337
             +            +  K K  L   D    L      K+GE I V +   + N++L +
Sbjct: 202 DLINHNGDITSKESCWEIKGKGFLGR-DTVFSLRTPVDVKSGEQIYVQYDLDKSNAELAL 260

Query: 338 NYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDML 397
           +YGF + ++  D   +   ++  DP Y+DK  +A+ NG      F V  G  +     M+
Sbjct: 261 DYGFTESNSSRDSYTLTLEISESDPFYEDKLDIAELNGMGETAYFDVVLG--ESLPPQMI 318

Query: 398 PYLRLGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPAT 450
            YLRL  +  T            V   L    PVS   E ++   + +  K+ LA Y  T
Sbjct: 319 TYLRLLCLGGTDAFLLEALFRNKVWEHLE--LPVSRDNEESICQVIQNACKSALAAYHTT 376

Query: 451 LSEDEAMLTDYNLHPKKRVATQLVRMEKKML 481
           + EDE +L   +L  + ++A ++   EKK+L
Sbjct: 377 IEEDEELLEREDLQSRHQIAVEVRVGEKKVL 407


>gi|302754340|ref|XP_002960594.1| hypothetical protein SELMODRAFT_402971 [Selaginella moellendorffii]
 gi|300171533|gb|EFJ38133.1| hypothetical protein SELMODRAFT_402971 [Selaginella moellendorffii]
          Length = 403

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 81/306 (26%), Positives = 132/306 (43%), Gaps = 41/306 (13%)

Query: 80  DLKSWMHKNGLPPCKVILKEKPSHNEK----HRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
           +  SW+ + G     + +    S + +    HRP         + AG+       +LV+T
Sbjct: 41  EFMSWLRRRGEDMNSIAVAIGMSKHGRALFAHRP---------MCAGECMIKFSQNLVLT 91

Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
            E+ L  E IA L   N+ + ++   L +M EK++G+ S W PYI  L      G+  + 
Sbjct: 92  PEK-LPCEVIALLDQANEFTRVS---LLVMAEKRKGQNSAWAPYIECLP---SFGE--IH 142

Query: 196 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
           S + W   ELA L  SP      ER   ++ EY E+  V     S    Y  D+     +
Sbjct: 143 STIFWDPKELACLECSPIHRGTGERNALLQSEYREVKKV---VESCPHLYDPDV-----S 194

Query: 256 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV-D 314
            E FK  +  V S        S      ++PL      +  + + + +  DD   +VV  
Sbjct: 195 LEQFKHEYATVSSRAWGQGPHS---DMTMIPL-VDFANHDPRSRTLFSHADDNCTVVVAS 250

Query: 315 RPYKAGES-----IVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK-R 368
           R Y+ G+      + +  G   N+ L ++YGFV  DNP+D   +   + +EDP  + K +
Sbjct: 251 RDYQTGDENFHLKVHICYGDHSNAVLALDYGFVVPDNPFDEAEIFLEIPSEDPLREIKLQ 310

Query: 369 MVAQRN 374
            +AQ N
Sbjct: 311 YMAQNN 316


>gi|302771638|ref|XP_002969237.1| hypothetical protein SELMODRAFT_410177 [Selaginella moellendorffii]
 gi|300162713|gb|EFJ29325.1| hypothetical protein SELMODRAFT_410177 [Selaginella moellendorffii]
          Length = 336

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 77/275 (28%), Positives = 120/275 (43%), Gaps = 37/275 (13%)

Query: 107 HRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMY 166
           HRP         + AG+        LV+T E+ L  E IA L   N+ + ++   L +M 
Sbjct: 72  HRP---------MCAGECMIKFSQDLVLTPEK-LPCEVIALLDQANEFTRVS---LLVMA 118

Query: 167 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 226
           EK++G+ S W PYI  L      G+  + S + W   ELA L  SP      ER   ++ 
Sbjct: 119 EKRKGQNSAWAPYIECLP---SFGE--IHSTIFWDPKELACLECSPIHRGTGERNALLQS 173

Query: 227 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 286
           EY E+  V     S    Y  D+     + E FK  +  V S        S      ++P
Sbjct: 174 EYREVKKV---VESCPHLYDPDV-----SLEQFKHEYATVSSRAWGQGPHS---DMTMIP 222

Query: 287 LGPPLLAYSSKCKAMLAAVDDAVQLVV-DRPYKAGES-----IVVWCGPQPNSKLLINYG 340
           L      +  + + + +  DD   +VV  R Y+ G+      + +  G   N+ L ++YG
Sbjct: 223 L-VDFANHDPRSRTLFSHADDNCTVVVASRDYQTGDENFHLKVHICYGDHSNAVLALDYG 281

Query: 341 FVDEDNPYDRLVVEAALNTEDPQYQDK-RMVAQRN 374
           FV  DNP+D   +   + +EDP  + K + +AQ N
Sbjct: 282 FVVPDNPFDEAEIFLEIPSEDPLREIKLQYMAQNN 316


>gi|449017905|dbj|BAM81307.1| similar to ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplast precursor
           [Cyanidioschyzon merolae strain 10D]
          Length = 567

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 116/444 (26%), Positives = 172/444 (38%), Gaps = 87/444 (19%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLE---RVLGNETIAELLTTNKLSELACLALY------LM 165
           A  D+QAG+  F VP  L  T +   R      + EL    +   LA L LY        
Sbjct: 120 ARRDIQAGEVLFQVPFHLCFTKDVAVRRFAALNVPELADEEEFFALATLLLYERGLDESW 179

Query: 166 YEKKQGKKSFWLPYIRELDR--QRGRGQLAVES----PL----LWSETELAYLTGSPTKA 215
            +  +G  SFW PY+  L       +G    ES    PL    LW+E E+ +L GSPT  
Sbjct: 180 KKSGRGPGSFWGPYLDILPPVPWEFKGAEPAESLSMDPLDALWLWAEDEMQWLQGSPTLL 239

Query: 216 EILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE-AFTFEIFKQAFVAVQSCVVHLQ 274
                   ++REY E          L++++P+    E AF  E F  AF  + S  V L 
Sbjct: 240 SARALRSKVEREYAE------ACERLYRRHPHIFDLEGAFRLERFLWAFGVLFSRAVSLP 293

Query: 275 KVSLARRFALVPLGPPLLAYSSKCKAMLAA----------------------------VD 306
             +     ALVP    L  +S+ C + + A                             D
Sbjct: 294 AENGM--LALVPYAD-LANHSAFCVSFIDARTAAFPYAFRASSKQKRGQWWQRFLAPNSD 350

Query: 307 DA------------------VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
           DA                  V    DR Y   E + V  G + N++LL+ YGFV + NPY
Sbjct: 351 DAGAVANTDSSHYREDAQREVVAYADRFYDKFEQVYVSYGQKSNAELLLLYGFVSDRNPY 410

Query: 349 DRLVVEAALNTEDPQ----YQDKRMVAQRNGK--LSVQVFHVHAGREKEAISDMLPYLRL 402
           + + V  +L+  +         KR      G+     + F ++A R    +  +L +  L
Sbjct: 411 NSVEVCVSLSGSEAAGAGLLDRKRSFLLACGRDPDKPECFPLYADRYPLELMQLLRFASL 470

Query: 403 GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYN 462
                 +   S +  +    PV+   E A    L    K  L  YP +  ED+A L D +
Sbjct: 471 --TEQDAAGYSDLEQIDVAQPVNRENEIAAKSALLQACKIALQAYPTSADEDDAALKDKS 528

Query: 463 ----LHPKKRVATQLVRMEKKMLN 482
               L  K+R++ +L R EK++L 
Sbjct: 529 MAQLLSRKQRLSVRLRRSEKRILE 552


>gi|358392567|gb|EHK41971.1| hypothetical protein TRIATDRAFT_251278, partial [Trichoderma
           atroviride IMI 206040]
          Length = 956

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 97/449 (21%), Positives = 175/449 (38%), Gaps = 84/449 (18%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER----------VLGNETIAELLTTNKLSELACLAL 162
           + A +D+ A    F+VP S +V +E            L  +T  E+    +    + L +
Sbjct: 531 IVALQDIPADTVLFTVPRSAIVNIETSELRAKLPDVFLNQDTAMEVDNKPQQDPWSTLII 590

Query: 163 YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RA 221
            L+YE  +G +S W PY+  L         + E+P+ WS+ E+  L  S T+++I +  A
Sbjct: 591 VLIYEYFKGDQSSWKPYLDVL-------PASFETPMFWSDAEVDELQASATRSKIGKTNA 643

Query: 222 EGI--------------------KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ 261
           E +                     +   EL  +    GS    Y +D          F+ 
Sbjct: 644 EEMFHAKILPVIRGNPDIFQTSQAKSDEELIQLAHRMGSTIMSYAFD----------FQN 693

Query: 262 AFVAVQSCVVHLQKVSLARR-FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 320
                +       +   A+    +VP+   +L   ++  A +   DDA+ +   R  KAG
Sbjct: 694 EDEEEEDDSEEWVEDREAKSTMGMVPMAD-ILNADAEYNAHVNYGDDALTVATLRTIKAG 752

Query: 321 ESIVVWCGPQPNSKLLINYGFVDEDN--------PY----DRLVVEAALNTED------- 361
           E I+ + GP PNS+LL  YG+V   +        P+    D L     L++E        
Sbjct: 753 EEILNYYGPHPNSELLRRYGYVTPKHSRYDVVELPWKMIEDALAANLGLSSEQLDSAREH 812

Query: 362 ---PQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSL 418
               ++++  ++ + + + +      +  +  E   D+   L+         M   I  +
Sbjct: 813 LDLDEFEETFVLERESDEPNPDGTFANPAKFSEIPEDLREQLK--------SMLKAIRKV 864

Query: 419 GPICPVSPCMERAVLDQ-LADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRME 477
            P C V       V    L     A  + YP T+ EDE +L+  NL  +++ A  +   E
Sbjct: 865 DPSCIVDKRKRDEVQHTVLITALDALTSQYPTTIIEDELILSGSNLSERRKAAVTVRLGE 924

Query: 478 KKMLNACLQVTADMIMLLPDVTVSPCPAP 506
           K++L     + ++   +  D  +   PAP
Sbjct: 925 KRLLQEARVLLSE---IASDAILDDAPAP 950


>gi|255083899|ref|XP_002508524.1| set domain protein [Micromonas sp. RCC299]
 gi|226523801|gb|ACO69782.1| set domain protein [Micromonas sp. RCC299]
          Length = 425

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 72/277 (25%), Positives = 117/277 (42%), Gaps = 35/277 (12%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYE----K 168
           + A+E+++ G++   +P S ++T+ER +    +        L E + LA +L  +     
Sbjct: 24  LVATEEVRRGESLLDIPESTLITVERAIAESNLGP--AHANLQEWSVLAAFLAEQALAID 81

Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA-YLTGSPTKAEILERAEGIKRE 227
                S +  Y+R L R+ G         L W E ++   L GSP++   +ER   +   
Sbjct: 82  AGADGSRFATYVRALPRRTG-------GVLDWPEEDVKELLAGSPSQRAAMERQASVDAA 134

Query: 228 YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 287
            +E+             +P   P         + AF  + S ++ L     A   ALVP 
Sbjct: 135 IDEIRA----------SFPQLTPG------ALRWAFDVLFSRLIRLPNRGGA--LALVPW 176

Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE--D 345
              +L +   C A +     AV L  DR YK GE +    GP+P+S+LLI+YGF     +
Sbjct: 177 AD-MLNHRPGCDAYIDDTGGAVCLSPDRRYKPGEQVYASYGPRPSSELLISYGFAPAVGE 235

Query: 346 NPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 382
           NP D   V   ++  D     K    +R G   V+ F
Sbjct: 236 NPDDEFEVVLGIDPNDRHADAKADALRRIGLSPVEAF 272


>gi|145516108|ref|XP_001443948.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124411348|emb|CAK76551.1| unnamed protein product [Paramecium tetraurelia]
          Length = 572

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 89/407 (21%), Positives = 171/407 (42%), Gaps = 51/407 (12%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA----CLALYLMYEK 168
           V A + + A +    +P S ++TLE +    T+A+ +   +L  L+     L+ +L+ EK
Sbjct: 165 VNAKQTINAKELILFIPKSHMITLE-MAKETTVAKKMMQFRLDLLSPKHSFLSTFLLQEK 223

Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
            +   SFW PYI  L             P+ ++ ++L +L GSP   +I ++   ++++Y
Sbjct: 224 FRPN-SFWKPYIDILPSSYP------SFPIFYNNSDLEWLKGSPFLKQIKDKLADLQKDY 276

Query: 229 NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG 288
           N++  V       F QY          F  F  A +   S +  +  ++  +  A VPL 
Sbjct: 277 NDICNV----VPEFTQY---------QFHEFCWARMTASSRIFGI-NINGVKTDAFVPLA 322

Query: 289 -------PPLLA--YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 339
                  P L +  YS + +  +   D+ ++         G+ I    G + NS+  +NY
Sbjct: 323 DMLNHKRPKLTSWCYSDEKQGFIIETDEKIE--------RGQMIFDSYGRKCNSRFFLNY 374

Query: 340 GFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPY 399
           GFV E N  + + +    +  DP  Q K    + + +   + F +    ++ A+ D + +
Sbjct: 375 GFVVEGNDANEVNLAVEADQNDPLLQLKEQAIKESLQWP-KNFKLLMDTDETAVIDFMSH 433

Query: 400 LRLGYVSDTSEMQSVISSLG-------PICPVSPCMERAVLDQLADYFKARLAGYPATLS 452
           +R   + D ++++ +++             P+    E  +   +    K  L  YP T  
Sbjct: 434 IRFLVIRDEAQLKLLLNQKNSQNFKSTKTQPLGIYNELEMWKMIGRICKKTLKQYPTTFE 493

Query: 453 EDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVT 499
           +D+ +L    L   +R    L   EK++L    Q +  M  LL +  
Sbjct: 494 QDQEILQICELTTNQRNCLILRMGEKEILKFYFQFSERMKELLSNFN 540


>gi|348675930|gb|EGZ15748.1| hypothetical protein PHYSODRAFT_561468 [Phytophthora sojae]
          Length = 430

 Score = 72.0 bits (175), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 97/409 (23%), Positives = 165/409 (40%), Gaps = 36/409 (8%)

Query: 111 HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
           H V A   L +G     VP  L +  E    ++    L    ++ +   LAL+LM+E+ +
Sbjct: 35  HGVFAKRALTSGQVTLQVPFKLTMNTESAATSDLAPVLEKYPQIPDDEVLALHLMHERSK 94

Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
           G +SF+ P+I  +           + P+ W+E EL  L G+          + ++R++  
Sbjct: 95  GGESFFAPFIASM-------PTTFDLPVFWTEAELNELKGTNVLLLTQLMKQHLERDFEN 147

Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 290
           +        + F      +PT   T + +  A   + S       VS   ++  V L P 
Sbjct: 148 IHQA---VAADFPDIFASLPT--LTIDDYMWAMSVIWSRAF---GVSKGGKYLHV-LCPA 198

Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPY---------KAGESIVVWCGPQPNSKLLINYGF 341
           +  ++     +   +DD V    ++            AG ++ +  G   N+KLL +YGF
Sbjct: 199 MDMFNHDV-TVRKPLDDFVSFNEEKQMMTHHVPEDVAAGSAVHISYGQYSNAKLLYSYGF 257

Query: 342 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD-MLPYL 400
           V  +N    +     +   DP ++ K+ V   N     Q +  H       + + +L  L
Sbjct: 258 VSPENFRRGVDFWMKIPLSDPYFKLKQTVLDSNELTKEQTYDFHGTLLSNDVDERLLATL 317

Query: 401 RLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 460
           R+  +++    Q   +    I  V    E AV + L    + +L+ Y  TL EDEA+L +
Sbjct: 318 RVILMNEQEIRQYKKAFESSILSVRN--ELAVYENLQSTCRRKLSNYATTLEEDEAILAE 375

Query: 461 YNLHPKKRVATQL-VRMEKKMLNACLQVTADMIMLLPDVTVSPCPAPYA 508
                K R+A  + VRME K      QVT  +I  L     S    P A
Sbjct: 376 TETESKPRLAFAVRVRMEDK------QVTTSVIETLEQWKQSLASKPDA 418


>gi|198413420|ref|XP_002131202.1| PREDICTED: similar to SET domain containing 3 [Ciona intestinalis]
          Length = 577

 Score = 72.0 bits (175), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 99/438 (22%), Positives = 197/438 (44%), Gaps = 56/438 (12%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
            KSW+ ++G+    + ++E  S  E       V A +D++      ++P   ++T E   
Sbjct: 83  FKSWLKEHGVEYSAIDIQE-VSEEEGFG----VIALQDIEIKCPLVTIPRKAMMTYEDA- 136

Query: 141 GNETIAELLTTNKLSEL---ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
            +  +A L+  N++  +    CLALYL  E+     S + PYI  + ++         + 
Sbjct: 137 KSSYLAGLIEGNEVLSVMPNVCLALYLHCERFT-LNSKYQPYIDMIPQE-------FNTI 188

Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA---F 254
           L +   E+ YL G+   +  + + + I R++  L  V+   GS  ++    +P +A   F
Sbjct: 189 LYFKPHEMKYLKGTAALSVAINQFKSIVRQFALLYQVF--NGSHQKEDVEKLPLQARNAF 246

Query: 255 TFEIFKQAFVAVQS----CVVHLQKV----SLARRFALVPL--------GPPLLAYSSKC 298
           TF+ ++    AV +       H+  V          AL+P+        GP   AY+   
Sbjct: 247 TFDTYRWCASAVTTRQNKIPTHVGDVLGDLDENSTLALIPMWDMFNHAIGPLSTAYN--- 303

Query: 299 KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 358
                A+   ++ +  + +K GE + +  G + NS LLI+ GFV +++P+D++ +   ++
Sbjct: 304 -----ALTRGIECLAMQDFKTGEQVKICYGARTNSDLLIHNGFVMKESPFDKVRIHLGVS 358

Query: 359 TEDPQYQDK-RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT------SEM 411
            +DP Y  K +++ + N ++S Q               +L +LR+ ++++        + 
Sbjct: 359 QKDPLYSLKAKLLEKLNVEVSGQFAVCSMDNSLPTSPQLLVFLRVFHMNEEELRSWLEKQ 418

Query: 412 QSVISSLGPI---CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKR 468
           ++ +SSL  I     V    +  V + L +  K  L G+       E M+ D +L  + +
Sbjct: 419 KNELSSLREIYISGEVKFKSDVKVWEFLENRVKLLLMGFKKIGDNIEEMMEDKSLTHRSK 478

Query: 469 VATQLVRMEKKMLNACLQ 486
           +A Q    E ++L+AC+ 
Sbjct: 479 LALQFRIEEHRILSACVN 496


>gi|145528147|ref|XP_001449873.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124417462|emb|CAK82476.1| unnamed protein product [Paramecium tetraurelia]
          Length = 605

 Score = 71.6 bits (174), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 91/408 (22%), Positives = 172/408 (42%), Gaps = 51/408 (12%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNET-IAELLTTNKLSELA----CLALYLMYE 167
           V A + + + +    VP S ++TLE  +  +T +A+ +   +L  L+     L+ +L+ E
Sbjct: 190 VNARKAISSKEVILFVPRSHMITLE--MAKDTPVAKKIIQYRLDLLSPKHSFLSTFLLQE 247

Query: 168 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
           KK  + SFW PY+  L +           P+ +++++L +L GSP   ++ ++   +K++
Sbjct: 248 KK-IQDSFWKPYLDVLPKSYSNF------PIFFNDSDLEWLKGSPFLKQVKDKITDLKKD 300

Query: 228 YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 287
           Y ++  V   A    Q           +F+ F  A +   S +  +  +   +  A VPL
Sbjct: 301 YCDICQV---APEFLQN----------SFDEFCWARMTASSRIFGIN-IKGVKTDAFVPL 346

Query: 288 GPPL---------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 338
              L           YS + +  +   D+ ++         G+ I    G + NS+ L+N
Sbjct: 347 ADMLNHKRPKLTSWCYSDERQGFIIETDENIE--------KGQMIFDSYGSKCNSRFLLN 398

Query: 339 YGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP 398
           YGFV +DN  + + V    +      Q K  +++   +   + F +       + SD + 
Sbjct: 399 YGFVVDDNNANEVNVMVEPDGTISLIQLKEGLSRETLQFP-KSFRLVIDPNDVSFSDFMS 457

Query: 399 YLRLGYVSDTSEMQSVISSLGPICP-----VSPCMERAVLDQLADYFKARLAGYPATLSE 453
           ++R   + +  E  +++     I P     +S   E A  + + +     L  YP TL +
Sbjct: 458 FIRFILIQEEKEFANLLGKNSYIKPTKIHFISIQNELATWNLIENICIRALNQYPTTLEQ 517

Query: 454 DEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVS 501
           D  +L    L   +R    L   EKK+LN   Q +  M  L  +   S
Sbjct: 518 DLEILKICELTTNQRNCLILRMGEKKILNFYKQFSEKMRQLFSNFDFS 565


>gi|322703179|gb|EFY94792.1| UV-endonuclease UVE-1 [Metarhizium anisopliae ARSEF 23]
          Length = 1118

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 95/427 (22%), Positives = 172/427 (40%), Gaps = 68/427 (15%)

Query: 113  VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN-------KLSELACLALYLM 165
            + A  D+ A    F++P   ++  +     E + EL  +         L   + L L +M
Sbjct: 694  IVALRDIPADTTLFTIPRDAIINSDTSSLREKLPELFESQGDEDEQQALDSWSALILIMM 753

Query: 166  YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEGI 224
            YE   G +S W PYI  L        L  ++P+ WSE EL+YL  S T  +I +  AE +
Sbjct: 754  YEFFLGHQSKWKPYIDVL-------PLTFDTPMFWSEEELSYLQASATVNKIGKADAEEM 806

Query: 225  KREY--------------------NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
             R                       +L  +    GS    Y +D+  E    +     +V
Sbjct: 807  FRTRLIPAIRGNPSVFASSGDCSDEDLIGLAHRMGSTIMAYAFDLENEEAENDDESDGWV 866

Query: 265  AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 324
              +     +  V++A           +L   ++  A +   D+ + +   R  KAGE I+
Sbjct: 867  EDREGKSMMGMVAMA----------DILNADAEFNAHVNHGDEELTVTSIRDIKAGEEIL 916

Query: 325  VWCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAALNTEDPQYQDKRMVAQRNGK 376
             + GP PNS+LL  YG++ E +        P+D   V+ +L +E    QD  ++ +   K
Sbjct: 917  NYYGPHPNSELLRRYGYITEKHSRYDVVEIPWD--AVQHSLMSELGVPQD--IMTETMDK 972

Query: 377  L------SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE-MQSVISSL----GPICPVS 425
            +       + V    +G      +   P +  G   D  E +++ I  L    G +    
Sbjct: 973  MDQDDLEDIFVLERDSGEPNPDGTFAGPAVVDGMPPDLKEQLKATIKLLQKVDGNLISDK 1032

Query: 426  PCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 485
               +  +   + +  +   + Y  T++EDE +L   +L  ++R+A ++   EKK+L    
Sbjct: 1033 RKRDDILRSTMVETLRLIASRYSTTIAEDEILLAQDSLTRRQRMAVRVRLGEKKLLQEAF 1092

Query: 486  QVTADMI 492
               ++M+
Sbjct: 1093 DHFSEMV 1099


>gi|326496433|dbj|BAJ94678.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 453

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 93/382 (24%), Positives = 160/382 (41%), Gaps = 40/382 (10%)

Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWL 177
           +L  G+    VP  L +  + V  +       +   L     ++L ++ E  +G  S W 
Sbjct: 52  NLPRGEVVAEVPKKLWLDADAVAASVLGRVCGSGGDLRPWVSVSLLILREAARGGDSLWA 111

Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
           PY+  L RQ        +S + WSE EL  + G+   +  +   E ++ E++ ++     
Sbjct: 112 PYLAILPRQ-------TDSTIFWSEEELLEIQGTQLLSTTMGVKEYVQSEFDNVE----- 159

Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL------ 291
           AG +      D+     TF+ F  AF  ++S V    +     + AL+P    +      
Sbjct: 160 AGII--NVNKDLFPGTITFDDFLWAFGVLRSRVFPELR---GDKLALIPFADLINHDGDI 214

Query: 292 ----LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDN 346
                 +  K K  L   D    L      K+GE I V +   + N++L ++YGF + ++
Sbjct: 215 TSKESCWEIKGKGFLGR-DTVFSLRTPVDVKSGEQIYVQYDLDKSNAELALDYGFTESNS 273

Query: 347 PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS 406
             D   +   ++  DP Y+DK  +A+ NG      F V  G  +     M+ YLRL  + 
Sbjct: 274 SRDSYTLTLEISESDPFYEDKLDIAELNGMGETAYFDVVLG--ESLPPQMITYLRLLCLG 331

Query: 407 DTSEM-------QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 459
            T            V   L    PVS   E ++   + +  K+ LA Y  T+ EDE +L 
Sbjct: 332 GTDAFLLEALFRNKVWEHLE--LPVSRDNEESICQVIQNACKSALAAYHTTIEEDEELLE 389

Query: 460 DYNLHPKKRVATQLVRMEKKML 481
             +L  + ++A ++   EKK+L
Sbjct: 390 REDLQSRHQIAVEVRVGEKKVL 411


>gi|428163078|gb|EKX32170.1| hypothetical protein GUITHDRAFT_121664 [Guillardia theta CCMP2712]
          Length = 449

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 97/426 (22%), Positives = 179/426 (42%), Gaps = 33/426 (7%)

Query: 77  DLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTL 136
           D  D+  W   NG    KV+L++    + +  PI  + A ED++AG+   S+P +L+   
Sbjct: 25  DGSDVYEWAAANGANVSKVVLRD----DGEAGPI--LHAKEDIEAGEVILSLPANLLFP- 77

Query: 137 ERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES 196
            RV  +  +  ++    +  +  + LYL+ E+     S W P+++ L  +          
Sbjct: 78  TRVSDHSPVVHMIENTTIGRITAICLYLISERADSS-SHWKPWLQSLPPRFFHA------ 130

Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQ------QYPYDIP 250
            L +SE ++ +   S  K     + + +++EY +           F         P ++ 
Sbjct: 131 -LSYSEDDMLHFQASSFKELRDRKKKNVRQEYEQTVAPLLHKLPAFDPLLAAVDKPQNVT 189

Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL---GPPLLAYSSKCKAMLAAVD- 306
            E FT+E F+ A+  V +  +    +    R   VPL   GP   ++      +  + D 
Sbjct: 190 REDFTYEAFEWAYSVVTTRGIFPGLLGEEEREGEVPLLVLGPLADSFIHGASGVKISYDA 249

Query: 307 DAVQLVVDRPYKAGES--IVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
              + V    +K  ++  I +  G   N +LL N GF+ ++N  + ++++  L+     +
Sbjct: 250 QEHRCVFSALHKVAKNSPISIGVGMSSNMELLANRGFMMQNNGNNFVLMKFQLDRNSDMH 309

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPV 424
              R    +   LS  + +V   R  E    +L  LR+  +S   E  S   +L    PV
Sbjct: 310 ASARESMMKQLNLSNPMTYV--VRYGEMPQGLLASLRIQSLSPV-EFGSYGKALA--TPV 364

Query: 425 SPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNAC 484
           +   E      L     + LA YP T+ EDE +LT        R A  L+R E+K++   
Sbjct: 365 TLENEWRAYRLLISSCNSILAMYPTTIEEDEIVLTQTKTSRHLRAAV-LLRREEKLIYES 423

Query: 485 LQVTAD 490
           ++  A+
Sbjct: 424 IKTWAN 429


>gi|442753255|gb|JAA68787.1| Putative set domain-containing protein [Ixodes ricinus]
          Length = 428

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 68/282 (24%), Positives = 122/282 (43%), Gaps = 28/282 (9%)

Query: 81  LKSWMHKNGLP-PCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
           L +WM  NG     K+ L++ P           V A E L  G+    +P SL+++    
Sbjct: 31  LLTWMEANGFRLHSKLGLRDFPDTGRG------VVALEKLVGGETFLKLPTSLLISTRTA 84

Query: 140 LGNETIAELLTTN---KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES 196
           L  +++     T    KL+ +  L L+++ +K  G+ S W P++  L R          +
Sbjct: 85  L--QSLLHSFITRYHAKLTPIDVLTLFVLDQKLLGEASRWWPFVDSLPR-------TFTT 135

Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 256
           P+    T    L     + E+  R   I+R + +L  +  + G + ++      +  FT+
Sbjct: 136 PVFLRRTVFESLP-KDLREEVHTRITSIQRTFLKLKVL--LGGHVEEEPEVQSLSTGFTW 192

Query: 257 EIFKQAFVAVQSCVVHLQKVSLARRF-----ALVPLGPPLLAYSSKCKAMLAAVDDAVQL 311
             F  A+ AV +  +  Q  + +  +     AL P     L +  K     A V +  ++
Sbjct: 193 NNFVWAWTAVNTRCIFAQGSNSSSLWENDHCALAPF-LDCLNHHWKASIETAMVGENFEI 251

Query: 312 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
           +  + + A E + +  GP  N +L ++YGFV  DNP D +VV
Sbjct: 252 LSHKSHDANEQVFISYGPHSNRRLFLDYGFVLPDNPNDVVVV 293


>gi|160331079|ref|XP_001712247.1| met [Hemiselmis andersenii]
 gi|159765694|gb|ABW97922.1| met [Hemiselmis andersenii]
          Length = 464

 Score = 70.9 bits (172), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 78/382 (20%), Positives = 170/382 (44%), Gaps = 40/382 (10%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER-VLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
           + A + +Q G+    +P +L+++++R  + NE        + L+E   L ++L+ +   G
Sbjct: 103 LLAFKKIQQGEKLIEIPENLILSVDRDQIKNEG------NDFLNEYDSLGIFLIQQMAMG 156

Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
            KS W  Y   L R+       +     W+  ++ +L GS T    L   E IK ++  L
Sbjct: 157 DKSKWKIYFDILPREED-----LNLGFRWNLNDIVFLRGSKTLNASLYLKEKIKIQFLRL 211

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP-- 289
           +   F    L  +YP  I    F    ++ A   + S  + LQ +   ++ +LVP     
Sbjct: 212 EKTIFSKNRL--KYPVSI----FNLAQWEWALSILLSRAIFLQNL---KKVSLVPYADFM 262

Query: 290 ---PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 346
              P        K +  + +  + +  D+ Y   + I    G + N +LL+ YGF+ E N
Sbjct: 263 NHNPFSTSYINSKKISFSKNHEIVMYADKDYNKFDQIFTTYGQKTNLELLLLYGFILERN 322

Query: 347 PYDRLVVEAALNTEDPQYQDKRMV---AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLG 403
           P+D + +  +L+ +D  ++ K+      ++  +++  +F+    +E      +  +LR  
Sbjct: 323 PFDSIELRISLSDKDSFFEKKKQFMIECEKTSEITFPIFYYKYPKE------LYEFLRFC 376

Query: 404 YVSDTSEMQSV-ISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDE---AMLT 459
            +S+  E+ S  +S        +  +E+ +   +    +  L  Y   +SE++   ++ +
Sbjct: 377 -ISNQEELGSTDLSDFNFNDENNYEIEKIIRKLVLFSCEKLLKNYSKKVSEEKILNSLNS 435

Query: 460 DYNLHPKKRVATQLVRMEKKML 481
           ++ +   +++A +  + EKK++
Sbjct: 436 NFLISKNQKMALKQSKCEKKII 457


>gi|62642307|gb|AAX92711.1| SET domain-containing protein [Picea abies]
          Length = 106

 Score = 70.9 bits (172), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 37/75 (49%), Positives = 52/75 (69%), Gaps = 7/75 (9%)

Query: 42  SSLRLVRRKNRFSIRVSSSDTLVAGS------REVVSKKEEDLGDLKSWMHKNGLPPCKV 95
           S +RL  R   F + V S+DTL A S      ++ +  KEE++ DLKSWMH++GLPPC+V
Sbjct: 32  SRVRLPGRCVGFPMVVYSADTLTASSQHGEDKKDAIRGKEEEV-DLKSWMHRHGLPPCRV 90

Query: 96  ILKEKPSHNEKHRPI 110
           +LKE+PS + KH+PI
Sbjct: 91  MLKERPSPDGKHKPI 105


>gi|350408192|ref|XP_003488333.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Bombus
           impatiens]
          Length = 484

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 100/438 (22%), Positives = 181/438 (41%), Gaps = 50/438 (11%)

Query: 73  KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
           K+ + +G   +W+ +NG       + E P ++        + A  +    +    +P  L
Sbjct: 77  KRSQGIGRFINWLKQNGANVYGASVAEFPGYDLG------LKAERNFLENELILRIPREL 130

Query: 133 VVTLERV------LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
           + ++         L N+ + +L+          LA+ L+ EK + + S W PY+  L   
Sbjct: 131 IFSIHNAAPELVALQNDPLLQLMPQ------VALAIALLIEKHK-EYSKWKPYLDILPT- 182

Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP 246
                    + L  +  ++  L GSPT    L++   I R+Y      +F    LFQ+  
Sbjct: 183 ------TYTTVLYMTAADMNELKGSPTLEAALKQCRNIARQY-----AYF--NKLFQKNN 229

Query: 247 YDIPT---EAFTFEIFKQAF--VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAM 301
             +     + FT+E +  A   V  +  ++  +  SL    AL+P+       +SK    
Sbjct: 230 NAVSAILRDVFTYEKYCWAVSTVMTRQNIIPSKDGSLMIH-ALIPMWDMCNHENSKITTD 288

Query: 302 LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 361
             A  +  +    R +K  E I +  G + NS   ++ GFV  DN  D   +   ++  D
Sbjct: 289 FNATLNCCECYALRDFKKAEQIFISYGARTNSDFFVHSGFVYMDNEQDGFKLRLGISKAD 348

Query: 362 PQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD-MLPYLRLGYVSDTSEMQSVISS--L 418
           P  +++  +  +    +V  F +  G   E ISD +L +LR+ +     E+   I S  +
Sbjct: 349 PLQKERVELLNKLDLPAVGEFLLKPG--TEPISDTLLAFLRV-FSMRKEELAHWIQSDRV 405

Query: 419 GPICPVSPCMERAVLDQLADYFKARL----AGYPATLSEDEAMLTDYNLHPKKRVATQLV 474
             +  +   +E  V + +  +   RL    A YP TL ED  +L +  L   K++A QL 
Sbjct: 406 NDLKHMDCALETVVEENVKKFLLTRLQLLIANYPTTLKEDLQLL-ETTLPRIKKLAIQLR 464

Query: 475 RMEKKMLNACLQVTADMI 492
             EK++L   L+     I
Sbjct: 465 VTEKRILQGALEYVQQWI 482


>gi|168046556|ref|XP_001775739.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162672891|gb|EDQ59422.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 524

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 82/310 (26%), Positives = 125/310 (40%), Gaps = 24/310 (7%)

Query: 52  RFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIH 111
           RF  R   S+         VS     +  L++W+ K     C + L+  P          
Sbjct: 49  RFGCRWVQSNGSTHTKESNVSISNTKVERLRNWLKKLNHDDCNLKLERCPQGGSGS-GYG 107

Query: 112 YVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
             A    +  G     VP   ++T E     + +  L+  + L+    + L+L+YE+ +G
Sbjct: 108 AFAGPGGVGNGSTIVKVPRKALMTEETARLCQDVGPLVKKSDLTPWQAMCLHLLYERARG 167

Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSET-ELAYLTGSPTKAEILERAEGIKREYNE 230
           + SFW PYI  L ++    +L    P+LWS+     +L GSP   ++ ER   I RE  E
Sbjct: 168 ETSFWYPYIAVLPKEL---ELIGIHPMLWSQKMRREWLEGSPM-LDVTERRLAICREDYE 223

Query: 231 LDTVWFMAGSLF----QQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL------AR 280
              +   AG L     +  P  I   A     +    +  +S  ++LQ   L        
Sbjct: 224 A-MLLAGAGRLTPRGNEGEPISITETAVQ---WAATMLLSRSFSLNLQTQKLRPGSFAED 279

Query: 281 RFALVPLGPPLLAYSSKCKAMLAAVDD---AVQLVVDRPYKAGESIVVWCGPQPN-SKLL 336
             ALVP    L   SS  +      D       L   R Y  GE +    GP  + S+LL
Sbjct: 280 TIALVPWADMLNHSSSAGRESCLVYDQKSGVATLQAHRTYSEGEQVFDSYGPSCSPSRLL 339

Query: 337 INYGFVDEDN 346
           ++YGFVDE+N
Sbjct: 340 LDYGFVDEEN 349


>gi|156064409|ref|XP_001598126.1| hypothetical protein SS1G_00212 [Sclerotinia sclerotiorum 1980]
 gi|154691074|gb|EDN90812.1| hypothetical protein SS1G_00212 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 470

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 102/409 (24%), Positives = 169/409 (41%), Gaps = 57/409 (13%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSE----LACLALYLMYEK 168
           V A+ D+   +  FS+P + V      L  + +A L  + +L E       L   LM E 
Sbjct: 42  VVATGDIDDDEIIFSIPRNAV------LNAQNVAPLPVSRRLFEKMPSWLVLTSILMTEA 95

Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA------- 221
            Q + S W PY+  L  +       ++S + WS++ELA L  S    +I ++        
Sbjct: 96  -QMENSKWAPYLAVLPER-------LDSLVFWSDSELAELQASAVVKKIGKKDAEDMFKS 147

Query: 222 ----EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQ 274
               +G+K    E+        S+   Y +DIP  +       +   A    V      +
Sbjct: 148 YIAPQGLKHSSTEM---CHKVASVIMAYAFDIPDPSDAPTSGGKGGEAGDDLVSDDGEDE 204

Query: 275 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 334
           K  L+    ++PL   L A + +  A L   ++ +++   +P   GE I    G  P S 
Sbjct: 205 KTILS----MIPLADMLNADADRNNARLICDNEELEMRAIKPISKGEEIFNDYGQLPRSD 260

Query: 335 LLINYGFV-DEDNPYD------RLVVEAALN----------TEDPQYQDKRMVAQRNGKL 377
           LL  YG+V D  + YD       L+V    N          T+D   + +  +A+R G  
Sbjct: 261 LLRRYGYVTDGYSAYDVAEISAELIVSLFRNGKVHPSLHKLTQD-GLKTRLELAEREGVY 319

Query: 378 SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLA 437
                 VH+  ++ +I D L       + D S ++++++S   I   S          LA
Sbjct: 320 EDSFDLVHSSPDEPSIPDELLAFLYLLLVDESHLKAILNSESSIPSRSKLTTELAGQVLA 379

Query: 438 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 486
              +AR   Y  TL EDE +L + +L  +  +A Q+   EKK+L A +Q
Sbjct: 380 TLLQAREKEYSTTLEEDEDLLKNADLPVRHAMAIQVRSGEKKVLRAAMQ 428


>gi|399949805|gb|AFP65462.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Chroomonas mesostigmatica
           CCMP1168]
          Length = 464

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 81/378 (21%), Positives = 160/378 (42%), Gaps = 36/378 (9%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           A   +Q G+    +P +L+  L++ L  E  +E L+   L+E   LA+  + E+  G+KS
Sbjct: 100 AFRKIQQGEKLIEIPENLI--LKKSLK-ENRSEDLSF--LNEYDSLAIKAIQERAIGEKS 154

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
            W  Y   L +++    +       W  +++ +L GS          E IK ++  ++  
Sbjct: 155 KWKVYYEILPKEKDLNLV-----FRWKISDIVFLRGSKVLNASFYLKEKIKIQFLRIEKT 209

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP----- 289
            F    L        P + F  + ++ A   + S  + LQ +   ++ ALVP        
Sbjct: 210 IFSKNRLV------YPEKIFNLQSWEWAISLLLSRAIFLQNM---KKIALVPYADFINHN 260

Query: 290 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
           P        K +  + ++ + +  D+ Y   + I    G + N +LL+ YGF+ E NP+D
Sbjct: 261 PFSTSYINSKKIAFSENNEIVMYADKDYNKFDQIFTTYGQKTNLELLVLYGFIIERNPFD 320

Query: 350 RLVVEAALNTEDPQYQDKRMV---AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS 406
            + +  AL+T+D  Y  K       ++  +++  VF+    +E      +  ++RL    
Sbjct: 321 SIELRVALSTKDELYNKKEKFINDCEKTEQITFPVFYYKYPKE------LYEFMRLCLSG 374

Query: 407 DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYN---L 463
                    S+L      +  +E+ +   +    K  L  Y  T++E++ +    N   L
Sbjct: 375 PRDFFGEEFSNLNFTDEENFNLEKIIRKTVIFACKKNLKAYNKTINEEKILNNLSNIIVL 434

Query: 464 HPKKRVATQLVRMEKKML 481
              ++ + +  + EKK+L
Sbjct: 435 TKNQKTSIKQRKCEKKIL 452


>gi|156361027|ref|XP_001625323.1| predicted protein [Nematostella vectensis]
 gi|156212150|gb|EDO33223.1| predicted protein [Nematostella vectensis]
          Length = 447

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 75/280 (26%), Positives = 117/280 (41%), Gaps = 32/280 (11%)

Query: 75  EEDLGDLKSWMHKNGLPPCKVILKEKPS-HNEKHRPIHYVAASEDLQAGDAAFSVPNSLV 133
           EE+   L  W  +NG+    V  K +P+  +   R +    A E + + +   SVP  L+
Sbjct: 47  EENYISLLKWAKRNGM----VFKKIRPAIFSSTGRGM---LAIERIHSSECVISVPERLL 99

Query: 134 VT----LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
           +T    LE  +GN     +    K S    L L+LMYEK   K SFW PYIR L      
Sbjct: 100 ITASSVLESAIGNYVAERMKGGAKSSNDYLLVLFLMYEKYLEKGSFWAPYIRTLPD---- 155

Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDI 249
                 +P  ++  EL +L     + +  E+   IK+ Y      +     + Q +  + 
Sbjct: 156 ---TFNTPCYFTRKEL-FLLPEQCREQAFEQVTQIKQSYKSFAKAY---NDVLQDFDCNF 208

Query: 250 PTEAFTFEIFKQAFVAVQS-CVVHLQKVSLAR----RFALVPLGPPLLAYSSKCK--AML 302
                 FE FK A+  V +  V H +    A+      AL PL   LL +  K +     
Sbjct: 209 -WRTVDFESFKWAWCVVNTRSVYHDEPNRRAQPIDGNCALAPL-LDLLNHCDKAEMCGRF 266

Query: 303 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
            +     ++ V   Y+ G  + +  GP  N++L + YGFV
Sbjct: 267 NSSSKNYEINVITEYQKGTQVFINYGPHDNTRLFLEYGFV 306


>gi|440797255|gb|ELR18348.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
          Length = 431

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 72/257 (28%), Positives = 105/257 (40%), Gaps = 43/257 (16%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVT------------LERVLGNETIAELLTTNKLSELACL 160
           V A+ D+  G+   SVP SLVV             + R+L  E        N L      
Sbjct: 59  VVAAHDIATGETLLSVPFSLVVDSADAPLATAAPEIRRILDEEFPLSATNENAL------ 112

Query: 161 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILER 220
              L+   K    S W  YI  L            + L +S+ EL+YL GS       +R
Sbjct: 113 ---LLLVHKNDPNSPWQRYIDVLPS-------TFSTTLFFSDDELSYLEGSSLHHFARQR 162

Query: 221 AEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV--QSCVVHLQKVSL 278
              I+ +Y+ + T       LF  YP     E F+ + +K A   +  +S VV   K  L
Sbjct: 163 RRAIESQYDTIFT------PLFVDYPEHFAPEQFSLDAWKWALSVIWSRSFVVDEGKRGL 216

Query: 279 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ---PNSKL 335
                +  + P     + + K  + AVD  +      P K GE I V  G      N++L
Sbjct: 217 VPWADMFNMAPE----TEQVKVAVDAVDHHLIYSARSPIKKGEQIFVAYGQSRQMSNAQL 272

Query: 336 LINYGFVDEDNPYDRLV 352
           L++YGFV E+NP+D +V
Sbjct: 273 LMDYGFVLENNPHDAVV 289


>gi|449442309|ref|XP_004138924.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Cucumis
           sativus]
          Length = 503

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 90/394 (22%), Positives = 167/394 (42%), Gaps = 40/394 (10%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           +A +++L   +    VP    +  + V  +E        + L     +AL+L+ E  +G 
Sbjct: 100 LATTKNLSKNEVVLEVPKRFWINPDAVADSEIGN---VCSGLKPWISVALFLIRENLKGD 156

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
            S W  Y+  L ++        +S + WSE ELA + G+   +  L   E +K E+ +++
Sbjct: 157 -SRWRRYLDILPQE-------TDSTVFWSEEELAEIQGTQLLSTTLNVKEYVKSEFLKVE 208

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL- 291
               +          D+     T + F  AF  ++S      +    +   L+P    + 
Sbjct: 209 EEILLRHK-------DLFPSRITLDDFFWAFGILRSRAFSRLR---GQNLVLIPFADLVN 258

Query: 292 ---------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGF 341
                     A+  K  A L + D    L      KAG+ + + +   + N+ L ++YGF
Sbjct: 259 HSANVTTEEHAWEVKGPAGLFSWDVLFSLRSPLSVKAGDQVFIQYDLKKSNADLALDYGF 318

Query: 342 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 401
           +++ +  +   +   +   D  + DK  +A+ NG      F +    E+     MLP+LR
Sbjct: 319 IEQKSDRNAYTLTLEIPESDLFFDDKLDIAETNGLNQTAYFDIIL--ERPFPPAMLPFLR 376

Query: 402 LGYVSDTSE--MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEA 456
           L  +  T    ++S+   S  G +  PVS   E  +   + +  +A L+GY  T+ EDE 
Sbjct: 377 LLALGGTDAFLLESLFRNSVWGHLEMPVSRANEELICQVVRNACEAALSGYHTTIEEDEK 436

Query: 457 MLTDYNLHPKKRVATQLVRMEKKMLNACLQVTAD 490
            L + NL  + R+A  +   EK++L   +Q+  D
Sbjct: 437 -LKEENLDSRLRIAVGIREGEKRVLQQIIQIFKD 469


>gi|449495943|ref|XP_004159992.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Cucumis
           sativus]
          Length = 503

 Score = 70.1 bits (170), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 90/394 (22%), Positives = 167/394 (42%), Gaps = 40/394 (10%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           +A +++L   +    VP    +  + V  +E        + L     +AL+L+ E  +G 
Sbjct: 100 LATTKNLSKNEVVLEVPKRFWINPDAVADSEIGN---VCSGLKPWISVALFLIRENLKGD 156

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
            S W  Y+  L ++        +S + WSE ELA + G+   +  L   E +K E+ +++
Sbjct: 157 -SRWRRYLDILPQE-------TDSTVFWSEEELAEIQGTQLLSTTLNVKEYVKSEFLKVE 208

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL- 291
               +          D+     T + F  AF  ++S      +    +   L+P    + 
Sbjct: 209 EEILLRHK-------DLFPSRITLDDFFWAFGILRSRAFSRLR---GQNLVLIPFADLVN 258

Query: 292 ---------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGF 341
                     A+  K  A L + D    L      KAG+ + + +   + N+ L ++YGF
Sbjct: 259 HSANVTTEEHAWEVKGPAGLFSWDVLCSLRSPLSVKAGDQVFIQYDLKKSNADLALDYGF 318

Query: 342 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 401
           +++ +  +   +   +   D  + DK  +A+ NG      F +    E+     MLP+LR
Sbjct: 319 IEQKSDRNAYTLTLEIPESDLFFDDKLDIAETNGLNQTAYFDIIL--ERPFPPAMLPFLR 376

Query: 402 LGYVSDTSE--MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEA 456
           L  +  T    ++S+   S  G +  PVS   E  +   + +  +A L+GY  T+ EDE 
Sbjct: 377 LLALGGTDAFLLESLFRNSVWGHLEMPVSRANEELICQVVRNACEAALSGYHTTIEEDEK 436

Query: 457 MLTDYNLHPKKRVATQLVRMEKKMLNACLQVTAD 490
            L + NL  + R+A  +   EK++L   +Q+  D
Sbjct: 437 -LKEENLDSRLRIAVGIREGEKRVLQQIIQIFKD 469


>gi|310799999|gb|EFQ34892.1| SET domain-containing protein [Glomerella graminicola M1.001]
          Length = 478

 Score = 70.1 bits (170), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 96/433 (22%), Positives = 172/433 (39%), Gaps = 82/433 (18%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER----------VLGNETIAELLTTNKLSELACLAL 162
           + A++D+      F++P   ++ +E             GN+   E +    L     L L
Sbjct: 44  IVATKDIAPETVLFTIPRKSIINIETSELPKKIPQVFTGNDGDDEDMENEPLDSWGSLIL 103

Query: 163 YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAE 222
            ++YE  QG  S W  Y   L  +        ++ + W   +L YL GS   ++I +   
Sbjct: 104 VMIYEYLQGNASPWKTYFEVLPEK-------FDTLMFWESPDLEYLKGSAVLSKIGKDEA 156

Query: 223 GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR- 281
                   L  +   AG  F Q     P+E+   ++  +    + +    L+      + 
Sbjct: 157 DEMFRSRILPVISANAGIFFPQ-GVSPPSESELLQLAHRMGSIIMAYAFDLENEEEPEQE 215

Query: 282 -------------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG 328
                          +VP+   +L   ++  A +   +D + +   RP KAGE I+ + G
Sbjct: 216 DEEWVEDREGKTMLGMVPMAD-ILNADAEFNAHVNHGEDDLSVTALRPIKAGEEILNYYG 274

Query: 329 PQPNSKLLINYGFVDEDN--------PYDRLVVEAALNTEDPQYQDK--RMVAQRNGKLS 378
           P PNS+LL  YG+V   +        P+D  +V++ L TE  +  D+  + VA+      
Sbjct: 275 PHPNSELLRRYGYVTPKHSRYDVVEIPWD--LVQSTL-TEQLRLTDEVWKQVAEHVDPED 331

Query: 379 VQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSL---------------GPICP 423
           ++   V      E  S+       G++   +++Q V + L               G + P
Sbjct: 332 LEDVFVLERESGEPDSE-------GHLQTPAKVQEVSAELEEQLKDVLKAIKKVRGDLIP 384

Query: 424 -------VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 476
                  V  C+  + L +L       LA YP T  EDEAML   N+  ++++A ++   
Sbjct: 385 DKRKRDEVYQCVVVSTLQKL-------LAQYPTTAEEDEAMLASGNVTSRQKLAVEVRLG 437

Query: 477 EKKMLNACLQVTA 489
           EK+++   LQV  
Sbjct: 438 EKRLIKEALQVAG 450


>gi|302836231|ref|XP_002949676.1| Rubisco large subunit N-methyltransferase [Volvox carteri f.
           nagariensis]
 gi|300265035|gb|EFJ49228.1| Rubisco large subunit N-methyltransferase [Volvox carteri f.
           nagariensis]
          Length = 484

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 107/401 (26%), Positives = 177/401 (44%), Gaps = 36/401 (8%)

Query: 105 EKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYL 164
           +  RP+  + AS D Q GD  FSVP+S  ++ E V     + +L     L     +AL L
Sbjct: 65  QTDRPV--LIASTDAQQGDVLFSVPDSAWLSAESVK-KAAVGKLAAAAGLEPWLQIALQL 121

Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
           + ++    KS    Y   +          +++PLLWSE EL  L G+    ++L+   G 
Sbjct: 122 VADRFGSTKSELSAYAASIPED-------LDTPLLWSEDELQELQGT----QVLQTLGGY 170

Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV---VHLQKVSLARR 281
              +    T   +   LF   P   P   FT   F  A  AV+S     +   K++LA  
Sbjct: 171 LTFFRS--TFQQLQSGLFTSNPAAFPPSIFTLPRFLWAVAAVRSRSHPPLDGPKIALA-- 226

Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV---DRPYKAGESIVVWCGP-QPNSKLLI 337
             L  L     A +SK     A +    Q++V    R  + GE + +  GP + +  +L+
Sbjct: 227 -PLTELVSHRRAANSKLSVRSAGLFGRGQVLVLEATRAIRKGEPLSMDYGPGKLDGPVLV 285

Query: 338 NYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDML 397
           +YG +D  +P     +   +   D    DK  + + N      V+++    +++   +ML
Sbjct: 286 DYGVMDVTSPKPGYSLTLKMPDSDRFIDDKLDILESNDLPQSVVYNLTP--DEQPTIEML 343

Query: 398 PYLRLGYV--SDTSEMQSVISS--LGPIC-PVSPCMERAVLDQLADYFKARLAGYPATLS 452
            +LRL  +  SD   ++S+  +   G +  PVS   E AV + L++  +A L GY  T+ 
Sbjct: 344 AFLRLMQLKGSDAFLLESIFRNDVWGFMQEPVSEGNEEAVCNTLSEGARAALGGYGTTID 403

Query: 453 EDEAMLTDYNLHPK--KRVATQLVRM-EKKMLNACLQVTAD 490
           +D A L       K  +R A  L+R+ EK+ L+A  +   D
Sbjct: 404 QDLAELRAQGSRAKGSRREAALLIRLGEKEALDAVARFFED 444


>gi|380015248|ref|XP_003691619.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Apis
           florea]
          Length = 483

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 95/435 (21%), Positives = 181/435 (41%), Gaps = 42/435 (9%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
           SK+ + +G   +W+ +NG       + E P ++        + A  +    +    +P  
Sbjct: 75  SKRSQGIGQFINWLKENGANVDGASVAEFPGYDLG------LKAERNFLENELILRIPRG 128

Query: 132 LVVTLERVLGNETIAELLT------TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDR 185
           L+ ++     +    EL+T         + ++A LA+ L+ E+ + + S W PY+  L  
Sbjct: 129 LIFSI-----HNAAPELITLQNDPLIQHMPQVA-LAIALLIERHK-ENSKWKPYLDILPT 181

Query: 186 QRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQY 245
                     + L  +  ++  L GSPT    L++   I R+Y+  + V+    +     
Sbjct: 182 -------TYTTVLYMTAADMIELKGSPTLEAALKQCRNIARQYSYFNKVFQNNNNAVSAI 234

Query: 246 PYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF-ALVPLGPPLLAYSSKCKAMLAA 304
             D+    FT+E +  A   V +    +     +R   AL+P+       + +      A
Sbjct: 235 LRDV----FTYERYCWAVSTVMTRQNLIPSEDGSRMIHALIPMWDMCNHENGRITTDFNA 290

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
             +  +    R +K GE I +  GP+ NS   ++ GFV  +N  D   +   ++  D   
Sbjct: 291 TSNYCECYALRDFKKGEQIFISYGPRTNSDFFVHSGFVYMENKQDGFKLRLGISKADSLQ 350

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISD-MLPYLRLGYVSDTSEMQSVISS--LGPI 421
           +++  +  +    +V  F +  G   E ISD +L +LR+ +    +E+   I S  +  +
Sbjct: 351 KERIELLNKLDLPTVGEFLLKLG--TEPISDLLLAFLRV-FSMRKAELAHWIRSDRVNDL 407

Query: 422 CPVSPCMERAVLDQLADYFKARL----AGYPATLSEDEAMLTDYNLHPKKRVATQLVRME 477
             +   +E  V + +  +   RL    A YP TL ED  +L +  L   K++  QL   E
Sbjct: 408 KHMDCALETVVEENVRKFLLTRLQLLIANYPTTLKEDLQLL-ETTLPQIKKLTIQLRVTE 466

Query: 478 KKMLNACLQVTADMI 492
           K++L   L+     I
Sbjct: 467 KRILQGALEYVEQWI 481


>gi|357444999|ref|XP_003592777.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase [Medicago truncatula]
 gi|355481825|gb|AES63028.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase [Medicago truncatula]
          Length = 451

 Score = 69.7 bits (169), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 64/261 (24%), Positives = 117/261 (44%), Gaps = 25/261 (9%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           AS+ +Q GD    VP SL +T + +       +   +  +  +A LA  L+  K  G+ S
Sbjct: 66  ASKSIQTGDCILQVPYSLQLTPDNLPPE---IKPFISEDVGNIAKLATVLLIHKNLGQDS 122

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
            W PYI  L  Q       + + + W+E+EL  +  S    E + +   I++++ E+  V
Sbjct: 123 EWHPYISCLPPQA-----EMHNTIFWNESELEMIRQSSVYQETIYQKSQIEKDFLEIKPV 177

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
                  FQ  P+      FT++ F  A   V S     +     +  +L+P     L +
Sbjct: 178 -------FQ--PFCQSFGDFTWKDFMHACTLVGS-----RAWGSTKGLSLIPFAD-FLNH 222

Query: 295 SSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 352
               +A++ + DD    ++  DR Y  GE +++  G   N+ L++++GF    N YD++ 
Sbjct: 223 DGISEAIVMSDDDNKCSEVFSDRDYVPGEQVLIRYGKFSNATLMLDFGFTIPYNIYDQVQ 282

Query: 353 VEAALNTEDPQYQDKRMVAQR 373
           ++  +   DP    K  + Q+
Sbjct: 283 IQYDIPKYDPLRHTKLELLQQ 303


>gi|440804743|gb|ELR25614.1| SET domain containing protein, partial [Acanthamoeba castellanii
           str. Neff]
          Length = 273

 Score = 69.7 bits (169), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 72/254 (28%), Positives = 104/254 (40%), Gaps = 43/254 (16%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVT------------LERVLGNETIAELLTTNKLSELACL 160
           V A+ D+ AG+   SVP SLVV             + R+L  E        N L      
Sbjct: 45  VVAAHDIAAGETLLSVPFSLVVDSADALLATSAPEIRRILDEEFPLSPTNENAL------ 98

Query: 161 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILER 220
              L+   K    S W  YI  L            + L +S+ EL+YL GS       +R
Sbjct: 99  ---LLLVHKNDPNSPWQRYIDVLPS-------TFSTTLFFSDDELSYLEGSSLHYFARQR 148

Query: 221 AEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV--QSCVVHLQKVSL 278
              I+ +Y+ + T       LF  YP     E F+ + +K A   +  +S VV   K  L
Sbjct: 149 RRAIESQYDTIFT------PLFVDYPEHFAPEQFSLDAWKWALSVIWSRSFVVDEGKSGL 202

Query: 279 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ---PNSKL 335
                +  + P     + + K  + AVD  +      P K GE I V  G      N++L
Sbjct: 203 VPWADMFNMAPE----TEQVKVAVDAVDHHLIYSARSPIKKGEQIFVAYGQSRQMSNAQL 258

Query: 336 LINYGFVDEDNPYD 349
           L++YGFV E+NP+D
Sbjct: 259 LMDYGFVLENNPHD 272


>gi|242049248|ref|XP_002462368.1| hypothetical protein SORBIDRAFT_02g024510 [Sorghum bicolor]
 gi|241925745|gb|EER98889.1| hypothetical protein SORBIDRAFT_02g024510 [Sorghum bicolor]
          Length = 489

 Score = 69.7 bits (169), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 103/388 (26%), Positives = 163/388 (42%), Gaps = 40/388 (10%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETI-AELLTTNKLSELACLALYLMYEKKQG 171
           + A+ DL  G+    VP  L +  + V  ++   A       L     +AL L+ E  +G
Sbjct: 83  LVAARDLPRGEVVAEVPKKLWMDADAVAASDIGRACGGGGGGLRPWVAVALLLLSEVARG 142

Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY--N 229
             S W PY+  L RQ              S    A L  S  + ++L    G+K EY  +
Sbjct: 143 ADSPWAPYLAILPRQTD------------STIFCAGLKKSSLRYKLLSTTVGVK-EYVQS 189

Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV---VHLQKVSLARRFALVP 286
           E D+V     S  +    D+   + TF+ F  AF  ++S V   +   K++L     LV 
Sbjct: 190 EFDSVQAEIISRNK----DLFPGSITFDDFLWAFGILRSRVFPELRGDKLALVPFADLVN 245

Query: 287 LGPPLLAYSS----KCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGF 341
             P + +  S    K K +    +    L      K+G+ I + +   + N++L ++YGF
Sbjct: 246 HSPDITSEGSSWEIKGKGLFGR-EPMFSLRTPVDVKSGQQIYIQYDLDKSNAELALDYGF 304

Query: 342 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 401
           V+ +   D   V   ++  DP Y DK  +A+ N       F +    ++     MLPYLR
Sbjct: 305 VESNPSRDSYTVTLEISESDPFYGDKLDIAELNELGETAYFDIIL--DEPLPPQMLPYLR 362

Query: 402 LGYVSDTSEM-------QSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSED 454
           L  +  T           SV   L    P+SP  E ++   + D  K+ LA Y  T+ ED
Sbjct: 363 LLCIGGTDAFILEALFRNSVWGHLE--LPLSPDNEESICQVMRDACKSALAAYHTTIEED 420

Query: 455 EAMLTDYNLHPKKRVATQLVRMEKKMLN 482
           E +    NL P+  +A  +   EKK+L 
Sbjct: 421 EELSERENLQPRLTIAIGVRAGEKKVLQ 448


>gi|384484604|gb|EIE76784.1| hypothetical protein RO3G_01488 [Rhizopus delemar RA 99-880]
          Length = 400

 Score = 69.3 bits (168), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 71/289 (24%), Positives = 126/289 (43%), Gaps = 51/289 (17%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           V A++D++ GD  FS+P S++++       + ++EL   ++LS  + L L +MYE ++  
Sbjct: 41  VTANKDIKEGDLLFSLPRSILLSQLTSSLKDQVSEL---SELSGWSPLILCMMYEIEK-P 96

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
            SFW PY   L R+         +P+ W++ +L  L G+   ++I  + E  +  +NEL+
Sbjct: 97  DSFWKPYFDVLPRE-------FTTPMFWNQEDLKELEGTDIISKI-GKKESEELFHNELE 148

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFK--QAFVAVQSCVVHLQKVSLARR--------- 281
            +        ++YP     +  T E+F    + +   S    LQK               
Sbjct: 149 PI-------IKKYPNLFDEQKHTIELFHICGSLIMAYSFNDELQKAPKENNKEEEKEEEE 201

Query: 282 ---------------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 326
                           ++VP+   L   +    A L    D++Q+   +  K GE I   
Sbjct: 202 EEEEEEEEEEEEEGLISMVPMADMLNHKTGFNNARLFHEPDSLQMRAIKDIKEGEQIYNT 261

Query: 327 CGPQPNSKLLINYGFVDEDNPYDR------LVVEAALNTEDPQYQDKRM 369
            G   N+ LL  YGFVDE N +D       L+VE     +D   +++++
Sbjct: 262 YGDLCNADLLRKYGFVDEKNDFDLVELDGPLLVEVCCEDQDEALKERKI 310


>gi|302896454|ref|XP_003047107.1| SET domain protein [Nectria haematococca mpVI 77-13-4]
 gi|256728035|gb|EEU41394.1| SET domain protein [Nectria haematococca mpVI 77-13-4]
          Length = 1037

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 106/447 (23%), Positives = 178/447 (39%), Gaps = 93/447 (20%)

Query: 113  VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN----------KLSELACLAL 162
            + A +D+ A    F++P   ++ +E     + + ++   +          +L   + L L
Sbjct: 610  IIALQDIPAETTLFTIPRKGIINVETSELPKKLPDVFDLDKPIDDDDEAPRLDSWSSLIL 669

Query: 163  YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK-------A 215
             LMYE  QG+KS W PY   L         + ++P+ WSE+EL  L  S  +       A
Sbjct: 670  VLMYEYLQGEKSQWKPYFDVLPS-------SFDTPMFWSESELDQLQASHMRHKIGKADA 722

Query: 216  EILERAE-------------GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
            E + R               G  R  ++L  +    GS    Y +D+  +    E     
Sbjct: 723  ESMFRKTLLPIIRKNSSVFGGENRSDDDLVEIAHRMGSTIMAYAFDLENDEDEEEEETDG 782

Query: 263  FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
            +V  +     +          +VP+   +L   ++  A +   ++++ +   RP KAGE 
Sbjct: 783  WVEDREGKSMM---------GMVPMAD-ILNADAEFNAHVNHEEESLTVTSLRPIKAGEE 832

Query: 323  IVVWCGPQPNSKLLINYGFVDEDNP-YDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQV 381
            I  + GP PNS+LL  YG+V E +  YD  VVE   +  +        V + N  +S QV
Sbjct: 833  IFNYYGPHPNSELLRRYGYVTERHSRYD--VVEIPWDVVES-------VMRLNFGISGQV 883

Query: 382  FHV--HAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGP-----------------IC 422
                 H   E+E   D     R     +T E+ S  +  GP                 + 
Sbjct: 884  LEKLRHGLEEEEEFEDTFVLER-----ETGEVNSDGTFSGPARFESMPEDLQEQLKTFLK 938

Query: 423  PVSPCMERAVLDQ----------LADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQ 472
             V      A+ D+          LA   +A  + YP + SED  +L   +L  + R+A +
Sbjct: 939  GVKKAQPEAIPDKRKRDEIHHAVLAKTLQALASKYPTSTSEDGILLQRQDLSQRTRMAIE 998

Query: 473  LVRMEKKMLNACLQVTA--DMIMLLPD 497
            +   EKK+L   +  T+  D+ M + D
Sbjct: 999  VRLGEKKLLQEAIASTSSVDVEMTVDD 1025


>gi|412990750|emb|CCO18122.1| predicted protein [Bathycoccus prasinos]
          Length = 543

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 70/262 (26%), Positives = 111/262 (42%), Gaps = 38/262 (14%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLM------- 165
           + A+E ++ G+    +P   ++T+E  L    + E     +L E + LA +L        
Sbjct: 118 LVATESIKRGEKVLEIPQEAIITVEVALKESLLREKKKLAELQEWSILATFLAETAQNLS 177

Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGI 224
            E     K  +  Y++ L R  G       S L W E+++   L GSP+    LER   +
Sbjct: 178 TEDNSSNKYRFATYVKALPRSTG-------SVLEWPESDVRTLLAGSPSLFSALERRASV 230

Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 284
                E+   +        Q+ +DI        +F        S ++ L+  SL    AL
Sbjct: 231 AAAIAEIRVNFPELNEKTLQWAFDI--------LF--------SRLIRLE--SLGGNLAL 272

Query: 285 VPLGPPLLAYSSKCKAM--LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
           VP    +L +   C+A   L      V L  DR Y+ GE +    G +P+S+LLI+YGF 
Sbjct: 273 VPWAD-MLNHQPGCEAFIDLDRGSRKVCLTTDRSYEPGEQVWASYGQRPSSELLISYGFA 331

Query: 343 DE--DNPYDRLVVEAALNTEDP 362
               DNP D   +   ++ EDP
Sbjct: 332 PAVGDNPDDEYALNLQIDEEDP 353


>gi|79315114|ref|NP_001030864.1| SET domain-containing protein [Arabidopsis thaliana]
 gi|51971180|dbj|BAD44282.1| unnamed protein product [Arabidopsis thaliana]
 gi|332645817|gb|AEE79338.1| SET domain-containing protein [Arabidopsis thaliana]
          Length = 353

 Score = 68.9 bits (167), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 58/233 (24%), Positives = 106/233 (45%), Gaps = 27/233 (11%)

Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
           LA  L+ EKK G+KS W+PYI  L +        + S + W E EL+ +  S    E ++
Sbjct: 2   LAAVLIREKKMGQKSRWVPYISRLPQP-----AEMHSSIFWGEDELSMIRCSAVHQETVK 56

Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
           +   I+++++      F+A +  Q  P  I TE    E F  A+  V S     +    +
Sbjct: 57  QKAQIEKDFS------FVAQAFKQHCP--IVTERPDLEDFMYAYALVGS-----RAWENS 103

Query: 280 RRFALVPLGPPLLAYSSKCKAMLAAVDD------AVQLVVDRPYKAGESIVVWCGPQPNS 333
           +R +L+P    +         +L   D+       +Q+  DR Y  G+ + +  G   N+
Sbjct: 104 KRISLIPFADFMNHDGLSASIVLRDEDNQLSEFSTLQVTADRNYSPGDEVFIKYGEFSNA 163

Query: 334 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQ---VFH 383
            L++++GF    N +D + ++  +  +DP    K  + Q +   +V+   +FH
Sbjct: 164 TLMLDFGFTFPYNIHDEVQIQMDVPNDDPLRNMKLGLLQTHHTRTVKDINIFH 216


>gi|225448769|ref|XP_002275729.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Vitis
           vinifera]
          Length = 480

 Score = 68.9 bits (167), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 69/292 (23%), Positives = 132/292 (45%), Gaps = 35/292 (11%)

Query: 75  EEDLGDLKSWM-HKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLV 133
           ++D  D   W+  K G+    V+   K ++         + AS+ +Q GD    VP ++ 
Sbjct: 39  DKDCDDFLPWLEQKAGVEISSVLSIGKSTYGRS------LFASKSIQTGDCILKVPYNVQ 92

Query: 134 VTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA 193
           ++ + V     I  LL  +++  +A LA+ +  E K G+ S W PYI  L  Q G     
Sbjct: 93  ISPDNV--PSKINSLLG-DEVGNIAKLAIVISVEWKMGQDSEWAPYINRLP-QPGE---- 144

Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW-FMAGSLFQQYPYDIPTE 252
           + S + WSE EL  +  S    E + +   I++++  +  V    + +LF+    DI  +
Sbjct: 145 MHSTIFWSEGELKMIQQSSVYQETINQKAQIQKDFLAIKPVLHHFSENLFK----DISLK 200

Query: 253 AFTFEIFKQAFVAVQSC-VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA--V 309
            F           + +C +V  +     +  +L+P     + +     ++L   +D    
Sbjct: 201 EF-----------MHACALVGSRAWGSTKGLSLIPFA-DFVNHDGFSDSVLLGDEDKQLS 248

Query: 310 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 361
           +++ DR Y  GE +++  G  PN+ LL+++GF    N YD++ ++  +   D
Sbjct: 249 EVIADRNYAPGEQVLIRYGKFPNATLLLDFGFTLPYNIYDQVQIQVNIPHHD 300


>gi|255947868|ref|XP_002564701.1| Pc22g06730 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211591718|emb|CAP97961.1| Pc22g06730 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 679

 Score = 68.9 bits (167), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 62/209 (29%), Positives = 101/209 (48%), Gaps = 15/209 (7%)

Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
             A +LM +  +G + FW PY+R L +    GQL   +PL + E ++ ++ G+      +
Sbjct: 105 TFAFFLMGQYLRGSEGFWYPYLRTLPQP---GQLT--TPLFFGEEDVDWIQGTGIPEAAV 159

Query: 219 ERAEGIKREYN----ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA-VQSCVVHL 273
           ER +  +++Y+    +LD + F      +QY +++   A T  I  +AF A V S  V  
Sbjct: 160 ERIKVWEQKYDLGYLKLDEIGFPD---CEQYTWELYLWASTI-ITSRAFSAKVLSGAVQP 215

Query: 274 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 333
             +      AL+PL   L  +    K    A D+ + L+V   + AG+ I    GP+ N 
Sbjct: 216 DDLPEDGVSALLPL-IDLPNHRPMAKVEWRAGDEDIGLLVLEDHSAGQEISNNYGPRNNE 274

Query: 334 KLLINYGFVDEDNPYDRLVVEAALNTEDP 362
           +LLINYGF    NP D  +V   +  + P
Sbjct: 275 QLLINYGFCIAGNPTDYRIVLLGVKPDSP 303


>gi|297736447|emb|CBI25318.3| unnamed protein product [Vitis vinifera]
          Length = 487

 Score = 68.6 bits (166), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 69/298 (23%), Positives = 134/298 (44%), Gaps = 40/298 (13%)

Query: 75  EEDLGDLKSWM-HKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLV 133
           ++D  D   W+  K G+    V+   K ++  +      + AS+ +Q GD    VP ++ 
Sbjct: 39  DKDCDDFLPWLEQKAGVEISSVLSIGKSTYGSRS-----LFASKSIQTGDCILKVPYNVQ 93

Query: 134 VTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA 193
           ++ + V     I  LL  +++  +A LA+ +  E K G+ S W PYI  L  Q G     
Sbjct: 94  ISPDNV--PSKINSLLG-DEVGNIAKLAIVISVEWKMGQDSEWAPYINRLP-QPGE---- 145

Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW-FMAGSLFQQYPYDIPTE 252
           + S + WSE EL  +  S    E + +   I++++  +  V    + +LF+    DI  +
Sbjct: 146 MHSTIFWSEGELKMIQQSSVYQETINQKAQIQKDFLAIKPVLHHFSENLFK----DISLK 201

Query: 253 AFTFEIFKQAFVAVQSC-VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD---- 307
            F           + +C +V  +     +  +L+P     + +     ++L   +D    
Sbjct: 202 EF-----------MHACALVGSRAWGSTKGLSLIPFA-DFVNHDGFSDSVLLGDEDKQLS 249

Query: 308 ----AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 361
                ++++ DR Y  GE +++  G  PN+ LL+++GF    N YD++ ++  +   D
Sbjct: 250 ESSSTLEVIADRNYAPGEQVLIRYGKFPNATLLLDFGFTLPYNIYDQVQIQVNIPHHD 307


>gi|301094750|ref|XP_002896479.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262109454|gb|EEY67506.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 478

 Score = 68.6 bits (166), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 106/460 (23%), Positives = 186/460 (40%), Gaps = 50/460 (10%)

Query: 62  TLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQA 121
           T V+  R  +SK++    +L  W+  NG    K+ L+E   +  + R +H   + + L  
Sbjct: 18  TPVSPPRNGMSKEDVVGQELIQWLETNGADSKKLTLQE---YAPEVRGVH---SRKVLVP 71

Query: 122 GDAAFSVPNSLVVTLERVLGNET-IAELLTTNKLSELA----CLALYLMYEKKQGKKSFW 176
           G+    +P   ++T+E  +G +T I   L    +  +A     L ++L+ + +  + SF+
Sbjct: 72  GERILVIPKKCLITVE--MGKQTDIGRKLLARNVDFVAPKHIFLMMFLLTDMEHVETSFF 129

Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
             Y   L             P+ WSE EL++L GS    +I ER   I+++Y+ +  V  
Sbjct: 130 RNYYSTLPSTLS------NMPIFWSEEELSWLKGSYIIQQIQERKAAIRKDYDVICRV-- 181

Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS 296
                      D     F+ + F  A + V S    L  +   +  ALVP    L  Y  
Sbjct: 182 -----------DPSFARFSLDRFSWARMIVCSRNFGL-TIDGVKTAALVPFADMLNHYRP 229

Query: 297 K-CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY------D 349
           +          DA  +        G  +    G + N + L+NYGF  EDN        +
Sbjct: 230 RETSWTFDQSIDAFTITSLGTIGTGAQVYDSYGKKCNHRFLLNYGFAVEDNTEEDGRNPN 289

Query: 350 RLVVEAALNTEDPQ-YQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 408
            ++++  L+  D Q + DKR     +G  ++    +          +   + RL  +  T
Sbjct: 290 EVLIDFQLSPADGQLFYDKRAYLHESGIYTMDA-RLSCSHSDANTREGFSFARL--IVAT 346

Query: 409 SEMQSVISSLGPIC---PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT--DYNL 463
            E  S +    P     P+S   E   L+ L +    +L+ Y  T+ ED  +L    Y L
Sbjct: 347 EEEFSTMKMKSPAHSSPPISFDNEIRALEYLRNLMTHQLSLYDTTIEEDNELLASKQYPL 406

Query: 464 HPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPC 503
              +  A   +R EK++     Q  AD ++ L  + ++ C
Sbjct: 407 FSNRIQALFFIRGEKQVCRY-FQELADKVIPLFSLPLAEC 445


>gi|336261436|ref|XP_003345507.1| hypothetical protein SMAC_07495 [Sordaria macrospora k-hell]
 gi|380088183|emb|CCC13858.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 499

 Score = 68.6 bits (166), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 70/304 (23%), Positives = 122/304 (40%), Gaps = 50/304 (16%)

Query: 81  LKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
           L  W H +G    P  ++   EK   + + +P    +A+E L +   A + P S+ ++  
Sbjct: 12  LLDWAHNHGASLHPSVEIYQDEKTGFSLRVKP----SAAESLHSPFKAVTCPTSITLSYL 67

Query: 138 RVLGNETIAELLT---------------TNKLSELACLALYLMYEKKQGKKSFWLPYIRE 182
             L +  I   LT                N L        YL+ +  +GK S W PYI  
Sbjct: 68  NALTDGPITPYLTPPALDTQKHAFPERFMNSLPPHVIGRFYLIQQYLKGKSSLWAPYIST 127

Query: 183 LDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS-- 240
           L       + A+  P  W+E ++  L G+     I E  + +K EY +   +    GS  
Sbjct: 128 LTDPSQLDKWAL--PPFWTEHDIELLRGTNAYVAIQEIQDNVKSEYKQARKILKQEGSPD 185

Query: 241 --LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYS--- 295
              + Q  Y+     FT   F+ + +  +S   ++++        L+P G  +  +S   
Sbjct: 186 YRAYTQVLYNWAYCMFTSRSFRPSLILSESAREYVER--------LLPEGAKIDDFSILQ 237

Query: 296 -----------SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 344
                      ++    L +   A +L+    Y+ G+ +    G + NS+LL+ YGFV E
Sbjct: 238 PLYDIGNHSPEAEYSWNLTSEPSACELICRNSYEPGQQVFNNYGKKTNSELLLGYGFVTE 297

Query: 345 DNPY 348
           +N Y
Sbjct: 298 NNDY 301


>gi|322694827|gb|EFY86647.1| SET domain protein [Metarhizium acridum CQMa 102]
          Length = 467

 Score = 68.2 bits (165), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 96/427 (22%), Positives = 172/427 (40%), Gaps = 68/427 (15%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN-------KLSELACLALYLM 165
           + A  D+ A    F++P   ++  E     + + +L  +         L   + L L +M
Sbjct: 43  ITALRDIPADTTLFTIPRDAIINSETSSLRKKLPDLFESQGDEDEEQALDSWSALILIMM 102

Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEGI 224
           YE   G +S W PYI  L        L  ++P+ WSE EL+YL  S T  +I +  AE +
Sbjct: 103 YEFFLGDESKWKPYIDVL-------PLTFDTPMFWSEEELSYLQASATVNKIGKADAEEM 155

Query: 225 KREY--------------------NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
            R                       +L  +    GS    Y +D+  E    +     +V
Sbjct: 156 FRTRLIPAIRGNPSVFVSSGDCSDEDLIGLAHRMGSTIMAYAFDLENEEAENDEESDGWV 215

Query: 265 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 324
             +     +  V++A           +L   ++  A +   D+ + +   R  KAGE I+
Sbjct: 216 EDREGKSMMGMVAMA----------DILNADAEFNAHVNHGDEELTVTSIRDIKAGEEIL 265

Query: 325 VWCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAALNTEDPQYQD------KRMV 370
            + GP PNS+LL  YG++ E +        P+D   V+ +L +E    QD      +RM 
Sbjct: 266 NYYGPHPNSELLRRYGYITEKHSRYDVVEIPWD--AVQHSLMSELGVPQDIMAETMERM- 322

Query: 371 AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE-MQSVISSL----GPICPVS 425
             ++    + V    +G      +   P +  G   D  E +++ I  L    G +    
Sbjct: 323 -DQDDLEDIFVLERDSGEPNPDGTFAGPAVVDGMPPDLKEQLKATIKLLQKLDGNLISDK 381

Query: 426 PCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 485
              +  +   + +  +   + Y  T++EDE +L   +L  ++R+A Q+   EKK+L    
Sbjct: 382 RKRDDILRSTMVETLRLIASRYSTTIAEDEVLLAQDSLTRRQRMAVQVRLGEKKLLQEAC 441

Query: 486 QVTADMI 492
              ++M+
Sbjct: 442 DHFSEMV 448


>gi|302753470|ref|XP_002960159.1| hypothetical protein SELMODRAFT_437298 [Selaginella moellendorffii]
 gi|300171098|gb|EFJ37698.1| hypothetical protein SELMODRAFT_437298 [Selaginella moellendorffii]
          Length = 377

 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 56/209 (26%), Positives = 95/209 (45%), Gaps = 30/209 (14%)

Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
           LAL ++ E+ +G+ S W PYI  L +        +++  LW +TEL+YL  SP   +  E
Sbjct: 114 LALIVLMERYKGQSSVWAPYISCLPQPA-----ELDNTFLWEDTELSYLKASPLYGKTRE 168

Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
           R E I  E+ ++     +   LF +          + E FK  +  V S     + +++ 
Sbjct: 169 RLEMITTEFGQVQNALNVWPQLFGK---------VSLEDFKHVYATVFS-----RSLAIG 214

Query: 280 RRFALVPLGPPLLAY-----SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 334
               LV +  P+L +     +S  K     + +   +  DR Y   + I +  G   N++
Sbjct: 215 EDSTLVMI--PMLDFFNHNATSFAKLSFNGLLNYAVVTADRAYTENDQIWINYGDLSNAE 272

Query: 335 LLINYGFVDEDNPYDRLVVEAALNTEDPQ 363
           L ++YGF   +NPYD    E  L T+ P+
Sbjct: 273 LALDYGFTVPENPYD----ETDLLTQFPE 297


>gi|219126444|ref|XP_002183467.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217405223|gb|EEC45167.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 519

 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 94/352 (26%), Positives = 142/352 (40%), Gaps = 58/352 (16%)

Query: 160 LALYLMYEKK-QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
           L +YL++++K  G  SF+ PY   L             P+ WS  EL  L GS   ++I 
Sbjct: 116 LMIYLLWDRKTHGSSSFFHPYYEILP------PTLRNMPIFWSAFELQELEGSHLLSQIA 169

Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 278
           +R + I+ +Y  +  V    G+L             T + FK A + V S    LQ +  
Sbjct: 170 DRGQAIQDDYEAILEVAPSLGTLC------------TLDEFKWARMCVCSRNFGLQ-IDG 216

Query: 279 ARRFALVPLGPPLLAYSSK-CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 337
            R  ALVP    L  Y  +  K     V     +   +  +AG  +    G + N + L+
Sbjct: 217 HRTSALVPHADMLNHYRPRETKWTFDEVTQCFTITSLQSIQAGAQVYDSYGQKCNHRFLL 276

Query: 338 NYGFVDEDNPY------DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 391
           NYGF  EDN        + + +E  ++  D  +QDK     R               E  
Sbjct: 277 NYGFAVEDNRELDGFCPNEVPLELYVDPADILFQDKLEFWTRG--------------ETN 322

Query: 392 AISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATL 451
            IS        G V+     Q+V  S+G   P S   E      +    + RLA YP T+
Sbjct: 323 QIS--------GAVTAGLIAQAVGGSMGRGVP-SHAAESYTSGPVVK--RVRLASYPTTI 371

Query: 452 SEDEAMLTDYNLHPK---KRVATQLVRMEKKMLN---ACLQVTADMIMLLPD 497
           S+D A L D   +P+   +R A   VR EK++L+      +   DM+  + D
Sbjct: 372 SQDMADLQDEASYPQFSNRRHAKIQVRGEKEVLHHFRVWSETALDMLTFIED 423


>gi|57899520|dbj|BAD87034.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
 gi|57899939|dbj|BAD87851.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
          Length = 509

 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 59/260 (22%), Positives = 102/260 (39%), Gaps = 39/260 (15%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           ASE +Q GD    VP  + +TL+++         L  + + + + LA  L+ E+  G +S
Sbjct: 63  ASEPIQEGDCIMQVPYHVQLTLDKL---PQKFNTLLDHAVGDTSKLAALLIMEQHLGNES 119

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
            W PYI+ L  +       + + +LW   EL  +  S    E +E  E  K+E+  L   
Sbjct: 120 GWAPYIKSLPTKD-----QMHNMVLWDLNELHAVQNSSIYDEAIEHKEQAKKEFLALKPA 174

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
                 LF +        A   +      V     +   QK                   
Sbjct: 175 LDHFPHLFGEVKLGDFMHASALDFLNHDGVFGSVLIYDEQK------------------- 215

Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
                       D  +++ DR Y  GE +++  G   N+ L +N+GF    N YD+ ++ 
Sbjct: 216 ------------DVCEIIADRNYAVGEQVMIRYGKYSNATLALNFGFTLARNIYDQALIR 263

Query: 355 AALNTEDPQYQDKRMVAQRN 374
             +  +DP Y+ K  + Q++
Sbjct: 264 IDMPVQDPLYKKKLDIWQKH 283


>gi|71895277|ref|NP_001025965.1| SET domain-containing protein 4 [Gallus gallus]
 gi|53134599|emb|CAG32346.1| hypothetical protein RCJMB04_23h14 [Gallus gallus]
          Length = 439

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 101/436 (23%), Positives = 175/436 (40%), Gaps = 61/436 (13%)

Query: 86  HKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVVTLERV 139
           HK      K  LK++   +   RP  +      +  ++ LQAG+   S+P   +VT   V
Sbjct: 28  HKLEYIKLKKWLKDRGFGDSSLRPAQFWGTGRGLMTTKALQAGELVISLPEKCLVTTTTV 87

Query: 140 LGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
           L N  + E +   K  +S L  L  +L+ EK  G++S W PY+  L +           P
Sbjct: 88  L-NSCLGEYIMKWKPPVSPLIALCPFLIAEKHAGERSLWKPYLDVLPK-------TYSCP 139

Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
           +   E ++  L   P + +  E+   +   Y      +    SLF +    I    F + 
Sbjct: 140 VC-LEQDVVQLLPEPLRKQAQEQRTAVHELYMSSKAFFSSLQSLFAENTATI----FNYS 194

Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVP----LGP--PLLAYS--SKCKAMLAAVDDAV 309
             + A+  + +  +++ K S    F+L P    L P   LL +S   + KA         
Sbjct: 195 ALEWAWCTINTRTIYM-KHSQRECFSLEPDVYALAPYLDLLNHSPNVQVKAAFNEQSRNY 253

Query: 310 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRM 369
           ++  +   K  E + +  GP  N +LL+ YGFV  DNP+  + V +    +     DK  
Sbjct: 254 EIQTNSQCKKYEEVFICYGPHDNQRLLLEYGFVAVDNPHSSVYVSSDTLLKYFPSLDK-- 311

Query: 370 VAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCME 429
             Q+N KLS+   H           D+L  L  G+   +  + + +  L        C  
Sbjct: 312 --QKNAKLSILKEH-----------DLLENLTFGWDGPSWRLLTALKVLSLGGDEFTCWR 358

Query: 430 RAVLDQLADYFKARLAGYPATLSEDEAM-----LTDYNLHPKKRVATQLVRMEKKMLNAC 484
           RA+   L D   AR        +E +A+     +  + +   + V  Q+ +M++   N  
Sbjct: 359 RAL---LGDVISAR--------NEQQALNITTKICHFLIEETQHVLLQISQMKRDKENLK 407

Query: 485 LQVTADMIMLLPDVTV 500
            Q+T    + L D+ +
Sbjct: 408 EQLTLVEALRLEDLKI 423


>gi|348671353|gb|EGZ11174.1| hypothetical protein PHYSODRAFT_361758 [Phytophthora sojae]
          Length = 486

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 100/442 (22%), Positives = 176/442 (39%), Gaps = 50/442 (11%)

Query: 80  DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
           +L  W+  NG    K+ L+E   +  + R +H   + + L  G+    +P   ++T+E  
Sbjct: 44  ELIQWLEGNGADTKKLALQE---YAPEVRGVH---SRKVLAPGERILVIPKKCLITVE-- 95

Query: 140 LGNET-IAELLTTNKLSELA----CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
           +G +T I   L    +  +A     L ++L+ + ++ + SF+  Y   L           
Sbjct: 96  MGKQTDIGRKLLARNVDFVAPKHIFLMMFLLTDMERAETSFFRNYYSTLP------STLS 149

Query: 195 ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
             P+ WS+ EL +L GS    +I ER   I+++Y+ +  V             D     F
Sbjct: 150 NMPIFWSDEELGWLKGSYIIQQIQERKAAIRKDYDVICRV-------------DPAFARF 196

Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSK-CKAMLAAVDDAVQLVV 313
           + + F  A + V S    L  +   +  ALVP    L  Y  +          DA  +  
Sbjct: 197 SLDRFSWARMIVCSRNFGL-TIDGVKTAALVPFADMLNHYRPRETSWTFDQSIDAFTITS 255

Query: 314 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY------DRLVVEAALNTEDPQ-YQD 366
                 G  +    G + N + L+NYGF  EDN        + ++++  L+  D Q + D
Sbjct: 256 LGTIGTGAQVYDSYGKKCNHRFLLNYGFAVEDNTEEDGRNPNEVLIDFQLSQADGQLFYD 315

Query: 367 KRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPIC---P 423
           KR     +G  ++    +          +   + RL  +  T +  S +    P     P
Sbjct: 316 KRAYLHESGIYTMDA-RLSCSHSDANTREGFSFARL--IVATEDEFSSMKMKSPAHSSPP 372

Query: 424 VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT--DYNLHPKKRVATQLVRMEKKML 481
           +S   E   L  L D    +L+ Y  T+ ED  +L    Y L   +  A   +R EK++ 
Sbjct: 373 ISFDNEIRALQYLRDLMTHQLSLYDTTIEEDNELLASKQYPLFSNRIQALFFIRGEKQVC 432

Query: 482 NACLQVTADMIMLLPDVTVSPC 503
               Q  AD ++ L  + ++ C
Sbjct: 433 RY-FQELADKVIQLFSLPLAEC 453


>gi|367029027|ref|XP_003663797.1| hypothetical protein MYCTH_2080826, partial [Myceliophthora
           thermophila ATCC 42464]
 gi|347011067|gb|AEO58552.1| hypothetical protein MYCTH_2080826, partial [Myceliophthora
           thermophila ATCC 42464]
          Length = 357

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 79/317 (24%), Positives = 131/317 (41%), Gaps = 63/317 (19%)

Query: 78  LGDLKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
            G L  W  ++G    P  ++ L     ++ +  P    +A+E LQ G AA S P  + +
Sbjct: 10  FGALVEWAEQHGARLHPSVEIYLDPVSKYSLRVSP----SATEGLQPGFAAVSCPARITL 65

Query: 135 TLERVLGNETIAELLTTNKLSELACL------------------------ALYLMYEKKQ 170
           +    L +  +     +++ ++ A L                          +L+ E  +
Sbjct: 66  SYLNALVDGLLDPSALSDRSAQSARLDQETSSTGAFPPRFTRSVPPHVLGRFFLVKEYLK 125

Query: 171 GKKSFWLPYIRELDRQRGRGQLAVES-PLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
           GK SFW PYI  L       Q+AV + P  W + ++AYL G+     I E  E +KRE+ 
Sbjct: 126 GKDSFWWPYIATLPPPE---QVAVWALPPFWPDHDIAYLEGTNAHVAIQEIQENVKREFK 182

Query: 230 ELDTVWFMAGSLFQQYPY-DIPTEAFTFEIFKQAFVAVQSCVVH----LQKVSLARRFAL 284
           +       A  L ++  + D+P  A+T  ++K AF    S        L   +  R  AL
Sbjct: 183 Q-------ARKLLKEEDFPDLP--AYTQLLYKWAFCIFTSRSFRPSLVLSDATKRRLSAL 233

Query: 285 VPLG---------PPLLAYSSKCKAM-----LAAVDDAVQLVVDRPYKAGESIVVWCGPQ 330
           +P G          PLL  ++            +V D  +L+   PY+ G  +    G +
Sbjct: 234 LPQGVQLDDFSVLQPLLDIANHSPTARYTWDTTSVPDTCRLICHDPYQPGTQVYNNYGLK 293

Query: 331 PNSKLLINYGFVDEDNP 347
            NS+LL+ YGF+  + P
Sbjct: 294 TNSELLLAYGFILPETP 310


>gi|357615786|gb|EHJ69829.1| putative SET domain containing 3 [Danaus plexippus]
          Length = 489

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 80/342 (23%), Positives = 142/342 (41%), Gaps = 60/342 (17%)

Query: 83  SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
           SW+H++G     V + E   +         + A++D   G    +VP  ++++ E+    
Sbjct: 88  SWLHEHGAEFEGVEISEFDGYG------FGLKATKDFSEGSLILTVPGKVMMS-EKDPKA 140

Query: 143 ETIAELLTTNKLSEL---ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
             ++E +  + L +      LAL+L+ EK     SFW PYI  L  +         + L 
Sbjct: 141 SDLSEFINIDPLLQNMPNVTLALFLLLEK-NNPNSFWKPYIDVLPEK-------YSTVLY 192

Query: 200 WSETELAYLTGSPTKAEILERAEGIKREY----NELDTVWFMAGSLFQQYPYDIPT---- 251
           ++  ELA L  SP     L+    I R+Y    N++ T+             D+P     
Sbjct: 193 FNSEELAELRPSPVFESSLKLYRSIVRQYAYFYNKIHTI-------------DLPVLKNL 239

Query: 252 -EAFTFEIFKQAFVAV---QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD 307
            + FTF+ ++ A   V   Q+ +V     +L    A +PL          C      +  
Sbjct: 240 QDIFTFDNYRWAVSTVMTRQNNIVQGTAFTLTN--AFIPLW-------DMCNHKHGKITT 290

Query: 308 AVQLVVDR-------PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 360
              L ++R        Y+  E I ++ G +PNS L ++ GFV  DN YD L +   ++  
Sbjct: 291 DFNLELNRGECYALQDYRRDEQIFIFYGARPNSDLFLHNGFVYPDNDYDSLSIALGISPN 350

Query: 361 DPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 402
           D     K  +  + G   V  F ++ G    ++ ++L ++R+
Sbjct: 351 DALRNGKVNLLNKLGLSGVTNFSLYKGASPISV-ELLAFIRI 391


>gi|298711968|emb|CBJ32910.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 247

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 64/196 (32%), Positives = 91/196 (46%), Gaps = 23/196 (11%)

Query: 311 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT-EDPQYQDK-R 368
           +   R + AGE +++  GP+ N  LL  YGFV++DNP D   +   ++   D   +D  R
Sbjct: 40  VTTQRGWTAGEQVLISYGPRSNDHLLRRYGFVEQDNPNDVYRITGLIDKLSDVLGKDSVR 99

Query: 369 MVAQRNGKLSVQ--------VFHVHAGR-----EKEAISDMLPYLRLGYVSDTS--EMQS 413
           ++ +  GKL           V  V  GR     EKE    ++P  RL  V D    E ++
Sbjct: 100 VLRESGGKLGTTGDNAEGRPVESVTVGRSGLLGEKEE-GRVMPVFRLAVVKDDQLPEGKA 158

Query: 414 VISSLGPIC-PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYN--LHPKKRVA 470
              SL      +SP  E A  D L         G+  TL+EDEA L+     L  +KRVA
Sbjct: 159 AGISLKDFSNEISPANEAAARDALRKLCIKEREGFATTLAEDEAYLSSLGNSLGAQKRVA 218

Query: 471 TQLVRMEKK-MLNACL 485
               RMEKK +L+A +
Sbjct: 219 FSF-RMEKKRVLDAAI 233


>gi|190402231|gb|ACE77646.1| hypothetical protein [Sorex araneus]
          Length = 350

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 62/248 (25%), Positives = 114/248 (45%), Gaps = 11/248 (4%)

Query: 252 EAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 310
           E+FT+E ++ A  +V +    +     +R   AL+PL       +          DD  +
Sbjct: 12  ESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCE 71

Query: 311 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 370
            V  + ++AGE I ++ G + N++ +++ GF  ++N +DR+ ++  ++  D  Y  K  V
Sbjct: 72  CVALQDFRAGEQIYIFYGTRSNAEFVVHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEV 131

Query: 371 AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLGPI- 421
             R G  +  VF +H   E    + +L +LR+  +++    + +        I +LG   
Sbjct: 132 LARAGIPTSSVFALHV-TELPISAQLLAFLRVFCMTEEELREHLLGENAIDRIFTLGNSE 190

Query: 422 CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 481
            PVS   E  +   L D     L  Y  T+ ED+A L    L P+  +A +L   EK++L
Sbjct: 191 YPVSWDNEVRLWTFLEDRASLLLKTYKTTIEEDKAFLQSPGLSPRAAMAVKLRLGEKEIL 250

Query: 482 NACLQVTA 489
              ++  A
Sbjct: 251 EKAVRSAA 258


>gi|429861365|gb|ELA36056.1| set domain protein [Colletotrichum gloeosporioides Nara gc5]
          Length = 471

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 89/425 (20%), Positives = 161/425 (37%), Gaps = 65/425 (15%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER----------VLGNETIAELLTTNKLSELACLAL 162
           + A++D+ A    F++P   ++ +E             GN+   E +    L     L L
Sbjct: 42  IIATKDIPAETTLFTIPRRSIINVETSELPKKIPQVFTGNDGDDEDMENEPLDSWGSLIL 101

Query: 163 YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-LERA 221
            ++YE  QG  S W PY   L  +         + + W  ++L  L GS   ++I  E A
Sbjct: 102 VMIYEFLQGAASPWKPYFEVLPEK-------FHTLMFWESSDLENLKGSAVLSKIGKEEA 154

Query: 222 EGIKREY----------------------NELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
           + + R                         EL  +    GS+   Y +D+  E    +  
Sbjct: 155 DEMFRSRILTVIAANPAIFYPEGSSPLGEAELLQLAHRMGSIIMAYAFDLDNEEEPEQEE 214

Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 319
              ++  +     L  V +A           +L   ++  A +   DD + +   RP  A
Sbjct: 215 DDEWIEDRDGKTMLGMVPMAD----------ILNADAEFNAHVNHGDDELTVTALRPIPA 264

Query: 320 GESIVVWCGPQPNSKLLINYGFVDEDN--------PYD--RLVVEAALNTEDPQYQDKRM 369
           GE I+ + GP PNS+LL  YG+V   +        P+D  +  V   L   D  ++  + 
Sbjct: 265 GEEILNYYGPHPNSELLRRYGYVTPKHSRYDVVEIPWDLVQASVSEHLKIGDDVWKQVQE 324

Query: 370 VAQRNGKLSVQVFHVHAGR-EKEAISDMLPYLRLGYVSDTSEMQSVISSL----GPICPV 424
                    V V    +G  + E     +  +R        ++++V+ ++    G + P 
Sbjct: 325 YVDPEELEDVFVLERESGEPDSEGQFRTVAEVREISAELEEQLKAVLKAIKKINGDLIPD 384

Query: 425 SPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNAC 484
               +      +    +  L+ YP +  EDEA+L   +L  ++R+A  +   EKK+L   
Sbjct: 385 KRKRDEVFHAVIVSTLQKILSQYPTSTQEDEALLATSDLTNRQRMAIHVRLGEKKLLKEA 444

Query: 485 LQVTA 489
           L+   
Sbjct: 445 LEFAG 449


>gi|241603784|ref|XP_002405757.1| SET domain-containing protein, putative [Ixodes scapularis]
 gi|215502568|gb|EEC12062.1| SET domain-containing protein, putative [Ixodes scapularis]
          Length = 429

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 65/282 (23%), Positives = 122/282 (43%), Gaps = 24/282 (8%)

Query: 79  GDLKSWMHKNGLP-PCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
           G L +WM  NG     K+ L++ P           V A E L  G+    +P +L+++  
Sbjct: 30  GRLLTWMEANGFRLHSKLGLRDFPDTGRG------VVALEKLVGGETFLKLPATLLISTR 83

Query: 138 RVLGNETIAELLTTN-KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES 196
             L +   + ++  + KL+ +  L L+++ +K  G+ S W P++  L R          +
Sbjct: 84  TALQSRLHSFIIRHHAKLTPIDVLTLFVLDQKLLGEASRWWPFVDSLPR-------TFTT 136

Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 256
           P+         L     + E+      I+R + +L  +  + G + ++      +  FT+
Sbjct: 137 PVFLRRKVFESLP-KDLREEVQTGITFIQRTFLKLKVL--LGGHVEEEPEVQCLSTGFTW 193

Query: 257 EIFKQAFVAVQSCVVHLQKVSLARRF-----ALVPLGPPLLAYSSKCKAMLAAVDDAVQL 311
             F  A+ AV +  +  Q  + +  +     AL P     L +  K     A V +  ++
Sbjct: 194 NNFVWAWTAVNTRCIFAQGSNSSSLWEDDHCALAPF-LDCLNHHWKASIETAMVGENFEI 252

Query: 312 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
           +  + + A E + +  GP  N +L ++YGFV  DNP D +VV
Sbjct: 253 LSHKSHDANEQVFISYGPHSNRRLFLDYGFVLPDNPNDVVVV 294


>gi|52545671|emb|CAH56365.1| hypothetical protein [Homo sapiens]
          Length = 380

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 58/248 (23%), Positives = 116/248 (46%), Gaps = 11/248 (4%)

Query: 252 EAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 310
           ++FT+E ++ A  +V +    +     +R   AL+PL       +          DD  +
Sbjct: 23  DSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCE 82

Query: 311 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 370
            V  + ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y  K  V
Sbjct: 83  CVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEV 142

Query: 371 AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD---------TSEMQSVISSLGPI 421
             R G  +  VF +H   E    + +L +LR+  +++          S +  + +     
Sbjct: 143 LARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFTLGNSE 201

Query: 422 CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 481
            PVS   E  +   L D     L  Y  T+ ED+++L +++L  + ++A +L   EK++L
Sbjct: 202 FPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKNHDLSVRAKMAIKLRLGEKEIL 261

Query: 482 NACLQVTA 489
              ++  A
Sbjct: 262 EKAVKSAA 269


>gi|46130858|ref|XP_389160.1| hypothetical protein FG08984.1 [Gibberella zeae PH-1]
          Length = 1000

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 95/443 (21%), Positives = 179/443 (40%), Gaps = 79/443 (17%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--------LSELACLALYL 164
           + A +D+ A    F++P   ++  E     + I ++   +K        L   + L L +
Sbjct: 578 IIALKDIPAETTLFTIPRKGIINTETSELPKKIPDVFDLDKPDEDDVPGLDSWSSLILIM 637

Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEG 223
           +YE  QG  S W  Y   L         + ++P+ WSE EL  L  S  + +I +  AE 
Sbjct: 638 IYEYLQGDSSQWKSYFDVLPS-------SFDTPMFWSENELDQLQASHMRHKIGKADAED 690

Query: 224 I-------------------KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
           +                    R   EL  +    GS    Y +D+  +    E  ++   
Sbjct: 691 MFKKTLVPIIRSNPSIFNAENRSDYELVEIAHRMGSTIMAYAFDLENDEEEEEETEEWVE 750

Query: 265 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 324
             +   +            +VP+   +L   ++  A +   ++++ +   RP KAGE I+
Sbjct: 751 DREGKSM----------MGMVPMAD-ILNADAEFNAHVNHEEESLTVTSLRPIKAGEEIL 799

Query: 325 VWCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAALNT----------------E 360
            + GP PNS+LL  YG+V E +        P+D  +VE+ L                  E
Sbjct: 800 NYYGPHPNSELLRRYGYVTEKHSRYDVVEIPWD--IVESVLTNFGISSKILEQIRGEFEE 857

Query: 361 DPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGP 420
           + +++D  ++ +  G+++         + +    D+   L+  ++    ++Q   S   P
Sbjct: 858 EEEFEDTFVLERDTGEVNSDGTFAEPAKFEGMPEDLQEQLK-SFLKGIKKLQ---SDTIP 913

Query: 421 ICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKM 480
                  + +AVL +  +   AR   YP ++SED+ +L   NL  + R+AT +   EKK+
Sbjct: 914 DKRKRDEIHQAVLVKTLEALAAR---YPTSISEDQILLNGQNLDQRARMATVVRLGEKKL 970

Query: 481 LNACLQVTADMIMLLPDVTVSPC 503
           L   +   ++ + +  D    P 
Sbjct: 971 LQEAIATFSEDVEMTMDDESGPA 993


>gi|340519125|gb|EGR49364.1| predicted protein [Trichoderma reesei QM6a]
          Length = 963

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 91/410 (22%), Positives = 170/410 (41%), Gaps = 53/410 (12%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERV--------LGNETIAELLTTNKLSELACLALYL 164
           + A +D+ A    F+VP S +++ E          +  ET  E+    +    + L + +
Sbjct: 537 IVALQDIPAEAVLFTVPRSGILSSETSELKGKLPEIFQETAMEVDDKPQQDPWSTLIIVM 596

Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
           MYE  +G +S W PYI  L         + E+P+ WS+ EL  L  S T++++  +A   
Sbjct: 597 MYEYFKGSESKWKPYIDVLPS-------SFETPMFWSDAELDELQASATRSKV-GKASAE 648

Query: 225 KREYNELDTVWFMAGSLF---QQYPYDIPTE----------AFTFEIFKQAFVAVQSCVV 271
           +   +++  V      LF   Q Y  D   +          +++F+   +          
Sbjct: 649 EMFQDKVLPVIRANQHLFPTSQTYSDDDLIQLAHRMGSTIMSYSFDFQNEDEEDEDETEE 708

Query: 272 HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 331
            +++        +VP+   +L   ++  A +   DDA+ +   R  KAGE I  + GP P
Sbjct: 709 WVEEREAKSTMGMVPMAD-ILNADAEYNAHVNYGDDALTVTALRTIKAGEEIFNYYGPHP 767

Query: 332 NSKLLINYGFVD-EDNPYDRL---------VVEAALNTEDPQYQDKRMVAQRNGKLSVQV 381
           NS+LL  YG+V  + + YD +          V A+L     Q  DK      + +L    
Sbjct: 768 NSELLRRYGYVTPKHSRYDVVELPWTLVEESVAASLGLSSEQL-DKARECLDSDELEDTF 826

Query: 382 FHVHAGREKEAISDMLPYLRLGYVSDT--SEMQSVISSLGPICPVSPCMERA-------V 432
                  E      +    R   + +    +++S++ ++    P S   +R        +
Sbjct: 827 VLERETEEPNPDGTLTGSARFSEIPEDLRDQLKSLLKAIRKAVPSSVVDKRKRDEIQHNI 886

Query: 433 LDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 482
           L +  D   +R   YP ++SEDE +L   ++  ++R A  +   EK+++ 
Sbjct: 887 LIRALDALASR---YPTSISEDERILAGNDISERRRAAVTVRLGEKRLIQ 933


>gi|270005261|gb|EFA01709.1| hypothetical protein TcasGA2_TC007289 [Tribolium castaneum]
          Length = 230

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 58/226 (25%), Positives = 101/226 (44%), Gaps = 26/226 (11%)

Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRP-------YKAGESIVVWCGPQPNSK 334
           +AL+PL          C      +  A   V+DR        +KAGE + ++ G + N+ 
Sbjct: 14  YALIPLW-------DMCNHTNGTISTAYNPVLDRSECLAVKNFKAGEQLFIFYGSRSNAD 66

Query: 335 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 394
           L ++ GFV E+N YD   +   ++  DP  Q +  +    GKLS+      + R+  +  
Sbjct: 67  LFVHNGFVFENNDYDVYWIRLGISKSDPLQQKRGHLL---GKLSIASTCDFSIRKGASPI 123

Query: 395 D--MLPYLRLGYVSDTSEMQSVISS-----LGPI-CPVSPCMERAVLDQLADYFKARLAG 446
           D  +L +LR+ +  +  ++   I+S     LG I C +   +E      L    K  L+ 
Sbjct: 124 DGQLLAFLRV-FNMNEEQLDHWINSDKSADLGHIDCALDTALETKSWRFLHARLKLLLST 182

Query: 447 YPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 492
           Y  TL EDE +L +    P + +A ++   EK+++   L+     I
Sbjct: 183 YKTTLDEDEKLLAEAQATPNRLLAIKMRATEKRIIRETLEYVEQYI 228


>gi|452841392|gb|EME43329.1| hypothetical protein DOTSEDRAFT_131367 [Dothistroma septosporum
           NZE10]
          Length = 445

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 102/437 (23%), Positives = 172/437 (39%), Gaps = 59/437 (13%)

Query: 83  SWMHKNGLP-PCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLG 141
           +W+  NG     K+ L +    N        + A EDL   +  FSVP S ++T E    
Sbjct: 16  NWLRDNGASISAKITLDDLRQQNAGRG----IVAVEDLDEDEELFSVPRSTMLTTETSRN 71

Query: 142 NETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
            E + + +    LS +  +AL  +     G +S W PY   L        ++ ++ + WS
Sbjct: 72  GEAVLQEVDDPWLSLIVVMALEYL----DGSQSRWKPYFDVL-------PVSFDNLMFWS 120

Query: 202 ETELAYLTGSPTKAEI----------------LERAEGIKREYNE-LDTVWFMAGSLFQQ 244
           + EL +L GS    +I                +ER    K   NE L  +    GS    
Sbjct: 121 DRELRHLEGSTVVGKIGKEAADATFREQLIPVIERISKAKAADNEELLRMCHRMGSTIMA 180

Query: 245 YPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAA 304
           Y +D+ T +   +   + +         L K        +VPL   L A + +  A L  
Sbjct: 181 YGFDLETSSDQAKNDGEEWEEDSDAGETLPK-------GMVPLADMLNADADRNNAKLFY 233

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN---------PYDRLVVEA 355
            DD V +   +P KAGE +    G  P + LL  YG++  DN         P D +   A
Sbjct: 234 EDDKVVMKTIKPVKAGEELYNDFGSLPRADLLRRYGYL-TDNYAQYDVVEIPADLIKERA 292

Query: 356 ALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVI 415
            L T+D    ++   A+  G L        A  E+    + L  L        +E + V 
Sbjct: 293 GLRTQD--VDERWQYAEEQGVLDDGYDVSRASSEEGQFPEELCVLLNLLALPRAEFEKVK 350

Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKR-VATQLV 474
           +      P      + +L  +  Y   R A YP  + +   M +D +L+ ++R +A  ++
Sbjct: 351 NKDKIPKPDLTTNAKKLLRTILVY---RYAAYPGNVDQ---MHSDVSLNDRRRKMAIVVI 404

Query: 475 RMEKKMLNACLQVTADM 491
           + EK++L   +   +++
Sbjct: 405 QGEKQVLQEAVDAISEI 421


>gi|189236574|ref|XP_975615.2| PREDICTED: similar to SET domain containing 3 [Tribolium castaneum]
          Length = 667

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 58/226 (25%), Positives = 101/226 (44%), Gaps = 26/226 (11%)

Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRP-------YKAGESIVVWCGPQPNSK 334
           +AL+PL          C      +  A   V+DR        +KAGE + ++ G + N+ 
Sbjct: 451 YALIPLW-------DMCNHTNGTISTAYNPVLDRSECLAVKNFKAGEQLFIFYGSRSNAD 503

Query: 335 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 394
           L ++ GFV E+N YD   +   ++  DP  Q +  +    GKLS+      + R+  +  
Sbjct: 504 LFVHNGFVFENNDYDVYWIRLGISKSDPLQQKRGHLL---GKLSIASTCDFSIRKGASPI 560

Query: 395 D--MLPYLRLGYVSDTSEMQSVISS-----LGPI-CPVSPCMERAVLDQLADYFKARLAG 446
           D  +L +LR+ +  +  ++   I+S     LG I C +   +E      L    K  L+ 
Sbjct: 561 DGQLLAFLRV-FNMNEEQLDHWINSDKSADLGHIDCALDTALETKSWRFLHARLKLLLST 619

Query: 447 YPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 492
           Y  TL EDE +L +    P + +A ++   EK+++   L+     I
Sbjct: 620 YKTTLDEDEKLLAEAQATPNRLLAIKMRATEKRIIRETLEYVEQYI 665



 Score = 40.0 bits (92), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 35/119 (29%), Positives = 58/119 (48%), Gaps = 12/119 (10%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSEL--ACLALYLMYEKK 169
           V A+ D+       +VP  L++++E       + +L+  +K L  +    L+++L+ EK 
Sbjct: 116 VKANVDIAESSLVIAVPRKLMMSVENA-KESVLKDLIEKDKILGSMPNVALSIFLLLEKY 174

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
           +G  SFW PYI  L +          + L +S  EL  L GSPT    L + + I R+Y
Sbjct: 175 KGD-SFWKPYIDILPK-------TYTTVLYFSIDELEELRGSPTLEVALRQIKSITRQY 225


>gi|449464220|ref|XP_004149827.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Cucumis
           sativus]
          Length = 499

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 63/249 (25%), Positives = 110/249 (44%), Gaps = 39/249 (15%)

Query: 112 YVAASEDLQAGDAAFSVP-------NSLVVTLERVLGNETIAELLTTNKLSELACLALYL 164
           ++ ASE ++AGD    VP       +SL + +  +LGNE          +  +A LA+ +
Sbjct: 77  FLFASETIRAGDCILKVPFNVQISPDSLPLPIRDLLGNE----------IGNVAKLAVVV 126

Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
           + E K G  S W PYI  L +        + + + W E+EL  +  S    E L +   I
Sbjct: 127 LLEHKLGLGSEWAPYIIRLPQP-----WEMHNTIFWKESELEMIRKSSLYEESLNQRSQI 181

Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 284
           KRE+  +           + +P  I  +  + + F  A+  V S     +        +L
Sbjct: 182 KREFLAIRKA-------LEAFPEII--DRISCDDFMHAYALVTS-----RAWRSTEGVSL 227

Query: 285 VPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
           +P     L +    +AML   DD    ++V DR +  GE +++  G   N+ L++++GF 
Sbjct: 228 IPFAD-FLNHDGASEAMLLNDDDKQLSEVVADRDFAPGEHVLIRYGKYSNATLMLDFGFA 286

Query: 343 DEDNPYDRL 351
              N +D++
Sbjct: 287 LPYNIHDQV 295


>gi|342881738|gb|EGU82570.1| hypothetical protein FOXB_06936 [Fusarium oxysporum Fo5176]
          Length = 467

 Score = 66.2 bits (160), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 102/450 (22%), Positives = 182/450 (40%), Gaps = 75/450 (16%)

Query: 104 NEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--------LS 155
           N     ++   A ED+ A    F++P   ++ +E     + I +    +K        L 
Sbjct: 38  NAGRGEVNKTVALEDIPAETTLFTIPRKGIINVETSELPKKIPDAFDLDKPDDDDAPGLD 97

Query: 156 ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 215
             + L L ++YE  QG+ S W PY   L         + ++P+ WS+ EL  L  S  + 
Sbjct: 98  SWSSLILIMIYEYLQGENSKWKPYFDVLPS-------SFDTPMFWSDNELDQLQASHMRH 150

Query: 216 EILE-RAEGIKRE------------YN-------ELDTVWFMAGSLFQQYPYDIPTEAFT 255
           +I +  AE + ++            +N       EL  +    GS    Y +D+  +   
Sbjct: 151 KIGKADAENMFQKTLLPIIRSNAEIFNAGNKTDAELIEIAHRMGSTIMAYAFDLENDE-- 208

Query: 256 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 315
                +        V      S+     +VP+   +L   ++  A +   ++++ +   R
Sbjct: 209 -----EEEEEADGWVEDRDGKSM---MGMVPMAD-ILNADAEFNAHVNHEEESLTVTSLR 259

Query: 316 PYKAGESIVVWCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAAL--NTEDPQYQ 365
           P KAGE I+ + GP PNS+LL  YG+V E +        P+D  +VE+AL  N   P   
Sbjct: 260 PIKAGEEILNYYGPHPNSELLRRYGYVTEKHSRYDVVEIPWD--IVESALTSNFGIP--- 314

Query: 366 DKRMVAQRNGKL-------SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE-MQSVISS 417
             +++ Q  G L          V     G      +   P        D  E +++ +  
Sbjct: 315 -GQVLEQIRGALEEDEEFEDTFVLERETGEVNSDGTFAEPARFESMPEDLQEQLKTFLKG 373

Query: 418 LGPICP--VSPCMERAVLDQ--LADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQL 473
           +    P  +    +R  + Q  LA   +A +A YP ++SEDE +L   +L+ + R+A  +
Sbjct: 374 IKKAQPDAIPDKRKRDEIHQAVLAKTLEALVARYPTSISEDENLLKQ-DLNQRTRMAIAV 432

Query: 474 VRMEKKMLNACLQVTADMIMLLPDVTVSPC 503
              EKK+L   +  ++  + +  D    P 
Sbjct: 433 RLGEKKLLQEAITASSGDVEMTMDDESGPA 462


>gi|325183831|emb|CCA18289.1| conserved hypothetical protein [Albugo laibachii Nc14]
 gi|325183979|emb|CCA18437.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 561

 Score = 66.2 bits (160), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 101/437 (23%), Positives = 180/437 (41%), Gaps = 59/437 (13%)

Query: 68  REVVSKKEEDL--GDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAA 125
           RE V+  E D+   +L  W+   G    K++L++   +  + R +H      +L  G+  
Sbjct: 105 REDVADLENDVVGAELIDWLQNQGAETKKLMLQQ---YAPEVRGVH---CRNELVPGERI 158

Query: 126 FSVPNSLVVTLERVLGNET-IAELLTTNKLSELA----CLALYLMYEKKQGKKSFWLPYI 180
             +P + ++T+E  +G +T I + +  + +  +A     L LYL+ + ++   +F+  Y 
Sbjct: 159 LFIPKNCLITVE--MGKQTEIGQKVLAHNIEFVAPKHIFLILYLLTDMEKKDLTFFKYYY 216

Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS 240
             L             P+ WS+ EL++L GS    +I ER   I+++Y+ +         
Sbjct: 217 STL------PSTLKNMPIFWSDQELSWLKGSYILHQIQERKAAIRKDYDAICRA------ 264

Query: 241 LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA 300
                  D     F+ E F  A + V S    L  +   +  ALVP    L  Y  +  +
Sbjct: 265 -------DPSFSRFSLERFSWARMIVCSRNFGL-TIDGVKTAALVPFADMLNHYRPRETS 316

Query: 301 -MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY------DRLVV 353
                  D   +       +G  +    G + N + L+NYGF  EDN        + ++V
Sbjct: 317 WTFDQKLDGFTITSLESICSGAQVYDSYGKKCNHRFLLNYGFAVEDNTEEDGSNPNEIMV 376

Query: 354 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY-----VSDT 408
           +  L+  D Q     ++  +   L     +    R   + SD  P  R G+     ++ T
Sbjct: 377 DFQLDPGDGQ-----LLYDKTAYLYESGIYTMNARLSCSHSD--PSTREGFSFARLIAAT 429

Query: 409 SEMQSVISSLGPIC---PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML--TDYNL 463
            +  S +    P     P+S   E A L+ L      +L  YP +L E EA+L   +Y L
Sbjct: 430 EDEFSSMKMRSPAHASPPISFRNEIAALNLLKQLMDTQLDQYPTSLDEGEAILKSKEYPL 489

Query: 464 HPKKRVATQLVRMEKKM 480
           +  +  A   +R EK++
Sbjct: 490 YSNRIQALFFIRGEKQV 506


>gi|350595011|ref|XP_003484025.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Sus
           scrofa]
          Length = 326

 Score = 66.2 bits (160), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 46/150 (30%), Positives = 77/150 (51%), Gaps = 7/150 (4%)

Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS 296
           M   +F +YP   P E F  E FK +F  + S +V L   S+  + ALVP    ++ +S 
Sbjct: 73  MRVRIFSKYPDFFPEEVFNIESFKWSFGILFSRMVRLP--SMDGKNALVPWAD-MMNHSC 129

Query: 297 KCKAMLAAVDDAVQLV--VDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLV 352
           + +  L     +  +V   DRPY+ GE + +  G + N +LL++YGFV ++  NP D + 
Sbjct: 130 EVETFLDYDKSSKGIVFPTDRPYQPGEQVFISYGKKSNGELLLSYGFVPKEGTNPSDSVE 189

Query: 353 VEAALNTEDPQYQDKRMVAQRNGKLSVQVF 382
           +  +L   D  Y++K  + ++ G    Q F
Sbjct: 190 LSLSLKKSDESYKEKLELLKKYGLSGSQCF 219


>gi|425766115|gb|EKV04742.1| hypothetical protein PDIG_87340 [Penicillium digitatum PHI26]
 gi|425778867|gb|EKV16969.1| hypothetical protein PDIP_33360 [Penicillium digitatum Pd1]
          Length = 679

 Score = 65.9 bits (159), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 58/213 (27%), Positives = 98/213 (46%), Gaps = 23/213 (10%)

Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
             A +LM +  +G + FW PY+R L +    GQL   +PL + E ++ ++ G+      +
Sbjct: 105 TFAFFLMAQYLRGPEGFWYPYLRTLPQP---GQLT--TPLFFGEEDVDWIQGTGIPEAAV 159

Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 278
           ER +  + +Y   D+ +   G+    +P     E +T+E++  A   + S     + +S 
Sbjct: 160 ERIKIWEEKY---DSGYLQLGA--TGFP---DCETYTWELYLWASTIITSRAFSAKVLSG 211

Query: 279 ARR---------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 329
           A +          AL+PL   L  +    K    A D  + L+V   + AG+ I    GP
Sbjct: 212 AVQPGDLPEDGVSALLPL-IDLPNHRPMAKVEWRAGDKDIGLLVLEDHSAGQEISNNYGP 270

Query: 330 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 362
           + N +LLINYGF    NP D  +V   +  + P
Sbjct: 271 RNNEQLLINYGFCIAGNPTDYRIVHLGVKPDSP 303


>gi|255087300|ref|XP_002505573.1| set domain protein [Micromonas sp. RCC299]
 gi|226520843|gb|ACO66831.1| set domain protein [Micromonas sp. RCC299]
          Length = 509

 Score = 65.9 bits (159), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 108/451 (23%), Positives = 187/451 (41%), Gaps = 57/451 (12%)

Query: 64  VAGSREVVSKKEEDLGDLKSWMHKNGLPPCKV---ILKEKPSHNEKHRPIHYVAASEDLQ 120
           VA    V S+ + D   L +W+   G+   KV   ++   P      R    VAA ED+ 
Sbjct: 44  VAVDASVDSRTQADFDALWAWLGSEGVDVSKVSPALVDAAPGG----RGWGLVAA-EDIG 98

Query: 121 AGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYI 180
            GDA  ++P SL +T++  L +   A        +    +AL L++E+  G+KS W  Y+
Sbjct: 99  GGDAVLAIPRSLWMTVDTALASPIGAH---CGDEAGWIAVALQLLHERSIGEKSRWAAYV 155

Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS 240
             L  Q       +++PL WS  E+A LTG+    ++L+ A G   +     T   +  S
Sbjct: 156 NALPAQ-------LDAPLFWSAEEVATLTGT----QLLDAAAGY--DSYARGTWARLKES 202

Query: 241 LFQQYPYDIPTEAFTFEIFKQAFVAVQS-CVVHLQKVSLARRFALVPLGPPLLAYSSKCK 299
            F   P   P++AF    F  AF  ++S C      V      ALVP     +A  S   
Sbjct: 203 AFDANPDVFPSDAFDEPSFLWAFGILRSRCQA---PVDQGADIALVP--GLDMANHSGLS 257

Query: 300 AMLAAVDDAVQLVVDRPYKAGESIVV-------------------WCGPQPNSKLLINYG 340
           +    +++     V    K+G S+++                   +   + +++L ++YG
Sbjct: 258 SQTWTLNNGGVAAVFGGGKSGGSMLLRTEKGAKGLLAKGAEVFMNYGQRKIDNQLALDYG 317

Query: 341 FVDEDNPYDRLVVE-AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPY 399
           F D        V+   A+   DP   DK  V +  G      F + A  + E   ++  +
Sbjct: 318 FTDAFASRPGYVLGPIAIPESDPNAFDKMDVLEVAGLREAPSFVLRAFEDPE--PELRVF 375

Query: 400 LRLGYV--SDTSEMQSVI--SSLGPIC-PVSPCMERAVLDQLADYFKARLAGYPATLSED 454
           +RL  +   D   ++++    + G I  PVS   E+     + +  +  L GY   + +D
Sbjct: 376 MRLLNLKGEDAFLLEAIFRQEAWGLISEPVSRLNEQEACGTMINGCEEALRGYATRVEDD 435

Query: 455 EAMLTDYNLHPKKRVATQLVRMEKKMLNACL 485
             +  D  +  + R+A ++   EK+ L   L
Sbjct: 436 RRVAEDPGVGHRLRLAARVRMGEKQALADAL 466


>gi|296419472|ref|XP_002839331.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295635461|emb|CAZ83522.1| unnamed protein product [Tuber melanosporum]
          Length = 541

 Score = 65.9 bits (159), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 62/251 (24%), Positives = 116/251 (46%), Gaps = 25/251 (9%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           V    D+ +     S P++L +   +     + A+ +T    ++ A L ++L  E  +GK
Sbjct: 43  VITCTDIPSHSQLISCPHTLTINYTKARSAFS-ADFITNT--TQHAALCMFLCLEWLKGK 99

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
           +SFW PY+  L R+        ++PL +S+ +L +L G   +A  +E  + I RE  E  
Sbjct: 100 ESFWWPYLCVLPRE-------FDTPLYFSDEDLQFLQGCNLEATEVEARKLIWREEFE-- 150

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA----LVPLG 288
                A S+ Q+  YD  TE +T+E++  A     S     + +   R        +P+ 
Sbjct: 151 ----AAVSILQREGYD--TEYYTWELYLWASTIFTSRSFPGKLMDWDRIIVHEDDTMPIL 204

Query: 289 PPLLAYSSKCKAMLAA---VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 345
            PL+   +   A +      D +++++      AG  +    GP+ N +LL+ YGF    
Sbjct: 205 FPLIDSLNHYPATIITWQPSDTSLRIISGVGVSAGAEVYNNYGPKANEELLMGYGFTLLQ 264

Query: 346 NPYDRLVVEAA 356
           NP+D  +++++
Sbjct: 265 NPFDSFLLKSS 275


>gi|303271159|ref|XP_003054941.1| methyltransferase [Micromonas pusilla CCMP1545]
 gi|226462915|gb|EEH60193.1| methyltransferase [Micromonas pusilla CCMP1545]
          Length = 544

 Score = 65.5 bits (158), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 103/392 (26%), Positives = 160/392 (40%), Gaps = 45/392 (11%)

Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWL 177
           D++AG+    +P +L VT   V  +  +A L       EL  LAL+L  E+ +G  S W 
Sbjct: 98  DVRAGEPLLEIPQNLAVTSVDVSDHPIVAGLAAGR--GELVGLALWLCCERAKGSLSDWA 155

Query: 178 PYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWF 236
           PY+  L          V+ PL W E+E+ + L GSPT  + + RA   + EY  +     
Sbjct: 156 PYVNTLPT-----GCTVDHPLRWEESEIRSLLKGSPTCEQAVGRAVDAREEYASIRAAIA 210

Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL--------- 287
                +    Y+  TE      F  A   V +  V L   ++   +ALVPL         
Sbjct: 211 ADADAYPADAYEFLTEL----AFTDALATVLARAVWLNAANV---YALVPLVDLLPVVGA 263

Query: 288 -----GPPLLAYSSKCKAMLAAV-----DDAVQLV-VDRPYKAGESIVVWCGP---QPNS 333
                 P   A  +  + + AAV     D A + V V     A ++  V C     +   
Sbjct: 264 PPPGVNPAAAAADAGARGLDAAVGVVDYDAATECVAVVSANDARQTAPVVCADALGRNAG 323

Query: 334 KLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG-KLSVQVFHVHAGREKEA 392
            L ++ G V+  +  D L    +    D  Y  K+ + +  G     Q F V A R    
Sbjct: 324 DLFLSTGRVNGAHVGDYLTFVTSTVMSDKLYAAKKQILEGMGYSADAQAFPVFADRMP-- 381

Query: 393 ISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLS 452
              +  Y+R   V + SE+ +V      I  VSP  E  +L  L    +  LA Y  +  
Sbjct: 382 -LQLFAYMRFARVQEPSELMTVSFEEDRI--VSPMNEYEILQLLMGDAREMLAEYENSSE 438

Query: 453 EDEAM-LTDYNLHPKKRVATQLVRMEKKMLNA 483
           E E + L + N+  ++  A +L   EK+++NA
Sbjct: 439 EFELLQLKETNISERQMTAAKLRLGEKRLINA 470


>gi|260835124|ref|XP_002612559.1| hypothetical protein BRAFLDRAFT_219602 [Branchiostoma floridae]
 gi|229297937|gb|EEN68568.1| hypothetical protein BRAFLDRAFT_219602 [Branchiostoma floridae]
          Length = 327

 Score = 65.5 bits (158), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 73/302 (24%), Positives = 125/302 (41%), Gaps = 50/302 (16%)

Query: 76  EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
           +D   L  W+ +NG     ++L + P           V ++ +L+ GD   S+P +L++T
Sbjct: 4   DDSIQLMRWLRRNGFRDSHLVLTDFPDTGRG------VMSTRNLKEGDCIVSLPENLLIT 57

Query: 136 LERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA 193
              V+ N  + + + T K  L+    L+LYL+ EK +GK SFW PYI+ L         +
Sbjct: 58  TTTVV-NSHLGQYIKTWKPRLTPKQVLSLYLIAEKSRGKDSFWYPYIQTL-------PTS 109

Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 253
             +P  +S  E+  L     +A +  R + ++  Y  L T       LF  +        
Sbjct: 110 YTTPSYFSTAEVDALPALVREATLRHR-KVLQNSYKSLQTSLHNLEPLFPDW-----KTV 163

Query: 254 FTFEIFKQAFVAVQSCVVH-------LQKVSLARRFALVPL-----GPPLLA------YS 295
           FT + ++ A+  V +  V+           S    +AL P        PL+        S
Sbjct: 164 FTLKSYRWAWATVYTRSVYKRGPGWEFLDPSDPDVYALAPFLDMLNHSPLVQTDTDFNVS 223

Query: 296 SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEA 355
           SKC           ++  +   +    + +   P  N +LL+ YGFV   NP+  +   A
Sbjct: 224 SKC----------YEVKTEGACRKYRQVFINYDPYDNGRLLMEYGFVMPRNPHSVVTFTA 273

Query: 356 AL 357
           A+
Sbjct: 274 AV 275


>gi|391340216|ref|XP_003744440.1| PREDICTED: SET domain-containing protein 4-like [Metaseiulus
           occidentalis]
          Length = 381

 Score = 65.5 bits (158), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 66/279 (23%), Positives = 118/279 (42%), Gaps = 41/279 (14%)

Query: 78  LGDLKSWMHKNGLPPCKVI-LKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTL 136
           +G+L SW+ + G  P  V+ L   P+          +    +++AGD    +P++L++T 
Sbjct: 21  IGELYSWIQRLGFKPTSVLRLACTPASGRG------IVCLSNIEAGDVIIDLPSTLLITP 74

Query: 137 ERVLGNETIAEL-LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
           + V       EL ++   LS    L ++++ E+  G+KS W PYI  +            
Sbjct: 75  DLVR-----KELNMSKENLSAEEILTIFVLSERSLGEKSKWKPYIESI------------ 117

Query: 196 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
            P ++   +       P +      A+ I R   E   V+      F+    D+      
Sbjct: 118 -PDVFDGLQCRKSVRLPRRL-----AQAIDRWNAERRNVFSRLRMFFRGRGIDL-----N 166

Query: 256 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 315
           FE F  A+ AV +  ++++         L P    LL +  K     + V++   +  + 
Sbjct: 167 FETFSWAWSAVNTRCIYVE----GHGSTLAPF-LDLLNHHWKASIETSFVNNHFIIRSNV 221

Query: 316 PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
            Y+AG  + +  G   N  L +NYGFV ++NP D + VE
Sbjct: 222 GYEAGSEVFIGYGSHDNRTLFLNYGFVLDENPNDCITVE 260


>gi|189189204|ref|XP_001930941.1| SET domain-containing protein RMS1 [Pyrenophora tritici-repentis
           Pt-1C-BFP]
 gi|187972547|gb|EDU40046.1| SET domain-containing protein RMS1 [Pyrenophora tritici-repentis
           Pt-1C-BFP]
          Length = 476

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 78/302 (25%), Positives = 122/302 (40%), Gaps = 51/302 (16%)

Query: 83  SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLER-VLG 141
           +W+ ++G      I  E   + +  R    V AS+D+   +  F +P + ++++E  +L 
Sbjct: 13  AWLRQSGAEISPKIKLEDLRNKDAGRG---VVASQDIAEHELLFRIPRASILSVENSILS 69

Query: 142 NETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
            E  A  L+   L     L L ++YE   G  S W PY   L  +         + + W+
Sbjct: 70  TEIPAATLSL--LGPWLSLILVMLYEYHNGSASNWAPYFAVLPTE-------FNTLMFWT 120

Query: 202 ETELAYLTGSPTKAEIL---------------------------ERAEGIKREYNELDTV 234
           E ELA L  S    ++                            ERA+   +E   L+ +
Sbjct: 121 EDELAELQASAVVGKVGKESADEAFLEQLLPVIEEFADIVFSGDERAKDKAKEMRSLENL 180

Query: 235 WFM--AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
             M   GSL   Y +D+     T E+ ++   A +     L K        +VPL   L 
Sbjct: 181 ELMHKMGSLIMAYAFDVEPATPTKEVDEEG-FAEEEEDAALPK-------GMVPLADMLN 232

Query: 293 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 352
           A + +C A L    D +++   +P +AGE I    GP P S LL  YG+V  DN     V
Sbjct: 233 ADADRCNARLFYEKDCLEMKALKPIQAGEEIFNDYGPLPRSDLLRRYGYVT-DNYAQYDV 291

Query: 353 VE 354
           VE
Sbjct: 292 VE 293


>gi|408393455|gb|EKJ72719.1| hypothetical protein FPSE_07119 [Fusarium pseudograminearum CS3096]
          Length = 465

 Score = 65.1 bits (157), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 94/443 (21%), Positives = 176/443 (39%), Gaps = 79/443 (17%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--------LSELACLALYL 164
           + A  D+ A    F++P    + +E     + I ++   +K        L   + L L +
Sbjct: 43  IIALRDIPAETTLFTIPRKGSINIETSELPQKIPDVFDLDKPDEDDVPGLDSWSSLILIM 102

Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEG 223
           +YE  +G  S W  Y   L         + ++P+ WSE EL  L  S  + +I +  AE 
Sbjct: 103 IYEYLRGDSSQWKSYFDVLPS-------SFDTPMFWSENELDQLQASHMRHKIGKADAEN 155

Query: 224 I-------------------KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
           +                    R  +EL  +    GS    Y +D+  +    E  ++   
Sbjct: 156 MFKKTLVPIIRSNPSIFNAENRSDSELVEIAHRMGSTIMAYAFDLENDEEEEEETEEWVE 215

Query: 265 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 324
                 +            +VP+   +L   ++  A +   ++++ +   RP KAGE I+
Sbjct: 216 DRDGKSM----------MGMVPMAD-ILNADAEFNAHVNHEEESLTVTSLRPIKAGEEIL 264

Query: 325 VWCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAALNT----------------E 360
            + GP PNS+LL  YG+V E +        P+D  +VE+ L                   
Sbjct: 265 NYYGPHPNSELLRRYGYVTEKHSRYDVVEIPWD--IVESVLTNFGISSKILKQIRGEFEG 322

Query: 361 DPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGP 420
           + +++D  ++ +  G+++         + +    D+   L+    S    ++ V S   P
Sbjct: 323 EEEFEDTFVLERDTGEINSDGTFAEPAKFEGMPEDLQEQLK----SFLKGIKKVQSDTIP 378

Query: 421 ICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKM 480
                  + +AVL +  +   AR   YP ++SED+ +L   NL  + R+AT +   EKK+
Sbjct: 379 DKRKRDEIHQAVLVKTLEALAAR---YPTSISEDQTLLNGQNLDQRARMATVVRLGEKKL 435

Query: 481 LNACLQVTADMIMLLPDVTVSPC 503
           L   +   ++ + +  D    P 
Sbjct: 436 LQEAIATFSEDVEMTMDDESGPA 458


>gi|302792358|ref|XP_002977945.1| hypothetical protein SELMODRAFT_107696 [Selaginella moellendorffii]
 gi|300154648|gb|EFJ21283.1| hypothetical protein SELMODRAFT_107696 [Selaginella moellendorffii]
          Length = 467

 Score = 65.1 bits (157), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 73/292 (25%), Positives = 119/292 (40%), Gaps = 59/292 (20%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           V A  DL  G+   ++P +  +TL      + I        L     L + LMYE+ +GK
Sbjct: 34  VRALRDLHHGELIATIPKAACLTLLTTAARDAIERARLGGGLG----LTVALMYERSKGK 89

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL 231
            S W  Y++ L RQ          P LWSE E+   L G+     + E    +K ++ E 
Sbjct: 90  GSKWYRYLKTLPRQE-------SVPFLWSEEEIDGLLLGTELHKALKEDKLLMKEDWEE- 141

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ-KVSLARRFALVPLGPP 290
                    L ++ P + P + FTFE    +++A +S V     ++     + +VPL   
Sbjct: 142 -----NIAPLTKEDPLEFPAQDFTFE----SYLAAKSLVSSRSFEIDAEHGYGMVPLAD- 191

Query: 291 LLAYSSKCKA-----MLAA-----------VDDA---------------VQLVVDRPYKA 319
              ++ K  A     ML A           +DD                +++V+ +   A
Sbjct: 192 --LFNHKTDAEDVHFMLNASDSDDDDNGLIIDDGLANGDCREISSDKSVLEMVMVKDVAA 249

Query: 320 GESIVVWCGPQPNSKLLINYGFVDEDNPYD--RLVVEAALNTEDPQYQDKRM 369
           G  I    G   N+ LL  YGF + +NP+D   L ++  L     ++Q KR+
Sbjct: 250 GSEIFNTYGQLGNAALLHRYGFTEPNNPHDIVNLDMDCVLEVLLSRFQKKRV 301


>gi|321470773|gb|EFX81748.1| hypothetical protein DAPPUDRAFT_317395 [Daphnia pulex]
          Length = 495

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 96/432 (22%), Positives = 176/432 (40%), Gaps = 40/432 (9%)

Query: 88  NGLPPCKVILKEKPSHNEKHRPIHYVA---------ASEDLQAGDAAFSVPNSLVVTLER 138
           + LPP    L+   +H+ K  P+  V          A++ +   +  FS+P  L+++ E 
Sbjct: 75  DTLPP---FLEWMTNHDVKMGPVELVELPLYGCCVRATKQVSTDELLFSIPQKLMLSNET 131

Query: 139 VLGNETIAELLTTNK-LSELACLAL-YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES 196
              + TI   +  +  LS++  +AL + +  +    KSFW PY+  L         + ++
Sbjct: 132 A-NSSTIGHFINNDPILSQMPNVALAFHVLNELYDPKSFWKPYLDALPS-------SYDT 183

Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYNEL-----DTVWFMAGSLFQQYPYDIPT 251
            + ++  E+  L GSP   + L     I R+Y+         V     +L   + Y+   
Sbjct: 184 VMYFTPDEITELKGSPAFDDALRMCRNIARQYSYFYSLLQKNVDPALSNLRANFTYNDYR 243

Query: 252 EAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQL 311
            A +  + +Q  +  Q  +    K  L    AL+PL         +              
Sbjct: 244 WAVSTVMTRQNLIPSQEEISGNDKDQLPPVNALIPLWDFCNHQDGQFSTEFQLESRRTVC 303

Query: 312 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 371
              R +  GE + ++ G +  ++  I+ GFVD +N +D L ++  L+  DP    +  + 
Sbjct: 304 QAGRDFGPGEQVFIFYGTRTCAEQFIHNGFVDINNAHDALTLKVGLSKSDPLAGQRATLL 363

Query: 372 QRNGKLSVQ------VFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVS 425
            +   LS +       F + AG  +     +L +LRL  ++  S  + + S         
Sbjct: 364 CKLRILSDEKISGPIAFQLKAG-PQPVDGKLLAFLRLFCMTKDSLDRWLQSDNASNLMHE 422

Query: 426 PC-MERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKM 480
            C +E  V D+   + KAR    L  YP T   D  ML + +L   +R+   L   EK++
Sbjct: 423 ECGIETEVDDKSWSFLKARCQLLLQLYPTTKEADLKMLEE-DLSSHRRMCVLLRLAEKRI 481

Query: 481 LNACLQVTADMI 492
           L + ++  A  I
Sbjct: 482 LLSAIECAAQRI 493


>gi|255568191|ref|XP_002525071.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase, chloroplast precursor, putative
           [Ricinus communis]
 gi|223535652|gb|EEF37318.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase, chloroplast precursor, putative
           [Ricinus communis]
          Length = 456

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 61/249 (24%), Positives = 121/249 (48%), Gaps = 27/249 (10%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           AS+ +Q GD    VP S  +  + +L    +++LL  +++  +A LA+ L+ ++K G++S
Sbjct: 51  ASKSIQTGDCILRVPYSAQIASDNLLPE--LSDLLG-DEVGSVAKLAIVLLVDQKVGQES 107

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
            W PYI  L  Q G     + S + WS++EL  +  S    E +++   I++++  +  V
Sbjct: 108 KWAPYISRLP-QLGE----MHSTIFWSKSELDMIFQSSVYKETIKQKAQIEKDFLTIKPV 162

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
                   + +P    + + TF+ F  A+  V+S     +     +  +L+P     L +
Sbjct: 163 -------LEHFPQ--ISRSITFQDFMHAYALVKS-----RAWGSTKGVSLIPFA-DFLNH 207

Query: 295 SSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 352
               +A++   +D    ++  DR Y   E +++  G   N+ LL+++GF    N +++  
Sbjct: 208 DGFSEAVVLNDEDKQVSEVAADRNYAPHEEVLIRYGKFSNATLLLDFGFSLPYNIHEQ-- 265

Query: 353 VEAALNTED 361
           VE  +N  D
Sbjct: 266 VEIQINIPD 274


>gi|297726941|ref|NP_001175834.1| Os09g0411650 [Oryza sativa Japonica Group]
 gi|255678893|dbj|BAH94562.1| Os09g0411650, partial [Oryza sativa Japonica Group]
          Length = 206

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 47/158 (29%), Positives = 82/158 (51%), Gaps = 9/158 (5%)

Query: 330 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGRE 389
           + N++L ++YGF + ++  D   +   ++  DP Y DK  +A+ NG      F +  G  
Sbjct: 10  KSNAELALDYGFTESNSSRDAYTLTLEISESDPFYDDKLDIAELNGMGETAYFDIVLG-- 67

Query: 390 KEAIS-DMLPYLRLGYVSDTSE--MQSVISS--LGPI-CPVSPCMERAVLDQLADYFKAR 443
            E++   MLPYLRL  +  T    ++++  +   G +  PVS   E A+   + +  K+ 
Sbjct: 68  -ESLPPQMLPYLRLLCLGGTDAFLLEALFRNAVWGHLELPVSQDNEEAICQVIRNACKSA 126

Query: 444 LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 481
           L  Y  T+ EDE +L   NL P+ ++A ++   EKK+L
Sbjct: 127 LGAYHTTIEEDEELLGSENLQPRLQIAVEVRAGEKKVL 164


>gi|303288796|ref|XP_003063686.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226454754|gb|EEH52059.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 538

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 112/451 (24%), Positives = 176/451 (39%), Gaps = 65/451 (14%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKV---ILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
           ++ + D   L +W+ + G     V   ++   P           + A+ D+  GDAA  V
Sbjct: 73  ARTQADFDALWTWLEREGADVASVSPALVDATPGGRGWG-----LVATRDVGGGDAAIVV 127

Query: 129 PNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           P +L +T E    ++ I   L          LAL L++EK  G  S W  YIR L R   
Sbjct: 128 PRALWMTKETAFASK-IGTALDPETTPPWCALALQLLHEKSLGDDSRWAAYIRCLPRVE- 185

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPY 247
               A+++PL WS  ELA L G+   A        ++  +  L    F    +LF     
Sbjct: 186 ----ALDAPLFWSSEELAELAGTQLLANAAGYDSYVRGTHAALKETTFKEHPALFGDAGD 241

Query: 248 DIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC-------KA 300
           D    AF+   F  AF  ++S    L  V      AL+P G  +  +   C         
Sbjct: 242 DDGGGAFSEREFLWAFGVLRSRA--LPPVDQGESIALIP-GIDMANHDGLCSQTWQLNNG 298

Query: 301 MLAAV---------DDAVQLVVDRP----YKAGESIVVWCGP-QPNSKLLINYGFVDEDN 346
            +AAV           +V L V++      K GE I    GP   +S+  ++YGFVD   
Sbjct: 299 GIAAVFGGRGGADGGGSVLLRVEKTKAGGAKRGEEIRCNYGPANIDSQFALDYGFVDAFC 358

Query: 347 PYDRLVVEA-ALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV 405
                V+   ++  +D    DK  V    G      F + A   ++   +M+ ++RL  +
Sbjct: 359 SRPGYVLGPLSIPEDDVNAFDKMDVLSVAGLKESPAFTIRA--FEDPPPEMVVFMRLLNL 416

Query: 406 SDTS----------EMQSVISSLGPICPVSPCMERAVLDQLADY-FKARLAGYPATLSED 454
            +            E  ++IS      PVSP       D  AD   +  L  Y   + +D
Sbjct: 417 KNDDAFLLEAIFRQECWALISD-----PVSP-------DNEADAGCEEALGAYATKIEDD 464

Query: 455 EAMLTDYNLHPKKRVATQLVRMEKKMLNACL 485
             +  D +  P+ R+A ++   EK+ L   L
Sbjct: 465 RGVADDADASPRLRLAARVRMGEKQALEEVL 495


>gi|449662705|ref|XP_002165483.2| PREDICTED: uncharacterized protein LOC100209819 [Hydra
           magnipapillata]
          Length = 819

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 95/394 (24%), Positives = 158/394 (40%), Gaps = 59/394 (14%)

Query: 107 HRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAEL---LTT-----NKLSELA 158
           HR  + + A+ED++ G+  F+VP  L++          + E    L T     N  S   
Sbjct: 127 HR--YGMLATEDIKKGEVLFTVPRQLLLNQNTATLKNRLNEFEKWLDTHGKSLNDSSGWL 184

Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY-LTGSPTKAEI 217
            L + LM+E  Q K SFW  Y+  +      G      PL W E E      G P   +I
Sbjct: 185 PLLITLMWEFNQ-KDSFWASYLLLVPEISEFGH-----PLFWKEEEYNLEFQGMPLLNDI 238

Query: 218 LERAEGIKREYNELDTVWF-----MAGSLFQQYPYDIPTEAFTFEIFKQ--AFVAVQSCV 270
           +   E I+ EY E   ++      + GSL          E ++ E FK+  AFV   S  
Sbjct: 239 IVDRENIETEYAEFVLLFLRRNKDLFGSL----------ENYSLEFFKRMVAFVMAYSFT 288

Query: 271 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 330
              +  S+      VP+   +L + S   A L      +Q++  R  K GE +    G  
Sbjct: 289 EDEESPSM------VPMAD-ILNHHSNNNAHLVFHKSNLQMISIRRIKKGEEVFNTFGKL 341

Query: 331 PNSKLLINYGFVD-EDNPYDRLVV-----------EAALNTEDPQYQDKRMVAQRNGKLS 378
            N++LL  YG+V+   N YD L++           +     +DP    K  +  R G   
Sbjct: 342 GNTELLQMYGYVEIPSNQYDSLLLPVKDFYKIMTSKNGTANDDPYLLAKINLLNRTGIAE 401

Query: 379 VQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVS---PCMERAVLDQ 435
           V  F +          D++ +L++ + SD  E++ ++ +     P S     + +  L +
Sbjct: 402 VDAFFMFDKNGLRCGPDLIQFLKIFHASD-RELEKILKTRASKRPESFYHKLLRKLRLSK 460

Query: 436 LADYFKARLAGYPATLSEDEAMLTDYNLHPKKRV 469
             +  K  L      ++ED+  +   N + +K V
Sbjct: 461 KTE--KNSLGMTVIDITEDDTEMDIENFNKRKNV 492


>gi|301122791|ref|XP_002909122.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262099884|gb|EEY57936.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 426

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 92/411 (22%), Positives = 171/411 (41%), Gaps = 38/411 (9%)

Query: 111 HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
           H V A + L +G     +P  L + +E    ++    L    ++ +   LAL+LM+E+ +
Sbjct: 35  HGVFAKQALTSGQVTLRIPFKLTMNIESAARSDLARVLEKYPQIPDDEVLALHLMHERSK 94

Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
              SF+ P+I  L           + P+ WSE+EL  L G+          + ++R++  
Sbjct: 95  RSDSFFAPFIASLPT-------TFDLPVFWSESELNELKGTNVLLLTQLMKQQLQRDFEN 147

Query: 231 LDTVWFMAGSLFQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 287
           +        ++ + +P     +PT   T E +  A   + S    + +     R  L P 
Sbjct: 148 IHQ------AVVEDFPEVFALLPT--LTLEDYTWAMSVIWSRAFGVTREKKYLR-VLCP- 197

Query: 288 GPPLLAYSSKCKAML---AAVDDAVQLV---VDRPYKAGESIVVWCGPQPNSKLLINYGF 341
              +  +    + +L    + D+  Q++   V +   AG ++ +  G   N+KLL +YGF
Sbjct: 198 AMDMFNHDVSLRILLDDFVSFDEETQMLTHHVPKEVAAGSALQISYGQYSNAKLLFSYGF 257

Query: 342 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGR-EKEAISDMLPYL 400
           V ++N    +     +   DP  + K+ V   N     Q +       E +    +L  L
Sbjct: 258 VAKENSRRAVDFWMKIPPNDPYLKLKQTVLDSNELTRDQTYDFCGTLFENDVDERLLATL 317

Query: 401 RLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 460
           R+  +++  E++ +         +S   E AV + L +  + +LA +  TL EDEA+L +
Sbjct: 318 RVILMNE-QEIR-LYKKAFETSIISIRNELAVYENLQNTCRRKLANFATTLEEDEAILAE 375

Query: 461 YNLHPKKRVATQL-VRMEKKM--------LNACLQVTADMIMLLPDVTVSP 502
                  R++  + VR+E K         L    QV A  + + P  T  P
Sbjct: 376 MATESSPRLSFAVRVRVEDKQVLTGVIDTLEKWKQVLASNLEMYPPSTTRP 426


>gi|328700922|ref|XP_003241429.1| PREDICTED: SET domain-containing protein 3-like [Acyrthosiphon
           pisum]
          Length = 463

 Score = 64.3 bits (155), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 68/301 (22%), Positives = 129/301 (42%), Gaps = 30/301 (9%)

Query: 73  KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
           + ++ +  L  W  KNG      IL     H  ++   + + A++++  GD   +VP +L
Sbjct: 81  RNDQSIEKLTKWATKNG-----AILNGVEIHQFENYA-YGMKANKNITVGDKLVTVPRAL 134

Query: 133 VVTLERV----LGNETIAELLTTNKLSELACLALYLMYEK-KQGKKSFWLPYIRELDRQR 187
           ++T E +    L      +++  N  +    LA++++ E  ++ KKSFW  Y+  L    
Sbjct: 135 MMTEENIPSSPLWKLHSQDMMLRNMPN--VALAIFILVESLRKDKKSFWHSYLTTL---- 188

Query: 188 GRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY 247
               +   +P+ +   +L  L GSP     L+    I R+Y     ++ ++       P 
Sbjct: 189 ---PVTYSTPVYFDVADLEALKGSPAFEAALKLNRNIARQYAYFKKLFQLSND-----PA 240

Query: 248 D-IPTEAFTFEIFKQA---FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA 303
             I  + FT+E ++ A    ++ Q+ V      S     AL+PL       S +      
Sbjct: 241 SVILKDTFTYEYYRWAVSTLMSRQNTVPSSDNPS-ENVSALIPLWDMFNHRSGRLSTDFV 299

Query: 304 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQ 363
              +      D  Y A E + ++ G + N+  L++ GFV  DN +D + +   ++  DP 
Sbjct: 300 KSSNVCVCYADGDYAADEQVYIFYGVRTNADFLVHNGFVYPDNEHDAVKIRLGVSRSDPL 359

Query: 364 Y 364
           Y
Sbjct: 360 Y 360


>gi|327290197|ref|XP_003229810.1| PREDICTED: SET domain-containing protein 4-like [Anolis
           carolinensis]
          Length = 440

 Score = 64.3 bits (155), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 77/309 (24%), Positives = 131/309 (42%), Gaps = 32/309 (10%)

Query: 86  HKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVVTLERV 139
           HK+     K  LKEK  +  K RP  +      +  ++ LQ G+   S+P   ++T + V
Sbjct: 29  HKDEYILLKKWLKEKGCNVNKLRPAQFPETGRGLVTTKGLQVGELIISLPEKCLLTTDTV 88

Query: 140 LGNETIAELLT--TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
           L N  + E +   T  +S L  L  +L+ EK   +KS W PY+  L          + S 
Sbjct: 89  L-NSYLREYIVKWTPPISPLIALCTFLIAEKWAQEKSPWKPYLDLLPE--------IYSC 139

Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
            +  E ++  L   P + +  E+ + ++  +      +F    LF   P D+ +  F ++
Sbjct: 140 PVCLEQKIVNLFPEPLRRKAHEQRKLVQELFISSQQFFFSLQPLF---PKDVAS-VFNYQ 195

Query: 258 IFKQAFVAVQSCVV---HLQKVSLARRFALVPLGP--PLLAY--SSKCKAMLAAVDDAVQ 310
            FK A+  + +  V   H Q+   +R      L P   LL +  + + KA         +
Sbjct: 196 AFKWAWCTINTRTVYMKHSQRDCFSRDTDTYALAPYLDLLNHNPTVQVKAGFNEKTKCYE 255

Query: 311 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 370
           +           + +  GP  N +LL+ YGFV  DNP+  + V      ++   +DK   
Sbjct: 256 ITTVTQCHHYNEVFICYGPHDNQRLLLEYGFVSRDNPHSSVYVGTDTLLKNVFPEDK--- 312

Query: 371 AQRNGKLSV 379
            QR  KLS+
Sbjct: 313 -QRPKKLSI 320


>gi|390354259|ref|XP_001201449.2| PREDICTED: SET domain-containing protein 4-like [Strongylocentrotus
           purpuratus]
          Length = 455

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 68/290 (23%), Positives = 122/290 (42%), Gaps = 26/290 (8%)

Query: 75  EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
           +E    L  WM ++G   CK    +    ++  R +      ++L+ GD+   +P  L+V
Sbjct: 40  DEQYITLMKWMKEHGFN-CKGCCLKPAVFSDTGRGL---MTKKNLRPGDSIVEIPRHLLV 95

Query: 135 TLERVLGNETIAELLTTN--KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
           T + +L  E +  ++     K +    +  +L+ E+ +GK SFW PYI  L +       
Sbjct: 96  TAKDILNTE-LGPIIKRQRQKPTPYQVVCAFLLTERSKGKSSFWYPYINVLPKD------ 148

Query: 193 AVESPLLWSETELAYLTGSPT--KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIP 250
              +P   S T+ A     PT  ++  + + + I+  +     ++      F QY     
Sbjct: 149 -FTTPAFGS-TKQADFDVLPTIARSRAINQLQDIRAAFESASCLFEDIERTFPQYRIFFS 206

Query: 251 TEAFTFEIF----KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYS--SKCKAMLAA 304
            ++F +  F    +  ++    C     K S    FAL P    LL +S  ++  A    
Sbjct: 207 LDSFVWAWFVINSRSVYIEPSGCEAFDPKAS--DDFALAPFLD-LLNHSPGAEVTAGFDP 263

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
           V +  ++     Y A + + +  GP  N  LL+ YGFV   NP+D +  E
Sbjct: 264 VSNCYRIKTLDSYHAYDQVFIHYGPHDNVNLLLEYGFVIPSNPHDAVSFE 313


>gi|50557134|ref|XP_505975.1| YALI0F28061p [Yarrowia lipolytica]
 gi|49651845|emb|CAG78787.1| YALI0F28061p [Yarrowia lipolytica CLIB122]
          Length = 454

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 73/275 (26%), Positives = 116/275 (42%), Gaps = 43/275 (15%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           V ASED++  +  F +P S  +++E     + I ++    KL+    L LY+M   K G 
Sbjct: 42  VIASEDIEEDEVLFKIPRSSFLSVEN--DPDFIKQVPEAKKLNSWLQLILYMM---KAGS 96

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE-- 230
            + W PY   L  Q       ++S ++W++ EL  L GS    +I     G + +Y E  
Sbjct: 97  MTKWKPYFDVLPTQ-------LDSLMMWTDDELEGLKGSMIVKKI--GKAGAEEDYQEKL 147

Query: 231 -------------LDT---VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ 274
                         DT    +   G L   Y +D P ++F+    +            L 
Sbjct: 148 KPIIDAHPEYFKDCDTSLESFHRMGGLIMAYSFDAP-DSFS----EDEEDDEDIEHDDLY 202

Query: 275 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD-AVQLVVDRPYKAGESIVVWCGPQPNS 333
              L +  A+VPL   L A++  C A L A DD    +   +P K GE +    G  PN 
Sbjct: 203 NEGLVK--AMVPLADTLNAHTRFCNANLIAEDDGGFSMTAIQPIKKGEQVYNTYGELPNC 260

Query: 334 KLLINYGFV-DEDNPYDRLVVEAALNTEDPQYQDK 367
             L  YG+V +E   +D  +VE +++     Y +K
Sbjct: 261 DFLRRYGYVENEGTEFD--IVEFSMDEISDFYANK 293


>gi|451999637|gb|EMD92099.1| hypothetical protein COCHEDRAFT_1134267 [Cochliobolus
           heterostrophus C5]
          Length = 476

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 92/379 (24%), Positives = 155/379 (40%), Gaps = 55/379 (14%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           VAA +D+   +  FS+P S ++++E  + +  I    T   L     L L ++YE   G 
Sbjct: 40  VAAKQDIAEHELLFSIPRSSILSVENSILSTEIPPT-TFALLGPWLSLILVMLYEYHNGS 98

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL-------------- 218
            S W PY   L           ++ + W+E EL  L  S    +I               
Sbjct: 99  ASNWAPYFAVLPTD-------FDTLMFWTEDELTELQASAVVNKIGKEGANEVFIEQLLP 151

Query: 219 -------------ERAEGIKREYNELDTVWFM--AGSLFQQYPYDIPTEAFTFEIFKQAF 263
                        ERA+ + +E    + +  M   GSL   Y +D+   A + +   +  
Sbjct: 152 VIEEFADVIFSGDERAKDLAKEMRAPENLELMHKMGSLIMAYAFDVEP-AISDKEVDEEG 210

Query: 264 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESI 323
            A +     L K        +VPL   L A + +C A L    D +++   +P +AGE I
Sbjct: 211 FAEEEEDAALPK-------GMVPLADMLNADADRCNARLFYEKDGLEMKALKPIQAGEEI 263

Query: 324 VVWCGPQPNSKLLINYGFVDED-NPYDRLVVEAALNTE----DPQYQDKRMVAQRNGKLS 378
               GP P S LL  YG++ E+   YD + + A L ++    D  + +KR+      ++ 
Sbjct: 264 FNDYGPLPRSDLLRRYGYITENYAQYDVVEIPADLVSQALAHDGLWHEKRIEYLDEQEIV 323

Query: 379 VQVFHVHAG---REKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQ 435
              + + A      +E++S  L  L    +  + E +  + S G + P +  M     + 
Sbjct: 324 DTGYDIAASVPFSLEESLSPELVILVETMLLPSEEFER-LQSKGRL-PKAEKMTGKAAEI 381

Query: 436 LADYFKARLAGYPATLSED 454
           L    +AR+A YP TL +D
Sbjct: 382 LYKIVQARIAQYPTTLEQD 400


>gi|428182808|gb|EKX51668.1| hypothetical protein GUITHDRAFT_102933 [Guillardia theta CCMP2712]
          Length = 436

 Score = 63.5 bits (153), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 81/369 (21%), Positives = 156/369 (42%), Gaps = 51/369 (13%)

Query: 81  LKSWMHK-NGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
           L+ W+ + +G+   KV L+  P           V A+  L+ G+  F +P S  +  E V
Sbjct: 29  LRIWLEEEHGVDMSKVDLQRSPLEG------LGVFANRRLEPGETLFMIPKSCCIYPELV 82

Query: 140 LGNETIAELLTTNKLS-------ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
             +  + +  +  KL+       E+  LA +L  EK +G +S + P+I  L         
Sbjct: 83  FEDRQLGK--SMQKLASAAGEGIEVVALATFLAREKMKGSESSYKPFIDVL-------PW 133

Query: 193 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 252
               PLLW++ E+  L G+    EIL   E ++      + V    G  ++Q+   I TE
Sbjct: 134 DSLHPLLWTDEEVDLLEGTYAHREILAFREQVEVATELFEPVLNPKG--WKQFFQTIETE 191

Query: 253 AFTFEIF----KQAFVAVQS----CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAA 304
             T E F    + AF +V S      +      L  R  +    P L  ++         
Sbjct: 192 KMTPEEFGFMMRGAFASVLSRAFDSKIGRGDKGLEERVVI----PLLDIFNHGSYGPSIT 247

Query: 305 VDDAVQLVVDRPY-----------KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
            D A++   ++ +           + GE +  + G +PN  +L  YGFV  +       +
Sbjct: 248 FDTALERDNEKGFPVRVADKGKSIEEGEELFGFYGDKPNWNMLTTYGFVSPNPKCQETTL 307

Query: 354 EAALNTEDPQYQDKRMVAQRNGKLSV-QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQ 412
             +++ +DP +  K  + +  G ++V Q+F +    + + +  ++ Y R+  +S+ +++ 
Sbjct: 308 SVSIDEKDPYFAQKEEILKARGMVAVEQLFDIR--HDTDPMGPLINYFRIREISNEADLT 365

Query: 413 SVISSLGPI 421
            V ++ G +
Sbjct: 366 KVQTNYGEM 374


>gi|327259114|ref|XP_003214383.1| PREDICTED: SET domain-containing protein 3-like, partial [Anolis
           carolinensis]
          Length = 311

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 55/197 (27%), Positives = 97/197 (49%), Gaps = 18/197 (9%)

Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 365
           DD  + V  + +KAGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y 
Sbjct: 9   DDRCECVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 68

Query: 366 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS--------S 417
            K  V  R G  +  VF +HA  E    + +L +LR+  +++    + +I         +
Sbjct: 69  MKAEVLARAGIPTSSVFALHA-TEPPISAQLLAFLRVFCMTEDELKEHLIGEHAIDRIFT 127

Query: 418 LGPI-CPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVATQ 472
           LG    PVS   E     +L  + +AR    L  Y  T+SED+A L   +L     +A +
Sbjct: 128 LGNSEFPVSWDNEV----KLWTFLEARASLLLKTYKTTVSEDKAFLGTQDLTCNATMAIK 183

Query: 473 LVRMEKKMLNACLQVTA 489
           L   EK++L   ++  A
Sbjct: 184 LRLGEKEILEKAIKSAA 200


>gi|238485948|ref|XP_002374212.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
 gi|83768069|dbj|BAE58208.1| unnamed protein product [Aspergillus oryzae RIB40]
 gi|220699091|gb|EED55430.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
          Length = 713

 Score = 62.8 bits (151), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 55/193 (28%), Positives = 89/193 (46%), Gaps = 7/193 (3%)

Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
            +LM +  QGK+ FW PYIR L  Q G    A+ +PL +   +L +L G+       ++A
Sbjct: 131 FFLMGQYLQGKEGFWYPYIRTLP-QPG----ALTTPLYYEGDDLEWLEGTSLSPARQQKA 185

Query: 222 EGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLAR 280
             +K +Y  + T    AG    ++Y +D+   A T  + +     V S V+   ++    
Sbjct: 186 NLLKEKYGTVYTELCKAGFDGAEKYTWDLYLWASTIFVSRAFSAKVLSGVIPDTQLPEEN 245

Query: 281 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 340
              L+P    +L +    K    A    V  +V     AG+ I    GP+ N +L++NYG
Sbjct: 246 VSVLLPF-IDILNHRPLAKVEWRAGKGNVAFLVLEDVAAGQEISNNYGPRNNEQLMMNYG 304

Query: 341 FVDEDNPYDRLVV 353
           F   +NP D  +V
Sbjct: 305 FCLPNNPCDYRIV 317


>gi|317144568|ref|XP_001820210.2| SET domain protein [Aspergillus oryzae RIB40]
 gi|391871646|gb|EIT80803.1| N-methyltransferase [Aspergillus oryzae 3.042]
          Length = 703

 Score = 62.8 bits (151), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 55/193 (28%), Positives = 89/193 (46%), Gaps = 7/193 (3%)

Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
            +LM +  QGK+ FW PYIR L  Q G    A+ +PL +   +L +L G+       ++A
Sbjct: 121 FFLMGQYLQGKEGFWYPYIRTLP-QPG----ALTTPLYYEGDDLEWLEGTSLSPARQQKA 175

Query: 222 EGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLAR 280
             +K +Y  + T    AG    ++Y +D+   A T  + +     V S V+   ++    
Sbjct: 176 NLLKEKYGTVYTELCKAGFDGAEKYTWDLYLWASTIFVSRAFSAKVLSGVIPDTQLPEEN 235

Query: 281 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 340
              L+P    +L +    K    A    V  +V     AG+ I    GP+ N +L++NYG
Sbjct: 236 VSVLLPF-IDILNHRPLAKVEWRAGKGNVAFLVLEDVAAGQEISNNYGPRNNEQLMMNYG 294

Query: 341 FVDEDNPYDRLVV 353
           F   +NP D  +V
Sbjct: 295 FCLPNNPCDYRIV 307


>gi|303275964|ref|XP_003057276.1| set domain protein [Micromonas pusilla CCMP1545]
 gi|226461628|gb|EEH58921.1| set domain protein [Micromonas pusilla CCMP1545]
          Length = 308

 Score = 62.8 bits (151), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 69/278 (24%), Positives = 122/278 (43%), Gaps = 37/278 (13%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEK---K 169
           + A ED++ G+    +P++ ++T+ER +    +       +L E + LA +L  +    +
Sbjct: 25  LVAREDVKRGEPLLEIPDASLITVERAVKESKLGP--KHAELQEWSLLAAFLAEQALDIE 82

Query: 170 QGKKS-FWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKRE 227
            G +S  +  Y++ L R+ G         L W E ++   L GSP++    ER   +   
Sbjct: 83  NGDESGVFAAYVKALPRRTG-------GVLDWPEEDVKTLLAGSPSQRAAYERQASVDGA 135

Query: 228 YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 287
             E+            ++P   P         + AF  + S ++ L   +     ALVP 
Sbjct: 136 IEEIRA----------EFPQLTPG------ALRWAFDVLFSRLIRLP--NRGGELALVPW 177

Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE--D 345
              +L +   C A +      V L  DR YK GE +    G +P+++LLI+YGF  E  +
Sbjct: 178 AD-MLNHKPGCNAYIDDSGGKVCLQPDRAYKPGEQVFASYGQRPSAELLISYGFAPEVGE 236

Query: 346 NPYDRLVVEAALNTEDPQYQDKRMVA-QRNGKLSVQVF 382
           NP D   +   ++  D +Y D +  A ++ G   V+ F
Sbjct: 237 NPDDEYEITLGIDPND-RYADAKAAALEKIGLRPVESF 273


>gi|413917183|gb|AFW57115.1| hypothetical protein ZEAMMB73_742803 [Zea mays]
          Length = 514

 Score = 62.4 bits (150), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 73/277 (26%), Positives = 115/277 (41%), Gaps = 24/277 (8%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           ASE +  GD A  +P SL+++ E +  +E    L   N ++    L L+ M E+      
Sbjct: 153 ASESIGVGDIALEIPESLIISDELLCQSEVFLSLKDFNNITSETMLLLWSMRERYNLGSK 212

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
           F  PY   L      G       L +    LA L G+    EI++  + ++++Y+EL  +
Sbjct: 213 F-KPYFDTLPANFNTG-------LSFGIDALAALEGTLLFDEIIQARQHLRQQYDELFPL 264

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
                 L   +P     +  T++ F  A     S  + +   S      LVP+   L   
Sbjct: 265 ------LCTNFPEIFRKDVCTWDDFLWACELWYSNSMMIVLSSGKLSTCLVPVAGLLNHS 318

Query: 295 SSKCKAMLAAVDDA---VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYDR 350
            S        VD+A   ++  + RP  AGE   +  G  P S LL  YGF+   DNPYD 
Sbjct: 319 VSPHILNYGRVDEATKSLKFPLSRPCDAGEQCFLSYGKHPGSHLLTFYGFLPRGDNPYDV 378

Query: 351 LVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 387
           + ++      D    D+ + AQ +   S Q  H+  G
Sbjct: 379 IPLDL-----DTSADDEDITAQSSATTS-QTTHMVRG 409


>gi|159490820|ref|XP_001703371.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158280295|gb|EDP06053.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 339

 Score = 62.4 bits (150), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 60/250 (24%), Positives = 108/250 (43%), Gaps = 18/250 (7%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER-----VLGNETIAELLTTNKLSELACLALYLMYE 167
           + AS +++ G+    VP+  V+  E      VL  E + +     ++ E+  L + +M+E
Sbjct: 68  LVASRNIKMGEVVVEVPDDAVLMAENCGLRDVLEEEGMTKDSADEEILEVQGLVIAVMWE 127

Query: 168 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
           + +G +S W PY+  L             PL W   E   L G+    ++L RA+     
Sbjct: 128 RWRGPESRWAPYLALLPDD------MTHMPLYWKRREFRELRGTAAYDKMLGRAQHPSDA 181

Query: 228 YNELDTVWF-MAGSLFQQYP-YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
             ++  +W  + G    ++P   +P     +E+++ A  AV S    L      +  A+V
Sbjct: 182 PTQVPLLWSEVVGPFIAEHPELGLPGGERGYELYRWATAAVASYSFILGD---DKYQAMV 238

Query: 286 PLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 343
           P+   L   +      L   +    +Q++  R   AG  +V   G   N++LL  YGFV+
Sbjct: 239 PVWDLLNHITGDVNVRLHHCSKRHVLQMIAMRDIVAGSELVNNYGELSNAELLRGYGFVE 298

Query: 344 EDNPYDRLVV 353
             N Y+ + V
Sbjct: 299 RANRYNHIPV 308


>gi|347836900|emb|CCD51472.1| similar to SET domain-containing protein [Botryotinia fuckeliana]
          Length = 470

 Score = 62.0 bits (149), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 98/397 (24%), Positives = 163/397 (41%), Gaps = 53/397 (13%)

Query: 126 FSVPNSLVVTLERVLGNETIAELLTTNKLSEL--ACLALY-LMYEKKQGKKSFWLPYIRE 182
           FS+P S V      L  +    L  + +L+E   + LAL  ++  + Q   S W PY+  
Sbjct: 55  FSIPRSAV------LNAQNAKPLAISKRLAEKMPSWLALTSILMAEGQVDDSKWAPYLAI 108

Query: 183 LDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA-----------EGIKREYNEL 231
           L  Q       + S + WS++ELA L  S    +I ++            +G++    E+
Sbjct: 109 LPEQ-------LNSLVFWSDSELAELQASAVVKKIGKQGAEDMFKTYITPQGLQHSSTEM 161

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQKVSLARRFALVPLG 288
                   S+   Y +DIP  +       +   A    V      +K  L+    ++PL 
Sbjct: 162 ---CHKVASVIMAYAFDIPDPSEGPTSGGKGEEAADDLVSDDGEDEKTILS----MIPLA 214

Query: 289 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNP 347
             L A + +  A L   ++ +++   +P   GE I    G  P S LL  YG+V D  + 
Sbjct: 215 DMLNADADRNNARLICDNEDLEMRAIKPIAKGEEIFNDYGQLPRSDLLRRYGYVTDGYSA 274

Query: 348 YDRLVVEAAL----------NTEDPQY-QDKRMV----AQRNGKLSVQVFHVHAGREKEA 392
           YD   + A L          +   P+  QDK  V    A+R G        VH+  ++ +
Sbjct: 275 YDVAEISAELIVSLFRNGKVHPSLPKLTQDKLKVRLDLAEREGVYDESFDLVHSSPDEPS 334

Query: 393 ISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLS 452
           I D L       + D S +++++ S   +   S          LA   +AR   Y  T+ 
Sbjct: 335 IPDELLAFLYLLLVDESHLKAILDSESSLPSRSKLTTELAGQVLAILLQARENEYSTTVE 394

Query: 453 EDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 489
           EDE +L + +L  +  +A Q+   EKK+L A ++  A
Sbjct: 395 EDEDLLKNADLPIRTAMAIQVRSGEKKVLRAAIREAA 431


>gi|365989356|ref|XP_003671508.1| hypothetical protein NDAI_0H00910 [Naumovozyma dairenensis CBS 421]
 gi|343770281|emb|CCD26265.1| hypothetical protein NDAI_0H00910 [Naumovozyma dairenensis CBS 421]
          Length = 540

 Score = 62.0 bits (149), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 65/252 (25%), Positives = 105/252 (41%), Gaps = 34/252 (13%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLT-----TNKLSELA---CLALYL 164
           + AS+D+   +  F +P S ++       N T ++L       T KL EL+    L + +
Sbjct: 90  IIASKDIDTDELLFEIPRSSIL-------NVTTSQLCVDFPHITGKLMELSQWDSLIICM 142

Query: 165 MYEKKQGK-KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLT--------GSPTKA 215
           MYE K  + +S W  Y   L        L     + W++ EL++LT        G     
Sbjct: 143 MYEMKVLQHESRWSSYFNVLPSSESLNTL-----MYWNDKELSFLTPSLVVNRVGKGDAE 197

Query: 216 EILERAEGIKREYNELDTVWFMAGSL----FQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 271
            +  R      E+NE D +    GS+    F   P  I   +F  EI             
Sbjct: 198 TMYRRILDTINEFNE-DILTEKLGSISWEEFLYIPSIIMAYSFDVEIKNDDDENEGDEEF 256

Query: 272 HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 331
             ++       +++PL   L A + KC A L    D+++++  +P K GE +    G  P
Sbjct: 257 DEKEEEPELLKSMIPLADTLNADTHKCNANLTYDKDSLKMLAIKPIKKGEQVYNTYGELP 316

Query: 332 NSKLLINYGFVD 343
           NS+LL  YG+V+
Sbjct: 317 NSELLRKYGYVE 328


>gi|212546319|ref|XP_002153313.1| SET domain protein [Talaromyces marneffei ATCC 18224]
 gi|210064833|gb|EEA18928.1| SET domain protein [Talaromyces marneffei ATCC 18224]
          Length = 481

 Score = 62.0 bits (149), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 96/429 (22%), Positives = 173/429 (40%), Gaps = 73/429 (17%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           V A  ++Q G+  F +P+ +V+ ++    N+ +A+ L    L     L + ++YE   G+
Sbjct: 50  VVARSNIQEGEDLFHLPHHIVLMVKTSRLNQILADDLKN--LGPWLSLVVVMIYEYSLGE 107

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT---------KAEILERAEG 223
           +S W  Y + L  +        ++ + WSE E + L  S           + +I E+   
Sbjct: 108 QSNWKQYFQVLPSK-------FDTLMFWSEEEFSQLQASAVVDKVGKRDAEEDIFEKVLP 160

Query: 224 IKREYNEL------------DT-------VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
           + R + +L            DT       +    GSL   Y +DI          +  ++
Sbjct: 161 LVRAHPDLFPPIDGVMSYDDDTGAQALLELAHRMGSLIMAYAFDIEKAEEEESEGEDGYL 220

Query: 265 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 324
                   L K        +VPL   L A + +  A L   + A+ +   +P KAG+ I 
Sbjct: 221 TDDEE--QLPK-------GMVPLADLLNADADRNNARLFQEEGALVMRAIKPIKAGDEIF 271

Query: 325 VWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL----------NTEDPQYQDKRMVAQRN 374
              G  P S LL  YG+V  DN     VVE  L          N ED +Y   +++ Q  
Sbjct: 272 NDYGELPRSDLLRRYGYV-TDNYAQYDVVELPLTGICHAAGFDNIEDKEYPQLKLLDQL- 329

Query: 375 GKLSVQVFHVHAGREKEAISDMLP----YLRLGYVSDTSEMQSVISSLGPICPVSPCMER 430
            ++    + +     ++ + D+LP     L      D+ E+Q ++S      P+    E 
Sbjct: 330 -EILEDGYCILRPSPEDTLLDILPDELLALLKTLTLDSEELQRLLSKNKHPKPILGAREA 388

Query: 431 AVLDQLADYFKARLAGYPATLSEDEAMLTDY-------NLHPKKRVATQLVRMEKKMLNA 483
            +   L D  ++++  Y  T+ ED+ +L  +           ++ +A Q+   EK++L A
Sbjct: 389 RI---LLDAAQSKMGQYGTTIQEDKILLQQFASSSVLRTRERRRHMAVQVRVGEKEILQA 445

Query: 484 CLQVTADMI 492
            L +  D +
Sbjct: 446 LLMMLQDFL 454


>gi|315039895|ref|XP_003169325.1| hypothetical protein MGYG_08872 [Arthroderma gypseum CBS 118893]
 gi|311337746|gb|EFQ96948.1| hypothetical protein MGYG_08872 [Arthroderma gypseum CBS 118893]
          Length = 455

 Score = 62.0 bits (149), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 90/395 (22%), Positives = 164/395 (41%), Gaps = 38/395 (9%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
           V A    + G+   ++P++ + T+++   +  +  +L   +  LS    LALYL++ K  
Sbjct: 28  VKALRSFKEGERILTIPSACLWTVKKAYADPLLGPVLRAAQPPLSVEDSLALYLLFVKS- 86

Query: 171 GKKSFWLPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIK 225
                     R L  +  R  +A       + + +++ EL    GS   A   +  + + 
Sbjct: 87  ----------RTLGYEGQRHHIAAMPQSYSASIFFTDDELQVCKGSSLYALTPQLEQRVH 136

Query: 226 REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
            +Y +L        +L  Q+    P + FT E +K A  ++ S  +       A    + 
Sbjct: 137 DDYRQLLV------ALLSQHRDLFPLDQFTIEDYKWALCSIWSRAMDFAVSETASVRLVA 190

Query: 286 PLGPPLLAYS---SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
           PL   +L +S    +C A      D + ++  + Y+ G+ I ++ G  PN++LL  YGFV
Sbjct: 191 PLAD-MLNHSPDVKQCHAYDPTSGD-LSILAAKDYQVGDQIFIYYGSVPNNRLLRLYGFV 248

Query: 343 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI-SDMLPYLR 401
             DNP D   +    +   P Y+ K  +    G  S     +     K+ + +++L YLR
Sbjct: 249 LPDNPNDSYDLVLQTSPLAPLYEQKERLWALAGLDSTCTIPLTV---KDPLPNNVLRYLR 305

Query: 402 LGYVSDTSEMQSVISSL--GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 459
           +  + D S +  +   L  G    V+   E  VL  L D   + L G+   L + EA L 
Sbjct: 306 IQRL-DESNITDITLRLVNGTDGKVNDGNEIQVLQFLVDSIGSLLEGFGIPLEKLEAQLV 364

Query: 460 --DYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 492
             DY        A  +   E+++L    +   D++
Sbjct: 365 AGDYPAGGNAWAAAHVSAGEQRVLTRAKKTAEDLL 399


>gi|255562948|ref|XP_002522479.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase, chloroplast precursor, putative
           [Ricinus communis]
 gi|223538364|gb|EEF39971.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase, chloroplast precursor, putative
           [Ricinus communis]
          Length = 502

 Score = 62.0 bits (149), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 92/405 (22%), Positives = 165/405 (40%), Gaps = 65/405 (16%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERV----LGNETIAELLTTNKLSELACLALYLMYEKKQ 170
           A  D+   +    +P  L +  + V    +GN         + L     +AL+L+ EK +
Sbjct: 84  AERDIARNEVVLEIPKKLWINPDAVAASDIGN-------VCSGLKPWISVALFLIREKLK 136

Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLW-----------------SETELAYLTGSPT 213
            + S W PY+  L            S + W                 SE ELA L G+  
Sbjct: 137 KEGSTWWPYLDILPD-------TTNSTIYWWVLLVAFYVLVLSFQRRSEEELAELQGTQL 189

Query: 214 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 273
               L   E ++RE+ +++    +     + +P  I     T + F  AF  ++S     
Sbjct: 190 LRTTLGVKEYMQREFAKVEEEILLPHK--ELFPSPI-----TLDDFLWAFGILRSRAFSR 242

Query: 274 QKVSLARRFALVPLGPPL----------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESI 323
            +    +   L+PL   +           AY  K   + +  +    L      K+GE +
Sbjct: 243 LR---GQNLVLIPLADLINHSPDITTEDYAYEIKGGGLFSR-ELLFSLRSPISVKSGEQV 298

Query: 324 VV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 382
           ++ +   + N++L ++YGF+++    +   +   ++  DP + DK  +A+ NG      F
Sbjct: 299 LIQYDLNKSNAELALDYGFIEKTPDRNTYTLTLQISESDPFFGDKLDIAETNGSGETADF 358

Query: 383 HVHAGREKEAISDMLPYLRLGYVSDTSE--MQSVISS--LGPI-CPVSPCMERAVLDQLA 437
            +  G        MLPYLRL  +  T    ++S+  +   G +  P+S   E  +   + 
Sbjct: 359 DIVLGNPLPPA--MLPYLRLVALGGTDAFLLESIFRNTIWGHLELPISRANEELICRVVR 416

Query: 438 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 482
           D  K+ L+GY  T+ EDE  L   +L+P+  +A  +   EKK+L 
Sbjct: 417 DACKSALSGYHTTIEEDEK-LEAADLNPRLEIAVGIRAGEKKVLQ 460


>gi|156384284|ref|XP_001633261.1| predicted protein [Nematostella vectensis]
 gi|156220328|gb|EDO41198.1| predicted protein [Nematostella vectensis]
          Length = 403

 Score = 61.6 bits (148), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 90/394 (22%), Positives = 168/394 (42%), Gaps = 43/394 (10%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL--ACLALYLMYEKKQGK 172
           A+ DLQ      +VP  L++++     +     +   + L  +    LAL+++ E+   +
Sbjct: 26  ATADLQENQVFVAVPEKLLMSVVTAKKSSLGPLISREHGLRSMPHVVLALHVLCERLH-E 84

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
            S W PY+  L R       +  + L +S  ++  L GSP+  E L++  GI ++Y    
Sbjct: 85  DSTWAPYLNILPR-------SYSTCLYFSPDDMMALQGSPSMGEALKQFRGIVKQY---- 133

Query: 233 TVWFMAGSLFQQYPYDIPTE-AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
            V+F            +P + +FTF+ F+ A   V +    ++  S     AL+P+    
Sbjct: 134 -VYFFRLVQINPEASRLPLKNSFTFDDFRWAVSTVMTRQNDVKVSSNETVKALIPM---- 188

Query: 292 LAYSSKCKA-MLAAVDDA---VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
               + C        DD+   V+ +  +P +AG+ + ++ G + N+  L + GFV  +  
Sbjct: 189 WDMCNHCNGPFTTGFDDSTKEVKSLAFKPTRAGDQVFIFYGRRNNADRLFHNGFVYTEAE 248

Query: 348 YDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS-DMLPYLRLGYVS 406
            D + ++  ++  D  Y  K  +    G L          R  E IS ++  +LR+ +  
Sbjct: 249 EDWVNIQLGVSKNDRLYAMKAQILAMVG-LDASGRSYRVLRGPEPISPELRIFLRV-FSM 306

Query: 407 DTSEMQSV--------ISSLGPICPVSPCMERAVLDQLADYFKAR----LAGYPATLSED 454
           +T E++          ++ L  +C     +      +L  +F  R    L  Y  T  ED
Sbjct: 307 NTGELKPYLFNPEGLPVTPLAELCKAEFTLSEENELKLWSFFHTRLQLILGQYKTTKQED 366

Query: 455 EAMLT--DYNLHPKKRVATQLVRMEKKMLNACLQ 486
           EA+L+  D  LH   R   +L   E+ +L + L+
Sbjct: 367 EALLSRDDNTLH--TRNCIRLRMSERDILVSALE 398


>gi|168005531|ref|XP_001755464.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693592|gb|EDQ79944.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1033

 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 69/295 (23%), Positives = 116/295 (39%), Gaps = 57/295 (19%)

Query: 70  VVSKKEEDLGD-LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
           VV +   D  D   SWM  NG            S +EK    H +A    L  G      
Sbjct: 520 VVHQNGTDTTDQFVSWMEGNGF-----------SISEKLSITHLLAGDGKLVRG------ 562

Query: 129 PNSLVVTLERVLGNETIAEL-----------LTTNKLSELACLALYLMYEKKQGKKSFWL 177
               VV L+ +   ET+  L           +   ++      A  L+ EK +G  S W 
Sbjct: 563 ----VVVLKNIRRGETLCNLPLDMGLYDNETIVAGEVDSWDRAAARLLREKAKGSSSAWA 618

Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
            YI  L +        +  P+L  + EL  +   P   E+++  + I+  ++ L +V  +
Sbjct: 619 SYINILPQN-------MTVPILLEDHELHEVQWWPVLRELVQVRKSIRESFSLL-SVDDL 670

Query: 238 AGSLFQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
           AG+ F++Y +    + + AFT  +F     A    + ++  ++    +    +  P+   
Sbjct: 671 AGADFEEYRWAAMMVHSRAFTLPVFADDHYAPYVMMPYMDMINHHYHYQADWMSQPIWG- 729

Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
                         V++V  R  K GE +    GP+ N  L + YGFV +DNP+D
Sbjct: 730 ------------GKVEIVARRDIKKGEELFASFGPRANDNLFLYYGFVLKDNPFD 772


>gi|296804474|ref|XP_002843089.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
 gi|238845691|gb|EEQ35353.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
          Length = 455

 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 90/395 (22%), Positives = 164/395 (41%), Gaps = 38/395 (9%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
           V A    + G+   ++P++ + T+E+   +  +  +L + +  LS    LA+YL++ +  
Sbjct: 28  VKALRSFKEGERILTIPSACLWTVEKAYADPLLGPVLRSAQPPLSVEDALAVYLLFVRS- 86

Query: 171 GKKSFWLPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIK 225
                     R    +  R  +A       + + ++E EL    GS   A   +  + ++
Sbjct: 87  ----------RTSGYEGQRHHIAAMPQSYSASIFFTEDELQVCAGSSLYALTRQLEQRVR 136

Query: 226 REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
            +Y +L         L  Q+    P + FT E +K A  ++ S  +    VS      LV
Sbjct: 137 DDYRQLLV------PLLSQHRDLFPLDQFTIEDYKWALCSIWSRAMDF-AVSGTTSVRLV 189

Query: 286 PLGPPLLAYS---SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
                +L +S    +C A      D + ++  + Y+ G+ + ++ G  PN++LL  YGFV
Sbjct: 190 APLADMLNHSPDVKQCHAYDPTSGD-LSILAAKDYQVGDQVFIYYGSVPNNRLLRLYGFV 248

Query: 343 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI-SDMLPYLR 401
             DNP D   +    +   P Y+ K  +    G  S     +     K+ + +++L YLR
Sbjct: 249 LPDNPNDSYDLVLQTSPLAPLYEQKERLWALAGLDSTCTIPLTV---KDPLPNNVLRYLR 305

Query: 402 LGYVSDTSEMQSVISSL--GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLT 459
           +  + D S +  +   L  G    VS   E  VL  L D   + L G+   L + EA L 
Sbjct: 306 IQRL-DESNITDITLQLVNGTDGKVSDGNEMQVLQFLVDSIGSLLEGFGIPLEKLEAQLA 364

Query: 460 --DYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 492
             DY        A  +   E+++L    +   D++
Sbjct: 365 AGDYPAGGNAWAAAHVSAGEQRVLTRAKRTAEDLL 399


>gi|66825817|ref|XP_646263.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|60474297|gb|EAL72234.1| SET domain-containing protein [Dictyostelium discoideum AX4]
          Length = 567

 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 96/460 (20%), Positives = 174/460 (37%), Gaps = 58/460 (12%)

Query: 65  AGSREVVSKKEEDL-GDLKSWMHKNGL--PPCKVILKEKPSHNEKHRPIHYVAASEDLQA 121
           A S ++V   E  L  +   W+   G     CKV +    S          + A++D++ 
Sbjct: 56  ANSGKIVEPTEAQLVANFIEWLKGKGFDESKCKVKIDRNTSEGTG------LVATQDIKE 109

Query: 122 GDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLP 178
           G+    +P++L +T            +L  ++L +      L+++L+ E      S W P
Sbjct: 110 GEDFVEIPSNLFITTAVAFQGLGKPPILENDRLIQSIPGILLSIFLVKELSN-PTSEWGP 168

Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMA 238
           YI+ L +Q         +   W   E     GSP     +    G  R+Y       ++ 
Sbjct: 169 YIKLLPKQ-------YNTVYYWGLKEFTQFRGSPNLEYAMRYVRGAMRQY------CYLY 215

Query: 239 GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR-----FALVPLGP--PL 291
             + +     +P  +FT++ F  A   VQS     Q    A        AL+P       
Sbjct: 216 SMIDRTQSNIMPISSFTWDAFVWAISTVQS----RQNPVYAGNGNGSIMALIPFWDFCNH 271

Query: 292 LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 351
            +  SK  +      + +     + +K GE + ++ GP+ N++LL++ GF  + N +D  
Sbjct: 272 SSTGSKITSFYHMDSNCMTSGAIKDFKKGEQVYMFYGPRDNTQLLMHAGFATKTNLHDSY 331

Query: 352 VVEAAL--NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 409
             E  L     + ++    ++ +R  +  V V         E   +++P+ R+ Y     
Sbjct: 332 PFELHLLEGNHEIRHDKVHLLEERGIRDGVVVNLNQNPTSNELPLELIPFYRI-YALSEQ 390

Query: 410 EMQSVIS---------------SLGPIC--PVSPCMERAVLDQLADYFKARLAGYPATLS 452
           E +++                  L P+    ++   E      L    K +LA YP TL 
Sbjct: 391 ETRAIAPPQVPGEHNHHHGHQLELKPLAFKIITQENEEKAYSNLVQALKGKLASYPTTLE 450

Query: 453 EDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 492
           EDE  L   N    +R        EKK+L+  ++    +I
Sbjct: 451 EDEQELKK-NPPANQRFILYTKINEKKILDRNIKYLESLI 489


>gi|80479475|gb|AAI08868.1| Unknown (protein for MGC:132347) [Xenopus laevis]
          Length = 456

 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 59/247 (23%), Positives = 110/247 (44%), Gaps = 20/247 (8%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAEL-LTTNKLSELACLALYLMYEKKQGKK 173
           A+ DL+ G+   ++P + ++T E VL +     + L    +S L  L  +L+ E+  G++
Sbjct: 64  ATRDLKPGELIIALPETCLITTETVLQSYLGKYIRLWRPHVSPLLALCTFLIAERFAGER 123

Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
           S W PY+  +             P+ W E E+ +L  +P + + LE+    K E  EL T
Sbjct: 124 SQWKPYLDVIPS-------TYSCPVYW-ELEIVHLLPAPLRQKALEQ----KTEVQELHT 171

Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA---LVPLGP- 289
                 +  Q    D   + +T++  + A+  V +  V+++     R  A   +  L P 
Sbjct: 172 ESLAFFNSLQPLFCDNVADIYTYDALRWAWCTVNTRTVYMKHTQQDRLLAQQDVCALAPY 231

Query: 290 -PLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 346
             LL +S + +       D    ++  +   +  +   +  GP  N +LL+ YGFV  +N
Sbjct: 232 LDLLNHSPEVQVEAEFSKDRRCYEIRTNSGCRKHDQAFICYGPHDNQRLLLEYGFVAANN 291

Query: 347 PYDRLVV 353
           P+  + V
Sbjct: 292 PHRSVYV 298


>gi|162606198|ref|XP_001713614.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Guillardia theta]
 gi|13794534|gb|AAK39909.1|AF165818_117 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Guillardia theta]
          Length = 460

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 75/382 (19%), Positives = 166/382 (43%), Gaps = 55/382 (14%)

Query: 133 VVTLERVLGNETIAEL--------------LTTNKLSELACLALYLMYEKKQGKKSFWLP 178
           ++    +L NE I E+              + +N     + LA+ L+ E  + KKSFW P
Sbjct: 102 LIASRNILKNEKIIEISENLMFDKFEHNLEINSNGSDNYSDLAIKLLVELFKNKKSFWFP 161

Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMA 238
           YI  L  +     L       W   EL ++ GS          + +K +Y  ++      
Sbjct: 162 YIGILPEEYDLKLL-----FRWPLKELFFIKGSRLSKASDYLKKKLKAQYEMVNK----- 211

Query: 239 GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC 298
             +FQ+     P++ F ++ ++ +   + S  + LQ+    ++  L+P    LL ++   
Sbjct: 212 -EVFQRNRLLYPSKIFNYQNWEWSMSILLSRTISLQE---TKKVVLIPY-IDLLNHNPFS 266

Query: 299 KAMLAA----VDDAVQLVV--DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 352
            + ++     + D+ ++VV  D+     + + +  G + N +LL  YGF+ E NPYD ++
Sbjct: 267 SSFISYRKIPLSDSKEIVVYSDKNCNKFDQLYISYGQKSNLELLNLYGFIAERNPYDSVI 326

Query: 353 VEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL----GYVSDT 408
           +  +++ +D  +++K+     N K     + +   +  +   +M+ ++++      ++D 
Sbjct: 327 IRISMSPKDIFFKEKKSFLFSNKKFFYNSYPIFLYKYPD---EMIEFIKICLFNTNINDK 383

Query: 409 SEMQSVISSLGPICPVSPC----MERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 464
           +   + I +      +  C    +E+++     DY   R       L E+   ++D    
Sbjct: 384 NFNLNKIENYDYTKIIKSCIVTVIEKSLNSNYNDYENLR----NIMLKENLLHISD---- 435

Query: 465 PKKRVATQLVRMEKKMLNACLQ 486
             ++++ +   +EKK+LN  L+
Sbjct: 436 -NQKISIKYNALEKKILNRFLE 456


>gi|62860180|ref|NP_001017105.1| SET domain containing 4 [Xenopus (Silurana) tropicalis]
 gi|89267009|emb|CAJ81787.1| novel protein containing a SET domain [Xenopus (Silurana)
           tropicalis]
          Length = 442

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 61/249 (24%), Positives = 117/249 (46%), Gaps = 20/249 (8%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSELACLALYLMYEKKQG 171
           + A+ DLQ G+   S+P+S ++T E VL +     + T +  +S L  L  +L+ E+   
Sbjct: 62  LMATRDLQPGELIISLPDSCLITTETVLQSYLGKYIRTWSPPVSPLLALCTFLIAERVAR 121

Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
           ++S W PY+  L         +   P+ W E+E+  L  +P + + LE+   +K  + E 
Sbjct: 122 ERSPWKPYLDVLPS-------SYSCPVYW-ESEIISLLPAPLRQKALEQQTEVKELHTE- 172

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQKVSLARRFALVPLG 288
              W    SL   +  +I T+ +T+   + A+  V +  V   H ++  L+ +  +  + 
Sbjct: 173 --SWSFFVSLQPLFGGNI-TDIYTYGALRWAWCTVNTRTVYMKHPRRHGLSAQQDVYAMA 229

Query: 289 PPL-LAYSSKCKAMLAAVDD---AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 344
           P L L   S    + AA ++     ++  +   +  +   +  GP  N +LL+ YGF+  
Sbjct: 230 PYLDLLNHSPAVQVEAAFNEERRCYEIRTNSGCRKHDQAFICYGPHDNQRLLLEYGFIAA 289

Query: 345 DNPYDRLVV 353
           +NP+  + V
Sbjct: 290 NNPHRSVYV 298


>gi|336467028|gb|EGO55192.1| hypothetical protein NEUTE1DRAFT_147775 [Neurospora tetrasperma
           FGSC 2508]
 gi|350288355|gb|EGZ69591.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
          Length = 504

 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 72/302 (23%), Positives = 123/302 (40%), Gaps = 40/302 (13%)

Query: 72  SKKEEDLGDLKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
           S  +E    L  W HK+G    P  +V   E    + + +P    +A+E L +G  A S 
Sbjct: 3   SPHKERFEALLDWAHKHGASLHPLLEVYEDEVTGFSLRVKP----SATELLGSGFKAVSC 58

Query: 129 PNSLVVTLERVLGNETIAELLTT---------------NKLSELACLALYLMYEKKQGKK 173
           P S+ ++    L +  I    TT               N L        YL+ +  +GK 
Sbjct: 59  PTSITLSYLNALTDGPITPSSTTLAPNTENPAFPERFMNSLPPHVIGRFYLIQQYLKGKS 118

Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
           SFW PYI  L       + A+  P  W+E ++  L G+     I E    +K EY +   
Sbjct: 119 SFWAPYISTLADPSQLDKWAL--PPFWAEDDIELLKGTNAYVAIQEIQSNVKSEYKQARK 176

Query: 234 VWFMAG----SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKV----SLARRFALV 285
           +    G      + Q  Y+     FT   F+ + V  +S   +++++    S    F+++
Sbjct: 177 ILKKEGFPDYRDYTQVLYNWAYCMFTSRSFRPSLVLSESAREYVERLLPEGSKIDDFSIL 236

Query: 286 PLGPPLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 340
               PL      ++ +     L +   A +L+ +  Y  G+ +    G + NS+LL+ YG
Sbjct: 237 ---QPLYDIGNHSWDASYTWNLTSEPSACELICNDSYGPGQQVFNNYGFKTNSELLLGYG 293

Query: 341 FV 342
           F+
Sbjct: 294 FI 295


>gi|85093434|ref|XP_959692.1| hypothetical protein NCU09581 [Neurospora crassa OR74A]
 gi|28921141|gb|EAA30456.1| predicted protein [Neurospora crassa OR74A]
          Length = 504

 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 73/307 (23%), Positives = 121/307 (39%), Gaps = 50/307 (16%)

Query: 72  SKKEEDLGDLKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
           S  +E    L  W HK+G    P  +V   E    + + +P    +A+E L +G  A S 
Sbjct: 3   SPHKERFEALLDWAHKHGASLHPLLEVYEDEVTGFSLRVKP----SATERLGSGFKAVSC 58

Query: 129 PNSLVVTLERVLGNETIAELLTT---------------NKLSELACLALYLMYEKKQGKK 173
           P S+ ++    L +  I    TT               N L        YL+ +  +GK 
Sbjct: 59  PTSITLSYLNALTDGPITPSSTTPAPNTKNPAFPERFMNSLPPHVIGRFYLIQQYLKGKS 118

Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
           SFW PYI  L       + A+  P  W+E ++  L G+     I E    +K EY +   
Sbjct: 119 SFWAPYISTLADPSQLDKWAL--PPFWAEDDIELLQGTNAYIAIQEIQNNVKSEYKQARK 176

Query: 234 VWFMAG----SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP 289
           +    G      + Q  Y+     FT   F+ + V  +S   ++++        L+P G 
Sbjct: 177 ILKKEGFPDYREYTQVLYNWAYCMFTSRSFRPSLVLSESAREYVER--------LLPEGT 228

Query: 290 ---------PLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
                    PL      ++ +     L +   A +L+ +  Y  G+ +    G + NS+L
Sbjct: 229 KIDDFSVLQPLYDIGNHSWDASYTWNLTSEPSACELICNDSYGPGQQVFNNYGFKTNSEL 288

Query: 336 LINYGFV 342
           L+ YGF+
Sbjct: 289 LLGYGFI 295


>gi|380480025|emb|CCF42668.1| SET domain-containing protein RMS1 [Colletotrichum higginsianum]
          Length = 318

 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 55/232 (23%), Positives = 107/232 (46%), Gaps = 34/232 (14%)

Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
             +VP+   +L   ++  A +   +D + +V  RP KAGE I+ + GP PNS+LL  YG+
Sbjct: 71  LGMVPMAD-ILNADAEFNAHVNHGEDDLSVVALRPIKAGEEILNYYGPHPNSELLRRYGY 129

Query: 342 VDEDN--------PYDRLVVEAALNTE---------------DPQ-YQDKRMVAQRNGKL 377
           V   +        P+D  +V++ L  +               DP+ ++D  ++ + +G+ 
Sbjct: 130 VTPKHSRYDVVEIPWD--LVQSILTEQLRLTDDVWKQLAEHVDPEDFEDVFVLERDSGEP 187

Query: 378 SVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLA 437
             +       + +E  +++   L+       + ++++    G + P     +      +A
Sbjct: 188 DSEGRLTTPAKVQEVSAELEEQLK-------AVLKAIKKVRGDLIPDKRKRDEVYQHVVA 240

Query: 438 DYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTA 489
              +  LA YP T  EDEA+L   NL  ++R+A ++   EK++L   LQ+  
Sbjct: 241 AALQKLLAQYPTTAEEDEALLASGNLTSRQRMAVEVRLGEKRLLKEALQMDG 292


>gi|344277088|ref|XP_003410336.1| PREDICTED: SET domain-containing protein 4 [Loxodonta africana]
          Length = 440

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 77/323 (23%), Positives = 130/323 (40%), Gaps = 34/323 (10%)

Query: 67  SREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAF 126
           SR V    + +  +LK W+         +I    P           + +   LQ G    
Sbjct: 22  SRGVNESYKSEFIELKKWLKDRKFEDTNLIPARFPGTGRG------LMSKTSLQVGQMII 75

Query: 127 SVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFWLPYIRELD 184
           S+P S +++ + V+    +   +T  K   S L  L  +L+ EK  G +S W PY+  L 
Sbjct: 76  SLPESCLLSTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVLEKHAGDQSSWKPYLETLP 134

Query: 185 RQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQ 244
           +           P+ W E E+  L   P +A+  E+   ++  +      +     LF +
Sbjct: 135 K-------TYTCPVCW-EPEVVNLLPRPLRAKAQEQRTRVQEFFTSFRDFFSSLQPLFSE 186

Query: 245 YPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP----LGP--PLLAYS--S 296
              +I    FT+     A+  V +  V+L+   L R F+  P    L P   LL +S   
Sbjct: 187 AVENI----FTYSALLWAWCTVNTRAVYLRHRQL-RCFSAEPDTCALAPYLDLLNHSPDV 241

Query: 297 KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 356
           + KA         ++V     +  E + +  GP  N +LL+ YGFV   NP+  + V   
Sbjct: 242 QVKAAFNEKTRCYEIVAVSSCRKHEEVFICYGPHDNHRLLLEYGFVSTRNPHACVYVSRD 301

Query: 357 LNTEDPQYQDKRMVAQRNGKLSV 379
           +  +     DK+M    N K+S+
Sbjct: 302 ILVKYLPSTDKQM----NKKISI 320


>gi|351701197|gb|EHB04116.1| SET domain-containing protein 3 [Heterocephalus glaber]
          Length = 705

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 62/251 (24%), Positives = 116/251 (46%), Gaps = 14/251 (5%)

Query: 252 EAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 310
           ++FT+E ++ A  +V +    +     +R   AL+PL       +          DD  +
Sbjct: 346 DSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCE 405

Query: 311 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 370
            V  + ++AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y  K  V
Sbjct: 406 CVALQDFQAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYAMKAEV 465

Query: 371 AQRNG---KLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISSLG 419
             R G    +   VF +H   E    + +L +LR+  +++    + +        I +LG
Sbjct: 466 LARAGIPTYVWSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGENAIDRIFTLG 524

Query: 420 -PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEK 478
               PVS   E  +   L D     L  Y  T  ED+A+L + +L  + ++A +L   EK
Sbjct: 525 NSEFPVSWENEVKLWSFLEDRASLLLKTYKTTTEEDKAVLKNPDLPARTKMAIKLRLGEK 584

Query: 479 KMLNACLQVTA 489
           ++L   +Q  A
Sbjct: 585 EILEKAVQSAA 595


>gi|159477607|ref|XP_001696900.1| rubisco small subunit N-methyltransferase [Chlamydomonas
           reinhardtii]
 gi|158274812|gb|EDP00592.1| rubisco small subunit N-methyltransferase [Chlamydomonas
           reinhardtii]
          Length = 411

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 52/182 (28%), Positives = 83/182 (45%), Gaps = 14/182 (7%)

Query: 315 RPYKAGESIVVWCGP---------QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 365
           R   A + +VVW G          +PN +LL+  G + ++N  D L   A L   D  Y 
Sbjct: 166 RAAGARKGVVVWDGAGSEMLLNDGRPNGELLLATGTLQDNNSSDFLSWPAGLVPADRYYM 225

Query: 366 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVS 425
            K  V +  G  + + F V+A R       +L YLRL  V+D + +     +      +S
Sbjct: 226 MKSQVLESMGYSAAEEFPVYADRMP---IQLLAYLRLSRVADPALLAKC--TFEADVELS 280

Query: 426 PCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACL 485
              E  +L  L    + RLA Y  +  ED  +    +L PK+R+A +L   EK+++NA +
Sbjct: 281 QMNEYEILQILMGDCRERLASYTKSYEEDVKIAQQSDLSPKERLAVKLRLGEKRIINATM 340

Query: 486 QV 487
           + 
Sbjct: 341 EA 342


>gi|70984218|ref|XP_747626.1| SET domain protein [Aspergillus fumigatus Af293]
 gi|66845253|gb|EAL85588.1| SET domain protein [Aspergillus fumigatus Af293]
          Length = 492

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 71/280 (25%), Positives = 119/280 (42%), Gaps = 55/280 (19%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
           V A  D+  G+  FS+P  LV++ +    N  + +LL+ + L EL     L L +MYE  
Sbjct: 50  VVARSDIFDGEELFSIPRGLVLSAQ----NSKLKDLLSQD-LEELGPWLSLILVMMYEYL 104

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
            G++S W PY + L +       + ++ + WS +EL  L GS   ++I    EG +    
Sbjct: 105 LGEQSAWAPYFKILPK-------SFDTLMFWSPSELRELQGSAIVSKI--GKEGAE---- 151

Query: 230 ELDTVWFMAGSLFQQYPYDIPT---------EAFTFEIFKQAFVA---VQSCVVHLQKVS 277
             D++  M   + +  P   P+         EA +  + + A +    + +    ++KV 
Sbjct: 152 --DSIMQMIAPVVRANPSLFPSVDGLASWDGEAGSHALLRLAHIMGSLIMAYAFDIEKVE 209

Query: 278 LARRF-------------------ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYK 318
                                    +VPL   L A + +  A L   DD++ +   +P +
Sbjct: 210 DEDDENNDEEDGYVTDDEQDQSSKGMVPLADILNADADRNNARLFQEDDSLVMKAIKPIR 269

Query: 319 AGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 358
            GE I    G  P + LL  YG+V  DN     VVE +L+
Sbjct: 270 VGEEIFNDYGELPRADLLRRYGYV-TDNYAQYDVVELSLD 308


>gi|159122413|gb|EDP47534.1| SET domain protein [Aspergillus fumigatus A1163]
          Length = 492

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 71/280 (25%), Positives = 119/280 (42%), Gaps = 55/280 (19%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
           V A  D+  G+  FS+P  LV++ +    N  + +LL+ + L EL     L L +MYE  
Sbjct: 50  VVARSDIFDGEELFSIPRGLVLSAQ----NSKLKDLLSQD-LEELGPWLSLILVMMYEYL 104

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
            G++S W PY + L +       + ++ + WS +EL  L GS   ++I    EG +    
Sbjct: 105 LGEQSAWAPYFKILPK-------SFDTLMFWSPSELRELQGSAIVSKI--GKEGAE---- 151

Query: 230 ELDTVWFMAGSLFQQYPYDIPT---------EAFTFEIFKQAFVA---VQSCVVHLQKVS 277
             D++  M   + +  P   P+         EA +  + + A +    + +    ++KV 
Sbjct: 152 --DSIMQMIAPVVRANPSLFPSVDGLASWDGEAGSHALLRLAHIMGSLIMAYAFDIEKVE 209

Query: 278 LARRF-------------------ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYK 318
                                    +VPL   L A + +  A L   DD++ +   +P +
Sbjct: 210 DEDDENNDEEDGYVTDDEQDQSSKGMVPLADILNADADRNNARLFQEDDSLVMKAIKPIR 269

Query: 319 AGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 358
            GE I    G  P + LL  YG+V  DN     VVE +L+
Sbjct: 270 VGEEIFNDYGELPRADLLRRYGYV-TDNYAQYDVVELSLD 308


>gi|396495152|ref|XP_003844476.1| similar to SET domain-containing protein [Leptosphaeria maculans
           JN3]
 gi|312221056|emb|CBY00997.1| similar to SET domain-containing protein [Leptosphaeria maculans
           JN3]
          Length = 475

 Score = 60.1 bits (144), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 72/281 (25%), Positives = 113/281 (40%), Gaps = 52/281 (18%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER-VLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
           V A+++++  +  F +P S V+++E  +L  E      T + L     L L ++YE   G
Sbjct: 40  VVATQEIREHEVLFRIPRSAVLSVENSILSTEIPTS--TFDLLGPWLSLILVMLYEHLNG 97

Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-------------- 217
             S W PY   L  +         + + WSE ELA L  S   A+I              
Sbjct: 98  DASNWAPYFAVLPNE-------FNTLMFWSEHELAELQASAVLAKIGREGANEAFLGQLV 150

Query: 218 -----------------LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK 260
                             ++AE ++ E N   T+    GSL   Y +DI           
Sbjct: 151 PVIKEFAGIFFSGDSRAAQKAEEMRDEKN--ITLMHKMGSLIMAYAFDIEPAT------P 202

Query: 261 QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 320
           +  V  +      +  +L +   ++PL   L A + +C A L      +++   +P KAG
Sbjct: 203 RKDVDEEGFAEEEEDEALPK--GMIPLADMLNADADRCNARLFYEQKYLEMKALKPIKAG 260

Query: 321 ESIVVWCGPQPNSKLLINYGFVDED-NPYDRLVVEAALNTE 360
           E I    GP P S LL  YG+V E+   YD + V   L +E
Sbjct: 261 EEIFNDYGPLPRSDLLRRYGYVTENYAQYDVVEVPMELVSE 301


>gi|307109196|gb|EFN57434.1| hypothetical protein CHLNCDRAFT_142903 [Chlorella variabilis]
          Length = 1233

 Score = 60.1 bits (144), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 89/375 (23%), Positives = 146/375 (38%), Gaps = 82/375 (21%)

Query: 111  HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
            H   A+ D+  G+    VP SL +T   V G +   E+L   + SEL  LAL+LM E+ +
Sbjct: 871  HGFVAARDVGQGEVLLQVPGSLAITAVDV-GKDAQLEVLARGR-SELVGLALWLMQERAK 928

Query: 171  GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE-LAYLTGSPTKAEILERAEGIKREYN 229
                                  A  +P+LW + E    L GSP   E   R + +++E+ 
Sbjct: 929  ----------------------ATLTPILWPDEERQQLLRGSPVLEEARTREQALRQEWQ 966

Query: 230  ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV-HLQKVSLARRFALVPLG 288
            ++  +        + YP  +  E        QAF+   S V+ H   +  A+ FAL+PL 
Sbjct: 967  DIAAIAAQTSGGPEAYPAVVYNE--------QAFLEAMSVVLAHAAYLPKAQCFALLPLV 1018

Query: 289  PPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 346
              L    S   A+L      +AV +V  R    G+ + ++C                   
Sbjct: 1019 GGLCRTGSSSGALLDYDLEREAVTVVAQR--TPGQEVALYC------------------- 1057

Query: 347  PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVS 406
                L + A+L   D  Y  KR + +  G      F +    ++ A   ++ +       
Sbjct: 1058 ----LFMAASLVAADRLYTTKREILEELGLGVKAEFPIF--EDRLATQQLINF------- 1104

Query: 407  DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPK 466
               E  ++I         SP  E  +L  L    + R+  Y     +D   L   +L P+
Sbjct: 1105 ---EQDTII---------SPENEYEILQLLMGDLRDRIQAYATEFDDDIKDLQRTDLTPR 1152

Query: 467  KRVATQLVRMEKKML 481
            +R+A QL   EK++L
Sbjct: 1153 QRLAAQLRLGEKRIL 1167


>gi|330933580|ref|XP_003304225.1| hypothetical protein PTT_16721 [Pyrenophora teres f. teres 0-1]
 gi|311319308|gb|EFQ87682.1| hypothetical protein PTT_16721 [Pyrenophora teres f. teres 0-1]
          Length = 476

 Score = 60.1 bits (144), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 77/302 (25%), Positives = 120/302 (39%), Gaps = 51/302 (16%)

Query: 83  SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLER-VLG 141
           +W+ K+G      I  E   + +  R    V AS+++   +  F +P + ++++E  +L 
Sbjct: 13  AWLRKSGAEISPKIKLEDLRNKDAGRG---VVASQEIAEHELLFRIPRTSILSVENSILS 69

Query: 142 NETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
            E  A  L+   L     L L ++YE   G  S W PY   L  +         + + W+
Sbjct: 70  TEIPAATLSL--LGPWLSLILVMLYEYHNGSASNWAPYFAVLPTE-------FNTLMFWT 120

Query: 202 ETELAYLTGSPTKAEIL---------------------------ERAEGIKREYNELDTV 234
           E ELA L  S    +I                            E+A+   +E      +
Sbjct: 121 EDELAELQASAVVGKIGKESADEAFLEQLLPVIEEFADIVFSGDEKAKDKAKEMRSPKNL 180

Query: 235 WFM--AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
             M   GSL   Y +D+     T E+ ++   A +     L K        +VPL   L 
Sbjct: 181 ELMHKMGSLIMAYAFDVEPATPTKEVDEEG-FAEEEEDAALPK-------GMVPLADMLN 232

Query: 293 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 352
           A + +C A L    D +++   +P +AGE I    GP P S LL  YG+V  DN     V
Sbjct: 233 ADADRCNARLFYEKDCLEMKALKPIQAGEEIFNDYGPLPRSDLLRRYGYVT-DNYAQYDV 291

Query: 353 VE 354
           VE
Sbjct: 292 VE 293


>gi|260831632|ref|XP_002610762.1| hypothetical protein BRAFLDRAFT_91548 [Branchiostoma floridae]
 gi|229296131|gb|EEN66772.1| hypothetical protein BRAFLDRAFT_91548 [Branchiostoma floridae]
          Length = 604

 Score = 60.1 bits (144), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 58/231 (25%), Positives = 112/231 (48%), Gaps = 35/231 (15%)

Query: 151 TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 210
           T++ +    L+L+L+ EK +GK SFW PYIR L         +  +P+ ++E+EL  L+ 
Sbjct: 230 TSRFTCAQVLSLFLLLEKNKGKDSFWYPYIRSLPN-------SFTTPVYFTESELNALSP 282

Query: 211 SPTKAEILERAEGIKRE----YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 266
           S     + E+A  +K+E    +N+L+   F+   L +          FTF+ F+ A+  +
Sbjct: 283 S-----LQEKARDLKKELLHAFNDLEP--FVTSCLPEL------DSTFTFDAFRWAWSVL 329

Query: 267 QSCVVHLQKVS---LARR----FALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPY 317
           ++  ++ +      L+ +      LVP+   L+ +S   KA           ++ V  PY
Sbjct: 330 KTRTLYQEDCRSPYLSNKEPQTSTLVPM-LDLINHSPSAKARFGYNVNTSCYEVRVLEPY 388

Query: 318 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED-PQYQDK 367
           +  + + +  G + N++L++ +GF   +NP D + +  +   E  PQ  D+
Sbjct: 389 RKYDQVFISYGFEENTELMLKFGFFVPENPKDFMKINLSEMLESLPQINDE 439


>gi|148226164|ref|NP_001079674.1| SET domain containing 4 [Xenopus laevis]
 gi|28422727|gb|AAH46855.1| MGC53706 protein [Xenopus laevis]
          Length = 456

 Score = 60.1 bits (144), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 60/247 (24%), Positives = 108/247 (43%), Gaps = 20/247 (8%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAEL-LTTNKLSELACLALYLMYEKKQGKK 173
           A+ DL+ G+   ++P + ++T E VL +     + L    +S L  L  +L+ E+  G  
Sbjct: 64  ATRDLKPGELIIALPETCLITTETVLQSYLGKYIRLWRPHVSPLLALCTFLIAERFAGDC 123

Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
           S W PY+  +             P+ W E E+ +L  +P + + LE+    K E  EL T
Sbjct: 124 SQWKPYLDVIPS-------TYSCPVYW-ELEIIHLLPAPLRKKALEQ----KTEVQELHT 171

Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA---LVPLGP- 289
                 S  Q    D   + +T++  + A+  V +  V+++     R  A   +  L P 
Sbjct: 172 ESLAFFSSLQPLFCDNVADIYTYDALRWAWCTVNTRTVYMKHTQQDRLLAQQDVCALAPY 231

Query: 290 -PLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 346
             LL +S + +       D    ++  +   +  +   +  GP  N +LL+ YGFV  +N
Sbjct: 232 LDLLNHSPEVQVEAEFSKDRRCYEIRTNSGCRKHDQAFICYGPHDNQRLLLEYGFVAANN 291

Query: 347 PYDRLVV 353
           P+  + V
Sbjct: 292 PHRSVYV 298


>gi|440804394|gb|ELR25271.1| rubisco lsmt substrate-binding protein [Acanthamoeba castellanii
           str. Neff]
          Length = 408

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 92/395 (23%), Positives = 153/395 (38%), Gaps = 45/395 (11%)

Query: 121 AGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELAC-----LALYLMYEKKQGKKSF 175
           A +    VP SL++     L  E +  +    K +  A      LAL++++E ++   SF
Sbjct: 4   ASERILEVPFSLLLDAGAALRAEDVGSVFAAVKPALDAVDNRLPLALFMLHELRK-PDSF 62

Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 235
           W PY   L  +       V  P+ W++ ++  L GSP  A +L + +  +  + E     
Sbjct: 63  WRPYFDALPSR-------VNLPMFWADEDMQLLAGSPLHAAVLAQKKQARDWHTE----- 110

Query: 236 FMAGSLFQQYP--YDIPTE------AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 287
                + ++YP  + +  +      +++   F+     + S       +       +VP+
Sbjct: 111 -HIVPIVRRYPRPFGVSDDDSSLEPSYSLARFEWVLSMIASRAFWHFDLKDTWEPHMVPM 169

Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLV---VDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 344
              +    +         DD  Q     V +PY  GE + +      N +LL  Y  + E
Sbjct: 170 ADLINHSLTNDNVSKYTFDDKTQTFIVHVQQPYAEGEQVFITYCTDSNFELLKTYAMMVE 229

Query: 345 DN-------PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDML 397
           DN         D   +      E  +    R + QR   L+ Q + V   + +E   D++
Sbjct: 230 DNYNKYTEIRLDETTIARICPDEVERLTKTRALTQRG--LAKQTYPV---KSEEFPLDLV 284

Query: 398 PYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM 457
             LRL ++  T       S+     PVS   E  V D +A   K  L+ YP T  ED AM
Sbjct: 285 QALRLYHLPLTDSHTE--STCFETDPVSVQNELMVYDTIAGCVKELLSQYPITAQEDAAM 342

Query: 458 LT-DYNLHPKKRVATQLVRMEKKMLNACLQVTADM 491
           L  D  L    R+A    R +K  L     V A+M
Sbjct: 343 LAHDPRLSATARLAVAYRREDKLFLTEVGSVFAEM 377


>gi|302816067|ref|XP_002989713.1| hypothetical protein SELMODRAFT_447801 [Selaginella moellendorffii]
 gi|300142490|gb|EFJ09190.1| hypothetical protein SELMODRAFT_447801 [Selaginella moellendorffii]
          Length = 400

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 60/255 (23%), Positives = 111/255 (43%), Gaps = 24/255 (9%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           A+  ++AG+    +P+ LV+T E++  ++ + +LL+T    +   L L ++ E+ +G+ S
Sbjct: 14  AARSIRAGEQIVRIPHELVLTAEKL--DDCVKKLLSTEY--DWCPLTLLILAEQHKGEAS 69

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
            W PY+  L            S + W + EL +L  +       ER E I  EYN +  V
Sbjct: 70  RWAPYVSCLPSFGDH-----HSTIFWGKEELKFLECTRAFRGTAERREMISDEYNSVKDV 124

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
                      P+    +   F+ F  A+  V   V      +L+   ++ P       +
Sbjct: 125 -------ISSCPHVFGEDISLFQ-FAHAYATV---VSRAWNGALSSEISMRPF-VDFCNH 172

Query: 295 SSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 352
                A ++     DA  ++ +R Y  GE + +  G + N+ L ++YGFV  +N  D+  
Sbjct: 173 DPVSHATVSHDTCKDAT-IIAERDYTKGEEVFISYGKRSNAVLAVDYGFVLPNNLSDQAE 231

Query: 353 VEAALNTEDPQYQDK 367
           +   +   DP  + K
Sbjct: 232 LWMEIPWNDPLREKK 246


>gi|212542185|ref|XP_002151247.1| SET domain protein [Talaromyces marneffei ATCC 18224]
 gi|210066154|gb|EEA20247.1| SET domain protein [Talaromyces marneffei ATCC 18224]
          Length = 709

 Score = 59.7 bits (143), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 68/223 (30%), Positives = 99/223 (44%), Gaps = 24/223 (10%)

Query: 161 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE--LAYLTGSPTKAEIL 218
           A +LM +    +  FW PYIR L      G+  + +PL + E E  L +L G  + A   
Sbjct: 108 AFFLMGQYLLQEHGFWYPYIRSLP-----GKEELTTPLFFREEEGDLEWL-GMTSLAASR 161

Query: 219 ERAEGI-----KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA--VQSCVV 271
           ER   I     +R Y  L  + F      + Y +D+   A T  I  +AF A  + S + 
Sbjct: 162 ERRLAIWRGNYERGYTMLKELGFEG---VEGYTWDLYLWASTI-ISSRAFTAKVLASVIP 217

Query: 272 HLQKVSLARRFALVPLGPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 329
            L+   + R   L+PL   + A + K   K    A  D++ LVV     AGE +    GP
Sbjct: 218 ELKNAEVDRVSVLLPL---IDATNHKPLSKVEWRAGTDSIGLVVMSDVAAGEEVGNNYGP 274

Query: 330 QPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 372
           + N +L++NYGF   DNP +  VV      + P  Q K    Q
Sbjct: 275 RNNEQLMMNYGFCIPDNPCEYRVVSLRAPLDSPLAQIKAQYEQ 317


>gi|281207217|gb|EFA81400.1| mRNA-decapping enzyme 2 [Polysphondylium pallidum PN500]
          Length = 1078

 Score = 59.7 bits (143), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 102/464 (21%), Positives = 187/464 (40%), Gaps = 97/464 (20%)

Query: 61  DTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQ 120
           D  + G    ++ +++D  ++++W   +  P   +   EK            + +S D++
Sbjct: 12  DIRIGGQTVQLTFRKDDGINIQTWKQDSKQPLLSLTPNEKG-----------IFSSRDIK 60

Query: 121 AGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLA--------LYLMY--EKKQ 170
            G+   S+P    +++ +V   + +  L   NK+ +L   A        LY  Y  +   
Sbjct: 61  EGEELLSLPWYNSLSMNKV--QQQLPWLF--NKIQDLELTAEDGLVVALLYYRYCMDDLS 116

Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
              S W   + E+          + S L +S+ E   L GSP   +++ +    K  +  
Sbjct: 117 FDYSEWFSAMPEV----------LNSGLFFSDAEAELLNGSPAYIDLMNQRLDAKELFGR 166

Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL---ARRFALVPL 287
           L        SLF++  +     A T++  K A+  V S  ++ +  +L      F  V L
Sbjct: 167 L-------KSLFKEQQFS--KCAMTYDRLKWAYSVVDSRKIYTEAPNLDANGNPFITVVL 217

Query: 288 GPPLLAYSSKCKAMLAAVD-----DAVQLVVDRPYKAGESIVVWCGPQP-NSKLLINYGF 341
            P  L Y +  +   AA D      A+++V  +P K GE I +  G Q  NS LLI+YGF
Sbjct: 218 AP-FLDYFNHAEDAQAAYDFDYDESAIKVVALQPIKKGEQIFLNYGNQDCNSDLLIHYGF 276

Query: 342 VDEDNPYDRLV---VEAALNT---EDPQYQDKRMVAQRNGKLS--VQVFHVHAGREKEAI 393
           +D+ +     V   VE  LNT    DPQ  +K  +  +  + +  +++F      E   I
Sbjct: 277 IDQSSTAKHCVNVLVEELLNTIPASDPQLIEKTELLTKAFEQNERMKLFKDSLTEELLKI 336

Query: 394 SDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSE 453
           S  L Y                               ++L  L      ++  YP T+ E
Sbjct: 337 SKYLSYKNF----------------------------SLLPYLKSLIDMKMKAYPTTMEE 368

Query: 454 DEAML---TDYNLHPKKRVATQLVRMEK----KMLNACLQVTAD 490
           D A++   T++    ++   + ++R+++    K + A +QV  D
Sbjct: 369 DRAIIEATTEFEKLSQRSKMSIIMRLQEKETLKEIGALIQVKID 412


>gi|148686780|gb|EDL18727.1| mCG18357, isoform CRA_e [Mus musculus]
          Length = 458

 Score = 59.7 bits (143), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 49/193 (25%), Positives = 95/193 (49%), Gaps = 10/193 (5%)

Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 365
           DD  + V  + ++AG+ I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y 
Sbjct: 156 DDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 215

Query: 366 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISS 417
            K  V  R G  +  VF +H+  E    + +L +LR+  +++    + +        I +
Sbjct: 216 MKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFT 274

Query: 418 LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 476
           LG    PVS   E  +   L D     L  Y  T+ ED+ +L + +L  +  +A +L   
Sbjct: 275 LGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRLG 334

Query: 477 EKKMLNACLQVTA 489
           EK++L   ++  A
Sbjct: 335 EKEILEKAVKSAA 347


>gi|332020870|gb|EGI61268.1| SET domain-containing protein 3 [Acromyrmex echinatior]
          Length = 232

 Score = 59.7 bits (143), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 55/217 (25%), Positives = 94/217 (43%), Gaps = 11/217 (5%)

Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
           AL+P+       + +      A  D  +    R +K GE + +  GP+ NS   ++ GFV
Sbjct: 18  ALIPMWDMCNHENGRITTDFNATSDRCECYALRDFKKGEQVFISYGPRTNSDFFVHSGFV 77

Query: 343 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD-MLPYLR 401
             DN  D   +   ++  D   +++  +  +    SV  F +  G   E ISD +L +LR
Sbjct: 78  CMDNEQDGFKLRLGISKADSLQKERIELLSKLDLPSVGEFLLKPG--TEPISDTLLAFLR 135

Query: 402 LGYVSDTSEMQSVISSLGPI------CPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 455
           + +    +E+   + S          C +   +E  V   L    +  +A YP TL ED 
Sbjct: 136 V-FSMRKAELTHWLRSDKVFDLKHVDCALETVVEENVRKFLLTRLQLLIANYPTTLKEDL 194

Query: 456 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 492
            +L +  L   K++A QL   EK++L+  L+     I
Sbjct: 195 ELL-ETTLPQMKKMAVQLRVTEKRILSGALEYVEQWI 230


>gi|302804448|ref|XP_002983976.1| hypothetical protein SELMODRAFT_423083 [Selaginella moellendorffii]
 gi|300148328|gb|EFJ14988.1| hypothetical protein SELMODRAFT_423083 [Selaginella moellendorffii]
          Length = 266

 Score = 59.7 bits (143), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 61/242 (25%), Positives = 103/242 (42%), Gaps = 29/242 (11%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK-- 172
           AS  ++AG+    +   L++    + G  +     T    ++LA + L   Y K Q K  
Sbjct: 25  ASRPVRAGERVLEISLDLMIAPSDLPGELSTVLSSTVKPWTKLALIVLMERY-KGQAKLQ 83

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
            S W PYI  L          +++  LW +TEL+YL  SP   +  ER E I  E+ ++ 
Sbjct: 84  SSAWAPYISCLPEPA-----ELDNTFLWEDTELSYLRASPLYGKTRERLEMITTEFGQVQ 138

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
               +   LF +          + E FK  +  V S     + +++     LV +  P+L
Sbjct: 139 NALDVWPQLFGK---------VSLEDFKHVYATVFS-----RSLAIGEDSTLVMI--PML 182

Query: 293 AY-----SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
            +     +S  K     + +   +  DR Y   + I +  G   N++L ++YGF   +NP
Sbjct: 183 DFFNHNATSFAKLSFNGLLNYAVVTADRDYAENDQIWINYGDLSNAELALDYGFAVPENP 242

Query: 348 YD 349
           YD
Sbjct: 243 YD 244


>gi|406860468|gb|EKD13526.1| putative SET domain-containing protein [Marssonina brunnea f. sp.
           'multigermtubi' MB_m1]
          Length = 474

 Score = 59.7 bits (143), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 97/410 (23%), Positives = 165/410 (40%), Gaps = 59/410 (14%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELAC-LALY-LMYEKKQ 170
           + A  D+   +  F++P   V+     LG+     +L      E+ C LAL  ++  + Q
Sbjct: 42  LVAQSDIGEDEVLFTIPRDAVLNTTTALGSADNPAIL------EMPCWLALTAIILTEGQ 95

Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEGIKREY- 228
            + S W PY+  L  +       ++S + WSE+EL  L  S    +I    AE +  E+ 
Sbjct: 96  QEDSKWAPYLALLPSR-------LDSLVFWSESELLELQASTVVNKIGRASAEQLFLEHI 148

Query: 229 ------NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-----CVVHLQKVS 277
                 N    +     S+   Y +DIP         K+     +S      +V   +  
Sbjct: 149 SPLGLSNTNTEMCHKVASVVMAYAFDIPE--------KKGHDDPESPEDGDDLVSDNEEE 200

Query: 278 LARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 337
                +++PL   L A +    A L   ++ +++   +P   GE I+   G  P S LL 
Sbjct: 201 ENTILSMIPLADMLNADADGNNARLCCDNEELEMRSIKPISKGEEILNDYGQLPRSDLLR 260

Query: 338 NYGFV-DEDNPYDRLVVE-------AALNTEDP-------------QYQDKRMVAQRNGK 376
            YG++ D+   YD  V E       A+L+TE P             + + +  +AQR G 
Sbjct: 261 RYGYISDKYAAYD--VAELSTQSLLASLSTEQPLLAGGTLQPLSREKLEQRVELAQREGV 318

Query: 377 LSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQL 436
                   H G +  +I D L  L    + D   + ++ +S   +   S      V   L
Sbjct: 319 YEDSYDLTHPGPDDPSIPDELLALLYILLLDNENLAAIETSHASLPSRSKLATSLVGQIL 378

Query: 437 ADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 486
               ++R   Y  T+  D+A+L   NL  +KR+A ++   EK +L   +Q
Sbjct: 379 TKILESRKQEYATTIEADQAILQADNLPSRKRMAVEVRLGEKLVLEKAIQ 428


>gi|148686778|gb|EDL18725.1| mCG18357, isoform CRA_c [Mus musculus]
          Length = 536

 Score = 59.3 bits (142), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 49/193 (25%), Positives = 95/193 (49%), Gaps = 10/193 (5%)

Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 365
           DD  + V  + ++AG+ I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y 
Sbjct: 234 DDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 293

Query: 366 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISS 417
            K  V  R G  +  VF +H+  E    + +L +LR+  +++    + +        I +
Sbjct: 294 MKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFT 352

Query: 418 LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 476
           LG    PVS   E  +   L D     L  Y  T+ ED+ +L + +L  +  +A +L   
Sbjct: 353 LGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRLG 412

Query: 477 EKKMLNACLQVTA 489
           EK++L   ++  A
Sbjct: 413 EKEILEKAVKSAA 425


>gi|340966944|gb|EGS22451.1| hypothetical protein CTHT_0019870 [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 499

 Score = 59.3 bits (142), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 48/191 (25%), Positives = 78/191 (40%), Gaps = 12/191 (6%)

Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
            +L+ E  +G+ SFW PYI  L +        +  P  W E ++ +L G+     I E  
Sbjct: 111 FFLIKEYLKGENSFWWPYIATLPQPEQVNSWTL--PAFWPEDDIQFLEGTNAHVAIGEIQ 168

Query: 222 EGIKREYNELDTVW----FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK-V 276
             IKREY +   V     F     + Q  Y      FT   F+ + +  QS   ++   +
Sbjct: 169 ANIKREYKQARKVLKEENFPNWKEYSQMLYKWAFSIFTSRSFRPSLILSQSVKDYVSTLL 228

Query: 277 SLARRFALVPLGPPLLAYSSKCKAMLAAVD-----DAVQLVVDRPYKAGESIVVWCGPQP 331
             AR      +  PL   ++         D     +  QL+    Y+ G+ +    G + 
Sbjct: 229 PSAREIDDFSILQPLFDIANHSMTATYTWDTTSDPNCCQLICQDSYRPGDQVFNNYGFKT 288

Query: 332 NSKLLINYGFV 342
           NS+LL+ YGF+
Sbjct: 289 NSELLLAYGFI 299


>gi|330806388|ref|XP_003291152.1| hypothetical protein DICPUDRAFT_155733 [Dictyostelium purpureum]
 gi|325078672|gb|EGC32310.1| hypothetical protein DICPUDRAFT_155733 [Dictyostelium purpureum]
          Length = 465

 Score = 59.3 bits (142), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 96/406 (23%), Positives = 164/406 (40%), Gaps = 52/406 (12%)

Query: 72  SKKEEDLGDLKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
           +K+ E L + K W+  N     P   + L +K   +        + A + ++  D   S+
Sbjct: 34  TKEIESLKEFKEWLVNNNAYINPNIDIELLDKYGRS--------IVAKKSIKKQDKLISI 85

Query: 129 PNSLVVTLERVLGN-----ETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIREL 183
           P  ++++   + G      + I E + +  LS     A+++MY  K  +KSFW PY+  L
Sbjct: 86  PKDIIMS--NIGGYPKKIPKEIYEQVQSIGLSPTNLQAVFIMY-SKLNEKSFWHPYVTVL 142

Query: 184 DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQ 243
                    +  + L +S+ EL  L  S  K   + R +GI+R Y           S F 
Sbjct: 143 PE-------SFSTSLYFSDNELDELQASQLKEFTIIRKDGIERHYE----------STFS 185

Query: 244 QYPYDIPTEAFTFEIFKQA-FVAVQSCVVHLQKVSLARR-FALVPLGPPLLAYS-SKCKA 300
           +    +P E     ++ Q  F    SCV   +  SLA     +VPL     A   SK K 
Sbjct: 186 RLSKLVP-EFSNLALYNQELFTWALSCVWS-RAFSLAENDGGMVPLADMFNAEDRSKSKV 243

Query: 301 MLAAVDDAVQLVVDRPYKAGESIVVWCG---PQPNSKLLINYGFV-DEDNPYDRLVVEA- 355
           +    D  +          GE I    G   P  +S++L++YGF+ DE    D + +   
Sbjct: 244 LPKVTDTTLDYYASDDIAEGEQIFTPYGVYKPLSSSQMLMDYGFIFDEGTVSDNVAITVP 303

Query: 356 ALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVI 415
             + ++P    K+ + + N  ++ +VF +       A  D+L Y R+  +      Q+  
Sbjct: 304 VFHNDEPNLSTKQEILEENDIIN-EVFLLQKTDPLPA--DLLLYARVKNLIAKECDQAKK 360

Query: 416 SSLGP---ICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 458
             L P     P++   E+  L  L +     L  Y   L  D+ +L
Sbjct: 361 HFLSPNTRNTPLNTRNEKVSLRFLENLIHRYLDSYGTNLESDKNLL 406


>gi|34784341|gb|AAH57968.1| Setd3 protein [Mus musculus]
          Length = 408

 Score = 59.3 bits (142), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 49/193 (25%), Positives = 95/193 (49%), Gaps = 10/193 (5%)

Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 365
           DD  + V  + ++AG+ I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y 
Sbjct: 106 DDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 165

Query: 366 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV--------ISS 417
            K  V  R G  +  VF +H+  E    + +L +LR+  +++    + +        I +
Sbjct: 166 MKAEVLARAGIPTSSVFALHS-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFT 224

Query: 418 LGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 476
           LG    PVS   E  +   L D     L  Y  T+ ED+ +L + +L  +  +A +L   
Sbjct: 225 LGNAEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKIVLKNPDLSVRATMAIKLRLG 284

Query: 477 EKKMLNACLQVTA 489
           EK++L   ++  A
Sbjct: 285 EKEILEKAVKSAA 297


>gi|145250231|ref|XP_001396629.1| SET domain protein [Aspergillus niger CBS 513.88]
 gi|134082145|emb|CAK42259.1| unnamed protein product [Aspergillus niger]
 gi|350636112|gb|EHA24472.1| hypothetical protein ASPNIDRAFT_48629 [Aspergillus niger ATCC 1015]
          Length = 489

 Score = 58.9 bits (141), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 78/308 (25%), Positives = 123/308 (39%), Gaps = 54/308 (17%)

Query: 83  SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
           +W+   G P  K+  K + +    H     V A  DL  G+  F++P + V++++    N
Sbjct: 22  TWLA--GKPGVKINPKIQIADLRSHAAGRGVVAQSDLDEGEELFTIPRAHVLSVQ----N 75

Query: 143 ETIAELLTTN--KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
             +  LL+ N   L     L + ++YE  QG +S W  Y R L R         ++ + W
Sbjct: 76  SNLKNLLSQNLDDLGPWLSLMVVMIYEYLQGDQSAWASYFRVLPRN-------FDTLMFW 128

Query: 201 SETELAYLTGSP---------TKAEILE------RA--------EGIKREYNELDTVWFM 237
           S +EL  L GS           +  ILE      RA        +G+     +  T   +
Sbjct: 129 SASELEELQGSAIVEKIGKQGAEESILETIAPIVRANPALFPPIDGVASYDGDAGTQALL 188

Query: 238 -----AGSLFQQYPYDI--PTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 290
                 GSL   Y +DI  P +       +  ++  +                +VPL   
Sbjct: 189 HLAHTMGSLIMAYAFDIEKPEDEEGERDGEDGYLTDEE--------EEQSSKGMVPLADL 240

Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 350
           L A + +  A L   ++ + +   +P KAGE I    G  P S LL  YG+V  DN    
Sbjct: 241 LNADADRNNARLFQEEEVLVMKAIKPIKAGEEIFNDYGEIPRSDLLRRYGYV-TDNYAQY 299

Query: 351 LVVEAALN 358
            VVE +L+
Sbjct: 300 DVVELSLD 307


>gi|384251065|gb|EIE24543.1| hypothetical protein COCSUDRAFT_40909 [Coccomyxa subellipsoidea
           C-169]
          Length = 685

 Score = 58.9 bits (141), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 52/171 (30%), Positives = 76/171 (44%), Gaps = 18/171 (10%)

Query: 201 SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK 260
           +E E++ L G+P     +E  + I+ +Y     V     +L   YP DI  +  T + F 
Sbjct: 65  TEEEVSMLEGTPAHTTFVEARQHIREQYRAAQPV---LQALTAAYPDDITPDLVTEDKFI 121

Query: 261 QAFVAVQSCVVHLQKVSLARRFALVPLG--------PPLLAYSSKCKAMLAAVDDAVQLV 312
            A     S  + ++ V  A R  LVP+         P ++ Y       L A  D+++L 
Sbjct: 122 WACELWYSYAIEVEYVDGAVRQTLVPIAHLLNHSPWPHIVRYGR-----LDAATDSLRLR 176

Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR--LVVEAALNTED 361
             R   AGE   +  GP PN KLL+ YGF   DNP+D   +  EA  N  D
Sbjct: 177 AFRHCAAGEQCFLSYGPLPNLKLLLFYGFALPDNPHDTVPITFEAEKNEGD 227


>gi|307190530|gb|EFN74527.1| SET domain-containing protein 3 [Camponotus floridanus]
          Length = 232

 Score = 58.9 bits (141), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 55/217 (25%), Positives = 94/217 (43%), Gaps = 11/217 (5%)

Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
           AL+P+       + +      A  D  +    R +K GE + +  GP+ NS   ++ GFV
Sbjct: 18  ALIPMWDMCNHENGRITTDFNATSDHCECYALRNFKKGEQVFISYGPRTNSDFFVHSGFV 77

Query: 343 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD-MLPYLR 401
             +N  D   +   ++  D   +++  +  + G  SV  F +  G   E ISD +L +LR
Sbjct: 78  YMNNKQDGFKLRLGISKADSLQKERIELLSKLGLPSVGEFLLKPG--TEPISDTLLAFLR 135

Query: 402 LGYVSDTSEMQSVISSLGPI------CPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 455
           + +    +E+   + S          C +   +E  V   L    +  +A YP TL ED 
Sbjct: 136 V-FSMRKAELAHWLRSDKVFDLKHMDCALETVVEENVRKFLLTRLQLLIANYPTTLKEDL 194

Query: 456 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 492
            +L +  L   K++A QL   EK++L   L+     I
Sbjct: 195 ELL-ETTLPQIKKMAVQLRVTEKRILLGALEYVEQWI 230


>gi|19112238|ref|NP_595446.1| ribosomal lysine methyltransferase Set10 [Schizosaccharomyces pombe
           972h-]
 gi|74626910|sp|O74738.1|SET10_SCHPO RecName: Full=Ribosomal N-lysine methyltransferase set10
 gi|3738151|emb|CAA21252.1| ribosomal lysine methyltransferase Set10 [Schizosaccharomyces
           pombe]
          Length = 547

 Score = 58.9 bits (141), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 64/247 (25%), Positives = 106/247 (42%), Gaps = 31/247 (12%)

Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
           L  +L  E  +G +S W  YI  L +          +PL ++E + A+L  +   +   E
Sbjct: 82  LCTFLALESLKGIQSKWYGYIEYLPK-------TFNTPLYFNENDNAFLISTNAYSAAQE 134

Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF-KQAFVAVQSCVVHLQKVSL 278
           R    K EY E       A SL     +  PTE FTF+++   A V    C        +
Sbjct: 135 RLHIWKHEYQE-------ALSL-----HPSPTERFTFDLYIWSATVFSSRC---FSSNLI 179

Query: 279 ARRFALVPLGPPLL-AYSSKCKAMLAAVDD-----AVQLVVDRPYKAGESIVVWCGPQPN 332
            +     P+  PL+ + + K K  +    D     +VQL+       G  +    GP+ N
Sbjct: 180 YKDSESTPILLPLIDSLNHKPKQPILWNSDFQDEKSVQLISQELVAKGNQLFNNYGPKGN 239

Query: 333 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG--KLSVQVFHVHAGREK 390
            +LL+ YGF   DNP+D + ++ A++ + P    K  + + +   +LS  VF +    +K
Sbjct: 240 EELLMGYGFCLPDNPFDTVTLKVAIHPDLPHKDQKAAILENDCQFQLSNLVFFLPKSPDK 299

Query: 391 EAISDML 397
           E    +L
Sbjct: 300 EIFQKIL 306


>gi|449542715|gb|EMD33693.1| hypothetical protein CERSUDRAFT_56467 [Ceriporiopsis subvermispora
           B]
          Length = 510

 Score = 58.9 bits (141), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 65/226 (28%), Positives = 98/226 (43%), Gaps = 31/226 (13%)

Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
           A+VP+   L A      A L   +  +++V  +P  AGE I    G  PNS LL  YG V
Sbjct: 260 AMVPMADMLNARFESENAKLFYEEHYLKMVATKPINAGEQIWNTYGDPPNSDLLRRYGHV 319

Query: 343 D----------EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQV-FHVHAGREK- 390
           D          E NP D + + A L          R   +  G L V+V F +    +  
Sbjct: 320 DVVPLGEPLSGEGNPADVVEIRADL-----VVSAVRKARKAAGDLQVRVDFWLEEADDDT 374

Query: 391 -------EAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 443
                  E   ++L ++RL  +S T +  + + + G +      +E  +L  + D  K R
Sbjct: 375 FVLMTDCEVPEELLSFIRL--LSLTKDEWNKVKAKGKLP--KGKLELELLPAIVDVLKER 430

Query: 444 LAGYPATLSEDEAML---TDYNLHPKKRVATQLVRMEKKMLNACLQ 486
           L  YP T+ EDE++L   +  NL   KR A  +   EK++L   LQ
Sbjct: 431 LKEYPTTIEEDESLLGPDSAVNLSFNKRNAVVVRLGEKRILRGALQ 476


>gi|302815683|ref|XP_002989522.1| hypothetical protein SELMODRAFT_129980 [Selaginella moellendorffii]
 gi|300142700|gb|EFJ09398.1| hypothetical protein SELMODRAFT_129980 [Selaginella moellendorffii]
          Length = 464

 Score = 58.9 bits (141), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 89/375 (23%), Positives = 150/375 (40%), Gaps = 62/375 (16%)

Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
           L L L+YE+ Q K S+W PYI  L         +   P+ +S  ++  +  +P   ++ +
Sbjct: 105 LGLKLLYERAQ-KGSYWWPYISMLPH-------SFTLPIFFSGVDIESIDYAPVTHQVKK 156

Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF---EIFKQAFVAVQSCVVHLQKV 276
           R   + +  +EL        +  +  P +I   A  F        A  AV S    +  V
Sbjct: 157 RCRFLLQFSSEL--------AKLESLPEEIHPFAGQFVDSGALGWAMAAVSSRAFRIHGV 208

Query: 277 SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV----------QLVVDRPYKAGESIVVW 326
           +     A++    PL+   +      A +++ +          ++V  R  + G +I + 
Sbjct: 209 TNKLCSAMML---PLIDMCNHSFQPNAHIEEDLSRDAQDVSFLKVVTKRNLEKGSAITLN 265

Query: 327 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA--QRNGK------LS 378
            GP  N  LL++YGFV  DNP+DR+     L  +    ++ RM+A   R G        S
Sbjct: 266 YGPLSNDLLLLDYGFVIPDNPHDRI----ELRYDGSLMENARMIAGLSRTGSPPFSSPAS 321

Query: 379 VQVFH--------------VHAGREKEAISDMLPYLRLGYVSDTS--EMQSVIS--SLGP 420
            QV                V  G  +E    +L  LR+ +       E + ++S  + G 
Sbjct: 322 WQVDRLKQLGLADSGESQKVTLGGPEEVDGRLLAALRILHAESQEPLERRELVSLQAWGV 381

Query: 421 ICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKM 480
              VS   E  VL  L          +  T+ EDEA L+D +L    R+A Q    +K++
Sbjct: 382 ESMVSSDNEERVLRTLCGLGAIVFNQFKTTIEEDEAKLSDKSLAETSRIAVQFRLTKKRL 441

Query: 481 LNACLQVTADMIMLL 495
           +   L+     +M L
Sbjct: 442 VVRVLESLKKRLMDL 456


>gi|149044196|gb|EDL97578.1| rCG27725, isoform CRA_b [Rattus norvegicus]
          Length = 538

 Score = 58.9 bits (141), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 47/193 (24%), Positives = 92/193 (47%), Gaps = 10/193 (5%)

Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 365
           DD  + V  + ++AG+ I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y 
Sbjct: 234 DDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 293

Query: 366 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---------SEMQSVIS 416
            K  V  R G  +  VF +H   E    + +L +LR+  +++          S +  + +
Sbjct: 294 MKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFT 352

Query: 417 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 476
                 PVS   E  +   L D     L  Y  T+ ED+ +L + +L  +  +A +L   
Sbjct: 353 LGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKTVLKNPDLSVRATMAIKLRLG 412

Query: 477 EKKMLNACLQVTA 489
           EK++L   ++  A
Sbjct: 413 EKEILEKAVKSAA 425


>gi|145553305|ref|XP_001462327.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124430166|emb|CAK94954.1| unnamed protein product [Paramecium tetraurelia]
          Length = 481

 Score = 58.5 bits (140), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 93/466 (19%), Positives = 183/466 (39%), Gaps = 63/466 (13%)

Query: 58  SSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASE 117
            S   L   S+ +  +  +   +L  W+        KV ++ K   +E +R +    AS+
Sbjct: 20  DSESELRTKSKRITYEDPDPYKNLIQWLKDGKAEVSKVSIEVK---SEGYRTLR---ASQ 73

Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIA-ELLTTNKL-SELACLALYLMYEKKQGKKSF 175
            ++ G+    VP +  ++LE V  +  I  +++  N + + +    +  + ++ + + SF
Sbjct: 74  FIRQGEWVLFVPRTHYLSLEEVKKSCLINRKMIQLNYIPNNIQTYFVNHLLQENRRQNSF 133

Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 235
           W PYI  L +           P  +   + A L GSPT   ++ + +  + EY+ L    
Sbjct: 134 WKPYIDVLPKD------VSGFPTNFDAEQDALLKGSPTLFTVMNQRKTFQEEYDNLKE-- 185

Query: 236 FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF-----------AL 284
             A   FQ+Y Y       T+  F           V  + ++++R F            L
Sbjct: 186 --AVKEFQRYGY-------TYNDF-----------VKFRTLTISRSFPVYIGENEQQQLL 225

Query: 285 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVD--RPYKAGESIVVWCGPQPNSKLLINYGFV 342
           VPL    + + +          DA    +   R  + GE +    G   N    +NYGF 
Sbjct: 226 VPLAD-FINHDNNGFLQYGYSPDADGFFMQAVRNIQKGEELFYNYGQWSNKYFFMNYGFA 284

Query: 343 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 402
              NP ++   +  L+  D  +   +M  +  G        +    + +     L  +R 
Sbjct: 285 SLTNPMNQFDFDICLDRNDRMF---KMKVELTGGNICWGNRLVNETDHDTFRQSLATVRF 341

Query: 403 GYVS---DTSEMQSVISSLGPICP---VSPC---MERAVLDQLADYFKARLAGYPATLSE 453
             +S   D  +++  + +     P    +P    +E+A    L D   + LA + +T+ +
Sbjct: 342 AQISKLDDFLQLEEDVQNYNQFWPGWHTTPKTIELEKATFKALRDLLVSELANFASTIED 401

Query: 454 DEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVT 499
           D+  L D +    +R    L   EK+++   ++V  DM++ + D T
Sbjct: 402 DQRRLNDPSTPEFRRHIIMLTMREKQIIKKNIEV-CDMMLSVIDKT 446


>gi|320168265|gb|EFW45164.1| hypothetical protein CAOG_03170 [Capsaspora owczarzaki ATCC 30864]
          Length = 464

 Score = 58.5 bits (140), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 74/318 (23%), Positives = 124/318 (38%), Gaps = 81/318 (25%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELA---------CLA 161
           V A  D+ AG    +VP +L++T E+   +ET  +L+T+  L  +EL+          L 
Sbjct: 170 VIARRDIPAGQTFINVPEALMMTAEKARKSETF-QLITSGALDSTELSPAMAKLDNFLLR 228

Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
           ++L+ E+++G  S+W PYI +L  QR R       PL ++E EL  L  SP   E   + 
Sbjct: 229 MFLIVERRRGGNSYWSPYI-DLLPQRFR------LPLYFTEAELELLKPSPALQEAFVQL 281

Query: 222 EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR 281
             + R+Y           + ++QY           E+ + A +   S   H + +   RR
Sbjct: 282 RNVVRQY-----------AAWKQY-------LMMLELARAAELPSGSGDAHQKILDQRRR 323

Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR-------------------------- 315
              +P+    L Y   C A  A      Q+VV                            
Sbjct: 324 AQAMPVRYNELTYDLFCWASSAVATRQNQIVVGEVRANQAPELSLALIPGWDMCNHAFGG 383

Query: 316 ------------------PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 357
                             P   GE +++  G + +     N  FV  D+P D+ ++  A+
Sbjct: 384 ASSFYDTQTRSLECVAVAPIAKGEPVLLHYGDRSSMAYFGNSEFVPADHPTDQYLILLAV 443

Query: 358 NTEDPQYQDKRMVAQRNG 375
             +DP ++ K  + Q  G
Sbjct: 444 GKQDPLFKSKSTILQALG 461


>gi|325186532|emb|CCA21071.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 441

 Score = 58.5 bits (140), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 95/421 (22%), Positives = 173/421 (41%), Gaps = 44/421 (10%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           V A++ LQ G+    +P  L ++    + ++    L   N+L +   +AL+LM E+ +  
Sbjct: 39  VYAAKSLQKGEITMEIPFHLTISKVTAMQSDLRQILQDKNELDQDEIVALFLMIERFKSS 98

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
            SF+ P+I+ L  Q        + P+ W++++ A L G  T   +L +   I R+  E D
Sbjct: 99  DSFFEPFIQSLPSQ-------FDLPIFWNDSDFAELEG--TNVALLAK---IMRKQIEAD 146

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
              F A  +     Y+      T EI    +    S ++  +   + R    + +  P L
Sbjct: 147 ---FQAIHIPLLRAYEERLNLRTSEISISDYEWALS-IIWTRAFGITRYGEYLRVLCPAL 202

Query: 293 AYSSKCKAMLAAVD---------DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV- 342
              +    +   +D         D +   V     A +   +  G   ++KLL +YGFV 
Sbjct: 203 DMFNHSVLVQEPLDEFIKYDHMKDVLAHCVVMETSANDPFYISYGSYSDAKLLYSYGFVS 262

Query: 343 -DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISD-MLPYL 400
            +E N ++ + +   +   DP ++ K+ + + N     Q +        + + +  L   
Sbjct: 263 LNEKNRFNGIDLWMRVPVTDPNFKLKQAILEGNAATRDQTYDFRGTIHLDDVDERFLASF 322

Query: 401 RLGYVS--DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 458
           R+  +S  +  E +    S      VS   E AV   + D  + RLA +P +L +D   L
Sbjct: 323 RIILLSQEEFREYEKAFDS----TIVSVRNELAVYAAIHDVCEKRLARFPTSLEDDLKKL 378

Query: 459 TDYNLHPKKRVATQL-VRME-KKMLNACLQVTADMIMLL--------PDVTVSPCPAPYA 508
            +  ++   R    + VRME KK+L +  ++  +   LL        PDVT    P    
Sbjct: 379 AELEMNSDLRKTYAISVRMEDKKILQSVCRLMKEWRNLLENDSNIYPPDVTRQQQPQLSM 438

Query: 509 P 509
           P
Sbjct: 439 P 439


>gi|346465219|gb|AEO32454.1| hypothetical protein [Amblyomma maculatum]
          Length = 353

 Score = 58.5 bits (140), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 82/346 (23%), Positives = 144/346 (41%), Gaps = 36/346 (10%)

Query: 80  DLKSWMHKNGLP-PCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLER 138
           DL  WM  NG     ++ ++E     E  R +   A  + + AG+    VP  L++T   
Sbjct: 28  DLLEWMIANGFELHVQLCVRE---FTETGRGL---ATLQKVTAGETFLRVPTCLLITTTT 81

Query: 139 VLGNETIAELLTTNK-LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
            L +     L+  ++ L+ +  L L+L+ EK +G  S W  +I  L        ++  +P
Sbjct: 82  ALSSSLHGFLVRHHRQLTAIEVLTLFLINEKLRGLDSEWRFFIDSL-------PVSYTTP 134

Query: 198 LLWSETELAYLTGSPT-KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 256
           +      LA L  +   KAE   +   I+R +  L  +  +  +L         +E FT+
Sbjct: 135 VFLGSKLLARLPETMCRKAE--AQVSRIRRTFVRLQIL--LKRALLDDSALLNLSENFTW 190

Query: 257 EIFKQAFVAVQS-CVVHLQKVSLARRFA---LVPLGPPLLAYSSKCKAMLAAVDDA--VQ 310
            +F  A+ AV + C+    K      F       L P L   +   KA +    +    +
Sbjct: 191 HLFVWAWTAVNTRCI--FSKHRTDHSFWDDDYCALAPFLDCLNHHWKADVETTVEGSYFE 248

Query: 311 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 370
           +V +  Y+  + + +  G   N KLL+ YGFV  DNP D + +     T+   Y   ++ 
Sbjct: 249 IVTNNNYEPNDQVFISYGSHDNKKLLLEYGFVLADNPNDVVAI-----TKGHLY---KLN 300

Query: 371 AQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS 416
           +Q+N  +      +    EK+ ISD   +   G   +   +  V+S
Sbjct: 301 SQQNDTVLYFATKLSFLEEKDIISDTCGFTTDGLTWNGKIVMQVLS 346


>gi|255071849|ref|XP_002499599.1| predicted protein [Micromonas sp. RCC299]
 gi|226514861|gb|ACO60857.1| predicted protein [Micromonas sp. RCC299]
          Length = 588

 Score = 58.5 bits (140), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 78/295 (26%), Positives = 123/295 (41%), Gaps = 36/295 (12%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVL---GNETIAELLTTNKLSELACLALYLMYEKK 169
            AA+  + AGD A ++P   + T+   L   G    A  +    L E    AL+L+ E+ 
Sbjct: 188 AAATTHIPAGDIAAAIPVERLFTVRHALEMPGPRGDAYRMFA-ALGEDTIAALWLIAERA 246

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVE-----SPLLWS-ETELAYLTGSPTKAEILERAEG 223
            G+ S W   I  L    G G+ +       +P+ W  E   A L G+P  A+ +  +E 
Sbjct: 247 LGEASPWHAVIASLPWPEG-GEGSASPCGGCTPVSWPREACDALLGGTPLLADAIAASEK 305

Query: 224 IKREYNELDTVWFMAGSLFQQYPYDI-PTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF 282
           + R++  L    F A     ++  D+ P  A+T + F++A  A  S  + +Q        
Sbjct: 306 LARQHAAL----FPA---LSEHMADVFPASAYTLDNFRRAHEAWNSYGMTVQASPGEPAA 358

Query: 283 ALVP---------LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 333
             +P         L P ++ YS          D  ++L V R   AGE + V  G + N+
Sbjct: 359 TCLPPVAMLCNHALWPHVVRYSRL-------RDGTLRLPVARSVHAGEEVFVSYGAKSNA 411

Query: 334 KLLINYGFVDEDNPYDRLVVEAAL-NTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 387
           +LL+ YGF    NPYD + +   L   E       R  A     L++    V AG
Sbjct: 412 ELLLFYGFALPGNPYDDVPLSLELPGGEVADVTKAREAALARAGLTLSPHAVRAG 466


>gi|66828265|ref|XP_647487.1| hypothetical protein DDB_G0268558 [Dictyostelium discoideum AX4]
 gi|60475797|gb|EAL73732.1| hypothetical protein DDB_G0268558 [Dictyostelium discoideum AX4]
          Length = 459

 Score = 58.2 bits (139), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 80/316 (25%), Positives = 130/316 (41%), Gaps = 52/316 (16%)

Query: 78  LGDLKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
           L +   W+  N +   P  ++ + EK   +        + A + ++  +   SVP  +++
Sbjct: 35  LNEFNKWLINNKVYKNPKIEIKVLEKYGRS--------IVAKQSIKKNEKLISVPKLIIM 86

Query: 135 T----LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRG 190
           +        L NE I E   +  +S     A++LMY  K   KSFW PY+  L ++    
Sbjct: 87  SNMGGFSHHLPNE-IYEPSISIGISPTNLQAIFLMY-CKLNDKSFWYPYVSVLPKE---- 140

Query: 191 QLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIP 250
                + + +SE EL  L  S  K   + R +GI+R YN   T     G + +  P    
Sbjct: 141 ---FTTSIYFSEEELDELQSSKLKEFTIIRKDGIERHYNSTFTRLSNRG-IAEFSPTSTQ 196

Query: 251 T---EAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL-------VPLGPPLLAYS-SKCK 299
           T   + +T E+F  A     SCV        +R F+L       VPL     A   SK K
Sbjct: 197 TLQQKGYTLELFTWAL----SCV-------WSRAFSLSDSDGGMVPLADMFNAEEISKSK 245

Query: 300 AMLAAVDDAVQLVVDRPYKAGESIVVWCG---PQPNSKLLINYGFV-DEDNPYDRLVVEA 355
                 D  +        + GE I    G   P  +S++L++YGFV D   P D + +  
Sbjct: 246 VQPKVTDSTLDYYASDDIEIGEQIFTPYGVYKPLSSSQMLMDYGFVFDHGTPSDNVAISV 305

Query: 356 -ALNTEDPQYQDKRMV 370
              + ++P  Q K+ +
Sbjct: 306 PIFHPDEPNIQVKQSI 321


>gi|356564844|ref|XP_003550657.1| PREDICTED: uncharacterized protein LOC100778605 [Glycine max]
          Length = 549

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 64/241 (26%), Positives = 106/241 (43%), Gaps = 22/241 (9%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           A +DL+ GD A  +P S++++ E V   +    L   + +S    L L+ M EK      
Sbjct: 178 ARKDLKVGDIALEIPVSIIISEELVHETDMYGVLKEIDGISSETILLLWSMKEKYNCDSK 237

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
           F + Y   L  +   G       L +S   +  L G+    EI++  + +  +Y+EL   
Sbjct: 238 FKI-YFDTLPEKFNTG-------LSFSIQAITMLDGTLLLEEIMQARQHLHAQYDEL--- 286

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
            F A  L   +P   P E +T+E F  A     S  + +       R  L+PL   L   
Sbjct: 287 -FPA--LCNNFPDIFPPELYTWEKFLWACELWYSNSMKIMYSDGKLRTCLIPLAGFL--N 341

Query: 295 SSKCKAML--AAVD---DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPY 348
            S C  ++    VD   ++++  + RP ++GE   +  G   +S L+  YGF+ + DN Y
Sbjct: 342 HSLCPHVMHYGKVDPATNSLKFCLSRPCRSGEECCLSYGNFSSSHLITFYGFLPQGDNSY 401

Query: 349 D 349
           D
Sbjct: 402 D 402


>gi|384248321|gb|EIE21805.1| SET domain-containing protein, partial [Coccomyxa subellipsoidea
           C-169]
          Length = 275

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 65/258 (25%), Positives = 109/258 (42%), Gaps = 34/258 (13%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLS----ELACLALYLMYEK 168
           V A++D+  G+    VP+  V+  E    +E + +   TN       E   L L LM EK
Sbjct: 32  VVATKDISCGEVVVHVPDESVLMPENCSCSEALEDAGLTNASGDAEMESIGLILALMTEK 91

Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR-- 226
           K GK S W  Y+  L +           PL W   +L  L G+     ++E+  G K   
Sbjct: 92  KLGKSSKWKGYLDFLPKS------IPGMPLFWDSEQLQSLEGT----SLIEKMNGCKAMP 141

Query: 227 --------EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 278
                   ++N +  + F+  +   + P++  +    + ++  A V+  S  +   +   
Sbjct: 142 DRPLEPPCKFNSV-VLPFLQSNAHLKLPHNAASTRRLY-VWATAMVSAYSFTIGEDRFQ- 198

Query: 279 ARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLL 336
               A+VP+   L   +      L   A   A++++       GE ++   G  PNS+LL
Sbjct: 199 ----AMVPMWDALNHITGHANVRLHHCARKGALRMIATCLITKGEQVINSYGDLPNSELL 254

Query: 337 INYGFVDED-NPYDRLVV 353
             YGFV+ D NP+D L V
Sbjct: 255 RRYGFVETDPNPHDCLEV 272


>gi|358369683|dbj|GAA86297.1| SET domain protein [Aspergillus kawachii IFO 4308]
          Length = 489

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 76/308 (24%), Positives = 124/308 (40%), Gaps = 54/308 (17%)

Query: 83  SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
           +W+   G P  K+  K + +    H     V A  DL  G+  F++P + V++++    N
Sbjct: 22  TWLA--GKPGVKINPKIQIADLRSHAAGRGVVAQSDLDEGEELFTIPRAHVLSVQ----N 75

Query: 143 ETIAELLTTN--KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
             +  LL+ N   L     L + ++YE  QG +S W  Y R L R         ++ + W
Sbjct: 76  SNLKNLLSQNLEDLGPWLSLMVVMIYEYLQGDQSAWASYFRVLPRN-------FDTLMFW 128

Query: 201 SETELAYLTGSP---------TKAEILE------RA--------EGIKREYNELDTVWFM 237
           S +EL  L GS           +  I+E      RA        +G+     +  T   +
Sbjct: 129 SASELEELQGSAIVEKIGKQGAEGSIIESIAPIVRANPALFPPIDGVASYDGDAGTQALL 188

Query: 238 -----AGSLFQQYPYDI--PTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 290
                 GSL   Y +DI  P +       +  ++  +      +         +VPL   
Sbjct: 189 HLAHTMGSLIMAYAFDIEKPEDEEGDRDGEDGYLTDEEEEQSSK--------GMVPLADL 240

Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 350
           L A + +  A L   ++ + +   +P K+GE I    G  P S LL  YG+V  DN    
Sbjct: 241 LNADADRNNARLFQEEEVLVMKAIKPIKSGEEIFNDYGEIPRSDLLRRYGYV-TDNYAQY 299

Query: 351 LVVEAALN 358
            VVE +L+
Sbjct: 300 DVVELSLD 307


>gi|449283795|gb|EMC90389.1| SET domain-containing protein 4 [Columba livia]
          Length = 440

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 85/364 (23%), Positives = 147/364 (40%), Gaps = 49/364 (13%)

Query: 97  LKEKPSHNEKHRPIHY------VAASEDLQAG-DAAFSVPNSLVVTLERVLGNETIAELL 149
           LK++   +   RP  +      +  ++ LQ   D   S+P   ++T + VL +  + E +
Sbjct: 39  LKDRGFEDSHLRPAEFWDTGRGLMTTKTLQVSRDLIISLPEKCLLTTDTVLSS-CLGEYI 97

Query: 150 TTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY 207
              K  +S L  L  +L+ EK  G+KS W PY+  L +           P+   E ++  
Sbjct: 98  MKWKPPVSPLTALCTFLIAEKHAGEKSLWKPYLDVLPK-------TYSCPVC-LEHDVVS 149

Query: 208 LTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ 267
           L   P + +  E+   +   Y      +     LF +    I    F +   + A+  + 
Sbjct: 150 LLPEPLRKKAQEQRTKVHELYISSKAFFSSLQPLFAENTETI----FNYSALEWAWCTIN 205

Query: 268 SCVVHLQKVSLARRFALVP----LGP--PLLAYS--SKCKAMLAAVDDAVQLVVDRPYKA 319
           +  +++ K S  + F+L P    L P   LL +S   + KA       + ++  +   K 
Sbjct: 206 TRTIYM-KHSQRKCFSLEPDVYALAPYLDLLNHSPNVQVKAAFNEQTRSYEIRTNSLCKK 264

Query: 320 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 379
            E + +  GP  N +LL+ YGFV  DNP+  + V +A   +     DK    QRN K+S+
Sbjct: 265 YEEVFICYGPHDNQRLLLEYGFVAMDNPHSSVYVSSATLLKYFPPLDK----QRNAKVSI 320

Query: 380 QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADY 439
              H           D+L  L  G+   +  + + +  L        C  R +   L D 
Sbjct: 321 LKDH-----------DLLENLTFGWDGPSWRLLTALKVLSLGADEFTCWRRTL---LGDV 366

Query: 440 FKAR 443
             AR
Sbjct: 367 ISAR 370


>gi|307195794|gb|EFN77608.1| SET domain-containing protein 3 [Harpegnathos saltator]
          Length = 245

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 56/217 (25%), Positives = 92/217 (42%), Gaps = 11/217 (5%)

Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
           AL+P+       + +      A  D  +    R ++ GE I +  GP+ NS   ++ GFV
Sbjct: 31  ALIPMWDMCNHENGRITTDFNATSDRCECYALRNFQKGEQIFISYGPRTNSDFFVHSGFV 90

Query: 343 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDM-LPYLR 401
             DN  D   +   ++  D   +++  +  +    SV  F +  G   E ISDM L +LR
Sbjct: 91  YMDNEQDGFKLRLGISKADSLQKERTELLGKLDLPSVGEFLLKPG--TEPISDMLLAFLR 148

Query: 402 LGYVSDTSEMQSVISSLGPI------CPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 455
           + +    +E+   + S          C +   +E  V   L    +  +A YP TL ED 
Sbjct: 149 V-FSMRKAELAHWLRSDKVFDLKHMDCALETVVEENVRKFLLTRLQLLIANYPTTLKEDL 207

Query: 456 AMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMI 492
            +L +  L   K++  QL   EKK+L   L+     I
Sbjct: 208 ELL-ETTLPQVKKMTVQLRVTEKKILLGALEYVEQWI 243


>gi|226508108|ref|NP_001151788.1| SET domain containing protein [Zea mays]
 gi|195649689|gb|ACG44312.1| SET domain containing protein [Zea mays]
          Length = 536

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 71/277 (25%), Positives = 114/277 (41%), Gaps = 24/277 (8%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           ASE +  GD A  +P  L+++ E +  +E    L   N ++    L L+ M E+      
Sbjct: 184 ASESIGVGDIALEIPEFLIISDELLCQSEVFLALKDFNNITSETMLLLWSMRERYNLGSK 243

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
           F  PY   L      G       L +    LA L G+    EI++  + ++++Y+EL  +
Sbjct: 244 F-KPYFDTLPANFNTG-------LSFGIDALAALEGTLLFDEIIQARQHLRQQYDELFPL 295

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
                 L   +P     +  T++ F  A     S  + +   S      LVP+   L   
Sbjct: 296 ------LCTNFPEMFRKDVCTWDDFLWACELWYSNSMMIVLSSGKLSTCLVPVAGLLNHS 349

Query: 295 SSKCKAMLAAVDDA---VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYDR 350
            S        VD+A   ++  + RP  AGE   +  G  P S L+  YGF+   DNPYD 
Sbjct: 350 VSPHILNYGRVDEATKSLKFPLSRPCDAGEQCFLSYGKHPGSHLVTFYGFLPRGDNPYDV 409

Query: 351 LVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 387
           + ++      D    D+ + AQ +   S Q  H+  G
Sbjct: 410 IPLDL-----DTSVDDEDIAAQSSATTS-QTTHMVRG 440


>gi|428175234|gb|EKX44125.1| hypothetical protein GUITHDRAFT_109909 [Guillardia theta CCMP2712]
          Length = 442

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 46/189 (24%), Positives = 77/189 (40%), Gaps = 28/189 (14%)

Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM---------AGSLFQQ 244
           + +PL WS+ E   L GS                YN LD  W M         A  L Q 
Sbjct: 161 LTTPLFWSDKEREELQGSNL--------------YNMLDG-WTMNVEKLHRSTARVLGQH 205

Query: 245 YPY-DIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA---RRFALVPLGPPLLAYSSKCKA 300
             + D+P   ++ + FK A+  + +    +   S     R+  + P+         K   
Sbjct: 206 NVFPDLPKAIYSLKEFKWAYATIFARAFDVDGKSFGFSGRQRIMAPMADLFNHGDVKTSY 265

Query: 301 MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 360
              A     +L   + +  GE I +    + N++ L+ YGFV E NP+D + + A++  +
Sbjct: 266 TFNAASGHFELFTQQFFSRGEQIFMNYDSKNNAEFLLQYGFVIESNPHDYVGIAASIGND 325

Query: 361 DPQYQDKRM 369
            P Y+DK +
Sbjct: 326 QPFYRDKSL 334


>gi|170067683|ref|XP_001868579.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167863782|gb|EDS27165.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 269

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 50/189 (26%), Positives = 89/189 (47%), Gaps = 24/189 (12%)

Query: 317 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 376
           Y+ GE I ++ G + N+  L++ GFV  DN    + +  +LN  + Q++ ++ + ++ G 
Sbjct: 74  YRKGEQIFIYYGNRTNADFLVHNGFVYPDNANSAVAIPLSLNPTEEQFEQRKQLLEKLGL 133

Query: 377 LSVQVFHVHAGREKEAIS-DMLPYLR--------LGYVSDTSEMQSVISSLGPICPVSPC 427
            S   F+V  G     IS ++L + R        LG+      +QS +  L P C   P 
Sbjct: 134 ASSGDFNVQRGGGDSFISPELLGFARVFNMTKEQLGHWQGEDAVQSQL--LEPDC---PG 188

Query: 428 MERAVLDQLADYFKARLA----GYPATLSEDEAMLTDYN------LHPKKRVATQLVRME 477
           +E ++ +++  Y   RL         TL +DEA+L +        L   K +  Q   +E
Sbjct: 189 LEASLREKVWKYLSIRLQLALRMTGTTLDQDEALLANQGQKGAQKLGHIKSMLVQFRVVE 248

Query: 478 KKMLNACLQ 486
           KK+L+  L+
Sbjct: 249 KKILSEALE 257


>gi|367048695|ref|XP_003654727.1| hypothetical protein THITE_2117893 [Thielavia terrestris NRRL 8126]
 gi|347001990|gb|AEO68391.1| hypothetical protein THITE_2117893 [Thielavia terrestris NRRL 8126]
          Length = 481

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 54/194 (27%), Positives = 80/194 (41%), Gaps = 18/194 (9%)

Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
            +L+ E  +G+ SFW PYI  L +       A+  P  W E ++AYL G+     I E  
Sbjct: 107 FFLIKEYLKGRDSFWAPYIATLPQPEHVSAWAL--PAFWPEEDIAYLAGTNAHVAIAEIQ 164

Query: 222 EGIKREYNE----LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV----AVQSCVVHL 273
             +K E+ +    L    F A   + Q  Y      FT   F+ + V    A Q     L
Sbjct: 165 ANVKSEFKQARKALKAAGFPAWQDYTQMLYKWAFCIFTSRSFRPSLVLSEPAKQQMAELL 224

Query: 274 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA-----VQLVVDRPYKAGESIVVWCG 328
                   F+++    PL   ++       A D A      QLV    Y+ GE +    G
Sbjct: 225 PPGCQLDDFSILQ---PLFDIANHSMTARYAWDVASDPASCQLVCHDAYQPGEQVYNNYG 281

Query: 329 PQPNSKLLINYGFV 342
            + NS+LL+ YGF+
Sbjct: 282 LKTNSELLLAYGFI 295


>gi|260835045|ref|XP_002612520.1| hypothetical protein BRAFLDRAFT_214305 [Branchiostoma floridae]
 gi|229297897|gb|EEN68529.1| hypothetical protein BRAFLDRAFT_214305 [Branchiostoma floridae]
          Length = 287

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 63/287 (21%), Positives = 118/287 (41%), Gaps = 31/287 (10%)

Query: 75  EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
           EE       W+H+NG   C+ +  +        R +    A++ L+  +    +P  L++
Sbjct: 18  EESFVRFFQWLHRNG---CRNVPLKPAVFPGTGRGM---MATKALKHEELMLVIPQRLLI 71

Query: 135 TLERVLGNETIAELLTTN-KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA 193
           T++ ++ +     +   + +L+    LA++LM EK + +KSFW PYI  L  +       
Sbjct: 72  TMDAIMDSYIAPYIERADPRLTPTQALAVFLMCEKYRREKSFWRPYIDILPEE------- 124

Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 253
              P  ++E +   L  S  + +   +     +EY EL   + M   LF         +A
Sbjct: 125 YSCPTFFTEDDFRLLPNS-LRGKAKAKKYECHKEYKELAPFFKMLADLFPD-----QEDA 178

Query: 254 FTFEIFKQAFVAVQSCVV----------HLQKVSLARRFALVPLGPPL-LAYSSKCKAML 302
           F F+ FK A+ A+++  +          HL+         + PL   +  A  +K +   
Sbjct: 179 FNFKDFKWAWSAIKTRALDVPIGRESCRHLRDAEDTPTPTMFPLVDSINHAAQAKIRHRY 238

Query: 303 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
                 ++   +  Y+    ++   G   N  LL+ +GFV   NP D
Sbjct: 239 NEKSRCLESRTETVYRRHAEVMNSYGRADNDNLLLEFGFVVPGNPED 285


>gi|320169513|gb|EFW46412.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 495

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 99/418 (23%), Positives = 170/418 (40%), Gaps = 54/418 (12%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYE-KKQG 171
           V A  DL AG+    VP SL++ +E    +  +  +L   +LS+   +A +L+YE  +  
Sbjct: 79  VFALRDLAAGETVLRVPLSLLLNVEHASAS-PLGGILDDFRLSDAEAMAFWLIYELTRPE 137

Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
           + S WLPY+  L       QL     + +   E+  L  SP       R   ++ ++ + 
Sbjct: 138 RASPWLPYLESL--PASIKQLT----MFYDPFEMKRLQASPVAEFTSRRTVKMRNKFGKY 191

Query: 232 DTVWFMAGSLFQQYP-----YDIPTEAFTFEIFKQAFVAVQSC------VVHLQKVSLAR 280
                    + +  P      + P E  T + F  A +AVQ        V H       R
Sbjct: 192 RE------QISKHRPAHLAEIEFPVELITVDDFLWA-MAVQFTRLITVQVKHPADGEWER 244

Query: 281 RFALVPLG-----PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQ--PN 332
              LVPL       P    + +C   L +     +    RP   G+ ++  + G +   N
Sbjct: 245 TKCLVPLADLLNTAPADQINVECATNLDSTH--FECATIRPVAEGQELLTPYGGAEQLSN 302

Query: 333 SKLLINYGFVDEDNPYDRLVV------EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHA 386
            +L+++YG    +NP D + +      E A+  +        M   R  +L + V     
Sbjct: 303 GQLIMDYGVTFRNNPSDLVALPIPKLRETAVAYDSKMRLLMAMSLDRFDRLQLPVLDHFE 362

Query: 387 GREKEAISDMLPYLRLGYVS---DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 443
              KE    +L + R+ YVS   D S+++ V+  +     ++P  ER  L+ L       
Sbjct: 363 SIPKE----LLAFARV-YVSTPSDLSDLEHVLELMKEHRAINPSNERRALELLLQLTNEM 417

Query: 444 LAGYPATLSEDEAMLTDYNLH--PKKRVATQLV-RM-EKKMLNACLQVTADMIMLLPD 497
           +  Y  T+ EDE ML + +    P       +V R+ EK++L++  Q+    I  LP+
Sbjct: 418 ILKYITTIEEDETMLRELDAESVPNANAVNAVVLRLGEKRILSSLWQLLDSAIEALPE 475


>gi|308811012|ref|XP_003082814.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplast precursor (ISS)
           [Ostreococcus tauri]
 gi|116054692|emb|CAL56769.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplast precursor (ISS)
           [Ostreococcus tauri]
          Length = 588

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 86/387 (22%), Positives = 149/387 (38%), Gaps = 51/387 (13%)

Query: 49  RKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSH-NEKH 107
           R +R      S+   V G  +      E    L  W+ + G    +V+     +  N+  
Sbjct: 12  RASRARWTTRSTRARVRGDAQRARASREAYDGLWMWLERRGADVSRVVADAVTTDANDSE 71

Query: 108 RPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL----ACLALY 163
           R    V A   L+ G  A  +P  + +   R   +  +   L   +   +      +AL 
Sbjct: 72  RAQFGVRAKTTLRRGTRAMVIPREVWMDATRATEDADVGAALRDARYDAVKQPWVRVALL 131

Query: 164 LMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG 223
           L+ E+++G    +  Y+  L +        ++SPL WS  EL  + G+    ++L+ A G
Sbjct: 132 LLKERERGADGEFAAYVATLPK-------TLDSPLFWSADELRDIAGT----QLLDNAAG 180

Query: 224 ----IKREYNELDTVWFMAGSLFQQYPYDIPTE-AFTFEIFKQAFVAVQS-CVVHLQKVS 277
               ++  Y EL         +F +Y      + AF    F+ AF  ++S  +  L   +
Sbjct: 181 YDAYVRAVYEEL------KNGVFVEYASTFDVDGAFDEASFRWAFGILRSRTMAPLDGAN 234

Query: 278 LARRFALVPLGPPLLAYSSKCKAMLAAVDD---------------AVQLVVDRPYKAGES 322
           +A    LVP G  L+ +SS   A                      A  +  DR Y  G  
Sbjct: 235 VA----LVP-GLDLINHSSLSGARWRVGGGGGMGGLFGGGSGSGVAAYVECDRDYDEGAE 289

Query: 323 IVVWCGPQP-NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQV 381
           I V   P+  +SK  ++YGF+D  NP     +  ++  +D    DK  V +  G      
Sbjct: 290 IFVNYDPEGIDSKFALDYGFIDVVNPSPGYALTLSIPEDDANLFDKLDVLETQGLPEAPT 349

Query: 382 FHVHAGREKEAISDMLPYLRLGYVSDT 408
           F +    + +   ++  +LRL +  DT
Sbjct: 350 FTLRPYSDPD--RELRTFLRLLHCKDT 374


>gi|145516585|ref|XP_001444181.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124411592|emb|CAK76784.1| unnamed protein product [Paramecium tetraurelia]
          Length = 658

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 37/143 (25%), Positives = 65/143 (45%), Gaps = 26/143 (18%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL---------SELACLALY 163
           V+A  ++ A     ++PN L+++  +VL +E ++++  T+K          +E  CLALY
Sbjct: 48  VSAKMNIPANKVIIAIPNKLIISHHKVLKSE-LSDMFKTHKQFFDDQITADAEFNCLALY 106

Query: 164 LMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG 223
           + Y K QG KSFW PY+  +++          +   W   +L  L       E +     
Sbjct: 107 IFYHKLQGDKSFWYPYLNVVEQH---------TMFEWRNRDLFNLQDQSLIDEFMYIQ-- 155

Query: 224 IKREYNELDTVWFMAGSLFQQYP 246
                +E+D  W+    L  +YP
Sbjct: 156 -----SEMDKSWYKFKGLMNKYP 173


>gi|358366345|dbj|GAA82966.1| SET domain protein [Aspergillus kawachii IFO 4308]
          Length = 673

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 61/229 (26%), Positives = 96/229 (41%), Gaps = 7/229 (3%)

Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
           + + E      +L+ +  +G + FW PYIR L  Q G    ++ +P  +   +L +L G+
Sbjct: 77  DAVGEKESTIFFLIGQYLRGTEGFWYPYIRTLP-QPG----SLTTPPYYEGEDLQWLDGT 131

Query: 212 PTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
              A   +R E +K +Y +  T    AG      Y +D+   A +  I +     V S V
Sbjct: 132 SLLAAREKRLEVLKEKYEKGSTALRNAGFEGADAYTWDLYLWAASMFISRAFSARVLSGV 191

Query: 271 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 330
                +S  +   L+P+   +  +    K    A  D V  VV     AG+ I    GP+
Sbjct: 192 FPETDLSEEKLSVLLPI-IDMGNHRPLAKVEWRAGKDDVAFVVLEDVSAGQEISNNYGPR 250

Query: 331 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 379
            N +L++NYGF    NP D  +V        P Y  K    Q    L+V
Sbjct: 251 NNEQLMMNYGFCIPGNPCDHRIVSLRAPPGSPLYMAKSHQLQMYPDLAV 299


>gi|17367341|sp|Q43088.1|RBCMT_PEA RecName: Full=Ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic; AltName:
           Full=[Fructose-bisphosphate aldolase]-lysine
           N-methyltransferase; AltName:
           Full=[Ribulose-bisphosphate carboxylase]-lysine
           N-methyltransferase; Short=PsLSMT; Short=RuBisCO LSMT;
           Short=RuBisCO methyltransferase; Short=rbcMT; Flags:
           Precursor
 gi|508551|gb|AAA69903.1| ribulose-1,5 bisphosphate carboxylase large subunit
           N-methyltransferase [Pisum sativum]
          Length = 489

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 91/377 (24%), Positives = 160/377 (42%), Gaps = 24/377 (6%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A +D+   D    VP  L +  + V  +E I  + +  +L     + L+L+ E+ + +
Sbjct: 84  LVALKDISRNDVILQVPKRLWINPDAVAASE-IGRVCS--ELKPWLSVILFLIRERSR-E 139

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
            S W  Y   L ++        +S + WSE EL  L GS      +   E +K E  +L+
Sbjct: 140 DSVWKHYFGILPQE-------TDSTIYWSEEELQELQGSQLLKTTVSVKEYVKNECLKLE 192

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQS-CVVHLQKVSLARRFALVPLGPP 290
               +        P  +    + F I + +AF  +++  +V +    L    A V     
Sbjct: 193 QEIILPNKRLFPDPVTLDDFFWAFGILRSRAFSRLRNENLVVVPMADLINHSAGVTTEDH 252

Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYD 349
             AY  K  A L + D    L      KAGE + + +   + N++L ++YGF++ +    
Sbjct: 253 --AYEVKGAAGLFSWDYLFSLKSPLSVKAGEQVYIQYDLNKSNAELALDYGFIEPNENRH 310

Query: 350 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 409
              +   ++  DP + DK  VA+ NG      F +   R       +LPYLRL  +  T 
Sbjct: 311 AYTLTLEISESDPFFDDKLDVAESNGFAQTAYFDIFYNRTLPP--GLLPYLRLVALGGTD 368

Query: 410 E--MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 464
              ++S+   +  G +   VS   E  +   + +  K+ LAGY  T+ +D   L + NL 
Sbjct: 369 AFLLESLFRDTIWGHLELSVSRDNEELLCKAVREACKSALAGYHTTIEQDRE-LKEGNLD 427

Query: 465 PKKRVATQLVRMEKKML 481
            +  +A  +   EK +L
Sbjct: 428 SRLAIAVGIREGEKMVL 444


>gi|384483765|gb|EIE75945.1| hypothetical protein RO3G_00649 [Rhizopus delemar RA 99-880]
          Length = 376

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 66/251 (26%), Positives = 116/251 (46%), Gaps = 38/251 (15%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A+ED++AG+   SVP + ++T      NE++ +L  T+ LS    LAL+L+   +  K
Sbjct: 1   MMATEDIEAGEVIVSVPRNFLIT------NESLTKLYGTHSLSPHQLLALHLVLLTRD-K 53

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
           +S+W PY   L        +   S LL      ++L  S  K E +++ + I  +Y    
Sbjct: 54  QSWWKPYTDLLPMHFNTMPVNYPSELL------SHLPNS-LKQETMQQKDNIHTDY---- 102

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL- 291
            V  +     +Q P DI     T E FK A++ V +  +H+       +   + L P L 
Sbjct: 103 -VTCLKFCKSKQLPQDI-----TAEEFKWAWLCVNTRCIHMTVPDYLAKGENIALAPMLD 156

Query: 292 -LAYSSKCKAMLAAVDDAVQLVVDR-------PYKAGESIVVWCGPQPNSKLLINYGFVD 343
            L ++++ K     ++    +   R        YK GE + +  GP  N  +L  YGFV 
Sbjct: 157 FLNHTTEAK-----IESGFNIRTQRFEIKTLTAYKKGEQVYINYGPHDNLAMLKEYGFVL 211

Query: 344 EDNPYDRLVVE 354
            +N Y+ ++++
Sbjct: 212 NENIYNFVLLD 222


>gi|297836754|ref|XP_002886259.1| hypothetical protein ARALYDRAFT_319874 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297332099|gb|EFH62518.1| hypothetical protein ARALYDRAFT_319874 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 541

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 59/244 (24%), Positives = 100/244 (40%), Gaps = 29/244 (11%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           ASEDL+ GD A  +P S +++ E V  ++    L   + ++    + L+ M EK      
Sbjct: 173 ASEDLKFGDVALEIPISSIISEEYVFNSDMYPILEKIDGITSETMVLLWTMREKHNLDSK 232

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
           F  PY   L      G       + +    +  L G+    EI++  E ++  Y+EL   
Sbjct: 233 F-KPYFDSLQENFCTG-------MSFGVNAIMELDGTLLLDEIMQAKELLRERYDELI-- 282

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG------ 288
                 L   + +  P E +T+E +  A     S  + ++      +  L+P+       
Sbjct: 283 -----PLLSNHRHVFPPEHYTWEHYLWACELYYSNSMQIKFPDGKLKTCLIPVAGFLNHS 337

Query: 289 --PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-D 345
             P ++ Y   C         +++  V RP   GE   +  G   +S LL  YGF+ + D
Sbjct: 338 IYPHIVKYGKVCVET-----SSLKFPVSRPCNKGEQCFLSYGNYSSSHLLTFYGFLPKGD 392

Query: 346 NPYD 349
           NPYD
Sbjct: 393 NPYD 396


>gi|281201870|gb|EFA76078.1| hypothetical protein PPL_10657 [Polysphondylium pallidum PN500]
          Length = 1234

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 56/252 (22%), Positives = 105/252 (41%), Gaps = 18/252 (7%)

Query: 113  VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTT-NKLSELACLALYLMYEKKQG 171
            +  ++ ++  +    VP   ++ ++    +  +  +    + L++   L L+++YEK + 
Sbjct: 767  IVTTKKVEENEVIIKVPRKFLINVQVAREHPILGRIFEEFSGLNDDTILFLFVIYEK-EN 825

Query: 172  KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
              SFW P+   L            + + ++ TEL  L G+   AE L+    +K     +
Sbjct: 826  PNSFWRPFFDTLPS-------YFPTSIHYTSTELLELEGTNLFAETLQ----VKEHLQSI 874

Query: 232  DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
              + F    L +QYP   P   F++E F  A     S  + L K+       LVP+   +
Sbjct: 875  RDMLF--PELSEQYPTIFPESLFSWENFLWARSLFDSRAIQL-KIDDKITNCLVPMADMI 931

Query: 292  LAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
              + +    +      D   ++V          I +  G   N +L + YGFV +DNPYD
Sbjct: 932  NHHHNAQISQRFFDQTDQCFKMVSCCSVPPNAQIFLHYGALQNRELALYYGFVIQDNPYD 991

Query: 350  RLVVEAALNTED 361
             +++   L  ED
Sbjct: 992  SMLIGFDLPDED 1003


>gi|384246985|gb|EIE20473.1| rubisco small subunit N-methyltransferase, partial [Coccomyxa
           subellipsoidea C-169]
          Length = 363

 Score = 57.0 bits (136), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 92/376 (24%), Positives = 146/376 (38%), Gaps = 92/376 (24%)

Query: 124 AAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIREL 183
           A   +P +L VT   V  +E +A L       EL  LAL+LM E+++G++S W P++  L
Sbjct: 2   ALVELPGNLSVTAVDVAAHEEVAGL--AEGRGELTGLALWLMAERQKGEESRWAPFLECL 59

Query: 184 DRQRGRGQLAVESPLLW-SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLF 242
                    A  SP+LW  E +   L  SPT  E   R   +++E++             
Sbjct: 60  PE-------ATLSPVLWPEEVQDELLKNSPTLKECRARRAALQQEWD------------- 99

Query: 243 QQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL--ARRFA-------LVPLGPPLLA 293
                                       V  Q+++   ARRF+        + LG P   
Sbjct: 100 ----------------------------VIAQRIATGDARRFSGGDELKLWITLGSP--G 129

Query: 294 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
           +      +L A+ D                      +PN +L +  G V++DN  D L V
Sbjct: 130 WGGTSDKLLMAIYDG---------------------RPNGELAMATGRVEDDNASDCLTV 168

Query: 354 EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS 413
              L   D  +  K+ + +  G   VQ F +   R     + +L YLRL  ++D + +  
Sbjct: 169 RVGLVQADRLFSVKKQILESLGFDIVQEFPIFRDR---MPTQLLAYLRLARLTDPALLAK 225

Query: 414 VISSLGPICPVSPCMERAVLDQLADYFKARLAGYP----ATLSEDEAMLTDYNLHPKKRV 469
           V  S      ++P  E  VL  L    + RL  Y      +  ED  +L    L  ++R+
Sbjct: 226 V--SFEEDIILNPVNEYEVLQLLLGECRDRLTSYAGMHMGSAEEDVKLLQRPGLTAQERL 283

Query: 470 ATQLVRMEKKMLNACL 485
           A +L + EK +L   L
Sbjct: 284 AARLRKAEKAILQGTL 299


>gi|225446052|ref|XP_002268920.1| PREDICTED: uncharacterized protein LOC100256524 [Vitis vinifera]
          Length = 566

 Score = 57.0 bits (136), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 61/249 (24%), Positives = 104/249 (41%), Gaps = 28/249 (11%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           A+EDL+ GD A  +P S+V++ E V  ++    L   + +S    L L+ M EK      
Sbjct: 194 ATEDLKVGDVALEIPMSIVISEELVHESDMFPILEKIDGISSETMLLLWSMKEKHNSNSK 253

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
           F   Y   L         A  + L +    +  L G+    EI+E  + +  +Y EL   
Sbjct: 254 F-NTYFNALPE-------AFNTGLSFEFDAIMVLAGTLLLEEIIEAKKHLNAQYEEL--- 302

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG------ 288
                +L + +P   P E +T E F  A     S  + +       R  L+P+       
Sbjct: 303 ---VPALCKDHPDIFPPEFYTQEQFLWACELWYSNGMQVMFTDGKLRTCLIPIAGFLNHS 359

Query: 289 --PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-D 345
             P ++ Y       + +  ++++  V +P   GE   +  G   +S L+  YGF+ + D
Sbjct: 360 LYPHIMHYGK-----VDSKTNSLKFCVSKPCNMGEQCYLSYGNFSSSHLVTFYGFIPQGD 414

Query: 346 NPYDRLVVE 354
           N YD + +E
Sbjct: 415 NLYDTIPLE 423


>gi|444705829|gb|ELW47217.1| Histone-lysine N-methyltransferase setd3 [Tupaia chinensis]
          Length = 539

 Score = 57.0 bits (136), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 46/193 (23%), Positives = 91/193 (47%), Gaps = 10/193 (5%)

Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 365
           DD  + V  + ++ GE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y 
Sbjct: 234 DDRCECVALQDFRPGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 293

Query: 366 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD---------TSEMQSVIS 416
            K  V  R G  +  VF +H   +    + +L +LR+  +++          S +  + +
Sbjct: 294 MKAEVLARAGIPTSSVFALHF-TDPPISAQLLAFLRVFCMTEEELKEHLLGDSAIDRIFT 352

Query: 417 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRM 476
                 PVS   E  +   L D     L  Y  T+ ED+++L   +L  +  +A +L   
Sbjct: 353 LGNSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTIEEDKSVLKSRDLSVRATMAIKLRLG 412

Query: 477 EKKMLNACLQVTA 489
           EK++L   ++  A
Sbjct: 413 EKEILERAVRSAA 425



 Score = 39.7 bits (91), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 38/160 (23%), Positives = 73/160 (45%), Gaps = 18/160 (11%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ +++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVDGFEMVNFKEEGFGLR---ATREIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERAS-PNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
                 ++PL + E E+ YL  +    ++  + +   R+Y
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY 217


>gi|297735395|emb|CBI17835.3| unnamed protein product [Vitis vinifera]
          Length = 583

 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 61/249 (24%), Positives = 104/249 (41%), Gaps = 28/249 (11%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           A+EDL+ GD A  +P S+V++ E V  ++    L   + +S    L L+ M EK      
Sbjct: 211 ATEDLKVGDVALEIPMSIVISEELVHESDMFPILEKIDGISSETMLLLWSMKEKHNSNSK 270

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
           F   Y   L         A  + L +    +  L G+    EI+E  + +  +Y EL   
Sbjct: 271 F-NTYFNALPE-------AFNTGLSFEFDAIMVLAGTLLLEEIIEAKKHLNAQYEEL--- 319

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG------ 288
                +L + +P   P E +T E F  A     S  + +       R  L+P+       
Sbjct: 320 ---VPALCKDHPDIFPPEFYTQEQFLWACELWYSNGMQVMFTDGKLRTCLIPIAGFLNHS 376

Query: 289 --PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-D 345
             P ++ Y       + +  ++++  V +P   GE   +  G   +S L+  YGF+ + D
Sbjct: 377 LYPHIMHYGK-----VDSKTNSLKFCVSKPCNMGEQCYLSYGNFSSSHLVTFYGFIPQGD 431

Query: 346 NPYDRLVVE 354
           N YD + +E
Sbjct: 432 NLYDTIPLE 440


>gi|403350379|gb|EJY74649.1| SET domain containing protein [Oxytricha trifallax]
          Length = 2165

 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 75/333 (22%), Positives = 135/333 (40%), Gaps = 65/333 (19%)

Query: 97  LKEKPSHNEKHRPIHYVA------ASEDLQAGDAAFSVPNSLVVTLERVL----GNETIA 146
           L++  SH EK +  +Y A      A+ D++ G+    VP   ++TLE  +    G +   
Sbjct: 154 LEQGGSHFEKLKIRYYTADYRGVHAARDIKKGEIILYVPKHQIITLEMAMTSPVGKKMYE 213

Query: 147 ELLTTNKLS-ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 205
           + L    +S + + L+ Y+M EK++ + S W  YI  L +           P+ ++E E 
Sbjct: 214 KGLRQRLISPKHSFLSTYIMQEKRKPE-SQWQIYIDILPKNFSN------FPIFFTEEER 266

Query: 206 AYLTGSPTKAEILERAEGIKREYN---------------ELDTVWFMAGSLFQQYPYDIP 250
            +L GSP   +ILE+ E IK +Y+               E   +  M  S      + I 
Sbjct: 267 IWLKGSPFLDQILEKIEDIKADYDLICKEVPEYVQFPIREYSEIRMMVSSRI----FGIQ 322

Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 310
            E     +    FVA    + H +           P          K   ++ A++D   
Sbjct: 323 IEG----VKTDGFVAYADMLNHKR-----------PRQTSWTYTDEKQGFIIEAMEDI-- 365

Query: 311 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR-M 369
                  + GE +    G + NS+  +NYGF++ +N  + + ++   +T+D   Q K+ M
Sbjct: 366 -------QRGEQVYDSYGKKCNSRFFLNYGFINLNNDANEVPIKVYYHTDDQLKQVKQDM 418

Query: 370 VAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 402
           +   +     + F V    E   + +   +LR 
Sbjct: 419 IVDHS---EFKKFRVVENLEDRVMQEFFSWLRF 448


>gi|242823770|ref|XP_002488126.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
 gi|218713047|gb|EED12472.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
          Length = 480

 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 88/386 (22%), Positives = 156/386 (40%), Gaps = 56/386 (14%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           V A  D+Q G+  F +P  +V+ ++    NE +A+ L    L     L + ++YE   G+
Sbjct: 50  VVARSDIQEGEDLFHLPQRVVLMVKTSPLNEILADEL--KNLGPWLSLVVVMIYEYSLGE 107

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT---------KAEILERAEG 223
           +S W  Y + L  +        ++ + WS  EL+ L  S           + +I E+   
Sbjct: 108 RSNWNQYFQVLPTK-------FDTLMFWSGEELSQLQASAVIHKIGKKDAEEDIFEKIIP 160

Query: 224 IKREYNELDTVWFMAGSLFQQYPYDIPTEA--------------FTFEIFKQAFVAVQSC 269
           + R + +L    F   +    Y  D   +A              + F+I K      +  
Sbjct: 161 LVRSHPDL----FPPVNGVMSYDDDAGAQALLELAHRMGSLIMAYAFDIEKGEEEESEGE 216

Query: 270 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 329
             +L          +VPL   L A + +  A L   D A+ +   +P K G+ I    G 
Sbjct: 217 DGYLTDDEEQLPKGMVPLADLLNADADRNNARLFQEDGALVMRAIKPIKTGDEIFNDYGE 276

Query: 330 QPNSKLLINYGFVDEDNPYDRLVVEAAL----------NTEDPQYQDKRMVAQRNGKLSV 379
            P S LL  YG+V  DN     VVE  L          N E  +Y   +++ +   ++  
Sbjct: 277 LPRSDLLRRYGYV-TDNYAQYDVVELPLTGICHAAGLDNIESQEYPHLKLLHEL--EILE 333

Query: 380 QVFHVHAGREKEAISDMLP----YLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQ 435
             + +     +++++D+LP     L      +  E+Q + S   P  P+    E  +   
Sbjct: 334 DGYCILRPSAEDSLTDILPDELLALLKSLTLEREELQRLQSKQKPPKPILAAREARI--- 390

Query: 436 LADYFKARLAGYPATLSEDEAMLTDY 461
           L D  K++L+ Y  T+ +D+A+L  +
Sbjct: 391 LLDSVKSKLSQYGTTVEQDKAILQQF 416


>gi|358056251|dbj|GAA97802.1| hypothetical protein E5Q_04481 [Mixia osmundae IAM 14324]
          Length = 433

 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 92/391 (23%), Positives = 162/391 (41%), Gaps = 55/391 (14%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETI---AELLTTNKLSELACLALYLMYEKKQG 171
           A+ +L++    FS+P SLV+++      +++   +E+ T   +  + CL    MYE+   
Sbjct: 39  ATSNLRSETELFSIPRSLVLSVHTSPLPKSLPDWSEISTQGWVGLILCL----MYEQID- 93

Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-LERAEGIKREYNE 230
             S W  Y+  +           +S + WS+ EL  L GS    +I  E AEG    Y+ 
Sbjct: 94  PASHWKRYLNSM-------PTCFDSLMFWSDDELRELQGSSVLDKIGREEAEG--SYYSI 144

Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL------------ 278
           L         +F+      P EA++  ++ +    + S   H+                 
Sbjct: 145 LVPYLSKHADIFK------PLEAYSLALYHRCGSLILSRSFHVSNQDDSASDASDDDDAA 198

Query: 279 ---ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
                   +VP+   L A S    A L    DA+ +   +   AGE I       PN+ L
Sbjct: 199 YHEVETVGMVPMADVLNAKSGSANACLVYHPDALVMTTTKEIAAGEQIFNTYNDPPNADL 258

Query: 336 LINYGFVDEDNPYDRLVVEAAL-NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAI- 393
           L  YG VDE N  D + + A L   +D          +R   L  ++  V+   + E + 
Sbjct: 259 LRRYGHVDEVNLNDNVEISADLIGCKD---------LERVDWLLDRLDDVYTLTQAEDLP 309

Query: 394 SDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSE 453
            D +  +++   +  SE + +  +     P    ++ A   ++ +  + RLA Y +T+ E
Sbjct: 310 EDFITAVKI-LTASKSEFRKIQKA--DDLP-DDVLDEATAMRVREILQMRLAQYSSTIEE 365

Query: 454 DEAMLTDYNLHPKKRVATQLVRM-EKKMLNA 483
           DE++L    +   +  A  LVR+ EK++L A
Sbjct: 366 DESLLASSTMLTSRSRAALLVRLGEKRILAA 396


>gi|320163048|gb|EFW39947.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 476

 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 106/470 (22%), Positives = 192/470 (40%), Gaps = 66/470 (14%)

Query: 83  SWMHKNGLPPC-KVILKEKPSHNEKHRP--IH---YVAASEDLQAGDAAFSVPNSLVVTL 136
           +W+  NG     K+ L+   + N   R   +H     +A       +  FS+P  L+++ 
Sbjct: 17  AWLRANGATVSPKLTLQATAAFNADSRTQVLHRRVIASAEAGFDKEEELFSIPRKLLLSA 76

Query: 137 ERVLGNETIAELLTTNKLSELAC-----LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQ 191
                  +IAELL  NK    A      L + +MYE    K SFW PY+  L        
Sbjct: 77  ----STSSIAELLLENKKEACALVGWMPLVVAMMYEI-TNKDSFWRPYLDLLPE------ 125

Query: 192 LAVESPLLWSETELAYLTGSPTKAEI-LERAEGIKRE----YNELDTVWF----MAGSLF 242
             +++P+ W++ +L  L G+ T + +  E AE I  E    + +L    F       +L+
Sbjct: 126 -TLDTPMFWNDDDLELLEGTSTLSHLGKEDAETIFTEQIVPFMKLHPTHFDLKVHNMALY 184

Query: 243 QQYPYDIPTEAFTFEIFKQAFV-------------AVQSCVVHLQKVSLARRFALVPLGP 289
            +    I   +F+ +  +                 A   C    ++  +  + A+VPL  
Sbjct: 185 HRVASVIMAYSFSEDDDEDDDDEDDDEEEDCCDGDANNECCSQKRQKRM-EKIAMVPLAD 243

Query: 290 PLLAYSSKCK-AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
            +L + + C  A L      + +    P  AG  +    G   NS+LL  YGF+D+ N +
Sbjct: 244 -MLDHKTGCNNARLFYGKTTLAMSCIEPCAAGHELYNTYGDLSNSELLRKYGFIDDVNEH 302

Query: 349 DRLVVEAAL---NTEDPQYQDKRMVAQRNGKLSVQVFHVHAG----REKEAISDML---- 397
           + + +   +     E   + ++ M A       +  FH+ A     +E EA   +L    
Sbjct: 303 NSVDIPVEMLEERFESCSFMEEAMEALEEIGCWLPEFHIPADALPPQELEASIALLFQSP 362

Query: 398 -PYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEA 456
                L  + D  E++S +++L     V+ C  R V + L  + + R   Y  T  EDE 
Sbjct: 363 KQVRALRALDDEDEIRSFLATL-----VNKC-RRKVSETLLAFGQKRAEEYTTTREEDEE 416

Query: 457 MLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLLPDVTVSPCPAP 506
            L + +L  ++++A ++   E+ +L+  +    + +   P    +  PAP
Sbjct: 417 RLKESDLTHRQKMALRVRIGERTILHNYISHLKERLETTPPDQETKEPAP 466


>gi|24987776|pdb|1MLV|A Chain A, Structure And Catalytic Mechanism Of A Set Domain Protein
           Methyltransferase
 gi|24987777|pdb|1MLV|B Chain B, Structure And Catalytic Mechanism Of A Set Domain Protein
           Methyltransferase
 gi|24987778|pdb|1MLV|C Chain C, Structure And Catalytic Mechanism Of A Set Domain Protein
           Methyltransferase
 gi|33357815|pdb|1OZV|A Chain A, Crystal Structure Of The Set Domain Of Lsmt Bound To
           Lysine And Adohcy
 gi|33357816|pdb|1OZV|B Chain B, Crystal Structure Of The Set Domain Of Lsmt Bound To
           Lysine And Adohcy
 gi|33357817|pdb|1OZV|C Chain C, Crystal Structure Of The Set Domain Of Lsmt Bound To
           Lysine And Adohcy
 gi|33357822|pdb|1P0Y|A Chain A, Crystal Structure Of The Set Domain Of Lsmt Bound To
           Melysine And Adohcy
 gi|33357823|pdb|1P0Y|B Chain B, Crystal Structure Of The Set Domain Of Lsmt Bound To
           Melysine And Adohcy
 gi|33357824|pdb|1P0Y|C Chain C, Crystal Structure Of The Set Domain Of Lsmt Bound To
           Melysine And Adohcy
          Length = 444

 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 90/377 (23%), Positives = 158/377 (41%), Gaps = 24/377 (6%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A +D+   D    VP  L +  + V  +E        ++L     + L+L+ E+ + +
Sbjct: 40  LVALKDISRNDVILQVPKRLWINPDAVAASEIGR---VCSELKPWLSVILFLIRERSR-E 95

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
            S W  Y   L ++        +S + WSE EL  L GS      +   E +K E  +L+
Sbjct: 96  DSVWKHYFGILPQE-------TDSTIYWSEEELQELQGSQLLKTTVSVKEYVKNECLKLE 148

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQS-CVVHLQKVSLARRFALVPLGPP 290
               +        P  +    + F I + +AF  +++  +V +    L    A V     
Sbjct: 149 QEIILPNKRLFPDPVTLDDFFWAFGILRSRAFSRLRNENLVVVPMADLINHSAGVTTEDH 208

Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYD 349
             AY  K  A L + D    L      KAGE + + +   + N++L ++YGF++ +    
Sbjct: 209 --AYEVKGAAGLFSWDYLFSLKSPLSVKAGEQVYIQYDLNKSNAELALDYGFIEPNENRH 266

Query: 350 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 409
              +   ++  DP + DK  VA+ NG      F +   R       +LPYLRL  +  T 
Sbjct: 267 AYTLTLEISESDPFFDDKLDVAESNGFAQTAYFDIFYNRTLPP--GLLPYLRLVALGGTD 324

Query: 410 E--MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 464
              ++S+   +  G +   VS   E  +   + +  K+ LAGY  T+ +D   L + NL 
Sbjct: 325 AFLLESLFRDTIWGHLELSVSRDNEELLCKAVREACKSALAGYHTTIEQDRE-LKEGNLD 383

Query: 465 PKKRVATQLVRMEKKML 481
            +  +A  +   EK +L
Sbjct: 384 SRLAIAVGIREGEKMVL 400


>gi|109158151|pdb|2H21|A Chain A, Structure Of Rubisco Lsmt Bound To Adomet
 gi|109158152|pdb|2H21|B Chain B, Structure Of Rubisco Lsmt Bound To Adomet
 gi|109158153|pdb|2H21|C Chain C, Structure Of Rubisco Lsmt Bound To Adomet
 gi|109158154|pdb|2H23|A Chain A, Structure Of Rubisco Lsmt Bound To Trimethyllysine And
           Adohcy
 gi|109158155|pdb|2H23|B Chain B, Structure Of Rubisco Lsmt Bound To Trimethyllysine And
           Adohcy
 gi|109158156|pdb|2H23|C Chain C, Structure Of Rubisco Lsmt Bound To Trimethyllysine And
           Adohcy
 gi|109158157|pdb|2H2E|A Chain A, Structure Of Rubisco Lsmt Bound To Azaadomet And Lysine
 gi|109158158|pdb|2H2E|B Chain B, Structure Of Rubisco Lsmt Bound To Azaadomet And Lysine
 gi|109158159|pdb|2H2E|C Chain C, Structure Of Rubisco Lsmt Bound To Azaadomet And Lysine
 gi|109158160|pdb|2H2J|A Chain A, Structure Of Rubisco Lsmt Bound To Sinefungin And
           Monomethyllysine
 gi|109158161|pdb|2H2J|B Chain B, Structure Of Rubisco Lsmt Bound To Sinefungin And
           Monomethyllysine
 gi|109158162|pdb|2H2J|C Chain C, Structure Of Rubisco Lsmt Bound To Sinefungin And
           Monomethyllysine
          Length = 440

 Score = 56.2 bits (134), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 90/377 (23%), Positives = 158/377 (41%), Gaps = 24/377 (6%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A +D+   D    VP  L +  + V  +E        ++L     + L+L+ E+ + +
Sbjct: 36  LVALKDISRNDVILQVPKRLWINPDAVAASEIGR---VCSELKPWLSVILFLIRERSR-E 91

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
            S W  Y   L ++        +S + WSE EL  L GS      +   E +K E  +L+
Sbjct: 92  DSVWKHYFGILPQE-------TDSTIYWSEEELQELQGSQLLKTTVSVKEYVKNECLKLE 144

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQS-CVVHLQKVSLARRFALVPLGPP 290
               +        P  +    + F I + +AF  +++  +V +    L    A V     
Sbjct: 145 QEIILPNKRLFPDPVTLDDFFWAFGILRSRAFSRLRNENLVVVPMADLINHSAGVTTEDH 204

Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYD 349
             AY  K  A L + D    L      KAGE + + +   + N++L ++YGF++ +    
Sbjct: 205 --AYEVKGAAGLFSWDYLFSLKSPLSVKAGEQVYIQYDLNKSNAELALDYGFIEPNENRH 262

Query: 350 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 409
              +   ++  DP + DK  VA+ NG      F +   R       +LPYLRL  +  T 
Sbjct: 263 AYTLTLEISESDPFFDDKLDVAESNGFAQTAYFDIFYNRTLPP--GLLPYLRLVALGGTD 320

Query: 410 E--MQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 464
              ++S+   +  G +   VS   E  +   + +  K+ LAGY  T+ +D   L + NL 
Sbjct: 321 AFLLESLFRDTIWGHLELSVSRDNEELLCKAVREACKSALAGYHTTIEQDRE-LKEGNLD 379

Query: 465 PKKRVATQLVRMEKKML 481
            +  +A  +   EK +L
Sbjct: 380 SRLAIAVGIREGEKMVL 396


>gi|302762396|ref|XP_002964620.1| hypothetical protein SELMODRAFT_81798 [Selaginella moellendorffii]
 gi|300168349|gb|EFJ34953.1| hypothetical protein SELMODRAFT_81798 [Selaginella moellendorffii]
          Length = 464

 Score = 56.2 bits (134), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 86/367 (23%), Positives = 153/367 (41%), Gaps = 46/367 (12%)

Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
           L L L+YE+ Q K S+W PYI  L         +   P+ +S  ++  +  +P   ++ +
Sbjct: 105 LGLKLLYERAQ-KGSYWWPYISMLPH-------SFTLPIFFSGVDIESIDYAPVTHQVKK 156

Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVH--LQKVS 277
           R   + +   EL  +  +   +       + + A  + +   A V+ ++  +H    K+ 
Sbjct: 157 RCRFLLQFSAELAKLESLPEEVHPFAGQSVDSGALGWAM---AAVSSRAFRIHGVTNKLC 213

Query: 278 LARRFALVPLGPPLLAYSSKCKAMLA--AVDDA-VQLVVDRPYKAGESIVVWCGPQPNSK 334
            A    L+ +       ++  +  L+  A D + +++V  R  + G +I +  GP  N  
Sbjct: 214 SAMMLPLIDMCNHSFQPNAHIEEDLSRDAQDVSFLKVVTKRNLEKGSAITLNYGPLSNDL 273

Query: 335 LLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA--QRNGK------LSVQVFH--- 383
           LL++YGFV  DNP+DR+     L  +    ++ RM+A   R G        S QV     
Sbjct: 274 LLLDYGFVIPDNPHDRI----ELRYDGSLMENARMIAGLSRTGSPPFSSPASWQVDRLKQ 329

Query: 384 -----------VHAGREKEAISDMLPYLRLGYVSDTS--EMQSVIS--SLGPICPVSPCM 428
                      V  G  +E    +L  LR+ +       E + ++S  + G    VS   
Sbjct: 330 LGLADSGESQKVTLGGPEEVDGRLLAALRILHAESQEPLERRELVSLQAWGVESMVSSDN 389

Query: 429 ERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVT 488
           E  VL  L          +  T+ EDEA L+D +L    R+A Q    +K+++   L+  
Sbjct: 390 EERVLRTLCGLAAIVFNQFKTTIEEDEAKLSDKSLAETSRIAVQFRLTKKRLVVRVLESL 449

Query: 489 ADMIMLL 495
              +M L
Sbjct: 450 KKRLMDL 456


>gi|224098926|ref|XP_002311320.1| SET domain-containing protein [Populus trichocarpa]
 gi|222851140|gb|EEE88687.1| SET domain-containing protein [Populus trichocarpa]
          Length = 490

 Score = 56.2 bits (134), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 88/401 (21%), Positives = 163/401 (40%), Gaps = 50/401 (12%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A  D+   +    +P  L +  + V  +E          +     +AL+L+ EK + +
Sbjct: 80  LVAQRDISRNEVVLEIPKKLWINPDVVAASEIGN---VCGGVKPWVSVALFLIREKLK-E 135

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
            S W PY+  L         +  S + WSE ELA L G+   +  L     ++RE+ +++
Sbjct: 136 DSTWRPYLDVLPE-------STNSTIFWSEEELAELQGTQLLSTTLGVKSYLRREFLKVE 188

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG---- 288
               +     Q +P  +     T + F  AF  ++S      +    +   L+PL     
Sbjct: 189 EEILVPHK--QLFPSPV-----TLDDFSWAFGILRSRSFSRLR---GQNLVLIPLADLCN 238

Query: 289 -------------PPLL----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ- 330
                        P +      Y  K   + +  D    L      KAGE +++      
Sbjct: 239 FLHTWLLDQVNHSPDITIEDGVYEIKGAGLFSR-DLIFSLRSPISLKAGEQVLIQYNLNL 297

Query: 331 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFH-VHAGRE 389
            N++L ++YGF++  +  +   +   ++  DP + DK  +A+ NG   +  F  V     
Sbjct: 298 SNAELAVDYGFIEAKSDRNMYTLTLQISESDPFFGDKLDIAETNGLGEIADFDIVLGNPL 357

Query: 390 KEAISDMLPYLRLGYVSDTSEMQSVISS--LGPI-CPVSPCMERAVLDQLADYFKARLAG 446
              +   L  + LG  +D+  ++S+  +   G +  PVS   E  +   + D  K+ L+G
Sbjct: 358 PPTLLPYLRLVALGG-TDSFLLESIFRNTIWGHLELPVSRANEELICRVVRDACKSALSG 416

Query: 447 YPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQV 487
           Y  T+ EDE  L    L+P+  +A  +   EKK+L    ++
Sbjct: 417 YHTTIEEDEK-LKGEELNPRLEIAVGIRAGEKKVLQQIEEI 456


>gi|42567909|ref|NP_197226.2| protein SET DOMAIN GROUP 40 [Arabidopsis thaliana]
 gi|75271674|sp|Q6NQJ8.1|SDG40_ARATH RecName: Full=Protein SET DOMAIN GROUP 40
 gi|34222078|gb|AAQ62875.1| At5g17240 [Arabidopsis thaliana]
 gi|51969984|dbj|BAD43684.1| unknown protein [Arabidopsis thaliana]
 gi|332005020|gb|AED92403.1| protein SET DOMAIN GROUP 40 [Arabidopsis thaliana]
          Length = 491

 Score = 56.2 bits (134), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 81/396 (20%), Positives = 158/396 (39%), Gaps = 56/396 (14%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNE-TIAELLTT-NKLSELACLALYLMYEKKQ 170
           + A+ +L+ G+    VP   ++T E ++  +  +++ +   N LS    L++ L+YE  +
Sbjct: 51  LGAARELKKGELVLKVPRKALMTTESIIAKDLKLSDAVNLHNSLSSTQILSVCLLYEMSK 110

Query: 171 GKKSFWLPYI----RELDRQRGRGQ-----LAVESPLLWSETELAYLTGSPTKAEILERA 221
            KKSFW PY+    R+ D     G      L VE  +  +E   A       +A  L + 
Sbjct: 111 EKKSFWYPYLFHIPRDYDLLATFGNFEKQALQVEDAVWATEKATAKCQSEWKEAGSLMKE 170

Query: 222 EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR 281
             +K ++      W  A +        +P ++                V  L        
Sbjct: 171 LELKPKFRSFQ-AWLWASATISSRTLHVPWDS----------AGCLCPVGDLFNYDAPGD 219

Query: 282 FALVPLGPP---------LLAYSSKCKAMLAAVDDAVQ---LVVDRPYKAGESIVVWCGP 329
           ++  P GP          L+  +   +      ++ V    L   R Y+ GE +++  G 
Sbjct: 220 YSNTPQGPESANNVEEAGLVVETHSERLTDGGFEEDVNAYCLYARRNYQLGEQVLLCYGT 279

Query: 330 QPNSKLLINYGFVDEDNPYDRLVV--EAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAG 387
             N +LL +YGF+ E+N  D++ +  E +L +    +    +   ++GKLS         
Sbjct: 280 YTNLELLEHYGFMLEENSNDKVFIPLETSLFSLASSWPKDSLYIHQDGKLSFA------- 332

Query: 388 REKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGY 447
                   ++  LRL  +  +   +SV+  +     +S   E  V+  +++   + L   
Sbjct: 333 --------LISTLRLWLIPQSQRDKSVMRLVYAGSQISVKNEILVMKWMSEKCGSVLRDL 384

Query: 448 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 483
           P +++ED  +     LH   ++    +R+E+K   A
Sbjct: 385 PTSVTEDTVL-----LHNIDKLQDPELRLEQKETEA 415


>gi|403349615|gb|EJY74245.1| hypothetical protein OXYTRI_04500 [Oxytricha trifallax]
          Length = 689

 Score = 56.2 bits (134), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 35/131 (26%), Positives = 66/131 (50%), Gaps = 19/131 (14%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-------SELACLALYLM 165
           + A +D+    A   +PNS ++++ RV  +  + ++L+ ++        ++  CLA++LM
Sbjct: 74  IGAKKDIGQYKAFLFIPNSCIISVTRVKKHPIVGQILSNHQELFMKHADADQLCLAVFLM 133

Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL--WSETELAYLTGSPTKAEILERAEG 223
            E  QG++SFW PYI  ++          ES LL  W + E+  L       +  E  + 
Sbjct: 134 NEYLQGQQSFWWPYINVMN----------ESDLLYKWKDEEIKLLNDFEIYQQAKEYRDD 183

Query: 224 IKREYNELDTV 234
           I+ E+N+L  +
Sbjct: 184 IEDEWNKLSKI 194


>gi|403215215|emb|CCK69715.1| hypothetical protein KNAG_0C06190 [Kazachstania naganishii CBS
           8797]
          Length = 496

 Score = 56.2 bits (134), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 59/254 (23%), Positives = 109/254 (42%), Gaps = 32/254 (12%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLT--TNKLSELACLALYLMYE-KK 169
           V A ED++  +  F VP + ++ +E    ++   E+       + +   L + L++E K 
Sbjct: 41  VIAIEDIEKDEILFEVPRTTMLNVENCELSKRYPEIKNHLVESVGQWEGLIIALLFEWKV 100

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
            G+KS W PY++ L ++    QL     + W++ EL  L         G+    E+ E  
Sbjct: 101 VGEKSKWWPYLQVLPKKTDMNQL-----IYWADDELELLKPSLILERVGADKAKEMFENV 155

Query: 222 EGI--KREYNELDT-----VW---FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 271
             I  K    E D+      W    +  S+   Y +D+  + +  E  K+     +    
Sbjct: 156 VDIINKSTLKEKDSYILKVTWENFLLVASIIMSYSFDV--QDYVEE--KEGGTDEEEDDN 211

Query: 272 HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 331
             + V   +   ++PL   L + + KC A L    + +++   +  K GE I    G  P
Sbjct: 212 ESENVRSLK--CMIPLADTLNSNTHKCNAHLIHGSNLLEMRSIKAIKKGEQIYNIYGDHP 269

Query: 332 NSKLLINYGFVDED 345
           NS++L  YG+++ D
Sbjct: 270 NSEILRRYGYIEPD 283


>gi|393230612|gb|EJD38215.1| SET domain-containing protein [Auricularia delicata TFB-10046 SS5]
          Length = 381

 Score = 56.2 bits (134), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 74/288 (25%), Positives = 119/288 (41%), Gaps = 33/288 (11%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVT-------LERVLGNETIAELLTTNKLSE--LACLALY 163
           V  SE+L       S P SL +T       L+R+LG    A+L   N LSE  L C  L 
Sbjct: 3   VHTSEELPPDAPVISAPFSLAITPTVAADALQRILGPG--ADL---NSLSERELVCTYLA 57

Query: 164 LMYEKKQ---GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILER 220
           + +  K+   G  +  L +   +D    R QL   +PL ++  ELA L G+   A   +R
Sbjct: 58  MHWIAKEVDLGPSAASLDHGPYVDSLPSRAQL--RTPLHFTPQELALLKGTNMAAATTDR 115

Query: 221 AEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLAR 280
               + E      V    G       Y   +   +   F    ++ +  ++     + + 
Sbjct: 116 EADWRSECERCRAVLGHWGEHLTWEHYLTASTHLSSRAFPSTLLSPEPALI----PTPSS 171

Query: 281 RFALVPLGPPL-LAYSSKCKAMLAAVDDAVQ---LVVDRPYKAGESIVVWCGPQPNSKLL 336
              LVPL   L  A +      ++  D+      +V   P  AG  ++   GP+PN++L+
Sbjct: 172 HPVLVPLIDSLNHARAHPVSWSVSPADNGAHTLSIVQHAPVAAGAEVLNNYGPKPNAELV 231

Query: 337 INYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV 384
           + YGF   DNP D LV++ +         D+R + +  G L   +F V
Sbjct: 232 LGYGFALPDNPDDTLVLKVS------GAADRREIWRAGGGLQRILFDV 273


>gi|295668911|ref|XP_002795004.1| SET domain-containing protein [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226285697|gb|EEH41263.1| SET domain-containing protein [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 488

 Score = 56.2 bits (134), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 90/429 (20%), Positives = 168/429 (39%), Gaps = 67/429 (15%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
           + A +D+   +  F++P  LV++ +    N  + +L+  N+  L +  CL L ++YE  Q
Sbjct: 50  IVAYDDINKEEELFAIPQGLVLSFQ----NSKLKDLMEINERDLGQWLCLILVMIYEYLQ 105

Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA--EGIKRE- 227
           G  S W PY + L           ++ + W++ EL  L GS     I + A  E   R+ 
Sbjct: 106 GVASPWAPYFKVLPTD-------FDTLMFWTDAELLELKGSAVLGRIGKSAAEEVFLRDL 158

Query: 228 -------------------YNELD------TVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
                              YN  D      ++    GSL   Y +D+  +       +  
Sbjct: 159 LPLVSKNSELFPLTSGLLSYNSPDGKAALLSLAHRMGSLIMSYAFDVKNDEAEEVEGEGG 218

Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
           +V         ++  L +   ++PL   L A + +  A L   D  + +   +  + GE 
Sbjct: 219 YVTDD------EERQLPK--GMIPLADLLNADADRNNACLFQEDGYLAMKSIKSIRKGEE 270

Query: 323 IVVWCGPQPNSKLLINYGFVDEDN--------PYDRLVVEAALNTEDPQYQDKRMVAQRN 374
           I    G  P ++LL  YG+V ++         P   +   A L +  P   + R+    +
Sbjct: 271 IFNDYGELPRAELLRRYGYVTDNYAQYDEAEVPIQTICKVAGLKSSTPGPDEPRLEFLDD 330

Query: 375 GKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPV-SPCMERAVL 433
            ++    + +        +++ LP   L  ++         + L     V  P +  A  
Sbjct: 331 LEVLDDGYGIPRPDRSTPLAETLPTELLVVLNILVMPLEQFNQLKQKSKVPKPALGIAEA 390

Query: 434 DQLADYFKARLAGYPATLSEDEAML-----TDYNLHPKK----RVATQLVRMEKKMLNAC 484
             L +  +  L  YP T+++D+ +L        +  PK     ++A Q+ + EK++LNA 
Sbjct: 391 TLLDEVVRLILGEYPTTVAQDKELLASCANNQGSTSPKSAGRLKMALQVRKGEKEILNAV 450

Query: 485 LQVTADMIM 493
           L    D I+
Sbjct: 451 LSELEDFIV 459


>gi|425773952|gb|EKV12277.1| hypothetical protein PDIG_46020 [Penicillium digitatum PHI26]
 gi|425782378|gb|EKV20291.1| hypothetical protein PDIP_17950 [Penicillium digitatum Pd1]
          Length = 487

 Score = 56.2 bits (134), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 75/296 (25%), Positives = 119/296 (40%), Gaps = 62/296 (20%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
           V A  ++  G+  FS+P ++V+T++    N  +  LL  N   ++     L L ++YE  
Sbjct: 50  VVAQSNIVEGEELFSIPRTMVLTVQ----NSELRTLLAENLEEQMGPWLSLMLVMVYEYL 105

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI------------ 217
           QG+KS W PY R L  +        ++ + WS  EL  L  S    +I            
Sbjct: 106 QGEKSRWAPYFRVLPSR-------FDTLMFWSPAELQELQASTIVEKIGRSNAEESIRDS 158

Query: 218 -----------------LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPT---EAFTFE 257
                            L   EGI  +   L  V  + GSL   Y +DI     +    E
Sbjct: 159 IAPILAKRPDLFPPPPGLASWEGIAGDA-ALIQVGHVMGSLIMAYAFDIEKAEDDDDEGE 217

Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY 317
           +  ++++        L K        +VPL   L A + +  A L   + A+ +   +P 
Sbjct: 218 VNDESYMTDDEEEEQLPK-------GMVPLADLLNADADRNNARLYQEEGALVMKAIKPI 270

Query: 318 KAGESIVVWCGPQPNSKLLINYGFVDEDNP-YDRL------VVEAA-LNTEDPQYQ 365
           + G+ I    G  P + LL  YG+V ++   YD L      + EAA L   DP+ Q
Sbjct: 271 QKGDEIFNDYGEIPRADLLRRYGYVTDNYAVYDVLELSLETICEAAGLANADPESQ 326


>gi|396468374|ref|XP_003838159.1| hypothetical protein LEMA_P116830.1 [Leptosphaeria maculans JN3]
 gi|312214726|emb|CBX94680.1| hypothetical protein LEMA_P116830.1 [Leptosphaeria maculans JN3]
          Length = 660

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 91/428 (21%), Positives = 175/428 (40%), Gaps = 78/428 (18%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLE-----RVL-----GNETIAELLTTNKLSELACLAL 162
           + A+ D+ A    F++P + ++ +E     R+L     G    AE      L   A L L
Sbjct: 41  IVATRDIPAETTLFTIPRNAIINVETSDLARLLPGIFDGTLNDAEDEKAEPLDPWASLIL 100

Query: 163 YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAE 222
            ++ E   G++S+W PYI  L         + ++P+ W++ EL  L G+   AE + ++E
Sbjct: 101 VMLREYLHGEQSYWKPYIDIL-------PTSFDTPIFWTQDELKELEGTVLTAEKIGKSE 153

Query: 223 ------------------------GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEI 258
                                    I     +L  +    GS    Y +D+  +    + 
Sbjct: 154 SDEMLRTHVLPIVTQNPTAFCPKGAIPLNEEDLLALAHRIGSTIMSYAFDLDDDKEESDA 213

Query: 259 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD---R 315
            ++ +V  +  +  L          +VP+   L A +       A V+   +L V     
Sbjct: 214 EEEGWVEDRDGLTML---------GMVPMADVLNANAD----FNAHVNHGEKLEVTSLRS 260

Query: 316 PYKAGESIVVWCGPQPNSKLLINYGFVD-EDNPYD---------RLVVEAALNTEDPQYQ 365
             +AG  I+ + GP P+S+LL  YG+V  E + YD         R  + A L   D   Q
Sbjct: 261 DIRAGTEILNYYGPLPSSELLRRYGYVTPEHHRYDVAEVSWELVRSTLVAHLELSDGILQ 320

Query: 366 DKRMVAQRNGKLSVQVFHV---HAGREKEAISDMLPYLRLGYVSDTSE-MQSVISSLGPI 421
               V  + G+  ++ + V    +G   +    + P       ++ SE +++++ +L   
Sbjct: 321 ---AVETQLGEDELEDYFVLERDSGEPSDEGRLVQPPQTCEVPTELSEQLKTILKALKKQ 377

Query: 422 CP--VSPCMERAVLDQ--LADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRME 477
            P  +   + R  +    + +    +L+ Y  ++ EDE +L + +L  + R+A ++   E
Sbjct: 378 QPELIGSAVRRDEILHAVIGEALNRKLSEYATSVEEDEELLENSSLTKRHRLAIEVRLGE 437

Query: 478 KKMLNACL 485
           K++L+  L
Sbjct: 438 KRLLHELL 445


>gi|242081035|ref|XP_002445286.1| hypothetical protein SORBIDRAFT_07g007800 [Sorghum bicolor]
 gi|241941636|gb|EES14781.1| hypothetical protein SORBIDRAFT_07g007800 [Sorghum bicolor]
          Length = 490

 Score = 55.8 bits (133), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 65/243 (26%), Positives = 103/243 (42%), Gaps = 22/243 (9%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + ASE +  G+ A  +P SL+++ E +  +E    L   N ++    L L+ M E+    
Sbjct: 182 MVASESIGVGEIALEIPESLIISDELLCQSEVFLALKDFNSITSETMLLLWSMRERYNLA 241

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
             F  PY   L      G       L +    LA L G+    EI++  + ++++Y+EL 
Sbjct: 242 SKF-KPYFDTLPANFNTG-------LSFGIDGLAALEGTLLFDEIMQAKQHLRQQYDELF 293

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQA--FVAVQSCVVHLQKVSLARRFALVPLGPP 290
            +      L   +P     +  T++ F  A       S +V L    L+    LVP+   
Sbjct: 294 PL------LCTNFPEIFRKDVCTWDNFLWACELWYSNSMMVVLSSGKLST--CLVPVAGL 345

Query: 291 LLAYSSKCKAMLAAVDDA---VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-DN 346
           L    S        VD+A   ++  + RP  AGE   +  G  P S L+  YGF+   DN
Sbjct: 346 LNHSVSPHILNYGRVDEATKSLKFPLSRPCDAGEQCFLSYGKHPGSHLVTFYGFLPRGDN 405

Query: 347 PYD 349
           PYD
Sbjct: 406 PYD 408


>gi|241712095|ref|XP_002413441.1| conserved hypothetical protein [Ixodes scapularis]
 gi|215507255|gb|EEC16749.1| conserved hypothetical protein [Ixodes scapularis]
          Length = 227

 Score = 55.8 bits (133), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 52/190 (27%), Positives = 81/190 (42%), Gaps = 25/190 (13%)

Query: 84  WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLG-- 141
           W   NG     + L+  P         +  AA +D+Q G     VP  +++T    +G  
Sbjct: 11  WCLDNGATINGITLQALPDDE------YGFAAEQDIQVGPVFLGVPLGMMMT---TIGAR 61

Query: 142 NETIAELLTTN---KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPL 198
              +  LL  +   K  E   L+++L+ E   G  SFW PYI  L R       +  + L
Sbjct: 62  KSKLGALLKDDPIMKSMENVALSMFLILELCAGSASFWHPYISILPR-------SFNTVL 114

Query: 199 LWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEI 258
            +S  EL  LTGS    E L+    I R+Y     + F    L +  PY    + FT+++
Sbjct: 115 YFSVDELQLLTGSSVLDEALKLHRSIARQYAYFHKI-FRTHPLAKSLPY---KDCFTYDL 170

Query: 259 FKQAFVAVQS 268
           ++ A  AV +
Sbjct: 171 YRWAVSAVMT 180


>gi|302810436|ref|XP_002986909.1| hypothetical protein SELMODRAFT_235145 [Selaginella moellendorffii]
 gi|300145314|gb|EFJ11991.1| hypothetical protein SELMODRAFT_235145 [Selaginella moellendorffii]
          Length = 447

 Score = 55.8 bits (133), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 72/294 (24%), Positives = 119/294 (40%), Gaps = 61/294 (20%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           V A  DL  G+   ++P +  +TL      + IA       L     L + +MYE+ +GK
Sbjct: 10  VRALRDLHHGELIATIPKAACLTLLTTAARDAIARARLGGGLG----LTVAVMYERSKGK 65

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL 231
            S W  Y++ L  Q          P LWSE E+   L G+     + E    +K ++ E 
Sbjct: 66  GSKWYRYLKTLPCQE-------SVPFLWSEEEIDGLLLGTELHKALKEDKLLMKEDWEE- 117

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ-KVSLARRFALVPLGPP 290
                    L ++ P + P + FTFE    +++A +S V     ++     + +VPL   
Sbjct: 118 -----NIAPLTKEDPLEFPAQDFTFE----SYLAAKSLVSSRSFEIDAEHGYGMVPLAD- 167

Query: 291 LLAYSSKCKA-----MLAA-------------VDDA---------------VQLVVDRPY 317
              ++ K  A     ML A             +DD                +++V+ +  
Sbjct: 168 --LFNHKTDAEDVHFMLNASDSDDDDDNNGLIIDDGLANGDCREISSDKSVLEMVMVKDV 225

Query: 318 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYD--RLVVEAALNTEDPQYQDKRM 369
            AG  I    G   N+ LL  YGF + +NP+D   L ++  L     ++Q KR+
Sbjct: 226 AAGSEIFNTYGQLGNAALLHRYGFTEPNNPHDIVNLDMDCLLEVLLSRFQKKRV 279


>gi|145549620|ref|XP_001460489.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124428319|emb|CAK93092.1| unnamed protein product [Paramecium tetraurelia]
          Length = 482

 Score = 55.8 bits (133), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 81/408 (19%), Positives = 159/408 (38%), Gaps = 41/408 (10%)

Query: 103 HNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV----LGNETIAELLTTNKLSELA 158
            +E HR +    A++ ++ G+    +P +  ++LE V    L N  + ++    K + + 
Sbjct: 63  QSEGHRTLR---ATQFIRQGEWVLFIPRTQYLSLEEVKKSCLINRKMIQI--NYKPNNIQ 117

Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
              +  + ++ + K SFW PYI  L +           P  +   + A L GSPT   ++
Sbjct: 118 TYFVNHLLQENRRKYSFWKPYIDVLPKD------VSGFPTYFDAEQDALLKGSPTLFTVI 171

Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 278
            + +  K EY  L      A   FQ+Y Y       T++ F +  +   S    +Q    
Sbjct: 172 NQRKVFKEEYENLKE----AVKEFQKYGY-------TYDDFIKFRILTISRSFTVQIGEK 220

Query: 279 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD--RPYKAGESIVVWCGPQPNSKLL 336
            ++  LVPL    + + +          DA    +   R  + GE +    G   N    
Sbjct: 221 EQQQLLVPLAD-FINHDNNGFLKYGYSKDADGFFMQAVRNIQKGEELFYNYGQWSNKYFF 279

Query: 337 INYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN---GKLSVQVFHVHAGREKEAI 393
           +NYGF    NP ++  ++  LN  D  +  K  + + N   G   V        R+  A 
Sbjct: 280 MNYGFASLTNPMNQFDLDICLNKNDRLFNLKISLTKGNMCWGNRLVNETDHDTFRQSLA- 338

Query: 394 SDMLPYLRLGYVSDTSEMQSVISSLGPICP------VSPCMERAVLDQLADYFKARLAGY 447
              + + ++  + D  +++  + +     P       +  +E+A    L       L  +
Sbjct: 339 --TVRFTQISKLDDFLQLEEDVQNFKQFWPGWHTTIKTIELEKATFKALKGILVTELGNF 396

Query: 448 PATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIMLL 495
            +T+ +DE  L D      ++    L   EK+++   + +   M+ ++
Sbjct: 397 ASTIEDDERRLNDPQTPEFRKHIIMLTLREKQIIKKNIDICDLMLQVI 444


>gi|255581713|ref|XP_002531659.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase,
           putative [Ricinus communis]
 gi|223528717|gb|EEF30729.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase,
           putative [Ricinus communis]
          Length = 558

 Score = 55.8 bits (133), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 57/244 (23%), Positives = 106/244 (43%), Gaps = 28/244 (11%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           A+EDL+ GD A  +P S++++ E V  ++    L   + +S    L L+ M E+      
Sbjct: 190 ATEDLKVGDIALEIPVSIIISEELVRHSDMYHILEKIDGISSETMLLLWSMKERHNCNSK 249

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
             + Y   L ++   G       ++ S+  L +        EI++  E ++ +Y+EL   
Sbjct: 250 SKI-YFDTLPKEFNTGLSFGVDAIMASDGTLLF-------DEIMQAKEHLRVQYDEL--- 298

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP-------- 286
                +L   YP   P E +T+E F  A     S  + ++ +    R  L+P        
Sbjct: 299 ---VPALCNNYPDVFPPELYTWEQFLWACELWYSNSMKIKFLDGKLRTCLIPIAGFLNHS 355

Query: 287 LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-D 345
           L P ++ Y       + ++ + ++  + RP + GE   +  G    + L+  YGF+ + D
Sbjct: 356 LHPHIIHYGK-----VDSITNTLKFPLSRPCRVGEQCCLSYGNFSGAHLITFYGFLPQGD 410

Query: 346 NPYD 349
           N YD
Sbjct: 411 NRYD 414


>gi|145356486|ref|XP_001422460.1| chloroplast lysine N-methyltransferase [Ostreococcus lucimarinus
           CCE9901]
 gi|144582703|gb|ABP00777.1| chloroplast lysine N-methyltransferase [Ostreococcus lucimarinus
           CCE9901]
          Length = 529

 Score = 55.8 bits (133), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 90/386 (23%), Positives = 155/386 (40%), Gaps = 55/386 (14%)

Query: 48  RRKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKH 107
           RR+ R+    +S            +    D   L  W+  NG     V   +  + +E  
Sbjct: 24  RRRARWGDATTSKTRRPRTRARRDAASSADHDALHEWLSANGADVASVEFYDARAGDEDD 83

Query: 108 RPIHYVA--ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELL----TTNKLSELACLA 161
                    A+  L  G  A  VP SL +T E  + ++ + + L        L+    LA
Sbjct: 84  GGDAGWGARATRALARGAKAIVVPKSLWITPEVGMNDDELGKALRDEDVAGGLARWTTLA 143

Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
           L L+ E+++G++S +  Y++ L          + SPL W+  EL+ + G+    ++L+ A
Sbjct: 144 LTLLKERERGEESKYAAYVKTLPE-------VLHSPLFWNAEELSEIQGT----QLLDNA 192

Query: 222 EG----IKREYNELDTVWFMAGSLFQQYP--YDIPTEAFTFEIFKQAFVAVQSCVVHLQK 275
            G    ++  Y  L T       +F ++   +D+   AF+ + F+ AF  ++S       
Sbjct: 193 AGYDGYVRGVYETLRT------GMFAKHADVFDVEG-AFSEDNFRWAFGILRS---RTMA 242

Query: 276 VSLARRFALVPLGPPLLAYSSKCKA-------MLAAV---------DDAVQLVV--DRPY 317
                  ALVP G  L+ +SS  +A       +  AV         DD V   V  DR  
Sbjct: 243 PCDGANIALVP-GVDLVNHSSLSQARWRVSGGVAGAVAGLFGGGKGDDGVSARVECDRAL 301

Query: 318 KAGESIVVWCGPQ-PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 376
              E + V   P+  ++   +++GFVD   P     +  ++  +DP   DK  V    G 
Sbjct: 302 NVNEPLYVNYNPEGTDTSFALDFGFVDTITPSPGYALSLSVPEDDPNVFDKLDVLDVCGL 361

Query: 377 LSVQVFHVHAGREKEAISDMLPYLRL 402
                F + A  + +   D+  +LRL
Sbjct: 362 GETPTFTLRAYSDPD--PDLRTFLRL 385


>gi|403414266|emb|CCM00966.1| predicted protein [Fibroporia radiculosa]
          Length = 420

 Score = 55.8 bits (133), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 82/339 (24%), Positives = 143/339 (42%), Gaps = 55/339 (16%)

Query: 61  DTLVAGSREVVSKKEEDLGDLKSWMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASED 118
           D +V+ + +VV+         K+W+ +NG    P      E+  ++        V AS+D
Sbjct: 8   DGIVSANGDVVA--------FKNWLAENGAEFHPHAAFRTERSGYS--------VIASQD 51

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELL----TTNKLSE--LAC--LALYLMYEKKQ 170
           L++     S P SL +T E  +    +  LL    T    SE  L C  + ++ + +   
Sbjct: 52  LRSDTTVVSCPFSLAITPE--VSKNALTTLLGPTFTGQSWSERQLICSYICMHWILDPSA 109

Query: 171 GKKSFWLPYIREL---DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
             +    PYIR L   D+ R        +PL +S+TEL  L GS      L+R    + E
Sbjct: 110 SSELAHWPYIRMLPAPDKLR--------TPLHFSDTELEALKGSNLYGATLDRRRDWQSE 161

Query: 228 YNE----LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 283
           + +    + TV    G  F    Y   +   +   F    ++    +V     S    + 
Sbjct: 162 WEQCQKTIATVDLTWGEQFSWERYLSASTYLSSRAFPSMVLSPNPSLV-----STEESYP 216

Query: 284 LVPLGPPLLAYS-----SKCKAMLAAVD-DAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 337
           ++  G   L +S     S   ++  + D + + LV+ +   AG  ++   GP+PN++L++
Sbjct: 217 VLLPGIDSLNHSRGQPVSWVVSIGTSSDVNRISLVLHKSTPAGSELLNNYGPKPNAELIL 276

Query: 338 NYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 376
            YGF   +NP D +V++   N+     Q K  V  RN +
Sbjct: 277 GYGFSLPENPDDTIVLKIGGNSASGLQQQKWEVG-RNAQ 314


>gi|451854554|gb|EMD67847.1| hypothetical protein COCSADRAFT_34629 [Cochliobolus sativus ND90Pr]
          Length = 476

 Score = 55.8 bits (133), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 103/429 (24%), Positives = 166/429 (38%), Gaps = 72/429 (16%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           V A +D+   +  FS+P S ++ +E  + +  I    T   L     L L ++YE   G 
Sbjct: 40  VVAKQDIAEHELLFSIPRSSILGVENSILSTEIPPA-TFAHLGPWLSLILIMLYEYHNGS 98

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL-------------- 218
            S W PY   L           ++ + W+E ELA L  S    +I               
Sbjct: 99  ASNWAPYFAVLPTD-------FDTLMFWTEDELAELQASAVVNKIGKEGANEVFIEQLLP 151

Query: 219 -------------ERAEGIKREYNELDTVWFM--AGSLFQQYPYDIPTEAFTFEIFKQAF 263
                        ERA+   +E    + +  M   GSL   Y +D+   A + +   +  
Sbjct: 152 VIEEFADVIFSGDERAKHKAKEMRAPENLELMHKMGSLIMAYAFDVEP-AISDKEVDEEG 210

Query: 264 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESI 323
            A +     L K        +VPL   L A   +C A L    D +++   +P +AG+ I
Sbjct: 211 FAEEEEDAALPK-------GMVPLADMLNADGDRCNARLFYEKDGLEMKALKPIQAGDEI 263

Query: 324 VVWCGPQPNSKLLINYGFV-DEDNPYD-----------RLVVEAALNTEDPQYQDKRMVA 371
               GP P S LL  YG++ D    YD            L  +   + E  +Y D++ + 
Sbjct: 264 FNDYGPLPRSDLLRRYGYITDNYAQYDVVEIPVDLVSQTLAHDGLWHEERIEYLDEQEIV 323

Query: 372 QRNGKLSVQV-FHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMER 430
                ++  + F +     +E++S  L  L    +    E +  + S G + P +  M  
Sbjct: 324 DTGYDIAASIPFSL-----EESLSPELVILVETMLLPREEFER-LQSKGRL-PKAEKMTG 376

Query: 431 AVLDQLADYFKARLAGYPATLSED-----EAMLTDYNLHPKKRVA-TQLVRM-EKKMLNA 483
                L    +AR+A YP TL +D     E          ++RVA  + VR+ EKK+L  
Sbjct: 377 KAAKFLYKIVQARIAQYPTTLEQDLQISSETQPVQTMSRKERRVAMARAVRIGEKKLLVQ 436

Query: 484 CLQVTADMI 492
             +  AD I
Sbjct: 437 TEERLADKI 445


>gi|281207968|gb|EFA82146.1| hypothetical protein PPL_04566 [Polysphondylium pallidum PN500]
          Length = 510

 Score = 55.8 bits (133), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 40/159 (25%), Positives = 72/159 (45%), Gaps = 21/159 (13%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           V A +DL+       +P S ++T        +I+  L   K+ +    ++ L+YE   G 
Sbjct: 59  VIALQDLKIDHTVAIIPKSCLLTPHTT----SISAYLKKYKIKDATATSIALLYEASIGS 114

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
           +S W  YI+ L        L+V+ P+LW++ +L  L G+  +  + E  E +   YN+  
Sbjct: 115 QSKWYGYIKSL-------PLSVDLPILWNDADLKNLKGTSIETVVYENKETVDATYNK-- 165

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 271
              ++   L   +P       F+ + FK+A     SC+V
Sbjct: 166 ---YIKSKLIANHPDVFNEHVFSLDNFKRA-----SCLV 196


>gi|255945819|ref|XP_002563677.1| Pc20g11910 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211588412|emb|CAP86520.1| Pc20g11910 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 487

 Score = 55.8 bits (133), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 74/283 (26%), Positives = 117/283 (41%), Gaps = 57/283 (20%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
           V A  ++  G+  FSVP ++V+T++    N  +  LL  N   ++     L L ++YE  
Sbjct: 50  VVAQSNISEGEELFSVPRAMVLTVQ----NSELRTLLGENLEEQMGPWLSLMLVMVYEYL 105

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA---EGIKR 226
           QG+KS W PY R L  +        ++ + WS  EL  L  S T  E + R+   E I+ 
Sbjct: 106 QGEKSRWAPYFRVLPSR-------FDTLMFWSPAELQELQAS-TIVEKIGRSGAEESIRN 157

Query: 227 ----------------------EYNELDT----VWFMAGSLFQQYPYDI---PTEAFTFE 257
                                 E +  D     V  + GSL   Y +DI     +    E
Sbjct: 158 SIAPILAKRPDLFPPPQGLASWEGDAGDAALIQVGHIMGSLIMAYAFDIEKSEDDGDEGE 217

Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY 317
              ++++        L K        +VPL   L A + +  A L   + A+ +   +P 
Sbjct: 218 ANDESYMTDDEEEEQLPK-------GMVPLADLLNADADRNNARLYQEEGALVMKAIKPI 270

Query: 318 KAGESIVVWCGPQPNSKLLINYGFVDEDNP-YDRLVVEAALNT 359
           + GE I    G  P + LL  YG+V ++   YD  V+E +L T
Sbjct: 271 QQGEEIFNDYGEIPRADLLRRYGYVTDNYAVYD--VLELSLET 311


>gi|367023575|ref|XP_003661072.1| hypothetical protein MYCTH_2300057 [Myceliophthora thermophila ATCC
           42464]
 gi|347008340|gb|AEO55827.1| hypothetical protein MYCTH_2300057 [Myceliophthora thermophila ATCC
           42464]
          Length = 496

 Score = 55.8 bits (133), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 69/270 (25%), Positives = 107/270 (39%), Gaps = 55/270 (20%)

Query: 113 VAASEDLQAGDAAFSVP-NSLVVTLERVLGNE----------------TIAELLTTNKLS 155
           + A  D+ A    F++P +S++ T    L NE                +  E  T++   
Sbjct: 49  IVARTDIAADTVLFTIPRSSIICTATSALKNEIPGIFDLEGDEDGNSDSGGEDGTSSSQD 108

Query: 156 ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 215
               L L L+YE  QG  S W PY+  L         A ++P+ WS TELA L  S    
Sbjct: 109 SWTLLILILIYEYLQGDASQWKPYLDVL-------PSAFDTPMFWSPTELAELQASALVT 161

Query: 216 EI-LERAEGIKRE-----YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSC 269
           ++  E A+ + R          D V+F  G   Q+   D       FE+  +   A+ + 
Sbjct: 162 KVGREEADRMIRSKILPVIRGHDHVFFPHGR--QRLDDDQ-----LFELAHRMGSAIMAY 214

Query: 270 VVHLQKVSLARR-----------------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 312
              L+K   A                     +VP+   +L   ++  A +    D++   
Sbjct: 215 AFDLEKDDDANEEASEQDEWVDDREGRTMLGMVPMA-DMLNADAEFNAYINHGADSLTAT 273

Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
             R  KAGE I+ + GP PN +LL  YG+V
Sbjct: 274 ALRTIKAGEEILNYYGPLPNGELLRRYGYV 303


>gi|115386294|ref|XP_001209688.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114190686|gb|EAU32386.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 486

 Score = 55.8 bits (133), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 92/392 (23%), Positives = 155/392 (39%), Gaps = 62/392 (15%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
           V A  D+   +  F++P  LV++ +    N  + +LL+ + L EL     L L +MYE  
Sbjct: 50  VVAQTDIPENEELFTIPRDLVLSTQ----NSKLKDLLSQD-LEELGPWLSLMLVMMYEYL 104

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP---------TKAEILER 220
            G +S W  Y + L R+        ++ + W+ +EL  L GS              ILE 
Sbjct: 105 LGDQSTWAAYFKVLPRK-------FDTLMFWTPSELLELQGSAVIDKIGRQGADESILEM 157

Query: 221 AEGIKREYNEL----------------DTVWFMA---GSLFQQYPYDI--PTEAFTFEIF 259
              I R +  L                  +  +A   GSL   Y +DI  P +       
Sbjct: 158 IAPIVRAHPSLFPPVDGLPSYDGDAGTQALLHLAHTMGSLIMAYAFDIEKPEDEDEEGDG 217

Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 319
           +  ++  +      ++  L++   +VPL   L A + +  A L   ++A+ +   +P   
Sbjct: 218 EGGYMTDE------EEEQLSK--GMVPLADLLNADADRNNARLFQDENALVMKAIKPIAK 269

Query: 320 GESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVE-------AALNTEDPQYQDKRMVA 371
           GE I    G  P + LL  YG+V D   PYD + V        A L+  DP+ Q      
Sbjct: 270 GEEIFNDYGEIPRADLLRRYGYVTDNYAPYDVVEVSLDVICKAAGLSDSDPEKQPPLEFL 329

Query: 372 QRNGKLSVQVFHVHAGREKEAISDMLP-YLRLGYVSDTSEMQSVISSLGPICPVSPCMER 430
                L          +E + ++D+LP  L +   + T   + +        P  P    
Sbjct: 330 DELELLDDGYVIPRPSQEDDQLTDILPDELIILLRTLTLSPEQLAQQRSKNKPPKPAFAE 389

Query: 431 AVLDQLADYFKARLAGYPATLSEDEAMLTDYN 462
           A    LA   + + A Y  T+++D+ +L+  N
Sbjct: 390 AEATILAKAIQLKQAQYATTIAQDQEILSQLN 421


>gi|327295326|ref|XP_003232358.1| hypothetical protein TERG_07206 [Trichophyton rubrum CBS 118892]
 gi|326465530|gb|EGD90983.1| hypothetical protein TERG_07206 [Trichophyton rubrum CBS 118892]
          Length = 692

 Score = 55.8 bits (133), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 50/200 (25%), Positives = 91/200 (45%), Gaps = 23/200 (11%)

Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
           LA ++++E+ +G+ S W PY+  L R        + S L + +++L +L G+        
Sbjct: 108 LAFFMVHEQLKGRDSHWWPYLATLPRAS-----ELTSALFFQDSDLEWLQGTSLYETHRA 162

Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAF--VAVQSCVVHLQKVS 277
               +K EY+       +A S+ +   Y +  E++T++IF  A+  +A ++    +    
Sbjct: 163 YRNTVKEEYD-------LAISILRDEGY-LAIESYTWDIFCWAYTLIASRAFTSRVLDAY 214

Query: 278 LARRFAL-----VPLGPPLLAYSSK---CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 329
           L+   +L       +  PL+ +S+     K    A    ++L V  P   GE +    GP
Sbjct: 215 LSNHPSLKQEEEFQIMLPLVDFSNHKPLAKIEWQAEATEIRLKVVEPTFTGEEVHNNYGP 274

Query: 330 QPNSKLLINYGFVDEDNPYD 349
             N +L+  YGF   DNP D
Sbjct: 275 LNNQQLMTTYGFCIVDNPCD 294


>gi|225561342|gb|EEH09622.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
          Length = 487

 Score = 55.5 bits (132), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 94/423 (22%), Positives = 166/423 (39%), Gaps = 68/423 (16%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
            + G+  F++P  ++ T+E    +  +   L + +  LS    LA+Y+++ +        
Sbjct: 52  FKEGERIFTIPADVLWTVEHAYADSLLGPALRSARPPLSVDDTLAMYILFVRS------- 104

Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
               RE      R  LA       S + +++ EL    GS   A        I+ +Y  L
Sbjct: 105 ----RESGYDGPRSHLATLPKSYSSSIFFTDDELEVCAGSSLYALTKRLGRCIEDDYRAL 160

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL-----QKVSLARRFALVP 286
                    L  Q+    P + FT E +K A   V S  +       + + L   FA   
Sbjct: 161 VV------RLLVQHQDLFPLDKFTIEDYKWALCTVWSRAMDFVLPGGKSIRLMAPFA--- 211

Query: 287 LGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGES-----IVVWCGPQPNSKLLINY 339
               +L +SS+ +   A   +   + ++  + Y+AG+      + ++ G  PN++LL  Y
Sbjct: 212 ---DMLNHSSEVRQCHAYDPLSGNLTILAGKDYEAGDQGVFFQVFIYYGSIPNNRLLRLY 268

Query: 340 GFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPY 399
           GFV   NP D   +    +   P ++ KR +    G  S     +        ++D LP 
Sbjct: 269 GFVMPGNPNDSYDLVLETHPMAPFFEQKRKLWDLAGFDSTSTISI-------TLTDPLPK 321

Query: 400 LRLGYV----SDTSEMQSVISSLGPICP----VSPCMERAVLDQLADYFKARLAGYPATL 451
             LGY+    SD S++ S+      I P    +S   E  VL  L + F   L  +   L
Sbjct: 322 NVLGYLRIQRSDESDLASIARQ--RIDPKYEKISDSNEVEVLQSLIESFCGLLDSFGTQL 379

Query: 452 SEDEAMLTDYNLHPKKR---VATQLVRMEKKMLNACLQVTADMIMLLPDVTVS-----PC 503
              E  L +  ++P +     A  +   E+++L    +   DM+  +   + +     P 
Sbjct: 380 ESLEKQLAE-GVYPSRGNAWAAAHVSLGEQQVLRLARKRAEDMLAAVESGSGNEKGSLPA 438

Query: 504 PAP 506
           PAP
Sbjct: 439 PAP 441


>gi|28393324|gb|AAO42088.1| unknown protein [Arabidopsis thaliana]
          Length = 543

 Score = 55.5 bits (132), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 62/239 (25%), Positives = 98/239 (41%), Gaps = 19/239 (7%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           ASEDL+ GD A  +P S +++ E V  ++    L T + ++    L L+ M EK      
Sbjct: 180 ASEDLKLGDVALEIPVSSIISEEYVYNSDMYPILETFDGITSETMLLLWTMREKHNLDSK 239

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
           F  PY   L      G       L +    +  L G+    EI++  E ++  Y+EL   
Sbjct: 240 F-KPYFDSLQENFCTG-------LSFGVDAIMELDGTLLLDEIMQAKELLRERYDELI-- 289

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
                 L   +    P E +T+E +  A     S  + ++      +  L+P+   L   
Sbjct: 290 -----PLLSNHREVFPPELYTWEHYLWACELYYSNSMQIKFPDGKLKTCLIPVAGFLNHS 344

Query: 295 SSKCKAMLAAVD---DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYD 349
                     VD    +++  V RP   GE   +  G   +S LL  YGF+ + DNPYD
Sbjct: 345 IYPHIVKYGKVDIETSSLKFPVSRPCNKGEQCFLSYGNYSSSHLLTFYGFLPKGDNPYD 403


>gi|79557522|ref|NP_179475.3| SET domain-containing protein [Arabidopsis thaliana]
 gi|56381987|gb|AAV85712.1| At2g18850 [Arabidopsis thaliana]
 gi|330251719|gb|AEC06813.1| SET domain-containing protein [Arabidopsis thaliana]
          Length = 543

 Score = 55.5 bits (132), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 62/239 (25%), Positives = 98/239 (41%), Gaps = 19/239 (7%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           ASEDL+ GD A  +P S +++ E V  ++    L T + ++    L L+ M EK      
Sbjct: 180 ASEDLKFGDVALEIPVSSIISEEYVYNSDMYPILETFDGITSETMLLLWTMREKHNLDSK 239

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
           F  PY   L      G       L +    +  L G+    EI++  E ++  Y+EL   
Sbjct: 240 F-KPYFDSLQENFCTG-------LSFGVDAIMELDGTLLLDEIMQAKELLRERYDELI-- 289

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
                 L   +    P E +T+E +  A     S  + ++      +  L+P+   L   
Sbjct: 290 -----PLLSNHREVFPPELYTWEHYLWACELYYSNSMQIKFPDGKLKTCLIPVAGFLNHS 344

Query: 295 SSKCKAMLAAVD---DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYD 349
                     VD    +++  V RP   GE   +  G   +S LL  YGF+ + DNPYD
Sbjct: 345 IYPHIVKYGKVDIETSSLKFPVSRPCNKGEQCFLSYGNYSSSHLLTFYGFLPKGDNPYD 403


>gi|367036287|ref|XP_003648524.1| hypothetical protein THITE_2106073 [Thielavia terrestris NRRL 8126]
 gi|346995785|gb|AEO62188.1| hypothetical protein THITE_2106073 [Thielavia terrestris NRRL 8126]
          Length = 496

 Score = 55.5 bits (132), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 55/203 (27%), Positives = 83/203 (40%), Gaps = 32/203 (15%)

Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
           L L L+YE  QG+ S W PY+  L           ++P+ WS TEL+ L  S   A++  
Sbjct: 109 LILVLIYEHLQGEASRWRPYLDVL-------PPTFDTPMFWSPTELSELQASALVAKV-G 160

Query: 220 RAEG----------IKREYNELDTVWFMAG----------SLFQQYPYDIPTEAFTFEIF 259
           RAE           + R + E   V+F  G           L  +    I   AF  E  
Sbjct: 161 RAEADRMIEAKVLPVIRAHEE---VFFPPGRAKLDDAQLFELAHRMGSTIMAYAFDLEND 217

Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 319
                        ++         +VP+   +L   ++  A +   DDA+     RP +A
Sbjct: 218 DSDNDEADEDDEWVEDREGRTMLGMVPMAD-MLNADAEFNAHINHGDDALTATALRPIRA 276

Query: 320 GESIVVWCGPQPNSKLLINYGFV 342
           G+ I+ + GP PN +LL  YG+V
Sbjct: 277 GDEILNYYGPLPNGELLRRYGYV 299


>gi|358388339|gb|EHK25932.1| hypothetical protein TRIVIDRAFT_82204 [Trichoderma virens Gv29-8]
          Length = 915

 Score = 55.5 bits (132), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 48/198 (24%), Positives = 88/198 (44%), Gaps = 22/198 (11%)

Query: 158 ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
           + L + +M+E  +G +S W PY+  L         + E+P+ WS  EL  L  S T+ ++
Sbjct: 543 SILIIIMMFEYFKGDESKWKPYMDVL-------PASFETPMFWSGAELDELQASATRTKV 595

Query: 218 LERAEGIKREYNELDTVWFMAGSLF---QQYPYDIPTE----------AFTFEIFKQAFV 264
             +A+  +  + ++  V      +F   Q Y  D   +          ++ F+   +   
Sbjct: 596 -GKADAEEMFHAKVLPVIRANHEIFPSSQSYSDDELVQLAHRMGSTIMSYAFDFQNEDEE 654

Query: 265 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 324
             +     ++         +VP+   +L   ++  A +   DDA+ +   R  KAGE I+
Sbjct: 655 DEEDEEEWVEDRESKSTMGMVPMAD-ILNADAEYNAHVNYGDDALTVTALRTIKAGEEIL 713

Query: 325 VWCGPQPNSKLLINYGFV 342
            + GP PNS+LL  YG+V
Sbjct: 714 NYYGPHPNSELLRRYGYV 731


>gi|358397725|gb|EHK47093.1| hypothetical protein TRIATDRAFT_298882 [Trichoderma atroviride IMI
           206040]
          Length = 481

 Score = 55.5 bits (132), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 50/193 (25%), Positives = 87/193 (45%), Gaps = 14/193 (7%)

Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
           L L+ E  +G++SFW PYI+ L +       A+  P  W E E   L G+  +  + +  
Sbjct: 90  LLLIKELLRGEESFWWPYIQALPQPEDVDDWAL--PPFWPEEEAELLEGTNVEVGLDKIR 147

Query: 222 EGIKREYNELDTVWFMA--------GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 273
           + +KRE+ E   +   +          L  +  Y+     F+   F+ + V  ++    L
Sbjct: 148 DDLKREFREAKAMLLASQKDAEDDFSELLTRELYNWAYCIFSSRSFRASLVMTEAQQQAL 207

Query: 274 -QKVSLARRFALVPL---GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 329
            + VS+     L+PL   G   +A   + +   A    A QL V R ++ G+ I     P
Sbjct: 208 PEDVSVDDFSVLLPLFDIGNHDMAVDVRWELDAANSGAACQLRVGREHQPGQQIFNNYSP 267

Query: 330 QPNSKLLINYGFV 342
           + N++LL+ YGF+
Sbjct: 268 KTNAELLLGYGFM 280


>gi|224042477|ref|XP_002188626.1| PREDICTED: SET domain-containing protein 4 [Taeniopygia guttata]
          Length = 457

 Score = 55.5 bits (132), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 93/392 (23%), Positives = 147/392 (37%), Gaps = 67/392 (17%)

Query: 81  LKSWM------HKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSV 128
           LKS+M      HK      K  LKE+   +   RP  +      +  ++ LQAGD   S+
Sbjct: 17  LKSFMDGVNCSHKLEYIKLKKWLKERGFEDSNLRPAEFWETGRGLMTTKALQAGDLIISL 76

Query: 129 PNSLVVT----LERVLGNE------------TIAELLTTNKLSELACLALYL---MYEKK 169
           P   ++T    L   LG               +   L    L  L C    L   + EK 
Sbjct: 77  PEKCLLTTGTVLSSCLGGHIEKWKPPVSPLLALCTFLIGQNLELLECFQFLLVNGIAEKH 136

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
            G+KS W PY+  L +       A   P    E ++  L   P + +  E+   I+  + 
Sbjct: 137 AGQKSPWKPYLDVLPK-------AYTCPAC-LEPDIINLLPKPLQKKAQEQKMLIQELFQ 188

Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP--- 286
                +     LF +   +I    F F   + A+  V +  +++ K      F+L P   
Sbjct: 189 SSRAFFSSLQPLFAEDTGNI----FNFSALQWAWCTVNTRTIYM-KHPHRECFSLEPDVY 243

Query: 287 -LGP--PLLAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
            L P   LL +S   + KA       + ++  D   K  + +++  GP  N +LL+ YGF
Sbjct: 244 ALAPYLDLLNHSPNVQVKAGFNEQTRSYEIWTDSQCKKYQEVLICYGPHDNQRLLLEYGF 303

Query: 342 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLR 401
           V  DNP+  + V A    +     DK    QR  K+S+   H           D L  L 
Sbjct: 304 VATDNPHSSVYVSADTLLKYFSSLDK----QREAKVSILKDH-----------DFLENLT 348

Query: 402 LGYVSDTSEMQSVISSLGPICPVSPCMERAVL 433
            G+   +  + + +  L        C  R +L
Sbjct: 349 FGWEGPSWRLLTALKVLSLAADEFACWRRILL 380


>gi|119467702|ref|XP_001257657.1| SET domain protein [Neosartorya fischeri NRRL 181]
 gi|119405809|gb|EAW15760.1| SET domain protein [Neosartorya fischeri NRRL 181]
          Length = 492

 Score = 55.1 bits (131), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 71/283 (25%), Positives = 118/283 (41%), Gaps = 61/283 (21%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
           V A  D+  G+  FS+P  LV++ +    N  + +LL+ + L EL     L L +MYE  
Sbjct: 50  VVARSDIFDGEELFSIPRGLVLSAQ----NSKLKDLLSQD-LEELGPWLSLILVMMYEYL 104

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
            G++S W PY + L +       + ++ + WS +EL  L GS   ++I +  EG +    
Sbjct: 105 LGEQSAWAPYFKVLPK-------SFDTLMFWSPSELQELQGSAIVSKIGK--EGAE---- 151

Query: 230 ELDTVWFMAGSLFQQYPYDIPT----EAFTFEIFKQAFVAVQSCVVHLQKVSLARRF--- 282
             D++  M   + +  P   P+     ++  E    A + +   +  L    +A  F   
Sbjct: 152 --DSIMQMIAPVVRANPSLFPSVEGLASWDGEAGSHALLGLAHIMGSL---IMAYAFDIE 206

Query: 283 ---------------------------ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 315
                                       +VPL   L A + +  A L   +D++ +   +
Sbjct: 207 KAEDEDDEDNDEEEGYVTDDEQDQSSKGMVPLADILNADADRNNARLFQEEDSLVMKAIK 266

Query: 316 PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 358
           P  AGE I    G  P + LL  YG+V  DN     VVE +L+
Sbjct: 267 PIHAGEEIFNDYGELPRADLLRRYGYV-TDNYAHYDVVELSLD 308


>gi|443730800|gb|ELU16158.1| hypothetical protein CAPTEDRAFT_140019 [Capitella teleta]
          Length = 255

 Score = 55.1 bits (131), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 67/255 (26%), Positives = 110/255 (43%), Gaps = 34/255 (13%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGN---ETIAELLTTNKLSELACLALYLMYEKK 169
           V     L  GD   ++P SL++T   VL +     I + L   +LS    L ++L+ E+ 
Sbjct: 8   VMVRRRLLTGDTIIAIPESLLITTSTVLRSYLGPVIHDFLPC-RLSPTETLVIFLLCERN 66

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
           +G  SFW PY+  L         +    L W+  E+  L    TK    +     +  +N
Sbjct: 67  KGCSSFWKPYVDILPS-------SYTDILHWTSKEMDLLPKF-TKRRACDLRLKAEESFN 118

Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLA----RRF 282
            L   +     L +Q P      AFT+++FK A+ +V +  V++   Q   L+     + 
Sbjct: 119 RLCNGFLPL--LVRQMPQF--NGAFTWDLFKWAWSSVNTRCVYMSQPQNSVLSPDEEDKS 174

Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDA------VQLVVDRPYKAGESIVVWCGPQPNSKLL 336
           AL P    LL ++   +   A  DD+        L   +PY   + + +  GP  N KLL
Sbjct: 175 ALAPFL-DLLNHTVDVEVN-ARFDDSSKSYKITTLTACKPY---DQVFINYGPHSNEKLL 229

Query: 337 INYGFVDEDNPYDRL 351
           + YGF    NP++ +
Sbjct: 230 LEYGFTLPCNPHNNI 244


>gi|334184301|ref|NP_001189551.1| SET domain-containing protein [Arabidopsis thaliana]
 gi|330251720|gb|AEC06814.1| SET domain-containing protein [Arabidopsis thaliana]
          Length = 536

 Score = 55.1 bits (131), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 62/239 (25%), Positives = 98/239 (41%), Gaps = 19/239 (7%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           ASEDL+ GD A  +P S +++ E V  ++    L T + ++    L L+ M EK      
Sbjct: 180 ASEDLKFGDVALEIPVSSIISEEYVYNSDMYPILETFDGITSETMLLLWTMREKHNLDSK 239

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
           F  PY   L      G       L +    +  L G+    EI++  E ++  Y+EL   
Sbjct: 240 F-KPYFDSLQENFCTG-------LSFGVDAIMELDGTLLLDEIMQAKELLRERYDELI-- 289

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
                 L   +    P E +T+E +  A     S  + ++      +  L+P+   L   
Sbjct: 290 -----PLLSNHREVFPPELYTWEHYLWACELYYSNSMQIKFPDGKLKTCLIPVAGFLNHS 344

Query: 295 SSKCKAMLAAVD---DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYD 349
                     VD    +++  V RP   GE   +  G   +S LL  YGF+ + DNPYD
Sbjct: 345 IYPHIVKYGKVDIETSSLKFPVSRPCNKGEQCFLSYGNYSSSHLLTFYGFLPKGDNPYD 403


>gi|432901733|ref|XP_004076920.1| PREDICTED: SET domain-containing protein 4-like [Oryzias latipes]
          Length = 441

 Score = 55.1 bits (131), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 69/273 (25%), Positives = 118/273 (43%), Gaps = 29/273 (10%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
           +Q G    S+P S ++T   VL +  +   L + K   S L  L ++L+ E+ +G+ S W
Sbjct: 68  IQPGGMLVSLPESCLLTTSTVL-HSYLGPFLKSWKPRPSSLVALCVFLVCERHRGEASDW 126

Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYL-TGSPTKAEILERAEGIKREYNELDTVW 235
            PYI  L         +   P  +++T +A L +G   +AE  E+ EG++  Y  +   +
Sbjct: 127 FPYIDVLP-------CSYCCPPYFTDTVMAVLPSGVRRRAE--EQREGLQHLY-AVHQDF 176

Query: 236 FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK-----VSLARRFALVPLGPP 290
           FM+      +P   P E  T+E  + A+ ++ +  V + +     +S    +AL P    
Sbjct: 177 FMSLQPVLSHP---PEEVLTYEALRWAWCSINTRSVFMDRPSSSFLSGPDNYALAPF-LD 232

Query: 291 LLAY--SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
           LL +    + KA         ++      +      +  G   N +LL+ YGFV   NP+
Sbjct: 233 LLNHRPDVQVKAGFNRTSGCYEIRSISGVQRYHQAFINYGSHDNQRLLLEYGFVSSCNPH 292

Query: 349 DRLVVEAALNTE----DPQYQDKRMVAQRNGKL 377
             + VE  L  E    D    +K    + NG L
Sbjct: 293 SVIYVEEDLLCEVLRGDESLDEKMKFLRENGFL 325


>gi|242769547|ref|XP_002341787.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
 gi|218724983|gb|EED24400.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
          Length = 739

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 65/230 (28%), Positives = 100/230 (43%), Gaps = 29/230 (12%)

Query: 161 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSE--TELAYLTGSPTKAEIL 218
             +LM +  + ++ FW PYI+ L      G   + +PLL+ E   +LA+L  +   A   
Sbjct: 138 TFFLMGQYLRREEGFWYPYIQSLP-----GPEELTTPLLFKEEDGDLAWLNMTSLAASRE 192

Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQ--------YPYDIPTEAFTFEIFKQAFVA--VQS 268
            R +  K  Y +       A S+ Q         Y +D+   A T  I  +AF A  + S
Sbjct: 193 RRLQIWKVNYEK-------AYSMMQDLGVENARLYTWDLYLWASTI-ISSRAFTAKVLAS 244

Query: 269 CVVHLQKVSLARRFALVPLGPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVW 326
            +  LQ      R ++  L P + A + K   K    A  D++ LVV    +AG+ +   
Sbjct: 245 VIPKLQTAEEGDRISV--LLPLIDATNHKPLSKVEWRAGTDSIGLVVMSDLRAGDEVGNN 302

Query: 327 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 376
            GP+ N +L++NYGF   DNP +  VV      + P  Q K    Q   K
Sbjct: 303 YGPRNNEQLMMNYGFCIPDNPCEYRVVSLRAPPDSPLAQIKAQYEQHCSK 352


>gi|121707885|ref|XP_001271968.1| SET domain protein [Aspergillus clavatus NRRL 1]
 gi|119400116|gb|EAW10542.1| SET domain protein [Aspergillus clavatus NRRL 1]
          Length = 677

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 55/203 (27%), Positives = 92/203 (45%), Gaps = 27/203 (13%)

Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
            +L+ +  QG+  FW PYIR L +      L++ +PL +   +L +L G+       +R 
Sbjct: 97  FFLIGQYLQGEDGFWFPYIRTLPQP-----LSLTTPLYYEGDDLGWLKGTSLWPAREQRM 151

Query: 222 EGIKREYNELDTVWFMAGSLFQ---QYPYD--------IPTEAFTFEIFKQAFVAVQSCV 270
           E +K  Y   + V  +  + FQ   +Y +D        I + AF+ ++  +AF  +    
Sbjct: 152 ELLKEAYE--NGVRELRKAGFQDVDKYTWDLYLWASSMIVSRAFSPKVLAEAFADID--- 206

Query: 271 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 330
           +    VS+     L+P    L+ +    K    A    V  +V     AG+ I    GP+
Sbjct: 207 LPEDGVSV-----LLPC-IDLMNHRPLAKVEWRAGKQDVAYLVLEDVAAGQEIANNYGPR 260

Query: 331 PNSKLLINYGFVDEDNPYDRLVV 353
            N +L++NYGF   DNP D  +V
Sbjct: 261 NNEQLMMNYGFCLPDNPCDYRIV 283


>gi|388516285|gb|AFK46204.1| unknown [Lotus japonicus]
          Length = 271

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 39/133 (29%), Positives = 65/133 (48%), Gaps = 7/133 (5%)

Query: 257 EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVD 314
           E FK +F  + S +V L   S+  + ALVP    +L +S   +  L        +    D
Sbjct: 2   ESFKWSFGILFSRMVRLP--SMDGKVALVPWAD-MLNHSCDVETFLDYDKQSKGIVFTTD 58

Query: 315 RPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRLVVEAALNTEDPQYQDKRMVAQ 372
           RPY+ GE + +  G + N +LL++YGFV  +  NP D + +  +L   D  Y++K  + +
Sbjct: 59  RPYQPGEQVFISYGKKSNGELLLSYGFVTREGANPSDSVELSLSLKKSDGSYKEKLELLK 118

Query: 373 RNGKLSVQVFHVH 385
           + G    Q F + 
Sbjct: 119 KYGLSGSQCFPIR 131


>gi|330800139|ref|XP_003288096.1| hypothetical protein DICPUDRAFT_152307 [Dictyostelium purpureum]
 gi|325081857|gb|EGC35358.1| hypothetical protein DICPUDRAFT_152307 [Dictyostelium purpureum]
          Length = 525

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 94/477 (19%), Positives = 179/477 (37%), Gaps = 132/477 (27%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + +++DL+  +    +P  +++++       +I+ +LT   +      A+ L+YE   G+
Sbjct: 82  IISNKDLKVNNIVAKIPKDIILSIHT----SSISNILTKYTMERNIATAIALIYEASIGE 137

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
           KS W  YI  L        L V+ P+LW +     L G+  +  I +    I   Y ++ 
Sbjct: 138 KSKWYGYISSL-------PLKVDIPILWDKESQQLLNGTVMEDVIQDDNILINHAYADI- 189

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF--ALVPLG-- 288
               +   L + +P     E F+ EIF      + + +V  +   +      +LVPL   
Sbjct: 190 ----VESLLIKNHP-----EYFSKEIFSFENFKIANSIVSSRAFCIDSYHGDSLVPLADI 240

Query: 289 ---------------------------------PPLLAYSSK------------------ 297
                                             PL+  S+K                  
Sbjct: 241 FNHKTGRENVHIESNGDVCNKCGSIKTCKHRKVTPLITKSAKSYKKLTNKKKMELIEKQK 300

Query: 298 ---------CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
                    C  +    D+ + + V +  KA + +    G   N+ LL  YGF++ DNP 
Sbjct: 301 QQQINDEENCGDIAEEDDEHLYIKVVKAVKANQEVYNTYGDHSNATLLSKYGFIEMDNPC 360

Query: 349 DRLVVEAAL-----------NTEDPQYQDKRM--------VAQRNGKLSVQVFHVHAGRE 389
           D L VE +L           N  D     KR+        +  RN   S+++    +GR 
Sbjct: 361 DNLPVEKSLVDTNLISLCKENGFDSNELSKRISFYASLFDIDSRNTH-SIEI----SGRL 415

Query: 390 KEA-----------ISDMLPYLRLGYVSDTSEMQSVISSLGP--ICPVSPCMERAVLDQL 436
            +A           +S+   +L++       +++     L    I   +  +++A++  L
Sbjct: 416 DDALVCSVGIALAPLSEFEGWLKMS----EHKLEKYFEKLEAEDIVKQNAQVKKAIVQIL 471

Query: 437 ADYFKARLAGYPATLSEDEAMLTDY--NLHPKKRVATQLVRMEKKMLNACLQVTADM 491
            +    +L+ YP TL +D+  L +   N   +K ++T L   EKK++   ++   D+
Sbjct: 472 NN----KLSNYPTTLEQDQNKLKELKENEENRKIISTSLNICEKKLIYKSIKYYEDL 524


>gi|320170797|gb|EFW47696.1| hypothetical protein CAOG_05634 [Capsaspora owczarzaki ATCC 30864]
          Length = 903

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 53/200 (26%), Positives = 87/200 (43%), Gaps = 21/200 (10%)

Query: 62  TLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQA 121
           T V G+R    +  +   +L  W+H  G+     I +   S  +       V A+E ++A
Sbjct: 371 TAVIGTRPAALESRKIGDNLLQWLHNAGMTS---IAENHLSIADFEHTGRGVLANERIEA 427

Query: 122 GDAAFSVPNSLVVTLERVLG-NETIAELLTT--NKLSELACLALYLMYEK-KQGKKSFWL 177
           G     +P  L++ +   L  +  I  +L+   ++  +   L LY+++EK   G  S W 
Sbjct: 428 GVEVLHLPQHLLINIHVALDESHPIGRVLSDLRDEYDDDTLLLLYVLHEKLVAGSASRWA 487

Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
           P+   L            SPLL+  TEL  L G+    E  E  +G++  +  L      
Sbjct: 488 PFFETL-------PATYNSPLLFHVTELLELEGTRLIDETFEIKDGLRVLHESL------ 534

Query: 238 AGSLFQQYPYDIPTEAFTFE 257
            G L + YP   PT+AFT+E
Sbjct: 535 -GPLAEAYPALFPTDAFTYE 553


>gi|328869852|gb|EGG18227.1| hypothetical protein DFA_03714 [Dictyostelium fasciculatum]
          Length = 504

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 42/176 (23%), Positives = 78/176 (44%), Gaps = 20/176 (11%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A +DL+  +    +P   V++ +      +IA +L   +L E    ++ LMYE  +G 
Sbjct: 44  IIAKQDLKVDEIIAVIPKRCVLSPKTT----SIAPILEKYELEEAVATSIALMYETSKGV 99

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
           +S W  YI+ +          ++ P+LW +  + YL G+  +  ++E  E ++ +Y E  
Sbjct: 100 QSKWYSYIQSM-------PTVIDLPILWDKESIEYLVGTDLEEIVIENIETLEEQYRE-- 150

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG 288
                   + + +P       FT E FK A   V S   ++ +       +LVPL 
Sbjct: 151 ----DVEPIIKNHPETFKENIFTLESFKIASTIVSSRAFNIDQYHGE---SLVPLA 199


>gi|322698908|gb|EFY90674.1| putative histone-lysine N-methyltransferase [Metarhizium acridum
           CQMa 102]
          Length = 437

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 64/271 (23%), Positives = 117/271 (43%), Gaps = 37/271 (13%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELL--TTNKLSELACLALYLMYEKKQGKKSFW 176
            + G+   ++P+ ++ T+E    +  +  +L  T+  LS    LA+Y+++ + + K  + 
Sbjct: 34  FKEGENILTIPSGILWTVEHAYADSILGPVLRSTSLPLSVEDTLAIYILFVRSR-KSGYD 92

Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
            P          R  +A       S + + E +L    G+       +  + I+ +Y  L
Sbjct: 93  GP----------RNHVAALPASYSSSIFFMEDQLEVCAGTSLYTITKQLEQRIEDDYRGL 142

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL-----QKVSLARRFALVP 286
             V  M G    QYP   P + FT E +K A   V S  +       + + L   FA   
Sbjct: 143 --VVRMLG----QYPDLFPLDKFTVEDYKWALCTVWSRAMDFVLPDGKSIRLLAPFA--- 193

Query: 287 LGPPLLAYSSKCKA--MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 344
               +L +SS+ K   +  A    + ++  + Y+AG+ + +  GP PN++LL  YGFV  
Sbjct: 194 ---DMLNHSSEAKQCHVYDASSGNLSVLAGKDYEAGDQVFINYGPMPNNRLLRLYGFVVP 250

Query: 345 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 375
            NP D   +  A +   P ++ K+ +    G
Sbjct: 251 GNPNDSYDLVLATHPMAPFFKQKQKLWASAG 281


>gi|226294776|gb|EEH50196.1| SET domain-containing protein [Paracoccidioides brasiliensis Pb18]
          Length = 488

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 92/428 (21%), Positives = 165/428 (38%), Gaps = 67/428 (15%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
           + A +D+   +  F++P  LV++ +    N  + +L+  N+  L +  CL L ++YE  Q
Sbjct: 50  IVAYDDINEEEELFAIPQGLVLSFQ----NSKLKDLMEINERDLGQWLCLILVMIYEYLQ 105

Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP--------TKAEILERA- 221
           G  S W PY + L           ++ + W++ EL  L GS         T  E+  R  
Sbjct: 106 GAASPWAPYFKVLPTD-------FDTLMFWTDAELLELKGSAVLGRIGKSTAEEVFLRDL 158

Query: 222 -------------EGIKREYNELD------TVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
                         G    YN  D      ++    GSL   Y +D+  +        + 
Sbjct: 159 LPLVSKNSELFPLTGGLLSYNSPDGKAALLSLAHRMGSLIMSYAFDVENDE------AEE 212

Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
                  V   ++  L +   ++PL   L A + +  A L   D  + +   +  + GE 
Sbjct: 213 VEGEDGYVTDDEERQLPK--GMIPLADLLNADADRNNARLFQEDGYLSMKSIKSIRKGEE 270

Query: 323 IVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVE-------AALNTEDPQYQDKRMVAQRN 374
           I    G  P ++LL  YG+V D    YD   V        A L +  P   + R+    +
Sbjct: 271 IFNDYGELPRAELLRRYGYVTDSYAQYDEAEVPIQTICRVAGLKSSTPGPDEPRLEFLDD 330

Query: 375 GKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPV-SPCMERAVL 433
            ++    + +        +++ LP   L  ++         + L     V  P +  A  
Sbjct: 331 LEVLDDGYGIPRHDRSTPLAETLPTELLVVLNILVMPLEQFNQLKQKSKVPKPALGIAEA 390

Query: 434 DQLADYFKARLAGYPATLSEDEAML---TDYNLHP------KKRVATQLVRMEKKMLNAC 484
             L +  +  L  YP T+++D+ +L    +Y          + ++A Q+ + EK++LNA 
Sbjct: 391 TLLDEVVRLILGEYPTTVAQDKELLASCANYQGSTSPISAGRLKMALQVRKGEKEILNAV 450

Query: 485 LQVTADMI 492
           L    D I
Sbjct: 451 LSELEDFI 458


>gi|167521575|ref|XP_001745126.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776740|gb|EDQ90359.1| predicted protein [Monosiga brevicollis MX1]
          Length = 390

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 64/311 (20%), Positives = 124/311 (39%), Gaps = 35/311 (11%)

Query: 75  EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
           EE+  +L  W+ + G    KV +       +       + A+ +   G+    +P + ++
Sbjct: 24  EEEYDELVDWLKQCGATVDKVAVDHFNGMGQG------LKATAEAAPGETLLRIPEACML 77

Query: 135 --------TLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
                   TL   + ++T+ +L+    L+       + +  +     SFW PYI  L   
Sbjct: 78  SEESARRSTLGAYMDSDTMLKLMPNVTLA-------FHLLLELHDLDSFWRPYIACL--- 127

Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQY 245
                ++   PL W   +L  L GS    E +   + + R+Y  L + +   A      +
Sbjct: 128 ----PVSYSVPLYWDLPDLMSLRGSSLFVEAIRLYKHVCRQYGYLHNKLSVRANPSCSCF 183

Query: 246 PY--DIPTEAFTFEIFKQAFVAVQSCVVHLQKVS----LARRFALVPLGPPLLAYSSKCK 299
           P    +  EAFTFE ++ A   V +    + +      +    AL+PL   +   +    
Sbjct: 184 PLTLGLSPEAFTFEDWRWAVATVMTRQNSIPQAGPDGQMKPTLALIPLWDMINHANHPMS 243

Query: 300 AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 359
               +  + ++ V   P K G  I +W G + N + L++ GF    +  D + V  +L+ 
Sbjct: 244 TQFDSERECLEFVCPAPAKPGSQITMWYGDRNNGQFLLHQGFFFAGHANDYVNVPFSLDE 303

Query: 360 EDPQYQDKRMV 370
            D  Y+ K ++
Sbjct: 304 TDSLYKIKALL 314


>gi|347967018|ref|XP_321037.5| AGAP002018-PA [Anopheles gambiae str. PEST]
 gi|333469795|gb|EAA01259.5| AGAP002018-PA [Anopheles gambiae str. PEST]
          Length = 493

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 91/399 (22%), Positives = 160/399 (40%), Gaps = 58/399 (14%)

Query: 121 AGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA--CLALYLMYEKKQGKKSFWLP 178
           AG+   +VP S+   +        + EL+    +SE     LAL L+ E+ + K S W P
Sbjct: 110 AGECIITVPRSMFFYVTNEPRYRQLLELMPGAMMSEQGNIMLALALIMERFRAK-SDWKP 168

Query: 179 YIREL-DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
           Y+  L DR          +PL ++  ++  L  +      L+  + I R+Y  +      
Sbjct: 169 YLDLLPDR--------YTTPLYYTTEDMGELAETDAFLPALKLCKHIARQYGFIRR---- 216

Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS----CVVHLQKV-SLARRFALVPL----- 287
               F Q   D   + FT+++F+ A   V +      V+L +   +    AL+PL     
Sbjct: 217 ----FVQEKVDELRDCFTYDVFRWAVSTVMTRQNKVPVNLAEFDGMDHTLALIPLWDMAN 272

Query: 288 -GPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYK--AGESIVVWCGPQPNSKLLINYGFV 342
              P  A  ++C A     A ++ ++  + R     A   I +  G + +++ L++ GFV
Sbjct: 273 HAFPDTANETRCVAETCYNATNEQLECSLTREVSDIASVPIFIVYGTRTDAEFLVHNGFV 332

Query: 343 DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP---- 398
              NP+  +     L    P Y+++  + +  G  +   F     RE  A +   P    
Sbjct: 333 CPRNPHANVQKRFTLVPAIPLYKERAHLLELLGMPTTGTFSFGPAREPAAATTTTPISQE 392

Query: 399 YLRLGYVSDTS------------EMQSVISSLGPICPVSPC--MERAVLDQLADYFKARL 444
            + L  VS  +              +  + +   + P   C   ER     LA   K  L
Sbjct: 393 LISLARVSSMTAKELDEYTAMKETQRQTLRTYQALLPAELCARTER----WLATVMKIML 448

Query: 445 AGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 483
             YP T+ +DEA+L   N H  +R+  +    EK++L +
Sbjct: 449 LRYPTTIEQDEALLKT-NRHHIRRLLIEYRLGEKQILRS 486


>gi|350632383|gb|EHA20751.1| hypothetical protein ASPNIDRAFT_120572 [Aspergillus niger ATCC
           1015]
          Length = 668

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 57/217 (26%), Positives = 92/217 (42%), Gaps = 7/217 (3%)

Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
           + + E      +L+ +  +G + FW PYIR L  Q G    ++ +P  +   +L +L G+
Sbjct: 77  DAVGEKESTIFFLVGQYLRGTEGFWYPYIRTLP-QPG----SLTTPPYYEGEDLQWLDGT 131

Query: 212 PTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
              A   +R E +K +Y +  T    AG      Y +D+   A +  I +     V S V
Sbjct: 132 SLLAAREKRLEVLKEKYEKGSTELRNAGFEGADAYTWDLYLWAASMFISRAFSAKVLSGV 191

Query: 271 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 330
                +S  +   L+P+   +  +    K    A  D +  VV     AG+ I    GP+
Sbjct: 192 FPETDLSEEKLSVLLPI-IDMGNHRPLAKVEWRAGKDDIAFVVLEDVWAGQEISNNYGPR 250

Query: 331 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 367
            N +L++NYGF    NP D  +V        P Y  K
Sbjct: 251 NNEQLMMNYGFCIPGNPCDHRIVSLRAPPGSPLYMAK 287


>gi|308802149|ref|XP_003078388.1| related to histone-lysine N-methyltransferase (ISS) [Ostreococcus
           tauri]
 gi|116056840|emb|CAL53129.1| related to histone-lysine N-methyltransferase (ISS), partial
           [Ostreococcus tauri]
          Length = 446

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 99/406 (24%), Positives = 162/406 (39%), Gaps = 53/406 (13%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL-ACLALYLMYEK-KQ 170
           VA + D+  G+   +VP    V+      + T+   L+    + L   LA +++ E    
Sbjct: 25  VATTRDVTRGELLATVPLEKCVSTSSARADATLWRGLSARPGASLDGILAAHVLREAFGL 84

Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
           G++S + P++R L  +        ++ + W E EL  L GS   A      +  + EY+ 
Sbjct: 85  GERSAFWPWLRLLPSE-------TDAAVGWDEDELRELQGSNVVAFARAIKKSWREEYDA 137

Query: 231 LDTVWFMAGSLFQQYPYDIPTEAF--------TFEIFKQAFVAVQSCVVHLQKVSLARRF 282
           LD         F     D P EAF        TFE F  A   V S  + L+  S +   
Sbjct: 138 LD---------FAGLGVDFP-EAFGGEHAAHYTFEKFTWARFVVWSRAIDLKTDSTSA-- 185

Query: 283 ALVPLGPPLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 337
            ++ +  P+L     A S K      A  +AV++     +K    +      +P+   L+
Sbjct: 186 PVIRMLVPILDMANHAPSGKLLPRWDAKANAVKIYAGSAFKRNTELRFNYDTKPSQYFLL 245

Query: 338 NYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDML 397
            YGF+ E NP + + V   L+  D   + K  + +R+G    +       R  +   D+L
Sbjct: 246 QYGFIPEANPAECVEVTMQLSQRDNLRERKEALLRRHGLDPTKRNFEWKVRGLD--YDLL 303

Query: 398 PYLRLGYVSDTSEMQSVIS-----SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLS 452
              R+    D SE+    S     S   +   +    +AVL +        L GY  TL 
Sbjct: 304 AAARI-IAMDESELDDDTSVALSVSGASVSAKNDARTKAVLLK---SLITSLDGYGTTLG 359

Query: 453 EDEAMLTDYNLH----PKKRVA-TQLVRMEKKMLNACLQVTADMIM 493
           ED + +  +N      PKKR     L+RM +K     L  +AD + 
Sbjct: 360 EDNSYIARFNTSSDELPKKRKRFAVLLRMREK---GILLASADALF 402


>gi|317038661|ref|XP_001401929.2| SET domain protein [Aspergillus niger CBS 513.88]
          Length = 699

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 57/217 (26%), Positives = 92/217 (42%), Gaps = 7/217 (3%)

Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
           + + E      +L+ +  +G + FW PYIR L  Q G    ++ +P  +   +L +L G+
Sbjct: 103 DAVGEKESTIFFLIGQYLRGTEGFWYPYIRTLP-QPG----SLTTPPYYEGEDLQWLDGT 157

Query: 212 PTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
              A   +R E +K +Y +  T    AG      Y +D+   A +  I +     V S V
Sbjct: 158 SLLAAREKRLEVLKEKYEKGSTELRNAGFEGADAYTWDLYLWAASMFISRAFSAKVLSGV 217

Query: 271 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 330
                +S  +   L+P+   +  +    K    A  D +  VV     AG+ I    GP+
Sbjct: 218 FPETDLSEEKLSVLLPI-IDMGNHRPLAKVEWRAGKDDIAFVVLEDVWAGQEISNNYGPR 276

Query: 331 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 367
            N +L++NYGF    NP D  +V        P Y  K
Sbjct: 277 NNEQLMMNYGFCIPGNPCDHRIVSLRAPPGSPLYMAK 313


>gi|451852693|gb|EMD65988.1| hypothetical protein COCSADRAFT_86793 [Cochliobolus sativus ND90Pr]
          Length = 478

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 86/370 (23%), Positives = 146/370 (39%), Gaps = 53/370 (14%)

Query: 151 TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 210
           +  L     L L ++YE  QG+ S W  Y+  L +       A E+P+ W+  EL  L G
Sbjct: 100 SEALDSWGSLILVMLYEYLQGEASRWKTYLDILPQ-------AFETPIFWTPDELKELEG 152

Query: 211 SPTKAEILERAEGIK--RE--------------------YNELD--TVWFMAGSLFQQYP 246
           +    E + + E  +  RE                     NE D  ++    GS    Y 
Sbjct: 153 TSLTTEKIGKKESDRMLRERILPIVTSHPDVFSPPGAPRLNEDDLLSLAHRMGSTIMAYA 212

Query: 247 YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD 306
           +D+  E    E  +  ++  +         SL     +VP+   +L  +++  A +   D
Sbjct: 213 FDLENEEEQSEDEEDGWIEDRDGK------SL---IGMVPMAD-MLNANAEFNAHVHHGD 262

Query: 307 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRL-----VVEAALNTE 360
                 +     AG  I+ + GP P+S+LL  YG+V  E + YD       +V  AL  E
Sbjct: 263 QLQVTSLRESIPAGSEILNYYGPLPSSELLRRYGYVTSEHHRYDVAEISWSLVRTALAEE 322

Query: 361 DPQYQDKRMVAQRNGKLSVQVFHV---HAGREKEAISDML--PYLRLGYVSDTSEMQSVI 415
               +D     +R  +  ++ F V    AG E  +   +   P LR        + ++ +
Sbjct: 323 LKLSEDTIADIERKLESELEEFFVIERDAG-EPSSYGTLTQPPVLREISTELEEQTKAFL 381

Query: 416 SSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVR 475
            +L    P     E      L    + RL  YP +  +DE++L+   L  + R+A ++  
Sbjct: 382 KALKKRDPKRKRSETICNTVLEKALRTRLGQYPTSAKQDESLLSKEGLSKRHRMAVEVRL 441

Query: 476 MEKKMLNACL 485
            EK++L   L
Sbjct: 442 GEKRLLQEAL 451


>gi|225678514|gb|EEH16798.1| SET domain-containing protein RMS1 [Paracoccidioides brasiliensis
           Pb03]
          Length = 488

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 91/428 (21%), Positives = 164/428 (38%), Gaps = 67/428 (15%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
           + A +D+   +  F++P  LV++ +    N  + +L+  N+  L +  CL L ++YE  Q
Sbjct: 50  IVAYDDINEEEELFAIPQGLVLSFQ----NSKLKDLMEINERDLGQWLCLILVMIYEYLQ 105

Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA--------- 221
           G  S W PY + L           ++ + W++ EL  L GS     I + A         
Sbjct: 106 GAASPWAPYFKVLPTD-------FDTLMFWTDAELLELKGSAVLGRIGKSAAEEVFLRDL 158

Query: 222 -------------EGIKREYNELD------TVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
                         G    YN  D      ++    GSL   Y +D+  +        + 
Sbjct: 159 LPLVSKNSELFPLTGGLLSYNSPDGKAALLSLAHRMGSLIMSYAFDVENDE------AEE 212

Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
                  V   ++  L +   ++PL   L A + +  A L   D  + +   +  + GE 
Sbjct: 213 VEGEDGYVTDDEERQLPK--GMIPLADLLNADADRNNARLFQEDGYLAMKSIKSIRKGEE 270

Query: 323 IVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVE-------AALNTEDPQYQDKRMVAQRN 374
           I    G  P ++LL  YG+V D    YD   V        A L +  P   + R+    +
Sbjct: 271 IFNDYGELPRAELLRRYGYVTDSYAQYDEAEVPIQTICRVAGLKSSTPGPDEPRLEFLDD 330

Query: 375 GKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPV-SPCMERAVL 433
            ++    + +        +++ LP   L  ++         + L     V  P +  A  
Sbjct: 331 LEVLDDGYGIPRHDRSTPLAETLPTELLVVLNILVMPLEQFNQLKQKSKVPKPALGIAEA 390

Query: 434 DQLADYFKARLAGYPATLSEDEAML---TDYNLHP------KKRVATQLVRMEKKMLNAC 484
             L +  +  L  YP T+++D+ +L    +Y          + ++A Q+ + EK++LNA 
Sbjct: 391 TLLDEVVRLILGEYPTTVAQDKELLASCANYQGSTSPISAGRLKMALQVRKGEKEILNAV 450

Query: 485 LQVTADMI 492
           L    D I
Sbjct: 451 LSELEDFI 458


>gi|299473350|emb|CBN77749.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 563

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 80/298 (26%), Positives = 124/298 (41%), Gaps = 30/298 (10%)

Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIP-TEAFT 255
           P+ W+E E+  L GS    ++ ER + I+ +Y          G +   YP   P  +  T
Sbjct: 212 PIFWTEEEMRLLQGSYLVTQVEERNQAIEGDY----------GVICDLYP---PFRDVAT 258

Query: 256 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSK-CKAMLAAVDDAVQLVVD 314
            E FK A + V S    L    L R  ALVP    L  Y  +  K           +   
Sbjct: 259 LEEFKWARMCVCSRNFGLDINGL-RTSALVPYADMLNHYRPRETKWTYDNNRGGFTITTL 317

Query: 315 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY------DRLVVEAALNTEDPQYQDKR 368
                G  +    G + N + L+NYGF  E+N        + + +   L+  DP  Q K 
Sbjct: 318 HRILGGAQVYDSYGQKCNHRFLLNYGFAIENNQEANGFCPNEVPLLFRLDARDPLRQKKA 377

Query: 369 MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQS--VISSLGPI-CPVS 425
              + +G    +V  +  G + +A+   L  LR+  V+D +EM +  +  ++  +  P+S
Sbjct: 378 RFWRMDGPEQRRV-RLCVG-DTDAVRGALSMLRV-IVADAAEMGARYMYRTVKDVRFPLS 434

Query: 426 PCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP--KKRVATQLVRMEKKML 481
              E A +++L       L  YP TL ED A L +  L P   +R A   V  EK +L
Sbjct: 435 VRNEVAAMERLLLLTTGALDAYPTTLEEDRAALKNGGLEPFSNRRHALIQVYGEKVVL 492


>gi|168014081|ref|XP_001759585.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689124|gb|EDQ75497.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 340

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 51/212 (24%), Positives = 97/212 (45%), Gaps = 28/212 (13%)

Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 350
           LL +SS+ +++       +++V ++  + GE++V+  GP  N  LL++YGFV   NP DR
Sbjct: 123 LLQHSSESQSL-----PVLEVVAEKDLEKGENVVLNYGPLSNDILLLDYGFVMPKNPNDR 177

Query: 351 --------------LVVEAALNT-EDPQYQDKRMVAQRN--GKLSVQVFHVHAGREKEAI 393
                         LV +  +++ +DP      ++ + N  G  S Q+  V  G  +   
Sbjct: 178 VELRYDDQLLHMACLVAKVNIDSFKDPTTSQLALLTRLNLHGPSSSQM--VTLGGTELVE 235

Query: 394 SDMLPYLRLGYVSDTSEMQSV----ISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 449
             +L  +R+ +  D  E+  V    + +     P+    ER  +  L       LA +P 
Sbjct: 236 GRLLAAVRVMHAQDPMELLDVDLEALQTWNQSPPLGVLNERKTIRTLIGLGMLALASFPT 295

Query: 450 TLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 481
            + ED++ L   ++    R+A Q   ++K++L
Sbjct: 296 EIEEDQSELVKGDISENHRLAIQFRMLKKRLL 327


>gi|323449371|gb|EGB05259.1| hypothetical protein AURANDRAFT_66448 [Aureococcus anophagefferens]
          Length = 762

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 72/305 (23%), Positives = 117/305 (38%), Gaps = 60/305 (19%)

Query: 66  GSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAA 125
           GS  VV+       +  +W+   G    K+ +K    H    R +   AA E    G+  
Sbjct: 11  GSSAVVTS------EFVAWLRAGGASFDKLAIK----HTALGRGVVATAAYE---PGETL 57

Query: 126 FSVPNSLVVTLERVLGNETIAELLTTNKLSELAC------LALYLMYEKKQGKKSFWLPY 179
            SVP +L++T+++      +A  L   +   +        LAL+L  ++ +     W PY
Sbjct: 58  LSVPEALLLTVDKASRRADVAASLGAARARGVDANGGNLALALFLAGDRSEA----WRPY 113

Query: 180 IRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAG 239
              + R           P  W   + A L GSP   +++ R + I+R+   L        
Sbjct: 114 RNVISRS------VSHLPCFWPTADEALLAGSPLGEDVVRRRDEIRRDCRSLGLTAVEDR 167

Query: 240 SLFQQYPYDIPTEAFTFEIFKQA--FVAVQSCVVHLQK-VSLA-RRFALVPLGPPLLAYS 295
             F      + + AF F   +    F  + +   H ++ V  A  R A V          
Sbjct: 168 QAFAFAEAQVLSRAFAFNGTRAMVPFADLMNTARHHERHVDFAFERGAFV---------- 217

Query: 296 SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN--PYDRLVV 353
                 + AV         R   AGE +    GP+ N++ L+NYGF   DN     RL+ 
Sbjct: 218 ------MRAV---------RRGAAGEPVTDSYGPKSNARYLLNYGFAMADNRDEAGRLLD 262

Query: 354 EAALN 358
           +AAL+
Sbjct: 263 DAALD 267


>gi|67540796|ref|XP_664172.1| hypothetical protein AN6568.2 [Aspergillus nidulans FGSC A4]
 gi|40738718|gb|EAA57908.1| hypothetical protein AN6568.2 [Aspergillus nidulans FGSC A4]
 gi|259480141|tpe|CBF71002.1| TPA: SET domain protein (AFU_orthologue; AFUA_6G04520) [Aspergillus
           nidulans FGSC A4]
          Length = 484

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 66/277 (23%), Positives = 108/277 (38%), Gaps = 49/277 (17%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN--KLSELACLALYLMYEKKQ 170
           V A  D+   +  F++P  LV++      N  + +LL+ +  +L     L L +++E  Q
Sbjct: 50  VVAQADIDEDEELFAIPRDLVLSTH----NSKLKDLLSQDLDQLGPWLSLMLVMIFEYLQ 105

Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA--------- 221
           G KS W PY + L +         ++ + WS  EL  L GS    +I ++          
Sbjct: 106 GGKSTWAPYFKVLPQN-------FDTLMFWSPEELEELQGSAVVEKIGKQGAEESILKLI 158

Query: 222 --------------EGIKREYNELDTVWFMA-----GSLFQQYPYDIPTEAFTFEIFKQA 262
                          G+     ++     +      GSL   Y +DI T     E   + 
Sbjct: 159 IPVVRANPALFPPINGLASYDGDVGAQALLGLAHTMGSLIMAYAFDIETPENEDEREGED 218

Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
                       K        +VPL   L A + +  A L   ++++ +   +P +AGE 
Sbjct: 219 GYLTDEEEEQSSK-------GMVPLADMLNADAYRNNARLFQEEESLVMKAIKPIRAGEE 271

Query: 323 IVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 359
           I    G  P S LL  YG+V  DN     V+E +L+T
Sbjct: 272 IFNDYGEIPRSDLLRRYGYV-TDNYASYDVIELSLDT 307


>gi|134074534|emb|CAK38827.1| unnamed protein product [Aspergillus niger]
          Length = 625

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 57/217 (26%), Positives = 92/217 (42%), Gaps = 7/217 (3%)

Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
           + + E      +L+ +  +G + FW PYIR L  Q G    ++ +P  +   +L +L G+
Sbjct: 29  DAVGEKESTIFFLIGQYLRGTEGFWYPYIRTLP-QPG----SLTTPPYYEGEDLQWLDGT 83

Query: 212 PTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
              A   +R E +K +Y +  T    AG      Y +D+   A +  I +     V S V
Sbjct: 84  SLLAAREKRLEVLKEKYEKGSTELRNAGFEGADAYTWDLYLWAASMFISRAFSAKVLSGV 143

Query: 271 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 330
                +S  +   L+P+   +  +    K    A  D +  VV     AG+ I    GP+
Sbjct: 144 FPETDLSEEKLSVLLPI-IDMGNHRPLAKVEWRAGKDDIAFVVLEDVWAGQEISNNYGPR 202

Query: 331 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDK 367
            N +L++NYGF    NP D  +V        P Y  K
Sbjct: 203 NNEQLMMNYGFCIPGNPCDHRIVSLRAPPGSPLYMAK 239


>gi|7329638|emb|CAB82703.1| putative protein [Arabidopsis thaliana]
          Length = 486

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 44/159 (27%), Positives = 77/159 (48%), Gaps = 18/159 (11%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           AS+ + AGD    VP +  +T + +  +     +L +N++  +  LA  L+ EKK G+KS
Sbjct: 75  ASKVIYAGDCMLKVPFNAQITPDELPSD---IRVLLSNEVGNIGMLAAVLIREKKMGQKS 131

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
            W+PYI  L +        + S + W E EL+ +  S    E +++   I+++++     
Sbjct: 132 RWVPYISRLPQPA-----EMHSSIFWGEDELSMIRCSAVHQETVKQKAQIEKDFS----- 181

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS--CVV 271
            F+A +  Q  P  I TE    E F  A+   +   C+V
Sbjct: 182 -FVAQAFKQHCP--IVTERPDLEDFMYAYALGEKVLCIV 217


>gi|159476096|ref|XP_001696150.1| protein N-methyltransferase [Chlamydomonas reinhardtii]
 gi|158275321|gb|EDP01099.1| protein N-methyltransferase [Chlamydomonas reinhardtii]
          Length = 474

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 58/221 (26%), Positives = 98/221 (44%), Gaps = 35/221 (15%)

Query: 157 LACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAE 216
            A +   L++ K+QG +S   P+I +L    G        PL WS+ +LA L      A+
Sbjct: 138 FAKMGAMLLWHKRQGSQSPLAPWIAQLPADTG-------VPLNWSDKQLAALQYPYLVAQ 190

Query: 217 ILERAEGIKREYNEL-DTV-----------------WFMAGSLFQQYPYDIPTEAFTFEI 258
           + E+    +RE+  L DT+                 W+  G + +   +  P    T   
Sbjct: 191 VKEQ----QREWTALYDTLRGSGMAAGAAPPSREEFWWAMG-VVRSRTFSGPYIGSTLSD 245

Query: 259 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAA--VDDAVQLVVDRP 316
             +    V + VV L + SL +++A+ PL   L  ++S  ++ ++     D+  +V  R 
Sbjct: 246 RLRLAGLVAALVVILSR-SL-KQYAICPL-IDLFNHTSAAQSEVSYNYFGDSYSVVASRD 302

Query: 317 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 357
           +K GE + +  G Q N  L+  YGF + DNP D  V+   L
Sbjct: 303 FKKGEQVFITYGAQSNDSLMQYYGFAEADNPQDTYVISDVL 343


>gi|148908465|gb|ABR17345.1| unknown [Picea sitchensis]
          Length = 350

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 43/154 (27%), Positives = 73/154 (47%), Gaps = 16/154 (10%)

Query: 317 YKAGESIVVWCG-PQPNSKLLINYGFVDED----NPYDRLVVEAALNTEDPQYQDKRMVA 371
           ++ GE +++  G  + N +L ++YGFV+ +    +  D   +   ++  DP + DK  +A
Sbjct: 136 FRTGEQVLMQYGMNKSNGQLALDYGFVERNRKNGSNRDIFTLTLEISESDPFFADKLDIA 195

Query: 372 QRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEM-------QSVISSLGPICPV 424
           + NG  +   F +  G  +     ML +LRL  +  T           SV   L    PV
Sbjct: 196 ELNGMETTAYFDITQG--QGVPESMLTFLRLIALGGTDAFLLEPLFRDSVWEHLS--LPV 251

Query: 425 SPCMERAVLDQLADYFKARLAGYPATLSEDEAML 458
           S   E A+   + D  ++ L+GY  T+ EDEA+L
Sbjct: 252 SQENEAAICKVVLDGCQSTLSGYGTTIEEDEALL 285


>gi|400597281|gb|EJP65016.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
          Length = 484

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 57/236 (24%), Positives = 90/236 (38%), Gaps = 75/236 (31%)

Query: 149 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 208
           L+++ L     L L L+YE  +G  S W PY+  L           E+P+ W+  EL  L
Sbjct: 96  LSSSPLDAWGALILVLLYEHLRGAASAWRPYLDVL-------PATFETPMFWTGAELGAL 148

Query: 209 TGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 268
               T  ++        RE  E DT   +   + + +P          ++F+ +      
Sbjct: 149 QAGATAGKV-------GRESAE-DTFRGILLPVVRAHP----------DVFQGSAALSDE 190

Query: 269 CVVHLQKVSLARRFALVPLGPPLLAYS---------------------SKCKAMLAAV-- 305
            +V     +LA R     +G  ++AY+                        KAM+  V  
Sbjct: 191 ALV-----ALAHR-----MGSTIMAYAFDLENDEEREDEEDEDGWVEDRDGKAMMGMVPM 240

Query: 306 -----------------DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 344
                            D+ + +   RP KAGE I+ + GP PNS+LL  YG+V E
Sbjct: 241 ADILNADAEFNAHVNHGDNELTVTALRPIKAGEEILNYYGPHPNSELLRRYGYVTE 296


>gi|384246211|gb|EIE19702.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 503

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 109/443 (24%), Positives = 189/443 (42%), Gaps = 56/443 (12%)

Query: 76  EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
           E L  L +W+ + GLP  K+ ++ +    +       +  S+  + G    +VP+S  +T
Sbjct: 50  ETLPPLSAWVEQRGLPLKKLNVRPEIVEGDL-----CLVVSKPTKKGQPLVAVPSSAWLT 104

Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
            ++V+ + +I  L+    L     +AL+L++E+ +   + W  ++  +         A +
Sbjct: 105 -QQVVRSSSIGSLV--EDLEPWLQIALFLLHERSKPDAA-WQGFLDSI-------PAAPD 153

Query: 196 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
            PL WSE EL+ L G+   + +    +  + +Y EL+        LF  +    P ++  
Sbjct: 154 VPLFWSEEELSQLEGTQLLSSVQGYRQFFEAKYAELEE------QLFAPHREAFPPKSHQ 207

Query: 256 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY----SSKCKAMLAA--VDDAV 309
            + F  A   V+S V            ALVPL   L+ +     ++ +  LA      A 
Sbjct: 208 LDDFLWAVATVRSRV---HSPLDGEDVALVPLAD-LVQHRKLQGARWQLQLAGGLFSKAQ 263

Query: 310 QLVVD--RPYKAGESIVVWCGP--------QPNSKLLINYGFVDEDNPY-DRLVVEA--- 355
            LVV+  R Y  GE + +  G         + +S++L++YG +D D P  D  VV+    
Sbjct: 264 ALVVEAQRDYAEGEVVTMDFGAPLTEEDQEKLDSQVLLDYGALDADRPQADPGVVQGGFI 323

Query: 356 ---ALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQ 412
              AL  +D  Y DK  + + NG      F + A  E          L      D   ++
Sbjct: 324 LSLALPEDDKYYDDKADILELNGLSEAASFVLRANEEPSEQLLGFLRLLNLSGQDAFLLE 383

Query: 413 SVISSLG---PICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRV 469
            +  +      + PVS   ERAV + + +  +A L GY  ++ +D   L D    P  R+
Sbjct: 384 PLFRNEAWGHMLAPVSEANERAVYESMMEGCRAALQGYATSIDDDLRALRDT--QPGTRL 441

Query: 470 ATQ-LVRM-EKKMLNACLQVTAD 490
               LVR+ EK+ L+A L    D
Sbjct: 442 EKAILVRLGEKETLDATLAFFED 464


>gi|346319394|gb|EGX88996.1| Protein kinase-like domain [Cordyceps militaris CM01]
          Length = 1753

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 86/394 (21%), Positives = 156/394 (39%), Gaps = 36/394 (9%)

Query: 113  VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
            V A    + G+   ++P++ + T E    +  +  +L + +  LS    LA++L++ K  
Sbjct: 912  VKALRSFKKGERILTIPSACLWTAEAARADPLLGPVLRSAQPPLSVEDTLAIHLLFVKS- 970

Query: 171  GKKSFWLPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIK 225
                      R    +  R  +A       + + ++E EL    GS       +  + ++
Sbjct: 971  ----------RTAGYEGQRLHIAAMPQRHSASIFFAEDELQVCEGSSLHTLTTQLEQRVQ 1020

Query: 226  REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
             ++ +L         L  Q+    P + FT E +K A   + S  +            + 
Sbjct: 1021 DDFRQLLV------QLLSQHRDLFPLDQFTIEDYKWALCTIWSRAMDFAVSDTTSVRLVA 1074

Query: 286  PLGPPLLAYS---SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
            PL   +L +S    +C A      D + ++  + Y+ G+ I ++ G  PN++LL  YGFV
Sbjct: 1075 PLAD-MLNHSLDVKQCHAYDPTSGD-LSILAAKDYQVGDQIFIYYGSVPNNRLLRLYGFV 1132

Query: 343  DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRL 402
              DNP D   +    +   P Y+ K  +    G  S     + A  +     ++L YLR 
Sbjct: 1133 LLDNPNDSYDLVLQTSPMAPLYEQKERLWALAGLDSTCTIPLTA--KHPLPKNVLRYLRT 1190

Query: 403  GYVSDTSEMQSVISSL--GPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 460
              + D +++  +   L  G    V+   E  VL  L D   + L G+   L + EA L  
Sbjct: 1191 QRL-DAADVADMTLQLLNGTDGKVNDGNEIQVLQFLIDSLGSVLEGFGIPLEKLEAQLAG 1249

Query: 461  --YNLHPKKRVATQLVRMEKKMLNACLQVTADMI 492
              Y        A Q+   E+ +L    +   DM+
Sbjct: 1250 GFYPAGGNAWAAAQVSAGEQGILTRAKKTAEDML 1283


>gi|345325919|ref|XP_001512656.2| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Ornithorhynchus anatinus]
          Length = 345

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 51/197 (25%), Positives = 94/197 (47%), Gaps = 18/197 (9%)

Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 365
           DD  + V  + + AGE I ++ G + N++ +I+ GF  ++N +DR+ ++  ++  D  Y 
Sbjct: 42  DDRCECVALQDFTAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKLGVSKSDRLYA 101

Query: 366 DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS--------S 417
            K  V  R G  +  VF +H   E    + +L +LR+  +++    + +I         +
Sbjct: 102 MKAEVLARAGIPTSSVFALHF-TEPPISAQLLAFLRVFCMTEEELKEHLIGDHAIDKIFT 160

Query: 418 LG-PICPVSPCMERAVLDQLADYFKAR----LAGYPATLSEDEAMLTDYNLHPKKRVATQ 472
           LG    PVS   E     +L  + +AR    L  Y  T+ ED++ L   +L     +A +
Sbjct: 161 LGNSEFPVSWDNEV----KLWTFLEARASLLLKTYKTTIEEDKSFLETPDLTFHATMAIK 216

Query: 473 LVRMEKKMLNACLQVTA 489
           L   EK++L   ++  A
Sbjct: 217 LRLGEKEILEKAVKSAA 233


>gi|299115166|emb|CBN75532.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 524

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 74/335 (22%), Positives = 131/335 (39%), Gaps = 71/335 (21%)

Query: 77  DLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTL 136
           +L  L SW  ++G    K+ L++             +   + L  G+   S+P SL +T+
Sbjct: 27  ELDGLLSWFVEHGGSMTKLCLEDLGGEMSLS-----LLTGQALNKGEVVMSIPISLCMTV 81

Query: 137 ERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES 196
           + VL                    AL+LM E+++G  SFW  Y+R L          V++
Sbjct: 82  DSVL--------------------ALHLMAERRKGDGSFWKQYLRTLPDD-------VDT 114

Query: 197 PLLW----SETELAYLTGSPTKAEILERA--EGIKREYNELDTVWFMAGSLFQQYPYDIP 250
           PL W    +E E   L G  T   +L R     +++++ E          L + +P  + 
Sbjct: 115 PLRWLVEQAEEEFRLLDG--TMVGLLSRMMHSQVRKDWEEFHL------PLVEAHPEILG 166

Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQK----VSLARRFALVPLG-------------PPLLA 293
               TFE +  A  ++ S     Q+     S   R A+VP+                ++ 
Sbjct: 167 --GVTFEDYLWAMSSIWSRSFDYQEPGPDDSPCSRRAMVPVINAANHDPSAADSLSEMIE 224

Query: 294 YSSKCKAMLAAVDD------AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
           + ++   +   + +       +++   R Y A E   +  G   N+KLL +YGFV   NP
Sbjct: 225 FQAQEGGLSMGIGEPGRARGTLRVSAGRDYAAREQFFILYGRYSNAKLLYSYGFVLASNP 284

Query: 348 YDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 382
           Y  L     +   DP +  K+ +   +   + Q +
Sbjct: 285 YGGLDYWVRVPQTDPGFAWKQALLDEHPLTAAQAY 319


>gi|326472332|gb|EGD96341.1| SET domain-containing protein [Trichophyton tonsurans CBS 112818]
          Length = 485

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 71/345 (20%), Positives = 123/345 (35%), Gaps = 72/345 (20%)

Query: 97  LKEKPSHNEKHRPIHY-----------VAASEDLQAGDAAFSVPNSLVVTLERVLGNETI 145
           LK    H + H  IH            + AS D+   +  F +P+ L+++++       +
Sbjct: 24  LKRSSPHFKMHSGIHIADLRSIGAGRGICASRDIAEDEELFVIPDDLILSVQNSEARSVL 83

Query: 146 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 205
              L   +L     L + ++YE  QG++S W PY R L         + ++ + W++ +L
Sbjct: 84  G--LDDKQLGPWLSLIITMIYEYYQGEQSKWYPYFRILPS-------SFDTLMFWTDEQL 134

Query: 206 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 265
           + L GS    +I + A          DT+      L Q  P+  P       +       
Sbjct: 135 SELQGSAVVGKIGKAAAD--------DTILQKVVPLIQANPHHFPPRPNMPPLNSPDSQN 186

Query: 266 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 325
              C+ H              +G  ++AY+   +    A +D  +               
Sbjct: 187 ALLCLAHR-------------MGSIIMAYAFDIEKADEADEDTAE--------------- 218

Query: 326 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 385
                        Y   DED P   +V  A +   D Q  + R+  +    +   + ++H
Sbjct: 219 -----------DGYMTDDEDEPAKGMVPLADIFNADAQRNNARLFQEEGSFVMKAIKNIH 267

Query: 386 AGREKEAISDMLPYL----RLGYVSDTSEMQSVIS-SLGPICPVS 425
           +G E       LP      R GYV+D      V+  SL  IC V+
Sbjct: 268 SGEEIFNDYGELPRADLLRRYGYVTDNYAQYDVVEFSLDGICKVA 312


>gi|357145323|ref|XP_003573603.1| PREDICTED: SET domain-containing protein 4-like [Brachypodium
           distachyon]
          Length = 532

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 61/264 (23%), Positives = 114/264 (43%), Gaps = 30/264 (11%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + ASE++  G  A  +P SL+++ E +  ++    L   N ++    L L+ M E+    
Sbjct: 178 MVASENIGVGHIALEIPESLIISEELLCQSDMFLALKDLNSITTETMLLLWSMRERHNPS 237

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
            +F + +   L      G       L +    LA L G+    E+++  + + ++Y+EL 
Sbjct: 238 SNFKM-FFETLPSNFNTG-------LNFGIGALAALEGTLLFDELMQARQHLHQQYDELF 289

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG---- 288
            +      L  ++P     + +T++ F  A     S  + +   S      L+P+     
Sbjct: 290 PM------LCTKFPEIFTQDIYTWDNFLWACELWYSNSMMVVLSSGKLTTCLIPVAGLLN 343

Query: 289 ----PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV-D 343
               P +L Y    +A       +++  + RP KAG+   +  G    S L+  YGF+  
Sbjct: 344 HSVYPHILNYGRVDQAT-----KSLKFPLSRPCKAGQQCFLSYGKHSGSHLITFYGFLPR 398

Query: 344 EDNPYDR--LVVEAALNTEDPQYQ 365
           EDNPYD   L ++ +++ ED   Q
Sbjct: 399 EDNPYDVVPLDLDMSVDEEDGTAQ 422


>gi|320170159|gb|EFW47058.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 640

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 75/296 (25%), Positives = 118/296 (39%), Gaps = 39/296 (13%)

Query: 70  VVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVP 129
            VS     L  L +W+   GL   ++    +P  N+      Y+ AS  ++A     +VP
Sbjct: 162 AVSTPRGALARLTAWIDNAGL---EINSNARPGLNDVDE--LYLFASNPIEAATLVATVP 216

Query: 130 NSLVV--TLERVLGNETIAELLTTNKLSELA------CLALYLMYEKKQGKKSFWLPYIR 181
             LV+  T  R L N  I  L    +   ++       LA+ L+YE  +  KS W  +I 
Sbjct: 217 APLVMFETYLRTLENPMI--LAIDRRFKTMSVPDPSYALAMALLYESYE-PKSMWREWIS 273

Query: 182 ELDRQRGRGQLAVESPLLWSETELAYLTGSP--TKAEILERAEGIKREYNELDTVWFMAG 239
            L +        ++S + WS  E   L   P   K +ILER   +++ YN          
Sbjct: 274 SLPQ-------TLDSTVFWSAEEQDALQSLPLKRKTQILER--HLQQLYNA------TTP 318

Query: 240 SLFQQYPYDIPTEAFTFEIFKQAFVAVQS-CVVHLQKVSLARRFALVPLGPPLLAYSSKC 298
            L   +P+      +++E+FK A++ V S  +          +  L PL   L     + 
Sbjct: 319 RLLAAFPHIFAGGNYSYEMFKWAYMIVDSRSLTFSTGPDTLPQIMLAPLVDLLHHDPVQT 378

Query: 299 KAMLAAVDDAV-----QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
              L    + V      L   R  K GE +V   G  PN +LL+ +G     NPY+
Sbjct: 379 NIQLGVHPEEVLGFEISLKTTRAIKKGEPLVRHIGELPNHQLLLRFGLAMPRNPYE 434


>gi|224077384|ref|XP_002305239.1| SET domain protein [Populus trichocarpa]
 gi|222848203|gb|EEE85750.1| SET domain protein [Populus trichocarpa]
          Length = 518

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 42/151 (27%), Positives = 66/151 (43%), Gaps = 21/151 (13%)

Query: 56  RVSSSDTLVAGSREVVSKKEEDLGD------LKSWMHKNGLPPCKVILKEKP-------- 101
           R+ +S T++  +     K+ ED G          W    G+  C   L   P        
Sbjct: 9   RIWASFTVLRRNSRQTKKEMEDAGQDEGFERFLKWAANLGISDCTTNLSLHPQSPTSCLG 68

Query: 102 -SHNEKHRPI---HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL 157
            S    H P      +AA  DL+ G+    VP S+++T + +L +E +   +  N  S L
Sbjct: 69  HSLTVSHFPDAGGRGLAAVRDLKKGELVLRVPKSVLITRDSLLKDEKLCSFVNNNTYSSL 128

Query: 158 A---CLALYLMYEKKQGKKSFWLPYIRELDR 185
           +    LA+ L+YE  +GK S+W PY+  L R
Sbjct: 129 SPTQILAVCLLYEMGKGKSSWWYPYLMHLPR 159


>gi|323452617|gb|EGB08490.1| hypothetical protein AURANDRAFT_71532 [Aureococcus anophagefferens]
          Length = 1114

 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 92/373 (24%), Positives = 145/373 (38%), Gaps = 54/373 (14%)

Query: 123 DAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRE 182
           DA     + LV + ER   +      L      E   LAL L+YE+++G KS W P+I  
Sbjct: 64  DAMLHARSPLVCSGEREANDARALGALLGKVTREDDALALRLLYERRKGAKSRWGPHIAL 123

Query: 183 LDRQRGRGQLAVESPLLWSETELAYLTGSPTK--------------AEILERAEGIKREY 228
           L            + L WSE ELA L GS                 +EI++++     E 
Sbjct: 124 LP------ATPPHALLRWSEAELAELAGSDALELANRWRSQVSSDFSEIVDKSRAAVEES 177

Query: 229 NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG 288
           +    +   A     ++P+ +  E F++ +           ++  + VS++R+ A     
Sbjct: 178 DPGKQL-SAAVKASLRFPW-LDLEGFSWAV----------SMIWSRCVSVSRKGA----- 220

Query: 289 PPLLAY--------SSKCKAMLAAVDDAVQLVVDR---PYKAGESIVVWCGPQPNSKLLI 337
           PP+ A+                   DDA    V R     K G+ + +     PN+ LL+
Sbjct: 221 PPIKAFLPVVDMHNHDPGAPENHGFDDARDGFVLRRTGNAKKGDELKLCYDGLPNAWLLL 280

Query: 338 NYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDML 397
            YGF  +   +    + A L+ E P Y+ KR   +   KL +      A   + A  D L
Sbjct: 281 LYGFALDHAAHAGRDLYAPLSPEAPHYEAKRAALE---KLGLGATADGAAPFRLAADDAL 337

Query: 398 PYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM 457
           P  RL  ++     ++ +  L  +   S    RA    L     A LA Y  +  ED A 
Sbjct: 338 PE-RL--LTALMAQRATLDELPGLPATSEATARAAAGDLVAACDALLAAYRGSEDEDAAA 394

Query: 458 LTDYNLHPKKRVA 470
           L D    P+ R+A
Sbjct: 395 LADPATPPRLRLA 407


>gi|348690659|gb|EGZ30473.1| hypothetical protein PHYSODRAFT_553476 [Phytophthora sojae]
          Length = 437

 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 100/404 (24%), Positives = 173/404 (42%), Gaps = 54/404 (13%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLS---ELACLALYLMYEK- 168
           V  +ED+      FS+P   V++++ +  N  +  +    +L+   E   LA+ L+YEK 
Sbjct: 47  VFIAEDVTPHAEVFSIPLDSVLSVKSLQENAVLQSIAFFQQLTPEREDDQLAIALLYEKF 106

Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
            +G KS W  +I  L R          + L +   EL  L GS       +  E +  +Y
Sbjct: 107 VRGSKSKWAKHIELLPR-------TYHNALYFGPEELRALEGSNVYFIAQQMEEKVAHDY 159

Query: 229 NELDTVWFMAGSLFQQYP----YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF-- 282
             L     +   LF+  P     D+  E F+ E +K A   + S       V +A++   
Sbjct: 160 ARLKESVLL--ELFENVPEGINVDLFDEFFSLENYKWALSTIWS---RFGDVPVAKQSFK 214

Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQ---LVVDRPYKAGESIVVWCGPQPNSKLLINY 339
           A+VP+   +L +  + + M    D + Q   LV  + + AG  + +  GP  N KLL  Y
Sbjct: 215 AMVPVFD-MLNHDPEAE-MSHFFDMSTQRFKLVSHQHWNAGAQMFINYGPLSNHKLLALY 272

Query: 340 GFVDEDNPYDRLVVEAALNTEDPQ---YQDKRMVAQRNGKLSVQVFHVHAGREKEAISD- 395
           GFV   NP+D   VE  L  ++     +Q+K  +   NG         HA    E ++D 
Sbjct: 273 GFVIIGNPFD--AVEMWLPMDEASTKFFQEKEQLLLTNGL-------DHATNPFELVADE 323

Query: 396 ----MLPYLRLGYVS--DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPA 449
               +L   R+  +      E + + +       +S   E+  L +L    +  L  +P 
Sbjct: 324 SNDLLLMAARIQEIDCETVEEFEELANKALEGEMISLENEQEALTRLIYTLEKMLESFPT 383

Query: 450 TLSEDEAML------TDYNLHPKKRVATQLVRMEKKMLNACLQV 487
           ++ ED+ +L      TD NL+  +R+A  + R +K +L+  + +
Sbjct: 384 SIEEDDILLEQDDKKTD-NLN-HERMAVAVRRSDKYILSENINM 425


>gi|302826668|ref|XP_002994755.1| hypothetical protein SELMODRAFT_432653 [Selaginella moellendorffii]
 gi|300136963|gb|EFJ04180.1| hypothetical protein SELMODRAFT_432653 [Selaginella moellendorffii]
          Length = 688

 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 47/191 (24%), Positives = 82/191 (42%), Gaps = 26/191 (13%)

Query: 167 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 226
           +K Q + S W PYI  L          +++  LW +TEL+YL  SP   +  ER E I  
Sbjct: 503 QKFQLQSSAWAPYISCLPEPA-----ELDNTFLWEDTELSYLRASPLYGKTRERLEIITT 557

Query: 227 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 286
           E+ ++     +   LF +          + E F   +  V S     + +++     LV 
Sbjct: 558 EFGQVQNALDVWPQLFGK---------VSVEDFMHVYATVFS-----RPLAIGEDSTLVM 603

Query: 287 LGPPLLAYSSKCKAMLAAVD-----DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
           +  P+L + +   A  A +      +   +  DR     + I + CG   N++L ++YGF
Sbjct: 604 I--PMLDFFNHNAASFAKLSFNGLLNYAVVTADRDCAENDQIWINCGDLSNAELALDYGF 661

Query: 342 VDEDNPYDRLV 352
              +N YD ++
Sbjct: 662 TVPENRYDEVM 672


>gi|323447496|gb|EGB03414.1| hypothetical protein AURANDRAFT_72732 [Aureococcus anophagefferens]
          Length = 403

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 68/269 (25%), Positives = 109/269 (40%), Gaps = 33/269 (12%)

Query: 84  WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
           W+ +NG      +  E  S++++ R +H   A+ DL+  +    VP   ++T+E  +G  
Sbjct: 37  WLTENGGKFADCV--ELRSYDDEVRGVH---ATRDLETEEILVEVPLKCLITVE--MGKA 89

Query: 144 TIAELLTTNKLSELAC-----LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPL 198
           T           EL       L L+++ +++    +F+ PY   L             P+
Sbjct: 90  TDVGRAVLEAELELDAPKHVFLMLFVLLDRRDSS-TFFAPYYDILP------STLSNMPI 142

Query: 199 LWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEI 258
            W   EL +L GS    +I ER   IK +Y  +  +W          P  I  +  T E 
Sbjct: 143 FWQPDELEWLKGSYLLTQIEERKRAIKADYEAICGIW----------PSFI--DVCTLEE 190

Query: 259 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSK-CKAMLAAVDDAVQLVVDRPY 317
           FK A + V S    +  V+ AR  A+VP    L  +  +  K        A  +   +  
Sbjct: 191 FKWARMCVCSRNFGVV-VNGARTSAMVPYADMLNHFRPRETKWTFDNSRGAFTITSLQKI 249

Query: 318 KAGESIVVWCGPQPNSKLLINYGFVDEDN 346
             G  I    G + N + L+NYGF  EDN
Sbjct: 250 SVGSQIYDSYGQKCNHRFLLNYGFAIEDN 278


>gi|428179206|gb|EKX48078.1| hypothetical protein GUITHDRAFT_106158 [Guillardia theta CCMP2712]
          Length = 410

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 54/211 (25%), Positives = 95/211 (45%), Gaps = 21/211 (9%)

Query: 153 KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP 212
           +L E   L+L+L+ EK + ++S W  +IR +         + ++   WSE  +A L   P
Sbjct: 19  QLCERQLLSLHLLVEKWKAERSRWWRFIRSIPP-------SYDTLENWSEQSVARLQYKP 71

Query: 213 TKAEILERAEGIKREYNELDTV--------WFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
             A    R   +  E+++L  +        W    +  +     +   +F+ E +  A  
Sbjct: 72  FLAIAARRKRVVNDEFSQLQRLLSRCKKRSWNEPEAAEEAERIQLGFSSFSREDYLWAAG 131

Query: 265 AVQSCVVHLQKVS-LARRFALVPLGPPLLAYSSKCKAMLAAV---DDAV--QLVVDRPYK 318
            V +   H ++ S  + R   V    P+L + +   A +AA     DA+  ++   R Y+
Sbjct: 132 TVSTRSCHYERKSGYSLRGETVGCLVPVLDFLNHSTAPVAACGFCKDAMVYRVTCLRSYE 191

Query: 319 AGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
            GE +++  G   N+ LL +YGFV EDNP D
Sbjct: 192 EGEQVMIHYGNWSNAGLLEHYGFVLEDNPLD 222


>gi|126325439|ref|XP_001376285.1| PREDICTED: SET domain-containing protein 4-like [Monodelphis
           domestica]
          Length = 437

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 65/283 (22%), Positives = 119/283 (42%), Gaps = 28/283 (9%)

Query: 86  HKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVVTLERV 139
           HK      +  LK++   +   RP  +      + A + LQ G+   S+P   ++T + V
Sbjct: 30  HKQEFIELRKWLKKRKFEDHNLRPTRFSNTGRGLMAVKSLQPGELIISLPKECLLTTDTV 89

Query: 140 LGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
           +    + + +T     +S L  L  +L+ EK  G KS W PY+  L +          + 
Sbjct: 90  I-RSYLGDYITKWMPPISPLLALCAFLISEKHAGNKSPWKPYLDVLPK--------AYTC 140

Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
           L+  E E+  L   P + +  E+   +++ +      +    SLF +   D+    F + 
Sbjct: 141 LVCLEPEVVRLLPRPLQMKAEEQRMQVQKLFISSRGFFSSLQSLFTE---DV-KHVFHYH 196

Query: 258 IFKQAFVAVQSCVV---HLQKVSLARRFALVPLGP--PLLAYSSKCKAMLAAVDDAV--Q 310
            F  A+  + +  V   H QK  L+    +  L P   LL +S +     A  ++    +
Sbjct: 197 AFLWAWCTINTRTVYMKHAQKQCLSAEPDVYALAPYLDLLNHSPRVWVEAAFNEETCCYE 256

Query: 311 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
           +      K  E + +  GP  N +LL+ YGFV  +NP+  + +
Sbjct: 257 IRTTSHCKKFEELFICYGPHDNHRLLLEYGFVASNNPHSAVYI 299


>gi|452824261|gb|EME31265.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase
           [Galdieria sulphuraria]
          Length = 546

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 86/387 (22%), Positives = 148/387 (38%), Gaps = 82/387 (21%)

Query: 76  EDLGDLKSWMHKNGLPPCKVILKEKPS---HNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
           E   +L++W+  NG+P     +K KP    HN +        A   L+ G+   ++P   
Sbjct: 71  EKTEELENWLFDNGVPS----IKGKPVLSPHNCRT-----FRAKIPLKLGEEVLAIPERF 121

Query: 133 VVTL---ERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
            +T    E++LG          + LS+   +A  L+ E  + + SFW P+I  L      
Sbjct: 122 WLTKQLSEKLLG-------FHVSDLSDEEAIAALLLVETARKETSFWKPWIETLPSSDEL 174

Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY-D 248
                   L+WS  E  YL  S T  +IL   E     + EL+T       LF ++ Y  
Sbjct: 175 HHF-----LVWSTAETQYLESSSTFEDILSLRETASLVFEELNT------ELFPKFLYPQ 223

Query: 249 IPTEAFTFEIFKQAFVAVQSCVVH-----------------------LQKVSLARRF--- 282
              + FT   F  A   VQS  ++                       + + S  R++   
Sbjct: 224 YDVKYFTLPYFTWALSIVQSFGLYDIMDSCPLVIVPGLEWLTYKYSLITEESFFRQYFHI 283

Query: 283 ---ALVPLGPPLLAYSSKCKAMLAAVDDA-----VQLVVDRPYKAGESIVVWCGPQPNSK 334
              +L+ +GP    ++ + +  + A +D      V LV +      ++   W        
Sbjct: 284 SNVSLIRVGP---FFTQERRLKITASEDLKVGEPVSLVYEGNVSLIDTFCRWGWK----- 335

Query: 335 LLINYGFVDEDN--PYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEA 392
             ++ G +DE+         +  A+ T D  + DK  +        +Q F +     KE 
Sbjct: 336 --LDLGALDEEQLLKMGSYEISFAVTTTDQFFDDKEDILDAQRLELLQTFELRYDMSKEL 393

Query: 393 ISDMLPYLRLGYVSDTSE--MQSVISS 417
           +  +LP+LRL  + D     ++SV  S
Sbjct: 394 LQRILPFLRLICLKDKDSFILESVFRS 420


>gi|281205954|gb|EFA80143.1| hypothetical protein PPL_06965 [Polysphondylium pallidum PN500]
          Length = 417

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 91/402 (22%), Positives = 171/402 (42%), Gaps = 54/402 (13%)

Query: 76  EDLGDLKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
           +DL   K WM   G+   P   ++  E    +        + A+  ++ GD    VP ++
Sbjct: 9   DDLVTFKQWMDDEGIYLNPSLDIVKLEDYGRS--------IIANTLIKEGDVLIRVPRNV 60

Query: 133 VVT---LERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQR 187
           +++   +E  +  E I  ++ +N+  +      A+YLMY     K S+W  Y   L +Q 
Sbjct: 61  MMSRTGIELHIPKE-IRSIIDSNRDDIGSTDGQAVYLMY-SLLNKDSYWHQYTSILPKQ- 117

Query: 188 GRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY 247
                   + + + + E+  L  S  +     R  GI+R YN    ++    SL  ++  
Sbjct: 118 ------FTTSIYFDQDEMKELQLSKLRYFTESRLSGIERHYN---VIFKKLSSLNDEFK- 167

Query: 248 DIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD 307
               + +TFE+FK A   + S    L     +    +VPL     A   K K+ +     
Sbjct: 168 ---KKEYTFELFKWALSCIWSRAFSLS----SDDGGMVPLADMFNAIE-KAKSKVRPDSR 219

Query: 308 AVQLV--VDRPYKAGESIVVWCGPQP---NSKLLINYGFVDEDNPYDRLVVEAALN--TE 360
           A QL+    +  + GE +    G      N+++L++YGF   D+P +   ++  L+  ++
Sbjct: 220 ADQLIYYASKDIERGEQVFTPYGVYKTIGNAQMLMDYGFA-FDDPSEGDTIQLTLDNFSD 278

Query: 361 DPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQ----SVIS 416
           D  Y D ++       + V+ F++   +  +   ++L Y R+  + + +E+Q       +
Sbjct: 279 DELYIDTKIDLLEQLDI-VREFNL---KRNQLPQELLIYARVKNLKE-NELQLAKEHYRN 333

Query: 417 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 458
                 PVS   E+  L  L++Y    L  Y  TLS+D  +L
Sbjct: 334 DDNRNKPVSRRNEKTALRYLSNYLSRYLDSYETTLSDDLELL 375


>gi|326480913|gb|EGE04923.1| SET domain-containing protein [Trichophyton equinum CBS 127.97]
          Length = 692

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 49/206 (23%), Positives = 82/206 (39%), Gaps = 35/206 (16%)

Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
           LA ++ +E+ +G+ S W PY+  L R        + S L + +++L +L G+        
Sbjct: 108 LAFFVAHEQLKGRDSHWWPYLATLPRAS-----ELTSALFYQDSDLDWLQGTNLYQTHQA 162

Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
               +K EY+   ++    G L          E+++++IF  A+  + S      +   +
Sbjct: 163 YRNTVKEEYDSAISILRDEGCL--------AVESYSWDIFCWAYTLIAS------RAFTS 208

Query: 280 RRFALVPLGPPLLAYSSKCKAMLAAVDDA----------------VQLVVDRPYKAGESI 323
           R         P L    + + ML  VD +                + L V  P   GE I
Sbjct: 209 RVLDAYLSNHPTLKQDEEFQIMLPLVDSSNHKPLAKIEWRAEATEIGLKVIEPTFTGEEI 268

Query: 324 VVWCGPQPNSKLLINYGFVDEDNPYD 349
               GP  N +L+  YGF   DNP D
Sbjct: 269 HNNYGPLNNQQLMTTYGFCIVDNPCD 294


>gi|326473914|gb|EGD97923.1| hypothetical protein TESG_05224 [Trichophyton tonsurans CBS 112818]
          Length = 692

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 49/206 (23%), Positives = 82/206 (39%), Gaps = 35/206 (16%)

Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
           LA ++ +E+ +G+ S W PY+  L R        + S L + +++L +L G+        
Sbjct: 108 LAFFVAHEQLKGRDSHWWPYLATLPRAS-----ELTSALFYQDSDLDWLQGTNLYQTHQA 162

Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
               +K EY+   ++    G L          E+++++IF  A+  + S      +   +
Sbjct: 163 YRNTVKEEYDSAISILRDEGCL--------AVESYSWDIFCWAYTLIAS------RAFTS 208

Query: 280 RRFALVPLGPPLLAYSSKCKAMLAAVDDA----------------VQLVVDRPYKAGESI 323
           R         P L    + + ML  VD +                + L V  P   GE I
Sbjct: 209 RVLDAYFSNHPTLKQDEEFQIMLPLVDSSNHKPLAKIEWRAEATEIGLKVIEPTFTGEEI 268

Query: 324 VVWCGPQPNSKLLINYGFVDEDNPYD 349
               GP  N +L+  YGF   DNP D
Sbjct: 269 HNNYGPLNNQQLMTTYGFCIVDNPCD 294


>gi|453083670|gb|EMF11715.1| SET domain-containing protein [Mycosphaerella populorum SO2202]
          Length = 477

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 72/269 (26%), Positives = 104/269 (38%), Gaps = 51/269 (18%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA----CLALYLMYEK 168
           V A++DL   +  FS+P + ++T      NET    L  N   EL      L L +++E 
Sbjct: 45  VVATQDLSEDEELFSIPRASILT------NETTD--LPANLRKELDHPWLSLILVMVHEY 96

Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS----------------- 211
            +G KS W PY   L           +S + WS+ EL  L GS                 
Sbjct: 97  LKGTKSSWYPYFNLLPE-------TFDSLMFWSDEELLSLKGSAVVDKIGKESADSTFTE 149

Query: 212 ---PTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDI--PTEAFTFEIFKQAFVAV 266
              P  A+     +   R  +EL ++    GS    Y +D+  P  +             
Sbjct: 150 QLIPLIAQHANIFQTAGRSNDELLSLCHRMGSTIMAYAFDLEKPEPSQPPNQQDDEEWEE 209

Query: 267 QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 326
           +   + L K        ++PL   L A +    A L   DD V +      +AGE +   
Sbjct: 210 EESAISLPK-------GMIPLADMLNANADHNNAKLFYQDDKVVMKTLHAVRAGEELFND 262

Query: 327 CGPQPNSKLLINYGFV-DEDNPYDRLVVE 354
            GP P S LL  YG+V D+   YD  VVE
Sbjct: 263 FGPLPRSDLLRRYGYVTDQYAKYD--VVE 289


>gi|118395738|ref|XP_001030215.1| hypothetical protein TTHERM_01108540 [Tetrahymena thermophila]
 gi|89284510|gb|EAR82552.1| hypothetical protein TTHERM_01108540 [Tetrahymena thermophila SB210]
          Length = 1709

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 41/148 (27%), Positives = 70/148 (47%), Gaps = 37/148 (25%)

Query: 113  VAASEDLQAGDAAFSVPNSLVVTLERVLG-------NETIAELLTTNKLSEL----AC-- 159
            +AA +D+       ++PN L+++ ++V G        +   +++  N+  EL     C  
Sbjct: 954  IAADQDISPQKVILAIPNKLIISEDKVYGCDLEEVLEKIQQQIIKQNRFPELFDEEKCGD 1013

Query: 160  -----LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 214
                 LALYLMYEK +G++SFW PY  EL+++         + L WS  ELA    S   
Sbjct: 1014 ADFNILALYLMYEKLKGEQSFWHPYF-ELNQKS-------YTLLDWSTEELAQFEDSY-- 1063

Query: 215  AEILERAEGIKREYNELDTVWFMAGSLF 242
                     I +E N+ + ++F+  S+ 
Sbjct: 1064 ---------ILQEVNQSNQIFFLQQSVL 1082


>gi|302834219|ref|XP_002948672.1| hypothetical protein VOLCADRAFT_104004 [Volvox carteri f.
           nagariensis]
 gi|300265863|gb|EFJ50052.1| hypothetical protein VOLCADRAFT_104004 [Volvox carteri f.
           nagariensis]
          Length = 510

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 94/386 (24%), Positives = 154/386 (39%), Gaps = 70/386 (18%)

Query: 157 LACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAE 216
            A +A  L++ K+QG +S   P+I +L    G        P+LW E ++A L      A+
Sbjct: 135 FAKMAAMLLWHKRQGSQSPLAPWIAQLPSDTG-------VPVLWDERQIAALQYPYLIAQ 187

Query: 217 ILERAEGIKREYNEL---------------DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ 261
           + E+    ++ Y +L               D  W M  S  +   +  P    T +   +
Sbjct: 188 VKEQQREWQQLYGDLVRSGTPAGVQAPSREDFFWAM--SCVRSRTFSGPYIGSTLQDRLR 245

Query: 262 -----AFVAVQSCVVHL---QKVSLA-------------------RRFALVPLGPPLLAY 294
                A +A  + V+ L   QK   A                   +++A+ PL   L  +
Sbjct: 246 TAGLVAVLAAGNTVLGLADPQKTLSAAIAVLLFNVLYELILSRSLKQYAICPL-IDLFNH 304

Query: 295 SSKCKAMLAA--VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 352
           SS  ++ +A     D+  +V  R +K GE + +  G Q N  L+  YGF + +NP D  V
Sbjct: 305 SSAVQSEVAYNYFGDSYSVVASREFKKGEQVFISYGAQSNDSLMQYYGFAEANNPQDVYV 364

Query: 353 VEAALN--TEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE 410
           +   L   T        R+ A +   L+  +  V   R     S+ L  +R    +D SE
Sbjct: 365 MTDMLRWLTAVRSVGQSRLDALKGSPLANSLQQVAIQRAGFP-SETLQAVRFLLAAD-SE 422

Query: 411 MQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKR-- 468
             + +SS       SP  E  + + +A+  +  L    ++L ED A+L+        R  
Sbjct: 423 AGADVSSFSKSG--SPDQEAQLAEVVAEVVRRELGHLGSSLQEDLALLSSTGASAGGRKG 480

Query: 469 -------VATQLVRMEKK-MLNACLQ 486
                  VA    R+EKK +L A LQ
Sbjct: 481 GTAAAAAVAAVAFRVEKKRLLTAVLQ 506


>gi|169626351|ref|XP_001806576.1| hypothetical protein SNOG_16462 [Phaeosphaeria nodorum SN15]
 gi|160705819|gb|EAT76160.2| hypothetical protein SNOG_16462 [Phaeosphaeria nodorum SN15]
          Length = 474

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 75/311 (24%), Positives = 122/311 (39%), Gaps = 55/311 (17%)

Query: 83  SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLER-VLG 141
           +W+ ++G+     I  E   + +  R    V A++D+   +  F +P + ++++E  +L 
Sbjct: 13  AWLRRSGVEISPKIQLEDLRNAQAGRG---VVATQDIPEHELLFRIPRTAILSVENSILS 69

Query: 142 NETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
            E  A   T   L     L L ++YE   G  S W PY   L  +         + + WS
Sbjct: 70  TEIPA--ATFEMLGPWLSLILVMLYEYINGDASNWAPYFSVLPTE-------FNTLMFWS 120

Query: 202 ETELAYLTGSPTKAEI-------------------------------LERAEGIKREYNE 230
           E ELA L  S    +I                                +RAE ++ E N 
Sbjct: 121 EDELAELQASAVLNKIGKEGANEAFMEQLLPIIKEFADIFFAGDERAKQRAEEMRDERNV 180

Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 290
           L  +    GSL   Y +D+   A + +   +   A +     L K        ++PL   
Sbjct: 181 L--LMHKMGSLIMAYAFDVEP-ATSRKDVDEEGFAEEEEDEALPK-------GMIPLADM 230

Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYD 349
           L A +    A L   +  +++   +P +AGE +    GP P S LL  YG+V D    YD
Sbjct: 231 LNADADCNNARLFYEEKYLEMKALKPIRAGEEVFNDYGPLPRSDLLRRYGYVTDNYAQYD 290

Query: 350 RLVVEAALNTE 360
            + +   L TE
Sbjct: 291 VVEINMDLVTE 301


>gi|298715435|emb|CBJ28046.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 719

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 71/312 (22%), Positives = 133/312 (42%), Gaps = 36/312 (11%)

Query: 84  WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
           W+  +G     +   E PS +E    +    A  D+  GD    +P++L+++      + 
Sbjct: 26  WLRSHG---AAIDCVEWPS-SETESGVRGAVARRDIAPGDHMVIIPHALMMSEFHAKADP 81

Query: 144 TIAEL--LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
               +  L T  L     LALY+M E  + ++SF+ PY+R L           ES LL  
Sbjct: 82  KYGHVHRLNTRLLGSDNGLALYIMQEILKEERSFYWPYLRMLPTPCNLRNWNRESLLLLQ 141

Query: 202 ETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ 261
           + +L   T + ++ ++L     + RE     T+ F++ S    YP     + +TFE+F  
Sbjct: 142 DHKLVRRTAARSR-QLL----ALYRE-----TIEFLSSS----YPELYTADRYTFELFDF 187

Query: 262 AFVAVQSCVVHLQKVSLARRF---ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV---DR 315
           A+  +Q+        +  +R    ALVP    L   + + K       +    +    + 
Sbjct: 188 AWRTIQA-------RAFGKRLKSSALVPFADCLNHGNVQTKYDFDVGGNGTFRLFPSGNN 240

Query: 316 PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL---NTEDPQYQDKRMVAQ 372
            Y     ++   G + N  LL++YGF   DN +D   V  +L   + + P  + ++   +
Sbjct: 241 RYPRNSEVLNSYGRRANDNLLLDYGFAMLDNEWDAAEVICSLPPSHDQSPLDRRRKACLR 300

Query: 373 RNGKLSVQVFHV 384
            +G+ +V++  V
Sbjct: 301 ASGQHTVRILRV 312


>gi|121703688|ref|XP_001270108.1| SET domain protein [Aspergillus clavatus NRRL 1]
 gi|119398252|gb|EAW08682.1| SET domain protein [Aspergillus clavatus NRRL 1]
          Length = 492

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 70/278 (25%), Positives = 107/278 (38%), Gaps = 51/278 (18%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
           V A   +  G+  FS+P  LV++ E    N  +  LL+ + L EL     L L ++YE  
Sbjct: 50  VVAQSAIVEGEELFSIPRDLVLSTE----NSKLKSLLSQD-LGELGPWLSLMLVMIYEYL 104

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA-------- 221
             ++S W PY R             ++ + WS  EL  L GS    +I  +         
Sbjct: 105 LREQSAWAPYYRIFPEN-------FDTLMFWSPAELQELQGSAIVDKIGRQGAEESILQM 157

Query: 222 ---------------EGIKREYNELDTVWFMA-----GSLFQQYPYDIPTEAFTFEIFKQ 261
                          +G+     E  T   +      GSL   Y +DI       +   +
Sbjct: 158 IAPVVKANPSLFPPIQGLSSWEGEAGTQALLGLAHVMGSLIMAYAFDIEKVNDEDDEDNE 217

Query: 262 AFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGE 321
                 +     Q         +VPL   L A + +  A L   +D++ +   +P  AG+
Sbjct: 218 GEDGYMTDEEEDQSSK-----GMVPLADILNADADRNNARLFQEEDSLVMKAIKPIAAGD 272

Query: 322 SIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAALN 358
            I    G  P S LL  YG+V D   PYD  V+EA+L+
Sbjct: 273 EIFNDYGELPRSDLLRRYGYVTDNYAPYD--VIEASLD 308


>gi|395518633|ref|XP_003763464.1| PREDICTED: SET domain-containing protein 4 [Sarcophilus harrisii]
          Length = 440

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 67/283 (23%), Positives = 117/283 (41%), Gaps = 28/283 (9%)

Query: 86  HKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVVTLERV 139
           HK      +  LKE+   +   RP  +      + A + LQ G+   S+P   ++T + V
Sbjct: 29  HKLEFIELRKWLKERKFEDHNLRPTRFSGTGRGLMAVKSLQPGELIISLPEKCLLTTDTV 88

Query: 140 LGNETIAELLT--TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
           + +  + + +T  T  +S L  L  +L+ E   G KS W PY+  L +          + 
Sbjct: 89  IKS-YLGDYITKWTPPISPLLALCTFLISENNAGNKSPWKPYLDILPKDY--------TC 139

Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
           L+  E ++  L   P K +  E+   ++  +      +    SLF +   D+    F + 
Sbjct: 140 LVCLEPQVVRLLPKPLKIKAQEQKTQVQELFVSSRGFFSSLQSLFTE---DVK-HIFHYH 195

Query: 258 IFKQAFVAVQSCVV---HLQKVSLARRFALVPLGP--PLLAYS--SKCKAMLAAVDDAVQ 310
            F  A+  + +  V   H QK  L+    +  L P   LL +S   +  A         +
Sbjct: 196 AFLWAWCTINTRTVYMKHAQKKCLSAEPDVYALAPYLDLLNHSPGVQVNAAFNEKTRCYE 255

Query: 311 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
           +      K  E + +  GP  N +LL+ YGFV  +NP+  + V
Sbjct: 256 IRTTSSCKKYEELFICYGPHDNHRLLLEYGFVAINNPHSAVYV 298


>gi|345326326|ref|XP_001512617.2| PREDICTED: SET domain-containing protein 4-like [Ornithorhynchus
           anatinus]
          Length = 499

 Score = 52.8 bits (125), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 64/267 (23%), Positives = 114/267 (42%), Gaps = 22/267 (8%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGK 172
           A++ L+AG+   S+P + ++T + VL    + + +   K  +S L  L  +L+ EK+ G 
Sbjct: 64  ATKSLKAGEMIISLPEACLLTTDTVL-KSPLGDYIWKWKPPVSPLLALCTFLIAEKQAGA 122

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
           +S W PY+  L +       A   P+      L+ L   P      E+   + RE     
Sbjct: 123 RSLWQPYLGVLPQ-------AYTCPVGLDAAVLSLLP-QPLGRRAREQRTAV-RELFAAS 173

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQKVSLARRFALVPLGP 289
             +F   SL   +  D+    FT +    A+  V +  V   H Q+   +    +  L P
Sbjct: 174 RAFF--SSLQPLFSEDV-ERVFTLDALGWAWCTVNTRTVYMEHAQRDCFSAEADIYALAP 230

Query: 290 --PLLAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 345
              LL +S  ++ +A         ++      +  E +++  GP  N +LL+ YGFV  +
Sbjct: 231 YLDLLNHSPGAQVEAAFNKETRCYEIRTASRCRKYEEVLICYGPHDNRRLLLEYGFVCSN 290

Query: 346 NPYDRLVVEAALNTEDPQYQDKRMVAQ 372
           NP+  +VV   +        DK+M  +
Sbjct: 291 NPHSNVVVSPDVLVRHLPSGDKQMTKK 317


>gi|340520781|gb|EGR51016.1| N-methyltransferase [Trichoderma reesei QM6a]
          Length = 470

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 50/186 (26%), Positives = 79/186 (42%), Gaps = 21/186 (11%)

Query: 196 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
           S + +SE EL    G+       +  + IK +Y +L      A  LF Q+P   P + FT
Sbjct: 107 SSIFFSEGELEVCAGTSLYTVTKQLEQRIKDDYRQL------AVRLFAQHPDLFPLQKFT 160

Query: 256 FEI----------FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS---KCKAML 302
            E           +K A   V S  +       +    L P    +L +SS   +C A  
Sbjct: 161 IEDVRLLRRATDPYKWALCTVWSRSMDFTLPDGSSIRLLAPFAD-MLNHSSEVKQCHAYD 219

Query: 303 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDP 362
               D + +   + Y+ G+ + ++ GP PN++LL  YGFV  DNP D   +    +   P
Sbjct: 220 VKSGD-LSVFAGKDYEIGDQVYIYYGPIPNNRLLRLYGFVIPDNPNDSYDLVLTTHPMAP 278

Query: 363 QYQDKR 368
            Y+ K+
Sbjct: 279 FYEQKQ 284


>gi|145353540|ref|XP_001421068.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144581304|gb|ABO99361.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 813

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 63/247 (25%), Positives = 100/247 (40%), Gaps = 24/247 (9%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           A  D   G+    +P     TL   L ++ +  + +     +   +AL++  E+ +G+K+
Sbjct: 26  ALRDCARGEVLLEIPLERGFTLAAALEDDAVKRVASCCARHD-DVVALHVCAERFRGEKA 84

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
               ++  L R       + ++   WSE EL  LTG+    E +   E  K +Y  L   
Sbjct: 85  TRAAHVATLPR-------SFDTAFFWSEEELRELTGTTCLRETMNLREETKNDYETLTKK 137

Query: 235 W--FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
                 G   +++  D       +E +  A   + S    L      R  A+VP    + 
Sbjct: 138 MEAIGEGGWMREHEVD-------YERYAWARSNLWSRQCDLLMPDGKRTRAMVPT-FDIF 189

Query: 293 AYSSKC----KAMLAAVDDAVQLVVDRPYKAGES--IVVWCGPQPNSKLLINYGFVDEDN 346
            +S+K        L A  + V +     YKAGE   I    G   NSKLL  YGF  +DN
Sbjct: 190 NHSAKAPLGKTHKLNAEKNCVTVYAADDYKAGEQAFISYGSGEAANSKLLTWYGFCIDDN 249

Query: 347 PYDRLVV 353
           PY+ L V
Sbjct: 250 PYEELDV 256


>gi|195439104|ref|XP_002067471.1| GK16171 [Drosophila willistoni]
 gi|194163556|gb|EDW78457.1| GK16171 [Drosophila willistoni]
          Length = 511

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 75/362 (20%), Positives = 142/362 (39%), Gaps = 38/362 (10%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           A++D+ A      VP   + + E++   E  +    T   +    LA  L+ EK +G  S
Sbjct: 118 ATKDINADQQVLRVPRKKIFSEEQLSKTERESFCNFTTNFN----LANALVVEKSRGADS 173

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
            W PYI  L  +         + L ++  ++  L G+   +  L +   I R+Y +L   
Sbjct: 174 IWKPYIDVLPSR-------YNTVLYFTVEQMRRLRGTSVCSSALRQCRMIARKYAKLYAF 226

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR-------FALVPL 287
            +   S  +            +E+++ A   V +   +L    +A +        AL+P 
Sbjct: 227 AYCDSSYLRPDTGLFTQHGLCYELYRWAVSTVMT-RQNLVPREIATKDDGNSPISALIPC 285

Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
                    K  +   +    ++       KAG    ++ G +PN+ LL++ GFVD +N 
Sbjct: 286 WDMANHRPGKITSFYDSNAHQMECTAQEFCKAGNQFFIYYGDRPNADLLVHNGFVDPNNN 345

Query: 348 YDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREK-----EAIS-DMLPYLR 401
            D + +   L+  D        +A++  +L  ++   H G  +     E IS  +L ++R
Sbjct: 346 KDFVNIRLGLSPTDG-------LAEKRSRLLDRLNIEHKGEFRVLPAPEYISGQLLAFVR 398

Query: 402 LGYVSD------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDE 455
           +  +S        S+++  +  L   C +   +E          FK  L    ATL E +
Sbjct: 399 VFNMSSDQLDHWCSDLERAVDLLHIDCALETDLETRTWQYFHQRFKLLLGVLEATLREAD 458

Query: 456 AM 457
            +
Sbjct: 459 EL 460


>gi|195132508|ref|XP_002010685.1| GI21676 [Drosophila mojavensis]
 gi|193907473|gb|EDW06340.1| GI21676 [Drosophila mojavensis]
          Length = 593

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 87/400 (21%), Positives = 156/400 (39%), Gaps = 48/400 (12%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           A+ D++AG+   SVP  L+ + E  L  E   +L   N  + L  +   L+ EK +G  S
Sbjct: 206 ATRDIKAGEQVLSVPRKLIFSEE--LLPEKQRQLFR-NFPTHLK-VTYTLIMEKLRGADS 261

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
            W P+I  L  +         + L ++  ++  L G+   +  +     I R Y  +   
Sbjct: 262 PWQPFIDTLPSR-------YNTVLYFTVEQMQRLRGTSACSAAVRHCRVIARLYASMYKC 314

Query: 235 WFMA---------GSLFQQYP--YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 283
            FM           +LF  Y   Y++   A +    +Q  V  Q        + ++   A
Sbjct: 315 AFMQLDDSVMGGMANLFTDYGLCYELYRWAVSTVTTRQNLVPRQEIPSDAANLPIS---A 371

Query: 284 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 343
           L+P        S K  +        ++      YK+GE   ++ G + N+  L++ GFVD
Sbjct: 372 LIPYWDMANHRSGKITSFYDQAAGQMECTAQEAYKSGEQYFIYYGDRSNADRLVHNGFVD 431

Query: 344 EDNPYDRLVVEAALNTEDPQYQDKRMV-----AQRNGKLSVQVFHVHAGREKEAISDMLP 398
             NP D + +   L+  D   + + ++      +R  +L V     H   E      +L 
Sbjct: 432 MQNPKDYVQIRLGLSPTDALAEQRAILLAELNIERKAELRVLPAPEHISGE------LLA 485

Query: 399 YLRLGYVSD------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLS 452
           ++R+  +S        S+++  +  L   C +   +E      L    K  L    ATL 
Sbjct: 486 FVRVFNMSKEQLEHWCSDLERAVDLLHIDCALETDLETRTWQYLYQRLKLLLGVLEATLK 545

Query: 453 ED------EAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 486
           E       EA+    +      +  Q  R+E+++L+  LQ
Sbjct: 546 ETDELKQLEALQQQADASEIDIMVLQYRRLERRILSDALQ 585


>gi|260819628|ref|XP_002605138.1| hypothetical protein BRAFLDRAFT_122719 [Branchiostoma floridae]
 gi|229290469|gb|EEN61148.1| hypothetical protein BRAFLDRAFT_122719 [Branchiostoma floridae]
          Length = 453

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 73/332 (21%), Positives = 136/332 (40%), Gaps = 40/332 (12%)

Query: 67  SREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAF 126
           +R V    EE       W+H+NG   C+ +  +     E  R +    A++ L+  +   
Sbjct: 19  TRPVSLAHEESFVRFFQWLHRNG---CRNVPLKPAVFPETGRGL---MATKALKHEELIL 72

Query: 127 SVPNSLVVTLERVLGNETIAELL--TTNKLSELACLALYLMYEKKQGKKSFWLPYIRELD 184
            +P  L++T++ ++ +  +A  +    ++L+    LA++LM EK + +KSFW PYI  L 
Sbjct: 73  VIPKRLLITIDAIM-DSYLAPYIERADSQLTPSQALAVFLMCEKCRREKSFWRPYIDILP 131

Query: 185 RQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQ 244
            +          P  ++E +   L  S  + +   +     +E+ EL   + M   LF  
Sbjct: 132 EE-------YTCPAFFTEEDFRLLPNS-LRGKAKAKKYECHKEFMELAPFFKMLADLFPD 183

Query: 245 YPYDIPTEAFTFEIFKQAFVAVQS----------CVVHLQKVSLARRFALVPLGPPL-LA 293
                  +AF F+ FK A+ A+++              L+         + PL   +  A
Sbjct: 184 -----QEDAFNFKDFKWAWSAIKTRAFDVPLGGETCYRLRDSEDTSNPTMFPLVDSINHA 238

Query: 294 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL-- 351
             +K +         ++   +  Y+    ++   G   N  LL+ +GFV   NP D +  
Sbjct: 239 AQAKIRHRYNEKRRCLESRTETVYRRHAEVMNSYGRADNDNLLLEFGFVVPGNPADTVTF 298

Query: 352 -VVEAALNTEDPQYQD----KRMVAQRNGKLS 378
            +V+  L    P+  +    K M   RN  +S
Sbjct: 299 HLVQDVLEYLQPENNELLERKIMFLARNNLIS 330


>gi|356521657|ref|XP_003529470.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Glycine
           max]
          Length = 487

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 85/377 (22%), Positives = 158/377 (41%), Gaps = 24/377 (6%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A +D+   +    VP  L +  + V  +E I ++ +   L     +AL+L+ E+ +  
Sbjct: 82  LVALKDISRNEVVLQVPKRLWINPDAVAASE-IGKVCSG--LKPWLAVALFLIRERSR-S 137

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
            S W  Y   L ++        +S + WSE EL+ L G+          + ++ E+  L+
Sbjct: 138 DSLWKHYFSILPKE-------TDSTIYWSEEELSELQGTQLLNTTRSVKQYVQNEFRRLE 190

Query: 233 TVWFMAGSLFQQYPYDIPTEAF--TFEIFK-QAFVAVQS-CVVHLQKVSLARRFALVPLG 288
               +     + +P  I  + F   F I + +AF  +++  +V +    L    A V   
Sbjct: 191 EEIIIPNK--KLFPSSITLDDFFWAFGILRSRAFSRLRNENLVVIPLADLINHSARVTTD 248

Query: 289 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNP 347
               AY  K  A L + D    L      KAG+ + + +   + N++L ++YGF++ +  
Sbjct: 249 DH--AYEIKGAAGLFSWDYLFSLRSPLSLKAGDQVYIQYDLNKSNAELALDYGFIEPNTD 306

Query: 348 YDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD 407
            +   +   ++  DP + DK  +A+ NG      F +   R           L     +D
Sbjct: 307 RNAYTLTLQISESDPFFGDKLDIAESNGFGETAYFDIFYNRPLPPGLLPYLRLVALGGTD 366

Query: 408 TSEMQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 464
              ++S+   S  G +  PVS   E  +   + +  K  LAGY  T+ ED+  L +  L 
Sbjct: 367 AFLLESIFRNSIWGHLELPVSRDNEELICRVVRETCKTALAGYHTTIEEDQK-LKEAKLD 425

Query: 465 PKKRVATQLVRMEKKML 481
            +  +A  +   EK +L
Sbjct: 426 SRHAIAVGIREGEKNLL 442


>gi|328772383|gb|EGF82421.1| hypothetical protein BATDEDRAFT_86633 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 648

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 75/325 (23%), Positives = 127/325 (39%), Gaps = 51/325 (15%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
           L  W   +G     V +KE  S +++ +      AS+D+        +P++++++   V 
Sbjct: 36  LVDWGRMHGANIENVEIKETASDDDR-KLTRGAYASKDIPPNSEICFIPSTILLSESDVR 94

Query: 141 GNETIAELLT--------TNKLSE---------LACLALYLMYEKKQ-GKKSFWLPYIRE 182
            +E    +LT          K+S+         L  +A +++++       S WLPY+  
Sbjct: 95  ASEIGKAILTYIDEHQDAKQKISDKIKHPHAEILLAMAAFIVHQVSLPTADSHWLPYLAS 154

Query: 183 LDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL-ERAEGIKREYNELDTVWFMAGSL 241
           L +           PL+W+   +  L G  +   ++ ER E I+   N    V    G  
Sbjct: 155 LPKNYAL-------PLMWTRDRIQNLLGGTSLLYMMIERLEWIQ---NSTKVVENACGHY 204

Query: 242 FQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR------------FALVPLGP 289
           F       PT A T +  + A  ++ S      K SL  +             + + L P
Sbjct: 205 F-------PTGALTVQSMQWATCSIWSRAFPKAKPSLDLQDGSHQDVQDWIGLSEICLFP 257

Query: 290 PLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
            L  ++ K   +      +  V  +       G  ++   GP+ N  LL NYGFV E+NP
Sbjct: 258 ILDMFNHKRGYRVEWRMTEKGVSFITPDGICKGSELLNNYGPKGNENLLSNYGFVIENNP 317

Query: 348 YDRLVVEAALNTEDPQYQDKRMVAQ 372
            D   V   L  EDP Y  K+ V +
Sbjct: 318 EDYFKVFLGLQQEDPLYTAKKAVLE 342


>gi|328772335|gb|EGF82373.1| hypothetical protein BATDEDRAFT_86177 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 966

 Score = 52.4 bits (124), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 70/319 (21%), Positives = 126/319 (39%), Gaps = 60/319 (18%)

Query: 71  VSKKEEDLGDLKS---WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFS 127
            ++K + L  L+S   W+H NG+    + +K+     +    I    ++  +  G+    
Sbjct: 548 TAEKLDQLASLESFTQWLHANGINTDGISIKKVDDSKDVGLGIF---STRQIHKGECLVK 604

Query: 128 VPNSLVVTLERVLGNETIA-----ELLTTNKL--SELACLALYLMYEKKQGKKSFWLPYI 180
           +P      L+ +L N+T A      ++ +N L  ++ + + +  + ++     S W PY 
Sbjct: 605 IP------LKLILSNDTSAMPALNSIVKSNVLLKTDPSVILVIRLLQEYINPMSLWQPYF 658

Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS 240
             L R           P+L S  +LA  TG+    E++     + R+Y  L  +      
Sbjct: 659 DLLPR-------VFTIPVLGSAQDLAAYTGTSIIDEVVHDMIALMRQYLYLQHI------ 705

Query: 241 LFQQYPYD-IPTEAFTFEIFKQAFVAVQS-----CVVHLQKVSLARRFALVPL------- 287
            F+  P   IP   FTF  F  A   V +     C  +     + +   L+PL       
Sbjct: 706 -FKSIPEPPIPLADFTFAAFSWARAIVSTRQNEICYANPSTSEMQQFLCLIPLFDMFNHK 764

Query: 288 -GPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 344
            G     + +K  C   +A+ D             GE I +  G + N ++L+  GFVD 
Sbjct: 765 PGNSTTQFDTKEYCSETIASCD----------VSPGEQIFIHYGKRSNQEMLLYSGFVDP 814

Query: 345 DN-PYDRLVVEAALNTEDP 362
            N  YD + +  ++   DP
Sbjct: 815 TNIEYDHIKLSVSIPQSDP 833


>gi|302832548|ref|XP_002947838.1| hypothetical protein VOLCADRAFT_88145 [Volvox carteri f.
           nagariensis]
 gi|300266640|gb|EFJ50826.1| hypothetical protein VOLCADRAFT_88145 [Volvox carteri f.
           nagariensis]
          Length = 508

 Score = 52.4 bits (124), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 93/403 (23%), Positives = 150/403 (37%), Gaps = 53/403 (13%)

Query: 80  DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
           + +SW+   GL    ++L+         R    + AS  L  G+    +P+ LV+T ER 
Sbjct: 24  EFQSWLRSEGLSTQPLLLRHC------GREGRGLVASRSLSRGEVLVKLPDHLVITAERA 77

Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
            G  ++  LL       LA        ++     + W PY+  L ++ G       + L 
Sbjct: 78  AGEWSLLALLLAEVKGRLAA------GDRSSPAAARWGPYVAVLPQRPG-------TLLD 124

Query: 200 WSETELA-YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEI 258
           W   E+   L GSP +             + EL+ +    G      P  +P      E 
Sbjct: 125 WPAKEVQQLLRGSPLQRLADSITSAASASWRELEPL-IAQGRADGLVPEHVPLSKGDLEW 183

Query: 259 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL---LAYSSKC----------------K 299
              AF  + S  + L   S      L P    L   ++    C                 
Sbjct: 184 ---AFGVLLSRCIRLP--SRGDLQVLAPWADQLNHDVSAEEGCHLDWSWDVAGPAVPGGD 238

Query: 300 AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV--DEDNPYDRLVVEAAL 357
               A   A+ L  DRPY AG+ + V  GP+ + +LL++YGF      NP+    +  A+
Sbjct: 239 RAGGATKGALVLRADRPYAAGQQVYVSYGPKSSGELLLSYGFCPPPASNPHQDCRLRVAV 298

Query: 358 NTE-DPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL--RLGYVSDTSEMQSV 414
           + + DP    K     R+G  S   F +      E +   L +L  R     +T E+ SV
Sbjct: 299 DRQGDPLADLKEQALARHGLPSELEFPLKLEGIPEGLLQYLAFLDARPKVAQETFELASV 358

Query: 415 ISSLGPICPVSPCMERAV--LDQLADYFKARLAGYPATLSEDE 455
           +   G   P+    +  V  L  L++   A L  YP ++  D+
Sbjct: 359 LFESGGF-PLLDGQDTLVLALRGLSNRCTAALKAYPTSMEADQ 400


>gi|240278777|gb|EER42283.1| conserved hypothetical protein [Ajellomyces capsulatus H143]
 gi|325090312|gb|EGC43622.1| conserved hypothetical protein [Ajellomyces capsulatus H88]
          Length = 471

 Score = 52.4 bits (124), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 60/271 (22%), Positives = 109/271 (40%), Gaps = 37/271 (13%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
            + G+   ++P+ ++ T+E    +  +   L + +  LS    LA Y+++ +        
Sbjct: 34  FKEGERILTIPSDVLWTVEHAYADSLLGPTLHSARPPLSVDDTLATYILFVRS------- 86

Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
               RE      R  LA       S + ++E EL   TG+   A   +    I+ +Y  L
Sbjct: 87  ----RESGYNGLRSHLAALPKSYSSSIFFTEDELEVCTGTSLYAITKQLGRCIQDDYKAL 142

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL-----QKVSLARRFALVP 286
                    L  Q+    P   FT E +K A   V S  +       + + L   FA   
Sbjct: 143 VV------RLLIQHRDLFPLSKFTIEDYKWALCTVWSRAMDFVLPDGKSIRLLAPFA--- 193

Query: 287 LGPPLLAYSSKCKAMLAA--VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 344
               +L +SS  +   A   +   + ++  + YKAG+ + ++ G  PN++LL  YGF+  
Sbjct: 194 ---DMLNHSSDVRQCHAYDPLSGNLSILAGKDYKAGDQVFIYYGSIPNNRLLRLYGFIIP 250

Query: 345 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 375
            NP D   +    +   P ++ K  + +  G
Sbjct: 251 SNPNDNYELVLETHPMAPFFEQKHKLWESAG 281


>gi|356577306|ref|XP_003556768.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Glycine
           max]
          Length = 487

 Score = 52.4 bits (124), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 84/376 (22%), Positives = 159/376 (42%), Gaps = 22/376 (5%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A +D+   +    VP  L +  + V  +E I ++     L     +AL+L+ E+ +  
Sbjct: 82  LVALKDISRNEVVLQVPKRLWINPDAVAASE-IGKVCIG--LKPWLAVALFLIRERSR-S 137

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
            S W  Y   L ++        +S + WSE EL+ L G+          + ++ EY  L+
Sbjct: 138 NSLWKHYFSVLPKE-------TDSTIYWSEEELSELQGTQLLNTTRSVKQYVENEYRRLE 190

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQS-CVVHLQKVSLARRFALVPLGPP 290
               +        P  +    + F I + +AF  +++  +V +         A V     
Sbjct: 191 EEIILPNKKLFPSPLTLDDFFWAFGILRSRAFSRLRNENLVVIPFADFINHSARVTTEDH 250

Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNPYD 349
             AY  K  A L + D    L      KAG+ + + +   + N++L ++YGF++ +   +
Sbjct: 251 --AYEIKGAAGLFSWDYLFSLRSPLSLKAGDQVYIQYDLNKSNAELALDYGFIEPNADRN 308

Query: 350 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV-HAGREKEAISDMLPYLRLGYVSDT 408
              +   ++  DP + DK  +A+ NG      F + ++      +   L  + LG  +D 
Sbjct: 309 AYTLTLQISESDPFFGDKLDIAESNGFGETAYFDIFYSRPLPPGLLPYLRLVALG-GTDA 367

Query: 409 SEMQSVI--SSLGPI-CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 465
             ++S+   S  G +  PVS   E  +   + +  K  LAGY  T+ ED+  L +  L  
Sbjct: 368 FLLESIFRNSIWGHLELPVSRDNEELICRVVRETCKTALAGYHTTIEEDQK-LKEAKLDS 426

Query: 466 KKRVATQLVRMEKKML 481
           +  +A  +   EK++L
Sbjct: 427 RHAIAVGIREGEKQLL 442


>gi|400602586|gb|EJP70188.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
          Length = 797

 Score = 52.0 bits (123), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 27/77 (35%), Positives = 40/77 (51%), Gaps = 2/77 (2%)

Query: 281 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 340
           R AL+P+   L   ++ C    +   +A  +  DR Y+AGE +    G   N  LL  YG
Sbjct: 593 RLALLPVADVLNHANAGCSVAFST--EAYDITADRAYQAGEEVYTSYGAHSNDFLLAEYG 650

Query: 341 FVDEDNPYDRLVVEAAL 357
           FV  DNP+D+L ++  L
Sbjct: 651 FVLPDNPWDQLCLDKVL 667


>gi|358056332|dbj|GAA97699.1| hypothetical protein E5Q_04377 [Mixia osmundae IAM 14324]
          Length = 347

 Score = 52.0 bits (123), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 36/107 (33%), Positives = 53/107 (49%), Gaps = 4/107 (3%)

Query: 252 EAFTFEIFKQAFVAVQS-CV-VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA- 308
           E   F+ F+ A++ V S CV + L   +    F LVPL   +  +SS C       D A 
Sbjct: 122 EIIDFDAFRWAWLCVNSRCVWLDLDYEAHEENFTLVPL-LDMANHSSTCANATVKYDHAH 180

Query: 309 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEA 355
            +L + RP K GE IV   G    + L   YGF++  NP++R+ + A
Sbjct: 181 FELKLTRPVKRGEEIVFEYGGHDQATLWAEYGFIESSNPHERIDLTA 227


>gi|328854233|gb|EGG03367.1| hypothetical protein MELLADRAFT_90239 [Melampsora larici-populina
           98AG31]
          Length = 509

 Score = 52.0 bits (123), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 57/216 (26%), Positives = 92/216 (42%), Gaps = 24/216 (11%)

Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
           A+VPL   L A S    A L      + +   +  + GE I       PN+ LL  YG V
Sbjct: 260 AMVPLADILNAKSGCENAKLFYEPTTLNMTTTKSIRKGEQIYNTYADPPNADLLRRYGHV 319

Query: 343 DEDNPYD----------RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHV-------- 384
           D++NP+D          RL  E +L+  DPQ Q+     +   K +++V  +        
Sbjct: 320 DDENPFDLAEVSLELCIRLAAE-SLHPSDPQNQNTLDELKSRAKWALEVSDIDEIFMLPT 378

Query: 385 HAGRE-KEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 443
            + RE KE + D L  +    +S   E Q+   S G +    P M   +          R
Sbjct: 379 KSQREPKEILPDELVIMLRILLSTEEEFQT-WKSKGKVP--KPAMSEPIAQLAIQILSNR 435

Query: 444 LAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKK 479
           L  Y  T+  D+ +L D +L  ++++ +  VR+ +K
Sbjct: 436 LNQYSTTIQNDQDLLKDQSL-SRRKLKSIKVRLGEK 470


>gi|225554758|gb|EEH03053.1| SET domain-containing protein [Ajellomyces capsulatus G186AR]
          Length = 485

 Score = 52.0 bits (123), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 69/306 (22%), Positives = 119/306 (38%), Gaps = 45/306 (14%)

Query: 80  DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
           +  SW+ +   P  KV  K K +          + A +D+   +  F++P +LV++ +  
Sbjct: 19  EFMSWLKQR--PGVKVSPKIKIADLRSEGAGRGIVADDDIGEDEELFAIPQNLVLSFQ-- 74

Query: 140 LGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
             N ++ +LL  N+       CL + ++YE  QG  S W  Y + L           ++ 
Sbjct: 75  --NSSLKDLLDFNERDFDPWLCLIVVMIYEYLQGGASTWSRYFQLLPTN-------FDTL 125

Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP-------YDIP 250
           + W++ EL  L+GS     +L +      E N L  +  +       +P       +D P
Sbjct: 126 MFWTDEELRELSGSA----VLNKIGRSDAEANILRNILPLVSGNPSHFPPMSGVASFDSP 181

Query: 251 TE----------------AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
                             A+ F+I K      +    ++          +VPL   L A 
Sbjct: 182 EGKAALLSLAHRMGSLIMAYAFDIEKGENDGGEGQDGYVTDDEEELSKGMVPLADLLNAD 241

Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVV 353
           + +  A L   D  + +   +P + GE I    G  P + LL  YG+V D    YD   V
Sbjct: 242 TDRNNARLFQEDCYLSMRSIKPIRKGEEIFNDYGELPRADLLRRYGYVTDNYAQYDE--V 299

Query: 354 EAALNT 359
           E ++ T
Sbjct: 300 EISMRT 305


>gi|302658278|ref|XP_003020845.1| SET domain protein [Trichophyton verrucosum HKI 0517]
 gi|291184711|gb|EFE40227.1| SET domain protein [Trichophyton verrucosum HKI 0517]
          Length = 692

 Score = 52.0 bits (123), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 47/206 (22%), Positives = 81/206 (39%), Gaps = 35/206 (16%)

Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
           LA ++++E+ +G+ S W PY+  L R          S L + + +L +L G+        
Sbjct: 108 LAFFMVHEQLKGRDSHWWPYLATLPRAS-----EFTSALFYQDNDLEWLQGTNLYQTHQA 162

Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
               ++ EY+   ++    G L          E++ ++IF  A+  + S      +   +
Sbjct: 163 YRNAVQEEYDSAISILRDEGFL--------AVESYRWDIFCWAYTLIAS------RAFTS 208

Query: 280 RRFALVPLGPPLLAYSSKCKAMLAAVDDA----------------VQLVVDRPYKAGESI 323
           R         P L    + + ML  VD +                + L V  P  +GE +
Sbjct: 209 RVLDAYFSNHPTLKQDEEFQIMLPLVDSSNHKPLAKIEWRAEATEIGLKVIEPTSSGEEV 268

Query: 324 VVWCGPQPNSKLLINYGFVDEDNPYD 349
               GP  N +L+  YGF   DNP D
Sbjct: 269 HNNYGPLNNQQLMTTYGFCIVDNPCD 294


>gi|322712432|gb|EFZ04005.1| histone-lysine N-methyltransferase [Metarhizium anisopliae ARSEF
           23]
          Length = 462

 Score = 52.0 bits (123), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 82/392 (20%), Positives = 161/392 (41%), Gaps = 34/392 (8%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
           V A    + G+   ++P+ L  T++    +  +   L + +  LS    LA+++++ +  
Sbjct: 28  VKARRRFKQGERILTIPSGLHWTVKHAQNDSLLGPALCSAQPPLSVEDTLAVHILFVRS- 86

Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
            ++S +      ++R       +  S + +++ EL    G+       +  + I+ +Y +
Sbjct: 87  -RESGYDGLRSHVERLPA----SYSSSIFFTDDELEVCAGASLYTITKQLQQRIEDDYRD 141

Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 290
           L         +  QYP   P + FT   +K A  AV S  +  Q    +    L P    
Sbjct: 142 LVV------RVLVQYPDLFPLDKFTLHHYKWALCAVWSRAMDFQLSDGSSIRLLAPFAD- 194

Query: 291 LLAYSS---KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
           +L +SS   +C    A+  D + ++  + Y+AG+ + +  G  PN +LL  YGF+   NP
Sbjct: 195 MLNHSSESKQCHVYDASSGD-LSVLAGKDYEAGDQVYIHYGSIPNHRLLRLYGFIIPGNP 253

Query: 348 YDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP-----YLRL 402
            D   +  A +   P ++ K+ +    G  S     +        ++D LP     YLR+
Sbjct: 254 NDSYDLVLATHPLAPFFELKQKLWALAGLDSTCTISL-------TLTDPLPKNVIRYLRI 306

Query: 403 GYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD-- 460
             + D S++ S+         +S   E  VL  L +   + L  +   L + E  L    
Sbjct: 307 QRL-DESDLASIALGQAADEKISNSNEVQVLQSLVESIASLLGSFGTRLEKLEEQLATGV 365

Query: 461 YNLHPKKRVATQLVRMEKKMLNACLQVTADMI 492
           Y +      A  +   E+++L    +   D++
Sbjct: 366 YPVGGNAWAAAHVSLGEQRVLKLAKKKAEDLL 397


>gi|119500300|ref|XP_001266907.1| SET domain protein [Neosartorya fischeri NRRL 181]
 gi|119415072|gb|EAW25010.1| SET domain protein [Neosartorya fischeri NRRL 181]
          Length = 704

 Score = 52.0 bits (123), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 55/203 (27%), Positives = 90/203 (44%), Gaps = 27/203 (13%)

Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
            +L+ +  +G + FW PYIR L +      L++ +PL +   +L +L G+       +R 
Sbjct: 124 FFLIGQYLKGSEGFWFPYIRTLPQP-----LSLTTPLYYEGGDLRWLDGTSLAPAREQRM 178

Query: 222 EGIKREYNELDTVWFMAGSLFQ---QYPYDI--------PTEAFTFEIFKQAFVAVQSCV 270
              K +Y    T    AG  FQ   QY +D+         + AF+ ++  +A   V+   
Sbjct: 179 GVWKEKYKNGITELRKAG--FQDVDQYTWDLYLWSSSILVSRAFSAKVLAEAVTDVE--- 233

Query: 271 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 330
           +    VS+     L+P    L+ +    K    A    V  VV     +G+ I    GP+
Sbjct: 234 LPEDGVSV-----LLPC-IDLMNHRPLAKVEWRAGKQDVAFVVLEDVGSGQEISNNYGPR 287

Query: 331 PNSKLLINYGFVDEDNPYDRLVV 353
            N +L++NYGF   DNP D  +V
Sbjct: 288 NNEQLMMNYGFCLPDNPCDYRIV 310


>gi|387197713|gb|AFJ68815.1| set domain protein, partial [Nannochloropsis gaditana CCMP526]
          Length = 327

 Score = 52.0 bits (123), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 53/218 (24%), Positives = 96/218 (44%), Gaps = 31/218 (14%)

Query: 83  SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
           +W+  +G    K+   E PS+    + I    A +D+ + +   S+P  L++T +  L +
Sbjct: 74  AWLRAHGARCDKI---EWPSYATGSQ-IRGAVALDDINSNEDMVSIPEPLLLTPDVALKD 129

Query: 143 ETIAELLTTN--KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
             I ++   N    S+   L + LM+E+ +G+ SF+ PY+  L R         ++ L W
Sbjct: 130 PDIGKVFEDNLEDFSDEDMLLILLMHERGKGETSFFYPYLATLPR-------LPDTLLNW 182

Query: 201 SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAG--SLFQQYPYDIP------TE 252
           +E  L++L       E+  R   +   Y  L      AG   LF + P D         +
Sbjct: 183 NEEGLSWLQDEGLSLEVFLRESQLTAHYTRLVEEKLKAGWPGLFGEAPDDASDSESKGAD 242

Query: 253 AFTFEIFKQAFVAVQSCVVHLQKVSLARRF---ALVPL 287
            ++ E F+ A++ +Q+        +  RR    AL+PL
Sbjct: 243 PYSLENFRFAWLTIQA-------RAFGRRLPYSALIPL 273


>gi|346980096|gb|EGY23548.1| SET domain-containing protein RMS1 [Verticillium dahliae VdLs.17]
          Length = 469

 Score = 52.0 bits (123), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 83/373 (22%), Positives = 143/373 (38%), Gaps = 69/373 (18%)

Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 213
           L     L L ++YE  QG  S W PY   L +Q        ++P+ WS+ EL  L G+  
Sbjct: 91  LDSWGQLILVMLYEVLQGDASRWKPYFDILPQQ-------FDTPIFWSDGELLELQGTSL 143

Query: 214 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 273
            AE + + E      +++  +     ++F       PTE     +  +    + +    L
Sbjct: 144 TAEKIGKVESDAMFRSKILPIVQANPAIFYPEGAAQPTEDELLHLAHRMGSTIMAYAFDL 203

Query: 274 QKVSLARR--------------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 319
           +                       +VP+   L A +++  A +   +      +    KA
Sbjct: 204 ENDDENENEEDGWVEDREGRTMLGMVPMADTLNA-NAEFNAHINHGESLEATAIRADIKA 262

Query: 320 GESIVVWCGPQPNSKLLINYGFVD-EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLS 378
           G+ I+ + GP P S+LL  YG+V  E + YD + V   L  E         V   +  LS
Sbjct: 263 GDQILNYYGPLPTSELLRRYGYVTPEHSRYDVVEVPWTLVKE---------VIVSSLSLS 313

Query: 379 VQVF-HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLA 437
            + +  V +  + E I D   Y  +   S         ++   +  VSP +    ++QL 
Sbjct: 314 AEAWKQVESQIDDEEIED---YFVIERDSGEPGPDGRFTAPAVLREVSPEL----VEQLK 366

Query: 438 DYFKA-----------------------------RLAGYPATLSEDEAMLTDYNLHPKKR 468
           ++ KA                             RLA YP ++  DE +L + +L  ++R
Sbjct: 367 EFLKAVKKLDSERIPDKRKRDEICDAVIAEVLKVRLAQYPTSIETDEKLLAEADLPARRR 426

Query: 469 VATQLVRMEKKML 481
           +A  +   EKK+L
Sbjct: 427 MAVVVRLGEKKLL 439


>gi|389741836|gb|EIM83024.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
          Length = 502

 Score = 52.0 bits (123), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 103/463 (22%), Positives = 173/463 (37%), Gaps = 116/463 (25%)

Query: 118 DLQAGDAAFSVPNSLVVT-----LERVLGNETIAELLTTNKLSEL----ACLALYLMYEK 168
           D+  G   F++P +L ++     L  +LG       L   K  EL    A L L +M+E+
Sbjct: 37  DIPEGHTLFTLPRNLTLSTRTSALPGLLG-------LDEWKQHELHIGWAGLILCMMWEE 89

Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI----------- 217
            QG  S W  Y+  L         + ++P+ WS  +L  L G+    +I           
Sbjct: 90  AQGASSRWSTYLASLPS-------SFDTPMFWSPDDLEELKGTSVVDKIGRDGAEEDYRS 142

Query: 218 -----------LERAEGIKREYNELDTVWFMAGSLF------------------------ 242
                      L   E + R Y+ L+    M   +                         
Sbjct: 143 KVVPTLQSRPDLFAPEALSRHYS-LENYHLMGSRILSRSFSVERWEGHAADKQEDSASSP 201

Query: 243 -----QQYPYDIPTEAFTFE-------IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 290
                +    D+ TEA T         + + +FV         +        A+VP+   
Sbjct: 202 VADTGRDEAMDVDTEAVTATAPEAEDGVDEPSFVVDDENDSDDEDEEDPANVAMVPMADM 261

Query: 291 LLA-YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE----- 344
           L A Y S+   +    +D ++++  +P   GE I    G  PNS LL  YG VD      
Sbjct: 262 LNARYRSENAKLFYETED-LRMITTKPILKGEQIFNTYGDPPNSDLLRRYGHVDLVPLPN 320

Query: 345 ---DNPYDRLVVE-----AALNTEDPQYQDKRMVAQR------NGKLSVQVFHVHAGREK 390
               NP D  +VE     A  +  +   Q     A+R       G   V +   +     
Sbjct: 321 GDIGNPAD--IVELRGDLAFFSISERHKQPVESSAERVDWWLEEGGEDVFILETN----H 374

Query: 391 EAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPAT 450
           E   +++P+ RL   S +   ++   S  P   V    + ++L  +A+  + RLA YP +
Sbjct: 375 ELPDELVPFCRLLLQSQSEWEKTKSKSKLPKAKV----DESILSTIANALERRLAEYPTS 430

Query: 451 LSEDEAMLTD-YNLHPKKRVATQLVRMEKKMLNACLQVTADMI 492
           + ED+ +LT+  +L+ K  V  +L   EK++L+  L    + +
Sbjct: 431 VEEDQKLLTEPLSLNRKHAVIVRL--GEKRILHGTLSTVKEKL 471


>gi|322697804|gb|EFY89580.1| putative histone-lysine N-methyltransferase [Metarhizium acridum
           CQMa 102]
          Length = 466

 Score = 52.0 bits (123), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 58/265 (21%), Positives = 109/265 (41%), Gaps = 27/265 (10%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
           V A    + G+   ++P++L  T++    +  +   L + +  L+    LA+Y+++ +  
Sbjct: 28  VKARRRFKQGERILTIPSALHWTVQHAQADSLLGPALRSARPPLTVEDTLAVYVLFVRS- 86

Query: 171 GKKSFWLPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIK 225
                     RE      R  +A       S + ++E EL    G+       +  + I+
Sbjct: 87  ----------RESGYNGPRSHVAALPTSYSSSIFFTEDELEVCAGTSLYTITKQLKQRIE 136

Query: 226 REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
            +Y +L         +    P   P   FT   +K A   V S  +  +    +    L 
Sbjct: 137 DDYKDL------IARVLGPRPDLFPLNKFTIHHYKWALCTVWSRAMDFELYDGSSMRLLA 190

Query: 286 PLGPPLLAYSSKCKA--MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 343
           P    +L +SS+ K   +  A    + ++  + Y+AG+ + +  G  PNS+LL  YGFV 
Sbjct: 191 PFAD-MLNHSSESKQCHVYDASTGNLSILAGKDYEAGDQVYIHYGSIPNSRLLRLYGFVI 249

Query: 344 EDNPYDRLVVEAALNTEDPQYQDKR 368
            DNP D   +  A +   P ++ K+
Sbjct: 250 PDNPNDSYDLVLATHPMAPFFEQKQ 274


>gi|336468018|gb|EGO56181.1| hypothetical protein NEUTE1DRAFT_83233 [Neurospora tetrasperma FGSC
           2508]
 gi|350289741|gb|EGZ70966.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
          Length = 459

 Score = 51.6 bits (122), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 59/261 (22%), Positives = 112/261 (42%), Gaps = 31/261 (11%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
            + G+   ++P  ++ T++    +  +   L + +  LS    LA Y+++ K        
Sbjct: 34  FKEGEKILTIPAGILWTVKHAYADPLLGPALRSAQPPLSVEDTLATYILFVKS------- 86

Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
               RE      R  +A       S +L++E +L    G+       +  + I+ ++  L
Sbjct: 87  ----RESGYDGQRSHIAALPTSYSSSILFAEDDLEACAGTSLYTITKQLEQSIEDDHRAL 142

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP-- 289
                    LF Q+P   P + FT E +K A   V S  +      LA   ++  L P  
Sbjct: 143 VV------RLFVQHPDLFPLDKFTVEDYKWALCTVWSRAMDF---VLADGNSIRLLAPFA 193

Query: 290 PLLAYSSKCKA--MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
            +L ++S+ K   +       + ++  + Y+AG+ + +  GP PNS+LL  YGFV   NP
Sbjct: 194 DMLNHTSEVKQCHVYDPSSGNLSVLAGKDYEAGDQVFINYGPVPNSRLLRLYGFVIPGNP 253

Query: 348 YDRLVVEAALNTEDPQYQDKR 368
            D   +  + + + P ++ K+
Sbjct: 254 NDSYDLVLSTHPQAPFFEQKQ 274


>gi|308806489|ref|XP_003080556.1| SET-domain transcriptional regulator-like protein (ISS)
           [Ostreococcus tauri]
 gi|116059016|emb|CAL54723.1| SET-domain transcriptional regulator-like protein (ISS)
           [Ostreococcus tauri]
          Length = 394

 Score = 51.6 bits (122), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 67/263 (25%), Positives = 108/263 (41%), Gaps = 46/263 (17%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERV------------------LGNETIAELLTTNKL 154
           V A E ++AG+    VP   ++ +E+                   +G++ I  +  T  L
Sbjct: 4   VRAVERVEAGECVARVPWDALLGVEQTVETSSPSPTSEILKQLTRMGDQIIMVIWLTAAL 63

Query: 155 SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 214
               C      YE+       W P +R L  +         S L W+  +L  + G    
Sbjct: 64  DAFEC-GDASAYEE-------WAPALRALPTR-------ASSSLAWNADDLGAVAGEDLA 108

Query: 215 AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF-TFEIFKQAFVAVQSCVVHL 273
             + E    +K +Y+ L    F A  L +Q P   P  AF  +  F++A+    S  + +
Sbjct: 109 NRLREYRRSVKVQYDAL----FPA--LCEQVPEAFPARAFGDYAKFERAYDIWTSYAMKV 162

Query: 274 QK-VSLARRFALVP----LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG 328
           Q   SL  R  +VP        L A+S +  ++      A +L + R    GE+I +  G
Sbjct: 163 QDPDSLQIREVIVPGVFLCNHSLSAHSVRYTSLERGTK-AFRLELSRGCVEGEAITISYG 221

Query: 329 PQPNSKLLINYGFVDEDNPYDRL 351
              N+ LL+ YGF  E+NPYDR+
Sbjct: 222 RLDNADLLMFYGFSLENNPYDRV 244


>gi|380089029|emb|CCC12973.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 465

 Score = 51.6 bits (122), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 76/354 (21%), Positives = 136/354 (38%), Gaps = 39/354 (11%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
            + G+   ++P+S++ T+E    +  +   L + +  LS    L  YL++ +        
Sbjct: 43  FKEGEKILTIPSSILWTVEHAYADPLLGPALCSVQPPLSPEDTLTTYLLFVRS------- 95

Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
               RE      R  +A       S + ++E EL    G+       +  + I+ ++  L
Sbjct: 96  ----RESGYDGQRSHVAALPTSYSSSIFFTEEELEVCAGTSLYTITKQLEQSIEDDHRAL 151

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
                    LF Q+    P + F+ E +K A   V S  +  Q         L P    +
Sbjct: 152 ------VMQLFIQHRDLFPLDKFSIEDYKWALCTVWSRRMDFQLRDGKSMRLLAPFAD-M 204

Query: 292 LAYSSKCK--AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
           L +SS+ K   +       + ++  + Y+ G+ + +  G  PNS+LL  YGFV   NP D
Sbjct: 205 LNHSSEAKPCHVYDVSSGNLSVLAGKDYEPGDQVFINYGSVPNSRLLRLYGFVIPGNPND 264

Query: 350 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP-----YLRLGY 404
              +  + + + P Y+ K  +    G  S     +        ++D LP     YLR+  
Sbjct: 265 TYDLVLSTHPQAPFYEQKHKLWVSAGLDSTSTIPL-------TLTDPLPKNVLRYLRIQR 317

Query: 405 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 458
              +      + +      VS   E  +L  L + F   L G+   L + E  L
Sbjct: 318 ADASDLAAMALQNAKADEKVSDSNEVEILQFLVESFGHLLGGFGTPLEKLEEQL 371


>gi|164423408|ref|XP_963594.2| hypothetical protein NCU08733 [Neurospora crassa OR74A]
 gi|157070080|gb|EAA34358.2| conserved hypothetical protein [Neurospora crassa OR74A]
          Length = 459

 Score = 51.6 bits (122), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 59/261 (22%), Positives = 111/261 (42%), Gaps = 31/261 (11%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
            + G+   ++P  ++ T++    +  +   L + +  LS    LA Y+++ K        
Sbjct: 34  FKEGEKILTIPAGILWTVKHAYADPLLGPALRSAQPPLSVEDTLATYILFVKS------- 86

Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
               RE      R  +A       S +L++E +L    G+       +  + I+ ++  L
Sbjct: 87  ----RESGYDGQRSHIAALPASYSSSILFAEDDLEACAGTSLYTITKQLEQSIEDDHRAL 142

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP-- 289
                    LF Q+P   P + FT E +K A   V S  +      LA   ++  L P  
Sbjct: 143 VV------RLFVQHPDLFPLDKFTVEDYKWALCTVWSRAMDF---VLADGNSIRLLAPFA 193

Query: 290 PLLAYSSKCKA--MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
            +L ++S+ K   +       + +   + Y+AG+ + +  GP PNS+LL  YGFV   NP
Sbjct: 194 DMLNHTSEVKQCHVYDPSSGTLSVFAGKDYEAGDQVFINYGPVPNSRLLRLYGFVIPGNP 253

Query: 348 YDRLVVEAALNTEDPQYQDKR 368
            D   +  + + + P ++ K+
Sbjct: 254 NDSYDLVLSTHPQAPFFEQKQ 274


>gi|1150596|emb|CAA86307.1| putative transcription regulator [Saccharomyces cerevisiae]
          Length = 496

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 60/254 (23%), Positives = 109/254 (42%), Gaps = 22/254 (8%)

Query: 113 VAASEDLQAGDAAFSVPNS--LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
           V A++ ++  +  F +P S  L VT  +++ +    +    N+      L + ++YE + 
Sbjct: 41  VVATQKIKKDETLFKIPRSSVLSVTTSQLIKDYPSLKDKFLNETGSWEGLIICILYEMEV 100

Query: 171 -GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
             ++S W PY +  ++      L     + W + EL  L         G     E+ ER 
Sbjct: 101 LQERSRWAPYFKVWNKPSDMNAL-----IFWDDNELQLLKPSLVLERIGKKEAKEMHERI 155

Query: 222 -EGIKREYNELDTVWFMAGSLFQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKVS 277
            + IK+   E  T      S F  + Y    I + +F  E+   +    +      +++ 
Sbjct: 156 IKSIKQIGGEFSTCVANCPSKFDNFAYIASIILSYSFDLEMQDSSVNENEEEETSEEELE 215

Query: 278 LARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLL 336
             R   +++PL   L A +SKC A L    + +++V  R  +  E +    G  PNS+LL
Sbjct: 216 NERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELL 275

Query: 337 INYGFVDED-NPYD 349
             YG+V+ D + YD
Sbjct: 276 RRYGYVEWDGSKYD 289


>gi|357131865|ref|XP_003567554.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Brachypodium distachyon]
          Length = 316

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 46/181 (25%), Positives = 74/181 (40%), Gaps = 22/181 (12%)

Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
           KKS W PY+R L R        + + + W   EL  +  S    E +ER E   +E++ +
Sbjct: 38  KKSGWAPYVRSLPRND-----QMHNMMFWDLNELHMVRISSICDEAIERRERAMKEFSAV 92

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
                    LF +            E F  A     S +V  +    +R  +L+P     
Sbjct: 93  KPSLECFPHLFGE---------IKLEDFMHA-----SALVSSRAWQTSRGVSLIPFAD-F 137

Query: 292 LAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
           L +     ++L      D  +++ DR Y  GE ++V  G   N+ L +N+GF    N YD
Sbjct: 138 LNHDGVSDSILLYDGQKDIAEVISDRNYAVGEQVMVRYGKYSNAMLALNFGFTLPRNIYD 197

Query: 350 R 350
           +
Sbjct: 198 Q 198


>gi|347967016|ref|XP_003436005.1| AGAP002018-PB [Anopheles gambiae str. PEST]
 gi|333469796|gb|EGK97407.1| AGAP002018-PB [Anopheles gambiae str. PEST]
          Length = 504

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 91/410 (22%), Positives = 159/410 (38%), Gaps = 69/410 (16%)

Query: 121 AGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA--CLALYLMYEKKQGKKSFWLP 178
           AG+   +VP S+   +        + EL+    +SE     LAL L+ E+ + K S W P
Sbjct: 110 AGECIITVPRSMFFYVTNEPRYRQLLELMPGAMMSEQGNIMLALALIMERFRAK-SDWKP 168

Query: 179 YIREL-DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
           Y+  L DR          +PL ++  ++  L  +      L+  + I R+Y  +      
Sbjct: 169 YLDLLPDR--------YTTPLYYTTEDMGELAETDAFLPALKLCKHIARQYGFIRR---- 216

Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFV--------AVQSCVVHLQKV--------SLARR 281
               F Q   D   + FT+++F+            AV + +    KV         +   
Sbjct: 217 ----FVQEKVDELRDCFTYDVFRLLLFSLLIPHSWAVSTVMTRQNKVPVNLAEFDGMDHT 272

Query: 282 FALVPL------GPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYK--AGESIVVWCGPQP 331
            AL+PL        P  A  ++C A     A ++ ++  + R     A   I +  G + 
Sbjct: 273 LALIPLWDMANHAFPDTANETRCVAETCYNATNEQLECSLTREVSDIASVPIFIVYGTRT 332

Query: 332 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE 391
           +++ L++ GFV   NP+  +     L    P Y+++  + +  G  +   F     RE  
Sbjct: 333 DAEFLVHNGFVCPRNPHANVQKRFTLVPAIPLYKERAHLLELLGMPTTGTFSFGPAREPA 392

Query: 392 AISDMLP----YLRLGYVSDTS------------EMQSVISSLGPICPVSPC--MERAVL 433
           A +   P     + L  VS  +              +  + +   + P   C   ER   
Sbjct: 393 AATTTTPISQELISLARVSSMTAKELDEYTAMKETQRQTLRTYQALLPAELCARTER--- 449

Query: 434 DQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 483
             LA   K  L  YP T+ +DEA+L   N H  +R+  +    EK++L +
Sbjct: 450 -WLATVMKIMLLRYPTTIEQDEALLKT-NRHHIRRLLIEYRLGEKQILRS 497


>gi|440802665|gb|ELR23594.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
          Length = 984

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 60/264 (22%), Positives = 108/264 (40%), Gaps = 43/264 (16%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNET---IAELLTTNKLSEL------ACLALYLM 165
           A+ED+  G+   S+P  LV+T E    +E    +A  L   +   L      A L  YL+
Sbjct: 32  ATEDILPGEELCSIPVRLVLTTEIARKSEVGRLVAAHLNAVQGERLRVSAGRAILCAYLI 91

Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIK 225
           +++   + +FW PY+R L +   R            + ++ +L G+     + E+ + I+
Sbjct: 92  HQRA-AQDAFWGPYLRSLPKHDDR-----------PDEDIQHLAGTNLFYAMQEKQQQIR 139

Query: 226 REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV------QSCVVHLQKVSLA 279
             ++       +  +L   +P   P + FT++ F   F A       Q+ V      + A
Sbjct: 140 ESFD------LLFPALCHAHPTVFPPDLFTWDHFLWTFTACSSRSFPQTLVQQPTATTSA 193

Query: 280 RR--FALVPLGPPLL--------AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 329
               + L+ +   LL         Y  K    L      ++ V +   + G       GP
Sbjct: 194 HADPYDLLEIDECLLPGLDMLNHQYRKKITWALDPSTGRLKFVTEDTVEKGTEAFNNYGP 253

Query: 330 QPNSKLLINYGFVDEDNPYDRLVV 353
           + N +LL+ YGF  EDN  D +++
Sbjct: 254 KGNEELLMGYGFCIEDNEQDYVMI 277



 Score = 39.7 bits (91), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 22/70 (31%), Positives = 39/70 (55%), Gaps = 3/70 (4%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELL--TTNKLSELACLALYLMYEKKQ 170
           V A++ + AG A  ++P  L++T++  L    + E L      L E   L L+L++EK +
Sbjct: 497 VFAAQAVPAGQALLTIPRQLLITVDTAL-ESPLGEALQYVEGGLDEDTVLTLFLVWEKGR 555

Query: 171 GKKSFWLPYI 180
           G+ S W P++
Sbjct: 556 GQASPWYPFL 565


>gi|384249602|gb|EIE23083.1| SET domain-containing protein, partial [Coccomyxa subellipsoidea
           C-169]
          Length = 306

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 39/150 (26%), Positives = 70/150 (46%), Gaps = 17/150 (11%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           V A +DL  G     +P + V++++    N  IA++L  +++     L + +MYE   GK
Sbjct: 4   VFAVQDLCEGQRLCEIPKTAVLSVQ----NTGIADILEQHRIRGGLGLIIAIMYELSIGK 59

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
           +SFW  Y+ EL ++          PL W+E E + L G+  +    E  E  + ++    
Sbjct: 60  ESFWHGYLEELHKRE-------YLPLFWAEQERSLLQGTEAEHRPQEDEELTQEDFET-- 110

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
                   L +Q+   +  ++FT E F+ A
Sbjct: 111 ----HVPPLVEQHADRLRADSFTLESFRVA 136


>gi|302840199|ref|XP_002951655.1| hypothetical protein VOLCADRAFT_105180 [Volvox carteri f.
           nagariensis]
 gi|300262903|gb|EFJ47106.1| hypothetical protein VOLCADRAFT_105180 [Volvox carteri f.
           nagariensis]
          Length = 517

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 45/199 (22%), Positives = 86/199 (43%), Gaps = 13/199 (6%)

Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
           +++ E+  L + +MYEK +G++S W PY+  +             PL W   E   L G+
Sbjct: 179 DEILEVQGLIIAVMYEKSRGRQSRWAPYLNLIPDD------MTHMPLYWKHREFKELRGT 232

Query: 212 PTKAEILERAEGIKREYNELDTVWF-MAGSLFQQYP-YDIPTEAFTFEIFKQAFVAVQSC 269
               +++ + +       ++  +W  +     Q++P  ++P     +++++ A  AV S 
Sbjct: 233 AAYDKMMGKVQCPADAPTQVPVLWSEVVEPFIQEHPELELPEGKAGYDLYRWATCAVASY 292

Query: 270 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWC 327
              L         A+VP+   L   + +    L   A    + ++  R    GE +V   
Sbjct: 293 SFILGDDKYQ---AMVPVWDLLNHITGRVNVRLHHCAKRHVLHMIATRDILRGEELVNNY 349

Query: 328 GPQPNSKLLINYGFVDEDN 346
           G   N++LL  YGFV+  N
Sbjct: 350 GELSNAELLRGYGFVEARN 368


>gi|224012755|ref|XP_002295030.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220969469|gb|EED87810.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 753

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 60/227 (26%), Positives = 99/227 (43%), Gaps = 28/227 (12%)

Query: 127 SVPNSLVVTLERVLGNET-IAELLTTNKLSELA----CLALYLMYEKK-QGKKSFWLPYI 180
           S+P S ++T+E  +G  T I   + T+ L   A     L +Y+++++K  G+ SF+ PY 
Sbjct: 141 SIPKSCLITVE--MGQATPIGRKILTSDLELDAPKHIFLMIYILWDRKVNGETSFFAPYY 198

Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS 240
           + L       +     P+ W+  EL  L GS    +I +RAE IK +Y  + ++    G 
Sbjct: 199 KILP------ETLRNMPIFWTREELDALEGSYLLLQIADRAEAIKEDYISICSIAPEFGD 252

Query: 241 LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSK-CK 299
           +             T E F+ A + V S    L  ++  R  ALVP    L     +  K
Sbjct: 253 I------------ATLEEFQWARMIVCSRNFGLL-INGHRTSALVPHADMLNHLRPRETK 299

Query: 300 AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 346
              +    +  +   +    GE +    G + N + L+NYGF  E N
Sbjct: 300 WTFSEESQSFTITTLQEIGMGEQVFDSYGQKCNHRFLLNYGFCVERN 346


>gi|302784522|ref|XP_002974033.1| hypothetical protein SELMODRAFT_414219 [Selaginella moellendorffii]
 gi|300158365|gb|EFJ24988.1| hypothetical protein SELMODRAFT_414219 [Selaginella moellendorffii]
          Length = 527

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 60/227 (26%), Positives = 90/227 (39%), Gaps = 37/227 (16%)

Query: 135 TLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
           T ER L    +  +L  N       +  +L+ E+ +GK+SFW PYI  L       +L++
Sbjct: 90  TAERCL---LVGPMLRKNDFRPWLTMCAHLLVERSRGKESFWHPYISALPSVE---ELSI 143

Query: 195 ESPLLW-SETELAYLTGSPTKAEILERAEGIKREYNELDTVW---FMAGSLFQQYPYDIP 250
             PLLW +ET    L GSP    I  R +  + ++  L T     F+ G           
Sbjct: 144 SHPLLWPAETIQELLQGSPMLDTIATRLKLCQEDHEALLTAGIEKFLPGG---------- 193

Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQKVSLA-------RRFALVPLGPPLLAYSSKCKAMLA 303
                 E   +  V   S V+  +  SL            LVP    L   SS  +    
Sbjct: 194 ------ETLSEGDVRWASAVLLSRAFSLELDVDDDFDTLCLVPWADMLNHCSSAGEESCL 247

Query: 304 AVDDAVQ---LVVDRPYKAGESIVVWCGPQ-PNSKLLINYGFVDEDN 346
             D   +   L   + Y  G+ +    GP    S+L ++YGFVD++N
Sbjct: 248 IFDQDTKTASLEAHKSYSKGDEVFDSYGPALTGSQLFLDYGFVDDEN 294


>gi|387191841|gb|AFJ68625.1| set domain-containing protein [Nannochloropsis gaditana CCMP526]
          Length = 736

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 41/200 (20%), Positives = 79/200 (39%), Gaps = 22/200 (11%)

Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
           LA+ L+ E+ +G +SFW PY+R L  +          P+ ++ +E   +        +  
Sbjct: 260 LAVLLVAERMKGPQSFWWPYLRNLPEK------YAHMPIFYNNSEFGSIQIPSLMRTVQS 313

Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
           R   +            ++    +Q  +  P E    +      +    C    + +   
Sbjct: 314 RCRMLVN----------ISDGYLRQLSHGGPAENPFLDDVHANDMGWGLCAASSRALRNI 363

Query: 280 RRFALVPLGPPLLAYSSKCKAMLAAVDD------AVQLVVDRPYKAGESIVVWCGPQPNS 333
                 PL  P++ +     +    + D      ++QLV  R  + G+++ +  G   N 
Sbjct: 364 PGLGSTPLMVPVIDFCEHAVSPTCYIKDYRKSGGSIQLVAGRDLQPGDALTISYGNLTNP 423

Query: 334 KLLINYGFVDEDNPYDRLVV 353
           +LL++YGF   DNP+DR  V
Sbjct: 424 QLLLDYGFTLSDNPHDRFEV 443


>gi|336260071|ref|XP_003344832.1| hypothetical protein SMAC_06115 [Sordaria macrospora k-hell]
          Length = 456

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 81/390 (20%), Positives = 149/390 (38%), Gaps = 41/390 (10%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
            + G+   ++P+S++ T+E    +  +   L + +  LS    L  YL++ +        
Sbjct: 34  FKEGEKILTIPSSILWTVEHAYADPLLGPALCSVQPPLSPEDTLTTYLLFVRS------- 86

Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
               RE      R  +A       S + ++E EL    G+       +  + I+ ++  L
Sbjct: 87  ----RESGYDGQRSHVAALPTSYSSSIFFTEEELEVCAGTSLYTITKQLEQSIEDDHRAL 142

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
                    LF Q+    P + F+ E +K A   V S  +  Q         L P    +
Sbjct: 143 ------VMQLFIQHRDLFPLDKFSIEDYKWALCTVWSRRMDFQLRDGKSMRLLAPFAD-M 195

Query: 292 LAYSSKCK--AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
           L +SS+ K   +       + ++  + Y+ G+ + +  G  PNS+LL  YGFV   NP D
Sbjct: 196 LNHSSEAKPCHVYDVSSGNLSVLAGKDYEPGDQVFINYGSVPNSRLLRLYGFVIPGNPND 255

Query: 350 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP-----YLRLGY 404
              +  + + + P Y+ K  +    G  S     +        ++D LP     YLR+  
Sbjct: 256 TYDLVLSTHPQAPFYEQKHKLWVSAGLDSTSTIPL-------TLTDPLPKNVLRYLRIQR 308

Query: 405 VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD--YN 462
              +      + +      VS   E  +L  L + F   L G+   L + E  L    Y+
Sbjct: 309 ADASDLAAMALQNAKADEKVSDSNEVEILQFLVESFGHLLGGFGTPLEKLEEQLAQGVYS 368

Query: 463 LHPKKRVATQLVRMEKKMLNACLQVTADMI 492
                  A  +   E+++L    +   D++
Sbjct: 369 PGGNAWAAAHVSLGEQRVLRLAKKRAEDLL 398


>gi|50294638|ref|XP_449730.1| hypothetical protein [Candida glabrata CBS 138]
 gi|49529044|emb|CAG62706.1| unnamed protein product [Candida glabrata]
          Length = 510

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 74/301 (24%), Positives = 116/301 (38%), Gaps = 46/301 (15%)

Query: 83  SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
           SW+  NG+   K+  K K   N        V  + D+Q  +  F +P ++++  E     
Sbjct: 14  SWLTNNGV---KISPKLKVEDNRYKDEGRCVVTTTDIQKDELLFEIPRNVLLNCETSQLV 70

Query: 143 ETIAELLT---TNKLSE-------LACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
           + I  +LT   T   SE       + CL  Y MY  K   KS W PY   L        L
Sbjct: 71  KDIPAVLTELETFSGSEPLSWEPLILCL-FYEMYILKD--KSRWWPYFEVLPTLEDMNVL 127

Query: 193 AVESPLLWSETELAYLTGS-----------PTKAEILER------AEGIKREYNELDTVW 235
                +LWS+ +LA L  S               ++L+R       E +K   N+     
Sbjct: 128 -----VLWSDEDLAALEPSYVLSCIGKEQVENMYQLLKRFIEASDHEQLKSNLNKFSWDS 182

Query: 236 FM-AGSLFQQYPYDIPTEAF-------TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 287
           F+  GSL   Y +D+  E         +           +      + + +    ++VPL
Sbjct: 183 FIRIGSLIMSYSFDVGKEIHNEGKEGESMNENDNMTNGDEDEDEDEEDLEVEMIKSMVPL 242

Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
              L A + KC A L      ++++  R   +GE +    G   NS+LL  YG+V+ D  
Sbjct: 243 ADTLNADTKKCNANLLHSKQTLRMIAIRDIPSGEQVYNTYGELSNSELLRRYGYVEWDGS 302

Query: 348 Y 348
           Y
Sbjct: 303 Y 303


>gi|402224283|gb|EJU04346.1| hypothetical protein DACRYDRAFT_114691 [Dacryopinax sp. DJM-731 SS1]
          Length = 1313

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 57/215 (26%), Positives = 90/215 (41%), Gaps = 19/215 (8%)

Query: 283  ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
            A+VP+   L A      A L    D +Q++  +P   GE I    G  PNS LL  YG+V
Sbjct: 1060 AMVPMADMLNARCGCNNAKLFYTRDDLQMMATKPIAKGEQIWNTYGDPPNSDLLRRYGYV 1119

Query: 343  DE-------DNPYDRLVVEAALNTEDPQ---YQDKRMVAQRNGKLSVQVFHVHAGREKEA 392
            D         +P D + + A    E  +   YQD+       G     V  V       +
Sbjct: 1120 DALTLPDGVGSPSDVVEINADTVVEAAKVQSYQDRIDWWLEEGGDDAFVLDV-----TYS 1174

Query: 393  ISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLS 452
            + D +  L    + +  + +   S      P  P ++    + L    + RLA YP +L+
Sbjct: 1175 VPDEMLSLVRLLLLNQEDWEKAQSK---GKPPKPKLDEKSYEVLLVVLQKRLAMYPISLT 1231

Query: 453  EDEAMLTDYN-LHPKKRVATQLVRMEKKMLNACLQ 486
            E E ML   N L+ K+R A  +   E+++L+  L+
Sbjct: 1232 EQEGMLRSSNELNEKRRNALIVTTGEQRILHKTLE 1266


>gi|70993754|ref|XP_751724.1| SET domain protein [Aspergillus fumigatus Af293]
 gi|66849358|gb|EAL89686.1| SET domain protein [Aspergillus fumigatus Af293]
 gi|159125354|gb|EDP50471.1| SET domain protein [Aspergillus fumigatus A1163]
          Length = 674

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 53/201 (26%), Positives = 88/201 (43%), Gaps = 23/201 (11%)

Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
            +L+ +  +G + FW PYIR L +      L++ +PL +   +L +L G+       +R 
Sbjct: 94  FFLIGQYLRGSEGFWFPYIRTLPQP-----LSLTTPLYYEGDDLRWLDGTSLAPAREQRM 148

Query: 222 EGIKREYNELDTVWFMAG-SLFQQYPYDI--------PTEAFTFEIFKQAFVAVQSCVVH 272
              K +Y    T    AG     QY +D+         + AF+ ++  +A   V+   + 
Sbjct: 149 GVWKEKYENGITELRKAGFEDVDQYTWDLYLWSSSILVSRAFSAKVLAEAVTDVE---LP 205

Query: 273 LQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPN 332
              VS+     L+P    L+ +    K    A    V  VV     +G+ I    GP+ N
Sbjct: 206 EDGVSV-----LLPC-IDLMNHRPLAKVEWRAGKQDVAFVVLEDVASGQEISNNYGPRNN 259

Query: 333 SKLLINYGFVDEDNPYDRLVV 353
            +L++NYGF   DNP D  +V
Sbjct: 260 EQLMMNYGFCLPDNPCDYRIV 280


>gi|366987955|ref|XP_003673744.1| hypothetical protein NCAS_0A08050 [Naumovozyma castellii CBS 4309]
 gi|342299607|emb|CCC67363.1| hypothetical protein NCAS_0A08050 [Naumovozyma castellii CBS 4309]
          Length = 499

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 63/285 (22%), Positives = 120/285 (42%), Gaps = 45/285 (15%)

Query: 112 YVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAE------LLTTNKLSELACLALYLM 165
           ++ A+ED++  +  F +P   ++    VL +  ++E      +L    +     L + ++
Sbjct: 42  FILATEDIKTDELLFEIPRESILN---VLTSSLVSEYPAWENILLDGDVGHWEGLIICML 98

Query: 166 YEKKQGKK-SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAE 216
           +E K  K  S W PY   L          + S + W+  EL  L         G+    +
Sbjct: 99  FEIKVKKNMSKWAPYFDVLPESTD-----LNSLMYWTAEELEALKPSLVLDRIGNDGAHQ 153

Query: 217 ILERAEGIKREYNELDTV-----------WFMAGSLFQQYPYDI---PTEAFTFEIFKQA 262
           + E+   + R + +  +V           +    S+   Y +D+   PT A   E   + 
Sbjct: 154 MHEKVMELIRTFEKDHSVDLSFGTITWEDFLYVASIIMSYSFDVELPPTSADENEEDDEV 213

Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
              V+  V +   +      +++PL   L + ++KC A L   +D++++      KAGE 
Sbjct: 214 EEDVEQTVRNEGSLK-----SMIPLADTLNSDTNKCNAHLIYDEDSLKMRAISNIKAGEQ 268

Query: 323 IVVWCGPQPNSKLLINYGFVD-EDNPYD--RLVVEAALNTEDPQY 364
           +    G  PN+++L  YG+V+ E + YD   L +E  + T   QY
Sbjct: 269 VYNIYGNHPNAEILRRYGYVEWEGSKYDFGELPLEVIIETLHEQY 313


>gi|428173103|gb|EKX42007.1| hypothetical protein GUITHDRAFT_141487 [Guillardia theta CCMP2712]
          Length = 355

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 61/256 (23%), Positives = 102/256 (39%), Gaps = 34/256 (13%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERV-------------LGNETIAELLTTNKLSELAC 159
             A +D+  G+   ++P+ +++  ERV             + +++I+       LSE   
Sbjct: 75  TTAKDDIADGELYIAIPDHMLMGPERVEPGSRLDKKLMKIVKSQSISMQEQRRLLSEKNK 134

Query: 160 LALYL---MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAE 216
           + +Y    MY  K  K+SFW PY   +          + SP+ WSE EL  L GS     
Sbjct: 135 VLMYFLLQMYNPK--KESFWKPYFDIMPTN-------LTSPIFWSEDELQELAGSEVSNM 185

Query: 217 ILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
                + ++  Y+EL         +F+        +AFT + +  A     S V+ L + 
Sbjct: 186 ARIEKKRLRAMYDELRE------RIFKHDRKTFLKQAFTLKNWFWANGLYDSRVIQLNRQ 239

Query: 277 SLARRF-ALVPLGPPLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNS 333
           +        +PL   +    S+ K  +       A  +  DR    G  +    G + N 
Sbjct: 240 TGHGNVPTFIPLIDMVNCIESQDKTFIQYDKKLRAAVMYADRAVSRGVQVFESYGNKSNY 299

Query: 334 KLLINYGFVDEDNPYD 349
           + L+  GFV EDNP D
Sbjct: 300 EYLLYNGFVMEDNPND 315


>gi|327295769|ref|XP_003232579.1| hypothetical protein TERG_06571 [Trichophyton rubrum CBS 118892]
 gi|326464890|gb|EGD90343.1| hypothetical protein TERG_06571 [Trichophyton rubrum CBS 118892]
          Length = 488

 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 99/467 (21%), Positives = 178/467 (38%), Gaps = 90/467 (19%)

Query: 97  LKEKPSHNEKHRPIHY-----------VAASEDLQAGDAAFSVPNSLVVTLERVLGNETI 145
           LK    H + H  IH            + AS D+   +  F +P+ LV++++       +
Sbjct: 24  LKRSSPHFKMHPGIHIADLRSVGAGRGICASRDIAEDEELFIIPDDLVLSVQNSEARSAL 83

Query: 146 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 205
              L   +L     L + ++YE  QG++S W PY R L         + ++ + W++ +L
Sbjct: 84  E--LDDKQLGPWLSLIITMIYEYYQGEQSKWYPYFRILPS-------SFDTLMFWTDEQL 134

Query: 206 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 265
             L GS    +I + A          DT+      L Q  P   P       +       
Sbjct: 135 LELQGSAVVGKIGKAAAD--------DTILQKVVPLIQANPRHFPPRPNMPPLNSSDSQN 186

Query: 266 VQSCVVH-LQKVSLARRF---------------------------ALVPLGPPLLAYSSK 297
              C+ H +  + +A  F                            +VPL     A + +
Sbjct: 187 ALLCLAHRMGSIIMAYAFDIEKTDEVDEDTAEDGYMTDDEDEPAKGMVPLADIFNADAQR 246

Query: 298 CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 357
             A L   + +  +   +   +GE I    G  P + LL  YG+V  DN     VVE +L
Sbjct: 247 NNARLFQEEGSFVMKAIKNIHSGEEIFNDYGELPRADLLRRYGYV-TDNYAQYDVVEFSL 305

Query: 358 NT---------EDPQYQDKRMVAQRNGKLSVQVFHV----HAGREKEAI-SDMLPYLRLG 403
           ++          +P   + R+    N  +  + + +      G  K+ I  D L  LR  
Sbjct: 306 DSICKVAGLPDSEPSSTNPRLELLDNLDMLEEGYSIPRIPPNGTLKDTIPKDFLVLLRAL 365

Query: 404 Y--VSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML--- 458
              + D + +++   +  P    S   E ++L  L      R + YP ++ EDE++L   
Sbjct: 366 TLPIEDLNRLKARNKAPKPEFSTS---EASLLRSLV---TCRQSEYPTSVQEDESILRCL 419

Query: 459 ------TDYNLHPKKRVATQLVRMEKKMLNACLQV--TADMIMLLPD 497
                  + ++  +K++A Q+ + EK++L   L +  T D  ++ PD
Sbjct: 420 EQQNGYINDSIPIRKKMAVQVRKGEKEILTQILTLLDTQDTHLVQPD 466


>gi|422293007|gb|EKU20308.1| ribulose- -bisphosphate carboxylase oxygenase small subunit
           n-methyltransferase i [Nannochloropsis gaditana CCMP526]
          Length = 385

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 53/202 (26%), Positives = 83/202 (41%), Gaps = 20/202 (9%)

Query: 78  LGDLKSWMHKN---GLPPCKVILKEKP-SHNEKHRPIHYVAASEDLQAGDAAFSVPNSLV 133
           LG+   WM      G+PP  ++L  +     E       +     +  G+A F +P S+V
Sbjct: 115 LGENGVWMQDKSGWGVPPHPLLLSSRTIDEIELEDSGRGLICKYPINMGNALFQLPLSIV 174

Query: 134 VTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA 193
           +  E+ L     A       ++E   +AL L+ E+  G  SFW PYI  L          
Sbjct: 175 IDKEKSLAAFDGA---LPADINEYFAIALMLIKERALGPSSFWAPYIDVLPTTE-----E 226

Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS-LFQQYPYDIPTE 252
           V   L+W E +LA L  SP  A        +  E+  L+  +  A S +F          
Sbjct: 227 VNPTLVWPEGDLALLEASPLVAATRSLKRKLAAEFALLEEQYMRARSDVFD-------PS 279

Query: 253 AFTFEIFKQAFVAVQSCVVHLQ 274
            FTFE +  AF+ + S  + ++
Sbjct: 280 VFTFEAYLWAFINIFSRAIRVK 301


>gi|260807503|ref|XP_002598548.1| hypothetical protein BRAFLDRAFT_118329 [Branchiostoma floridae]
 gi|229283821|gb|EEN54560.1| hypothetical protein BRAFLDRAFT_118329 [Branchiostoma floridae]
          Length = 448

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 60/280 (21%), Positives = 118/280 (42%), Gaps = 42/280 (15%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN-KLSELACLALYLMYEKKQGKKSFWL 177
           ++ G     +P  ++++ + VL +     + +   +L+ +  +  +L+Y+K  G+ SFW 
Sbjct: 65  IKRGQTMIKMPQHMILSTKTVLDSVLGPYIESAEPQLTTIQAITTFLIYQKHIGETSFWK 124

Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
           PY+  L  +          P+ + E +  YL  S  +A I  + +   + Y EL   +  
Sbjct: 125 PYLDILPNE-------YTHPVYFGEEDFLYLPHS-LRANIKAKKQECIKSYEELKPFFPS 176

Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAF--VAVQSCVVHLQKVSLARRF--------ALVPL 287
              L   +        FTF+ ++ A+  V  +S  V  +  ++ R          +LVP+
Sbjct: 177 LEPLLPNW-----EGIFTFDAYRWAWSTVKTRSLYVDDKGSTVLRNLDKSGLGVTSLVPM 231

Query: 288 GPPLLAYSSKCKAML------AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
              LL +S   +  L         D    +  +  YK G+ ++       N  LL+NYGF
Sbjct: 232 -VDLLNHSHSARTGLLIKKSCKNGDYFYTVTAEDDYKRGDQVLFCYRRADNQTLLLNYGF 290

Query: 342 VDEDNPYDRL----------VVE-AALNTEDPQYQDKRMV 370
           V  DN  D +          ++E      EDP+++ ++++
Sbjct: 291 VLPDNHLDTIKFFLVKDIIGILELMNFEEEDPKFRRRKVL 330


>gi|171679805|ref|XP_001904849.1| hypothetical protein [Podospora anserina S mat+]
 gi|170939528|emb|CAP64756.1| unnamed protein product [Podospora anserina S mat+]
          Length = 468

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 51/199 (25%), Positives = 80/199 (40%), Gaps = 29/199 (14%)

Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
            +L+ E  + K S+W PYI  L +       A+  P +W E ++  L  +     + E  
Sbjct: 104 FFLVKEYLKEKDSYWWPYISTLPQPDRVDTWAL--PAVWPEDDIECLEETNAHVAVREIQ 161

Query: 222 EGIKREYNE----LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS 277
             IK+EY      L  V F     + Q  Y      FT   F+ + +  Q    H+    
Sbjct: 162 ANIKKEYKHARKLLKEVDFPGWQEYTQLLYKWAFCIFTSRSFRPSLILSQETQDHV---- 217

Query: 278 LARRFALVPLGP---------PLLAY-----SSKCKAMLAAVDDAVQLVVDRPYKAGESI 323
                 L P G          PLL       +S+ +  L  VD   QL+ +  Y+ G+ +
Sbjct: 218 ----LGLTPHGTKVDDFSILQPLLDIGNHDPTSQYQWNLE-VDGTCQLICNNAYQPGQQV 272

Query: 324 VVWCGPQPNSKLLINYGFV 342
               G + NS+LL+ YGF+
Sbjct: 273 FNNYGLKSNSELLLGYGFI 291


>gi|440302460|gb|ELP94773.1| hypothetical protein EIN_341910 [Entamoeba invadens IP1]
          Length = 823

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 93/414 (22%), Positives = 160/414 (38%), Gaps = 97/414 (23%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL---VVTLE 137
           + +W+ ++G     V +K  P +         + +S++   GD   S+P  L    + L 
Sbjct: 4   ITTWVKEHGGHIDGVYVKNFPVYGNG------LCSSKEFHEGDTLLSIPYHLQLNTIELH 57

Query: 138 RVL-----GNET--IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRG 190
            V      G E   + E    N+  E + + LYL    K  +K F  PYI  L       
Sbjct: 58  NVFESMVPGFEVPRLGEG-AKNRDDENSVVYLYLAM-NKTNEKCFHFPYINTLPT----- 110

Query: 191 QLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIP 250
                 PL +SE EL  L G+    ++L   E  K    +L   +    +L  QYP    
Sbjct: 111 --TFSCPLSYSENELKMLKGT----KLLVTVEKTKTFLKKLSDYY---ETLTHQYP---- 157

Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV---PLGP-----PLLAYSSKCKAML 302
           T    F+ F Q  V          +V  +R F ++   P+G      P   +S+      
Sbjct: 158 TRFQQFDDFYQRLVWAH-------QVFWSRAFLVIYPDPIGDVASLIPFADFSNH----- 205

Query: 303 AAVDDAVQLVVDRPYKA----GESIVVWCGPQ--------PNSKLLINYGFVDEDNPYDR 350
              +  V  V +R  +         V+ CG Q        PN K+L+ YGFV  +NPYD 
Sbjct: 206 -NTETKVTYVSNRQTQTFSLQTNEKVLHCGEQIFNNYRIRPNEKMLLGYGFVISENPYDE 264

Query: 351 LVV-----EAALNTEDPQYQDKRM--------------------VAQRNGKLSVQVFHVH 385
           +++     E     +  + ++ +M                    + Q   +  V  F  +
Sbjct: 265 VLLRINFKERHFEKQVEESEESKMEVENKENERMEVEEEDNEDEITQILKREGVDRFDYY 324

Query: 386 AGREKEAISDMLPYLRLGYVS--DTSEMQSVISSLGPICPVSPC-MERAVLDQL 436
             REKE  +D+L  LR+  +S  + ++    +  L  + P++     R++++Q+
Sbjct: 325 LTREKELPTDLLRVLRIVNLSLVEANQYSQALLDLSYVSPINEIKATRSLMEQI 378


>gi|145346652|ref|XP_001417799.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144578027|gb|ABO96092.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 490

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 61/259 (23%), Positives = 104/259 (40%), Gaps = 31/259 (11%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLS------ELACLALYLMY 166
           V A+ DL+ G+   SVP+  V+T++     + + E +            +   L + +M 
Sbjct: 57  VRATRDLRVGEVVVSVPDDAVLTVDACAVKKELGEFVGDGDDEAPSPRLDKELLVIAVMC 116

Query: 167 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS-------PTKAEILE 219
           E   GK S W  Y+  +      G     S L W + ++  L G+           E L+
Sbjct: 117 EMCAGKSSAWCEYLETVHEAVRVGH----SVLAWDDEQVTALFGTDAWRDAYENDDETLD 172

Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF-TFEIFKQAFVAVQSCVVHLQKVSL 278
                +  +  + T++F    LF +    +  EA         A VA  S  +   ++  
Sbjct: 173 LPMMTEEHFENVVTLFF---KLFPKLASGLSVEALRELHFAATAMVAGYSFTLGDDEIQ- 228

Query: 279 ARRFALVPLGPPLLAYSSKCKAMLAAVDD----AVQLVVDRPYKAGESIVVWCGPQPNSK 334
               A+VP    +L ++  C+A +    D     +Q++  R  K GE +    GP  N++
Sbjct: 229 ----AMVPFWD-MLNHAPPCEASVRLHHDQKNGCLQMITVRGVKKGEEVFNTYGPLRNAE 283

Query: 335 LLINYGFVDEDNPYDRLVV 353
           LL  YGFV   NP+    V
Sbjct: 284 LLRRYGFVLPRNPHGGTTV 302


>gi|299472213|emb|CBN77183.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloropl [Ectocarpus
           siliculosus]
          Length = 460

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 64/270 (23%), Positives = 111/270 (41%), Gaps = 35/270 (12%)

Query: 84  WMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLG 141
           W+ K+G  L    V+    P   E+      + A++ ++ G +  ++P SL +T    L 
Sbjct: 18  WLTKSGVRLTDNAVLAGRSPLAGERG-----LVAAKAIETGQSVLAIPQSLGLTATG-LK 71

Query: 142 NETIAELLTT--NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
           +  IA+ +        E   +AL +++E+ QG+ S   P+I  L ++   G+L  E PL 
Sbjct: 72  SSGIAQYVEGFEGWTGETGLIALQVLWERAQGEGSKMAPWIAVLPKE---GEL--EMPLF 126

Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
           W E +L     S T+        G   + +E D  W ++ + F ++P   P + F    F
Sbjct: 127 WGEADLTLADASSTRG-----ISGFVADVDE-DFAW-LSENAFAKHPKVFPADKFGPGDF 179

Query: 260 KQAFVAVQSCVVHLQK-------VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 312
           + A     S    +         V  A   +L  +  P    +     +  +   AV L 
Sbjct: 180 RWAVGVALSRSFFVDGELRLTPLVDFANHSSLRGVSEP----TGGTTGLFGS--KAVVLR 233

Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
             + Y+ GE   V  GP+  +  L   GFV
Sbjct: 234 AGKNYEEGEEFFVSYGPKGAAGYLEENGFV 263


>gi|303279242|ref|XP_003058914.1| set domain protein [Micromonas pusilla CCMP1545]
 gi|226460074|gb|EEH57369.1| set domain protein [Micromonas pusilla CCMP1545]
          Length = 457

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 69/306 (22%), Positives = 121/306 (39%), Gaps = 63/306 (20%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNE---KHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
            K+W+  NG    +  ++     +E   +  P   V A  D++ G++   +P+S   T E
Sbjct: 4   FKTWLRSNGFWWNEDAIELGSRIDEGGGEDAPRVGVKAKRDIEIGESVARIPSSACFTCE 63

Query: 138 RVLGNETIAEL-LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES 196
                + + ++ L+  +   LA L   L+ E+  G  S W  Y+  L           + 
Sbjct: 64  NCAHADAVRKVKLSAGEDEWLASLGTALVLERTLGSSSRWNAYLDSLPHSE------PDV 117

Query: 197 PLLWSET--ELAYLTGSPTKAEILERAEGIKREYNE-----LDTVWFMAGSLFQQYPYDI 249
            ++WSE      YL G+  +  + +     + E+       LDT+   A    +   +D 
Sbjct: 118 VMMWSEDGERRRYLCGTDIEQSLRDERAAARTEWTRHVKPVLDTLRGAA----KDVGFD- 172

Query: 250 PTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP-LGPPLL--------------AY 294
                        F+A +S       V+ +R F + P +G  L+               Y
Sbjct: 173 ------------DFLAARS-------VASSRAFTVNPRVGAGLVPIADLFNHRTGGHHVY 213

Query: 295 SSKCKAMLA-------AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
            S  +   A       + DDA+ + V +  KAGE +    G   N+KLL +YGF   DNP
Sbjct: 214 LSDARGTAAVSERDEGSDDDALFVRVVKASKAGEEVFNTYGKLGNAKLLCSYGFAQLDNP 273

Query: 348 YDRLVV 353
            D++ +
Sbjct: 274 ADKVTI 279


>gi|322706860|gb|EFY98439.1| SET domain protein [Metarhizium anisopliae ARSEF 23]
          Length = 595

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 68/289 (23%), Positives = 129/289 (44%), Gaps = 38/289 (13%)

Query: 113 VAASEDLQAGDA------AFSVPNSLVVTLERVLG----NETIAELL-TTNKLSELACLA 161
           + A  DL++ +A        ++P+ LV++ E V      +    +LL    + S    + 
Sbjct: 143 LVAHADLESAEADGTSKGPVTIPHDLVLSAEAVEDFAKVDHNFKQLLEAVGRQSTRGDIM 202

Query: 162 LYLMYEKKQGKK------SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 215
           LYL+ +  Q  +      + W  YIR L R        +  P +W+E E   L G+  +A
Sbjct: 203 LYLVSQFAQSSRPKGLSPTPWTEYIRLLPR-------PIPVPTMWTEPERLLLNGTSLEA 255

Query: 216 EILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK 275
            +  +   + +E++ L  V        + +P+         E+  + +V V +     + 
Sbjct: 256 ALEAKLLSLGKEFDTLREV-------SEDFPFWNEFLWSGEEVSLEDWVLVDAWY-RSRC 307

Query: 276 VSLARR-FALVPLGPPLLAYSSKCKAMLAAVD-DAVQLVV--DRPYKAGESIVVWCG-PQ 330
           + L R   A+VP G  ++ +SSK  A     D D V L++    P ++GE + +  G  +
Sbjct: 308 LELPRSGTAMVP-GLDMVNHSSKATAYYEEDDHDNVVLLIRPGCPVRSGEEVTISYGDAK 366

Query: 331 PNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 379
           P S++L +YGF+D +N  D+L +      +DP  + K  ++     L++
Sbjct: 367 PASEMLFSYGFIDPNNIVDKLTLRLDPFPDDPLARAKLRISNSGPTLTI 415


>gi|387193935|gb|AFJ68731.1| hypothetical protein NGATSA_2061300, partial [Nannochloropsis
           gaditana CCMP526]
          Length = 446

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 62/274 (22%), Positives = 110/274 (40%), Gaps = 35/274 (12%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
           L  W +KNG+     I     S       +    A+  ++ G+   +VP +L  +++ V 
Sbjct: 60  LLEWCNKNGIKDASKITIGPVSQAGMGLGL---VATAPIKQGETLATVPLNLCFSMDSVR 116

Query: 141 GN---ETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
            +   + I E      L + + +AL L+YE   G KS +  YI+ L R    GQ   + P
Sbjct: 117 ASPLGKVIGEF--EPALGDASLIALQLLYEAHMGPKSKYAVYIKSLPRP---GQDGFDHP 171

Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
           L WS  E   L  S T+       + +  +Y      W  + +L +     +  ++F   
Sbjct: 172 LFWSTAEQGVLAKSSTRNLGETLIDAVAEDYG-----WIQS-ALARGGISGLQADSFDLS 225

Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA---------MLAAVDDA 308
            F+ A   V S     +      R A      PLL  +++ +          +       
Sbjct: 226 DFEWAVAVVLSRSFFAEN---GLRLA------PLLDMANRGEGCTNEPQIGGLGIFGGKG 276

Query: 309 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
           ++++ DR    G+ IV+  GP+   + L ++GFV
Sbjct: 277 LKVIADRDTDKGQEIVISYGPKSGIEFLEDHGFV 310


>gi|114684050|ref|XP_001168792.1| PREDICTED: SET domain-containing protein 4 isoform 4 [Pan
           troglodytes]
 gi|410222534|gb|JAA08486.1| SET domain containing 4 [Pan troglodytes]
 gi|410259178|gb|JAA17555.1| SET domain containing 4 [Pan troglodytes]
 gi|410287502|gb|JAA22351.1| SET domain containing 4 [Pan troglodytes]
 gi|410336607|gb|JAA37250.1| SET domain containing 4 [Pan troglodytes]
          Length = 307

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 60/250 (24%), Positives = 101/250 (40%), Gaps = 22/250 (8%)

Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSF 175
            LQ G    S+P S ++T + V+    +   +T  K   S L  L  +L+ EK  G +S 
Sbjct: 67  SLQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGHRSL 125

Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 235
           W PY+  L +       A   P+   E E+  L     KA+  E+   ++  +      +
Sbjct: 126 WKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFF 177

Query: 236 FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--P 290
                LF +    I    F++     A+  V +  V+L   Q+  L+       L P   
Sbjct: 178 SSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPDTCALAPYLD 233

Query: 291 LLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
           LL +S   + KA       + ++     ++  E + +  GP  N +L + YGFV   NP+
Sbjct: 234 LLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEYGFVSVHNPH 293

Query: 349 DRLVVEAALN 358
             + V    N
Sbjct: 294 ACVYVSRGWN 303


>gi|224125978|ref|XP_002329631.1| predicted protein [Populus trichocarpa]
 gi|222870512|gb|EEF07643.1| predicted protein [Populus trichocarpa]
          Length = 513

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 58/256 (22%), Positives = 108/256 (42%), Gaps = 28/256 (10%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           A++DL+ GD A  +P S++++ E V  ++    L   + ++    L L+ M E+      
Sbjct: 149 ATKDLKVGDIALEIPVSIIISEEHVHKSDMYHILEKIDGITSETMLLLWSMKERHNCSSK 208

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
           F + Y   L  +   G       L +    +  L G+    EI++  E ++ +Y+EL   
Sbjct: 209 FKI-YFDTLPEEFKTG-------LSFGVDAIMALDGTLLLEEIMQAKEHLRVQYDEL--- 257

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG------ 288
                 L + YP     E +T+E F  A     S  + +  V    R  L+P+       
Sbjct: 258 ---VPPLCKNYPDVFLPELYTWEQFLWACELWYSNSMKVMFVDGKLRTCLIPIAGFLNHS 314

Query: 289 --PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-D 345
             P ++ Y       + +  + ++  + RP   GE   +  G   +S L+  YGF+ + D
Sbjct: 315 LYPHIVHYGK-----VDSATNTLKFPLTRPCCFGEQCCLSYGNFSSSHLITFYGFMPQGD 369

Query: 346 NPYDRLVVEAALNTED 361
           NP D + ++  +   D
Sbjct: 370 NPCDVIPLDIDVGDAD 385


>gi|422293951|gb|EKU21251.1| hypothetical protein NGA_2061300, partial [Nannochloropsis gaditana
           CCMP526]
          Length = 452

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 62/274 (22%), Positives = 110/274 (40%), Gaps = 35/274 (12%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
           L  W +KNG+     I     S       +    A+  ++ G+   +VP +L  +++ V 
Sbjct: 66  LLEWCNKNGIKDASKITIGPVSQAGMGLGL---VATAPIKQGETLATVPLNLCFSMDSVR 122

Query: 141 GN---ETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
            +   + I E      L + + +AL L+YE   G KS +  YI+ L R    GQ   + P
Sbjct: 123 ASPLGKVIGEF--EPALGDASLIALQLLYEAHMGPKSKYAVYIKSLPRP---GQDGFDHP 177

Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
           L WS  E   L  S T+       + +  +Y      W  + +L +     +  ++F   
Sbjct: 178 LFWSTAEQGVLAKSSTRNLGETLIDAVAEDYG-----WIQS-ALARGGISGLQADSFDLS 231

Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA---------MLAAVDDA 308
            F+ A   V S     +      R A      PLL  +++ +          +       
Sbjct: 232 DFEWAVAVVLSRSFFAEN---GLRLA------PLLDMANRGEGCTNEPQIGGLGIFGGKG 282

Query: 309 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
           ++++ DR    G+ IV+  GP+   + L ++GFV
Sbjct: 283 LKVIADRDTDKGQEIVISYGPKSGIEFLEDHGFV 316


>gi|55953063|ref|NP_001007260.1| SET domain-containing protein 4 isoform 2 [Homo sapiens]
 gi|12804091|gb|AAH02898.1| SET domain containing 4 [Homo sapiens]
 gi|119630161|gb|EAX09756.1| SET domain containing 4, isoform CRA_a [Homo sapiens]
          Length = 307

 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 60/250 (24%), Positives = 101/250 (40%), Gaps = 22/250 (8%)

Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSF 175
            LQ G    S+P S ++T + V+    +   +T  K   S L  L  +L+ EK  G +S 
Sbjct: 67  SLQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGHRSL 125

Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 235
           W PY+  L +       A   P+   E E+  L     KA+  E+   ++  +      +
Sbjct: 126 WKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFF 177

Query: 236 FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--P 290
                LF +    I    F++     A+  V +  V+L   Q+  L+       L P   
Sbjct: 178 SSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPDTCALAPYLD 233

Query: 291 LLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
           LL +S   + KA       + ++     ++  E + +  GP  N +L + YGFV   NP+
Sbjct: 234 LLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEYGFVSVHNPH 293

Query: 349 DRLVVEAALN 358
             + V    N
Sbjct: 294 ACVYVSRGWN 303


>gi|356553227|ref|XP_003544959.1| PREDICTED: protein SET DOMAIN GROUP 40-like [Glycine max]
          Length = 475

 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 66/290 (22%), Positives = 119/290 (41%), Gaps = 47/290 (16%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSELACLALYLMYEKKQG 171
           + A  DL+ G+    VP S ++T E V+ ++ + + +  +  LS    L + L+YE  +G
Sbjct: 55  LGAVRDLRRGEIVLRVPKSALMTRETVMEDKKLCDAVNRHSSLSSAQILIVCLLYEMGKG 114

Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
           K S W PY+  L           +   ++ E E   L       ++ E     ++   + 
Sbjct: 115 KTSRWHPYLMHLPH-------TYDVLAMFGEFEKHAL-------QVDEAMWVTEKAMLKA 160

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
            + W  A SL Q   +    + FTF+ +  A   + S  +H   +       L P+G  L
Sbjct: 161 KSEWKEAHSLMQDLMF--KPQFFTFKAWVWAAATISSRTLH---IPWDEAGCLCPVG-DL 214

Query: 292 LAYSSKC--KAMLAAVDDAVQL---------------------VVDRPYKAGESIVVWCG 328
             Y +     + +  +D A QL                          YK G+ +++  G
Sbjct: 215 FNYDAPGIEPSGIEDLDHAEQLDSHSWRLTDGGFEEDANAYCFYAREHYKKGDQVLLCYG 274

Query: 329 PQPNSKLLINYGFVDEDNPYDRLVV--EAALNTEDPQYQDKRMVAQRNGK 376
              N +LL +YGF+ ++NP D++ +  E AL +    +  + +    NGK
Sbjct: 275 TYTNLELLEHYGFLLQENPNDKVFIPLEPALYSS-TSWSKESLYIHHNGK 323


>gi|168016200|ref|XP_001760637.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162687997|gb|EDQ74376.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 450

 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 71/308 (23%), Positives = 122/308 (39%), Gaps = 51/308 (16%)

Query: 81  LKSWMHKNGLPP--CKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLER 138
            + WM  NG+    C++    +PS N ++      A  ++ Q        P  L +T   
Sbjct: 18  FRDWMQINGVQSRFCEI----RPSSNGENAGFGLFATKDNAQG--VLMVTPLLLAITPMT 71

Query: 139 VLGNETIA----ELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
           VL +  +     +L+   ++ +   + L+L+ E+ +G+ SFW PY+  L  + G      
Sbjct: 72  VLQDPELGGHYCKLMEEGEVDDRLLIMLFLVIERARGRFSFWAPYLEILPFKFG------ 125

Query: 195 ESPLLWSETELAYLTGSPT-KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 253
            +PL +SE EL+ L G+   +A   +   G+      LD     A S+F     +IP   
Sbjct: 126 -TPLSFSEEELSELKGTHLFQATQQQSTTGLILRCPVLDR----ANSVFWTRALNIPCP- 179

Query: 254 FTFEIFKQAFVAVQSCVVH----------LQKVSLARRFALVPLGPPLLAYSSKCKAM-- 301
                F   F        H             V +     +  L P +   +   KA+  
Sbjct: 180 ---HSFNNRFAVDLDSTTHKKPEESSAADTDDVKIPSSVWVEGLVPGIDFCNHDLKAVAL 236

Query: 302 ---------LAAVDDAVQLV--VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 350
                    +  V +++ LV  +D     G  I +  G + N +LL  YGFV  +NP D 
Sbjct: 237 WEVDGPEGSVTGVPNSMYLVTGLDVVISNGSEIFISYGNKSNEELLYLYGFVLVENPDDY 296

Query: 351 LVVEAALN 358
           L+V + + 
Sbjct: 297 LMVRSTIG 304


>gi|50556556|ref|XP_505686.1| YALI0F20944p [Yarrowia lipolytica]
 gi|49651556|emb|CAG78495.1| YALI0F20944p [Yarrowia lipolytica CLIB122]
          Length = 402

 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 59/219 (26%), Positives = 106/219 (48%), Gaps = 35/219 (15%)

Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIREL-DRQRGRGQLAVESPLLWSETELAYLTGSP 212
           +S    LAL+L+ ++  G KS W  ++  L DR+ G     ++ PL WS+ +   LT  P
Sbjct: 79  MSAHQVLALFLVIQQSLGSKSDWKAFMGLLPDRKEG----FLDVPLQWSKEDQDSLT--P 132

Query: 213 TKAEILERA-EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-CV 270
               +L++  +  + +Y++  T  F+A     +Y  D P +A+ +     A++ V S C+
Sbjct: 133 EGIVVLKKTLDTFEADYDKTKT--FVA-----KYDSD-PRDAYLW-----AWLCVNSRCL 179

Query: 271 VHLQKVSLARRFAL-----VPLGP--PLLAYS-----SKCKAMLAAVDDAVQLVVDRPYK 318
                ++  ++ A      + L P   L+ +S     + C+   +++   + L   R Y 
Sbjct: 180 YFDLTLTTGKKDAQEVPDNITLAPYVDLINHSVESGPTHCQLKTSSIGFEI-LCGQRGYT 238

Query: 319 AGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 357
           A E I +  GP+ NS LL  YGF   +NP+D + +  AL
Sbjct: 239 ADEEIFLCYGPRSNSVLLCEYGFTVPENPWDDVDISDAL 277


>gi|346324642|gb|EGX94239.1| SET domain-containing protein RMS1 [Cordyceps militaris CM01]
          Length = 482

 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 58/225 (25%), Positives = 89/225 (39%), Gaps = 47/225 (20%)

Query: 158 ACLALYLMYEKKQ------GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
           + L L L+YE  Q      G    W PY+  L         A  +P+ WS  EL  L  S
Sbjct: 102 SALILVLLYEHLQRDADATGAACRWRPYLDVL-------PAAFATPMFWSPAELGALQAS 154

Query: 212 PTKAEI-LERAEGIKREY--------------------NELDTVWFMAGSLFQQYPYDIP 250
           P  A++  E A+ + R                       ++  +    GS    Y +D+ 
Sbjct: 155 PAVAKVGRESADNMFRGILLPAVRAHAHVFAGSERLSDEQIVALAHRMGSTIMAYAFDLD 214

Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 310
            E    E  +  +V  +     +  V +A           +L   ++    +   DD + 
Sbjct: 215 KEEDEDEDGEDGWVEDRDGKALMGMVPMA----------DILNADAEFNVHVNHGDDDLT 264

Query: 311 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP-YDRLVVE 354
           +   RP +AGE I+ + GP PNS+LL  YG+V E +  YD  VVE
Sbjct: 265 VTALRPIRAGEEILNYYGPHPNSELLRRYGYVTERHARYD--VVE 307


>gi|378728064|gb|EHY54523.1| SET domain-containing protein 6 [Exophiala dermatitidis NIH/UT8656]
          Length = 495

 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 81/326 (24%), Positives = 122/326 (37%), Gaps = 51/326 (15%)

Query: 118 DLQAGDAAFSVPNSLVVTL-ERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFW 176
           D+ + +  F++P SLV+T     +    + EL           L + ++YE  +G+ S W
Sbjct: 55  DIASDEELFAIPRSLVLTTATSSIPRSVLKELEDKGATGAWPPLIVTIIYEYLRGESSPW 114

Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN------- 229
            PY + L            + + W++ ELA L  S    +I  R    + E+        
Sbjct: 115 HPYFKIL-------PTTFNTLMFWNDAELAELQASAVVDKIGRRQ--AEEEWQNTIIPTM 165

Query: 230 --------------ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK 275
                         +L  +  MAGSL   Y +DI  +    +       A  +     + 
Sbjct: 166 ADHPDLFPVGGSSAKLIELAHMAGSLIMAYAFDIDRDDMEDDNDNDKDGADSADDEFEED 225

Query: 276 VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
                   +VP    L A + K  A L    D + +   +P  AGE I    GP P S L
Sbjct: 226 DEDEPFKGMVPFADMLNADADKNNARLFQEPDYLIMKATKPISAGEQIFNDYGPLPRSDL 285

Query: 336 LINYGFV-DEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS 394
           L  YG+V D    YD +     L  E         VA ++ K   QV+     RE+E   
Sbjct: 286 LRMYGYVTDNYAQYDVVEFSHDLLLE---------VAGKHSKSKDQVW-----REREQQL 331

Query: 395 DMLPYLRLGYV-----SDTSEMQSVI 415
           D L  L  GY       DT  +Q V+
Sbjct: 332 DELGVLDDGYAITRPEYDTQGLQDVL 357


>gi|426392958|ref|XP_004062802.1| PREDICTED: SET domain-containing protein 4 [Gorilla gorilla
           gorilla]
          Length = 440

 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 65/278 (23%), Positives = 113/278 (40%), Gaps = 27/278 (9%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
           LQ G    S+P S ++T + V+    +   +T  K   S L  L  +L+ EK  G++S W
Sbjct: 68  LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGRRSLW 126

Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
            PY+  L +       A   P+   E E+  L     KA+  E+   ++  +      + 
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178

Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
               LF +    I    F++     A+  V +  V+L   Q+  L+       L P   L
Sbjct: 179 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPDTCALAPYLDL 234

Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY- 348
           L +S   + KA       + ++     ++  E + +  GP  N +L + YGFV   NP+ 
Sbjct: 235 LNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEYGFVSVHNPHA 294

Query: 349 ----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 382
                R ++   L + D Q   K  + + +G +    F
Sbjct: 295 CVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332


>gi|302803412|ref|XP_002983459.1| hypothetical protein SELMODRAFT_445547 [Selaginella moellendorffii]
 gi|300148702|gb|EFJ15360.1| hypothetical protein SELMODRAFT_445547 [Selaginella moellendorffii]
          Length = 536

 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 56/217 (25%), Positives = 86/217 (39%), Gaps = 34/217 (15%)

Query: 145 IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW-SET 203
           +  +L  N       +  +L+ E+ +GK+SFW PYI  L       +L++  PLLW +ET
Sbjct: 97  VGPMLRKNDFRPWLTMCAHLLVERSRGKESFWHPYIAALPSV---DELSISHPLLWPAET 153

Query: 204 ELAYLTGSPTKAEILERAEGIKREYNELDTVW---FMAGSLFQQYPYDIPTEAFTFEIFK 260
               L GSP    I  R +  + ++  L T     F+ G                 E   
Sbjct: 154 IQELLQGSPMLDTIATRLKLCQEDHEALLTAGIEKFLPGG----------------ETLS 197

Query: 261 QAFVAVQSCVVHLQKVSLA-------RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ--- 310
           +  V   S V+  +  SL            LVP    L   SS  +      D   +   
Sbjct: 198 EGDVRWASAVLLSRAFSLELDVDDDFDTLCLVPWADMLNHCSSAGEESCLIFDQDTKTAS 257

Query: 311 LVVDRPYKAGESIVVWCGPQ-PNSKLLINYGFVDEDN 346
           L   + Y  G+ +    GP    S+L ++YGFVD++N
Sbjct: 258 LEAHKSYSKGDEVFDSYGPALTGSQLFLDYGFVDDEN 294


>gi|409080258|gb|EKM80618.1| hypothetical protein AGABI1DRAFT_71041 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 492

 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 61/225 (27%), Positives = 97/225 (43%), Gaps = 33/225 (14%)

Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
           A+VP+   L A      A L    D +++V  +P K GE I    G  PN++LL  YG V
Sbjct: 245 AMVPMADILNARYQTENAKLFHEKDELKMVTTKPIKTGEQIWNTYGDLPNAELLRRYGHV 304

Query: 343 D--------EDNPYDRLVVEAAL----NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREK 390
           D          NP D + ++A L     +  P+      V     K  +  + +  G E 
Sbjct: 305 DFLSLPSGGHGNPGDVVEIKADLIISAVSSTPE-----AVKDDEAKERID-WWLEEGGED 358

Query: 391 EAISD--------MLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 442
             I D        M+ +++L  ++     ++   S     P    +E  + D L    + 
Sbjct: 359 VFILDYEYDLPPVMISFVKLLLLTQADWEKAREKS----KPPKSRLEGILYDILISTLEK 414

Query: 443 RLAGYPATLSEDEAMLT-DYNLHPKKRVATQLVRMEKKMLNACLQ 486
           RLA YP T+  D+A+LT D  L+ K  +  +L   EK++L+  LQ
Sbjct: 415 RLAEYPTTIETDKALLTNDTPLNNKNAIIVRL--GEKEILHGILQ 457


>gi|395332633|gb|EJF65011.1| SET domain-containing protein [Dichomitus squalens LYAD-421 SS1]
          Length = 502

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 61/223 (27%), Positives = 97/223 (43%), Gaps = 27/223 (12%)

Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
           A+VP+   L A      A L   +  +++V  +P +AGE I    G  PNS LL  YG V
Sbjct: 257 AMVPMADMLNARFESENAKLFYEERELKMVTTKPVEAGEQIWNTYGDPPNSDLLRRYGHV 316

Query: 343 D----------EDNPYDRLVVEAAL----NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGR 388
           D            NP D + V A L     ++  +Y  +  V     +    VF +    
Sbjct: 317 DVVPLRPPLSGMGNPRDIVEVRADLIVSAVSKKVEYSLQERVDWWLEEAEDDVFILRT-- 374

Query: 389 EKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP 448
           + E   +++ + RL ++S+   +++   S  P   V P     VL    D   ARL  YP
Sbjct: 375 DCELPEELVSFERLLFLSEDEWIKTAKKSKLPKPKVDP----DVLTVAIDVLSARLKEYP 430

Query: 449 ATLSEDEAMLT-----DYNLHPKKRVATQLVRMEKKMLNACLQ 486
            ++ EDE +L+       +L+ K  V  +L   EK++L   L+
Sbjct: 431 TSIEEDEKLLSADKVESLSLNKKHAVIVRL--GEKRILQGTLK 471


>gi|328772032|gb|EGF82071.1| hypothetical protein BATDEDRAFT_23340 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 419

 Score = 49.7 bits (117), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 63/254 (24%), Positives = 109/254 (42%), Gaps = 38/254 (14%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERV--LGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           A+ D Q GD    +P  L++   R   L N   A +     L +   +AL++ ++K    
Sbjct: 49  ATSDFQIGDPVVRIPARLLLVPRRTHKLFNNHPAIV----ALKQHPSIALFIAWQKIHPT 104

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
              W PYI  L R      L ++  LL     L Y        +I E A   K + ++LD
Sbjct: 105 PE-WSPYIDILPRSFDTMPLCIDLKLL---AMLPY--------DIQEIA---KNQQSKLD 149

Query: 233 TVWFMAGSLFQQYPYD-IPTEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-------RFAL 284
           T +    +      Y+ IP +     IFK A++ V +  + +   ++++       +  +
Sbjct: 150 TDYAFVCTALAVSGYEMIPKD-----IFKWAWIVVNTRCITMNTNAISKPQLSHIHQQPI 204

Query: 285 VPLGPPLLAYSSKCKAMLAAVDDAVQ----LVVDRPYKAGESIVVWCGPQPNSKLLINYG 340
           + L P L   +    A ++A  D V+    +    PYK G  + +  GP  N+ LL  YG
Sbjct: 205 ITLAPFLDCLNHTSTARISAGYDTVEKAYIIRTLVPYKKGSQVFINYGPHDNNFLLAEYG 264

Query: 341 FVDEDNPYDRLVVE 354
           F    NP++ +V++
Sbjct: 265 FAILKNPFNHVVLD 278


>gi|116206234|ref|XP_001228926.1| hypothetical protein CHGG_02410 [Chaetomium globosum CBS 148.51]
 gi|88183007|gb|EAQ90475.1| hypothetical protein CHGG_02410 [Chaetomium globosum CBS 148.51]
          Length = 442

 Score = 49.7 bits (117), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 49/195 (25%), Positives = 79/195 (40%), Gaps = 20/195 (10%)

Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
            +L+ E  +GK SFW PY+  L          +  P  W E ++AYL  +     I E  
Sbjct: 113 FFLIKEYLKGKDSFWWPYLATLPSPDQVNAWVL--PAFWPEDDIAYLECTNAHVAIQEIQ 170

Query: 222 EGIKREYNE----LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---- 273
             +K E+ +    L    F   + +    Y      FT   F+ + +   +   H+    
Sbjct: 171 ANVKGEFKQARKILKNENFPDVAAYTSLMYKWAFTIFTSRSFRPSLILSDTTKRHISTLL 230

Query: 274 -QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD-----DAVQLVVDRPYKAGESIVVWC 327
            Q V L   F+++    PLL  ++     + + D     DA  LV    Y  G  +    
Sbjct: 231 PQSVEL-DDFSILQ---PLLDIANHSPTAVYSWDTTSPADACTLVCGDRYPPGAQVFNNY 286

Query: 328 GPQPNSKLLINYGFV 342
           G + NS+LL+ YGF+
Sbjct: 287 GLKTNSELLLGYGFI 301


>gi|392569623|gb|EIW62796.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
          Length = 509

 Score = 49.7 bits (117), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 61/221 (27%), Positives = 92/221 (41%), Gaps = 23/221 (10%)

Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
           A+VP+   L A      A L   +  +++V  +P KAGE I    G  PNS LL  YG V
Sbjct: 264 AMVPMADMLNARFESENAKLFYDERELKMVSTKPIKAGEQIWNTYGDPPNSDLLRRYGHV 323

Query: 343 D----------EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG----KLSVQVFHVHAGR 388
           D            NP D + V A L       + K  + +R      +    VF +    
Sbjct: 324 DLVPLSAPLSGLGNPGDVVEVRADLIVSVAAKKVKHDLKERVDWWLEEADDDVFVLRT-- 381

Query: 389 EKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYP 448
           + E   +++ ++RL  +      ++   S  P     P +++ VL    D  + RL  YP
Sbjct: 382 DCELAEELVSFVRLLLLPKDEWEKAAQKSKLP----KPKLDKDVLTIAVDVLEKRLKDYP 437

Query: 449 ATLSEDEAMLTDY---NLHPKKRVATQLVRMEKKMLNACLQ 486
            TL EDEA+        L   KR A  +   EK++L   L+
Sbjct: 438 TTLEEDEALFAPERFGELSLNKRHAVVVRLGEKRILRGTLK 478


>gi|66827459|ref|XP_647084.1| hypothetical protein DDB_G0267502 [Dictyostelium discoideum AX4]
 gi|60475269|gb|EAL73204.1| hypothetical protein DDB_G0267502 [Dictyostelium discoideum AX4]
          Length = 472

 Score = 49.7 bits (117), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 69/329 (20%), Positives = 133/329 (40%), Gaps = 67/329 (20%)

Query: 195 ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
           ++ L + E E+ YL GSP   +I+   E   + Y++L    F    + +       +   
Sbjct: 163 DTSLYFDEKEIEYLAGSPAFVDIMVEKEVATKLYDQLSQTLFKDNVILEMCQG--QSTII 220

Query: 255 TFEIFKQAFVAVQSCVVHLQK----VSLARRFALVPLGPPLLAYSSKCKAMLAAVD---- 306
            ++ F+ A   + +  +++       S  ++  L P+ PP++ Y +      A +D    
Sbjct: 221 GWDQFRWAHSTITARKIYVTDPDSVGSDGKQMKLSPVVPPIVDYFNHGNQPSAEIDYNEE 280

Query: 307 -DAVQLVVDRPYKAGESIVV-----WCGPQPNSKLLINYGF----VDEDNPYDRLVVE-- 354
             +V +   +  K GE I V     +CG    S LL++YG+    +D+ +  + L+ E  
Sbjct: 281 LGSVDVKAIKDIKKGEEIFVSYDHHYCG----SDLLVDYGYLPNQIDDKSCVNVLMEELL 336

Query: 355 AALNTEDPQYQDK-----RMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTS 409
             +N +DP   DK     +++  ++ KL +                              
Sbjct: 337 ETINLDDPIKDDKYYLVNKLLETKDIKLKIS----------------------------- 367

Query: 410 EMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML--TDYN-LHPK 466
            M S+   L  I       + ++L+ L      ++  YP T+ +D+  L   +YN L  +
Sbjct: 368 -MDSLTEDLLKISKYMSYKQESLLEYLKRLVSLKIGHYPTTIIQDKEFLLSKEYNQLSAR 426

Query: 467 KRVATQLVRMEKKMLNAC---LQVTADMI 492
            ++A  L   EK++L+     LQ   D I
Sbjct: 427 SKLAFNLAFQEKQILSNVYTKLQENIDTI 455


>gi|449506720|ref|XP_004162829.1| PREDICTED: uncharacterized LOC101212907 [Cucumis sativus]
          Length = 559

 Score = 49.7 bits (117), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 62/256 (24%), Positives = 105/256 (41%), Gaps = 28/256 (10%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           A EDL  GD    +P +++++ E V  +     L     +     + L+ M EK      
Sbjct: 189 AKEDLDVGDTVLEIPLAIIISEELVQKSTMYPVLSKVEGMLPETMMLLWSMKEKHIVDSE 248

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
           F + Y   L         A  + L +    +  L G+    E+++  E ++++YNEL   
Sbjct: 249 FRV-YFDTLPE-------AFNTGLSFGVGAMTTLVGTLLFDELMQAKEHLRKQYNEL--- 297

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP-------- 286
            F A  L   +P   P E +++E F  A     S  + +       R  LVP        
Sbjct: 298 -FPA--LCNNHPDIFPEEFYSWEEFLWACELWYSNSLKIMFPDGNVRTCLVPIAGFLNHS 354

Query: 287 LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-D 345
           L P +L Y       + +  D+++  + RP +AGE   +  G    S L+  YGF+ E D
Sbjct: 355 LHPHILHYGK-----VDSDTDSLKFRLSRPCRAGEECYLSYGNYSGSHLVTFYGFLPEGD 409

Query: 346 NPYDRLVVEAALNTED 361
           N  D + ++     +D
Sbjct: 410 NVNDVIPLDIDFGDDD 425


>gi|328872715|gb|EGG21082.1| hypothetical protein DFA_00957 [Dictyostelium fasciculatum]
          Length = 643

 Score = 49.7 bits (117), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 95/455 (20%), Positives = 183/455 (40%), Gaps = 63/455 (13%)

Query: 71  VSKKEEDLGDLKSWM-HKNG-LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
           +S   EDL   + W+ +KN  L P   I+   P           + A+ +++  +    +
Sbjct: 202 ISTTPEDLKSFQQWLSNKNTYLNPSIDIVDLGPPFGRS------MVANTNIKKDEILVEI 255

Query: 129 PNSLVVTLERVLGN--ETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
           P  +++T + ++ N    I + +   K+S     A+ ++Y     + S+W  Y+  L +Q
Sbjct: 256 PKGIMMTPKSMIKNLPRFIIDWMDEMKISRTDQQAIAIIYSILH-EDSYWYEYVSILPKQ 314

Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN-------------ELDT 233
                    + + ++  E+  L  SP       R  G+ R Y+             E D+
Sbjct: 315 -------FTTTVYFTREEMTQLQASPVHRFTEMRLNGVHRHYDTTISRLRFGYEGGEDDS 367

Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLA 293
                 S       +   + +T + FK A   V S    L +        +VPL     A
Sbjct: 368 TKTKTKSQLDAMK-EFKDDRYTLDQFKWALGCVWSRAFSLSE----EDGGMVPLADMFNA 422

Query: 294 YS----SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP---NSKLLINYGFVDED- 345
            +    SK    ++A   ++     +  +AGE I    G      + ++L++YGF+ ED 
Sbjct: 423 DTVISRSKVHPKISASSPSLVYTASQDIEAGEQIFTPYGVYKTLGSGQMLMDYGFIHEDG 482

Query: 346 NPYDRLVVEAA-LNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 404
           +  D  +V  A +   +P Y  KR + Q NG +  + F +    + +   ++  + R+  
Sbjct: 483 SSADSTIVTVAPIPPSEPLYDLKRHLMQSNG-IESEEFTI---TKNKLAKELFLFARIKS 538

Query: 405 VS--DTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML---- 458
           ++  ++ +  +   S      ++P  E+A L  L++     L  Y  T+ +D  +L    
Sbjct: 539 INKKESDQASAHFMSTQRHSMLNPRNEKAALRLLSNLISRHLDAYQTTIDQDNQILKEIE 598

Query: 459 ---TDYNLHPKKRVAT----QLVRMEKKMLNACLQ 486
              T+ N H      T    +L  MEK +LN+ L+
Sbjct: 599 KDKTNTN-HSSVTFNTINAIKLRLMEKNILNSFLK 632


>gi|238494116|ref|XP_002378294.1| SET domain protein [Aspergillus flavus NRRL3357]
 gi|317148877|ref|XP_001822982.2| SET domain protein [Aspergillus oryzae RIB40]
 gi|220694944|gb|EED51287.1| SET domain protein [Aspergillus flavus NRRL3357]
          Length = 478

 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 80/305 (26%), Positives = 125/305 (40%), Gaps = 51/305 (16%)

Query: 83  SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
           SW+  +G P  KV  K + +          V A  D+  G+  F++P   V++ +    N
Sbjct: 22  SWL--SGKPGVKVNPKIRLADLRSRAAGRGVVAQSDIAEGEELFTIPREHVLSTQ----N 75

Query: 143 ETIAELLTTN--KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
             + +LL+ +  +L     L L ++YE   G +S W  Y + L R+        ++ + W
Sbjct: 76  SKLKDLLSQDVEELGPWLSLMLVMIYEYLLGDQSAWASYFKILPRK-------FDTLMFW 128

Query: 201 SETELAYLTGSPTKAEILER--AEGIKREYNELDTVWFMAG-SLF------QQYPYDIPT 251
           S +EL  L GS     I++R   EG +    E+      A  SLF        Y  D  T
Sbjct: 129 SPSELQELQGSA----IVDRIGKEGAEESILEMIAPIVRANPSLFPPVDGLASYDGDAGT 184

Query: 252 EAF-------TFEIFKQAF-----------VAVQSCVVHLQKVSLARRFALVPLGPPLLA 293
           +A           I   AF              +S  V   +  L++   +VPL   L A
Sbjct: 185 QALLNLAHVMGSLIMAYAFDIEKPEDEDDEGDDESGYVTDDEEQLSK--GMVPLADLLNA 242

Query: 294 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLV 352
            + +  A L   +  + +   +P  AG  I    G  P + LL  YG+V D  +PYD  V
Sbjct: 243 DADQNNARLFQEETGLVMKAIKPISAGAEIFNDYGEIPRADLLRRYGYVTDNYSPYD--V 300

Query: 353 VEAAL 357
           VE +L
Sbjct: 301 VELSL 305


>gi|308809221|ref|XP_003081920.1| N-methyltransferase (ISS) [Ostreococcus tauri]
 gi|116060387|emb|CAL55723.1| N-methyltransferase (ISS) [Ostreococcus tauri]
          Length = 403

 Score = 49.3 bits (116), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 51/171 (29%), Positives = 75/171 (43%), Gaps = 40/171 (23%)

Query: 158 ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
           A LA+ LM +   G  + W  Y   L         AV+S ++WS+ EL  L GS  +   
Sbjct: 47  ATLAVALMQQTNGGASARWRAYCDAL-------PAAVDSLMMWSDEELEVLQGSALRQRA 99

Query: 218 LERAEGIKREYNELDTVWFMAGSLFQQYPYDI-PTEAFTFEIFKQAFVAVQSCVVHLQKV 276
           + R +  KREY+ L    F A  L +  P      EA++F++F+ A+  V          
Sbjct: 100 VFRRDLCKREYDAL----FPA--LARADPETFGDVEAYSFDVFRWAYATV---------- 143

Query: 277 SLARRFALVPLGPPLLAYSSKCKAMLAAVD------DAVQLVVDRPYKAGE 321
            +AR F L  L         +C A+L  +D      DA + VV+R   A E
Sbjct: 144 -MARAFVLPDL---------QCMALLPGLDIYNSARDAEKCVVERDEGACE 184


>gi|348552908|ref|XP_003462269.1| PREDICTED: SET domain-containing protein 4-like [Cavia porcellus]
          Length = 440

 Score = 49.3 bits (116), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 69/333 (20%), Positives = 127/333 (38%), Gaps = 39/333 (11%)

Query: 67  SREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAF 126
           SR V    + +   LK W+         ++    P           + +   L+ G    
Sbjct: 22  SRGVNESHKPEFIKLKKWLKDRNFEDTNLMPARFPGTGRG------LMSKTSLREGQMII 75

Query: 127 SVPNSLVVTLERVLGNETIAELLTTNKL-SELACLALYLMYEKKQGKKSFWLPYIRELDR 185
           S+P S ++T + V+ +   A ++      S L  L  +L+ EK  G +S W PY+  L +
Sbjct: 76  SLPGSCLLTTDTVIRSSLGAYIIKWKPPPSPLLALCTFLVSEKHAGDQSVWKPYLDILPK 135

Query: 186 QRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQY 245
                  +   P+   E E+  L   P KA+  E+   +++ +      +     LF++ 
Sbjct: 136 -------SYTCPVC-LEPEVVNLLPEPLKAKAEEQRMSVQQFFASSRDFFSSLQPLFEEA 187

Query: 246 PYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS--------- 296
              +    F++     A+  V +  V+L+     RR   + L P   A +          
Sbjct: 188 TDSV----FSYSALLWAWCTVNTRAVYLR----TRRRDCLSLEPDTCALAPYLDLLNHSP 239

Query: 297 --KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY-----D 349
             + KA         ++     Y+  + + +  GP  N +LL+ YGFV   NP+      
Sbjct: 240 NVQVKAAFNEETGCYEIRTASDYRKHKEVFICYGPHDNHRLLLEYGFVSLCNPHACVYVS 299

Query: 350 RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 382
           R ++   L + D Q   K  + + +G L    F
Sbjct: 300 REILVKYLPSTDKQMNKKISILKDHGFLENLTF 332


>gi|403342378|gb|EJY70508.1| SET domain containing protein [Oxytricha trifallax]
          Length = 653

 Score = 49.3 bits (116), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 31/124 (25%), Positives = 59/124 (47%), Gaps = 14/124 (11%)

Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGN------ETIAELLTTNKLSELACLALYLMYE 167
           AA  +++  D    VP  +++T+ER L +      +  A +    +  +   L ++L+YE
Sbjct: 56  AAKLNIKNNDVIVYVPQKVLITVERALASPIGFIFDNHASIFKATEDRDYLVLLVFLIYE 115

Query: 168 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
            ++G +SFW PY   +D     G L    P  WS+  +  L  S  K +I +  +  + +
Sbjct: 116 HQKGTRSFWHPYFEAID----PGLL----PCFWSDQTIEELADSELKDQIRQERDNYEED 167

Query: 228 YNEL 231
           ++ L
Sbjct: 168 WDML 171


>gi|378731232|gb|EHY57691.1| hypothetical protein HMPREF1120_05719 [Exophiala dermatitidis
           NIH/UT8656]
          Length = 714

 Score = 49.3 bits (116), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 61/224 (27%), Positives = 86/224 (38%), Gaps = 46/224 (20%)

Query: 161 ALYLMYEKKQGKKSFWLPYIRELD--RQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
           A +L+ +   G KS+W PYI  L         Q   E+ LLW       L G+  KA   
Sbjct: 128 AFFLLEQLVLGDKSWWAPYISSLPTVEDVSHSQFEDEADLLW-------LEGTNLKAGFA 180

Query: 219 ERAEGIKREYNELDTVWFMAG--SLFQQYPYDIPTEAFTFEIFKQAFV--AVQSCVVHLQ 274
             A   K  Y        + G   L Q    +    A+T+E F+ A      +S    + 
Sbjct: 181 AEAARWKEMY--------LKGMHQLKQSQWENAVNGAYTWERFRWAMTIFGSRSFTSQVL 232

Query: 275 KVSLARRFALVP------------LGP----------PLLAYSSK---CKAMLAAVDDAV 309
             +L    AL+             LG           PL+  S+     K    A    V
Sbjct: 233 DATLPADKALLQQYRHDDGRDLCVLGELFAQHFGVLLPLVDISNHKPGAKVEWQARYSFV 292

Query: 310 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
            L V  PY++G+ I    GP+ N  LL+ YGF   DNP+D +V+
Sbjct: 293 GLQVLEPYESGQEIFNNYGPRDNETLLVAYGFTIPDNPFDHVVI 336


>gi|332872029|ref|XP_001168891.2| PREDICTED: SET domain-containing protein 4 isoform 8 [Pan
           troglodytes]
 gi|410222532|gb|JAA08485.1| SET domain containing 4 [Pan troglodytes]
 gi|410259176|gb|JAA17554.1| SET domain containing 4 [Pan troglodytes]
 gi|410287500|gb|JAA22350.1| SET domain containing 4 [Pan troglodytes]
 gi|410336605|gb|JAA37249.1| SET domain containing 4 [Pan troglodytes]
          Length = 440

 Score = 49.3 bits (116), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 65/278 (23%), Positives = 112/278 (40%), Gaps = 27/278 (9%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
           LQ G    S+P S ++T + V+    +   +T  K   S L  L  +L+ EK  G +S W
Sbjct: 68  LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGHRSLW 126

Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
            PY+  L +       A   P+   E E+  L     KA+  E+   ++  +      + 
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178

Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
               LF +    I    F++     A+  V +  V+L   Q+  L+       L P   L
Sbjct: 179 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPDTCALAPYLDL 234

Query: 292 LAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY- 348
           L +S   + KA       + ++     ++  E + +  GP  N +L + YGFV   NP+ 
Sbjct: 235 LNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEYGFVSVHNPHA 294

Query: 349 ----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 382
                R ++   L + D Q   K  + + +G +    F
Sbjct: 295 CVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332


>gi|8393013|ref|NP_059134.1| SET domain-containing protein 4 isoform 1 [Homo sapiens]
 gi|12229715|sp|Q9NVD3.1|SETD4_HUMAN RecName: Full=SET domain-containing protein 4
 gi|7023055|dbj|BAA91819.1| unnamed protein product [Homo sapiens]
 gi|119630162|gb|EAX09757.1| SET domain containing 4, isoform CRA_b [Homo sapiens]
 gi|119630163|gb|EAX09758.1| SET domain containing 4, isoform CRA_b [Homo sapiens]
 gi|119630165|gb|EAX09760.1| SET domain containing 4, isoform CRA_b [Homo sapiens]
          Length = 440

 Score = 49.3 bits (116), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 65/278 (23%), Positives = 112/278 (40%), Gaps = 27/278 (9%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
           LQ G    S+P S ++T + V+    +   +T  K   S L  L  +L+ EK  G +S W
Sbjct: 68  LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGHRSLW 126

Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
            PY+  L +       A   P+   E E+  L     KA+  E+   ++  +      + 
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178

Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
               LF +    I    F++     A+  V +  V+L   Q+  L+       L P   L
Sbjct: 179 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPDTCALAPYLDL 234

Query: 292 LAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY- 348
           L +S   + KA       + ++     ++  E + +  GP  N +L + YGFV   NP+ 
Sbjct: 235 LNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEYGFVSVHNPHA 294

Query: 349 ----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 382
                R ++   L + D Q   K  + + +G +    F
Sbjct: 295 CVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332


>gi|388452885|ref|NP_001253203.1| SET domain-containing protein 4 [Macaca mulatta]
 gi|355560299|gb|EHH16985.1| SET domain-containing protein 4 [Macaca mulatta]
 gi|387541878|gb|AFJ71566.1| SET domain-containing protein 4 isoform 1 [Macaca mulatta]
          Length = 440

 Score = 48.9 bits (115), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 65/278 (23%), Positives = 112/278 (40%), Gaps = 27/278 (9%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
           LQ G    S+P S ++T + V+    +   +T  K   S L  L  +L+ EK  G +S W
Sbjct: 68  LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGDRSLW 126

Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
            PY+  L +       A   P+   E E+  L     KA+  E+   ++  +      + 
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178

Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
               LF +    I    F++     A+  V +  V+L   Q+  L+       L P   L
Sbjct: 179 SLQPLFVEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPDTCALAPYLDL 234

Query: 292 LAYSSKC--KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY- 348
           L +S +   KA       + ++     ++  E + +  GP  N +L + YGFV   NP+ 
Sbjct: 235 LNHSPRVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEYGFVSVHNPHA 294

Query: 349 ----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 382
                R ++   L + D Q   K  + + +G +    F
Sbjct: 295 CVYVSREILVKYLPSRDKQMDKKISILKDHGYIENLTF 332


>gi|154272535|ref|XP_001537120.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
 gi|150409107|gb|EDN04563.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
          Length = 485

 Score = 48.9 bits (115), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 63/291 (21%), Positives = 111/291 (38%), Gaps = 42/291 (14%)

Query: 80  DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
           +  SW+ +   P  KV  K K +          + A +D+   +  F++P SLV++ +  
Sbjct: 19  EFMSWLKQR--PGVKVSPKIKIADLRSEGAGRGIVADDDIGEDEELFAIPQSLVLSFQ-- 74

Query: 140 LGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
             N  + +LL  N+       CL + ++YE  QG  S W  Y + L           ++ 
Sbjct: 75  --NSRLKDLLDFNERDFDPWLCLIVVMIYEYLQGGASTWSRYFQLLPTN-------FDTL 125

Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP-------YDIP 250
           + W++ EL  L+GS     +L +      E N    +  +       +P       +D P
Sbjct: 126 MFWTDEELRELSGSA----VLNKIGRSDAEANIFRNILPLVSGNPSLFPPMSGVASFDSP 181

Query: 251 ----------------TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
                             A+ F+I K      +    ++          +VPL   L A 
Sbjct: 182 EGKAALLSLAHRMGSLVMAYAFDIEKGENDGREGQDGYVTDDEEELSKGMVPLADLLNAD 241

Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 345
           + +  A L   D  + +   +P + GE I    G  P + LL  YG+V ++
Sbjct: 242 ADRNNARLFQEDCYLSMRSIKPIRKGEEIFNDYGELPRADLLRRYGYVTDN 292


>gi|302410103|ref|XP_003002885.1| SET domain-containing protein RMS1 [Verticillium albo-atrum
           VaMs.102]
 gi|261357909|gb|EEY20337.1| SET domain-containing protein RMS1 [Verticillium albo-atrum
           VaMs.102]
          Length = 469

 Score = 48.9 bits (115), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 81/373 (21%), Positives = 142/373 (38%), Gaps = 69/373 (18%)

Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 213
           L     L L ++YE  QG  S W PY   L +Q        ++P+ WS+ EL  L G+  
Sbjct: 91  LDSWGQLILVMLYEVLQGDSSRWKPYFDILPQQ-------FDTPIFWSDGELLELQGTSL 143

Query: 214 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 273
            AE + + E      +++  +     ++F       PTE     +  +    + +    L
Sbjct: 144 TAEKIGKVESDAMFRSKILPIVQANPAIFYPEGAAQPTEDELLHLAHRMGSTIMAYAFDL 203

Query: 274 QKVSLARR--------------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 319
           +                       +VP+   L A +++  A +   +      +    +A
Sbjct: 204 ENDDENENEEDGWVEDREGRTMLGMVPMADTLNA-NAEFNAHINHGESLEATAIRADIRA 262

Query: 320 GESIVVWCGPQPNSKLLINYGFVD-EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLS 378
           G+ ++ + GP P S+LL  YG+V  E + YD + V   L  E         V      LS
Sbjct: 263 GDQVLNYYGPLPTSELLRRYGYVTPEHSRYDVVEVPWTLVKE---------VIVSCLSLS 313

Query: 379 VQVF-HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLA 437
            + +  V +  + E I D   Y  +   S         ++   +  VSP +    ++QL 
Sbjct: 314 AEAWKQVESQIDDEEIED---YFVIERDSGEPGPDGRFTAPAVLREVSPEL----VEQLK 366

Query: 438 DYFKA-----------------------------RLAGYPATLSEDEAMLTDYNLHPKKR 468
           ++ KA                             RLA YP ++  DE +L + +L  ++R
Sbjct: 367 EFLKAVKKLDSERIPDKRKRDEICDAVIAEVLKVRLAQYPTSIETDEKLLAEADLPARRR 426

Query: 469 VATQLVRMEKKML 481
           +A  +   EKK+L
Sbjct: 427 MAVVVRLGEKKLL 439


>gi|345561352|gb|EGX44442.1| hypothetical protein AOL_s00188g347 [Arthrobotrys oligospora ATCC
           24927]
          Length = 468

 Score = 48.9 bits (115), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 68/299 (22%), Positives = 121/299 (40%), Gaps = 41/299 (13%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTT--NKLSELACLALYLMYEKKQGKKSFW 176
            + G+   ++P+S++ T+E    +  I  +L +    LS    LA+Y+++ +        
Sbjct: 34  FKEGERILTIPSSILWTVEHAYADSIIRPVLQSMQGALSVDDTLAIYILFVRS------- 86

Query: 177 LPYIRELDRQRGRGQL-----AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
               RE      R  +     +  S + +++ EL    GS       +  + I+ +Y  L
Sbjct: 87  ----RESGYNGLRSHVEALPTSYSSSIFFTDDELEVCAGSSLYTITKQLKQQIQDDYRTL 142

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
                    LF QY        FT E +K A   V S  +   +        L P    +
Sbjct: 143 ------VERLFGQYLDIFSLGKFTIEDYKWALCTVWSRAMDFVQPDGKSIRLLAPFAD-M 195

Query: 292 LAYSS---KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
           L +SS   KC     +  D + ++  + Y+ G+ + +  G  PN++LL  YGFV  +NP 
Sbjct: 196 LNHSSDVKKCHVYDTSSGD-LSILAGKDYEPGDQVFINYGSIPNNRLLRLYGFVVPNNPN 254

Query: 349 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP-----YLRL 402
           D   +      E P ++ K+ +    G  SV    +       +++D LP     YLR+
Sbjct: 255 DSYDLVLMTQPEAPFFELKQKLWVSAGLDSVSTISL-------SLNDPLPKSVLQYLRI 306


>gi|397507017|ref|XP_003824008.1| PREDICTED: SET domain-containing protein 4 [Pan paniscus]
          Length = 440

 Score = 48.9 bits (115), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 65/278 (23%), Positives = 112/278 (40%), Gaps = 27/278 (9%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
           LQ G    S+P S ++T + V+    +   +T  K   S L  L  +L+ EK  G +S W
Sbjct: 68  LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGHRSLW 126

Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
            PY+  L +       A   P+   E E+  L     KA+  E+   ++  +      + 
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178

Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
               LF +    I    F++     A+  V +  V+L   Q+  L+       L P   L
Sbjct: 179 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPDTCALAPYLDL 234

Query: 292 LAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY- 348
           L +S   + KA       + ++     ++  E + +  GP  N +L + YGFV   NP+ 
Sbjct: 235 LNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEYGFVSVHNPHA 294

Query: 349 ----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 382
                R ++   L + D Q   K  + + +G +    F
Sbjct: 295 CVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332


>gi|297707870|ref|XP_002830708.1| PREDICTED: SET domain-containing protein 4 [Pongo abelii]
          Length = 440

 Score = 48.9 bits (115), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 64/278 (23%), Positives = 112/278 (40%), Gaps = 27/278 (9%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
           LQ G    S+P S ++T + V+    +   +T  K   S L  L  +L+ EK  G +S W
Sbjct: 68  LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGDRSLW 126

Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
            PY+  L +       A   P+   E E+  L     KA+  E+   ++  +      + 
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPQSLKAKAEEQRAHVQEFFASSRDFFS 178

Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ---KVSLARRFALVPLGP--PL 291
               LF +    I    F++     A+  V +  V+L+   +  L+       L P   L
Sbjct: 179 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRHRECLSAELDTCALAPYLDL 234

Query: 292 LAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY- 348
           L +S   + KA       + ++     ++  E + +  GP  N +L + YGFV   NP+ 
Sbjct: 235 LNHSPHVQVKAAFNEETHSYEIRTTSRWRRHEEVFICYGPHDNQRLFLEYGFVSVHNPHA 294

Query: 349 ----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 382
                R ++   L + D Q   K  + + +G +    F
Sbjct: 295 CVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332


>gi|255070351|ref|XP_002507257.1| predicted protein [Micromonas sp. RCC299]
 gi|226522532|gb|ACO68515.1| predicted protein [Micromonas sp. RCC299]
          Length = 986

 Score = 48.9 bits (115), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 68/285 (23%), Positives = 111/285 (38%), Gaps = 67/285 (23%)

Query: 113 VAASEDLQA----GDAAFSVPNSLVVTLERVLGNET---IAELLTTNK-LSELACLALYL 164
           V A+E++      GD  FS+P + ++T      + T   + EL   ++ + +   L  +L
Sbjct: 45  VIAAENVNGAQDGGDTIFSIPITCLMTPAAAFADVTYGKVFELFAAHQSVEDRTVLVFFL 104

Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
             E+++G  S W PYIREL            +PL WS  E   L G+        R  G 
Sbjct: 105 AIERQRGMTSHWGPYIRELPS-------IFSNPLNWSRAETLRLAGT--------RLGGA 149

Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA----- 279
            +         F   +L Q     +P  AF   +  Q  ++  +  +    +SLA     
Sbjct: 150 TK---------FHDCALLQLTEVCVP--AFIAILRAQLILSANTKAIASGAISLAQDALS 198

Query: 280 -----------------------RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV--- 313
                                  R  ALVPLG  +L +S   +      D A Q ++   
Sbjct: 199 PDRLAWSHSCVSSRAFSLFLNGQRTIALVPLG-DMLDHSPDAQIEWRTDDTAGQFLIISH 257

Query: 314 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 358
           DR   AG  +    G + N +L++ YGF  + +  + L V  A++
Sbjct: 258 DR-LPAGSIMFNNYGAKSNEELILGYGFFMKSSVLETLYVRLAVD 301


>gi|384254260|gb|EIE27734.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 724

 Score = 48.9 bits (115), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 81/331 (24%), Positives = 127/331 (38%), Gaps = 54/331 (16%)

Query: 71  VSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVP- 129
           + K  E     + W  ++G+    + L E          +  +AA++++  G+   S+P 
Sbjct: 67  IQKSGEGPLGFQEWALQSGITSPSLRLAEFAG-------LRGMAAADNIAKGEVLVSLPV 119

Query: 130 -NSLVVT-LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQR 187
             +LVV+  ER     T       +K      +AL L+YE++ G  S   PY+  L    
Sbjct: 120 AAALVVSPKERSQLPGTFCSSAFYSKKPWYVQMALNLLYERQLGPASKLAPYVAALP--- 176

Query: 188 GRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL------------DTVW 235
               +   +PL WSE +L  L       E+  + EG+KR + EL            D +W
Sbjct: 177 ----VDFSTPLSWSEAQLQALCYPQLIREVATQREGLKRLHAELAVSTPGTPITEQDLIW 232

Query: 236 FMAGSLFQQY--PYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL-GPPLL 292
            +     + +  PY  PT     + F        + +     ++ A   AL  L    +L
Sbjct: 233 ALQAVRSRAFSGPYAGPTWRSRLKTFGALGALAAASITVAHVLNGAIAAALFNLLYDVVL 292

Query: 293 AYSSKCKAMLAAVD-----DAVQLVVDRPYKA-------------GESIVVWCGPQPNSK 334
           +   K  AM   VD       VQ  V+  Y A             GE + +  G Q N  
Sbjct: 293 SQKVKWYAMCPVVDFLNHKSTVQSEVEYEYFADRFSVRCQSYFSKGEQVFISYGKQSNDS 352

Query: 335 LLINYGFVDEDNPYDRLVV----EAALNTED 361
           LL  YGFV+   P+D   +     AAL   D
Sbjct: 353 LLQYYGFVEPGIPHDTYTIPDLRAAALALSD 383


>gi|22328112|gb|AAH36556.1| SETD4 protein [Homo sapiens]
 gi|119630166|gb|EAX09761.1| SET domain containing 4, isoform CRA_d [Homo sapiens]
 gi|167773807|gb|ABZ92338.1| SET domain containing 4 [synthetic construct]
          Length = 416

 Score = 48.9 bits (115), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 65/278 (23%), Positives = 113/278 (40%), Gaps = 27/278 (9%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
           LQ G    S+P S ++T + V+ +  +   +T  K   S L  L  +L+ EK  G +S W
Sbjct: 44  LQEGQMIISLPESCLLTTDTVIRS-YLGAYITKWKPPPSPLLALCTFLVSEKHAGHRSLW 102

Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
            PY+  L +       A   P+   E E+  L     KA+  E+   ++  +      + 
Sbjct: 103 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 154

Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
               LF +    I    F++     A+  V +  V+L   Q+  L+       L P   L
Sbjct: 155 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPDTCALAPYLDL 210

Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY- 348
           L +S   + KA       + ++     ++  E + +  GP  N +L + YGFV   NP+ 
Sbjct: 211 LNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEYGFVSVHNPHA 270

Query: 349 ----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 382
                R ++   L + D Q   K  + + +G +    F
Sbjct: 271 CVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 308


>gi|358386801|gb|EHK24396.1| hypothetical protein TRIVIDRAFT_168260 [Trichoderma virens Gv29-8]
          Length = 370

 Score = 48.5 bits (114), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 33/140 (23%), Positives = 59/140 (42%), Gaps = 6/140 (4%)

Query: 218 LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS 277
           LE  E +++   E    W    + F+    D+P E +T+        +  +      K  
Sbjct: 113 LESREHLRKREKEFQGNW----NAFKDAFPDVPYEEYTYAWMIVNTRSFYNETPETLKYP 168

Query: 278 LARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 337
              R AL+P+          CK   +A  D   +V DR YK GE + +      N  +L+
Sbjct: 169 WEDRLALIPVADLFNHSDDGCKVYYSA--DGYHIVADREYKKGEELFISYSSHSNDYILL 226

Query: 338 NYGFVDEDNPYDRLVVEAAL 357
            YGF+ +++  D + ++ A+
Sbjct: 227 EYGFIPDESLDDDVYIDDAV 246


>gi|355747383|gb|EHH51880.1| SET domain-containing protein 4 [Macaca fascicularis]
          Length = 440

 Score = 48.5 bits (114), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 64/278 (23%), Positives = 112/278 (40%), Gaps = 27/278 (9%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
           LQ G    S+P S ++T + V+    +   +T  K   S L  L  +L+ EK  G +S W
Sbjct: 68  LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGDRSLW 126

Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
            PY+  L +       A   P+   E E+  L     KA+  E+   ++  +      + 
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178

Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
               LF +    I    F++     A+  + +  V+L   Q+  L+       L P   L
Sbjct: 179 SLQPLFVEAVDSI----FSYSALLWAWCTINTRAVYLRPRQRECLSAEPDTCALAPYLDL 234

Query: 292 LAYSSKC--KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY- 348
           L +S +   KA       + ++     ++  E + +  GP  N +L + YGFV   NP+ 
Sbjct: 235 LNHSPRVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEYGFVSVHNPHA 294

Query: 349 ----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 382
                R ++   L + D Q   K  + + +G +    F
Sbjct: 295 CVYVSREILVKYLPSRDKQMDKKISILKDHGYIENLTF 332


>gi|328864871|gb|EGG13257.1| hypothetical protein DFA_11018 [Dictyostelium fasciculatum]
          Length = 1658

 Score = 48.5 bits (114), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 76/351 (21%), Positives = 139/351 (39%), Gaps = 27/351 (7%)

Query: 113  VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELL--TTNKLSELACLALYLMYEKKQ 170
            V  ++ ++  +   SVP   ++ ++    +  +  +L      L++   L L+++YEK +
Sbjct: 1212 VVTTKKVEENECVVSVPRKFLINVDCARKHPVLNSILFEEATGLNDDTILFLFVIYEK-E 1270

Query: 171  GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
               SFW P+   L            + + ++ TEL  L G+     + E    IK     
Sbjct: 1271 NPNSFWRPFFDTLPS-------YFPTSIHYTTTELLELEGT----NLFEETIQIKEHLES 1319

Query: 231  LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 290
            +  + F    L  QYP   P   FT E F  A     S  + L K+       LVP+   
Sbjct: 1320 IRELLF--PELSNQYPDVFPESLFTMENFLWARSLFDSRAIQL-KIDGRIVNCLVPMADM 1376

Query: 291  LLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
            +  +      +      +D  +++      A   I +  G   + +L + YGFV  +N Y
Sbjct: 1377 INHHDQAQISQRYFDQENDCFRMISCCNIPATSQIFLQYGALQSWELALYYGFVISNNHY 1436

Query: 349  DRLVVEAALNTED-PQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD 407
            D + +   +  ED P+ ++++        L+V   ++H        S +L  LR+  +++
Sbjct: 1437 DSVHIGFDMPEEDTPELREEKQKLLDRHLLTVDHHYLHRSN---IPSKLLASLRVALLAE 1493

Query: 408  TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 458
              E    +    PI   S   E  VL  L       L  + +T  ED+ +L
Sbjct: 1494 -DEFNPHVDVWNPI---SRSNEEVVLYTLYSTVLMLLKQFSSTCDEDQQLL 1540


>gi|145518912|ref|XP_001445328.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124412772|emb|CAK77931.1| unnamed protein product [Paramecium tetraurelia]
          Length = 761

 Score = 48.5 bits (114), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 41/141 (29%), Positives = 67/141 (47%), Gaps = 23/141 (16%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERV-LGNETIA-----ELLTTNKLS--ELACLALYL 164
           V A++D+ A  A   VP  L+++ E+  L + +I      EL   N+ S  E   L  YL
Sbjct: 46  VVATKDIPANTAIICVPQPLIISQEKCKLSSLSIVYDKHPELFDENETSDAEFNILIFYL 105

Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
             EKK+G+KSF+ PY++ +         +  + + WS+ EL Y+       E       I
Sbjct: 106 FNEKKKGEKSFYHPYVQAIQ--------SNNTLIDWSKEELNYIEDPIILDEF-----AI 152

Query: 225 KREYNELDTVWFMAGSLFQQY 245
            RE  +L  +W  A  +F ++
Sbjct: 153 VRE--DLKDLWNQAKEIFNEF 171


>gi|113930683|ref|NP_001039027.1| SET domain-containing protein 4 [Danio rerio]
 gi|66911144|gb|AAH96876.1| SET domain containing 4 [Danio rerio]
          Length = 440

 Score = 48.5 bits (114), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 71/323 (21%), Positives = 136/323 (42%), Gaps = 47/323 (14%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVV 134
           L+ W+++ G     +I            P+++      + A++ ++A ++  S+P   ++
Sbjct: 37  LRRWLNERGFTSQSLI------------PVNFHDTGRGLMATQTIKAKNSVISLPEECLL 84

Query: 135 TLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
           T   VL +  +A+ +      +S L  L  +L+ E+  G+ S W PYI  L +       
Sbjct: 85  TTSTVLKS-YMADYIKRWHPPISPLLALCCFLISERHHGEASEWNPYIDILPK------- 136

Query: 193 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 252
               PL + +  +  L  S  K +  ++ E  +  ++   T +     LF Q     PTE
Sbjct: 137 TYTCPLYFPDNVIELLPRSLQK-KATQQKEQFQELFSSSQTFFHSLQPLFNQ-----PTE 190

Query: 253 A-FTFEIFKQAFVAVQSCVV---HLQKVSLARRFALVPLGP--PLLAY--SSKCKAMLAA 304
             F+ +  + A+ +V +  V   H Q   L+R   +  L P   LL +  + + +A    
Sbjct: 191 ELFSQDALRWAWCSVNTRTVYMEHDQSKYLSREKDVYALAPYLDLLNHCPNVQVEAGFNK 250

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY-----DRLVVEAALNT 359
                ++      K  +   +  GP  N +LL+ YGFV   NP+     D   ++  L+ 
Sbjct: 251 ETRCYEIRSVNGCKKFQQAFINYGPHDNHRLLLEYGFVAPCNPHSVVYVDLETLKVGLDE 310

Query: 360 EDPQYQDKRMVAQRNGKLSVQVF 382
           +D Q ++K +  + N  L    F
Sbjct: 311 KDKQLKEKLLYLKDNDFLRNLTF 333


>gi|329663327|ref|NP_001192753.1| SET domain-containing protein 4 [Bos taurus]
 gi|296490853|tpg|DAA32966.1| TPA: SET domain containing 4 [Bos taurus]
          Length = 440

 Score = 48.5 bits (114), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 71/318 (22%), Positives = 127/318 (39%), Gaps = 35/318 (11%)

Query: 39  NFGSSLRLVRRKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILK 98
           N G     +RR+  F+    SS+     SR V    + +  +LK W+         +I  
Sbjct: 3   NGGGRTSRIRRRKLFT----SSE-----SRGVNESYKPEFIELKKWLKDRRFEDTTLIPA 53

Query: 99  EKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-SEL 157
             P           + +   LQ G    S+P S ++T + V+ +   A +       S L
Sbjct: 54  HFPGTGRG------LMSKTSLQEGQTIISLPESCLLTTDTVIRSYLGAYIAKWQPPPSPL 107

Query: 158 ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
             L  +L+ EK  G +S W PY+  L +       A   P+   E E+  L  +P K + 
Sbjct: 108 LALCTFLVSEKHAGDRSPWKPYLEVLPK-------AYTCPVC-LEPEVVNLLPNPLKTKA 159

Query: 218 LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK-- 275
            E+   +   ++     +     LF +    I    F++   + A+ AV +  V++++  
Sbjct: 160 WEQRSHVWEFFSSSRGFFSSLQPLFSEAVETI----FSYRALRWAWCAVNTRAVYMKRPP 215

Query: 276 -VSLARRFALVPLGP--PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGE--SIVVWCGPQ 330
            + L+       L P   LL +S   +   A  ++     +    + G+   + +  GP 
Sbjct: 216 LLCLSPEPDTCALAPYLDLLNHSPDVQVKAAFNEETRCYEIRTATRCGKHKEVFICYGPH 275

Query: 331 PNSKLLINYGFVDEDNPY 348
            N +LL+ YGFV   NP+
Sbjct: 276 DNHRLLLEYGFVCVSNPH 293


>gi|409045252|gb|EKM54733.1| hypothetical protein PHACADRAFT_97093 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 513

 Score = 48.5 bits (114), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 63/222 (28%), Positives = 97/222 (43%), Gaps = 26/222 (11%)

Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
           A+VP+   L    +   A L   D+ V +++     KAGE I    G  PNS LL  YGF
Sbjct: 271 AMVPMADMLNGRFNTETARLFYDDEHVLRMMTVHEIKAGEQIWNTYGDPPNSDLLRRYGF 330

Query: 342 VD----------EDNPYD------RLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVH 385
           +D            NP D       LVVEAA      + QD+  V     +    VF V 
Sbjct: 331 IDVTKLESPLSGAGNPADIVEIPANLVVEAATKHTTSKTQDR--VDWWLEEAEDDVFVV- 387

Query: 386 AGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLA 445
            G + E   +M+   RL  +   +E +    + G +    P M+  +     D  ++RL 
Sbjct: 388 -GTDCELPPEMVSLARL-LLQPKAEWEKT-KAKGKVP--KPTMDTTIAAIAMDVLQSRLK 442

Query: 446 GYPATLSEDEAMLTDYN-LHPKKRVATQLVRMEKKMLNACLQ 486
            YP ++ EDE +L D + L   +++A  +   EK++L   L+
Sbjct: 443 EYPTSVEEDERLLADESQLGFNRKMAVTVRLGEKRILAGTLR 484


>gi|242066082|ref|XP_002454330.1| hypothetical protein SORBIDRAFT_04g028760 [Sorghum bicolor]
 gi|241934161|gb|EES07306.1| hypothetical protein SORBIDRAFT_04g028760 [Sorghum bicolor]
          Length = 490

 Score = 48.5 bits (114), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 64/289 (22%), Positives = 113/289 (39%), Gaps = 44/289 (15%)

Query: 80  DLKSWMHKNG---LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTL 136
           DL  W+ + G    P  +V      +H E    +   A   D+  GD   ++P  L + L
Sbjct: 60  DLVRWVQREGGFVHPALRVA-----NHPEHGLGVSAAAPDGDIPPGDVLIALPGRLPLRL 114

Query: 137 ERVLGN-ETIAELLTTNKLSELACLALYL-MYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
            R  G  + +   L      EL  + L L + +++    SFW PYI  L           
Sbjct: 115 RRPTGAADDVLVQLAQQVPEELWAMKLGLRLLQERAKSDSFWWPYIANLPE-------TF 167

Query: 195 ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
             P+ +   ++  L  +P   ++ +R   +     E+           QQ  + +P+   
Sbjct: 168 TVPIFFPGEDIKNLQYAPLLHQVNKRCRFLLEFEKEI-----------QQKLHTVPSVDH 216

Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC------------KAML 302
            F  + Q   +  S +      + +R F L    P LL     C            +  +
Sbjct: 217 PF--YGQDVNS--SSLGWAMSAASSRAFRLHGEIPMLLPLIDMCNHSFNPNARIVQEGSV 272

Query: 303 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 351
            ++D +V++V ++  +   SI +  G  PN   L++YGFV   NPYD++
Sbjct: 273 NSLDMSVKVVAEKKIEQNASITLNYGCHPNDFFLLDYGFVITPNPYDQV 321


>gi|403271547|ref|XP_003927684.1| PREDICTED: SET domain-containing protein 4 [Saimiri boliviensis
           boliviensis]
          Length = 440

 Score = 48.5 bits (114), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 65/278 (23%), Positives = 111/278 (39%), Gaps = 27/278 (9%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
           LQ G    S+P S ++T + V+    +   +T  K   S L  L  +L+ EK  G +S W
Sbjct: 68  LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGDRSLW 126

Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
            PY+  L +       A   P+   E E+  L     KA+  E+   ++  +      + 
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178

Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
               LF +    I    F++     A+  V +  V+L   Q+  L+       L P   L
Sbjct: 179 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQQECLSAEPDTCALAPYLDL 234

Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY- 348
           L +S   + KA         ++     ++  E + +  GP  N +L + YGFV   NP+ 
Sbjct: 235 LNHSPHVQVKAAFNEETHCYEIRTTSRWRKHEEVFICYGPHDNQRLFLEYGFVSAHNPHA 294

Query: 349 ----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 382
                R ++   L + D Q   K  + + +G +    F
Sbjct: 295 CVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332


>gi|426218421|ref|XP_004003445.1| PREDICTED: SET domain-containing protein 4 [Ovis aries]
          Length = 439

 Score = 48.5 bits (114), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 70/318 (22%), Positives = 126/318 (39%), Gaps = 35/318 (11%)

Query: 39  NFGSSLRLVRRKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILK 98
           N G     +RR+  F     SS+     SR V    + +  +LK W+         +I  
Sbjct: 3   NGGGRTSRIRRRKLFR----SSE-----SRGVNESYKPEFIELKKWLKDRRFEDATLIPA 53

Query: 99  EKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-SEL 157
             P           + +   LQ G    S+P S ++T + V+ +   A +       S L
Sbjct: 54  RFPGTGRG------LMSKTSLQEGQTIISLPESCLLTTDTVIRSYLGAYIAKWQPPPSPL 107

Query: 158 ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
             L  +L+ EK  G +S W PY+  L +       A   P+   E E+  L  +P K + 
Sbjct: 108 LALCTFLVSEKHAGDRSPWKPYLEVLPK-------AYTCPVC-LEPEVVNLLPNPLKTKA 159

Query: 218 LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK-- 275
            E+   ++  ++     +     LF +    I    F++   + A+  V +  V++++  
Sbjct: 160 WEQRSHVQEFFSSSRGFFSSLQPLFSEAIETI----FSYRALRWAWCTVNTRAVYMKRPP 215

Query: 276 -VSLARRFALVPLGP--PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGE--SIVVWCGPQ 330
            + L+       L P   LL +S   +   A  ++     +    + G+   + +  GP 
Sbjct: 216 QLCLSPEPDTCALAPYLDLLNHSPDVQVKAAFNEETRCYEIRTATRCGKHKEVFICYGPH 275

Query: 331 PNSKLLINYGFVDEDNPY 348
            N +LL+ YGFV   NP+
Sbjct: 276 DNHRLLLEYGFVSVSNPH 293


>gi|393217169|gb|EJD02658.1| SET domain-containing protein [Fomitiporia mediterranea MF3/22]
          Length = 513

 Score = 48.5 bits (114), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 104/442 (23%), Positives = 163/442 (36%), Gaps = 86/442 (19%)

Query: 115 ASEDLQAGDAAFSVPNSLVVT-----LERVLGNETIAELLTTNKLSELACLALYLMYEKK 169
           A +D+  G   FSVP SL ++     L +++G E   + L  NK      L L +M+E+ 
Sbjct: 34  ALQDIPEGHTLFSVPRSLTLSTHTSELPKLIG-EAAWKSLRLNK--GWVGLILCMMWEEC 90

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
           +   S W  Y   L R       A ++P+ W+  EL  L G+    +I +  E  +R+Y 
Sbjct: 91  RWTDSKWCGYFNILPR-------AFDTPMFWTGDELKELDGTDVLGKIGK--EQAERDYY 141

Query: 230 EL---------DTVWFMAGSLFQQYPYD--------IPTEAFTFEIFKQAFVAVQS---- 268
           E+         D   F  G +   Y  +        I + +F  E +K+     QS    
Sbjct: 142 EILNPAVRTRPDL--FDPGHIASFYSLENYHVMGSRILSRSFHVEKWKEQTPGSQSRASS 199

Query: 269 -------CV------VHLQKVSLAR--------------RFALVPLGPPLLAYSSKCKAM 301
                  C+       +L  V                    A+VP+   L A      A 
Sbjct: 200 ELHENGDCMDIDDESSNLSAVGAENGGDDDSDDEAENPSDIAMVPMADMLNAQYGSENAK 259

Query: 302 LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD----------EDNPYDRL 351
           L      + +V  +P + GE I    G  PNS LL  YG VD          E NP D +
Sbjct: 260 LFYEPTHLNMVSTKPIRRGEQIYNAYGDLPNSALLREYGHVDLVPLPGVPWKEGNPADVV 319

Query: 352 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAIS------DMLPYLRLGYV 405
            + A L          R+ A+ + K  +  +    G +   +       D++       +
Sbjct: 320 EIPADLALHAVLSSQARVDAE-SLKERIDWWLEEGGDDVFVLGTDLELPDVMISFLKLLL 378

Query: 406 SDTSEMQSVISSLGPICP-VSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 464
               E +   S   P  P +    +      +    + RLA YP TL  DEA+L+     
Sbjct: 379 LSKLEWEKARSKSKPPKPKLDMDSKLQTFPLVLGMLERRLAKYPTTLEHDEALLSGQTSL 438

Query: 465 PKKRVATQLVRM-EKKMLNACL 485
           P       +VR+ EK +L  C+
Sbjct: 439 PYNVRNAIIVRIGEKHILVGCM 460


>gi|348676999|gb|EGZ16816.1| hypothetical protein PHYSODRAFT_251772 [Phytophthora sojae]
          Length = 424

 Score = 48.1 bits (113), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 76/349 (21%), Positives = 137/349 (39%), Gaps = 59/349 (16%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK---LSELACLALYLMYEKKQG 171
           A+  + +G+    +P  L+++ +    +  +  +   N+     +   LAL+L+ E    
Sbjct: 43  AAAAVASGEPMLCIPRRLLISEDLCWRDPQLGRVFQDNRDVFTRDDPVLALFLVRELLLA 102

Query: 172 KKSFWLPYIRELDRQRGRGQLAV----ESPLLWSETELAYLTGSPTKAEILER-AEGIKR 226
            +SF+ PY            LAV    ES   W++ EL  L          ER  +   R
Sbjct: 103 DRSFFHPY------------LAVLPYPESVQDWTQAELGELHD--------ERLVDAAAR 142

Query: 227 EYNELDTVWFMAGSLFQ-QYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF--- 282
             +E+D  +       Q +YP + P   +TF+ FK A+  +Q+        +  RR    
Sbjct: 143 RTSEIDVYYRRVMVRLQTKYPGEFPEALYTFDRFKFAWKTIQA-------RTFGRRLPWT 195

Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV---DRPYKAGESIVVWCGPQPNSKLLINY 339
           ALVP    L   +   K      D+ +  +       +  G  +    G + N +LL++Y
Sbjct: 196 ALVPFADCLNHTNVATKYDFDVNDNGLFRLYPSGATSFAQGAEVFNSYGRRSNFQLLLDY 255

Query: 340 GFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPY 399
           GF   DN +D + VE   +   P          R  KL      V   R+  ++ ++ P 
Sbjct: 256 GFALPDNEWDYVDVEIGKDRAGP----------RGRKLRFMKRVVRIDRQS-SLDELFPP 304

Query: 400 LRLGYVSD------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 442
             L  ++D       SE  + +S    +C     +   +++ +AD+  A
Sbjct: 305 SFLAGLADPVPDEEQSEAAAELSERTALCDALEWLRSILIETIADWGTA 353


>gi|325095092|gb|EGC48402.1| SET domain-containing protein [Ajellomyces capsulatus H88]
          Length = 485

 Score = 48.1 bits (113), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 68/302 (22%), Positives = 115/302 (38%), Gaps = 45/302 (14%)

Query: 84  WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
           W+ +   P  KV  K K +          + A +D+   +  F++P +LV+  +    N 
Sbjct: 23  WLKQR--PGVKVSPKIKIADLRSEGAGRGIVADDDIGEDEELFAIPQNLVLGFQ----NS 76

Query: 144 TIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
            + +LL  N+       CL + ++YE  QG  S W  Y + L           ++ + W+
Sbjct: 77  RLKDLLDFNERDFDPWLCLIVVMIYEYLQGGASTWSRYFQLLPTN-------FDTLMFWT 129

Query: 202 ETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP-------YDIPTE-- 252
           + EL  L+GS     +L +      E N L  +  +       +P       +D P    
Sbjct: 130 DEELRELSGSA----VLNKIGRSDAEANILRNILPLVSGNPSHFPPMSGVASFDSPEGKA 185

Query: 253 --------------AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC 298
                         A+ F+I K      +    ++          +VPL   L A + + 
Sbjct: 186 ALLSLAHRMGSLIMAYAFDIEKGENDGREGQDGYVTDDEEELSKGMVPLADLLNADADRN 245

Query: 299 KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAAL 357
            A L   D  + +   +P + GE I    G  P + LL  YG+V D    YD   VE ++
Sbjct: 246 NARLFQEDCYLSMRSIKPIRKGEEIFNDYGELPRADLLRRYGYVTDNYAQYDE--VEISM 303

Query: 358 NT 359
            T
Sbjct: 304 RT 305


>gi|149742140|ref|XP_001496337.1| PREDICTED: SET domain-containing protein 4 [Equus caballus]
          Length = 440

 Score = 48.1 bits (113), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 65/260 (25%), Positives = 108/260 (41%), Gaps = 22/260 (8%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSELACLALYLMYEKKQGKKSFWL 177
           LQ G    S+P S ++T + V+ +   A +      LS L  L  +L+ EK  G +S W 
Sbjct: 68  LQEGQMIISLPESCLLTTDTVIRSYLGAYIAKWQPPLSPLLALCTFLVAEKHAGDRSVWK 127

Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
           PY+  L +       A   P+   E E+  L   P KA+  E+   ++  +      +  
Sbjct: 128 PYLEVLPK-------AYTCPVC-LEPEVVDLLPKPLKAKAREQRTRLQAFFTSSRDFFSS 179

Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP----LGP--PL 291
              LF +    I    F++  F  A+  V +  V++ K    R F+  P    L P   L
Sbjct: 180 LRPLFSEAVESI----FSYSAFLWAWCTVNTRAVYM-KPRRRRCFSAEPDTYALAPYLDL 234

Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
           L +S   + +A         ++      +  E + +  GP  N +LL+ YGFV   NP+ 
Sbjct: 235 LNHSPDVQVRAGFNEETRCYEIRTVSSCRKHEEVFICYGPHDNQRLLLEYGFVSIHNPHA 294

Query: 350 RLVVEAALNTEDPQYQDKRM 369
            + V   +  +     DK+M
Sbjct: 295 CVYVSKDILVKYLPSTDKQM 314


>gi|281201674|gb|EFA75882.1| tryptophan 2,3-dioxygenase [Polysphondylium pallidum PN500]
          Length = 732

 Score = 48.1 bits (113), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 57/208 (27%), Positives = 94/208 (45%), Gaps = 36/208 (17%)

Query: 81  LKSWMHKNG----LPPCKVI--LKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
            K+W+  NG    L   K++  L E             + A+ +++ GD    VP  L +
Sbjct: 70  FKNWLASNGCQESLDKVKIVRTLAEGTG----------LIANTEIKEGDEFIKVPLKLFM 119

Query: 135 TLE---RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQ 191
           + E   + +G++   E L   K+     L ++L+ E ++ ++SFW PYIR L +      
Sbjct: 120 SQETAFKSIGDKVSREPLF--KMLPNMLLVIHLIQETQKQQQSFWAPYIRMLPK------ 171

Query: 192 LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPT 251
            + ++ L ++  E   L GSP    +LE  E I    N L    F+    F + P  + T
Sbjct: 172 -SYKTALYFTLAEFQLLIGSP----VLE--ESINTYRNTLRQYCFLY-DFFGKNPGILST 223

Query: 252 EAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
             FT+E F+Q  +A    +V L K  LA
Sbjct: 224 SNFTWE-FEQNELAAYKSIVSLLKKRLA 250


>gi|449466129|ref|XP_004150779.1| PREDICTED: uncharacterized protein LOC101212907 [Cucumis sativus]
          Length = 559

 Score = 48.1 bits (113), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 62/256 (24%), Positives = 104/256 (40%), Gaps = 28/256 (10%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           A EDL  GD    +P +++++ E V  +     L     +       L+ M EK      
Sbjct: 189 AKEDLDVGDTVLEIPLAIIISEELVQKSTMYPVLSKVEGMLPETMTLLWSMKEKHIVDSE 248

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
           F + Y   L         A  + L +    +  L G+    E+++  E ++++YNEL   
Sbjct: 249 FRV-YFDTLPE-------AFNTGLSFGVGAMTTLVGTLLFDELMQAKEHLRKQYNEL--- 297

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP-------- 286
            F A  L   +P   P E +++E F  A     S  + +       R  LVP        
Sbjct: 298 -FPA--LCNNHPDIFPEEFYSWEEFLWACELWYSNSLKIMFPDGNVRTCLVPIAGFLNHS 354

Query: 287 LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-D 345
           L P +L Y       + +  D+++  + RP +AGE   +  G    S L+  YGF+ E D
Sbjct: 355 LHPHILHYGK-----VDSDTDSLKFRLSRPCRAGEECYLSYGNYSGSHLVTFYGFLPEGD 409

Query: 346 NPYDRLVVEAALNTED 361
           N  D + ++     +D
Sbjct: 410 NVNDVIPLDIDFGDDD 425


>gi|240276868|gb|EER40379.1| SET domain-containing protein [Ajellomyces capsulatus H143]
          Length = 485

 Score = 48.1 bits (113), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 68/302 (22%), Positives = 115/302 (38%), Gaps = 45/302 (14%)

Query: 84  WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
           W+ +   P  KV  K K +          + A +D+   +  F++P +LV+  +    N 
Sbjct: 23  WLKQR--PGVKVSPKIKIADLRSEGAGRGIVADDDIGEDEELFAIPQNLVLGFQ----NS 76

Query: 144 TIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
            + +LL  N+       CL + ++YE  QG  S W  Y + L           ++ + W+
Sbjct: 77  RLKDLLDFNERDFDPWLCLIVVMIYEYLQGGASTWSRYFQLLPTN-------FDTLMFWT 129

Query: 202 ETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP-------YDIPTE-- 252
           + EL  L+GS     +L +      E N L  +  +       +P       +D P    
Sbjct: 130 DEELRELSGSA----VLNKIGRSDAEANILRNILPLVSGNPSHFPPMSGVASFDSPEGKA 185

Query: 253 --------------AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC 298
                         A+ F+I K      +    ++          +VPL   L A + + 
Sbjct: 186 ALLSLAHRMGSLIMAYAFDIEKGENDGREGQDGYVTDDEEELSKGMVPLADLLNADADRN 245

Query: 299 KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAAL 357
            A L   D  + +   +P + GE I    G  P + LL  YG+V D    YD   VE ++
Sbjct: 246 NARLFQEDCYLSMRSIKPIRKGEEIFNDYGELPRADLLRRYGYVTDNYAQYDE--VEISM 303

Query: 358 NT 359
            T
Sbjct: 304 RT 305


>gi|255075907|ref|XP_002501628.1| predicted protein [Micromonas sp. RCC299]
 gi|226516892|gb|ACO62886.1| predicted protein [Micromonas sp. RCC299]
          Length = 607

 Score = 48.1 bits (113), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 55/220 (25%), Positives = 95/220 (43%), Gaps = 16/220 (7%)

Query: 145 IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 204
           +A  +   +L+  A LAL++++E     +S    Y+  L    G+   +V  PLLW+ T+
Sbjct: 137 VAASMGAPELATHAALALHVLFELGD-PRSEGFAYLATLPGLAGKASPSV--PLLWTPTQ 193

Query: 205 LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY---DIPTEAFTFEIFKQ 261
           +A L G+PT   +L RA+ +   +  L       G   +++ +    + + A + +    
Sbjct: 194 VATLRGTPTHGRVLRRAKFVSDAHAALFGSGGGGGVPLEKFAWALSSVLSRAASGDRMPY 253

Query: 262 AFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGE 321
           AF+     + H   V      + V L P         +  +   D  V  V D P  AGE
Sbjct: 254 AFLPGVDLLNH-GGVDANCELSAVKLAP------GGNEENVTWGDVEVTCVKDTP--AGE 304

Query: 322 SIVVWCGPQP-NSKLLINYGFVDEDNPYDRLVVEAALNTE 360
            + +  G +  N +LL  YGF    N +DR  +E  L  +
Sbjct: 305 QLTISYGDESDNCRLLRLYGFATRGNVHDRRTIELRLTGD 344


>gi|302804174|ref|XP_002983839.1| hypothetical protein SELMODRAFT_445692 [Selaginella moellendorffii]
 gi|300148191|gb|EFJ14851.1| hypothetical protein SELMODRAFT_445692 [Selaginella moellendorffii]
          Length = 236

 Score = 48.1 bits (113), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 46/186 (24%), Positives = 79/186 (42%), Gaps = 29/186 (15%)

Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
           + S W PYI  L    G     +++  LW +TEL+YL  SP   +  ER E I  E+ ++
Sbjct: 74  QSSAWAPYISCLPEPAG-----LDNTFLWEDTELSYLRASPLYGKTRERLEIITTEFGQV 128

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
                +   LF +          + E F   +  V S     + +++     LV +  P+
Sbjct: 129 QNALDVWPQLFGK---------VSVEDFMHVYATVFS-----RPLAIGEDSTLVMI--PM 172

Query: 292 LAYSSKCKAMLAAVD-----DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 346
           L + +   A  A +      +   +  DR     + I + CG   N++L ++YGF     
Sbjct: 173 LDFFNHNAASFAKLSFNGLLNYAVVTADRDCAENDQIWINCGDLSNAELALDYGFT---V 229

Query: 347 PYDRLV 352
           P +RL+
Sbjct: 230 PENRLI 235


>gi|449520517|ref|XP_004167280.1| PREDICTED: LOW QUALITY PROTEIN: sulfate transporter 4.1,
           chloroplastic-like, partial [Cucumis sativus]
          Length = 923

 Score = 48.1 bits (113), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 56/219 (25%), Positives = 92/219 (42%), Gaps = 39/219 (17%)

Query: 112 YVAASEDLQAGDAAFSVP-------NSLVVTLERVLGNETIAELLTTNKLSELACLALYL 164
           ++ ASE ++AGD    VP       +SL + +  +LGNE          +  +A LA+ +
Sbjct: 734 FLFASETIRAGDCILKVPFNVQISPDSLPLPIRDLLGNE----------IGNVAKLAVVV 783

Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
           + E K G  S W PYI  L +        + + + W E+EL  +  S    E L +   I
Sbjct: 784 LLEHKLGLGSEWAPYIIRLPQ-----PWEMHNTIFWKESELEMIRKSSLYEESLNQRSQI 838

Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 284
           KRE+  +           + +P  I  +  + + F  A+  V S     +        +L
Sbjct: 839 KREFLAIRKA-------LEAFPEII--DRISCDDFMHAYALVTS-----RAWRSTEGVSL 884

Query: 285 VPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGE 321
           +P     L +    +AML   DD    ++V DR +  GE
Sbjct: 885 IPFA-DFLNHDGASEAMLLNDDDKQLSEVVADRDFAPGE 922


>gi|308812602|ref|XP_003083608.1| unnamed protein product [Ostreococcus tauri]
 gi|116055489|emb|CAL58157.1| unnamed protein product [Ostreococcus tauri]
          Length = 427

 Score = 47.8 bits (112), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 79/343 (23%), Positives = 139/343 (40%), Gaps = 46/343 (13%)

Query: 151 TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 210
           +N+ +E A LA+ L  E+++G  S + P++                  L+  T       
Sbjct: 79  SNESAEWA-LAIELAMEREKGVASRYRPFV----------------DSLYERTPANSTVV 121

Query: 211 SPTKAEIL--ERAEGIKREYNELDTV--WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 266
           S    E L    AE + R Y+E D V  W  A   F+ +P     + FT   F++A   V
Sbjct: 122 SKKARERLAEHHAEKVMRRYDE-DIVRGWNAAVRTFRTFPTIFRAQDFTRSKFEEALAIV 180

Query: 267 QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 326
           ++    + +    R   LVPL   L+  +S     +  VDD   + VD  ++AG+ +   
Sbjct: 181 RANSFEVTRADGVRERVLVPLAHLLVHDTSSSVPCVKMVDDTFVINVD-EHRAGDELSCS 239

Query: 327 CGPQPNSKLLINYG----FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL--SVQ 380
            G   +++    +G    + +E+N  D +            + D+  + +  G    +  
Sbjct: 240 HGEYSDAETFARFGTSAVYSEENNARDVITF---------TFPDEVHLKEEIGSCGPAED 290

Query: 381 VFHVHAGREKEAISDMLPYLRL--GYVSDTSEMQSVISSLGPIC--PVSPCMERAVLDQL 436
           +     G    A ++++  LRL     ++ SEM+     L  +   P+S   E AV D L
Sbjct: 291 IGFTRDG----ASAELMCALRLVSANATEWSEMRKPNFDLQSLKNRPLSEESEVAVYDAL 346

Query: 437 ADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKK 479
                  L  YP +  +DE +L    L   +R A ++   EK+
Sbjct: 347 FATLTDLLNSYPYSDVDDEHLLRGDRLADDERRAVKIRLREKR 389


>gi|428163884|gb|EKX32933.1| hypothetical protein GUITHDRAFT_120884 [Guillardia theta CCMP2712]
          Length = 320

 Score = 47.8 bits (112), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 55/227 (24%), Positives = 95/227 (41%), Gaps = 33/227 (14%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           AS+ +  G+    VP  L++   + L   ++  LL   +L +  C+ L LM E      S
Sbjct: 32  ASKRISPGETFLKVPRHLLLGPHQ-LRASSLDRLLEGWQLPD--CMLLLLMCESVN-SSS 87

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
           F+ PY+  L          V++P+ WS+ E   L GSP     ++    + R + E    
Sbjct: 88  FFRPYLDLLPD-------TVDTPITWSKEEAKELVGSPVLHRAVKLRHELARSFQE---- 136

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
             M   +F +YP   P   F++E ++ A+  ++S             + L+PL   +  +
Sbjct: 137 --MKDKVFDKYPDRFPPLLFSYERYQWAYSILRSRAFG--------NYTLMPLIDLMNHH 186

Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC--GPQPNSKLLINY 339
                A     D +  L+  R Y       VW   G + ++ LL+NY
Sbjct: 187 PDSRLAPTLLSDGSDALIARREYN------VWGFYGRKSDADLLLNY 227


>gi|256270722|gb|EEU05884.1| Set7p [Saccharomyces cerevisiae JAY291]
          Length = 494

 Score = 47.8 bits (112), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 61/255 (23%), Positives = 110/255 (43%), Gaps = 26/255 (10%)

Query: 113 VAASEDLQAGDAAFSVPNS--LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
           V A++ ++  +  F +P S  L VT  +++ +    +    N+      L + ++YE + 
Sbjct: 41  VVATQKIKKDETLFKIPRSSVLSVTTSQLIKDYPSLKDKFLNETGSWEGLIICILYEMEV 100

Query: 171 -GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
             ++S W PY +  ++      L     + W + EL  L         G     E+ ER 
Sbjct: 101 LQERSRWAPYFKVWNKPSDMNAL-----IFWDDNELQLLKPSLVLERIGKKEAKEMHERI 155

Query: 222 -EGIKREYNELDTVWFMAGSL-FQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
            + IK+   E   V   A S  F  + Y    I + +F  E+   +    +      +++
Sbjct: 156 IKSIKQIGGEFSRV---ATSFEFDNFAYIASIILSYSFDLEMQDSSVNENEEEETSEEEL 212

Query: 277 SLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
              R   +++PL   L A +SKC A L    + +++V  R  +  E +    G  PNS+L
Sbjct: 213 ENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSEL 272

Query: 336 LINYGFVDED-NPYD 349
           L  YG+V+ D + YD
Sbjct: 273 LRRYGYVEWDGSKYD 287


>gi|320163219|gb|EFW40118.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 1188

 Score = 47.8 bits (112), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 45/202 (22%), Positives = 81/202 (40%), Gaps = 22/202 (10%)

Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 256
           PL W++ EL +L G+     I ER   ++ ++  +  V      L ++ P   P + FT+
Sbjct: 245 PLWWNDAELDHLDGTNIGGYIQERRNQVRNQFLNVFPV------LSREQPALFPKDVFTY 298

Query: 257 EIFKQAFVAVQSCVVHLQ-KVSLARRFALVPLGPPLLAYSSKC------------KAMLA 303
           E +  AF    S    L+  V+         +G P+     +C             A + 
Sbjct: 299 EAYLWAFSTCSSRAFPLRVTVNPTTGVESHAIGNPMKEPCVECLLPLLDMMNHQFGASIT 358

Query: 304 AVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 361
              D  +V+       + GE +    GP+ N +LL+ YGF   +N  D + ++  +   D
Sbjct: 359 WFTDETSVRFFTGAKVRKGEQVYNNYGPKSNEELLMGYGFCLPNNEADHVKIQLTVGN-D 417

Query: 362 PQYQDKRMVAQRNGKLSVQVFH 383
           P  + K  + + +G       H
Sbjct: 418 PDGEAKLAILRWHGLSLTHFLH 439


>gi|294948379|ref|XP_002785721.1| hypothetical protein Pmar_PMAR008080 [Perkinsus marinus ATCC 50983]
 gi|239899769|gb|EER17517.1| hypothetical protein Pmar_PMAR008080 [Perkinsus marinus ATCC 50983]
          Length = 353

 Score = 47.8 bits (112), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 21/71 (29%), Positives = 37/71 (52%)

Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
             ++PL       S+K    +  V++  Q++ ++P K GE I    G   N  LL+ +GF
Sbjct: 171 LCVIPLADQFNHSSTKWHTRVREVEEGFQMLAEKPVKKGEEIFNNYGLYTNEMLLLTHGF 230

Query: 342 VDEDNPYDRLV 352
           ++ DNP+D  +
Sbjct: 231 IEFDNPHDHFI 241


>gi|189190580|ref|XP_001931629.1| conserved hypothetical protein [Pyrenophora tritici-repentis
           Pt-1C-BFP]
 gi|187973235|gb|EDU40734.1| conserved hypothetical protein [Pyrenophora tritici-repentis
           Pt-1C-BFP]
          Length = 372

 Score = 47.8 bits (112), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 57/245 (23%), Positives = 100/245 (40%), Gaps = 24/245 (9%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A+ D+QAG+    VP  L  TL+ V   + I+  L  N +S  A LA YL  +K    
Sbjct: 33  IIATRDIQAGETILFVPFKLFRTLKHV--PKAISRRLPRN-MSLHALLATYLSLDKTD-- 87

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
            +F +P     D       +    P LW   EL      P    ++++    KR+     
Sbjct: 88  -TFAIPNKTLPDLSSFEAGM----PFLWP-AELHPFLPKPALDLLMKQQRSFKRD----- 136

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
             W +    +     D    A+   +  ++F         ++++    R A++P+     
Sbjct: 137 --WDIVSKAYSNISQDQYLHAWLL-VNTRSFYCTTPI---MERLPHDDRLAILPVADLFN 190

Query: 293 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 352
                C+A  A+  +    + DR Y+ GE + +  G      LL  YGFV  +N +D + 
Sbjct: 191 HADVGCEARFAS--ENYSFIADRDYRTGEELHISYGSHSTDFLLTEYGFVPTENCWDVVC 248

Query: 353 VEAAL 357
           ++ A+
Sbjct: 249 LDEAI 253


>gi|302754816|ref|XP_002960832.1| hypothetical protein SELMODRAFT_437299 [Selaginella moellendorffii]
 gi|300171771|gb|EFJ38371.1| hypothetical protein SELMODRAFT_437299 [Selaginella moellendorffii]
          Length = 418

 Score = 47.8 bits (112), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 46/172 (26%), Positives = 76/172 (44%), Gaps = 22/172 (12%)

Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF---TF 256
           W +TEL+YL  SP   +  ER E I  E+ ++    F    L Q    D+  + F   + 
Sbjct: 202 WEDTELSYLRASPLYGKARERLEMITTEFGQVQND-FCTCVLEQ--ALDVWPQLFGKVSL 258

Query: 257 EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY-----SSKCKAMLAAVDDAVQL 311
           E  K  +  V S     + +++     LV +  P+L +     +S  K     + +   +
Sbjct: 259 EDLKHVYATVFS-----RSLAIGEDSTLVMI--PMLDFFNHNATSFAKLSFNGLLNYAVV 311

Query: 312 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQ 363
             DR Y   + I +  G   N++L ++YGF   +NPYD    E  L T+ P+
Sbjct: 312 TADRDYAENDQIWINYGDLSNAELALDYGFTVPENPYD----ETELLTQFPE 359


>gi|296810368|ref|XP_002845522.1| SET domain-containing protein [Arthroderma otae CBS 113480]
 gi|238842910|gb|EEQ32572.1| SET domain-containing protein [Arthroderma otae CBS 113480]
          Length = 491

 Score = 47.8 bits (112), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 64/273 (23%), Positives = 110/273 (40%), Gaps = 45/273 (16%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + A  D+   +  F +P  L++++E     E +   L   +L     L + ++YE  QG+
Sbjct: 61  LGAVRDIAEDEELFVIPEDLILSVENSKAREALG--LNETQLGPWLSLIIVMIYEYYQGE 118

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA--EGIKREYNE 230
           +S W PY   L         + ++ + W+E +L  L G     +I + A  E I ++   
Sbjct: 119 QSRWEPYFHIL-------PTSFDTLMFWTEAQLQELQGCAVVDKIGKSAADEAILQKVVP 171

Query: 231 L--------------------DTVWFMA---GSLFQQYPYDI-PTEAFTFEIFKQAFVAV 266
           L                    D +  +A   GSL   Y +DI  TE    +  +  ++  
Sbjct: 172 LIQANPHHFPARSGMPPLDSNDALLCLAHRMGSLIMAYAFDIEKTEGADDDAAEDGYMTD 231

Query: 267 QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 326
                   +   A+   +VPL     A + +  A L   + +  +   R  +AGE I   
Sbjct: 232 -------DEDEPAK--GMVPLADIFNADAQRNNARLFQEEGSFVMKAIRNIQAGEEIFND 282

Query: 327 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 359
            G  P + LL  YG+V  DN     VVE +L++
Sbjct: 283 YGELPRADLLRRYGYV-TDNYAQYDVVEFSLDS 314


>gi|323355591|gb|EGA87411.1| Set7p [Saccharomyces cerevisiae VL3]
          Length = 515

 Score = 47.8 bits (112), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 61/255 (23%), Positives = 110/255 (43%), Gaps = 26/255 (10%)

Query: 113 VAASEDLQAGDAAFSVPNS--LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKK- 169
           V A++ ++  +  F +P S  L VT  +++ +    +    N+      L + ++YE + 
Sbjct: 41  VVATQKIKKDETLFKIPRSSVLSVTTSQLIKDYPSLKDKFLNETGSWEGLIICILYEMEV 100

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
             ++S W PY +  ++      L     + W + EL  L         G     E+ ER 
Sbjct: 101 LQERSRWAPYFKVWNKPSDMNAL-----IFWDDNELQLLKPSLVLERIGKKEAKEMHERI 155

Query: 222 -EGIKREYNELDTVWFMAGSL-FQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
            + IK+   E   V   A S  F  + Y    I + +F  E+   +    +      +++
Sbjct: 156 IKSIKQIGGEFSRV---ATSFEFDNFAYIASIILSYSFDLEMQDSSVNENEEEETSEEEL 212

Query: 277 SLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
              R   +++PL   L A +SKC A L    + +++V  R  +  E +    G  PNS+L
Sbjct: 213 ENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSEL 272

Query: 336 LINYGFVDED-NPYD 349
           L  YG+V+ D + YD
Sbjct: 273 LRRYGYVEWDGSKYD 287


>gi|323334121|gb|EGA75505.1| Set7p [Saccharomyces cerevisiae AWRI796]
          Length = 515

 Score = 47.8 bits (112), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 61/255 (23%), Positives = 110/255 (43%), Gaps = 26/255 (10%)

Query: 113 VAASEDLQAGDAAFSVPNS--LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
           V A++ ++  +  F +P S  L VT  +++ +    +    N+      L + ++YE + 
Sbjct: 41  VVATQKIKKDETLFKIPRSSVLSVTTSQLIKDYPSLKDKFLNETGSWEGLIICILYEMEV 100

Query: 171 -GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
             ++S W PY +  ++      L     + W + EL  L         G     E+ ER 
Sbjct: 101 LQERSRWAPYFKVWNKPSDMNAL-----IFWDDNELQLLKPSLVLERIGKKEAKEMHERI 155

Query: 222 -EGIKREYNELDTVWFMAGSL-FQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
            + IK+   E   V   A S  F  + Y    I + +F  E+   +    +      +++
Sbjct: 156 IKSIKQIGGEFSRV---ATSFEFDNFAYIASIILSYSFDLEMQDSSVNENEEEETSEEEL 212

Query: 277 SLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
              R   +++PL   L A +SKC A L    + +++V  R  +  E +    G  PNS+L
Sbjct: 213 ENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSEL 272

Query: 336 LINYGFVDED-NPYD 349
           L  YG+V+ D + YD
Sbjct: 273 LRRYGYVEWDGSKYD 287


>gi|6320463|ref|NP_010543.1| Rkm4p [Saccharomyces cerevisiae S288c]
 gi|46577338|sp|Q12504.1|RKM4_YEAST RecName: Full=Ribosomal N-lysine methyltransferase 4; AltName:
           Full=SET domain-containing protein 7
 gi|1136212|emb|CAA92714.1| unknown [Saccharomyces cerevisiae]
 gi|1226033|emb|CAA94096.1| unknown [Saccharomyces cerevisiae]
 gi|51830266|gb|AAU09704.1| YDR257C [Saccharomyces cerevisiae]
 gi|190404795|gb|EDV08062.1| hypothetical protein SCRG_00269 [Saccharomyces cerevisiae RM11-1a]
 gi|259145494|emb|CAY78758.1| Set7p [Saccharomyces cerevisiae EC1118]
 gi|285811273|tpg|DAA12097.1| TPA: Rkm4p [Saccharomyces cerevisiae S288c]
 gi|323349272|gb|EGA83501.1| Set7p [Saccharomyces cerevisiae Lalvin QA23]
 gi|365766338|gb|EHN07836.1| Set7p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
 gi|392300372|gb|EIW11463.1| Rkm4p [Saccharomyces cerevisiae CEN.PK113-7D]
          Length = 494

 Score = 47.4 bits (111), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 61/255 (23%), Positives = 110/255 (43%), Gaps = 26/255 (10%)

Query: 113 VAASEDLQAGDAAFSVPNS--LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
           V A++ ++  +  F +P S  L VT  +++ +    +    N+      L + ++YE + 
Sbjct: 41  VVATQKIKKDETLFKIPRSSVLSVTTSQLIKDYPSLKDKFLNETGSWEGLIICILYEMEV 100

Query: 171 -GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
             ++S W PY +  ++      L     + W + EL  L         G     E+ ER 
Sbjct: 101 LQERSRWAPYFKVWNKPSDMNAL-----IFWDDNELQLLKPSLVLERIGKKEAKEMHERI 155

Query: 222 -EGIKREYNELDTVWFMAGSL-FQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
            + IK+   E   V   A S  F  + Y    I + +F  E+   +    +      +++
Sbjct: 156 IKSIKQIGGEFSRV---ATSFEFDNFAYIASIILSYSFDLEMQDSSVNENEEEETSEEEL 212

Query: 277 SLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
              R   +++PL   L A +SKC A L    + +++V  R  +  E +    G  PNS+L
Sbjct: 213 ENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSEL 272

Query: 336 LINYGFVDED-NPYD 349
           L  YG+V+ D + YD
Sbjct: 273 LRRYGYVEWDGSKYD 287


>gi|349577313|dbj|GAA22482.1| K7_Set7p [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 494

 Score = 47.4 bits (111), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 61/255 (23%), Positives = 110/255 (43%), Gaps = 26/255 (10%)

Query: 113 VAASEDLQAGDAAFSVPNS--LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
           V A++ ++  +  F +P S  L VT  +++ +    +    N+      L + ++YE + 
Sbjct: 41  VVATQKIKKDETLFKIPRSSVLSVTTSQLIKDYPSLKDKFLNETGSWEGLIICILYEMEV 100

Query: 171 -GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
             ++S W PY +  ++      L     + W + EL  L         G     E+ ER 
Sbjct: 101 LQERSRWAPYFKVWNKPSDMNAL-----IFWDDNELQLLKPSLVLERIGKKEAKEMHERI 155

Query: 222 -EGIKREYNELDTVWFMAGSL-FQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
            + IK+   E   V   A S  F  + Y    I + +F  E+   +    +      +++
Sbjct: 156 IKSIKQIGGEFSRV---ATSFEFDNFAYIASIILSYSFDLEMQDSSINENEEEETSEEEL 212

Query: 277 SLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
              R   +++PL   L A +SKC A L    + +++V  R  +  E +    G  PNS+L
Sbjct: 213 ENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSEL 272

Query: 336 LINYGFVDED-NPYD 349
           L  YG+V+ D + YD
Sbjct: 273 LRRYGYVEWDGSKYD 287


>gi|151942233|gb|EDN60589.1| SET domain-containing protein [Saccharomyces cerevisiae YJM789]
          Length = 494

 Score = 47.4 bits (111), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 61/255 (23%), Positives = 110/255 (43%), Gaps = 26/255 (10%)

Query: 113 VAASEDLQAGDAAFSVPNS--LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
           V A++ ++  +  F +P S  L VT  +++ +    +    N+      L + ++YE + 
Sbjct: 41  VVATQKIKKDETLFKIPRSSVLSVTTSQLIKDYPSLKDKFLNETGSWEGLIICILYEMEV 100

Query: 171 -GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
             ++S W PY +  ++      L     + W + EL  L         G     E+ ER 
Sbjct: 101 LQERSRWAPYFKVWNKPSDMNAL-----IFWDDNELQLLKPSLVLERIGKKEAKEMHERI 155

Query: 222 -EGIKREYNELDTVWFMAGSL-FQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
            + IK+   E   V   A S  F  + Y    I + +F  E+   +    +      +++
Sbjct: 156 IKSIKQIGGEFSRV---ATSFEFDNFAYIASIILSYSFDLEMQDSSINENEEEETSEEEL 212

Query: 277 SLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
              R   +++PL   L A +SKC A L    + +++V  R  +  E +    G  PNS+L
Sbjct: 213 ENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSEL 272

Query: 336 LINYGFVDED-NPYD 349
           L  YG+V+ D + YD
Sbjct: 273 LRRYGYVEWDGSKYD 287


>gi|354548388|emb|CCE45124.1| hypothetical protein CPAR2_701280 [Candida parapsilosis]
          Length = 565

 Score = 47.4 bits (111), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 72/322 (22%), Positives = 126/322 (39%), Gaps = 65/322 (20%)

Query: 108 RPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYE 167
           +P ++ A S+    G A+  +P  LVVT ++  G +   +  T    +  + L +YL Y 
Sbjct: 28  KPNYFGAISK--SNGKASIQIPRELVVTCDK--GIDLYKD--TYKNANHSSLLKIYLCYS 81

Query: 168 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
           + Q  +SF  PY+  L   +     A++SP +WS  + A L G+     + E    +  E
Sbjct: 82  RTQ--QSFHQPYLDTLPSLQ-----AIDSPYIWSAEDKALLKGTNLGNSLKENISSLVEE 134

Query: 228 YNELDTVWFMAGSLFQQYPYDIP------------------TEAFTFEIFKQAFVAVQSC 269
                  W+ A +L    P D+P                  T+   +  F +   +  + 
Sbjct: 135 -------WWNAINLL---PEDVPKPEQHFINLKFYYENKFYTDDDYYSYFNEVDTSNWTS 184

Query: 270 VVHLQKVSL---ARRFALVPLGPP-------------LLAYSSKCKAMLAAVDDAVQLVV 313
             +    SL   +R F    + P              LL ++ K K   +  D       
Sbjct: 185 FPNYLWASLVLKSRAFPAYIIDPSLPKNEPMLLPVVDLLNHNPKTKVQWSGTDGGFLFQS 244

Query: 314 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQR 373
           D    +GE +    G + N +LL+ YGF  E+NP D     AAL  + P   D ++   +
Sbjct: 245 DDA-SSGEELFNNYGQKGNEELLLAYGFAIENNPADS----AALKIKIP---DSKLQVVK 296

Query: 374 NGKLSVQVFHVHAGREKEAISD 395
           +  + +   H +     + +SD
Sbjct: 297 DLGIKLPSIHDYTNSVIDQVSD 318


>gi|254577261|ref|XP_002494617.1| ZYRO0A05654p [Zygosaccharomyces rouxii]
 gi|238937506|emb|CAR25684.1| ZYRO0A05654p [Zygosaccharomyces rouxii]
          Length = 494

 Score = 47.4 bits (111), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 60/258 (23%), Positives = 112/258 (43%), Gaps = 34/258 (13%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAEL--LTTNKLSELACLALYLMYE-KK 169
           V AS+D+ + +  F +P S V+ +          +L  +   +L     L L ++YE K 
Sbjct: 42  VLASQDIGSDEVLFEIPRSSVLNVATSQLVRDFPQLKDVFWQELGHWEGLILCMVYEIKV 101

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLT--------GSPTKAE----I 217
            G++SFW  Y++ L + +    L     + WS  +LA L         G+    E    I
Sbjct: 102 MGQQSFWWNYLQVLPKSQDLNTL-----VYWSADQLAALEPSLVVGRLGADESQEMYRQI 156

Query: 218 LERAEGIKREYN----ELDTVWFM-AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ----S 268
           L+  +    E+     +L    F+   S+   Y +D+  +    E  +      +    S
Sbjct: 157 LKYIQNFGPEFQSKIGQLTFEEFVHVASVIMSYSFDVDLKGEDDEDDEDEDEGEEEEGES 216

Query: 269 CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG 328
            V H + +      ++VPL   L A + +  A L    +++++V  +P K G+ +  + G
Sbjct: 217 NVAHDKYMK-----SMVPLADTLNADTKQFNAHLVYDKESLKMVSVKPIKMGQQVYNFYG 271

Query: 329 PQPNSKLLINYGFVDEDN 346
             PN+++L  YG+V+ D 
Sbjct: 272 EHPNAEILRRYGYVEWDG 289


>gi|302754814|ref|XP_002960831.1| hypothetical protein SELMODRAFT_402223 [Selaginella moellendorffii]
 gi|300171770|gb|EFJ38370.1| hypothetical protein SELMODRAFT_402223 [Selaginella moellendorffii]
          Length = 486

 Score = 47.4 bits (111), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 43/162 (26%), Positives = 71/162 (43%), Gaps = 18/162 (11%)

Query: 196 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF- 254
           S   W +TEL+YL  SP   +  ER E I  E+ ++    F    L Q    D+  + F 
Sbjct: 313 STFRWEDTELSYLRASPLYGKARERLEMITTEFGQVQND-FCTCVLEQ--ALDVWPQLFG 369

Query: 255 --TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY-----SSKCKAMLAAVDD 307
             + E  K  +  V S     + +++     LV +  P+L +     +S  K     + +
Sbjct: 370 KVSLEDLKHVYATVFS-----RSLAIGEDSTLVMI--PMLDFFNHNATSFAKLSFNGLLN 422

Query: 308 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
              +  DR Y   + I +  G   N++L ++YGF   +NPYD
Sbjct: 423 YAVVTADRDYAENDQIWINYGDLSNAELALDYGFTVPENPYD 464


>gi|66813084|ref|XP_640721.1| hypothetical protein DDB_G0281543 [Dictyostelium discoideum AX4]
 gi|60468751|gb|EAL66753.1| hypothetical protein DDB_G0281543 [Dictyostelium discoideum AX4]
          Length = 1339

 Score = 47.0 bits (110), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 52/243 (21%), Positives = 100/243 (41%), Gaps = 17/243 (6%)

Query: 113  VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
            V  ++ +   +A   VP   ++ ++    +  +  +     L++   L L+++YEK    
Sbjct: 787  VVTTKKVDENEAVVVVPKKYLINVDVAKAHPILGPIFEELHLNDDTILFLFVIYEKGNAN 846

Query: 173  KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
             SFW P+   L            + + +S TEL  L G+     + E     K++ N   
Sbjct: 847  -SFWRPFYDTLPS-------YFTTSIHYSATELLELEGT----NLFEETLHTKQQLNSFR 894

Query: 233  TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
               F    L +QYP   P   F++E F  A   + S  + L K+  + +  LVP+   + 
Sbjct: 895  DYLF--PELSKQYPDIFPESQFSWENFLWARSLLDSRAIQL-KIDGSIKSCLVPMADMIN 951

Query: 293  AYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 350
             +++    +        + +++      A   I +  G   N +L + YGF+  +N YD 
Sbjct: 952  HHTNAQISERFFDHDSQSFKMISSCNIPANNQIFLHYGALQNWELALYYGFIIPNNIYDS 1011

Query: 351  LVV 353
            L +
Sbjct: 1012 LHI 1014


>gi|261190993|ref|XP_002621905.1| SET domain-containing protein [Ajellomyces dermatitidis SLH14081]
 gi|239590949|gb|EEQ73530.1| SET domain-containing protein [Ajellomyces dermatitidis SLH14081]
 gi|239613147|gb|EEQ90134.1| SET domain-containing protein [Ajellomyces dermatitidis ER-3]
 gi|327354785|gb|EGE83642.1| SET domain-containing protein [Ajellomyces dermatitidis ATCC 18188]
          Length = 481

 Score = 47.0 bits (110), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 62/263 (23%), Positives = 107/263 (40%), Gaps = 50/263 (19%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELL--TTNKLSELACLALYLMYEKKQ 170
           + A  ++   +  F++P +LV++ +    N  + +LL  +   L    CL L ++YE  Q
Sbjct: 50  IVALSNINEDEELFAIPQNLVLSFQ----NSKLKDLLHISEKDLGPWLCLILVMIYEYLQ 105

Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI--LERAEGIKRE- 227
           G  S W  Y + L  +        ++ + W++ EL  L+GS    +I   +    I R+ 
Sbjct: 106 GGASPWSRYFQVLPTE-------FDTLMFWTDEELRELSGSAVLNKIGKSDAEAAILRDI 158

Query: 228 -------------------YNELD---TVWFMA---GSLFQQYPYDIPTEAFTFEIFKQA 262
                              Y+  D   T+  +A   GSL   Y +DI       E  +  
Sbjct: 159 FPIVSTNPHLFPPISGLGSYDSPDGRATLLSLAHRMGSLIMAYAFDI-------EKGEDE 211

Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
              VQ   +  +   L +   +VPL   L A + +  A L   D  + +   +P + GE 
Sbjct: 212 EGEVQDGYITDEGEELTK--GMVPLADLLNADADRNNARLFQEDGYLAMKSIKPIRNGEE 269

Query: 323 IVVWCGPQPNSKLLINYGFVDED 345
           I    G  P + LL  YG+V ++
Sbjct: 270 IFNDYGELPRADLLRRYGYVTDN 292


>gi|453087416|gb|EMF15457.1| SET domain-containing protein [Mycosphaerella populorum SO2202]
          Length = 454

 Score = 47.0 bits (110), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 72/336 (21%), Positives = 141/336 (41%), Gaps = 38/336 (11%)

Query: 126 FSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIREL 183
            ++P+ L+ T++R   +  +  LL++ +  LS    LA Y+++ + + K  +  P     
Sbjct: 41  LTIPHGLLWTVKRAYADPVLGPLLSSTRPPLSVDDTLATYILFIRAR-KSGYDGP----- 94

Query: 184 DRQRGRGQL--AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSL 241
             Q     L  +  S + +++ EL    GS         A  I+ +Y +L         L
Sbjct: 95  --QSHVAALPASYSSSIFFADAELEICAGSSLYTTTKHLARQIEVDYKDL------VARL 146

Query: 242 FQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYS---SKC 298
           F ++    P++ FT + +K A   V S  +   K+       L+     +L +S    +C
Sbjct: 147 FGRHRDVFPSDKFTIDDYKWALCTVWSRAMDF-KLRDGESIRLMAPFADMLNHSPDVGQC 205

Query: 299 KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 358
                   + + ++  + Y+ G+ + +  GP PN++L   YGFV   NP D   +  + +
Sbjct: 206 HVYDPQSGN-LSILAGKSYEPGDQVFINYGPIPNNRLSRLYGFVVPGNPNDSYDLVLSTH 264

Query: 359 TEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP-----YLRLGYVSDTSEMQS 413
              P ++ K  +    G  S     +        ++D LP     YLR+  +++T ++ +
Sbjct: 265 PMAPFFEQKHKLWIAAGLDSTSTVSL-------TLTDPLPRSVLRYLRIQRLNET-DLAA 316

Query: 414 VISSLGPIC--PVSPCMERAVLDQLADYFKARLAGY 447
           V +    +    +S   E  VL  L +   A L G+
Sbjct: 317 VGTRQSDVAFEKISDSNETEVLTFLVESISALLDGF 352


>gi|148671823|gb|EDL03770.1| SET domain containing 4, isoform CRA_d [Mus musculus]
          Length = 397

 Score = 47.0 bits (110), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 65/271 (23%), Positives = 112/271 (41%), Gaps = 27/271 (9%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
           LQ G    S+P S ++T + V+   ++   +   K  +S L  L  +L+ EK  G +S W
Sbjct: 25  LQEGQVMISLPESCLLTTDTVI-RSSLGPYIKKWKPPVSPLLALCTFLVSEKHAGCRSLW 83

Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
             Y+  L +       +   P+   E E+  L  SP KA+  E+   ++  +      + 
Sbjct: 84  KSYLDILPK-------SYTCPVCL-EPEVVDLLPSPLKAKAEEQRARVQDLFTSARGFFS 135

Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ---KVSLARRFALVPLGP--PL 291
               LF + P D     F++  F  A+  V +  V+L+   +  L+       L P   L
Sbjct: 136 TLQPLFAE-PVD---SVFSYRAFLWAWCTVNTRAVYLRSRRQECLSAEPDTCALAPFLDL 191

Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
           L +S   + KA         ++      +  + + +  GP  N +LL+ YGFV   NP+ 
Sbjct: 192 LNHSPHVQVKAAFNEKTRCYEIRTASRCRKHQEVFICYGPHDNQRLLLEYGFVSVRNPHA 251

Query: 350 RLVVEA-----ALNTEDPQYQDKRMVAQRNG 375
            + V A      L   D Q   K  + + +G
Sbjct: 252 CVPVSADMLVKFLPAADKQLHRKITILKDHG 282


>gi|332229557|ref|XP_003263953.1| PREDICTED: SET domain-containing protein 4 [Nomascus leucogenys]
          Length = 440

 Score = 47.0 bits (110), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 65/278 (23%), Positives = 111/278 (39%), Gaps = 27/278 (9%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
           LQ G    S+P S ++T + V+    +   +T  K   S L  L  +L+ EK  G +S W
Sbjct: 68  LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGDRSLW 126

Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
            PY+  L +       A   P+   E E+  L     KA+  E+   ++  +      + 
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178

Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
               LF +    I    F++     A+  V +  V+L   Q   L+       L P   L
Sbjct: 179 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQWECLSAEPDTCALAPYLDL 234

Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY- 348
           L +S   + KA       + ++     ++  E + +  GP  N +L + YGFV   NP+ 
Sbjct: 235 LNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEYGFVSVHNPHA 294

Query: 349 ----DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 382
                R ++   L + D Q   K  + + +G +    F
Sbjct: 295 CVYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332


>gi|148671819|gb|EDL03766.1| SET domain containing 4, isoform CRA_a [Mus musculus]
          Length = 378

 Score = 47.0 bits (110), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 65/271 (23%), Positives = 112/271 (41%), Gaps = 27/271 (9%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
           LQ G    S+P S ++T + V+   ++   +   K  +S L  L  +L+ EK  G +S W
Sbjct: 6   LQEGQVMISLPESCLLTTDTVI-RSSLGPYIKKWKPPVSPLLALCTFLVSEKHAGCRSLW 64

Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
             Y+  L +       +   P+   E E+  L  SP KA+  E+   ++  +      + 
Sbjct: 65  KSYLDILPK-------SYTCPVCL-EPEVVDLLPSPLKAKAEEQRARVQDLFTSARGFFS 116

Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ---KVSLARRFALVPLGP--PL 291
               LF + P D     F++  F  A+  V +  V+L+   +  L+       L P   L
Sbjct: 117 TLQPLFAE-PVD---SVFSYRAFLWAWCTVNTRAVYLRSRRQECLSAEPDTCALAPFLDL 172

Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
           L +S   + KA         ++      +  + + +  GP  N +LL+ YGFV   NP+ 
Sbjct: 173 LNHSPHVQVKAAFNEKTRCYEIRTASRCRKHQEVFICYGPHDNQRLLLEYGFVSVRNPHA 232

Query: 350 RLVVEAA-----LNTEDPQYQDKRMVAQRNG 375
            + V A      L   D Q   K  + + +G
Sbjct: 233 CVPVSADMLVKFLPAADKQLHRKITILKDHG 263


>gi|33468718|emb|CAE30375.1| SI:dZ63M10.4 (novel protein similar to human chromosome 21 open
           reading frame 18 (C21orf18)) [Danio rerio]
          Length = 440

 Score = 47.0 bits (110), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 70/323 (21%), Positives = 136/323 (42%), Gaps = 47/323 (14%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVV 134
           L+ W+++ G     +I            P+++      + +++ ++A ++  S+P   ++
Sbjct: 37  LRRWLNERGFTSQSLI------------PVNFHGNGRGLMSTQTIKAKNSLISLPEECLL 84

Query: 135 TLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
           T   VL +  +A+ +      +S L  L  +L+ E+  G+ S W PYI  L +       
Sbjct: 85  TTSTVLKS-YMADYIKRWHPPISPLLALCCFLISERHHGEASEWNPYIDILPK------- 136

Query: 193 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 252
               PL + +  +  L  S  K +  ++ E  +  ++   T +     LF Q     PTE
Sbjct: 137 TYTCPLYFPDNVIELLPRSLQK-KATQQKEQFQELFSSSQTFFHSLQPLFNQ-----PTE 190

Query: 253 A-FTFEIFKQAFVAVQSCVV---HLQKVSLARRFALVPLGP--PLLAY--SSKCKAMLAA 304
             F+ +  + A+ +V +  V   H Q   L+R   +  L P   LL +  + + +A    
Sbjct: 191 ELFSQDALRWAWCSVNTRTVYMEHDQSKYLSREKDVYALAPYLDLLNHCPNVQVEAGFNK 250

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY-----DRLVVEAALNT 359
                ++      K  +   +  GP  N +LL+ YGFV   NP+     D   ++  L+ 
Sbjct: 251 ETRCYEIRSVNGCKKFQQAFINYGPHDNHRLLLEYGFVAPCNPHSVVYVDLETLKVGLDE 310

Query: 360 EDPQYQDKRMVAQRNGKLSVQVF 382
           +D Q ++K +  + N  L    F
Sbjct: 311 KDKQLKEKLLYLKDNDFLRNLTF 333


>gi|294659704|ref|XP_462118.2| DEHA2G13354p [Debaryomyces hansenii CBS767]
 gi|199434171|emb|CAG90604.2| DEHA2G13354p [Debaryomyces hansenii CBS767]
          Length = 480

 Score = 47.0 bits (110), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 55/234 (23%), Positives = 96/234 (41%), Gaps = 31/234 (13%)

Query: 134 VTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA 193
           +T+E +LG            LS    L+ Y+ +EK++G  SFW P+I  L  +     LA
Sbjct: 135 LTMEEMLG------------LSSFQLLSFYICFEKQRGSSSFWKPFIDMLP-ETSDFDLA 181

Query: 194 VESPLLWS------ETELAYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYP 246
              PL+W         EL  L  + TK  + +  +  + +YN + D +      +     
Sbjct: 182 ---PLVWKVLKVDHYEELLKLLPNSTKRHMDKIYDRFQTDYNVVKDLISIKLKEISDNER 238

Query: 247 YDIPTEAFT----FEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGPPLLAYSSKCK 299
            +  T+A       E++  +++ + S  +++   Q  + A  F + P     L +S   +
Sbjct: 239 SNDLTDAIRHLVPIELYLWSWMCINSRCLYMEIPQSKNAADNFTMAPY-VDFLNHSCDDQ 297

Query: 300 AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
             L       Q+     Y   E + +  GP  N  LL  YGF   +N ++ L V
Sbjct: 298 CGLKIDGTGFQVYTTCSYNPDEQLFLSYGPHSNEFLLCEYGFTLPENKWNDLDV 351


>gi|242059429|ref|XP_002458860.1| hypothetical protein SORBIDRAFT_03g041640 [Sorghum bicolor]
 gi|241930835|gb|EES03980.1| hypothetical protein SORBIDRAFT_03g041640 [Sorghum bicolor]
          Length = 491

 Score = 47.0 bits (110), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 68/320 (21%), Positives = 118/320 (36%), Gaps = 67/320 (20%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
            K WM  +G     V+  +  S +       +V A   L+ GD   ++P    +T  R  
Sbjct: 13  FKRWMRAHG-----VVCSDALSLDVSDPLGVHVRAVTPLRDGDLVATIPRGACLT-PRTT 66

Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIREL-DRQRGRGQLAVESPLL 199
           G           +L     LA+ +MYE+ +G  S W  Y++ L DR+          PL+
Sbjct: 67  GAAAAI---EAAELGGCLALAVAVMYERARGTDSPWDAYLQLLPDRE--------SVPLV 115

Query: 200 WSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEI 258
           W   E    L G+     + +  E +  ++ E      ++G L      D+  + F+ E 
Sbjct: 116 WPADEAECLLAGTELDKIVKQDREFLCEDWKECIEPLLLSGEL------DVDPDDFSLEK 169

Query: 259 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML---------------- 302
           +  A   V S    +        F +VPL   L  + + C+ +                 
Sbjct: 170 YFSAKTLVSSRSFQIDSY---HGFGMVPLAD-LFNHKTDCEHVHFTSASDASDSDGEDAD 225

Query: 303 ----------------------AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 340
                                  + D+ +++++ R    GE +    G   N+ LL  YG
Sbjct: 226 DDQSDASADDESTIENPTSSSPGSKDEDLEMIIVRDVNEGEEVYNTYGTMGNAALLHRYG 285

Query: 341 FVDEDNPYDRLVVEAALNTE 360
           F + DN YD + ++ AL T+
Sbjct: 286 FTELDNQYDIVNIDLALVTK 305


>gi|388579878|gb|EIM20197.1| RuBisCO-cytochrome methylase [Wallemia sebi CBS 633.66]
          Length = 447

 Score = 47.0 bits (110), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 89/450 (19%), Positives = 182/450 (40%), Gaps = 74/450 (16%)

Query: 84  WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT-----LER 138
           W   NG    K I+    + +   R +  VA   D++A  + F++P  +V++      + 
Sbjct: 10  WFTTNGGEFSKDIVAIGENVDGMGRGLVAVA---DIKAQTSLFTIPRDIVLSTRTSSFKE 66

Query: 139 VLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPL 198
            +G +   +L   N +     L + + +E  QG  S W  Y + L +Q         S +
Sbjct: 67  KVGQDVYKQLENDN-IGSWTPLIMAMCWEYNQGGSSKWDAYFKILPKQ-------FTSLM 118

Query: 199 LWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEI 258
            WS+ EL+ L G+    +I    E I+ E+  +  +     ++F     DI    +T ++
Sbjct: 119 FWSKEELSLLKGTTVVDKI--GLEDIENEFERVRDIVKQNENVFG----DIAN--YTLDL 170

Query: 259 FKQ--AFVAVQSCVVHLQKV---------------------SLARRFALVPLGPPLLAYS 295
           FK+  + +  +S  V   K                      +     A+VP+   L + +
Sbjct: 171 FKRMGSLILSRSFTVEEWKTEEEREKEEEEEEDEDEEIDLRTSVDDVAMVPMADILNSRT 230

Query: 296 SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD-----ED----- 345
               A     ++ ++++  +  KAG+ I       PN+ L+  YG VD     +D     
Sbjct: 231 DSVNAHTEYEENCLRMISLQDIKAGDQIFNTYNDPPNADLIRRYGHVDYSPLSQDPDFMG 290

Query: 346 NPYD------RLVVEAALNTEDPQYQDKRM--VAQRNGKLSVQVFHVHAGREKEAISDML 397
           N  D       +++E AL      ++++R+  +    G+ S ++ H       + + ++L
Sbjct: 291 NKNDVVELPADILLELALPDAKESHKERRVEFLLDECGEDSFELTH------DDLVPELL 344

Query: 398 PYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAM 457
               L +    +E ++   S     P +    +   + L    K R+  Y +TL +D + 
Sbjct: 345 KICVLLFTESEAEFKTREKSRK--LPKASGFTKGKAEFLIKAIKQRMEQYGSTLEDDISK 402

Query: 458 LTDYNLHPKKRVATQLVRM-EKKMLNACLQ 486
           L + +  P+      +V + E+++LN  ++
Sbjct: 403 LDNKDSLPENNFKALVVTVGERRILNKAIE 432


>gi|384248108|gb|EIE21593.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 229

 Score = 47.0 bits (110), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 33/110 (30%), Positives = 52/110 (47%), Gaps = 9/110 (8%)

Query: 80  DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
           D   W+ K G     +   E  +  E  R    V A  +++ G    +VP  L+++    
Sbjct: 5   DFAEWLQKGGALIADI---EPGAVAEGFRG---VIAKANIEEGTLLVAVPERLLLSAHSA 58

Query: 140 LGNETIAE-LLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
             +   AE LL TNK  +     LA +L++E  +G++SFW PY+  L RQ
Sbjct: 59  KKDRAFAEALLATNKQSIGSSQVLAAHLLHEASKGQESFWRPYLATLPRQ 108


>gi|323309789|gb|EGA62995.1| Set7p [Saccharomyces cerevisiae FostersO]
          Length = 417

 Score = 47.0 bits (110), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 61/255 (23%), Positives = 110/255 (43%), Gaps = 26/255 (10%)

Query: 113 VAASEDLQAGDAAFSVPNS--LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKK- 169
           V A++ ++  +  F +P S  L VT  +++ +    +    N+      L + ++YE + 
Sbjct: 41  VVATQKIKKDETLFKIPRSSVLSVTTSQLIKDYPSLKDKFLNETGSWEGLIICILYEMEV 100

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
             ++S W PY +  ++      L     + W + EL  L         G     E+ ER 
Sbjct: 101 LQERSRWAPYFKVWNKPSDMNAL-----IFWDDXELQLLKPSLVLERIGKKEAKEMHERI 155

Query: 222 -EGIKREYNELDTVWFMAGSL-FQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
            + IK+   E   V   A S  F  + Y    I + +F  E+   +    +      +++
Sbjct: 156 IKSIKQIGGEFSRV---ATSFEFDNFAYIASIILSYSFDLEMQDSSVNENEEEETSEEEL 212

Query: 277 SLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
              R   +++PL   L A +SKC A L    + +++V  R  +  E +    G  PNS+L
Sbjct: 213 ENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSEL 272

Query: 336 LINYGFVDED-NPYD 349
           L  YG+V+ D + YD
Sbjct: 273 LRRYGYVEWDGSKYD 287


>gi|299470104|emb|CBN78133.1| protein N-methyltransferase [Ectocarpus siliculosus]
          Length = 482

 Score = 47.0 bits (110), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 36/74 (48%)

Query: 281 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 340
           R AL+PL   +  YS     M    D A+ L V   +  G+      GP  N  LL  YG
Sbjct: 215 RMALLPLIDSINHYSRMPTHMYWEADGALSLSVGAAFDPGDHAFASYGPVSNDDLLQYYG 274

Query: 341 FVDEDNPYDRLVVE 354
           FV++DNP D  V+E
Sbjct: 275 FVEQDNPSDTYVLE 288


>gi|428175768|gb|EKX44656.1| hypothetical protein GUITHDRAFT_109433 [Guillardia theta CCMP2712]
          Length = 591

 Score = 47.0 bits (110), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 69/277 (24%), Positives = 118/277 (42%), Gaps = 47/277 (16%)

Query: 100 KPSHNEKH-RPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK----- 153
           KP   E+  R I  +A  E++      FS+P  ++++ +  + + +IA +   +K     
Sbjct: 35  KPHDGERGVRVISDIAPCEEM------FSIPEKILMSRKSCMAS-SIAHVFRKHKDVLFS 87

Query: 154 -LSELACLALYLMYEK-KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
              ELA L L ++YEK  QG  SFW P I  L    G       +   WSE EL  L   
Sbjct: 88  SRDELA-LTLLILYEKLDQGNASFWKPMIDILPADPG-------AASKWSEEELQELQDE 139

Query: 212 PTKAEILERAEGIKREYNE-LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
             KAE +     +++ Y   L  +    G +F         + +T+E F+ A + V+S  
Sbjct: 140 SLKAEAMIVVASMQQTYQRVLRPILVQHGDVF-------SVDRYTWEEFRWALLCVESRT 192

Query: 271 VHLQKVSLARRF----ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD----RPYKAGES 322
                     RF    ++VP    L   + +        +     + D      ++ GE 
Sbjct: 193 FG--------RFLPHPSIVPFADLLNHVNVQTSYRWLPEERRAAYMCDASGEHVHRRGEE 244

Query: 323 IVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 359
             +  GP+ N++LL++YGF  + N Y+ + +   +NT
Sbjct: 245 AFMSYGPRSNAELLLHYGFALQSNRYEAVELNFRINT 281


>gi|115391295|ref|XP_001213152.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114194076|gb|EAU35776.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 691

 Score = 46.6 bits (109), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 50/193 (25%), Positives = 87/193 (45%), Gaps = 15/193 (7%)

Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
            +L+ +  +G + FW PYI  L +    G L   +PL +   +L +L G+ + A   E+ 
Sbjct: 115 FFLIGQYLRGSEGFWYPYICTLPQP---GDLT--TPLYYEGADLRWLEGT-SLAPAREQK 168

Query: 222 EGIKRE-----YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
           E + +E     + EL    F      ++Y +++   A T  + +     V + VV   ++
Sbjct: 169 ESLLKEKYQSTFEELRKSGFGDA---EKYTWELYLWASTIFVSRAFSAKVLAGVVPHAEL 225

Query: 277 SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLL 336
                  L+P    +L +    K    A +  V  VV     AGE +    GP+ N +L+
Sbjct: 226 PEENVSVLLPF-IDVLNHRPLAKVEWRAGERDVLFVVLEHVAAGEEVANNYGPRNNEQLM 284

Query: 337 INYGFVDEDNPYD 349
           +NYGF  ++NP D
Sbjct: 285 MNYGFCLQNNPCD 297


>gi|67484540|ref|XP_657490.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
 gi|56474743|gb|EAL52100.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
          Length = 791

 Score = 46.6 bits (109), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 70/317 (22%), Positives = 120/317 (37%), Gaps = 36/317 (11%)

Query: 78  LGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
           + D+K W+ +NG     V +K    +         + A+++ +  +   S+P S  + + 
Sbjct: 1   MEDIKKWVIQNGGVIDGVDVKTFEGYGRG------LCANKEFKKDEVIMSIPYS--IQIN 52

Query: 138 RVLGNETIAELLT------TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQ 191
           R+  N    E+         +   +L  L    +   K   K F  PYI  L        
Sbjct: 53  RINLNHIWPEVKLPKFNEGDDDRDDLNGLVYLYLAVNKTNPKCFHWPYINVLPE------ 106

Query: 192 LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPT 251
              + PL ++  EL  + G+   A + E+          +  V +    L QQ+P     
Sbjct: 107 -TYDCPLSYTIDELNLMKGTKLYAAV-EKINAFL-----MKVVDYYNNKLIQQFPQYF-- 157

Query: 252 EAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQL 311
           ++F  ++FK+   A QS       V   + F  V    P   +S+ C           Q 
Sbjct: 158 QSFD-DLFKRLQWAHQSFWSRAFLVIYPQPFGEVGSLIPFCDFSNHCTQAKVTYISNTQT 216

Query: 312 ------VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 365
                   +   K GE I      + N KLL+ YGFV+E+NP D L++      +D QY 
Sbjct: 217 ETFSFQTNEELVKPGEQIFNNYRIRSNEKLLLGYGFVEENNPCDNLLLRIYFEVDDNQYN 276

Query: 366 DKRMVAQRNGKLSVQVF 382
           +   + ++    S   F
Sbjct: 277 EIEEILKQEEIKSFDFF 293


>gi|294868786|ref|XP_002765694.1| hypothetical protein Pmar_PMAR013760 [Perkinsus marinus ATCC 50983]
 gi|239865773|gb|EEQ98411.1| hypothetical protein Pmar_PMAR013760 [Perkinsus marinus ATCC 50983]
          Length = 330

 Score = 46.6 bits (109), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 21/72 (29%), Positives = 36/72 (50%)

Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
             ++PL       S+K    +  V+   Q++ ++P K GE I    G   N  LL+ +GF
Sbjct: 171 LCVIPLADQFNHSSTKWHTRVREVEGGFQMLAEKPVKKGEEIFNNYGLYTNEMLLLTHGF 230

Query: 342 VDEDNPYDRLVV 353
           ++ DNP+D  + 
Sbjct: 231 IEFDNPHDHFIT 242


>gi|400596811|gb|EJP64567.1| histone-lysine N-methyltransferase [Beauveria bassiana ARSEF 2860]
          Length = 406

 Score = 46.6 bits (109), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 86/392 (21%), Positives = 163/392 (41%), Gaps = 37/392 (9%)

Query: 81  LKSWMHKNG-LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
           + +W++K+G +    + L + P           V A    +  +   ++P + + T++  
Sbjct: 1   MDAWLNKSGAVGLGDLDLADFPETGRG------VKAQRPFKEDERILTIPANCLWTVKGA 54

Query: 140 LGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
             +     +L + +  LS    LALY+++ + +G+   +       +RQ     L  E  
Sbjct: 55  YADPLFGPVLQSVQPPLSVEDTLALYILFVRSRGEDPAYA------ERQTHVAMLPSEYT 108

Query: 198 L--LWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAF 254
           L   +++ EL    GS            +  +Y +L T  FM    LF       P + F
Sbjct: 109 LSMYFTDEELRVCAGSSLYTLTTHLRGRVGDDYKKLLTGVFMRHRDLF-------PLDKF 161

Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV- 313
           +F+ +K A  ++ S  +    +S      L+     +L ++S  K   A       L V 
Sbjct: 162 SFQHYKWALSSIWSRGMDF-TISEGNSVRLMAPFADMLNHASDAKQCHAYDPSTGSLTVL 220

Query: 314 -DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQ 372
             R Y+ G+ + ++ G   NS+LL  YGFV  DNP D   +    ++  P Y+ K    Q
Sbjct: 221 ACRDYEVGDQVFIYYGNVSNSRLLRLYGFVLPDNPNDNYELVLQTSSMAPLYEQK----Q 276

Query: 373 RNGKLSV--QVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLG--PICPVSPCM 428
           R  KL+   ++  +    +      +L YLR+  + D S++ ++   +       +S   
Sbjct: 277 RLWKLAGLDEISTIPLSLQNPLPDSVLRYLRIQRL-DASDLGTMTMQIATESYTKISDEN 335

Query: 429 ERAVLDQLADYFKARLAGYPATLSEDEAMLTD 460
           E  +L  L+   +A L G+  +L + E  L +
Sbjct: 336 ESQILLFLSQSIEALLEGFEISLEKLETQLAE 367


>gi|400594002|gb|EJP61885.1| histone-lysine N-methyltransferase [Beauveria bassiana ARSEF 2860]
          Length = 481

 Score = 46.6 bits (109), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 64/277 (23%), Positives = 111/277 (40%), Gaps = 24/277 (8%)

Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
           + +S+ E+    GS       +    I  +Y +L T   M      ++    P   F  E
Sbjct: 115 IFFSDEEMQVCKGSSLYTLTTQLRGRIGDDYKKLLTRVLM------RHRNLFPLSKFGIE 168

Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS---KCKAMLAAVDDAVQLVVD 314
            +K A   V S  +    VS      L+     +L +SS   +C A      D + ++  
Sbjct: 169 HYKWALCTVWSRGMDF-TVSEGNSLRLLAPFADMLNHSSDVKQCHAYDPTTGD-LSILAS 226

Query: 315 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 374
           + Y  G+ + ++ GP PN++LL  YGFV  +NP+D   +    +   P Y+ K  + +  
Sbjct: 227 KDYNVGDQVFIYYGPVPNNRLLRLYGFVLPENPHDSYDLVLQTSPMAPLYEQKERLWKLA 286

Query: 375 GKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSL------GPICPVSPCM 428
           G  +     + A       +D LP   L Y+      +S++ ++      G    +S   
Sbjct: 287 GLDTACTIPLTA-------NDPLPRSVLRYLRIQRLDESLLGAMTMQIATGADEKISDDS 339

Query: 429 ERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHP 465
           E  +L  L D   A L G+   L    A L   +++P
Sbjct: 340 ETLILQFLIDSISAILEGFSIPLDILTAQLAAGDVYP 376


>gi|367013376|ref|XP_003681188.1| hypothetical protein TDEL_0D03930 [Torulaspora delbrueckii]
 gi|359748848|emb|CCE91977.1| hypothetical protein TDEL_0D03930 [Torulaspora delbrueckii]
          Length = 484

 Score = 46.6 bits (109), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 33/100 (33%), Positives = 48/100 (48%), Gaps = 6/100 (6%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           V A +DL+ G     +P S + +      N +IA LL  +++  +  L +  +YE    K
Sbjct: 40  VFAKQDLEEGTVLLKLPKSCLFSA----SNSSIANLLVDDEIDGVLALNIAFLYETTVFK 95

Query: 173 -KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
            KS W PY++ + R      L V  P  WSETE   L GS
Sbjct: 96  EKSHWFPYLKSI-RIYNDDGLLVLPPSHWSETEKLLLKGS 134


>gi|330797452|ref|XP_003286774.1| hypothetical protein DICPUDRAFT_54488 [Dictyostelium purpureum]
 gi|325083217|gb|EGC36675.1| hypothetical protein DICPUDRAFT_54488 [Dictyostelium purpureum]
          Length = 1335

 Score = 46.6 bits (109), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 54/230 (23%), Positives = 95/230 (41%), Gaps = 23/230 (10%)

Query: 129  PNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
            P   ++ ++    N  +  +     L++   L L+++YEK +   +FW P+   L     
Sbjct: 849  PRKYLINVDVAKSNPILGPIFEELHLNDETILFLFVIYEK-ENPNTFWRPFYDTLPS--- 904

Query: 189  RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIK--REYNELDTVWFMAGSLFQQYP 246
                   + + +S TEL  L G+   AE L   + ++  R+Y        +   L  QYP
Sbjct: 905  ----YFTTSIHYSSTELLELEGTNLFAETLAVKQQLQAFRDY--------LFPELSNQYP 952

Query: 247  YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD 306
               P   F++E F  A   + S  + L K+    +  LVP+   ++ + +  +      D
Sbjct: 953  DIFPESVFSWENFLWARSLLDSRAIQL-KIDGKIKSCLVPMAD-MINHHTNAQISERHFD 1010

Query: 307  ---DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
               +  ++V      A   I +  G   NS L + YGFV  +N YD   V
Sbjct: 1011 QDSNCFRMVSSCNIPANNQIFLHYGALQNSDLALYYGFVIPNNIYDSFHV 1060


>gi|402076002|gb|EJT71425.1| hypothetical protein GGTG_10683 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 497

 Score = 46.6 bits (109), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 53/202 (26%), Positives = 82/202 (40%), Gaps = 27/202 (13%)

Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
            L L +++E  +G  S W PY+  L  +        E+P+ WS  ELA L  SP  A + 
Sbjct: 104 SLILVMIHEHLRGSASPWRPYLDVLPAR-------FETPMFWSAAELAELQASPVVASV- 155

Query: 219 ERAEG-------IKREYNELDTVWFMAGS----------LFQQYPYDIPTEAFTFEIFKQ 261
            RAEG       I     E + ++F AG           L  +    I   AF  E    
Sbjct: 156 GRAEGDAMIRSRILPVIRENEALFFGAGGAAMGDEELVELAHRMGSTIMAYAFDLERDDD 215

Query: 262 AFVAVQSCVVHLQKVSLARR-FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 320
           A     +      +    R    +VP+   +L   ++  A +   ++A+  +  R   AG
Sbjct: 216 AMDEDDAEGDGWVEDRDGRTVMGMVPMA-DILNADAEFNAHINHSEEALVAISLRKIPAG 274

Query: 321 ESIVVWCGPQPNSKLLINYGFV 342
           E I+ + GP PN +L   YG+ 
Sbjct: 275 EEILNYYGPLPNGQLCRRYGYT 296


>gi|348684109|gb|EGZ23924.1| hypothetical protein PHYSODRAFT_296170 [Phytophthora sojae]
          Length = 452

 Score = 46.6 bits (109), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 51/192 (26%), Positives = 77/192 (40%), Gaps = 36/192 (18%)

Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 253
           V+ PL W + +   L G       ++      R Y+++    F A + F      +  EA
Sbjct: 143 VDLPLYWDDKQFEELQGCEEARRAMQHG---ARFYSQVYKHLFGANNQF------VNAEA 193

Query: 254 FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY---SSKCKAMLAAVDDAVQ 310
           F + I         S ++        + FAL+P           S  C+  L + D+ VQ
Sbjct: 194 FFWAI---------SILMSRATSGQNQPFALIPFFDWFNHAGNGSDNCRHALDS-DECVQ 243

Query: 311 ---------LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD--RLVVEAAL-- 357
                    +   R Y+ GE + +  G   N +LL NYGF   +NPYD   L + AAL  
Sbjct: 244 DFDMQKGFTIHTTRSYEPGEQLFINYGSHGNLRLLRNYGFTMPNNPYDVVNLPMPAALQQ 303

Query: 358 -NTEDPQYQDKR 368
            N  DP +  KR
Sbjct: 304 PNEADPAFAQKR 315


>gi|211826273|gb|AAH09054.2| SETD3 protein [Homo sapiens]
          Length = 228

 Score = 46.6 bits (109), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 53/223 (23%), Positives = 100/223 (44%), Gaps = 29/223 (13%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 5   GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 58

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 59  LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 114

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y     V        Q +P+ 
Sbjct: 115 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV-------IQTHPHA 162

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPL 287
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL
Sbjct: 163 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPL 205


>gi|17865444|sp|P58467.1|SETD4_MOUSE RecName: Full=SET domain-containing protein 4
 gi|17061796|gb|AAK68849.1| C21orf18 [Mus musculus]
          Length = 439

 Score = 46.6 bits (109), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 65/271 (23%), Positives = 112/271 (41%), Gaps = 27/271 (9%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
           LQ G    S+P S ++T + V+   ++   +   K  +S L  L  +L+ EK  G +S W
Sbjct: 67  LQEGQVMISLPESCLLTTDTVI-RSSLGPYIKKWKPPVSPLLALCTFLVSEKHAGCRSLW 125

Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
             Y+  L +       +   P+   E E+  L  SP KA+  E+   ++  +      + 
Sbjct: 126 KSYLDILPK-------SYTCPVCL-EPEVVDLLPSPLKAKAEEQRARVQDLFTSARGFFS 177

Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ---KVSLARRFALVPLGP--PL 291
               LF + P D     F++  F  A+  V +  V+L+   +  L+       L P   L
Sbjct: 178 TLQPLFAE-PVD---SVFSYRAFLWAWCTVNTRAVYLRSRRQECLSAEPDTCALAPFLDL 233

Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
           L +S   + KA         ++      +  + + +  GP  N +LL+ YGFV   NP+ 
Sbjct: 234 LNHSPHVQVKAAFNEKTRCYEIRTASRCRKHQEVFICYGPHDNQRLLLEYGFVSVRNPHA 293

Query: 350 RLVVEAA-----LNTEDPQYQDKRMVAQRNG 375
            + V A      L   D Q   K  + + +G
Sbjct: 294 CVPVSADMLVKFLPAADKQLHRKITILKDHG 324


>gi|172073177|ref|NP_663457.2| SET domain-containing protein 4 [Mus musculus]
 gi|148671824|gb|EDL03771.1| SET domain containing 4, isoform CRA_e [Mus musculus]
          Length = 439

 Score = 46.6 bits (109), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 65/271 (23%), Positives = 112/271 (41%), Gaps = 27/271 (9%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
           LQ G    S+P S ++T + V+   ++   +   K  +S L  L  +L+ EK  G +S W
Sbjct: 67  LQEGQVMISLPESCLLTTDTVI-RSSLGPYIKKWKPPVSPLLALCTFLVSEKHAGCRSLW 125

Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
             Y+  L +       +   P+   E E+  L  SP KA+  E+   ++  +      + 
Sbjct: 126 KSYLDILPK-------SYTCPVCL-EPEVVDLLPSPLKAKAEEQRARVQDLFTSARGFFS 177

Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ---KVSLARRFALVPLGP--PL 291
               LF + P D     F++  F  A+  V +  V+L+   +  L+       L P   L
Sbjct: 178 TLQPLFAE-PVD---SVFSYRAFLWAWCTVNTRAVYLRSRRQECLSAEPDTCALAPFLDL 233

Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
           L +S   + KA         ++      +  + + +  GP  N +LL+ YGFV   NP+ 
Sbjct: 234 LNHSPHVQVKAAFNEKTRCYEIRTASRCRKHQEVFICYGPHDNQRLLLEYGFVSVRNPHA 293

Query: 350 RLVVEAA-----LNTEDPQYQDKRMVAQRNG 375
            + V A      L   D Q   K  + + +G
Sbjct: 294 CVPVSADMLVKFLPAADKQLHRKITILKDHG 324


>gi|315045047|ref|XP_003171899.1| SET domain-containing protein 6 [Arthroderma gypseum CBS 118893]
 gi|311344242|gb|EFR03445.1| SET domain-containing protein 6 [Arthroderma gypseum CBS 118893]
          Length = 485

 Score = 46.2 bits (108), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 67/318 (21%), Positives = 116/318 (36%), Gaps = 61/318 (19%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           + AS D+   +  F +P  L+++++       +   L   +L     L + ++YE  QG+
Sbjct: 51  ICASRDITEDEELFVIPEDLILSVQNSEARTVLG--LDDKQLGPWLSLIIAMIYEYYQGE 108

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
           +S W PY   L         + ++ + W++ +L+ L GS    +I + A          D
Sbjct: 109 QSKWYPYFGVLPS-------SFDTLMFWTDEQLSELQGSAVVGKIGKAAAD--------D 153

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
           T+      L Q      P         +     + S       +SLA R A +     ++
Sbjct: 154 TILQKVVPLIQANSLHFPP--------RSDMPPLNSPDSQSALLSLAHRMASL-----IM 200

Query: 293 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 352
           AY+   +    A +D  +                            Y   DED P   +V
Sbjct: 201 AYAFDIEKAEEADEDTAE--------------------------DGYMTDDEDEPAKGMV 234

Query: 353 VEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYL----RLGYVSDT 408
             A +   D Q  + R+  +    +   V ++H+G E       LP      R GYV+D 
Sbjct: 235 PLADIFNADAQRNNARLFQEEGSFVMKAVRNIHSGEEIFNDYGELPRADLLRRYGYVTDN 294

Query: 409 SEMQSVIS-SLGPICPVS 425
                V+  SL  IC V+
Sbjct: 295 YTQYDVVEFSLDSICKVA 312


>gi|156374449|ref|XP_001629819.1| predicted protein [Nematostella vectensis]
 gi|156216828|gb|EDO37756.1| predicted protein [Nematostella vectensis]
          Length = 281

 Score = 46.2 bits (108), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 64/278 (23%), Positives = 116/278 (41%), Gaps = 30/278 (10%)

Query: 84  WMHKNGLPPCKVILKEKPSHNEKHRPIHY-VAASEDLQAGDAAFSVPNSLVVT-----LE 137
           W H N L      L  K S  +K     Y + A ED+   +  F VP  L++      + 
Sbjct: 22  WCHDNDLK-----LNNKVSSMQKGSCHRYGMVAMEDISPDECLFKVPRGLLLEPKTCGIS 76

Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
           ++L  + I  +L+ ++      L L LMYE      S W PY+  +      G   ++ P
Sbjct: 77  KILTGKVIQNMLSQHE--GWVPLLLALMYEYTN-PTSLWKPYMDIV-----PGIDILDQP 128

Query: 198 LLW-SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 256
           + W  ET  + L G+  + ++ +  + I+R+Y      + +A  + +++      +  + 
Sbjct: 129 MFWPDETRQSLLQGTGFEDDVEDDKQRIERQY------FTVAVPIMKKFKKFFDLKRHSL 182

Query: 257 EIFKQ--AFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD 314
            ++K   AF+   S               +VP+   +L + S   A L   ++ + +V  
Sbjct: 183 SLYKHMAAFIMAYSFTEDSPSFHGNNVPVMVPMAD-ILNHHSNNNARLEFGEEELSMVST 241

Query: 315 RPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYDRL 351
           +    G  +    G   N  LL +YGFV+  DNP D +
Sbjct: 242 QHILKGGEVFNTYGQLANCHLLQSYGFVEGPDNPNDTV 279


>gi|345566622|gb|EGX49564.1| hypothetical protein AOL_s00078g53 [Arthrobotrys oligospora ATCC
           24927]
          Length = 611

 Score = 46.2 bits (108), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 74/313 (23%), Positives = 130/313 (41%), Gaps = 43/313 (13%)

Query: 158 ACLAL---YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 214
           AC  L    L+ E+ Q    FW PYIR L +         ++PL +++ E+  L G+   
Sbjct: 112 ACFHLSQHLLLKEQSQ----FW-PYIRLLPK-------TFDTPLYFNDDEMERLAGTNLG 159

Query: 215 A-EILERAEGIKREYNELDTVWFMAG---SLFQQYPYDIPTEA---FTFEIFKQAFVAV- 266
           A ++L R +    E+       F+ G      ++Y +D+   A   +T   F    V + 
Sbjct: 160 AGDVLLRKQLWMEEWEAGKQ--FLEGVGAERAREYTWDLFLRAATIYTSRSFPSKLVGIT 217

Query: 267 -QSCVVHLQKVSLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 324
             S +     +S    F  L+PL   +L +    K +      +  L+       G  + 
Sbjct: 218 MDSSIEENTMLSDDNGFPVLIPL-VDILNHKPNTKIIWEPTQTSFSLITPETISEGSQVF 276

Query: 325 VWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV-QVFH 383
              GP+ N +LL+ YGFV  +NP D L ++  ++   P+ Q  ++  QR  K +  +VFH
Sbjct: 277 NNYGPKGNEELLMGYGFVIPENPGDSLAMKFTIS---PRGQAAQIWEQRALKQTWREVFH 333

Query: 384 VHAGREKEAISDMLPYLRLGY-----------VSDTSEMQSVISSLGPICPVSPCMERAV 432
           +    +    +  +P L   +           V++ +E+  + +      P+S   E AV
Sbjct: 334 LTKSADSGQKTSTVPALESDWPEAFVDLFRILVANENEIDDLENGDINATPISIRNELAV 393

Query: 433 LDQLADYFKARLA 445
              L    K +LA
Sbjct: 394 ALGLKAAIKQKLA 406


>gi|324503528|gb|ADY41532.1| SET domain-containing protein 3 [Ascaris suum]
          Length = 502

 Score = 46.2 bits (108), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 67/307 (21%), Positives = 120/307 (39%), Gaps = 58/307 (18%)

Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
           L++    + I + +    L+ + C        +K    S WLPY+  L +          
Sbjct: 142 LKKCFEQDMIVKTMDNVALALMVCC-------QKLSPDSSWLPYLDALPQ-------TFS 187

Query: 196 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS--------------- 240
           +PL +S  EL  L+ SP   E L     + R++     V+F+A                 
Sbjct: 188 TPLYFSALELRKLSPSPAYEESLIMYRNVARQF-----VYFLAAVQRSERSRSAKKDKNH 242

Query: 241 -------LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL----QKVSLARRFALVPLGP 289
                  LF   P+ +    FTF++++ A   V + +  +     K S  +  A VP   
Sbjct: 243 AAVGMEPLFLNAPFTVSN--FTFDLYRWAVACVTTRINFIPSQYAKDSNGQPVA-VPCLI 299

Query: 290 PLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV-D 343
           PLL      +        +   D   +   + YKAG+ + ++ G + N +  ++ GFV D
Sbjct: 300 PLLDMANHEFDHPLTVHFSTEGDYASIKATKDYKAGDEVTIFYGIRTNRQFFLHNGFVPD 359

Query: 344 EDNPYDRLVVEAALNTEDPQYQDKRMV---AQRNGKLSVQVFHVHAGREKEAISDMLPYL 400
            +N  D   ++      D Q + +  +   A  N +  V VF V+A      +S +L + 
Sbjct: 360 GENKNDTYKLKIGFPRGDKQVRARLKLMHDAGFNAESRVFVFEVNASERPVPLS-LLDFA 418

Query: 401 RLGYVSD 407
           R+  V +
Sbjct: 419 RVFLVEN 425


>gi|330924929|ref|XP_003300837.1| hypothetical protein PTT_12198 [Pyrenophora teres f. teres 0-1]
 gi|311324820|gb|EFQ91062.1| hypothetical protein PTT_12198 [Pyrenophora teres f. teres 0-1]
          Length = 372

 Score = 46.2 bits (108), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 62/285 (21%), Positives = 112/285 (39%), Gaps = 44/285 (15%)

Query: 80  DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
           +L SW  + G+    +  +  PS          + A+ D+QAG+    VP  +  +L+ V
Sbjct: 6   ELLSWATERGVKLSGIKPQNIPSRGTG------IIATRDIQAGETILFVPFKVFRSLKHV 59

Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQ--GKKSFWLPYIRELDRQRGRGQLAVESP 197
              + IA  L  N +S  A LA YL  +K       +  LP +   +            P
Sbjct: 60  --PKAIARRLPRN-MSLHALLAAYLTLDKTDTFAIANQTLPDLSSFE---------AGMP 107

Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
            LW   EL      P    + ++    +R++  +  V+                   + E
Sbjct: 108 FLWP-AELHPFLPKPALDLLKKQQRNFQRDWATVSKVY----------------SNVSHE 150

Query: 258 IFKQAFVAVQSCVVHLQKVSLAR-----RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 312
            +  +++ V +   +    S+ R     R A++P+          C+A  A+  +    +
Sbjct: 151 QYLHSWLLVNTRSFYCTTPSMERLPHDDRLAILPVADLFNHADVGCEAQFAS--ENYSFI 208

Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 357
            DR Y+AGE + +  G      LL  YGFV  +N +D + ++ A+
Sbjct: 209 ADRTYRAGEELYISYGTHSTDFLLAEYGFVPAENRWDVVCLDEAI 253


>gi|12718364|emb|CAC28558.1| related to histone-lysine N-methyltransferase [Neurospora crassa]
          Length = 471

 Score = 46.2 bits (108), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 59/265 (22%), Positives = 111/265 (41%), Gaps = 35/265 (13%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
            + G+   ++P  ++ T++    +  +   L + +  LS    LA Y+++ K        
Sbjct: 42  FKEGEKILTIPAGILWTVKHAYADPLLGPALRSAQPPLSVEDTLATYILFVKS------- 94

Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
               RE      R  +A       S +L++E +L    G+       +  + I+ ++  L
Sbjct: 95  ----RESGYDGQRSHIAALPASYSSSILFAEDDLEACAGTSLYTITKQLEQSIEDDHRAL 150

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEI----FKQAFVAVQSCVVHLQKVSLARRFALVPL 287
                    LF Q+P   P + FT E     +K A   V S  +      LA   ++  L
Sbjct: 151 VV------RLFVQHPDLFPLDKFTVEDVGLHYKWALCTVWSRAMDF---VLADGNSIRLL 201

Query: 288 GP--PLLAYSSKCKA--MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 343
            P   +L ++S+ K   +       + +   + Y+AG+ + +  GP PNS+LL  YGFV 
Sbjct: 202 APFADMLNHTSEVKQCHVYDPSSGTLSVFAGKDYEAGDQVFINYGPVPNSRLLRLYGFVI 261

Query: 344 EDNPYDRLVVEAALNTEDPQYQDKR 368
             NP D   +  + + + P ++ K+
Sbjct: 262 PGNPNDSYDLVLSTHPQAPFFEQKQ 286


>gi|407035166|gb|EKE37568.1| [Ribulose-bisphosphate-carboxylase]-lysine N-methyltransferase
           [Entamoeba nuttalli P19]
          Length = 791

 Score = 46.2 bits (108), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 71/318 (22%), Positives = 120/318 (37%), Gaps = 38/318 (11%)

Query: 78  LGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
           + D+K W+ +NG     V +K    +         + A+++ +  +   S+P S  + + 
Sbjct: 1   MEDIKKWVIQNGGVIDGVDVKTFDGYGRG------LCANKEFKKDEIIMSIPYS--IQIN 52

Query: 138 RVLGNETIAELLT------TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQ 191
           R+  N    E+         +   +L  L    +   K   K F  PYI  L        
Sbjct: 53  RINLNHIWPEVKLPKFNEGDDDRDDLNGLVYLYLAVNKTNPKCFHWPYINVLPE------ 106

Query: 192 LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP-YDIP 250
              + PL ++  EL  + G+   A + E+          +  V +    L QQ+P Y  P
Sbjct: 107 -TYDCPLSYTIDELNLMKGTKLYAAV-EKINAFL-----MKVVDYYNNKLIQQFPQYFQP 159

Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 310
            +    ++FK+   A QS       V   + F  V    P   +S+ C           Q
Sbjct: 160 FD----DLFKRLQWAHQSFWSRAFLVIYPQPFGEVGSLIPFCDFSNHCTQAKVTYISNTQ 215

Query: 311 L------VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
                    +   K GE I      + N KLL+ YGFV+E+NP D L++      +D QY
Sbjct: 216 TETFSFQTNEALVKPGEQIFNNYRIRSNEKLLLGYGFVEENNPCDNLLLRIYFEVDDNQY 275

Query: 365 QDKRMVAQRNGKLSVQVF 382
            +   + ++    S   F
Sbjct: 276 NEIEEILKQEEIKSFDFF 293


>gi|410082051|ref|XP_003958604.1| hypothetical protein KAFR_0H00600 [Kazachstania africana CBS 2517]
 gi|372465193|emb|CCF59469.1| hypothetical protein KAFR_0H00600 [Kazachstania africana CBS 2517]
          Length = 508

 Score = 46.2 bits (108), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 58/244 (23%), Positives = 104/244 (42%), Gaps = 22/244 (9%)

Query: 113 VAASEDLQAGDAAFSVP-NSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
           V A +D+  G+  F +P +S++  L   L ++      T   +     L L L+YE K  
Sbjct: 41  VIAVKDIAEGEVLFEIPRDSILNVLTSSLSSDFSDLEETLQSIGSWEGLILCLLYEWKGK 100

Query: 172 K-KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
           K KS W  Y   L         A+   + W+E EL +L  S     I +++   K  Y++
Sbjct: 101 KEKSKWWKYFNVLPSSN-----AMNGLMYWNEQELEHLRPSLVLDRIGKKSA--KNMYHK 153

Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFE---IFKQAFVAVQSCVVHLQKVSLARRF----- 282
           + T+  +  S F +   ++  E F +    I   +F         L +    +       
Sbjct: 154 VLTL--VKESKFPEVLCNVEWEDFVYAASVIMAYSFDVENGESQTLNEEDDDQDEEENTG 211

Query: 283 ---ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 339
              +++PL   L + + +C A L   D  +++   +P K GE +    G  PN+++L  Y
Sbjct: 212 YIKSMIPLADTLNSDTHQCNANLMYDDKFLKMYAIKPIKKGEQVFNIYGNHPNAEILRRY 271

Query: 340 GFVD 343
           G+V+
Sbjct: 272 GYVE 275


>gi|167389227|ref|XP_001738871.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165897700|gb|EDR24782.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
          Length = 791

 Score = 46.2 bits (108), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 70/318 (22%), Positives = 124/318 (38%), Gaps = 38/318 (11%)

Query: 78  LGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
           + D+K W+ +NG     V +K    +         + A+++ +  +   S+P S  + + 
Sbjct: 1   MEDIKKWVIQNGGIIDGVDVKTFEGYGRG------LCANKEFKQDEIIMSIPYS--IQIN 52

Query: 138 RVLGNETIAELLT------TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQ 191
           R+  N    E+         +   +L  L    +   K   K F  PYI  L +      
Sbjct: 53  RINLNHIWPEVKLPKFNEGDDDRDDLNGLVYLYLAINKTNPKCFHWPYINVLPK------ 106

Query: 192 LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP-YDIP 250
              + PL ++  EL  + G+     + E+          +  V +    L QQ+P Y  P
Sbjct: 107 -TYDCPLSYTIDELNIMKGTKLYVAV-EKINAFL-----MKVVDYYNNKLIQQFPQYFQP 159

Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC-KAMLAAVDDAV 309
            +    ++FK+   A QS       V   + F  V    P   +S+ C +A +  + +  
Sbjct: 160 FD----DLFKRLQWAHQSFWSRAFLVIYPQPFGEVGSLIPFCDFSNHCTQAKVTYISNTR 215

Query: 310 QLVV-----DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQY 364
                    +   K GE I      + N KLL+ YGFV+E+NP D L++      +D QY
Sbjct: 216 TETFSFQTNEEVVKPGEQIFNNYRIRSNEKLLLGYGFVEENNPCDNLLLRIYFEVDDNQY 275

Query: 365 QDKRMVAQRNGKLSVQVF 382
            +   + ++    S   F
Sbjct: 276 NEIEEILKQEEIKSFDFF 293


>gi|367009050|ref|XP_003679026.1| hypothetical protein TDEL_0A04830 [Torulaspora delbrueckii]
 gi|359746683|emb|CCE89815.1| hypothetical protein TDEL_0A04830 [Torulaspora delbrueckii]
          Length = 484

 Score = 46.2 bits (108), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 39/68 (57%), Gaps = 1/68 (1%)

Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
           +++PL   L A +SKC A L    +++++   +P   GE +    G  PNS+LL  YG+V
Sbjct: 213 SMIPLADTLNANTSKCNANLVYDIESLKMCATKPIGMGEQVYNIYGDHPNSELLRRYGYV 272

Query: 343 D-EDNPYD 349
           + E + YD
Sbjct: 273 EWEGSKYD 280


>gi|408392258|gb|EKJ71616.1| hypothetical protein FPSE_08255 [Fusarium pseudograminearum CS3096]
          Length = 527

 Score = 46.2 bits (108), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 23/63 (36%), Positives = 35/63 (55%), Gaps = 2/63 (3%)

Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
           S  CK + +A+  +VQ   DR YK GE + V  GP  N  LL  YGF+ + N +D + ++
Sbjct: 194 SQGCKLVYSALGYSVQ--TDRAYKQGEEVFVSYGPHSNDFLLTEYGFILDTNRWDEVYLD 251

Query: 355 AAL 357
             +
Sbjct: 252 EVI 254


>gi|358395377|gb|EHK44764.1| hypothetical protein TRIATDRAFT_80097 [Trichoderma atroviride IMI
           206040]
          Length = 463

 Score = 45.8 bits (107), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 77/353 (21%), Positives = 141/353 (39%), Gaps = 59/353 (16%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
            Q G+   ++P   + T+E    +  +  +L + +  LS    LA+YL++ +        
Sbjct: 34  FQQGERILTIPGDSLWTVEHADSDPLLGPVLRSVQPPLSVEDTLAVYLLFVR-------- 85

Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
              +RE   +  R  +A       S + ++E EL    G+       +  E I+ +Y   
Sbjct: 86  ---LREHGYEGPRSHVAAMPARYSSSIFFNEDELEVCAGTSLYTITKQLEERIEDDYR-- 140

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
                +   +F Q+P  +P    + + +K A   V S  +            ++P G PL
Sbjct: 141 ----VLVMRVFTQHPDLLPLAKISIQDYKWALCTVWSRAMDF----------VLPNGKPL 186

Query: 292 ---------LAYSSKCKAMLAAVDDAVQLVV--DRPYKAGESIVVWCGPQPNSKLLINYG 340
                    + +S + K   A    +  L V   + Y+ G+ I +  G  PN++LL  YG
Sbjct: 187 RVLAPFADMINHSPEVKQCHAYDPSSGNLSVLAGKDYEIGDQIYISYGSIPNNRLLRLYG 246

Query: 341 FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLP-- 398
           FV  +NP D   +  + +   P Y+ K+ +    G  S     +        + D LP  
Sbjct: 247 FVIPENPNDSYDLVLSTHPMAPFYEQKQKLWASAGLDSASTIPL-------TLIDPLPKS 299

Query: 399 ---YLRLGYVSDTSEMQSV-ISSLGPICPVSPCMERAVLDQLADYFKARLAGY 447
              YLR+  + D S++ ++ +  L     +S   E  +L  L +   A L G+
Sbjct: 300 VLRYLRIQRL-DASDLAAIALQKLDTNEKISNSKEVEILQFLVESISALLDGF 351


>gi|403158396|ref|XP_003307692.2| hypothetical protein PGTG_00642 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375163798|gb|EFP74686.2| hypothetical protein PGTG_00642 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 622

 Score = 45.8 bits (107), Expect = 0.050,   Method: Compositional matrix adjust.
 Identities = 30/78 (38%), Positives = 39/78 (50%), Gaps = 2/78 (2%)

Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
            A+VPL   L A +    A L    D +++   R  K GE I    G  PNS LL  YG 
Sbjct: 345 IAMVPLADLLNAKTGSENARLFYETDCLKMKATRNIKKGEQIYNTYGDPPNSDLLRRYGH 404

Query: 342 VDEDNPYDRLVVEAALNT 359
           VD+ N +D  VVE ++ T
Sbjct: 405 VDDPNRFD--VVEISIKT 420


>gi|40068483|ref|NP_954574.1| histone-lysine N-methyltransferase setd3 isoform b [Homo sapiens]
 gi|28071060|emb|CAD61911.1| unnamed protein product [Homo sapiens]
 gi|111309143|gb|AAI20968.1| SET domain containing 3 [Homo sapiens]
 gi|118341365|gb|AAI27625.1| SET domain containing 3 [Homo sapiens]
 gi|118341638|gb|AAI27626.1| SET domain containing 3 [Homo sapiens]
 gi|119602071|gb|EAW81665.1| SET domain containing 3, isoform CRA_b [Homo sapiens]
 gi|156138972|gb|AAI48252.1| SET domain containing 3 [Homo sapiens]
          Length = 296

 Score = 45.8 bits (107), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 53/223 (23%), Positives = 100/223 (44%), Gaps = 29/223 (13%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y     V        Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV-------IQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPL 287
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPL 273


>gi|44890428|gb|AAH66931.1| SETD3 protein [Homo sapiens]
          Length = 292

 Score = 45.8 bits (107), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 53/223 (23%), Positives = 100/223 (44%), Gaps = 29/223 (13%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y     V        Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV-------IQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPL 287
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPL 273


>gi|452982650|gb|EME82409.1| hypothetical protein MYCFIDRAFT_40308 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 449

 Score = 45.8 bits (107), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 61/250 (24%), Positives = 99/250 (39%), Gaps = 36/250 (14%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           V A+ DL + +  F +P + ++T E     + I + LT   LS    L L +++E   G 
Sbjct: 42  VVATSDLTSDEEIFRIPRTSILTTETTDLPQEILQQLTDPWLS----LILAMIFEYLLGT 97

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-LERAEGIKREY--- 228
            S + PY+  L         +  + + W++ EL YL GS   ++I  E A+    E    
Sbjct: 98  NSRFKPYLDILPE-------SFNTLMFWTDNELQYLQGSAILSKIGKEEADNTFSEQLLP 150

Query: 229 ----------------NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVH 272
                            +L  +    GS+   Y +D+     T      +          
Sbjct: 151 IITKNPEIFKIGTCNNQDLLALCHRMGSIIMSYAFDLDPPPTT--TTSSSEEWESDSDSE 208

Query: 273 LQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPN 332
            +K+S     AL+PL   L A      + L    D+  +   +P  AGE ++   GP P 
Sbjct: 209 NEKISPK---ALIPLADMLNANGDLTNSKLFFSSDSFIMKTLQPVAAGEELLNDFGPLPP 265

Query: 333 SKLLINYGFV 342
           + LL  YGFV
Sbjct: 266 ADLLRRYGFV 275


>gi|260822399|ref|XP_002606589.1| hypothetical protein BRAFLDRAFT_277814 [Branchiostoma floridae]
 gi|229291933|gb|EEN62599.1| hypothetical protein BRAFLDRAFT_277814 [Branchiostoma floridae]
          Length = 459

 Score = 45.8 bits (107), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 66/262 (25%), Positives = 108/262 (41%), Gaps = 41/262 (15%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLE-----RVLGNETIAELLTTNKLSELACLALYLMYE 167
           + A E+L+ G+  F V  S V++ E      +L  ET     + +  S      L LMYE
Sbjct: 53  MVAQEELEEGECLFKVDKSAVLSTETTEIAHLLKEETSLHGDSLHGDSGWVPQILALMYE 112

Query: 168 KKQGKKSFWLPYIR------ELDRQRGRGQLAVESPLLWSETELAY---LTGSPTKAEIL 218
                 S W PY++      +LD+           P+ W+E E+      TG P  +   
Sbjct: 113 YT-NPNSRWRPYLQLVPDFSQLDQ-----------PMFWTEDEIERDLCNTGIPEASS-- 158

Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ--AFVAVQSCVVHLQKV 276
                +K EY  L      A    +++ +    E  +FE++K+  AF+   S    +   
Sbjct: 159 SDLTKMKLEYTSL------ALPFIRKHRHIFSEEVHSFELYKRMVAFIMAYSFFEPVNGR 212

Query: 277 SLARRFALVPLGPPL---LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 333
                 + +PL  P+   L + +K  A L    D +++V  R   AGE +    G   N 
Sbjct: 213 EDEGGKSSLPLMVPMADILNHVAKNNAQLEWDADCLRMVTTRTVAAGEEVFNTFGQLANW 272

Query: 334 KLLINYGFVDE--DNPYDRLVV 353
           +LL  YGF +   +N YD + +
Sbjct: 273 QLLHMYGFAEAWPENIYDTVDI 294


>gi|315042966|ref|XP_003170859.1| SET domain-containing protein [Arthroderma gypseum CBS 118893]
 gi|311344648|gb|EFR03851.1| SET domain-containing protein [Arthroderma gypseum CBS 118893]
          Length = 693

 Score = 45.8 bits (107), Expect = 0.053,   Method: Compositional matrix adjust.
 Identities = 50/194 (25%), Positives = 84/194 (43%), Gaps = 11/194 (5%)

Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
           LAL++ +++ + K S W PY+  L R        + S L +   +L +L G+        
Sbjct: 108 LALFVAHQQLKEKGSHWWPYLATLPRAS-----ELTSALFYHGDDLEWLQGTNLYQTHQA 162

Query: 220 RAEGIKREYNELDTVWFMAGSLFQQ-YPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 278
               +K EY+   ++    G L  + Y +D+   A+T  I  +AF + +   V+L +   
Sbjct: 163 YMNAVKEEYDSAISILRDEGCLAAELYSWDLFCWAYTV-IASRAFTS-RVLSVYLSRNPA 220

Query: 279 ARRFALVPLGPPLLAYSSK---CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
            ++     +  PL+  S+     K    A    + L V  P  + E I    GP  N +L
Sbjct: 221 LKQDEEFQILLPLVDSSNHKPLAKIEWRAEAAEIGLKVVEPIVSEEEIHNNYGPLNNQQL 280

Query: 336 LINYGFVDEDNPYD 349
           +  YGF   DNP D
Sbjct: 281 MTTYGFCIVDNPCD 294


>gi|358055500|dbj|GAA98620.1| hypothetical protein E5Q_05307 [Mixia osmundae IAM 14324]
          Length = 462

 Score = 45.8 bits (107), Expect = 0.054,   Method: Compositional matrix adjust.
 Identities = 65/288 (22%), Positives = 120/288 (41%), Gaps = 39/288 (13%)

Query: 81  LKSWM-HKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT---L 136
           L SW+ H +G     +I+ +    ++       + A+ DL AG    S P++L +T    
Sbjct: 10  LGSWLRHHDGFIHEHLIVVQDELGDKS------IIATTDLPAGTCIASCPHTLAITPTSA 63

Query: 137 ERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWL--PYIRELDRQRGRGQLAV 194
              LG+          +LS+   + LYL+  K        L   Y+  L  +      A+
Sbjct: 64  RAALGHHA-------TELSDHQAMVLYLVLHKHPSPAVCCLHQAYVDTLPPRS-----AM 111

Query: 195 ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAG--SLFQQYPYDIPTE 252
            +PL ++  E+  L G+     + +R    + E+    TV   AG   LF+        E
Sbjct: 112 RTPLWFNPAEVQLLQGTNLAGAVTDRQRDWQLEWM---TVLRRAGQSGLFKASF----EE 164

Query: 253 AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS----KCKAMLAAVDDA 308
            +   ++    ++ ++   HL  +    + +   L P + A++     K     ++    
Sbjct: 165 TWPSALWAATILSSRAFPSHL--IDGNEQASTPVLFPGVDAFNHQQARKVTWQTSSASGR 222

Query: 309 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 356
             LV D P  AG+ +    GP+ N + L+ YGF+  +NP D +V++ A
Sbjct: 223 FNLVQDEPTAAGQQVFNNYGPKSNEEFLLGYGFIIPNNPDDHMVLKLA 270


>gi|303275314|ref|XP_003056953.1| set domain protein [Micromonas pusilla CCMP1545]
 gi|226461305|gb|EEH58598.1| set domain protein [Micromonas pusilla CCMP1545]
          Length = 701

 Score = 45.8 bits (107), Expect = 0.054,   Method: Compositional matrix adjust.
 Identities = 38/136 (27%), Positives = 59/136 (43%), Gaps = 19/136 (13%)

Query: 84  WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
           WM + G+    V +   P      R      A+ D+  GD   SVP   ++T E  +  +
Sbjct: 37  WMKRRGIVLNGVGVGRFP------RTGRGCVATRDIAPGDVLVSVPEDAIITAETSVAAD 90

Query: 144 TIAEL-LTTNKLS-------ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
            + +  L  +++S       E   L L ++ E  +G +S + PY+  L   R     A  
Sbjct: 91  ALTKFGLGGDEMSAEASPRLEREALVLAVLAEMSRGHESDFAPYLAALPTLR-----ATH 145

Query: 196 SPLLWSETELAYLTGS 211
           SPL WS  ELA L G+
Sbjct: 146 SPLAWSGAELAELEGT 161


>gi|346327621|gb|EGX97217.1| SET domain-containing protein, putative [Cordyceps militaris CM01]
          Length = 371

 Score = 45.8 bits (107), Expect = 0.057,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 38/77 (49%), Gaps = 2/77 (2%)

Query: 281 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 340
           R AL+P+       S  C    +   +   +  DR Y+A E +    G   N  LL  YG
Sbjct: 178 RLALLPVADMFNHASVGCAVAFST--EVYDVTADRDYEADEELYTSYGAHSNDFLLAEYG 235

Query: 341 FVDEDNPYDRLVVEAAL 357
           F+ +DNP+D+L ++A L
Sbjct: 236 FMLQDNPHDQLCLDAVL 252


>gi|328771298|gb|EGF81338.1| hypothetical protein BATDEDRAFT_87914 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 607

 Score = 45.8 bits (107), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 41/154 (26%), Positives = 71/154 (46%), Gaps = 17/154 (11%)

Query: 78  LGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
           L  LK W  +N +   +  ++ +   N   R    V A + L+ GD   ++P   +++++
Sbjct: 4   LNILKQWFGENKIAYDEEKIRIEHDTNNGFR----VFAKQTLEVGDILCAIPKEAILSIK 59

Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
               N  +A++L    L     L + LM+E+  G+KS W  YI+ L        L    P
Sbjct: 60  ----NCGVADVLEEQGLGGQLGLVIALMFERSLGEKSPWYGYIQSL-------PLRENIP 108

Query: 198 LLWSETELAYLTGSPTKAEILE-RAEGIKREYNE 230
           L W + + A L G+   A +LE   + +K +Y E
Sbjct: 109 LFWEKDQQACLDGTAV-AHLLEPMPKDLKADYKE 141


>gi|111306423|gb|AAI20969.1| SETD3 protein [Homo sapiens]
          Length = 284

 Score = 45.8 bits (107), Expect = 0.059,   Method: Compositional matrix adjust.
 Identities = 53/223 (23%), Positives = 100/223 (44%), Gaps = 29/223 (13%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
            K+E+   DL  W  +NG     V   E  +  E+   +    A+ D++A +    VP  
Sbjct: 73  GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126

Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
           L++T+E    N  +  L + +++ +      LA +L+ E+     SFW PYI+ L  +  
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182

Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
                 ++PL + E E+ YL  +    ++  + +   R+Y     V        Q +P+ 
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV-------IQTHPHA 230

Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPL 287
             +P  ++FT+E ++ A  +V +    +     +R   AL+PL
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPL 273


>gi|46129354|ref|XP_389038.1| hypothetical protein FG08862.1 [Gibberella zeae PH-1]
          Length = 478

 Score = 45.4 bits (106), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 40/77 (51%), Gaps = 3/77 (3%)

Query: 281 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 340
           R   +P+   L  +   CK + +A+  +VQ   DR YK GE + V  GP  N  LL  YG
Sbjct: 178 RLVCMPVAD-LFNHDQGCKLVYSALGYSVQ--TDRVYKQGEEVYVSYGPHSNDFLLTEYG 234

Query: 341 FVDEDNPYDRLVVEAAL 357
           F+ + N +D + ++  +
Sbjct: 235 FILDTNRWDEVYLDEVI 251


>gi|296808191|ref|XP_002844434.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
 gi|238843917|gb|EEQ33579.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
          Length = 684

 Score = 45.4 bits (106), Expect = 0.064,   Method: Compositional matrix adjust.
 Identities = 51/194 (26%), Positives = 84/194 (43%), Gaps = 11/194 (5%)

Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
           LA ++ +++ + K S W PY+  L R    G+L   S L +   +L +L  +        
Sbjct: 111 LAFFVAHQQLKAKDSHWWPYLATLPRA---GELT--SALFYQGEDLEWLQDTNFYHARQM 165

Query: 220 RAEGIKREYNELDTVWFMAGS-LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 278
             + +K EY+   ++    G  L + Y ++I   A+T  I  +AF + +    ++ K   
Sbjct: 166 YHDAVKTEYDAAISILRKEGCPLVESYSWNIFCWAYTV-IASRAFTS-RVLEAYISKNPA 223

Query: 279 ARRFALVPLGPPLLAYSSK---CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
            R+     +  PL+  S+     K    A    + L V  P  A E I    GP  N +L
Sbjct: 224 LRQDDEFQIMLPLVDSSNHRPLAKIEWRAEATRIGLKVIDPVSAKEEIHNNYGPLNNQQL 283

Query: 336 LINYGFVDEDNPYD 349
           +  YGF   DNP D
Sbjct: 284 MATYGFCIVDNPCD 297


>gi|452823683|gb|EME30691.1| hypothetical protein Gasu_19370 [Galdieria sulphuraria]
          Length = 370

 Score = 45.4 bits (106), Expect = 0.067,   Method: Compositional matrix adjust.
 Identities = 65/279 (23%), Positives = 113/279 (40%), Gaps = 48/279 (17%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSE--------LACLALYLMY 166
           A + +  G     +P+ L++T     GN+    L   N +          +  ++++L +
Sbjct: 38  AKKPITKGSILLEIPDPLLIT-----GNKVCKWLERNNWIGHQQISSVQGVLLVSIFLFF 92

Query: 167 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 226
           E +Q   SFW PY++ L        L  +  LL      +Y+T    +A+I++  E ++R
Sbjct: 93  ESRQSD-SFWKPYLQVLPTSYDLLFLYRDGLLL------SYVT----EADIMQMVESVRR 141

Query: 227 EYNELDTVWFMAGSLFQQY--PY-----DIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
                     +    FQ Y  P+     D       F+ F + + AV S + +L      
Sbjct: 142 ----------ILRDTFQTYVIPHFSSVDDRDKWNVLFKEFVRWYCAVVSRICYLPDDIAG 191

Query: 280 RRFALVPLGPPL--LAYSSKCKAMLAAVDDAVQLV-VDRPYKAGESIVVWCGPQPNSKLL 336
              ALVPLG      A  +    + A  +    +    R +  G  + V  G   N++L+
Sbjct: 192 ---ALVPLGDIFNHEAVDTPVDILYAKWERGYYVFRAHRNFSIGTQVFVSYGALSNTELM 248

Query: 337 INYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 375
           + YGF   DNP+D L        E  ++ + R+V  R G
Sbjct: 249 MYYGFTLNDNPWDTLSFYPHELDESIKFYE-RVVLDREG 286


>gi|291000152|ref|XP_002682643.1| predicted protein [Naegleria gruberi]
 gi|284096271|gb|EFC49899.1| predicted protein [Naegleria gruberi]
          Length = 619

 Score = 45.4 bits (106), Expect = 0.070,   Method: Compositional matrix adjust.
 Identities = 51/207 (24%), Positives = 85/207 (41%), Gaps = 19/207 (9%)

Query: 157 LACLALYLMYE-KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 215
           L    ++L+YE   + +KS   PY+  L R+         + L + E E+A L  +    
Sbjct: 106 LIVFYMFLIYELHVEKEKSTHFPYLNLLPRE-------FTTALYFDEDEMAALRSTNLYK 158

Query: 216 EILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK 275
            +    + +K+ Y E    + M      +YP     + F++E F  AF AV S V  ++ 
Sbjct: 159 SVQSIRQNLKQIY-ETKVEYLM-----NKYPQKFDRQVFSYENFMWAFSAVWSRVFPIEY 212

Query: 276 -VSLARRFALVP-LGPPLLAYSSKCKA---MLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 330
                    +VP L P +   + K  A        D    L      K+G+ +    G +
Sbjct: 213 PAENGEGVEIVPTLLPTVDILNHKFNAKITYFTGSDRRFYLKTRESLKSGDYVCNNYGAK 272

Query: 331 PNSKLLINYGFVDEDNPYDRLVVEAAL 357
            N   L++YGFV  +N  D L V+  +
Sbjct: 273 SNDSFLLSYGFVIPNNSEDTLYVQFGI 299


>gi|428171155|gb|EKX40074.1| hypothetical protein GUITHDRAFT_113813 [Guillardia theta CCMP2712]
          Length = 353

 Score = 45.4 bits (106), Expect = 0.070,   Method: Compositional matrix adjust.
 Identities = 24/79 (30%), Positives = 41/79 (51%), Gaps = 1/79 (1%)

Query: 275 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 334
           ++ L+R FA    G   L +S              Q+V ++ +K G+S+ +  G + N +
Sbjct: 183 EIVLSRAFAFSRTGGDDLVFSG-TSVKYDNSKQEFQIVAEKDFKVGQSVEISYGLKSNHE 241

Query: 335 LLINYGFVDEDNPYDRLVV 353
           LL++YGF+  DNP D  V+
Sbjct: 242 LLLSYGFILPDNPEDFFVI 260


>gi|449702130|gb|EMD42824.1| Hypothetical protein EHI5A_004190 [Entamoeba histolytica KU27]
          Length = 749

 Score = 45.4 bits (106), Expect = 0.074,   Method: Compositional matrix adjust.
 Identities = 57/227 (25%), Positives = 91/227 (40%), Gaps = 23/227 (10%)

Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
           LYL   K   K   W PYI  L           + PL ++  EL  + G+   A + E+ 
Sbjct: 42  LYLAVNKTNPKCFHW-PYINVLPE-------TYDCPLSYTIDELNLMKGTKLYAAV-EKI 92

Query: 222 EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR 281
                    +  V +    L QQ+P     ++F  ++FK+   A QS       V   + 
Sbjct: 93  NAFL-----MKVVDYYNNKLIQQFPQYF--QSFD-DLFKRLQWAHQSFWSRAFLVIYPQP 144

Query: 282 FALVPLGPPLLAYSSKC-KAMLAAVDDAVQLVV-----DRPYKAGESIVVWCGPQPNSKL 335
           F  V    P   +S+ C +A +  + +           +   K GE I      + N KL
Sbjct: 145 FGEVGSLIPFCDFSNHCTQAKVTYISNTQTETFSFQTNEELVKPGEQIFNNYRIRSNEKL 204

Query: 336 LINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 382
           L+ YGFV+E+NP D L++      +D QY +   + ++    S   F
Sbjct: 205 LLGYGFVEENNPCDNLLLRIYFEVDDNQYNEIEEILKQEEIKSFDFF 251


>gi|403350232|gb|EJY74567.1| hypothetical protein OXYTRI_04175 [Oxytricha trifallax]
          Length = 766

 Score = 45.1 bits (105), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 39/164 (23%), Positives = 67/164 (40%), Gaps = 23/164 (14%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--------SELACLALYL 164
           V A ED++  +A   VP  L++T+E    +  I  +   NK          E   L +++
Sbjct: 27  VRAREDIEHREAFLYVPFKLLITMELAHNHPIIGHVFKENKQIFTKEHEDFEQLTLTVFM 86

Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
           +YE ++G +SFW PY+  L          VE    WS++++  +       E       I
Sbjct: 87  LYEYQKGLESFWFPYLNLLP--------DVEFFCNWSKSDIEAIDDQELAYETKSYKRDI 138

Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 268
           + E+ E++        L   YP    +      +F + F  V S
Sbjct: 139 EIEWKEIEL-------LLLHYPQHFSSALIDKHLFMRIFAQVCS 175


>gi|321462357|gb|EFX73381.1| hypothetical protein DAPPUDRAFT_58066 [Daphnia pulex]
          Length = 425

 Score = 45.1 bits (105), Expect = 0.082,   Method: Compositional matrix adjust.
 Identities = 67/306 (21%), Positives = 118/306 (38%), Gaps = 44/306 (14%)

Query: 56  RVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPC-KVILKEKPS-HNEKHRPIHYV 113
           R+ +    +  SR +      +  +L  WM  NG     K  L  KP+  N   R +   
Sbjct: 8   RIRNRLVRIVHSRPLRIDSHSEFVELCKWMSANGWNAVSKNCLVTKPALFNSTGRGL--- 64

Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKK 173
            A  ++        +P SL++T E+VL    I++LL  + ++   CL  +++  K  G  
Sbjct: 65  MAMSNIAPNHLLVQIPQSLLITKEKVLAE--ISDLLQFS-MTTAECLTFFILNSKFNGLY 121

Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
           S    YI  L +    G L           E+A L  S  + +I+     + ++Y ++  
Sbjct: 122 S---SYISTLPKSFSVGGLC-------KSQEIAALP-SFLQEKIMCNQNFVLKKYEKIFA 170

Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL--------- 284
           +W             I     + E+F+ A+  V +  V  Q  S      L         
Sbjct: 171 IW-----------RKIYGSTLSLELFQWAWFCVNTRAVFYQD-SKQHSHGLNKVDGMENN 218

Query: 285 VPLGPPLLAYSSKCKAMLAA----VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 340
           + L P L  ++   + ++ A         ++  DR  K  + + +  GP  N KL + YG
Sbjct: 219 MALAPYLDMFNHDAEVVVEAGFNKTTQCYEIRSDRHIKKYQQVFINYGPHDNMKLFLEYG 278

Query: 341 FVDEDN 346
           F+   N
Sbjct: 279 FLATKN 284


>gi|303271033|ref|XP_003054878.1| set domain protein [Micromonas pusilla CCMP1545]
 gi|226462852|gb|EEH60130.1| set domain protein [Micromonas pusilla CCMP1545]
          Length = 664

 Score = 45.1 bits (105), Expect = 0.088,   Method: Compositional matrix adjust.
 Identities = 44/141 (31%), Positives = 67/141 (47%), Gaps = 17/141 (12%)

Query: 149 LTTNKLSELACLALYLMYE-KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY 207
           +T+ +++  A +AL+L++E   Q +KS W P++  L R        VE+PLLW+  ELA 
Sbjct: 189 ITSREVTIDAVIALHLLHELYVQREKSEWWPWVSILPRD-------VETPLLWTPRELAQ 241

Query: 208 LTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ 267
           L GS     I  R   +K    + D ++     L Q++P   P E F  E +  A   V 
Sbjct: 242 LEGSNL---IGFRDAVLKGWTTQRDALF---PKLTQKFPSLFPEEHFRTERWAWAMAIVW 295

Query: 268 SCVVHLQKVSLARRFALVPLG 288
           S       V + R  A+ P G
Sbjct: 296 SRAA---DVPVPRPEAIFPSG 313


>gi|358399747|gb|EHK49084.1| hypothetical protein TRIATDRAFT_213818 [Trichoderma atroviride IMI
           206040]
          Length = 378

 Score = 45.1 bits (105), Expect = 0.089,   Method: Compositional matrix adjust.
 Identities = 36/171 (21%), Positives = 72/171 (42%), Gaps = 24/171 (14%)

Query: 192 LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPT 251
             V  P++W   EL +L     +  + +R +  + ++++           F +   DI  
Sbjct: 102 FEVGMPMMWPR-ELKHLLPLEPRNLVFKREKAFQGDWSD-----------FHKAFSDISY 149

Query: 252 EAFTFEIFKQAFVAVQSCVVHLQ-----KVSLARRFALVPLGPPLLAYSSKCKAMLAAVD 306
           E +T+     A++ V +   + +     K     R AL+P+        + C+   +   
Sbjct: 150 EEYTY-----AWLTVNTRTFYNESPETLKYPWEDRLALIPVADLFNHADAGCRVYYSP-- 202

Query: 307 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 357
           +   +V DR YK GE + +      N   L+ YGFV ++NP D + ++  +
Sbjct: 203 EGYHIVADRDYKRGEELYISYSSHSNDYNLVEYGFVPDENPSDDVYIDDVI 253


>gi|302510645|ref|XP_003017274.1| hypothetical protein ARB_04152 [Arthroderma benhamiae CBS 112371]
 gi|291180845|gb|EFE36629.1| hypothetical protein ARB_04152 [Arthroderma benhamiae CBS 112371]
          Length = 479

 Score = 45.1 bits (105), Expect = 0.092,   Method: Compositional matrix adjust.
 Identities = 30/126 (23%), Positives = 55/126 (43%), Gaps = 14/126 (11%)

Query: 97  LKEKPSHNEKHRPIHY-----VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTT 151
           LK    H + H  IH        A   +   +  F +PN L+++++       +   L  
Sbjct: 24  LKRSSPHFKMHPGIHIADLRSTGAGRGISEDEELFVIPNDLILSVQNSEARSVLG--LDD 81

Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
            +L     L + ++YE  QG++S W PY R L         + ++ + W++ +L+ L GS
Sbjct: 82  KQLGPWLSLIITMIYEYYQGEQSKWYPYFRILPS-------SFDTLMFWTDEQLSELQGS 134

Query: 212 PTKAEI 217
               +I
Sbjct: 135 AVVGKI 140


>gi|145349891|ref|XP_001419360.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144579591|gb|ABO97653.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 465

 Score = 45.1 bits (105), Expect = 0.093,   Method: Compositional matrix adjust.
 Identities = 53/221 (23%), Positives = 83/221 (37%), Gaps = 47/221 (21%)

Query: 150 TTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLT 209
           T  + S L  L   L  E+  G KS +  Y R L R       A      W++ E +YL 
Sbjct: 89  TKTEASWLCGLTAALCVERSLGLKSRYFAYDRVLPRCEANVVCA------WNDGERSYLA 142

Query: 210 GSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSC 269
           G+  +  + + A   K E+        +   +F+++  +      +FE F +A   V S 
Sbjct: 143 GTEVETSLRDEAAAAKNEWER------VVAPVFKEHGVEC-----SFEQFIEARTVVSS- 190

Query: 270 VVHLQKVSLARRFALVP-----LGPPLLAYSSKCKAMLAAVDDA--------------VQ 310
                     R F L P     L P   A++         V D               V+
Sbjct: 191 ----------RAFTLSPNAGVGLVPIADAFNHLTGNHHVNVGDGDAVVRSETGGEALCVK 240

Query: 311 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 351
           +  ++  + G+ I    G   N+KLL +YGF   DNP D +
Sbjct: 241 VTNEQGVRRGDEIFNTYGFHGNAKLLNSYGFTQNDNPADEV 281


>gi|403412960|emb|CCL99660.1| predicted protein [Fibroporia radiculosa]
          Length = 508

 Score = 45.1 bits (105), Expect = 0.094,   Method: Compositional matrix adjust.
 Identities = 62/225 (27%), Positives = 90/225 (40%), Gaps = 33/225 (14%)

Query: 284 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 343
           +VP+   L A      A L   +  +++V  +P KAGE I    G  PNS LL  YG VD
Sbjct: 258 MVPMADMLNARFGSENAKLFYEEHHLKMVTTKPIKAGEQIWNTYGDPPNSDLLRRYGHVD 317

Query: 344 ----------EDNPYD------RLVVEAALNTEDPQYQDK--RMVAQRNGKLSVQVFHVH 385
                       NP D       L V AA      + QDK    +   N    V      
Sbjct: 318 LVPLEPPLAGLGNPADIVEIGADLAVFAAKKDSPEKLQDKIDWWLEVANDDTFV------ 371

Query: 386 AGREKEAISDMLPYLRLGYV-SDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARL 444
            G + +   +++ + RL ++  D  E     S L       P ++  VL    D    R+
Sbjct: 372 IGTDCQLPEELVSFARLLFLPRDEWEKVRQKSKL-----PKPKIDAQVLSVAEDVLSRRI 426

Query: 445 AGYPATLSEDEAMLTDYNLHP---KKRVATQLVRMEKKMLNACLQ 486
             Y  T+ +DEA+L   N  P    K+ A  +   EK++L+  LQ
Sbjct: 427 NEYSTTIEDDEALLALENAQPLSLNKKHALIVRHGEKRILHGTLQ 471


>gi|226505024|ref|NP_001151430.1| SET domain containing protein [Zea mays]
 gi|195646778|gb|ACG42857.1| SET domain containing protein [Zea mays]
 gi|413923893|gb|AFW63825.1| SET domain containing protein [Zea mays]
          Length = 491

 Score = 45.1 bits (105), Expect = 0.094,   Method: Compositional matrix adjust.
 Identities = 57/253 (22%), Positives = 106/253 (41%), Gaps = 38/253 (15%)

Query: 114 AASEDLQAGDAAFSVPNSLVVTLER-VLGNETIAELLTTNKLSELACLALYL-MYEKKQG 171
           AA  D+  GD   ++P+ L + L R     + +   L      EL  + L L + +++  
Sbjct: 93  AAYGDIPIGDVLIALPSQLPLRLRRPTSAADDVLVQLAQQVPDELWAMKLGLRLLQERAK 152

Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA----EGIKRE 227
             SFW PYI  L             P+ +   ++  L  +P   ++ +R     E  K  
Sbjct: 153 SDSFWWPYIANLPE-------TFTVPIFFPGEDIKNLQYAPILHQVNKRCRFLLEFEKEV 205

Query: 228 YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 287
             +L TV  +    + Q   D+ + +  +     A  A  S    L           VP+
Sbjct: 206 QQKLHTVPLVDHPFYGQ---DVNSSSLGW-----AMSAASSRAFRLH--------GEVPM 249

Query: 288 GPPLL-----AYSSKCKAM----LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 338
             PL+     +++   + +    + ++D +V+++ ++  K  E+I +  G  PN   L++
Sbjct: 250 LLPLIDMCNHSFNPNARIVQERSVNSLDMSVKVLAEKKIKQNEAITLNYGCYPNDFFLLD 309

Query: 339 YGFVDEDNPYDRL 351
           YGFV   NPYD++
Sbjct: 310 YGFVITQNPYDQV 322


>gi|344300819|gb|EGW31140.1| hypothetical protein SPAPADRAFT_142076 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 436

 Score = 45.1 bits (105), Expect = 0.095,   Method: Compositional matrix adjust.
 Identities = 48/210 (22%), Positives = 78/210 (37%), Gaps = 19/210 (9%)

Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW---SETELAYLTG 210
           LS    L +Y+  E ++GK SFW P+   LD         +  PL+W   ++ +L  L  
Sbjct: 117 LSSFQLLGMYITIETQRGKSSFWKPF---LDMLPSIADFEL-MPLVWQINNQHDLLDLLP 172

Query: 211 SPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
            P +    +       +YN +        +   Q   D        + F  A++ + S  
Sbjct: 173 QPIRKTSEKVYTRFTSDYNTV--------TALLQTKIDNTEAVLPLDQFLLAWICINSRC 224

Query: 271 VHLQ---KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 327
           +++      S +  F + P     L +S      L       Q+     Y   E + +  
Sbjct: 225 LYMNLPTSKSASDNFTMAPY-VDFLNHSPNDHCTLKIDGRGFQVFSTCAYSENEQVYLSY 283

Query: 328 GPQPNSKLLINYGFVDEDNPYDRLVVEAAL 357
           GP  N  LL  YGF   DN ++ L V   L
Sbjct: 284 GPHSNDFLLCEYGFTISDNKWNDLDVTEYL 313


>gi|146180409|ref|XP_001020886.2| hypothetical protein TTHERM_00411920 [Tetrahymena thermophila]
 gi|146144524|gb|EAS00641.2| hypothetical protein TTHERM_00411920 [Tetrahymena thermophila
           SB210]
          Length = 726

 Score = 45.1 bits (105), Expect = 0.095,   Method: Compositional matrix adjust.
 Identities = 48/188 (25%), Positives = 81/188 (43%), Gaps = 31/188 (16%)

Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGNE-----TIAELLTTNKLSELA---CLALYLM 165
           AA++D+    A  S+PN ++++ +R   +E       +E L + K ++ A    L ++ M
Sbjct: 67  AATKDIAPLTAFISIPNKIIISYDRARFSELKSFFKQSEDLFSEKENDEAGVNVLTVFFM 126

Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIK 225
           YE+ +GKKS W  Y   L+          E+ L W+  E+  +     + +         
Sbjct: 127 YERLKGKKSLWHEYFEILENN--------ETILTWTAEEINRIPDPYIQKQA-------- 170

Query: 226 REYNE-LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-CVVHLQKVSLARRFA 283
           REY E +D +W     L    P        T E+F  A+  V S C  + QK +     +
Sbjct: 171 REYKEQVDELWDELKELLHSQPNFFQKATATKELFLWAYNIVMSRCFGYTQKGT-----S 225

Query: 284 LVPLGPPL 291
           +VP    L
Sbjct: 226 IVPFADCL 233


>gi|145517214|ref|XP_001444490.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124411912|emb|CAK77093.1| unnamed protein product [Paramecium tetraurelia]
          Length = 748

 Score = 45.1 bits (105), Expect = 0.096,   Method: Compositional matrix adjust.
 Identities = 37/141 (26%), Positives = 66/141 (46%), Gaps = 23/141 (16%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERV-LGNETIA-----ELLTTNKLS--ELACLALYL 164
           V A++D+ A  A   VP +L+++ E+  L + +I      EL   N+ S  E   L  YL
Sbjct: 46  VVATQDIPANTAIICVPQTLIISQEKCKLSSLSIVYDKHPELFDENQTSDAEFNILIFYL 105

Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
             EKK+G++SF+ PYI+ +            + + W++ EL+ +       E    +E +
Sbjct: 106 FNEKKKGEQSFFYPYIQAIQTNN--------TLIDWTKEELSQIEDPIVLDEFAIVSEDL 157

Query: 225 KREYNELDTVWFMAGSLFQQY 245
           K        +W  A  +F ++
Sbjct: 158 K-------VLWNYAQDIFNEF 171


>gi|410900968|ref|XP_003963968.1| PREDICTED: SET domain-containing protein 4-like [Takifugu rubripes]
          Length = 386

 Score = 45.1 bits (105), Expect = 0.098,   Method: Compositional matrix adjust.
 Identities = 59/249 (23%), Positives = 110/249 (44%), Gaps = 23/249 (9%)

Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSF 175
           +++ GD   S+P S ++T   VL N  +   + + K  LS L  L ++L+ E+ +G+ S 
Sbjct: 65  NVKPGDMLISLPESCLLTTSTVL-NSYLGSFIKSWKPHLSPLLALCVFLVCERHRGEASD 123

Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 235
           W PYI  L +       +   P  +++  +A L  S  + +  E+ E + RE +  +  +
Sbjct: 124 WFPYIDVLPK-------SYTCPAYFTDEVMALLPPS-VQRKAREQREAV-REIHSSNKAF 174

Query: 236 FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-CVVHLQKVSLARRFALVPLGPPLLAY 294
           F +       P +   +  T+E  + A+ +V +  V  L   +   R   V    P L  
Sbjct: 175 FRSLQPVLTQPAE---DVLTYEALRWAWCSVNTRSVFMLHSSNDFLRGQDVYALAPFLDL 231

Query: 295 SSKC-----KAMLAAVDDAVQL-VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
            + C     KA         ++  V R  +  ++ + + G   N +L++ YGFV   NP+
Sbjct: 232 LNHCPDVQVKASFNEETKCYEIRSVSRMLQYQQAFINY-GSHDNQRLMLEYGFVAPCNPH 290

Query: 349 DRLVVEAAL 357
             + V+  L
Sbjct: 291 SVVYVDKDL 299


>gi|440464611|gb|ELQ34010.1| hypothetical protein OOU_Y34scaffold00824g3 [Magnaporthe oryzae
           Y34]
          Length = 373

 Score = 45.1 bits (105), Expect = 0.099,   Method: Compositional matrix adjust.
 Identities = 46/163 (28%), Positives = 71/163 (43%), Gaps = 24/163 (14%)

Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYN-ELDTVWFMAGSL----FQQYPYDIPT 251
           P +W + EL  L   PT A +    E  + +YN E +TV     S+    FQ Y + + T
Sbjct: 107 PFMWPK-ELQKLL--PTSARVF--LENQQTKYNHEWNTVSQAMPSISEERFQYYWHIVNT 161

Query: 252 EAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQL 311
             F +E      V+   C       S   R ALVPL          C+  ++ + +   +
Sbjct: 162 RTFLYE------VSETECY------SWEDRLALVPLADIFNHADEGCR--VSYMPEHYVI 207

Query: 312 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
             DR Y+AGE + +  G   N  LL  YGF+   N +D + ++
Sbjct: 208 TTDRAYEAGEELFISYGDHSNDCLLTEYGFLLPKNRWDIICID 250


>gi|428174941|gb|EKX43834.1| hypothetical protein GUITHDRAFT_140267 [Guillardia theta CCMP2712]
          Length = 805

 Score = 45.1 bits (105), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 43/190 (22%), Positives = 77/190 (40%), Gaps = 35/190 (18%)

Query: 69  EVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
           E  +K+ EDL +   W+ +NG+   KV L+    H       + + A + ++  +  F +
Sbjct: 540 EGSAKRNEDLIEFSKWLRRNGVDDSKVKLRADGGHGMG----NSLYARQMIKEDELLFRI 595

Query: 129 PNSLVVTLERVLGNETIAELLTTNKL-----SELACLALYLM--------YE-------- 167
           P  +    + V  + T+  ++   ++      E   L+L LM        YE        
Sbjct: 596 PLKIAFYSDAVRRHPTLGSVIKGARIPQGMQGETFLLSLMLMGPLTHLEQYEACQVGHME 655

Query: 168 ---KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
              K   + SFWLPYI+ L +          +P+ W+E E   L GS     + +     
Sbjct: 656 TGCKLSNETSFWLPYIKILPK-------TFSAPIFWNEVERQELKGSQVMEMLNDDLAQA 708

Query: 225 KREYNELDTV 234
           +RE+  +  V
Sbjct: 709 RREWEMMKIV 718


>gi|407923069|gb|EKG16157.1| hypothetical protein MPH_06594 [Macrophomina phaseolina MS6]
          Length = 305

 Score = 45.1 bits (105), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 54/250 (21%), Positives = 99/250 (39%), Gaps = 42/250 (16%)

Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWL 177
           D+Q G+  F++P S V++      +  + +L     L   A L + ++YE  +G  S W 
Sbjct: 3   DIQEGEVLFTIPRSAVLSATNSSLSSILPQLF--EHLDPWASLIVTMIYEYLRGDASPWK 60

Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG----------IKRE 227
           PY   L           ++ + WS+ ELA L  S    +I + +            ++R 
Sbjct: 61  PYFDVLPAH-------FDTLMFWSDDELAELQASAVTQKIGKDSANEMFTNTIIPLVRRH 113

Query: 228 YN---------------ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVH 272
            +               +L  +    GS    Y +DI  +  + ++ ++ + +       
Sbjct: 114 ASVFFPDPNTAQGASDGDLLALAHRMGSTIMAYAFDIEPDPASKQVDEEGYASDD----- 168

Query: 273 LQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPN 332
            +  +L +   +VPL   L A + +  A L    DA+ +       AG+ +    G  P 
Sbjct: 169 -EDEALPK--GMVPLADMLNADADRNNARLHYGPDALTMEAVTNISAGDEVFNDYGSLPR 225

Query: 333 SKLLINYGFV 342
           S LL  YG+V
Sbjct: 226 SDLLRRYGYV 235


>gi|296232125|ref|XP_002761462.1| PREDICTED: SET domain-containing protein 4 [Callithrix jacchus]
          Length = 440

 Score = 45.1 bits (105), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 64/277 (23%), Positives = 109/277 (39%), Gaps = 25/277 (9%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-SELACLALYLMYEKKQGKKSFWL 177
           LQ G    S+P S ++T + V+ +   A +       S L  L  +L+ EK  G +S W 
Sbjct: 68  LQEGQMIISLPESCLLTTDTVIQSYLGAYIAKWKPPPSPLLALCTFLVSEKHAGDRSLWK 127

Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
           PY+  L +       A   P+   E E+  L     KA+  E+   ++  +      +  
Sbjct: 128 PYLEILPK-------AYTCPVC-LEPEVVNLLPISLKAKAEEQRAHVQEFFASSRDFFSS 179

Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PLL 292
              LF +    I    F++     A+  V +  V+L   Q   L+       L P   LL
Sbjct: 180 LQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQWECLSAEPDTCALAPYLDLL 235

Query: 293 AYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY-- 348
            +S   + KA         ++     ++  E + +  GP  N +L + YGFV   NP+  
Sbjct: 236 NHSPHVQVKAAFNEETHCYEIRTTSRWRKHEEVFICYGPHDNHRLFLEYGFVSGHNPHAC 295

Query: 349 ---DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 382
               R ++   L + D Q   K  + + +G +    F
Sbjct: 296 VYVSREILVKYLPSTDKQMDKKISILKDHGYIENLTF 332


>gi|299115489|emb|CBN75653.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 451

 Score = 44.7 bits (104), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 21/70 (30%), Positives = 38/70 (54%)

Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 365
           +D + LV  + + +G  +    GP PNS+LL+ +GF   DNP++ + + A +    P + 
Sbjct: 229 NDCLHLVTLQDWASGSEVKFSYGPLPNSRLLLLHGFCLPDNPFESVELWAMMEPGAPGFA 288

Query: 366 DKRMVAQRNG 375
           +K  +   NG
Sbjct: 289 EKNKIMLDNG 298


>gi|395848935|ref|XP_003797093.1| PREDICTED: SET domain-containing protein 4 [Otolemur garnettii]
          Length = 440

 Score = 44.7 bits (104), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 74/330 (22%), Positives = 133/330 (40%), Gaps = 44/330 (13%)

Query: 65  AGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHY------VAASED 118
           A SR V    + +  +LK W            LK++   +    P H+      + +   
Sbjct: 20  AESRGVNESFKCEFIELKKW------------LKDRKFEDTNLMPAHFPGTGRGLMSKTS 67

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
           LQ G    S+P + ++T + V+ +  +   +T  K   S L  L  +L+ EK  G +S W
Sbjct: 68  LQEGQMIISLPENCLLTTDTVIES-YLGAYITKWKPPPSPLLALCTFLVSEKHAGDQSPW 126

Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
            PY+  L +       A   P+   E E+  L   P KA+  E+   ++  +      + 
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKPLKAKAEEQRAHVQEFFASSRDFFS 178

Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ---KVSLARRFALVPLGP--PL 291
               LF +    I    F++     A+  V +  V+L+   +  L+       L P   L
Sbjct: 179 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRHRRRECLSAEPDTCALAPYLDL 234

Query: 292 LAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
           L +S   +   A  ++    ++     ++  E + +  G   N +LL+ YGFV   NP+ 
Sbjct: 235 LNHSPNVQVRAAFNEETRCYEIRTASSWRKHEEVFICYGHHDNQRLLLEYGFVSIQNPHA 294

Query: 350 RLVVEAALNTEDPQYQDKRMVAQRNGKLSV 379
            + V   +  +     DK+M    N K+S+
Sbjct: 295 CVYVSREILVKYLPSTDKQM----NKKISI 320


>gi|390602144|gb|EIN11537.1| SET domain-containing protein [Punctularia strigosozonata HHB-11173
           SS5]
          Length = 503

 Score = 44.7 bits (104), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 59/220 (26%), Positives = 87/220 (39%), Gaps = 24/220 (10%)

Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
           A+VP+   L A      A L      +++V  +P  +GE I    G  PNS LL  YG V
Sbjct: 255 AMVPMADMLNARYGSENAKLFYESRDLRMVTTKPIASGEQIWNTYGDPPNSDLLRRYGHV 314

Query: 343 D---------EDNPYDRLVVEA--ALNTEDPQYQDKRMVAQ-----RNGKLSVQVFHVHA 386
           D           NP D + V A   LN  + + Q   +  +       G   V VF    
Sbjct: 315 DLLALSDGDGMGNPSDIVEVRADLVLNHVNSKKQSHELEERIDWWLEEGGDDVFVFT--- 371

Query: 387 GREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAG 446
            R+ E  S+++  +RL  +  T   ++      P   V       +L  +      RL  
Sbjct: 372 -RDAELPSELVSLIRLLILPPTEWTKTRDKGKLPKGKVDDVR---ILHVVTGALHERLQQ 427

Query: 447 YPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 486
           YP ++ +DEA+L    L   KR A  +   EK +L   L 
Sbjct: 428 YPTSIEDDEALLA-TALSENKRQAVIVRLAEKHILRKALH 466


>gi|255080174|ref|XP_002503667.1| set domain protein [Micromonas sp. RCC299]
 gi|226518934|gb|ACO64925.1| set domain protein [Micromonas sp. RCC299]
          Length = 401

 Score = 44.7 bits (104), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 30/119 (25%), Positives = 54/119 (45%), Gaps = 10/119 (8%)

Query: 144 TIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET 203
           ++AE L   +L     L + +M+E+  G+ S W  Y   L     RG+  +  P+ W+  
Sbjct: 38  SVAETLREARLGGGLALNIAIMHERSLGEGSRWAGYFAVLP---ARGERTL--PMFWTSA 92

Query: 204 ELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
           +L +L G+     + E AE ++ ++NE      +   L   +P   P    T E + +A
Sbjct: 93  QLEHLRGTDLLRHVTEDAESMRLDFNE-----NVVDGLCVTHPVAFPPGKHTLEAYMEA 146


>gi|320167148|gb|EFW44047.1| hypothetical protein CAOG_02072 [Capsaspora owczarzaki ATCC 30864]
          Length = 533

 Score = 44.7 bits (104), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 63/274 (22%), Positives = 114/274 (41%), Gaps = 38/274 (13%)

Query: 113 VAASEDLQAGDAA--FSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
           + AS+ ++A       SVP S       +L +  +A L       E A L+L  +YE   
Sbjct: 101 IFASQAIEASTTTPLLSVPLSTFFARFTLLDSPMMAALAVRPVAREEAKLSLLFLYEYFD 160

Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
              SFW P+ +   R+       ++    W +  L  L  +  +  I +    I+ EY++
Sbjct: 161 -PDSFWQPWFQLFPRE-------LDCAGFWDDLLLMELDNTSIRDAIRQLEALIEYEYDQ 212

Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL--- 287
           LD       +L  ++P     + F+++ FK AF+ + S  + +  V+ A    ++P    
Sbjct: 213 LDL-----PALRLRFPDSFVADRFSYDDFKWAFMVLASRGLTM-SVNNAPCTVMIPFVDF 266

Query: 288 ----GPPLLAYSSKCKAMLAA------VDDAVQ------LVVDRPYKAGESIVVWCGPQP 331
               G   +A+S   +A  A+       DD+V+      +  +  +  GE + +      
Sbjct: 267 FNHNGAKSIAFSYTRRAGDASDVSSGNYDDSVENLNCAVISGNETFLPGEQMFLNYKAHS 326

Query: 332 NSKLLINYGFVDEDNPYDRLVVEAALN---TEDP 362
           N  LL++YGF    N +D  +V    +   T DP
Sbjct: 327 NEVLLLHYGFALPHNEHDTFLVRLHFDREKTNDP 360


>gi|258567286|ref|XP_002584387.1| predicted protein [Uncinocarpus reesii 1704]
 gi|237905833|gb|EEP80234.1| predicted protein [Uncinocarpus reesii 1704]
          Length = 706

 Score = 44.7 bits (104), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 55/206 (26%), Positives = 87/206 (42%), Gaps = 22/206 (10%)

Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIREL--DRQRGRGQLAVESPLLWSE-TELAYLTG 210
           + E   LA +LM +   G +SFW PYI+ L  D Q  R +      L W E T L  L  
Sbjct: 120 VEEPGALAFFLMDQYLLGDESFWAPYIQSLPDDSQFTRLEYYTGDDLKWLEGTNLLKLRE 179

Query: 211 SPTKAEILERAEGIK--REYNELDT---VW--FMAGSLFQQYPYDIPTEAFTFEIFKQAF 263
              +    +   G++  +E+   +T    W  F+  S        I + AF+ E+ K   
Sbjct: 180 KLLERLKAKYETGLRLLKEFPNKNTPKYTWERFLWASSI------ILSRAFSSEVLKDYI 233

Query: 264 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESI 323
               + V  L+  S+     LVPL   +  +    +   A   + + L+V +    GE +
Sbjct: 234 KGTPTRVKPLEDFSV-----LVPLVD-ISNHQPLAQVEWATSLEKIGLIVHKTLLPGEEV 287

Query: 324 VVWCGPQPNSKLLINYGFVDEDNPYD 349
               GP+ N +L++NYGF    N  D
Sbjct: 288 PNNYGPRSNERLMMNYGFCIRGNVCD 313


>gi|406607002|emb|CCH41620.1| SET domain-containing protein 4 [Wickerhamomyces ciferrii]
          Length = 424

 Score = 44.3 bits (103), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 42/208 (20%), Positives = 89/208 (42%), Gaps = 24/208 (11%)

Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 213
           LS    ++L+L  E  +GK+S+W P+I+ L          + SP LW       + G   
Sbjct: 114 LSSFQIMSLFLELESSRGKESWWDPFIQMLPTIND----FLTSPFLWQ------IQG--- 160

Query: 214 KAEILER-----AEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 268
           K E++E+      +   + +N  ++ +    +L +   ++   +    + F   ++ + S
Sbjct: 161 KYELIEKLPKSTQKHSLKMFNRFESDFKAVKTLLE--THNASKDIINHDKFVLYWMCINS 218

Query: 269 CVVHL---QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 325
             +++   QK + +  F + P     + +S+  +  L        ++    YK  + + +
Sbjct: 219 RCLYMEIPQKKTTSDNFTMAPY-VDFINHSTNDQCKLKIDRTGFHVITTSNYKENDELYL 277

Query: 326 WCGPQPNSKLLINYGFVDEDNPYDRLVV 353
             GP  N  LL  YGF   +N ++ L +
Sbjct: 278 SYGPHSNEFLLCEYGFHLSNNEWNDLDI 305


>gi|255077808|ref|XP_002502485.1| set domain protein [Micromonas sp. RCC299]
 gi|226517750|gb|ACO63743.1| set domain protein [Micromonas sp. RCC299]
          Length = 728

 Score = 44.3 bits (103), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 40/137 (29%), Positives = 61/137 (44%), Gaps = 19/137 (13%)

Query: 83  SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE-RVLG 141
           +WM K G+    V +   P H  +        A+ D++ GD    VP + ++T +  V G
Sbjct: 51  AWMKKKGVKLNGVSIGRFP-HTGRG-----CVATRDIKEGDVLVEVPEAAIITADGSVAG 104

Query: 142 NETIAELLTTNKL-------SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
           +  +A  L    L        E   L L +M E  +G++S + PY+  L   R     A 
Sbjct: 105 SALVAFGLGGEALLHEYSPRLEREALVLAVMAEMSRGEESEFAPYLAALPTLR-----AT 159

Query: 195 ESPLLWSETELAYLTGS 211
            SPL WS  EL+ L G+
Sbjct: 160 HSPLGWSGAELSELEGT 176


>gi|403338831|gb|EJY68658.1| hypothetical protein OXYTRI_10728 [Oxytricha trifallax]
          Length = 770

 Score = 44.3 bits (103), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 21/77 (27%), Positives = 40/77 (51%), Gaps = 6/77 (7%)

Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGNE------TIAELLTTNKLSELACLALYLMYE 167
           A  ED+Q  +A   +PN  ++T+ER   +E          +  +++  +   L +++M E
Sbjct: 67  AVKEDIQHNEAFVYIPNKCLITVERARSSEIGFIFANHENVFKSSEDRDFLTLLVFMMCE 126

Query: 168 KKQGKKSFWLPYIRELD 184
            ++G +SFW PY   +D
Sbjct: 127 FQKGDQSFWYPYFNAVD 143


>gi|440792461|gb|ELR13682.1| [Ribulose-bisphosphate-carboxylase]-lysine N-methyltransferase
           [Acanthamoeba castellanii str. Neff]
          Length = 400

 Score = 44.3 bits (103), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 49/204 (24%), Positives = 88/204 (43%), Gaps = 23/204 (11%)

Query: 161 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE- 219
           A+  + E     +SFW PY+ EL         AV +   W++ ELA +  +    E++E 
Sbjct: 49  AVLWLLESVNCAQSFWQPYLSELPD-------AVATVDRWNQEELAEVGHTLMLYEMVEY 101

Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
           + + I  +Y  +   +    +  Q +   IP+E    E +++A   V S      +  L 
Sbjct: 102 KKKKIAADYAAILLPFLQENT--QLFGGSIPSE----EEYRRALSLVYSRTFDFSE--LI 153

Query: 280 RRFALVPLGPPLLAYS------SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 333
                +P     L +S      + C        D  +L+    Y  GE + +  G + +S
Sbjct: 154 GEHVFIPF-VDFLNHSINDTGKAACTYSYNHDKDCFELLAGADYDEGEEVFISYGEKTSS 212

Query: 334 KLLINYGFVDEDNPYDRLVVEAAL 357
           +LL +YGF+ E+N  D + + A+L
Sbjct: 213 QLLASYGFMYENNAEDTVDITASL 236


>gi|340503949|gb|EGR30449.1| SET domain protein [Ichthyophthirius multifiliis]
          Length = 518

 Score = 44.3 bits (103), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 50/211 (23%), Positives = 91/211 (43%), Gaps = 25/211 (11%)

Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQ 365
           D+   +   +P K G+ I    G + N  LL+ YGF    N YD   +   +N    Q  
Sbjct: 250 DNYFVVTTQKPEKKGQQIYNCYGQRTNKFLLMWYGFCFNKNRYDSYSLRLWINMRQEQLN 309

Query: 366 D---KRMVAQ---------------RNGKLSVQVFHVHAGREKEAIS-DMLPYLRLGYVS 406
           +   +++V Q               +  K+++     +   +K  I+ D++ YLRL  + 
Sbjct: 310 NDLFEKIVFQEFLEKEDCKGGFVWKKQEKVNLDDITQNFRIKKNKINIDLIIYLRLYLMM 369

Query: 407 DTS--EMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH 464
                +++ V+ SL    PVSP  E  VL          L+ +  T+ +D+ +L + NL+
Sbjct: 370 HYKGPDLKRVMVSL----PVSPVYECFVLSFAIRLLSYLLSRFTTTIKDDKELLQNQNLN 425

Query: 465 PKKRVATQLVRMEKKMLNACLQVTADMIMLL 495
            K R A      +K++L   + +    ++LL
Sbjct: 426 YKYRFAIIYRLNQKEILQEQISLMNQALILL 456


>gi|302754812|ref|XP_002960830.1| hypothetical protein SELMODRAFT_402221 [Selaginella moellendorffii]
 gi|300171769|gb|EFJ38369.1| hypothetical protein SELMODRAFT_402221 [Selaginella moellendorffii]
          Length = 393

 Score = 44.3 bits (103), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 27/192 (14%)

Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
           LAL ++ E+ +G+ + W PYI  L +        +++   W +TEL+YL  SP   +  E
Sbjct: 167 LALIVLMERYKGQ-AIWAPYISCLPQPA-----ELDNTFRWEDTELSYLRASPLYGKARE 220

Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ-SCVVHLQKVSL 278
           R E I  E+ ++    F    L Q             +++ Q F  V    + H+     
Sbjct: 221 RLEMITTEFGQVQND-FCTCVLEQ-----------ALDVWPQLFGKVSLEDLKHVYATVF 268

Query: 279 ARRFALVPLGP---PLLAY-----SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 330
           +R  A+        P+L +     +S  K     + +   +  DR Y   + I +  G  
Sbjct: 269 SRSLAIGEDSTTLIPMLDFFNHNATSFAKLSFNGLLNYAVVTADRDYAENDQIWINYGDL 328

Query: 331 PNSKLLINYGFV 342
            N++L ++YGF 
Sbjct: 329 SNAELALDYGFT 340


>gi|345325921|ref|XP_001512684.2| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Ornithorhynchus anatinus]
          Length = 392

 Score = 44.3 bits (103), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 37/164 (22%), Positives = 73/164 (44%), Gaps = 29/164 (17%)

Query: 73  KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHY-----VAASEDLQAGDAAFS 127
           K+E+   DL  W   NG            +  E    +++     + A+ +++A +    
Sbjct: 74  KREDYFPDLMKWATANG------------ASTEGFELVNFEEGFGLRATREIKAEELFLW 121

Query: 128 VPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELD 184
           VP  L++T+E    N  +  L + +++ +      LA +L+ E+     SFWLPYI+ L 
Sbjct: 122 VPRKLLMTVESA-KNSVLGSLYSQDRILQAMGNITLAFHLLCERAN-PSSFWLPYIQTLP 179

Query: 185 RQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
            +        ++PL + E E+ YL  +    ++  + +   R+Y
Sbjct: 180 SE-------YDTPLYFEEDEVQYLQSTQAIHDVFSQYKNTARQY 216


>gi|357131408|ref|XP_003567330.1| PREDICTED: ribosomal N-lysine methyltransferase 3-like
           [Brachypodium distachyon]
          Length = 495

 Score = 43.9 bits (102), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 67/318 (21%), Positives = 113/318 (35%), Gaps = 63/318 (19%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
            K WM K+G     V+  +    +       YV A   L+ GD   ++P    +T  R  
Sbjct: 13  FKRWMSKHG-----VVCSDALCLDASEAGGVYVRALSALREGDLVATIPRRACLT-PRTS 66

Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
           G           +L     LA+ +MYE+ +G +S W  Y+R +             PL+W
Sbjct: 67  GAAAAI---EAAELGGTLALAVAVMYERARGAESPWNAYLRLIPD-------CEPVPLVW 116

Query: 201 SETELA-YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
            + E    L+G+     + +  E +  ++ E       +G L       +  E F+ E +
Sbjct: 117 PDEEAERLLSGTELDKIVKQDREFLCEDWKECIEPLISSGDL------GVNPEDFSLEKY 170

Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA----------- 308
             A   + S   H+     +    +VPL       +         V DA           
Sbjct: 171 FAAKSLLSSRSFHIDSYHGS---GMVPLADLFNHKTDGEHVHFTKVSDASDSDEGEDDDD 227

Query: 309 --------------------------VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
                                     +++++ R   AGE +    G   N+ LL  YGF 
Sbjct: 228 QSNAGSDEEPTVENSATNPSGYNDEDLEMIIVRDANAGEEVYNTYGTMGNAALLHRYGFT 287

Query: 343 DEDNPYDRLVVEAALNTE 360
           + DNPYD + ++  L T+
Sbjct: 288 ELDNPYDIVNIDLTLVTK 305


>gi|145344497|ref|XP_001416768.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144576994|gb|ABO95061.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 514

 Score = 43.9 bits (102), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 99/417 (23%), Positives = 168/417 (40%), Gaps = 65/417 (15%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEK-KQG 171
           VA + ++ AG+    VP    +       +  +   +  +  S  A LA +++ E    G
Sbjct: 85  VATTRNVSAGELLAEVPLEKCLCAASARMDARLWRAIGASGASGDAILAAHVLREAFDAG 144

Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA--EGIKREYN 229
            KS + P++R L R        V+S + W+E EL+ L+GS     +  RA     + EY+
Sbjct: 145 SKSAYWPWLRLLPRD-------VDSTVGWNEDELSELSGS--NVVVFTRAIKAQWRMEYD 195

Query: 230 ELDTVWFMAGSLFQQYPYDIPTEA---FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 286
            LD       +L +++P     E    +TF+ F  A   + S  + L   S       + 
Sbjct: 196 ALDV-----PTLGEKFPDVFGGERAAHYTFDKFTWARFIIWSRAIDLSTESA--EAPTIR 248

Query: 287 LGPPLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
           +  PLL     A   K +    A  +AV++     ++    +      +P+   L+ YGF
Sbjct: 249 VLVPLLDMANHAPGGKLRPEWDARSNAVKVYAASAFREHTELRFNYDTKPSQYFLLQYGF 308

Query: 342 VDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKE------AIS- 394
           + E NP +   VEA +   D         + R+ K  +   H    +++       +I  
Sbjct: 309 IPETNPAE--CVEATVRVSDHD-------SLRDAKEELLRLHGLDPKKRNFEWKPRSIDY 359

Query: 395 DMLPYLRLGYVSDTSEMQSVIS-----SLGPICPVSPCMERAV-LDQLADYFKARLAGYP 448
           D+L   R+    D +EM    S     S   +   +    +AV L  LA +    L  Y 
Sbjct: 360 DLLAATRV-ITMDEAEMSDATSLTLAVSGASVSAKNDARTKAVLLKSLASF----LESYT 414

Query: 449 ATLSEDEAML-------TDYNLHPKKRVATQLVRMEKKMLNACLQVTADMIML-LPD 497
            TL+ED   +        D  L  K++    L+RM +K +   L  +AD +   LPD
Sbjct: 415 TTLAEDNEYVARVDDESNDEPLPGKRKRFAVLLRMREKQI---LLASADALFKELPD 468


>gi|345795412|ref|XP_544872.3| PREDICTED: SET domain-containing protein 4 [Canis lupus familiaris]
          Length = 440

 Score = 43.9 bits (102), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 75/347 (21%), Positives = 132/347 (38%), Gaps = 39/347 (11%)

Query: 41  GSSLRLVRRKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEK 100
           G     +RR+  F   VS         R V    + +  +LK W+         +I    
Sbjct: 5   GGRTSRIRRRKLFRCSVS---------RGVNESYKPEFIELKKWLKDRKFEDTNLIPACF 55

Query: 101 PSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-SELAC 159
           P           + +   L+ G    S+P S ++T + V+ +     +       S L  
Sbjct: 56  PGTGRG------LMSKTSLREGQMIISLPESCLITTDTVIRSYLGTYIAKWQPPPSPLLA 109

Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
           L  +L+ EK  G +S W PY+  L +       A   P+   E E+  L   P KA+  E
Sbjct: 110 LCTFLVSEKHAGDQSLWKPYLEILPQ-------AYTCPVC-LEPEVVNLFPKPLKAKAEE 161

Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQKV 276
           +   ++  ++     +     LF +    I    F++     A+  V +  V   H Q+ 
Sbjct: 162 QRARVQEFFSSSRDFFSSLQPLFSEAVESI----FSYRALLWAWCTVNTRAVYVKHRQRQ 217

Query: 277 SLARRFALVPLGP--PLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPN 332
             +       L P   LL +S + +   A  ++    ++      +  E + +  GP  N
Sbjct: 218 CFSTEPNTYALAPYLDLLNHSPEVQVKGAFNEETRCYEIRTASNCRKHEEVFICYGPHDN 277

Query: 333 SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 379
            +LL+ YGFV   NP+  + V   +  +     DK+M    N K+S+
Sbjct: 278 QRLLLEYGFVSIHNPHACVYVSEDILVKYLPTTDKQM----NKKISI 320


>gi|302820198|ref|XP_002991767.1| hypothetical protein SELMODRAFT_430007 [Selaginella moellendorffii]
 gi|300140448|gb|EFJ07171.1| hypothetical protein SELMODRAFT_430007 [Selaginella moellendorffii]
          Length = 389

 Score = 43.9 bits (102), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 57/255 (22%), Positives = 104/255 (40%), Gaps = 35/255 (13%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           A+  ++AG+    +P+ LV+T E++  ++ + +LL+T    +   L L ++ E+ +G+ S
Sbjct: 14  AARSIRAGEQIVRIPHDLVLTAEKL--DDCVKKLLSTEY--DWCPLTLLILAEQHKGEAS 69

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
            W PY+  L            S + W + EL +L  +       ER E I  EY  +  V
Sbjct: 70  RWAPYVSCLPSFGDH-----HSTIFWEKEELKFLECTRAFRGTAERREMISDEYISVKNV 124

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
                      P+    +   F+ F  A+  V   V      +L+   ++ P       +
Sbjct: 125 -------ISSCPHVFGEDISLFQ-FAHAYATV---VSRAWNGALSSEISMRP-------F 166

Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC--GPQPNSKLLINYGFVDEDNPYDRLV 352
              C        D V           ++ VV+   G + N+ L ++YGFV  +N  D+  
Sbjct: 167 VDFCN------HDPVSHATVSHDSCKDATVVFISYGKRSNAVLAVDYGFVLPNNLSDQAE 220

Query: 353 VEAALNTEDPQYQDK 367
           +   +   DP  + K
Sbjct: 221 LWMEIPWNDPLREKK 235


>gi|397642897|gb|EJK75526.1| hypothetical protein THAOC_02751 [Thalassiosira oceanica]
          Length = 395

 Score = 43.9 bits (102), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 69/319 (21%), Positives = 131/319 (41%), Gaps = 45/319 (14%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
            K W    G+        E+   ++K R I Y   +   +AG A   VP  L+++  + +
Sbjct: 82  FKYWASTMGIEKNDCFKLEE--QDKKQREI-YAMTTRSTEAGTAVLYVPEHLILSSSKAM 138

Query: 141 ----------GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRG 190
                       E +A +   ++L E   L L ++ E ++G  S W  ++  L R     
Sbjct: 139 AELRTDGMAEAEEYLASVGAESQLREY-YLMLKVLLEYQKGSDSEWHKWLDALPRYYSNA 197

Query: 191 QLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIP 250
                  +  +E  L  L     K  + ER       Y+ + +V F+A  + + +P D+ 
Sbjct: 198 -------VAMTEFCLTCLPPLMKKLAVEERDAQKLLSYDSIQSVPFLADDIKEGFPRDMV 250

Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAV 309
           T A+               +V+ + V        ++P+G     ++S    ++   D+A 
Sbjct: 251 TWAYQ--------------IVYTRSVETEDGDLKIIPMG-DFFDHASDYAEIVPQYDEAG 295

Query: 310 QLVVDRPYK--AGESI-VVWCGPQPNSKLLINYGFVDEDNP--YDRLVVEAALNTE--DP 362
                  Y   AG+ +  ++  P+  S LL  YGF+DE  P  Y +L +   +N E  + 
Sbjct: 296 NYYAVTAYDVPAGKKLRYIYSNPRNPSHLLARYGFIDEICPATYCKL-LPPTVNEEMIEL 354

Query: 363 QYQDKRMVAQRNGKLSVQV 381
            Y  ++M+  R+G+++ +V
Sbjct: 355 GYSQEKMLFYRSGEVADEV 373


>gi|358384831|gb|EHK22428.1| hypothetical protein TRIVIDRAFT_84056 [Trichoderma virens Gv29-8]
          Length = 458

 Score = 43.9 bits (102), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 66/316 (20%), Positives = 125/316 (39%), Gaps = 55/316 (17%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
           ++ W+ ++G     + L   P+     R +         + G+   ++P+  + T+E   
Sbjct: 1   MEGWLRESGAELDGLELAHFPAIGRGVRTLRC------FKQGERILTIPSGCLWTVEHAY 54

Query: 141 GNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV---- 194
            +  +  +L + +  LS    LA+Y+++ +            RE      R  +A     
Sbjct: 55  ADAVLGPVLRSAQPPLSVEDTLAIYILFVRS-----------RESGYDGLRSHVAALPAS 103

Query: 195 -ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT-VWFMAGSLFQQYPYDIPTE 252
             S + + + EL    GS       +  + I+ +Y  L   V+ +   LF       P  
Sbjct: 104 YSSSIFFEDDELEVCAGSSLYTITRQLEQRIEEDYRGLVVRVFGLHLDLF-------PLN 156

Query: 253 AFTFEI--FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL---------LAYSSKCKA- 300
            FT E   +K A   V S  +            ++P G PL         + +S + K  
Sbjct: 157 KFTIENVGYKWALCTVWSRAMDF----------VLPNGNPLRLLAPFADMVNHSPEVKQC 206

Query: 301 -MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT 359
            +  A    + ++  + Y+A + + ++ GP PNS+LL  YGFV  DNP D   +  + + 
Sbjct: 207 HVYDASSGNLSILAGKDYEAEDQVFIYYGPMPNSRLLRLYGFVIPDNPNDSYDLVLSTHP 266

Query: 360 EDPQYQDKRMVAQRNG 375
             P Y+ K+ +    G
Sbjct: 267 LAPFYEQKQKLWASAG 282


>gi|407852222|gb|EKG05847.1| hypothetical protein TCSYLVIO_003073 [Trypanosoma cruzi]
          Length = 565

 Score = 43.9 bits (102), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 43/148 (29%), Positives = 70/148 (47%), Gaps = 15/148 (10%)

Query: 320 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRL-----VVEAALNTEDPQYQDKR--MVAQ 372
           G  I +  GP  N +LL  YGFV E N +DRL       EAA+  E   +  +R  +VA+
Sbjct: 362 GREIWMSYGPLQNWELLQFYGFVLEGNEHDRLPFPLDFPEAAVGDE---WDGRRAALVAK 418

Query: 373 RNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAV 432
               L+   +  H GR   A   ++  LR+ ++++  E  ++  + GP   +    E  V
Sbjct: 419 YGLHLAGCCWICHDGRPPPA---LVALLRV-HLAEAEEFDTMERN-GPFASLGAGTEARV 473

Query: 433 LDQLADYFKARLAGYPATLSEDEAMLTD 460
              +AD  +  L  +  +L EDE +L +
Sbjct: 474 FATIADTIRCILDLFSTSLEEDERLLEN 501


>gi|222640175|gb|EEE68307.1| hypothetical protein OsJ_26571 [Oryza sativa Japonica Group]
          Length = 422

 Score = 43.9 bits (102), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 31/112 (27%), Positives = 53/112 (47%), Gaps = 17/112 (15%)

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR--LVVEAALNTED- 361
           V  +++  + RP KAGE   +  G  P S L+  YGF+  DNPYD   L ++ +++ ED 
Sbjct: 315 VTKSLKFPLSRPCKAGEQCFLSYGKHPGSHLITFYGFLPRDNPYDVIPLDLDTSVDEEDS 374

Query: 362 ---------PQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGY 404
                      +  +RM+  R  +  +Q     + ++   +     YLRLG+
Sbjct: 375 SSPSVTTSQTSHMGERMLG-RQSRTGLQ----RSTKKDSFVHCYFVYLRLGH 421


>gi|207346544|gb|EDZ73016.1| YDR257Cp-like protein [Saccharomyces cerevisiae AWRI1631]
          Length = 354

 Score = 43.9 bits (102), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)

Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
           +++PL   L A +SKC A L    + +++V  R  +  E +    G  PNS+LL  YG+V
Sbjct: 80  SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 139

Query: 343 DED-NPYD 349
           + D + YD
Sbjct: 140 EWDGSKYD 147


>gi|410079629|ref|XP_003957395.1| hypothetical protein KAFR_0E01060 [Kazachstania africana CBS 2517]
 gi|372463981|emb|CCF58260.1| hypothetical protein KAFR_0E01060 [Kazachstania africana CBS 2517]
          Length = 534

 Score = 43.9 bits (102), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 40/144 (27%), Positives = 63/144 (43%), Gaps = 9/144 (6%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           V A  DL AG     +P S + +      N TI+ LL   ++  +  L L  +YE    +
Sbjct: 40  VFAKRDLPAGTTLLQLPKSAIFSA----SNSTISNLLVEEEIDGVLALNLAFIYETTVFR 95

Query: 173 -KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE-YNE 230
            KS W PY++ +     +G ++V  P  WSE     L G  T  + L  A   ++E Y  
Sbjct: 96  EKSHWYPYLKSIQVVDSQGNISV-PPGYWSEEAKDLLRG--TTLDTLYDALSPQQEVYEG 152

Query: 231 LDTVWFMAGSLFQQYPYDIPTEAF 254
            +    +A    Q++   +P E F
Sbjct: 153 FEISLHVAKKWNQEFSLPLPEEYF 176


>gi|452986759|gb|EME86515.1| hypothetical protein MYCFIDRAFT_131111 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 391

 Score = 43.9 bits (102), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 31/120 (25%), Positives = 49/120 (40%), Gaps = 10/120 (8%)

Query: 257 EIFKQAFVAVQSCVVHLQKVSLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 315
           ++FK  +  V S   H +       F  L P    +    S     +       ++  +R
Sbjct: 164 DVFKYYWAIVNSRSFHFKPPGAKPGFMVLCPFIDYMNHGPSGTGVNVRQTAKGYEVTANR 223

Query: 316 PYKAGESIVVWCGPQPNSKLLINYGFV---------DEDNPYDRLVVEAALNTEDPQYQD 366
            Y AGE ++   G  PN KLL++YGF+         D+D   D  +++   NT   Q QD
Sbjct: 224 DYVAGEEVLATYGAHPNDKLLVHYGFINSSKPGAPSDDDIRLDHYILDNLSNTTRDQLQD 283


>gi|50546259|ref|XP_500648.1| YALI0B08624p [Yarrowia lipolytica]
 gi|49646514|emb|CAG82890.1| YALI0B08624p [Yarrowia lipolytica CLIB122]
          Length = 490

 Score = 43.9 bits (102), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 53/185 (28%), Positives = 71/185 (38%), Gaps = 18/185 (9%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           V A +DL AGD    VP S  ++  R  G   IA LL  + L  +A L +  +YE+  G 
Sbjct: 36  VFAKKDLDAGDIVLKVPKSACLS-PRTCG---IANLLDEHDLDNIAGLLVAFLYERSLGD 91

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE-YNEL 231
           +S W  +   L           E P  WS  E   L       EI     G   E Y EL
Sbjct: 92  QSPWHEFFESLKPVIAD---VPEIPKFWSNDEDRALLSGTEVEEIGGLETGEDEEVYQEL 148

Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
              +F            I  E  +F+ FK+  V + S    + +    R   LVP G  L
Sbjct: 149 IVPFFEDNGKL------INLECPSFDEFKKLVVVIASRAFEVDQF---RELCLVP-GACL 198

Query: 292 LAYSS 296
             +S 
Sbjct: 199 FNHSD 203


>gi|428181778|gb|EKX50641.1| hypothetical protein GUITHDRAFT_135258 [Guillardia theta CCMP2712]
          Length = 254

 Score = 43.9 bits (102), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 44/158 (27%), Positives = 66/158 (41%), Gaps = 25/158 (15%)

Query: 297 KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAA 356
           + K+ L  V+  VQL+   P KAGE I ++ G    +  L  +GF D DNP D +  E  
Sbjct: 72  RYKSELGRVE--VQLLA--PVKAGEQIFIYYGALSTASELTRFGFCDRDNPNDTVPFELD 127

Query: 357 LNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS 416
           L +E  + Q K M          +V+      ++    D LP  RL            + 
Sbjct: 128 L-SEMTELQRKAM----------EVWEFRPDVQQLLKRDGLPSWRL----------LAML 166

Query: 417 SLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSED 454
            +  +  +S   E+ V   + +   A  AGYP  L ED
Sbjct: 167 RILHLNQLSVANEKLVWGTMEELLNAVTAGYPTRLEED 204


>gi|302498903|ref|XP_003011448.1| SET domain protein [Arthroderma benhamiae CBS 112371]
 gi|291174999|gb|EFE30808.1| SET domain protein [Arthroderma benhamiae CBS 112371]
          Length = 689

 Score = 43.9 bits (102), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 46/207 (22%), Positives = 81/207 (39%), Gaps = 36/207 (17%)

Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
           LA ++++E+ +G+ S W PY+  L R        + S L + + +L +L G+        
Sbjct: 104 LAFFMVHEQLKGRDSHWWPYLATLPRAS-----ELTSALFYQDNDLEWLQGTNLYQTHQA 158

Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
               +K EY+   ++    G L          E++ ++IF  A+  + S      +   +
Sbjct: 159 YRNAVKEEYDSAISILRDEGFL--------AVESYRWDIFCWAYTLIAS------RAFTS 204

Query: 280 RRFALVPLGPPLLAYSSKCKAMLAAVDDA----------------VQLVVDRPYKAGESI 323
           R         P L    + + ML  VD +                + L V  P  +GE +
Sbjct: 205 RVLDAYFSNHPTLKQDEEFQIMLPLVDSSNHKPLAKIEWRAEATEIGLKVIEPTFSGEEV 264

Query: 324 VVWCGPQPNSK-LLINYGFVDEDNPYD 349
               G   N + ++  YGF   DNP D
Sbjct: 265 HNNYGSLNNQQSVMTTYGFCIVDNPCD 291


>gi|307104961|gb|EFN53212.1| hypothetical protein CHLNCDRAFT_137077 [Chlorella variabilis]
          Length = 512

 Score = 43.9 bits (102), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 23/73 (31%), Positives = 40/73 (54%), Gaps = 2/73 (2%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
           V A++D+ AG     VP  L++++E    +  +   L  ++  L+    LA++L+ E  +
Sbjct: 37  VLATQDIPAGTCVLRVPRHLLMSVESARRDAELCTALRQHRAALTSDQVLAVHLLCEASK 96

Query: 171 GKKSFWLPYIREL 183
           G  SFW PY+R L
Sbjct: 97  GAASFWQPYLRSL 109


>gi|308812294|ref|XP_003083454.1| N-methyltransferase (ISS) [Ostreococcus tauri]
 gi|116055335|emb|CAL58003.1| N-methyltransferase (ISS) [Ostreococcus tauri]
          Length = 492

 Score = 43.5 bits (101), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 20/56 (35%), Positives = 32/56 (57%)

Query: 294 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
           +S    A +   ++ V+LV  R  KAG+ I +  G   N +L ++YGF+ EDN +D
Sbjct: 248 HSFDASARVRECENGVELVTTRDLKAGQPIELCYGELSNDELFLDYGFIVEDNAFD 303


>gi|116197927|ref|XP_001224775.1| hypothetical protein CHGG_07119 [Chaetomium globosum CBS 148.51]
 gi|88178398|gb|EAQ85866.1| hypothetical protein CHGG_07119 [Chaetomium globosum CBS 148.51]
          Length = 555

 Score = 43.5 bits (101), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 63/275 (22%), Positives = 117/275 (42%), Gaps = 45/275 (16%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
            + G+   ++P+ ++ T+E    +  +  +L + +  LS    LA Y+++ +        
Sbjct: 121 FKKGERILTIPSGILWTVEHAYADPLVGPVLRSARPPLSVEDTLATYILFIRS------- 173

Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSP--TKAEILERAEGIKREYN 229
               RE      R  +A       S + ++E EL    G+   T  + L+R+  I+ +Y 
Sbjct: 174 ----RESGYDGLRSHVAAFPTSYPSSIFFAEEELEVCAGTSLYTITKKLDRS--IEDDYR 227

Query: 230 ELDTVWFMAGS--LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL-----QKVSLARRF 282
            L  V  +A S  LF       P + F+ E +K A   V S  +         + L   F
Sbjct: 228 TL-VVRVLAQSRDLF-------PLDKFSIEDYKWALCTVWSRAMDFVLPDGNSIRLVAPF 279

Query: 283 ALVPLGPPLLAYSSKCK--AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 340
           A       +L +SS+ +   +  A    + ++  + Y+AG+   ++ G  PNS+LL  YG
Sbjct: 280 A------DMLNHSSEVEPCHIYDASSGNLSVLAGKDYEAGDQAFIYYGSIPNSRLLRLYG 333

Query: 341 FVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNG 375
           FV   NP D   +  + +   P ++ K+ +    G
Sbjct: 334 FVMPGNPNDSYDLVISTHPSAPFFERKQKLWASAG 368


>gi|395326815|gb|EJF59220.1| SET domain-containing protein [Dichomitus squalens LYAD-421 SS1]
          Length = 429

 Score = 43.5 bits (101), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 69/326 (21%), Positives = 122/326 (37%), Gaps = 83/326 (25%)

Query: 72  SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHY--------VAASEDLQAGD 123
           S + E++ + KSW+ + G               + H  +H+        VAA  D+ +  
Sbjct: 3   SIEPENVANFKSWIAQQG--------------GQIHAGVHFEPVEFGFNVAARSDIPSDA 48

Query: 124 AAFSVPNSLVVTLERVLGNETIAELLTTNKLS----ELAC--LALYLMYEKKQGKKSFWL 177
              S+P SL +T    +    I +LL T   +    +L C  + L+ + E          
Sbjct: 49  TVVSIPFSLAITPN--VARHAIKQLLNTEPQNWSERQLECTYIVLHSIVEPIDPSILRHR 106

Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
           PY+  L          + +PL ++E EL+   GS      L+R    + E+ +       
Sbjct: 107 PYLDTLPSPE-----QLRTPLHFTEAELSSFRGSNLFGATLDRKHEWETEWQQCKNTVSA 161

Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG--PPLLAYS 295
           A + + Q        +FT+E +  A   + S           R F    L   P L+   
Sbjct: 162 AIAGWGQ--------SFTWEKYLTAATYLSS-----------RAFPSTILSDTPSLVTTE 202

Query: 296 SKCKAMLAAVD---------------------------DAVQLVVDRPYKAGESIVVWCG 328
           +    +L  +D                            ++ LV+  P   G  ++   G
Sbjct: 203 TSYPVLLPGIDALNHARGHPVSWVVSAPSQTSSSQRSESSISLVIHTPTPRGSELLNNYG 262

Query: 329 PQPNSKLLINYGFVDEDNPYDRLVVE 354
           P+PNS+L++ YGF   +NP D +V++
Sbjct: 263 PKPNSELILGYGFSLPNNPDDTIVLK 288


>gi|119495234|ref|XP_001264406.1| SET domain protein [Neosartorya fischeri NRRL 181]
 gi|119412568|gb|EAW22509.1| SET domain protein [Neosartorya fischeri NRRL 181]
          Length = 492

 Score = 43.5 bits (101), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 62/301 (20%), Positives = 121/301 (40%), Gaps = 57/301 (18%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASE----DLQAGDAAFSVPNSLVVTL 136
           L +W   NG+    +  ++    +   +    VA +E    D +A D   +VP+ L +TL
Sbjct: 11  LSTWAKLNGMSLEGIAFQKLHGEHGTDKGTAIVATAEKKDEDAEA-DTLLTVPSDLALTL 69

Query: 137 ERVLGN-----------ETIAELLTTNKLSELACLALYLMY--------EKKQGKKSFWL 177
           E V  +           + + +   T + + L  L + + +         +K G  + W 
Sbjct: 70  EYVHNHAKTDRHLREVLDAVGDFGRTARGAILIFLIVQITHASPDFANQRQKIGVSNPWT 129

Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL-----D 232
            YIR +         ++  P  +S  E   L G+  +  +  +   +++E+  L     D
Sbjct: 130 EYIRFM-------PASIPLPTFYSAEERELLRGTSLQTAVDAKLGSLEKEFEHLRQATED 182

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
             W       Q++ +D  T  FTF+  K      +S VV L +   A       + P + 
Sbjct: 183 IHWC------QEHWWDEDTGKFTFDDLKYVDAVYRSRVVDLPRSGHA-------IVPCVD 229

Query: 293 AYSSKCKAMLAAVDD-------AVQLVVDRPYKAGESIVVWCGPQ-PNSKLLINYGFVDE 344
             +  C+ ++ A  D        +QL   +  + GE + +  G + P S+++ +YGFV+ 
Sbjct: 230 MANHACEDLVKARYDEDGAGNAVLQLRTGKKLRVGEEVTISYGDEKPASEMVFSYGFVEN 289

Query: 345 D 345
           +
Sbjct: 290 E 290


>gi|410970027|ref|XP_003991492.1| PREDICTED: SET domain-containing protein 4 [Felis catus]
          Length = 440

 Score = 43.5 bits (101), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 72/322 (22%), Positives = 128/322 (39%), Gaps = 32/322 (9%)

Query: 67  SREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAF 126
           SR V    + +  +LK W+         +I    P           + +   LQ G    
Sbjct: 22  SRGVNESYKPEFIELKKWLKDRKFEDTNLIPACFPGTGRG------LMSKTSLQEGQVII 75

Query: 127 SVPNSLVVTLERVLGNETIAELLTTNKL-SELACLALYLMYEKKQGKKSFWLPYIRELDR 185
           S+P + ++T + V+ +   A +       S L  L  +L+ EK  G +S W PY+  L +
Sbjct: 76  SLPETCLLTTDTVIRSYLGAYIAKWRPPPSPLLALCTFLVSEKHAGDQSVWKPYLEILPK 135

Query: 186 QRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQY 245
                  A   P+   E E+  L   P +A+  E+   ++  ++     +     LF + 
Sbjct: 136 -------AYTCPVC-LEPEVVNLFPKPLRAKAEEQRARVREFFSSSRGFFSSLQPLFSEA 187

Query: 246 PYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP----LGP--PLLAYSSKCK 299
              I    F++     A+  V +  V++ K    R F+  P    L P   LL +S   +
Sbjct: 188 VGSI----FSYRALLWAWCTVNTRAVYV-KPRRRRCFSAEPDTCALAPYLDLLNHSPHVQ 242

Query: 300 AMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 357
              A  ++    ++      +  E + +  GP  N +LL+ YGFV   NP+  + V   +
Sbjct: 243 VEAAFNEETRCYEIRTASSCRKHEEVFICYGPHDNQRLLLEYGFVSIHNPHACVYVSEDI 302

Query: 358 NTEDPQYQDKRMVAQRNGKLSV 379
             +     DK+M    N K+S+
Sbjct: 303 LVKYLPSTDKQM----NKKISI 320


>gi|384251962|gb|EIE25439.1| ResB-domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 889

 Score = 43.5 bits (101), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 35/135 (25%), Positives = 55/135 (40%), Gaps = 15/135 (11%)

Query: 79  GDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLER 138
           G L+ W+   GLPP KV +  +            V   E L       +VP  L++T + 
Sbjct: 32  GSLEDWLTHRGLPPQKVAISHEIPEGRGLVATRRVRKHEKL------LNVPAQLLLTADV 85

Query: 139 VLGNETIAELLTTNKLSELACLALYLMYEKKQ--GKKSFWLPYIRELDRQRGRGQLAVES 196
            L +     LL +  +   + LA +L   ++Q  G K+ W  Y+  L  Q G        
Sbjct: 86  ALQHSAYGGLLESCGVPAWSVLATFLAETRRQPEGDKNVWGQYVDALPSQTG-------C 138

Query: 197 PLLWSETELAYLTGS 211
            L W+  E+  L G+
Sbjct: 139 VLEWASEEVDLLRGT 153


>gi|367016539|ref|XP_003682768.1| hypothetical protein TDEL_0G01900 [Torulaspora delbrueckii]
 gi|359750431|emb|CCE93557.1| hypothetical protein TDEL_0G01900 [Torulaspora delbrueckii]
          Length = 573

 Score = 43.5 bits (101), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 25/79 (31%), Positives = 37/79 (46%), Gaps = 9/79 (11%)

Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD- 349
           LL + +  K      +D V  V     K GE +    G + N  LL++YGFV + NPYD 
Sbjct: 229 LLNHKNDTKVKWTFTNDNVCFVSQEIMKEGEEVFNNYGEKSNEDLLLSYGFVQDQNPYDL 288

Query: 350 --------RLVVEAALNTE 360
                   + +++ ALN E
Sbjct: 289 TRLTLRLTKEMIDEALNAE 307


>gi|217074704|gb|ACJ85712.1| unknown [Medicago truncatula]
          Length = 209

 Score = 43.1 bits (100), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 32/120 (26%), Positives = 56/120 (46%), Gaps = 8/120 (6%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           AS+ +Q GD    VP SL +T + +       +   +  +  +A LA  L+  K  G+ S
Sbjct: 66  ASKSIQTGDCILQVPYSLQLTPDNLPPE---IKPFISEDVGNIAKLATVLLIHKNLGQDS 122

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
            W PYI  L  Q       + + + W+E+EL  +  S    E + +   I++++ E+  V
Sbjct: 123 EWHPYISCLPPQA-----EMHNTIFWNESELEMIRQSSVYQETIYQKSQIEKDFLEIKPV 177


>gi|414886518|tpg|DAA62532.1| TPA: hypothetical protein ZEAMMB73_960129 [Zea mays]
          Length = 483

 Score = 43.1 bits (100), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 63/281 (22%), Positives = 117/281 (41%), Gaps = 58/281 (20%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNE-TIAELLTTNK--LSELACLALYLMYEKK 169
           +AA+ DL+ G+    +P + ++T +RV  ++  IA  ++ +K  LS +  L + L+ E  
Sbjct: 51  LAAARDLRRGELVLRLPRAALLTSDRVTADDPRIAACVSAHKPRLSSVQILIVCLLAEVG 110

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI---KR 226
           +G  S W PY+ +L                   T LA  T +  + E L+  + I   ++
Sbjct: 111 KGSNSVWYPYLCQLPSYY---------------TILA--TFNDFEVEALQVDDAIWVAQK 153

Query: 227 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 286
             + + + W  A  L ++  +    +   F+ +  AF  V S  +H   ++      L P
Sbjct: 154 AKSAIKSDWEDATPLMKELEF--KPKLLMFKSWLWAFATVSSRTLH---IAWDEAGCLCP 208

Query: 287 LG-------------------PPLLAYSSKCKAMLAAVD----------DAVQLVVDRPY 317
           +G                     L  Y  K   M  + +          +A  L   + Y
Sbjct: 209 VGDLFNYAAPDDDTLLEDEDTAELTNYQQK-NGMTNSSERLTDGGYEDCNAYCLYARKNY 267

Query: 318 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 358
           K GE +++  G   N +LL +YGF+  +NP ++  +E  L+
Sbjct: 268 KKGEQVLLAYGTYTNLELLEHYGFLLGENPNEKTFIELDLD 308


>gi|322707769|gb|EFY99347.1| SET domain protein [Metarhizium anisopliae ARSEF 23]
          Length = 467

 Score = 43.1 bits (100), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 78/192 (40%), Gaps = 13/192 (6%)

Query: 162 LYLMYEKKQGKKSFWLPYIRELDR--QRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
           L+L+ E  +  KSFW PYIR L +  Q  + Q A+     W + E   L G+  +  I +
Sbjct: 95  LFLIKEYLKRDKSFWWPYIRALPQPGQGNKSQWALAP--FWDDDEAELLEGTNVEVGIDK 152

Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE--IFKQAFVAVQSCVVHLQKVS 277
               ++R+  E   +  + G     +   + TE + +   IF          +   Q+ S
Sbjct: 153 IRNDVRRDLQEAQELLRLHGDADGAFGKALTTELYQWAYCIFSSRSFRPSLVLSDEQRRS 212

Query: 278 LARRFALVPLGPPLLAYSSKCKAMLAAV----DDAVQ---LVVDRPYKAGESIVVWCGPQ 330
           L R   +      L  +      M   +    DD  Q   L V R +  G+ +      +
Sbjct: 213 LPRGVTMDDFSVLLPLFDIGNHDMTTEIRWDLDDDRQTCELRVGRTHMPGQQVFNNYSMK 272

Query: 331 PNSKLLINYGFV 342
            N++LL+ YGF+
Sbjct: 273 TNAELLLGYGFM 284


>gi|301763371|ref|XP_002917104.1| PREDICTED: SET domain-containing protein 4-like [Ailuropoda
           melanoleuca]
          Length = 440

 Score = 43.1 bits (100), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 72/321 (22%), Positives = 124/321 (38%), Gaps = 30/321 (9%)

Query: 67  SREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAF 126
           SR V    + +  +LK W+         +I    P           + +   L+ G    
Sbjct: 22  SRGVNESYKPEFIELKKWLKDRKFEDTNLIPACFPGTGRG------LMSKTSLREGQMII 75

Query: 127 SVPNSLVVTLERVLGNETIAELLTTNKL-SELACLALYLMYEKKQGKKSFWLPYIRELDR 185
           S+P S ++T + V+ +   A +       S L  L  +L+ EK  G +S W PY+  L +
Sbjct: 76  SLPESCLLTTDTVIRSYLGAYIAKWQPPPSPLLALCTFLVSEKHAGDQSLWKPYLEILPK 135

Query: 186 QRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQY 245
                  A   P+   E E+  L   P KA+  E+   ++  ++     +     LF + 
Sbjct: 136 -------AYTCPVC-LEPEVVNLFPKPLKAKAEEQRARVQGFFSSSRDFFSSLQPLFSEA 187

Query: 246 PYDIPTEAFTFEIFKQAFVAVQSCVV---HLQKVSLARRFALVPLGP--PLLAYSSKC-- 298
              I    F++     A+  V +  V   H Q+   +       L P   LL +S +   
Sbjct: 188 VESI----FSYSALLWAWCTVNTRAVYVKHRQEQCFSTEPNTCALAPYLDLLNHSPRVQV 243

Query: 299 KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 358
           KA         ++      +  E + +  GP  N +LL+ YGFV   NP+  + V   + 
Sbjct: 244 KAAFNEETRCYEIRTASGCRKHEEVFICYGPHDNQQLLLEYGFVSIQNPHACVYVSEDVL 303

Query: 359 TEDPQYQDKRMVAQRNGKLSV 379
            +     DK+M    N K+S+
Sbjct: 304 VKYLPLTDKQM----NKKISI 320


>gi|336258546|ref|XP_003344085.1| hypothetical protein SMAC_09068 [Sordaria macrospora k-hell]
 gi|380093059|emb|CCC09296.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 421

 Score = 43.1 bits (100), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 18/50 (36%), Positives = 29/50 (58%)

Query: 308 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 357
           A  +   RPY AGE + +  G   N  LLI YGF+ ++N +D + ++ A+
Sbjct: 263 AFTITTTRPYSAGEEVYICYGNHSNDFLLIEYGFLFDENVWDEVCIDDAI 312


>gi|195565510|ref|XP_002106342.1| GD16174 [Drosophila simulans]
 gi|194203718|gb|EDX17294.1| GD16174 [Drosophila simulans]
          Length = 395

 Score = 43.1 bits (100), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 61/276 (22%), Positives = 104/276 (37%), Gaps = 56/276 (20%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLE-----RVLGNETIAELLTTNKLSELACLALYLMYEKK 169
           A+  L   +   SVP  L+ + E     R+ G  T A  L          LA  L+ EK 
Sbjct: 59  ATRPLAKDELVLSVPRKLIFSEESNSDCRLFGKMTQATHLN---------LAYDLVIEKI 109

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
           +G+ S W PYI  L  +         + L ++  ++  L G+   +  L +   I ++Y 
Sbjct: 110 RGEFSEWRPYIDVLPAK-------YSTVLYFTTKQMELLRGTAAASLALRQCRVIAKQYA 162

Query: 230 ELDTVWFMA--------------GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK 275
            L                     G  F Q+          +E+++ A     S V+  Q 
Sbjct: 163 FLYRYAHTMTEPSTGNRSHPGERGLFFTQH-------GLCYELYRWAV----STVMTRQN 211

Query: 276 VSLARR----------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 325
           +  + +           AL+P          K  +  AAV   ++        AGE   +
Sbjct: 212 LVPSEKQESEDTPKLISALIPYWDMANHRPGKITSFYAAVPRQLECTAQEAVDAGEQFFI 271

Query: 326 WCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 361
           + G + N+ LL++ GFVD++N  D + +   L+  D
Sbjct: 272 YYGDRSNTDLLVHNGFVDDNNLKDYVNIRVGLSLTD 307


>gi|400598098|gb|EJP65818.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
          Length = 356

 Score = 43.1 bits (100), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 41/168 (24%), Positives = 69/168 (41%), Gaps = 22/168 (13%)

Query: 197 PLLWSETELAYLTGSPTKAEILERAE-GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
           P  W       L G  T   +LE+ +    R++  L +           YPY +P+E + 
Sbjct: 104 PFFWPPEAQRLLPG--TARRLLEKQQSNFGRDWKHLQSA----------YPY-VPSEDYM 150

Query: 256 FEIFKQAFVAVQSCVVHLQKVSL---ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 312
              F    V+ ++     Q+  L     R A++P+       S  CK    A  ++  +V
Sbjct: 151 HAWF---VVSSRAFYQETQQTLLYPWHDRLAMLPVADLFNHASVGCKVSYCA--ESYDIV 205

Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 360
            DR Y  G+ +    G   N  LL  YGF+ ++N  DR   +  +++E
Sbjct: 206 ADREYGTGDEVCTCYGEHSNDFLLAEYGFLLQNNTNDRFDPDDLISSE 253


>gi|159131477|gb|EDP56590.1| SET domain protein [Aspergillus fumigatus A1163]
          Length = 490

 Score = 43.1 bits (100), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 66/297 (22%), Positives = 123/297 (41%), Gaps = 49/297 (16%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDL-QAGDAA--FSVPNSLVVTLE 137
           L SW   NG+    +  ++  S +   +    VA +E   + G+A    +VP+ L +TLE
Sbjct: 11  LSSWAKLNGISLEGIAFQKLYSEHGTDKGSAIVATAEKKDEEGEANTLLTVPSDLALTLE 70

Query: 138 RVLGN-----------ETIAELLTTNKLSELACLALYLMY--------EKKQGKKSFWLP 178
            V  +           + + +   T + + L  L + + +         +K G  + W  
Sbjct: 71  YVHNHAKIDRHLREVLDAVGDFGRTARGAILIFLIIQITHASPDFVNKRQKIGISNPWTE 130

Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL-----DT 233
           YIR +         +V  P  +S  E   L G+  +  +  +   +++E++ L     + 
Sbjct: 131 YIRFM-------PASVPLPTFYSAEERELLRGTSLQTAVDAKLGSLEKEFDHLRQATEEI 183

Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP---LGPP 290
            W       Q++ +D  T  FTF+ +K      +S VV L +       A+VP   +   
Sbjct: 184 PWC------QEHWWDEDTGKFTFDDWKYVDAVYRSRVVDLPRSG----HAIVPCVDMANH 233

Query: 291 LLAYSSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWCGPQ-PNSKLLINYGFVDED 345
               S K K       +AV QL   +  + GE + +  G + P S+++ +YGFV+ +
Sbjct: 234 ACEDSVKAKYDEEGAGNAVLQLRTGKKLRVGEEVTISYGDEKPASEMVFSYGFVENE 290


>gi|424512980|emb|CCO66564.1| predicted protein [Bathycoccus prasinos]
          Length = 542

 Score = 43.1 bits (100), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 81/329 (24%), Positives = 123/329 (37%), Gaps = 66/329 (20%)

Query: 83  SWMHKNG-LPPCKVILKEKPSHNEKHRPIHY--VAASEDLQAGDAAFSVPNSLVVTLERV 139
           +W  KN  L P   +     S  EK     Y  V A+ D+ + D    +P     T+  V
Sbjct: 76  AWRVKNNILAPNVEVAYVGGSEKEKGGDDLYRGVKATSDIASEDDLVRLPRE--ATMLVV 133

Query: 140 LGNETIAELLTTNKLSELAC-------LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
            G E   E   +N+L   A        +AL L+YEK  G +S +  YI +L +       
Sbjct: 134 EGQENPHEEYISNELWAKAGDERWALRVALVLLYEKSLGSRSKFYEYIEQLPK------- 186

Query: 193 AVESPLLWSETE---LAYLTGSP-TKAEILE------------RAEGIKREYNELDTVWF 236
           + E+   W+E E   L Y  G    K + LE            R  G+K    E + +W 
Sbjct: 187 SFENLGTWTEEEVRELQYSVGEKFAKEQRLENEKACELIQEYARDGGLKTIERE-EVIWA 245

Query: 237 M--------AGSLFQQYPYD---IPTEAFTFEIFKQAFVAVQS------CVVHL------ 273
           +        +G +  Q       +P       +F  +F+  Q+      CV  L      
Sbjct: 246 LDVVRSRVFSGKIADQEALQRKLLPRALSVGTVFA-SFLTAQTTELKWLCVFALLALVVF 304

Query: 274 -----QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG 328
                  V     + L+PL      + +  K      +    L   + YK GE +++  G
Sbjct: 305 DSTKENDVKTDTAYVLMPL-IDAFNHQTMLKTEFEFTNSEFALKSPKSYKKGEEVLISYG 363

Query: 329 PQPNSKLLINYGFVDEDNPYDRLVVEAAL 357
             PN +LL+ YGFVD+ N  D    E  L
Sbjct: 364 LMPNDELLLRYGFVDDQNVADTYQFEGLL 392


>gi|298708218|emb|CBJ30557.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 493

 Score = 43.1 bits (100), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 49/203 (24%), Positives = 85/203 (41%), Gaps = 25/203 (12%)

Query: 158 ACLA-LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAE 216
           ACL  L L++E+  G+ S +  Y+  L +           PL W+E E+  L G  T AE
Sbjct: 133 ACLTVLRLLHERGLGESSPFHSYLSVLPQDH-------RLPLEWTEAEVGLLQG--TSAE 183

Query: 217 ILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
            L  A  +  ++    +V         Q+P        T   F +    V+S     +  
Sbjct: 184 PLVGAGSLDSQFEAFQSV-------VAQHPTVWEPSVCTKAAFAKGVNWVRS-----RGF 231

Query: 277 SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD--RPYKAGESIVVWCGPQPNSK 334
           ++     ++P G  +  +    +++    D     V+   +P KAGE +    G   N++
Sbjct: 232 TVMGDPHMIP-GADMFNHDPNKQSVQIGTDGEEHFVMKTVQPVKAGEEVFSSFGHISNAQ 290

Query: 335 LLINYGFVDEDNPYDRLVVEAAL 357
           LL +YGFV   N +D +++   L
Sbjct: 291 LLNSYGFVLPGNSFDTVLIPTQL 313


>gi|281338852|gb|EFB14436.1| hypothetical protein PANDA_005285 [Ailuropoda melanoleuca]
          Length = 415

 Score = 43.1 bits (100), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 64/269 (23%), Positives = 109/269 (40%), Gaps = 24/269 (8%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-SELACLALYLMYEKKQGKKSFWL 177
           L+ G    S+P S ++T + V+ +   A +       S L  L  +L+ EK  G +S W 
Sbjct: 44  LREGQMIISLPESCLLTTDTVIRSYLGAYIAKWQPPPSPLLALCTFLVSEKHAGDQSLWK 103

Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
           PY+  L +       A   P+   E E+  L   P KA+  E+   ++  ++     +  
Sbjct: 104 PYLEILPK-------AYTCPVC-LEPEVVNLFPKPLKAKAEEQRARVQGFFSSSRDFFSS 155

Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQKVSLARRFALVPLGP--PLL 292
              LF +    I    F++     A+  V +  V   H Q+   +       L P   LL
Sbjct: 156 LQPLFSEAVESI----FSYSALLWAWCTVNTRAVYVKHRQEQCFSTEPNTCALAPYLDLL 211

Query: 293 AYSSKC--KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 350
            +S +   KA         ++      +  E + +  GP  N +LL+ YGFV   NP+  
Sbjct: 212 NHSPRVQVKAAFNEETRCYEIRTASGCRKHEEVFICYGPHDNQQLLLEYGFVSIQNPHAC 271

Query: 351 LVVEAALNTEDPQYQDKRMVAQRNGKLSV 379
           + V   +  +     DK+M    N K+S+
Sbjct: 272 VYVSEDVLVKYLPLTDKQM----NKKISI 296


>gi|302896942|ref|XP_003047350.1| hypothetical protein NECHADRAFT_106552 [Nectria haematococca mpVI
           77-13-4]
 gi|256728280|gb|EEU41637.1| hypothetical protein NECHADRAFT_106552 [Nectria haematococca mpVI
           77-13-4]
          Length = 471

 Score = 43.1 bits (100), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 53/230 (23%), Positives = 96/230 (41%), Gaps = 36/230 (15%)

Query: 161 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-LE 219
           A++L+ +   G++SFW PYI+ L +       A+  PLLW E++L +L G+  +  +  +
Sbjct: 111 AIFLVQQYLLGEQSFWYPYIQILPQPDDDKDSAI--PLLWPESDLLWLRGTHLEEAVSKQ 168

Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV-------- 271
           + + +KR        W  A    Q+Y +D P++ FT E+   A+    S           
Sbjct: 169 KVDHVKR--------WTEAMETLQKYGWD-PSQ-FTLELGLWAYYCFYSRYFWSIILEPD 218

Query: 272 ---------HLQKVSLARRFALVPLGPPL--LAYSSKCKAMLAAVDDAVQLVVDRPYKAG 320
                    HL K  +        L P L  L ++ +        D  + +  +   K G
Sbjct: 219 VANIKPEFQHLVKAGMNLDDTAKILLPILETLNHAQETNTEYNLDDKGLSVSKNIELKPG 278

Query: 321 ESIVVWCGPQP----NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQD 366
           +   +    +     N+ LL ++GF+  DN    LV+ +  +   P + D
Sbjct: 279 DPFYIAYDKETQRFNNTVLLKDFGFILPDNEAAELVLSSPFDLTRPMHLD 328


>gi|85113406|ref|XP_964517.1| hypothetical protein NCU02158 [Neurospora crassa OR74A]
 gi|28926302|gb|EAA35281.1| hypothetical protein NCU02158 [Neurospora crassa OR74A]
          Length = 504

 Score = 43.1 bits (100), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 18/50 (36%), Positives = 29/50 (58%)

Query: 308 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 357
           A  +   RPY AGE + +  G   N  LLI YGF+ ++N +D + ++ A+
Sbjct: 270 AFTITTTRPYAAGEEVYICYGNHSNDFLLIEYGFLFDENVWDEVCIDDAI 319


>gi|443733230|gb|ELU17670.1| hypothetical protein CAPTEDRAFT_97123, partial [Capitella teleta]
          Length = 199

 Score = 42.7 bits (99), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 53/205 (25%), Positives = 89/205 (43%), Gaps = 30/205 (14%)

Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
           L ++L+ E+ +G  SFW PY+  L         +    L W+  E+  L    TK    +
Sbjct: 1   LVIFLLCERNKGCSSFWKPYVDILPS-------SYTDILHWTSKEMDLLPKF-TKRRACD 52

Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKV 276
                +  +N L   +     L +Q P      AFT+++FK A+ +V +  V++   Q  
Sbjct: 53  LRLKAEESFNRLCNGFLPL--LVRQMPQF--NGAFTWDLFKWAWSSVNTRCVYMSQPQNS 108

Query: 277 SLA----RRFALVPLGPPLLAYSSKCKAMLAAVDDA------VQLVVDRPYKAGESIVVW 326
            L+     + AL P    LL ++   +   A  DD+        L   +PY   + + + 
Sbjct: 109 VLSPDEEDKSALAPFLD-LLNHTVDVEVN-ARFDDSSKSYKITTLTACKPY---DQVFIN 163

Query: 327 CGPQPNSKLLINYGFVDEDNPYDRL 351
            GP  N KLL+ YGF    NP++ +
Sbjct: 164 YGPHSNEKLLLEYGFTLPCNPHNNI 188


>gi|255584095|ref|XP_002532790.1| Protein SET DOMAIN GROUP, putative [Ricinus communis]
 gi|223527460|gb|EEF29592.1| Protein SET DOMAIN GROUP, putative [Ricinus communis]
          Length = 510

 Score = 42.7 bits (99), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 23/74 (31%), Positives = 39/74 (52%), Gaps = 1/74 (1%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSELACLALYLMYEKKQG 171
           + A+ DL+ G+    VP S ++T +  L +  +   +  +  LS    L + L+YE  +G
Sbjct: 57  LGAARDLKKGELVLRVPKSALLTKDSFLKDGLLLSAINNHSALSPTQTLTVCLLYEMSKG 116

Query: 172 KKSFWLPYIRELDR 185
           + SFW PY+  L R
Sbjct: 117 QSSFWYPYLMHLPR 130


>gi|391342782|ref|XP_003745694.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Metaseiulus occidentalis]
          Length = 278

 Score = 42.7 bits (99), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 37/182 (20%), Positives = 83/182 (45%), Gaps = 12/182 (6%)

Query: 315 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 374
           R YK  E + ++ G + N++ +++ GFV ++N +D L ++  L+  D  ++ KR + ++ 
Sbjct: 96  REYKKNEQVNIFYGNRANAQFMLHNGFVPDENQWDSLAIKIGLSKADKLFEMKRRLCEQM 155

Query: 375 GKLSVQVFHVHAGREKEAISDMLPYLRLGYV--------SDTSEMQSVISSLGPICPVSP 426
              +  VF +    + + +  ++P + L  V        SD +   + + +  P     P
Sbjct: 156 KIPTSDVFELKKAPDGDGV--LVPKVLLHLVHILQWKAPSDGTTSGTDVGA-DPSDATDP 212

Query: 427 CMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML-NACL 485
              +     L    +  +   P ++ E   +L D     + ++A +    E++ML NAC 
Sbjct: 213 VRTKKAKTFLHVRCQLLMKALPRSVEELTEILNDPTTSLESKLAIRYRLSEQRMLTNACN 272

Query: 486 QV 487
           ++
Sbjct: 273 KI 274


>gi|297608243|ref|NP_001061350.2| Os08g0244400 [Oryza sativa Japonica Group]
 gi|255678277|dbj|BAF23264.2| Os08g0244400, partial [Oryza sativa Japonica Group]
          Length = 195

 Score = 42.7 bits (99), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 22/59 (37%), Positives = 34/59 (57%), Gaps = 2/59 (3%)

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD--RLVVEAALNTED 361
           V  +++  + RP KAGE   +  G  P S L+  YGF+  DNPYD   L ++ +++ ED
Sbjct: 14  VTKSLKFPLSRPCKAGEQCFLSYGKHPGSHLITFYGFLPRDNPYDVIPLDLDTSVDEED 72


>gi|336463341|gb|EGO51581.1| hypothetical protein NEUTE1DRAFT_125257 [Neurospora tetrasperma
           FGSC 2508]
 gi|350297448|gb|EGZ78425.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
          Length = 503

 Score = 42.7 bits (99), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 18/50 (36%), Positives = 29/50 (58%)

Query: 308 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 357
           A  +   RPY AGE + +  G   N  LLI YGF+ ++N +D + ++ A+
Sbjct: 270 AFTITTTRPYAAGEEVYICYGNHSNDFLLIEYGFLFDENVWDEVCIDDAI 319


>gi|156849027|ref|XP_001647394.1| hypothetical protein Kpol_1018p68 [Vanderwaltozyma polyspora DSM
           70294]
 gi|156118080|gb|EDO19536.1| hypothetical protein Kpol_1018p68 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 494

 Score = 42.7 bits (99), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 54/253 (21%), Positives = 99/253 (39%), Gaps = 36/253 (14%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEK-KQG 171
           + A ED+  G+  F +P   ++ +              T++L E   L L ++YE    G
Sbjct: 43  MVAVEDVAEGETLFEIPRGSILNVNTSALTRDYPSF-GTSQLGEWEELILCMLYEMFVLG 101

Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA-E 222
           + S W PY   L          + S + WS+ EL  L         G     E+  +   
Sbjct: 102 ENSRWYPYFNVLP-----SSAELNSLIYWSDRELGLLKPSFVIERIGRGKSQEMFSKVLS 156

Query: 223 GIKREYNELDTV--------WFMAGSLFQQYPYDI----PTEAFTFEIFKQAFVAVQSCV 270
            I+ + ++L  +        +    S+   Y +D+    P      EI      +  S  
Sbjct: 157 YIENQDSDLSLIAKYLTWENFVYVASIIMSYSFDVEDLNPQSDEDDEIEDDDNDSEMSPD 216

Query: 271 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 330
             ++        +++PL   L + +  C A L    + +++   +P +AGE +    G  
Sbjct: 217 KSIK--------SMIPLADTLNSDTHLCNANLMYDKETLKMTAIKPIRAGEEVFNIYGEH 268

Query: 331 PNSKLLINYGFVD 343
           PNS++L  YG+V+
Sbjct: 269 PNSEILRRYGYVE 281


>gi|388250581|gb|AFK23406.1| histone-lysine N-methyltransferase [Cordyceps militaris]
          Length = 479

 Score = 42.7 bits (99), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 48/188 (25%), Positives = 81/188 (43%), Gaps = 7/188 (3%)

Query: 309 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKR 368
           + ++  + Y+ G+ I ++ G  PN++LL  YGFV  DNP D   +    +   P Y+ K 
Sbjct: 239 LSILAAKDYQVGDQIFIYYGSVPNNRLLRLYGFVLLDNPNDSYDLVLQTSPMAPLYEQKE 298

Query: 369 MVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYV--SDTSEMQSVISSLGPICPVSP 426
            +    G  S     + A  +     ++L YLR   +  +D ++M   + + G    V+ 
Sbjct: 299 RLWALAGLDSTCTIPLTA--KHPLPKNVLRYLRTQRLDAADVADMTLQLLN-GTDGKVND 355

Query: 427 CMERAVLDQLADYFKARLAGYPATLSEDEAMLTD--YNLHPKKRVATQLVRMEKKMLNAC 484
             E  VL  L D   + L G+   L + EA L    Y        A Q+   E+ +L   
Sbjct: 356 GNEIQVLQFLIDSLGSVLEGFGIPLEKLEAQLAGGFYPAGGNAWAAAQVSAGEQGILTRA 415

Query: 485 LQVTADMI 492
            +   DM+
Sbjct: 416 KKTAEDML 423


>gi|356511297|ref|XP_003524363.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
           setd3-like [Glycine max]
          Length = 449

 Score = 42.7 bits (99), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 41/172 (23%), Positives = 70/172 (40%), Gaps = 19/172 (11%)

Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD--RLVVEAAL------ 357
           D  +++V +   K  + +++  G   N   L++YGFV   NPYD   L  + AL      
Sbjct: 232 DSKMKVVAETAIKEDDPLLLCYGCLNNDLFLLDYGFVMHSNPYDCIELKYDGALLDAAST 291

Query: 358 -------NTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE 410
                  N   P    + +++Q N         V  G ++     +L  LR+   ++   
Sbjct: 292 AAGVSSPNFSTPAPWQELILSQLNLAGETPDLKVSLGGQETVEGRLLAALRVILSTNVET 351

Query: 411 MQ----SVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 458
           MQ    S++ SL    P+    E AV   L       L  +P  + +DE++L
Sbjct: 352 MQKYDLSILQSLDAEAPLGVANEIAVFRTLIALCVIALGHFPTKIMDDESLL 403


>gi|359476494|ref|XP_002269094.2| PREDICTED: protein SET DOMAIN GROUP 40-like [Vitis vinifera]
          Length = 504

 Score = 42.7 bits (99), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 43/74 (58%), Gaps = 1/74 (1%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN-KLSELACLALYLMYEKKQG 171
           +AA+ DL  G+   +VP S ++T + +L +E ++  +  +  LS    L + L+ E  +G
Sbjct: 51  LAAARDLSQGELILTVPKSALMTSQSLLKDEKLSVAVKRHTSLSSPQILTICLLAEMSKG 110

Query: 172 KKSFWLPYIRELDR 185
           K S+W PY+ +L R
Sbjct: 111 KSSWWHPYLMQLPR 124


>gi|297738159|emb|CBI27360.3| unnamed protein product [Vitis vinifera]
          Length = 449

 Score = 42.7 bits (99), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 43/74 (58%), Gaps = 1/74 (1%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN-KLSELACLALYLMYEKKQG 171
           +AA+ DL  G+   +VP S ++T + +L +E ++  +  +  LS    L + L+ E  +G
Sbjct: 51  LAAARDLSQGELILTVPKSALMTSQSLLKDEKLSVAVKRHTSLSSPQILTICLLAEMSKG 110

Query: 172 KKSFWLPYIRELDR 185
           K S+W PY+ +L R
Sbjct: 111 KSSWWHPYLMQLPR 124


>gi|50303805|ref|XP_451849.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
 gi|49640981|emb|CAH02242.1| KLLA0B07161p [Kluyveromyces lactis]
          Length = 553

 Score = 42.7 bits (99), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 25/45 (55%)

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
            D+ V +++    KAGE I    G   NS LL  YGF  EDNP+D
Sbjct: 319 TDECVDIILSNDVKAGEEIFNSYGDHSNSYLLARYGFCIEDNPHD 363


>gi|363747293|ref|XP_003643967.1| PREDICTED: N-lysine methyltransferase SETD6-like [Gallus gallus]
          Length = 447

 Score = 42.7 bits (99), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 70/273 (25%), Positives = 113/273 (41%), Gaps = 37/273 (13%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLE----RVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
           A+ DL+ G+  FSVP S +++      R L ++    L +    S    L L L++E   
Sbjct: 51  AAADLEPGELLFSVPRSALLSQHTCAIRALLHDAQESLQSQ---SGWVPLLLALLHEYTT 107

Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE-LAYLTGSPTKAEILERAEGIKREYN 229
           G  S W PY         +   +++ P+ W E E +  L G+     + +    I+ EY+
Sbjct: 108 G-TSHWRPYFS-----LWQDFSSLDHPMFWPEEERVRLLQGTGIPEAVDKDLANIQLEYS 161

Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ--AFVAVQSCVVHLQKVSLARRFALVPL 287
            +  + FM     + +P     E  T E++KQ  AFV   S    L++     +    P+
Sbjct: 162 SI-ILPFM-----KSHPDIFDPELHTLELYKQLVAFVMAYSFQEPLEEEDEDEKGPNPPM 215

Query: 288 GPP---LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 344
             P   +L + +   A L      +++V  +P   G+ I    G   N +LL  YGF + 
Sbjct: 216 MVPVADILNHVANHNASLKYAPTCLRMVTTQPISKGQEIFNTYGQMANWQLLHMYGFAE- 274

Query: 345 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL 377
             PY         NT D    D +MV  R   L
Sbjct: 275 --PYPG-------NTNDT--ADIQMVTVRKAAL 296


>gi|440464432|gb|ELQ33864.1| hypothetical protein OOU_Y34scaffold00857g1 [Magnaporthe oryzae
           Y34]
          Length = 464

 Score = 42.4 bits (98), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 36/129 (27%), Positives = 55/129 (42%), Gaps = 3/129 (2%)

Query: 241 LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA 300
           L  Q+    P E FT E +K A   V S  +            L P    +L +S   K 
Sbjct: 146 LLVQHRDLFPLEQFTIEDYKWALCTVWSRAMDFVLPGGNSIRLLAPFAD-MLNHSDNVKQ 204

Query: 301 MLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 358
             A  +    + ++  + Y+AG+ + ++ GP  NS+LL  YGFV   N  D   +  A +
Sbjct: 205 CHAYDSSSKTLSVLAGKDYEAGDQVFIYYGPVSNSRLLRLYGFVLPGNSNDNYDLVLATH 264

Query: 359 TEDPQYQDK 367
            E P +  K
Sbjct: 265 PEAPFFARK 273


>gi|403370373|gb|EJY85047.1| hypothetical protein OXYTRI_17100 [Oxytricha trifallax]
          Length = 777

 Score = 42.4 bits (98), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 39/165 (23%), Positives = 74/165 (44%), Gaps = 31/165 (18%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIA-------ELLTTNKLSELACLALYLM 165
           VAA + +   +A   +PN L++  +++  +E          E   T K S+   L  ++ 
Sbjct: 112 VAAKKFIGPNEAYLYIPNKLIINEDKLYKSEYAQIFIDHPNEFKNTEK-SDQTSLIFFVA 170

Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVES--PLLWSETELAYLTGSPTKAEILERAEG 223
            E  +G++S+W PY           + A +S  P  W +  +  L  +  KAE+      
Sbjct: 171 LELLKGEESYWHPYF----------ETAQDSDLPQFWEDQNIDELEDALIKAEL------ 214

Query: 224 IKREYNELDTV--WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 266
              + +++D +  + +A  +   YP  +  E FT EI+K+A+  V
Sbjct: 215 ---QMHQVDFIGDYEIAHGIANHYPDLVHAEKFTIEIYKRAYNIV 256


>gi|170588849|ref|XP_001899186.1| SET domain containing protein [Brugia malayi]
 gi|158593399|gb|EDP31994.1| SET domain containing protein [Brugia malayi]
          Length = 278

 Score = 42.4 bits (98), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 57/254 (22%), Positives = 105/254 (41%), Gaps = 36/254 (14%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           A+ D +  +   S+P  L++T   +       ++     L     L  + + EK+Q  K 
Sbjct: 35  ATTDFRENETIISIPVGLIITAGFIAEMPDYCDVFKRYCLKPFEALVYFFLVEKEQNSK- 93

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
            W PY+  L             P  +S     + +  P       R +   ++ NEL   
Sbjct: 94  -WTPYLEVL-------------PKSFSTPASLHPSLKPEDFPYCLRKQWYVQK-NELKI- 137

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-CVVHLQKV------SLARRFALVPL 287
                 +++++   I  +   ++ F  A+  V + C+    K+      +     A+VPL
Sbjct: 138 ------MYEKF-VTILADNTIWDHFLWAWHIVNTRCIYRNNKLHPLIDNTEDDSLAIVPL 190

Query: 288 GPPLLAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 345
              +L +S  S+C A+  +  +  +++V RP + GE I +  G   N  L I YGF  +D
Sbjct: 191 -IDMLNHSNDSQCCAIWDSKFNLYKVIVTRPIRKGEQIFICYGSHTNGSLWIEYGFYLKD 249

Query: 346 NPYDRLVVEAALNT 359
           N  D+  VE +L +
Sbjct: 250 NICDK--VEISLGS 261


>gi|367042232|ref|XP_003651496.1| hypothetical protein THITE_2111880 [Thielavia terrestris NRRL 8126]
 gi|346998758|gb|AEO65160.1| hypothetical protein THITE_2111880 [Thielavia terrestris NRRL 8126]
          Length = 377

 Score = 42.4 bits (98), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 42/171 (24%), Positives = 69/171 (40%), Gaps = 22/171 (12%)

Query: 192 LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPT 251
           LA   PL WS   L      P +A  L RA+  K   +     W    + F         
Sbjct: 104 LATALPLAWSSPVLHNYLPPPARA--LLRAQQAKFARD-----WAAVSAAFP-------- 148

Query: 252 EAFTFEIFKQAFVAVQSCVVHLQKVSLAR-----RFALVPLGPPLLAYSSKCKAMLAAVD 306
            A   + F+ A++   +   + +    AR     R  L P+   L  +++     +A   
Sbjct: 149 -ALAPDAFRHAWLLTNTRTFYHETARTARLPHDDRMVLQPVAD-LFNHAADGGCEVAFTP 206

Query: 307 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 357
            +  +  DR Y  GE +++  G   N  LL+ YGFV E N +D + ++ A+
Sbjct: 207 ASFAITADRAYAEGEEVLICYGRHSNDFLLVEYGFVLEQNRWDEVGLDEAV 257


>gi|340507383|gb|EGR33354.1| SET domain protein [Ichthyophthirius multifiliis]
          Length = 165

 Score = 42.4 bits (98), Expect = 0.56,   Method: Composition-based stats.
 Identities = 33/103 (32%), Positives = 50/103 (48%), Gaps = 20/103 (19%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAEL--------LTTNKLSELACLALYL 164
           V A E++ A     ++PN+L+++   V  +E    L        L  +  ++   LALYL
Sbjct: 51  VIAKEEIPANKVFVAIPNNLLLSTYLVEQSELKVILEENPHLFDLDEDDDAQFNKLALYL 110

Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL--WSETEL 205
           M EK +G+ SFW PY+          Q+A ES  L  W E E+
Sbjct: 111 MKEKIKGENSFWYPYL----------QIAPESFTLLDWKEEEV 143


>gi|326913214|ref|XP_003202935.1| PREDICTED: SET domain-containing protein 4-like, partial [Meleagris
           gallopavo]
          Length = 241

 Score = 42.4 bits (98), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 31/100 (31%), Positives = 48/100 (48%), Gaps = 9/100 (9%)

Query: 94  KVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAE 147
           K  LK++   +   RP  +      +  +  LQAG+   S+P   +VT   VL N  + E
Sbjct: 36  KKWLKDRGFGDSSLRPAQFWGTGRGLMTTRALQAGELVISLPEKCLVTTNTVL-NSCLGE 94

Query: 148 LLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDR 185
            +   K  +S L  L  +L+ EK  G+KS W PY+  L +
Sbjct: 95  YIMKWKPPVSPLIALCTFLIAEKHAGEKSLWKPYLDVLPK 134


>gi|302835223|ref|XP_002949173.1| hypothetical protein VOLCADRAFT_120737 [Volvox carteri f.
           nagariensis]
 gi|300265475|gb|EFJ49666.1| hypothetical protein VOLCADRAFT_120737 [Volvox carteri f.
           nagariensis]
          Length = 593

 Score = 42.4 bits (98), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 62/252 (24%), Positives = 98/252 (38%), Gaps = 25/252 (9%)

Query: 109 PIHYVAASEDLQAGDAAFSVPNSLVVTLERV----LGNETIAELLTTNKLSELACLALYL 164
           P+  + A   +  GD    VP  L+++ E      LG    A  L  +  S         
Sbjct: 191 PLRGLRADTAVAPGDVVLHVPADLLISYETAKKSDLGKVLSALPLDLSDDSIALIWTCVE 250

Query: 165 MYEKKQGKKSFW--LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAE 222
            +E +     FW  LP+             +  + L  S+ ++A L G+P   + + RA 
Sbjct: 251 RHEPEAPHAPFWAALPH-------------SFSTALSASQEDVALLEGTPLHGDAV-RAR 296

Query: 223 GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF 282
               E  E  +  F   SL   YP     E F++E +  A     S  + +Q  S   R 
Sbjct: 297 QHLSEAFESSSPAFR--SLLGAYPDYFKPEWFSWESYLWAAELWYSYGIQVQFASGDIRT 354

Query: 283 ALVPLGPPLLAYSSKCKAMLAAVD---DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 339
            L P    +  +        + VD     +++   RP +AG  + +  GP  N+KLL+ Y
Sbjct: 355 CLAPYLGLMNHHPLPHVVHFSKVDPETGCLRVRAFRPCEAGNQLFLSYGPYSNAKLLLFY 414

Query: 340 GFVDEDNPYDRL 351
           GF   DNP D +
Sbjct: 415 GFAVRDNPADEV 426


>gi|307103393|gb|EFN51653.1| hypothetical protein CHLNCDRAFT_139846 [Chlorella variabilis]
          Length = 712

 Score = 42.4 bits (98), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 31/101 (30%), Positives = 53/101 (52%), Gaps = 7/101 (6%)

Query: 315 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRN 374
           RP +AG+ + +  GP PN KLL  YGFV   NP+D  +V   L   +   + +++ A   
Sbjct: 444 RPCQAGQQVFISYGPVPNLKLLCYYGFVVPHNPHD--LVPLQLEPPEGPLKQQQLAAMEA 501

Query: 375 GKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVI 415
             L ++    H+ ++      +L  LRL  V+ ++E+Q V+
Sbjct: 502 LGLGLE----HSLQDGPLSKQLLACLRL-IVATSAELQLVV 537


>gi|383863095|ref|XP_003707018.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Megachile
           rotundata]
          Length = 277

 Score = 42.4 bits (98), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 28/91 (30%), Positives = 46/91 (50%), Gaps = 4/91 (4%)

Query: 315 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE---DPQYQDKRMVA 371
           R +K G+ I +  GP+PNS   ++ GFV  D+ +D L        E   DP   ++R + 
Sbjct: 80  RDFKKGDQIFISYGPRPNSDFFLHSGFVYMDHKHDTLKFWVGSFLESNLDPHLAERRQLL 139

Query: 372 QRNGKLSVQVFHVHAGREKEAISDMLPYLRL 402
           ++        F V++GRE    S +L Y+R+
Sbjct: 140 KKLHLQPWSEFVVNSGREPIPGS-VLAYMRV 169


>gi|367001244|ref|XP_003685357.1| hypothetical protein TPHA_0D02870 [Tetrapisispora phaffii CBS 4417]
 gi|357523655|emb|CCE62923.1| hypothetical protein TPHA_0D02870 [Tetrapisispora phaffii CBS 4417]
          Length = 495

 Score = 42.4 bits (98), Expect = 0.63,   Method: Compositional matrix adjust.
 Identities = 61/263 (23%), Positives = 103/263 (39%), Gaps = 53/263 (20%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER-----------VLGNETIAELLTTNKLSELACLA 161
           V ASE ++  +  F +P   ++ ++            + G   I E+     L  + CL 
Sbjct: 43  VVASEHIEKDEVLFEIPRDSILNVDTSELFKNHYEGYIDGKTVIEEIGLWETL--ILCL- 99

Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILER- 220
            Y M+ KK+  +SFW  Y   L +      L     + W + EL  L  S     ILER 
Sbjct: 100 FYEMFVKKE--ESFWSQYFAVLPKATDFNTL-----MYWEDRELENLKPSF----ILERI 148

Query: 221 ------------AEGIKREYNELDTVWF------MAGSLFQQYPYDIPTEAFTFEIFKQA 262
                        E +++  + ++T  F      +  S+   Y +DI       E  +  
Sbjct: 149 GKDKSVAMHEKLMEFVEKNLDVIETSSFTWDRFLLVASIIMAYSFDI-------ERGECD 201

Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
               +       + SL +  +++PL   L A + +C A L      +++   +P KA E 
Sbjct: 202 ADEEEEEEEEDIERSLIK--SMIPLADTLNADTKRCNANLIYDSGVLKMCAIKPIKANEQ 259

Query: 323 IVVWCGPQPNSKLLINYGFVDED 345
           I    G   N +LL  YG+V+ D
Sbjct: 260 IYNTYGNHANFELLRRYGYVEVD 282


>gi|303272215|ref|XP_003055469.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226463443|gb|EEH60721.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 468

 Score = 42.4 bits (98), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 32/119 (26%), Positives = 52/119 (43%), Gaps = 10/119 (8%)

Query: 144 TIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET 203
           ++A+ L   +L     L + +M E+  G +S W  Y   L   RG   L    P+ W+E 
Sbjct: 85  SVAKELRDARLGGGLALNVAVMVERALGSESRWRDYFAVLP-SRGERTL----PMFWTEA 139

Query: 204 ELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
            L  L G+     + E AE ++ +Y+E      +   L   +P     E  TFE + +A
Sbjct: 140 RLEALKGTDLATHVREDAENLRADYDEE-----VVNGLCVAHPEKFRREELTFERYLEA 193


>gi|190347905|gb|EDK40262.2| hypothetical protein PGUG_04360 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 466

 Score = 42.4 bits (98), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 67/284 (23%), Positives = 115/284 (40%), Gaps = 59/284 (20%)

Query: 113 VAASEDLQAGDAAFSVP-------NSLVVTLERVLGNETIAEL---------LTTNKLSE 156
           V A++++ A +    +P       N+++  + R  G E++ +L          TT++ +E
Sbjct: 73  VYATQNVSAKETLVRIPHSFLMNTNTIIKHISRFNGKESVPDLGYSVSLPSEYTTDQWTE 132

Query: 157 LAC---------------LALYLMYEKKQGKKSFW------LPYIRELDRQRGRGQLAVE 195
           L                  ALY+  EKK+ + SFW      LP + ELD           
Sbjct: 133 LYAKIPISKWLQLTAFQRTALYICLEKKRKENSFWCAFISSLPKLEELDF---------- 182

Query: 196 SPLLWS-ETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEA 253
           +P++W  E+E   LTGS   A+  E      R + +  +V F    +   ++     +E 
Sbjct: 183 APIVWEVESE---LTGSKA-ADFFELLPRSSRNHAKKVSVRFNEDYTAVSEFLTAAKSEP 238

Query: 254 FTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGPPL-LAYSSKCKAMLAAVDDAV 309
                F  A++ + S  +++      + A  F L P    L      KC   + +   +V
Sbjct: 239 LNKMEFLWAWMCINSRCLYMSFPSSKAEADNFTLAPYVDFLNHDCDEKCAIKIDSRGFSV 298

Query: 310 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
              VD  + AG+ ++   GP  N  LL  Y F  E N ++ L V
Sbjct: 299 ISCVD--HAAGQELLFSYGPHSNEFLLCEYAFTMETNKWNNLDV 340


>gi|159471213|ref|XP_001693751.1| transcription factor, E2F and DP-related [Chlamydomonas
           reinhardtii]
 gi|158283254|gb|EDP09005.1| transcription factor, E2F and DP-related [Chlamydomonas
           reinhardtii]
          Length = 656

 Score = 42.4 bits (98), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 41/149 (27%), Positives = 62/149 (41%), Gaps = 6/149 (4%)

Query: 206 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 265
           A L GSP  AE  +    +   +      +    SL + YP       F++E +  A   
Sbjct: 164 AALAGSPLAAEAGQARRHLAEAFAASQPAF---ESLLKAYPDYFQPHWFSWESYLWAAEL 220

Query: 266 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA---VQLVVDRPYKAGES 322
             S  + +Q  +   R  LVP    +  +        + VD A   +++   RP   G  
Sbjct: 221 WYSYGIQVQVAAGDIRTCLVPYLGLMNHHPLPHVVHFSKVDPASRGLRVRAFRPCARGRQ 280

Query: 323 IVVWCGPQPNSKLLINYGFVDEDNPYDRL 351
           + +  GP PNSKLL+ YGF   DNP D +
Sbjct: 281 LFLSYGPYPNSKLLLFYGFALPDNPVDEV 309


>gi|70995934|ref|XP_752722.1| SET domain protein [Aspergillus fumigatus Af293]
 gi|66850357|gb|EAL90684.1| SET domain protein [Aspergillus fumigatus Af293]
          Length = 490

 Score = 42.4 bits (98), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 65/297 (21%), Positives = 123/297 (41%), Gaps = 49/297 (16%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDL-QAGDAA--FSVPNSLVVTLE 137
           L SW   NG+    +  ++  S +   +    VA +E   + G+A    +VP+ L +TLE
Sbjct: 11  LSSWAKLNGISLEGIAFQKLYSEHGTDKGSAIVATAEKKDEEGEANTLLTVPSDLALTLE 70

Query: 138 RVLGN-----------ETIAELLTTNKLSELACLALYLMY--------EKKQGKKSFWLP 178
            V  +           + + +   T + + L  L + + +         +K G  + W  
Sbjct: 71  YVHNHAKIDRHLREVLDAVGDFGRTARGAILIFLIIQITHASPDFVNKRQKIGISNPWTE 130

Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL-----DT 233
           YIR +         +V  P  +S  E   L G+  +  +  +   +++E++ L     + 
Sbjct: 131 YIRFM-------PASVPLPTFYSAEERELLRGTSLQTAVDAKLGSLEKEFDHLRQATEEI 183

Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP---LGPP 290
            W       Q++ +D  T  FTF+ +K      +S VV L +       A+VP   +   
Sbjct: 184 PWC------QEHWWDEDTGKFTFDDWKYVDAVYRSRVVDLPRSG----HAIVPCVDMANH 233

Query: 291 LLAYSSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWCGPQ-PNSKLLINYGFVDED 345
               S K +       +AV QL   +  + GE + +  G + P S+++ +YGFV+ +
Sbjct: 234 ACEDSVKARYDEEGAGNAVLQLRTGKKLRVGEEVTISYGDEKPASEMVFSYGFVENE 290


>gi|226492747|ref|NP_001140859.1| uncharacterized protein LOC100272935 [Zea mays]
 gi|194701488|gb|ACF84828.1| unknown [Zea mays]
 gi|413951742|gb|AFW84391.1| hypothetical protein ZEAMMB73_159573 [Zea mays]
          Length = 495

 Score = 42.0 bits (97), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 71/324 (21%), Positives = 118/324 (36%), Gaps = 73/324 (22%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
            K WM  +G     V+  +  S +       +V A+  L+ GD   ++P    +T  R  
Sbjct: 15  FKRWMRAHG-----VVCSDALSLDVSDPLGVHVRAATPLRDGDLVATIPRGACLT-PRTT 68

Query: 141 GNETIAELLTTNKLSELACLALYL--MYEKKQGKKSFWLPYIRELDRQRGRGQLAVES-P 197
           G     E           CLAL +  MYE+ QG  S W  Y++ L           ES P
Sbjct: 69  GAAAAIEAAELG-----GCLALTVAVMYERAQGADSPWDAYLQLLPD--------CESVP 115

Query: 198 LLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 256
           L+W   E    L G+     + +  E +  ++ E      ++G L      D+  + F+ 
Sbjct: 116 LVWPAGEAECLLAGTELDKIVKQDKEFLCEDWKECIEPLMLSGEL------DVDPDDFSL 169

Query: 257 EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML-------------- 302
           E +  A   V S    +     +    +VPL   L  + + C+ +               
Sbjct: 170 EKYLSAKTLVSSRSFQIDSYHGS---GMVPLAD-LFNHKTDCEHVHFTSASDASDSDGEE 225

Query: 303 --------------------------AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLL 336
                                      A D+ +++++ R    GE +    G   N+ LL
Sbjct: 226 EEDDRSDASADDKPTTKNPTSSPPGSRANDEDLEIIIVRDVNEGEEVYNTYGTMGNAALL 285

Query: 337 INYGFVDEDNPYDRLVVEAALNTE 360
             YGF + DN YD + ++ AL T+
Sbjct: 286 HRYGFTELDNQYDIVNIDLALVTK 309


>gi|218200748|gb|EEC83175.1| hypothetical protein OsI_28406 [Oryza sativa Indica Group]
          Length = 319

 Score = 42.0 bits (97), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 22/59 (37%), Positives = 34/59 (57%), Gaps = 2/59 (3%)

Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR--LVVEAALNTED 361
           V  +++  + RP KAGE   +  G  P S L+  YGF+  DNPYD   L ++ +++ ED
Sbjct: 179 VTKSLKFPLSRPCKAGEQCFLSYGKHPGSHLITFYGFLPRDNPYDVIPLDLDTSVDEED 237


>gi|358335378|dbj|GAA53907.1| histone-lysine N-methyltransferase setd3 [Clonorchis sinensis]
          Length = 254

 Score = 42.0 bits (97), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 37/173 (21%), Positives = 82/173 (47%), Gaps = 15/173 (8%)

Query: 323 IVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVF 382
           I++  G + +++ L+  GFV   NP++ + +   ++  D     +  + +     S  + 
Sbjct: 58  ILMDYGKRTSAEFLMFSGFVPATNPHNNVRIVLGVSKSDQLSSKREQLLELIALQSPLIL 117

Query: 383 HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVIS---------SLGPICPVSPCMERAVL 433
           H+    +  ++SD + + R+ +V D+ ++ + +S            P+CP  P  ++A+ 
Sbjct: 118 HITG--DLSSLSDAIAFARV-FVMDSDQLDAHLSMTTSALHALRTSPLCPGDPIDDQAIA 174

Query: 434 DQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNACLQ 486
             L   F+  ++ Y   +SEDE      NL P +R   +L   E ++L +C++
Sbjct: 175 -FLIMRFELLVSAYGPMVSEDEVGYE--NLTPIQRYCERLRVQEVQILRSCIE 224


>gi|195040205|ref|XP_001991024.1| GH12451 [Drosophila grimshawi]
 gi|193900782|gb|EDV99648.1| GH12451 [Drosophila grimshawi]
          Length = 573

 Score = 42.0 bits (97), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 64/287 (22%), Positives = 114/287 (39%), Gaps = 42/287 (14%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           A  D+ A +   SVP  L+ + E  L  E   EL   N  + L  +   L+ EK +G  S
Sbjct: 158 AKRDIAAEELVLSVPRKLIFSEE--LLPEWKRELFR-NFPTHLN-VTYTLIIEKVRGAAS 213

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL--- 231
            W P+I  L  +         + L ++  ++  L G+   +  +     I R Y  +   
Sbjct: 214 AWQPFIDTLPTR-------YSTVLYFTVDQMQRLRGTSACSAAMRHCLVIARLYASMYKC 266

Query: 232 ------DTVWFMAGSLFQQYP--YDIPTEAFTFEIFKQAFVAVQ-SCVVHLQKV------ 276
                 D V     +LF +Y   Y++   A +    +Q  V  + S V  + +V      
Sbjct: 267 AYIQPGDNVMAAKANLFTEYGLCYELYRWAVSTVTTRQNLVPRELSTVGEVDQVCQLGGF 326

Query: 277 ----------SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ---LVVDRPYKAGESI 323
                     + AR   +  L P     + +C  + +  D A Q         +KAGE  
Sbjct: 327 EGTEIKRDAETGARNAPISALIPYWDMTNHRCGKITSYYDRAAQQMECTAQEAFKAGEQF 386

Query: 324 VVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 370
            ++ G + N+  L+++GF+D  N  D + +   L+  DP  + + ++
Sbjct: 387 FIYYGDRSNADRLVHHGFLDMHNLKDYVQIRLGLSPTDPLVEQRSLL 433


>gi|71425330|ref|XP_813082.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70877934|gb|EAN91231.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 565

 Score = 42.0 bits (97), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 86/353 (24%), Positives = 144/353 (40%), Gaps = 48/353 (13%)

Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL- 218
           L L L+YE+   + S W     EL      G   V  P  W   +LA L G     ++L 
Sbjct: 201 LVLSLIYERYVAETSHW----NELLLSCPGGYPNV--PSFWDWEDLAELEGLDVLDDVLA 254

Query: 219 ERAEGIKREYNELDTVWFMAGSLFQ--QYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
           ++A+  + +   +  + F+  +L    ++  D   E F+ E    A     S   +L  V
Sbjct: 255 KKAQLAQFQTETMAVLPFIHEALAGGCRFGKDEFLECFSIEAMMWARATFDSRAFNL-NV 313

Query: 277 SLARRFALVPLGPPLLAYSSKCKAMLAAV-----DDAVQLVVDRPYK-AGESIVVWCGPQ 330
                 ALVP+   ++ + ++   ++  V     D  +Q+      +  G  I +  GP 
Sbjct: 314 DGRVVIALVPVAD-MINHHNRSDVLVRKVEPNGGDFVMQIGASLTAQDIGREIWMSYGPL 372

Query: 331 PNSKLLINYGFVDEDNPYDRL-----VVEAALNTEDPQYQDKR--MVAQRNGKLSVQVFH 383
            N +LL  YGFV E N +DRL       E  +  E   +  +R  +VA     L+ + + 
Sbjct: 373 QNWELLQFYGFVLEGNEHDRLPFPFDFPEGVVGDE---WDGRRAALVATYGLHLAGRCWI 429

Query: 384 VHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 443
            H GR   A+  +L      ++++  E  + +   GP   +    E  V+  +AD  +  
Sbjct: 430 CHDGRPPPALVALLRV----HLAEAEEFDT-MERKGPFASLGAGTEARVVATIADTIRCI 484

Query: 444 LAGYPATLSEDEAML------------TDYNLHP---KKRVATQLVRMEKKML 481
           L  +  +L EDE +L             D N  P    KR+A  L+RM  K +
Sbjct: 485 LDLFSTSLEEDERLLENGSGPVATHSGDDGNTQPLSCNKRLAI-LLRMGMKRI 536


>gi|301094169|ref|XP_002997928.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262109714|gb|EEY67766.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 440

 Score = 42.0 bits (97), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 44/164 (26%), Positives = 70/164 (42%), Gaps = 23/164 (14%)

Query: 315 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL-----NTEDPQYQDKRM 369
           + Y+ GE + +  G   N +LL NYGF   +NPYD + +   +     N  DP +  KR 
Sbjct: 247 KAYEPGEQLYINYGSHSNLRLLRNYGFTTPNNPYDVVTLPMPIALQQPNPADPAFLQKRG 306

Query: 370 VAQR-NGKLSVQV-------FHVHAGREKEAISDMLPYL-----RLGYVSDTSEMQS--V 414
           + Q   G  S  +       F+ H G+        L  L      L  +   +  QS   
Sbjct: 307 LLQSATGSHSTDIPALRSLRFN-HDGQLAPNAEHWLEILLATPEELSEIITQAASQSGAA 365

Query: 415 ISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 458
            S++    P+S  ++  V  ++     ARL  + +TL ED+A L
Sbjct: 366 DSTISLALPMS--LKHKVHSEVGSLVTARLKQHSSTLEEDDAFL 407


>gi|218200744|gb|EEC83171.1| hypothetical protein OsI_28399 [Oryza sativa Indica Group]
          Length = 437

 Score = 42.0 bits (97), Expect = 0.72,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 33/56 (58%), Gaps = 2/56 (3%)

Query: 308 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR--LVVEAALNTED 361
           +++  + RP KAGE   +  G  P S L+  YGF+  DNPYD   L ++ +++ ED
Sbjct: 300 SLKFPLSRPCKAGEQCFLSYGKHPGSHLITFYGFLPRDNPYDVIPLDLDTSVDEED 355


>gi|10177069|dbj|BAB10511.1| unnamed protein product [Arabidopsis thaliana]
          Length = 447

 Score = 42.0 bits (97), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 40/179 (22%), Positives = 80/179 (44%), Gaps = 22/179 (12%)

Query: 307 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV--EAALNTEDPQY 364
           +A  L   R Y+ GE +++  G   N +LL +YGF+ E+N  D++ +  E +L +    +
Sbjct: 213 NAYCLYARRNYQLGEQVLLCYGTYTNLELLEHYGFMLEENSNDKVFIPLETSLFSLASSW 272

Query: 365 QDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPV 424
               +   ++GKLS                 ++  LRL  +  +   +SV+  +     +
Sbjct: 273 PKDSLYIHQDGKLSFA---------------LISTLRLWLIPQSQRDKSVMRLVYAGSQI 317

Query: 425 SPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLNA 483
           S   E  V+  +++   + L   P +++ED  +     LH   ++    +R+E+K   A
Sbjct: 318 SVKNEILVMKWMSEKCGSVLRDLPTSVTEDTVL-----LHNIDKLQDPELRLEQKETEA 371


>gi|169847976|ref|XP_001830696.1| hypothetical protein CC1G_03233 [Coprinopsis cinerea okayama7#130]
 gi|116508170|gb|EAU91065.1| hypothetical protein CC1G_03233 [Coprinopsis cinerea okayama7#130]
          Length = 496

 Score = 42.0 bits (97), Expect = 0.75,   Method: Compositional matrix adjust.
 Identities = 35/121 (28%), Positives = 60/121 (49%), Gaps = 19/121 (15%)

Query: 115 ASEDLQAGDAAFSVPNSLVVT-----LERVLGNETIAELLTTNKLSE-LACLALYLMYEK 168
           A +DL  G   F++P +L ++     L  + G E    L    KL +  A L L +M+E 
Sbjct: 42  ALKDLPEGHVLFTIPRALTLSTRTSRLPELFGLEEWKRL----KLHQGWAGLMLCMMWEA 97

Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
            QGK+S W  Y+  L         A ++P+ W+E +L+ L G+    ++ +  E  +R+Y
Sbjct: 98  AQGKESRWAGYLDIL-------PAAFDTPMFWNEEDLSELAGTSIVGKLGK--EDAERDY 148

Query: 229 N 229
           +
Sbjct: 149 D 149


>gi|302829721|ref|XP_002946427.1| hypothetical protein VOLCADRAFT_86703 [Volvox carteri f.
           nagariensis]
 gi|300268173|gb|EFJ52354.1| hypothetical protein VOLCADRAFT_86703 [Volvox carteri f.
           nagariensis]
          Length = 658

 Score = 42.0 bits (97), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 25/90 (27%), Positives = 40/90 (44%), Gaps = 17/90 (18%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-----------------LS 155
           + A+ DLQ G+A   VP  L++T      +  +A  L  +                  L 
Sbjct: 32  IVATRDLQPGEAVLRVPERLLLTTRSAARDPQLAAALQRHTERSRGVAAAPSCGGGCGLG 91

Query: 156 ELACLALYLMYEKKQGKKSFWLPYIRELDR 185
               LA +L+ E  +G +SFW PY+++L R
Sbjct: 92  PHQVLACHLLLEVSRGPQSFWWPYLKQLPR 121



 Score = 39.7 bits (91), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 29/91 (31%), Positives = 42/91 (46%), Gaps = 12/91 (13%)

Query: 306 DDAVQ---LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL----- 357
           D+A Q   +VV RPY+ GE +++  G   N +LL  YGFV E N +D   ++ AL     
Sbjct: 312 DEATQQYCIVVRRPYREGEQVMLCYGRYTNLELLEYYGFVLEGNLHDTARLDPALLPLPS 371

Query: 358 ----NTEDPQYQDKRMVAQRNGKLSVQVFHV 384
                   P           NG+ S Q+ H+
Sbjct: 372 AARTAGGAPHLAPSDCFLHANGQPSWQLLHL 402


>gi|156717956|ref|NP_001096520.1| N-lysine methyltransferase setd6 [Xenopus (Silurana) tropicalis]
 gi|325530258|sp|A4QNG5.1|SETD6_XENTR RecName: Full=N-lysine methyltransferase setd6; AltName: Full=SET
           domain-containing protein 6
 gi|140832737|gb|AAI35641.1| LOC100125156 protein [Xenopus (Silurana) tropicalis]
          Length = 454

 Score = 42.0 bits (97), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 56/246 (22%), Positives = 98/246 (39%), Gaps = 37/246 (15%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNET-IAELLTTNKLSELAC-----LALYLMYEK 168
           A EDL  G+  FS+P S +++      N T I +L+   + S  +C     L + L+YE 
Sbjct: 55  AREDLSDGELLFSIPRSAILS-----QNTTRIRDLIEKEQDSLQSCSGWVPLLISLLYEA 109

Query: 169 KQGKKSFWLPYIR---ELDRQRGRGQLAVESPLLWSETE-LAYLTGSPTKAEILERAEGI 224
                S W PY     ELD          + P+ WSE E    L G+     + +  + I
Sbjct: 110 TDS-SSHWAPYFGLWPELD--------PPDMPMFWSEEEQTKLLQGTGILEAVHKDLKNI 160

Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 284
           ++EYN +           ++ P        T +++K+    V +      +         
Sbjct: 161 EKEYNSI------VLPFIRRNPEKFCPMKHTLDLYKRLVAFVMAYSFQEPQEEDEEEDIE 214

Query: 285 VPLGPP-------LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 337
             + PP       LL + ++  A L    + ++++  +   AG+ +    G   N +LL 
Sbjct: 215 KDILPPMMVPVADLLNHVAQHNAHLEFTPECLRMITTKSVCAGQELFNTYGQMANWQLLH 274

Query: 338 NYGFVD 343
            YGF +
Sbjct: 275 MYGFAE 280


>gi|407417214|gb|EKF38012.1| hypothetical protein MOQ_001785 [Trypanosoma cruzi marinkellei]
          Length = 578

 Score = 42.0 bits (97), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 37/137 (27%), Positives = 65/137 (47%), Gaps = 9/137 (6%)

Query: 328 GPQPNSKLLINYGFVDEDNPYDRLVVEAAL--NTEDPQYQDKR--MVAQRNGKLSVQVFH 383
           GP  N +LL  YGFV E+N +DRL             ++  +R  +VA     L+ + + 
Sbjct: 370 GPLQNWELLQFYGFVVEENEHDRLPFPFDFPEGVAGDEWDRRRATLVATYGLHLAGRCWI 429

Query: 384 VHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKAR 443
            H GR   A   ++  LR+ ++++  E  ++  + GP   +    E  V+  +AD  +  
Sbjct: 430 CHDGRPPPA---LVALLRV-HLAEAEEFDTMERN-GPFASLGAGTEARVVATIADTIRCI 484

Query: 444 LAGYPATLSEDEAMLTD 460
           L  +  +L EDE +L +
Sbjct: 485 LDLFSTSLEEDEWLLEN 501


>gi|150864441|ref|XP_001383253.2| hypothetical protein PICST_42613 [Scheffersomyces stipitis CBS
           6054]
 gi|149385697|gb|ABN65224.2| predicted protein [Scheffersomyces stipitis CBS 6054]
          Length = 453

 Score = 42.0 bits (97), Expect = 0.85,   Method: Compositional matrix adjust.
 Identities = 51/227 (22%), Positives = 88/227 (38%), Gaps = 42/227 (18%)

Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS-------ETELA 206
           L+    L+LYL +E+++   SFW P++  L            +PL+W        E  + 
Sbjct: 122 LTSFQLLSLYLCFERQRIHSSFWKPFLEMLPDISDFSL----NPLIWQVLQVDQWEELIQ 177

Query: 207 YLTGSPTKAEILERAEGIKREYNE------------LDTVWFMAGSLFQQYPYDIPTEAF 254
           +L  S  +     RAE +   + E            LD +     S  +  P D      
Sbjct: 178 FLPESAKR-----RAEDVYERFLEDYVVVRALVSRILDDLKLSESSADEYIPVD------ 226

Query: 255 TFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQL 311
              +F  A++ + S  +++   Q  + A  F + P     L +S   +  +        +
Sbjct: 227 ---LFLWAWMCINSRCLYMTIPQGKTNADNFTMAPY-VDFLNHSCNDECSILIDTTGFHV 282

Query: 312 VVDRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEAAL 357
               PY  G+ + +  GP  N  LL  YGFV   DN ++ L + A +
Sbjct: 283 RTTTPYMPGDQLFLSYGPHCNEFLLCEYGFVIPHDNKWNDLDISAYI 329


>gi|401624185|gb|EJS42251.1| set7p [Saccharomyces arboricola H-6]
          Length = 494

 Score = 42.0 bits (97), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 22/68 (32%), Positives = 38/68 (55%), Gaps = 1/68 (1%)

Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
           +++PL   L A +SKC A L     +++++  R  +  E +    G  PNS++L  YG+V
Sbjct: 220 SMIPLADTLNADTSKCNANLTYDSGSLKMIAVRDIEIDEQVYNIYGEHPNSEILRRYGYV 279

Query: 343 DED-NPYD 349
           + D + YD
Sbjct: 280 EWDGSKYD 287


>gi|365982325|ref|XP_003667996.1| hypothetical protein NDAI_0A05980 [Naumovozyma dairenensis CBS 421]
 gi|343766762|emb|CCD22753.1| hypothetical protein NDAI_0A05980 [Naumovozyma dairenensis CBS 421]
          Length = 573

 Score = 41.6 bits (96), Expect = 0.94,   Method: Compositional matrix adjust.
 Identities = 23/54 (42%), Positives = 28/54 (51%)

Query: 296 SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
           SK +  L   DD V +V  R    GE I +  GP PN+ LL   GF   DNP+D
Sbjct: 335 SKPEEELNNPDDYVDIVTTRGILKGEEIFISYGPLPNAFLLAKCGFTMADNPFD 388


>gi|171684553|ref|XP_001907218.1| hypothetical protein [Podospora anserina S mat+]
 gi|170942237|emb|CAP67889.1| unnamed protein product [Podospora anserina S mat+]
          Length = 396

 Score = 41.6 bits (96), Expect = 0.94,   Method: Compositional matrix adjust.
 Identities = 21/87 (24%), Positives = 40/87 (45%), Gaps = 2/87 (2%)

Query: 274 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 333
           ++++   + AL P+   L      C+ +         +  DR YK GE + +  G   N 
Sbjct: 194 ERLTKDDKMALQPVADLLNHSDEGCEVVFDT--GCYTISADREYKQGEEVYICYGTHSND 251

Query: 334 KLLINYGFVDEDNPYDRLVVEAALNTE 360
            L++ YGF  E+N +D + ++  +  E
Sbjct: 252 FLMVEYGFCPEENKWDEVCIDEVVLEE 278


>gi|357122881|ref|XP_003563142.1| PREDICTED: protein SET DOMAIN GROUP 40-like [Brachypodium
           distachyon]
          Length = 480

 Score = 41.6 bits (96), Expect = 0.96,   Method: Compositional matrix adjust.
 Identities = 65/284 (22%), Positives = 115/284 (40%), Gaps = 60/284 (21%)

Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGNE-TIAELLTTN--KLSELACLALYLMYEKKQ 170
           AA+ DL+ G+    VP + ++T +RV+ ++  IA  +     +LS +  L + L+ E  +
Sbjct: 45  AAARDLRRGELVLRVPRAALLTSDRVMADDPEIASCIAARHPRLSSVQRLIVCLLAEVGK 104

Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI-KREYN 229
           GK S W  Y+ +L                   T LA       +A  ++ A  I ++  +
Sbjct: 105 GKSSSWYLYLSQLPSYY---------------TVLATFNDFEIEALQVDDAIWIAQKSLS 149

Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP 289
            + + W  A  L Q   +    +   F+ +  AF  V S  +H   V+      L P+G 
Sbjct: 150 AIRSEWEDATPLMQGLKF--KPKLLIFKTWLWAFATVSSRTLH---VAWDDAGCLCPVG- 203

Query: 290 PLLAYS----------------SKCKA---MLAAV----------------DDAVQLVVD 314
            L  Y+                +KC+    ML  V                 +A  L   
Sbjct: 204 DLFNYAAPDDDISSEEENREEVTKCQQKNEMLEEVKFGRSSERLSDGGYEDSEAYCLYAR 263

Query: 315 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN 358
           + Y  GE +++  G   N +LL +YGF+  +NP ++  ++  L+
Sbjct: 264 KCYTKGEQVLLGYGTYTNLELLEHYGFLLAENPNEKTYIQLDLD 307


>gi|218189844|gb|EEC72271.1| hypothetical protein OsI_05430 [Oryza sativa Indica Group]
          Length = 1243

 Score = 41.6 bits (96), Expect = 0.96,   Method: Compositional matrix adjust.
 Identities = 31/114 (27%), Positives = 54/114 (47%), Gaps = 8/114 (7%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           AS+ +Q GD    VP  + +TL+++         L  + + + + LA  L+ E+  G +S
Sbjct: 66  ASKPIQEGDCIMQVPYHVQLTLDKLPQKFNT---LLDHAVGDTSKLAALLIMEQHLGNES 122

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
            W PYI+ L  +       + + +LW   EL  +  S    E +E  E  K+E+
Sbjct: 123 GWAPYIKSLPTKD-----QMHNMVLWDLNELHAVQNSSIYDEAIEHKEQAKKEF 171


>gi|449472508|ref|XP_002187588.2| PREDICTED: N-lysine methyltransferase SETD6 [Taeniopygia guttata]
          Length = 383

 Score = 41.6 bits (96), Expect = 0.98,   Method: Compositional matrix adjust.
 Identities = 58/243 (23%), Positives = 104/243 (42%), Gaps = 26/243 (10%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTT-NKLSELACLALYLMYEKKQGKK 173
           A+E+L+AG+  F++P + +++      +  + E   +    S    L L L++E      
Sbjct: 3   AAEELEAGEVLFTIPRTALLSQHTTSIHALLQEAQESLQSQSGWVPLLLALLHEYT-ASN 61

Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL---TGSPTKAEILERAEGIKREYNE 230
           S W PY       R     +++ P+ W + E   L   TG P   +  +    I+ EYN 
Sbjct: 62  SHWQPYFSLWQDFR-----SLDHPMFWPQEERTRLLQGTGIPEAVD--KDLANIQLEYNS 114

Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ--AFVAVQSCVVHLQKVSLARRFALVPLG 288
           +  + FM     + +P     +  T E++K+  AFV   S    L++     +    P+ 
Sbjct: 115 I-ILPFM-----ETHPDIFDPKLHTLELYKELVAFVMAYSFQEPLEEEEEDEKGPNPPMM 168

Query: 289 PP---LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 345
            P   +L + +   A L      +++V  +P + G+ I    G   N +LL  YGF +  
Sbjct: 169 VPVADILNHVANHNANLEYSPQCLRMVTTQPVRKGQEIFNTYGQMANWQLLHMYGFAE-- 226

Query: 346 NPY 348
            PY
Sbjct: 227 -PY 228


>gi|320584053|gb|EFW98265.1| Nuclear protein that contains a SET-domain [Ogataea parapolymorpha
           DL-1]
          Length = 499

 Score = 41.6 bits (96), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 55/220 (25%), Positives = 88/220 (40%), Gaps = 31/220 (14%)

Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
           GN+ + E      L++   L L L YE   G++S W  Y+  L  +         S + W
Sbjct: 74  GNQEVLE-----TLNQWEALILCLAYEMMLGEESRWSSYLAVLPEK-------FNSLMFW 121

Query: 201 SETELAYLTGSPTKAEI-LERAE--------------GIKR--EYNELDTVWFMAGSLFQ 243
           S  EL  L  S     I  E+AE              G K+  EY  +D  + +  S+  
Sbjct: 122 SSEELEKLKPSNVLQRIGREQAEQMYSKLVPEYCLRLGSKKLVEYLTIDR-FHVVASIIM 180

Query: 244 QYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA 303
            Y +D+       E  +              K     + ++VPL   L + ++   A L+
Sbjct: 181 SYSFDVDDPEDDPEDDEDEEEDFDEIEQECIKYDGYLK-SMVPLADTLNSNTNLVNANLS 239

Query: 304 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 343
             +DA+ +   +  K GE I    G  PNS++L  YG+V+
Sbjct: 240 YENDALVMTATKDIKKGEQIYNIYGELPNSEILRKYGYVE 279


>gi|149059901|gb|EDM10784.1| hypothetical protein RDA279, isoform CRA_d [Rattus norvegicus]
          Length = 399

 Score = 41.6 bits (96), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 57/241 (23%), Positives = 101/241 (41%), Gaps = 26/241 (10%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
           LQ G    S+P S ++T + V+   ++   +   K  +S L  L  +L+ E+  G  S W
Sbjct: 27  LQEGQVIISLPESCLLTTDTVI-RSSVGPYIKKWKPPVSPLLALCTFLVSERHAGSHSLW 85

Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
             Y+  L +       +   P+   E E+  L   P +A+  E+   ++  +      + 
Sbjct: 86  KSYLDILPK-------SYTCPVCL-EPEVVDLLPGPLRAKAEEQRARVQDLFASSRDFFS 137

Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ---KVSLARRFALVPLGP--PL 291
               LF +    I    F++  F  A+  V +  V+L+   +  L+       L P   L
Sbjct: 138 TLQPLFAESVDSI----FSYHAFLWAWCTVNTRAVYLKSRRQECLSSEPDTCALAPFLDL 193

Query: 292 LAYSSKCKAMLAAVDDAVQL----VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
           L +S   +   AA ++  +        R  K  E+ + + GP  N +LL+ YGFV   NP
Sbjct: 194 LNHSPHVQVK-AAFNEKTRCYEIRTASRCRKHQEAFICY-GPHDNQRLLLEYGFVAFGNP 251

Query: 348 Y 348
           +
Sbjct: 252 H 252


>gi|406606937|emb|CCH41659.1| hypothetical protein BN7_1200 [Wickerhamomyces ciferrii]
          Length = 577

 Score = 41.6 bits (96), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 55/222 (24%), Positives = 87/222 (39%), Gaps = 58/222 (26%)

Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF- 236
           P+I  L   R  G     SP  W+E E + +    T A++      I    N+L   W+ 
Sbjct: 110 PFIEFLPTGREIG-----SPFFWNEMERSLIKN--TDADL-----AIDVGLNKLVEEWYD 157

Query: 237 MAGSL---FQQYPYDIPTEAFT-----------FEIFKQAFVAVQSCVVHLQKVSL--AR 280
           +   L   FQ Y Y    + F            FE F    V+  S   +L   ++  +R
Sbjct: 158 IVTKLPKKFQSYQYQKDLKFFHDFQKDRDVSKHFEFFNDDSVSWTSFAAYLWSSTIFTSR 217

Query: 281 RFALVPLGPPLLAYSSKCK----AMLAAVDDA-----------------VQLVVDRPYKA 319
            F      P L++ + +C+     ML  + D                  +    ++  K 
Sbjct: 218 GF------PFLISSTDECRDLNEGMLVPIQDLSNHNPSVEIKWGRLDKFMTFTTEQIVKK 271

Query: 320 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTED 361
           G+ I    GP+ N +LL  YGFV ++N YD+ V+  AL  +D
Sbjct: 272 GDEIFSNYGPKSNHELLFGYGFVMDNNIYDKAVL--ALRLQD 311


>gi|170093191|ref|XP_001877817.1| SET-domain protein [Laccaria bicolor S238N-H82]
 gi|164647676|gb|EDR11920.1| SET-domain protein [Laccaria bicolor S238N-H82]
          Length = 524

 Score = 41.6 bits (96), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 60/228 (26%), Positives = 100/228 (43%), Gaps = 32/228 (14%)

Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
            A+VP+   L A      A L   ++ ++++  RP K GE I    G  PN++LL  YG 
Sbjct: 276 IAMVPMADILNARYGSENAKLFYEENYLKMISTRPIKGGEQIWNTYGDLPNAELLRRYGH 335

Query: 342 VD--------EDNPYD------RLVVEAA-----LNTEDPQYQDKRMVAQRNGKLSVQVF 382
           VD        + NP D       L+V  A     L+T+D   +    + +        VF
Sbjct: 336 VDVIQLPNGGQGNPGDVAEIRADLIVSVAAEQHSLSTDDTHERIDWWLEEGGD----DVF 391

Query: 383 HVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKA 442
            ++   + E    ++  +RL  + D  E + +        P  P M+   L  L +  + 
Sbjct: 392 DLYF--DLEIPPSIISVIRLLLLPD-EEWEKIKEKA---KPPKPKMDAVALTVLHEVLQR 445

Query: 443 RLAGYPATLSEDEAML-TDYNLHPKKRVATQLVRMEKKMLNACLQVTA 489
           RL  YP ++ +DE +L T  +L+ +  +  +L   EKK+L+  L  TA
Sbjct: 446 RLKEYPTSIQDDEQLLMTAPSLNLRHAIIVRL--GEKKILDGILTKTA 491


>gi|158508540|ref|NP_001025734.2| N-lysine methyltransferase SETD6 [Gallus gallus]
          Length = 447

 Score = 41.6 bits (96), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 70/273 (25%), Positives = 113/273 (41%), Gaps = 37/273 (13%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLE----RVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
           A+ DL+ G+  FSVP S +++      R L ++    L +    S    L L L++E   
Sbjct: 51  AAADLEPGELLFSVPRSALLSQHTCAIRALLHDAQESLQSQ---SVWVPLLLALLHEYTT 107

Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE-LAYLTGSPTKAEILERAEGIKREYN 229
           G  S W PY         +   +++ P+ W E E +  L G+     + +    I+ EY+
Sbjct: 108 G-TSRWRPYF-----SLWQDFSSLDHPMFWPEEERVRLLQGTGIPEAVDKDLANIQLEYS 161

Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ--AFVAVQSCVVHLQKVSLARRFALVPL 287
            +  + FM     + +P     E  T E++KQ  AFV   S    L++     +    P+
Sbjct: 162 SI-ILPFM-----KSHPDIFDPELHTLELYKQLVAFVMAYSFQEPLEEEDEDEKGPNPPM 215

Query: 288 GPP---LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 344
             P   +L + +   A L      +++V  +P   G+ I    G   N +LL  YGF + 
Sbjct: 216 MVPVADILNHVANHNASLEYAPTCLRMVTTQPISKGQEIFNTYGQMANWQLLHMYGFAE- 274

Query: 345 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL 377
             PY         NT D    D +MV  R   L
Sbjct: 275 --PYPG-------NTNDT--ADIQMVTVRKAAL 296


>gi|355718756|gb|AES06374.1| SET domain containing 4 [Mustela putorius furo]
          Length = 256

 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 57/233 (24%), Positives = 95/233 (40%), Gaps = 23/233 (9%)

Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 213
           L  L  L  +L+ EK  G +S W PY+  L +       A   P+   E ++  L   P 
Sbjct: 7   LLALCTLCTFLVSEKHAGDQSLWKPYLDILPK-------AYTCPVC-LEPKVVNLFPEPL 58

Query: 214 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV-- 271
           KA+  E+   ++  ++     +     LF +   +I    F++     A+  V +  V  
Sbjct: 59  KAKAEEQRARVQGFFSSSRDFFSSLQPLFSEAVENI----FSYSALLWAWCTVNTRAVYM 114

Query: 272 -HLQKVSLARRFALVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVW 326
            H Q+   +       L P   LL +S   + KA         ++      +  E + + 
Sbjct: 115 KHGQRKCFSPEPDTYALAPYLDLLNHSPDVQVKAAFNEETRCYEVRTASGCRKHEQVFIC 174

Query: 327 CGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSV 379
            GP  N +LL+ YGFV   NP+  + V A L  +     DK+M    N K+S+
Sbjct: 175 YGPHDNQRLLLEYGFVSIQNPHACVYVSADLLVKYLPSTDKQM----NKKISI 223


>gi|302790237|ref|XP_002976886.1| hypothetical protein SELMODRAFT_416932 [Selaginella moellendorffii]
 gi|300155364|gb|EFJ21996.1| hypothetical protein SELMODRAFT_416932 [Selaginella moellendorffii]
          Length = 177

 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 32/98 (32%), Positives = 47/98 (47%), Gaps = 6/98 (6%)

Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 350
            L  S  CK  + AV +++++   R  KAG    +  G  PN  LL  YGFV E+NP+D 
Sbjct: 54  FLWASELCK--IDAVTNSLKVYSLRSCKAGMQCFISYGALPNIDLLCFYGFVLENNPFDT 111

Query: 351 LVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGR 388
           + VE     E P+   K  + +R   +S   F +   R
Sbjct: 112 IPVE----LEVPESPAKVALMERYNLVSHISFELRGFR 145


>gi|297807745|ref|XP_002871756.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297317593|gb|EFH48015.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 493

 Score = 41.2 bits (95), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 22/75 (29%), Positives = 40/75 (53%), Gaps = 2/75 (2%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAE--LLTTNKLSELACLALYLMYEKKQ 170
           + A  +L+ G+    VP + ++T E ++  +      ++    LS    L++ L+YE  +
Sbjct: 54  LGAVRELKKGELVLKVPRNALMTTESMIAKDRKLNDAVILHGSLSSTQILSVCLLYEMGK 113

Query: 171 GKKSFWLPYIRELDR 185
           GK+SFW PY+  L R
Sbjct: 114 GKRSFWYPYLVHLPR 128


>gi|143584415|sp|Q5ZK17.2|SETD6_CHICK RecName: Full=N-lysine methyltransferase SETD6; AltName: Full=SET
           domain-containing protein 6
          Length = 447

 Score = 41.2 bits (95), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 70/273 (25%), Positives = 113/273 (41%), Gaps = 37/273 (13%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLE----RVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
           A+ DL+ G+  FSVP S +++      R L ++    L +    S    L L L++E   
Sbjct: 51  AAADLEPGELLFSVPRSALLSQHTCAIRALLHDAQESLQSQ---SVWVPLLLALLHEYTT 107

Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE-LAYLTGSPTKAEILERAEGIKREYN 229
           G  S W PY         +   +++ P+ W E E +  L G+     + +    I+ EY+
Sbjct: 108 G-TSRWRPYF-----SLWQDFSSLDHPMFWPEEERVRLLQGTGIPEAVDKDLANIQLEYS 161

Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ--AFVAVQSCVVHLQKVSLARRFALVPL 287
            +  + FM     + +P     E  T E++KQ  AFV   S    L++     +    P+
Sbjct: 162 SI-ILPFM-----KSHPDIFDPELHTLELYKQLVAFVMAYSFQEPLEEEDEDEKGPNPPM 215

Query: 288 GPP---LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 344
             P   +L + +   A L      +++V  +P   G+ I    G   N +LL  YGF + 
Sbjct: 216 MVPVADILNHVANHNASLEYAPTCLRMVTTQPISKGQEIFNTYGQMANWQLLHMYGFAE- 274

Query: 345 DNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKL 377
             PY         NT D    D +MV  R   L
Sbjct: 275 --PYPG-------NTNDT--ADIQMVTVRKAAL 296


>gi|149059902|gb|EDM10785.1| hypothetical protein RDA279, isoform CRA_e [Rattus norvegicus]
          Length = 475

 Score = 41.2 bits (95), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 57/241 (23%), Positives = 101/241 (41%), Gaps = 26/241 (10%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
           LQ G    S+P S ++T + V+   ++   +   K  +S L  L  +L+ E+  G  S W
Sbjct: 103 LQEGQVIISLPESCLLTTDTVI-RSSVGPYIKKWKPPVSPLLALCTFLVSERHAGSHSLW 161

Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
             Y+  L +       +   P+   E E+  L   P +A+  E+   ++  +      + 
Sbjct: 162 KSYLDILPK-------SYTCPVCL-EPEVVDLLPGPLRAKAEEQRARVQDLFASSRDFFS 213

Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ---KVSLARRFALVPLGP--PL 291
               LF +    I    F++  F  A+  V +  V+L+   +  L+       L P   L
Sbjct: 214 TLQPLFAESVDSI----FSYHAFLWAWCTVNTRAVYLKSRRQECLSSEPDTCALAPFLDL 269

Query: 292 LAYSSKCKAMLAAVDDAVQL----VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
           L +S   +   AA ++  +        R  K  E+ + + GP  N +LL+ YGFV   NP
Sbjct: 270 LNHSPHVQVK-AAFNEKTRCYEIRTASRCRKHQEAFICY-GPHDNQRLLLEYGFVAFGNP 327

Query: 348 Y 348
           +
Sbjct: 328 H 328


>gi|195480581|ref|XP_002101314.1| GE17555 [Drosophila yakuba]
 gi|194188838|gb|EDX02422.1| GE17555 [Drosophila yakuba]
          Length = 548

 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 60/257 (23%), Positives = 105/257 (40%), Gaps = 40/257 (15%)

Query: 126 FSVPNSLVVTLE-----RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYI 180
            SVP  L+ + E     R+ G  T A  L          LA  L+ EK +G+ S W PYI
Sbjct: 164 LSVPRKLIFSEENNSDCRLFGKMTQATHLN---------LAYDLLIEKIRGEFSEWRPYI 214

Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS 240
             L  +         + L ++  ++  L G+   +  L +   I ++Y  L   +  A +
Sbjct: 215 DVLPAK-------YSTVLYFTTKQMERLRGTAACSLALRQCRVIAKQYAFL---YRYAHT 264

Query: 241 LFQQ------YPYD----IPTEAFTFEIFKQAFVAV---QSCV-VHLQKVSLARRF--AL 284
           L +       +P +           +++++ A   V   Q+ V    Q+   + +F  AL
Sbjct: 265 LAESSTGNRSHPGERGLFFTQRGLCYKLYRWAVSTVMTRQNLVPSEKQEAQDSPKFISAL 324

Query: 285 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 344
           +P          K  +  AAV   ++        AGE   ++ G + N+ LL++ GFVD 
Sbjct: 325 IPYWDMANHRPGKITSFYAAVSRQLECTAQEAVAAGEQFFIYYGDRSNTDLLVHNGFVDV 384

Query: 345 DNPYDRLVVEAALNTED 361
           +N  D + +   L+  D
Sbjct: 385 NNLKDYVNIRVGLSPTD 401


>gi|159464317|ref|XP_001690388.1| hypothetical protein CHLREDRAFT_144255 [Chlamydomonas reinhardtii]
 gi|158279888|gb|EDP05647.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 486

 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 23/55 (41%), Positives = 35/55 (63%), Gaps = 3/55 (5%)

Query: 306 DDAVQ---LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 357
           D+A Q   +VV R   AG+ +++  G   N +LL +YGFV +DNP+D   ++AAL
Sbjct: 244 DEARQQYVIVVRRRVAAGQQVLLCYGRHTNLELLEHYGFVMQDNPHDTAPLDAAL 298


>gi|67538920|ref|XP_663234.1| hypothetical protein AN5630.2 [Aspergillus nidulans FGSC A4]
 gi|40743533|gb|EAA62723.1| hypothetical protein AN5630.2 [Aspergillus nidulans FGSC A4]
 gi|259484901|tpe|CBF81518.1| TPA: SET domain protein (AFU_orthologue; AFUA_4G11040) [Aspergillus
           nidulans FGSC A4]
          Length = 707

 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 61/241 (25%), Positives = 91/241 (37%), Gaps = 28/241 (11%)

Query: 118 DLQAGDAAF-----SVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           D  AGDA F       P+S +   ERV G+E +A                +L+ +  +G 
Sbjct: 78  DFHAGDAHFPAHDVKFPSSFI---ERV-GSEEVA--------------IFFLIGQYLRGP 119

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
           +SFW PYIR L +      L         E           K ++    E  +   NEL 
Sbjct: 120 ESFWHPYIRTLPQPGSLTTLPYYEEEEDLEWLEGTSLLQARKRKVALLREKYESSSNELR 179

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
              F      ++Y +D+   A T  + +     V S V+   ++       L+P    +L
Sbjct: 180 ESGFQDA---ERYSWDLYLWASTIFVSRAFSEKVLSGVIPEHEMP-ENTSVLLPF-IDIL 234

Query: 293 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 352
            +    K    A    V  VV       E I    GP+ N +L++NYGF   +NP D   
Sbjct: 235 NHRPLAKVEWRAGLQNVDFVVLEDVSVNEEIANNYGPRNNEQLMMNYGFCLANNPCDYRT 294

Query: 353 V 353
           V
Sbjct: 295 V 295


>gi|166091525|ref|NP_001107219.1| SET domain-containing protein 4 [Rattus norvegicus]
 gi|165971256|gb|AAI58670.1| Setd4 protein [Rattus norvegicus]
          Length = 439

 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 57/241 (23%), Positives = 101/241 (41%), Gaps = 26/241 (10%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
           LQ G    S+P S ++T + V+   ++   +   K  +S L  L  +L+ E+  G  S W
Sbjct: 67  LQEGQVIISLPESCLLTTDTVI-RSSVGPYIKKWKPPVSPLLALCTFLVSERHAGSHSLW 125

Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
             Y+  L +       +   P+   E E+  L   P +A+  E+   ++  +      + 
Sbjct: 126 KSYLDILPK-------SYTCPVCL-EPEVVDLLPGPLRAKAEEQRARVQDLFASSRDFFS 177

Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ---KVSLARRFALVPLGP--PL 291
               LF +    I    F++  F  A+  V +  V+L+   +  L+       L P   L
Sbjct: 178 TLQPLFAESVDSI----FSYHAFLWAWCTVNTRAVYLKSRRQECLSSEPDTCALAPFLDL 233

Query: 292 LAYSSKCKAMLAAVDDAVQL----VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
           L +S   +   AA ++  +        R  K  E+ + + GP  N +LL+ YGFV   NP
Sbjct: 234 LNHSPHVQVK-AAFNEKTRCYEIRTASRCRKHQEAFICY-GPHDNQRLLLEYGFVAFGNP 291

Query: 348 Y 348
           +
Sbjct: 292 H 292


>gi|121701277|ref|XP_001268903.1| SET domain protein [Aspergillus clavatus NRRL 1]
 gi|119397046|gb|EAW07477.1| SET domain protein [Aspergillus clavatus NRRL 1]
          Length = 498

 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 47/203 (23%), Positives = 90/203 (44%), Gaps = 18/203 (8%)

Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 235
           W  YIR +         ++  P  ++E EL  L G+  +  +  +   +++E+  L    
Sbjct: 134 WTEYIRFM-------PPSIRLPTFYTEAELELLRGTSLRTAVFAKLASLEKEFERLRQS- 185

Query: 236 FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP---LGPPLL 292
                  Q+Y +D  T   TF+ +K      +S VV L +       A+VP   +     
Sbjct: 186 TEGIPWCQKYWWDEDTGRLTFDDWKYVDAVYRSRVVELPESG----HAIVPCVDMANHAS 241

Query: 293 AYSSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWCGPQ-PNSKLLINYGFV-DEDNPYD 349
             S K +   ++ +DA+ QL   R   +GE + +  G + P S+++ +YGFV +E     
Sbjct: 242 EDSVKARYDESSTEDALLQLRQGRRICSGEEVTISYGSEKPASEMVFSYGFVENERTDAK 301

Query: 350 RLVVEAALNTEDPQYQDKRMVAQ 372
           ++ ++  +  +DP    K+M  +
Sbjct: 302 QIFLDLEIPDDDPLRMAKQMFCK 324


>gi|396469509|ref|XP_003838423.1| similar to SET domain-containing protein [Leptosphaeria maculans
           JN3]
 gi|312214991|emb|CBX94944.1| similar to SET domain-containing protein [Leptosphaeria maculans
           JN3]
          Length = 415

 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 36/75 (48%), Gaps = 2/75 (2%)

Query: 275 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 334
           K++ A  +A+ P         S C+    A      ++ DR Y+AGE + V  GP  N  
Sbjct: 190 KLTSADCYAMCPFMDYFNHSDSGCEPQHNA--HGYSVLADRAYRAGEEVYVSYGPHTNDF 247

Query: 335 LLINYGFVDEDNPYD 349
           LL+ YGF+ + N  D
Sbjct: 248 LLVEYGFLLDANSND 262


>gi|297845640|ref|XP_002890701.1| hypothetical protein ARALYDRAFT_472886 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297336543|gb|EFH66960.1| hypothetical protein ARALYDRAFT_472886 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 471

 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 38/175 (21%), Positives = 71/175 (40%), Gaps = 19/175 (10%)

Query: 303 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL----------V 352
           A  +  V++V +   K  + +++  G   N   L++YGFV E NPYD +           
Sbjct: 252 AESNTLVKVVAETELKENDPLLLNYGCLSNDFFLLDYGFVIESNPYDTIELKYDEQLMDA 311

Query: 353 VEAALNTEDPQYQ-----DKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSD 407
              A     P++        ++++Q N    +    V  G  +     +L  +R+    +
Sbjct: 312 ASMAAGVSSPKFSSPAPWQHQLLSQLNLAGEMPNLKVTIGGPEPVEGRLLAAIRILLCGE 371

Query: 408 TSEMQ----SVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML 458
             E++      + SL  I P+    E AV   +       L+ +P  + EDEA++
Sbjct: 372 MVEVEKHDLDTLKSLSAIAPLGIANEIAVFRTVIALCVIALSHFPTKIMEDEAII 426


>gi|302660547|ref|XP_003021952.1| hypothetical protein TRV_03939 [Trichophyton verrucosum HKI 0517]
 gi|291185873|gb|EFE41334.1| hypothetical protein TRV_03939 [Trichophyton verrucosum HKI 0517]
          Length = 479

 Score = 41.2 bits (95), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 94/450 (20%), Positives = 169/450 (37%), Gaps = 84/450 (18%)

Query: 97  LKEKPSHNEKHRPIHY-----VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTT 151
           LK    H + H  IH        A   +   +  F +P+ L+++++       +   L  
Sbjct: 24  LKRSSPHFKMHPGIHIADLRSTGAGRGISEDEELFVIPDDLILSVQNSEARSVLG--LDD 81

Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
            +L     L + ++YE  QG++S W  Y R L         + ++ + W++ +L+ L GS
Sbjct: 82  KQLGPWLSLIITMIYEYYQGEQSKWYSYFRILPS-------SFDTLMFWTDEQLSELQGS 134

Query: 212 PTKAEILERA-------------EGIKREY---------------NELDTVWFMAGSLFQ 243
               +I + A             +   R +               N L  +    GS+  
Sbjct: 135 SVVGKIGKAAADDTILQKVVPLIQANSRHFPPRPNMPPLNSPDSQNALLCLAHRMGSIIM 194

Query: 244 QYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA 303
            Y +DI       E       A    +   +         +VPL     A + +  A L 
Sbjct: 195 AYAFDIEKTDEADE-----HTADDGYMTDDEDEPAK---GMVPLADIFNADAQRNNARLF 246

Query: 304 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE--------- 354
             + +  +   +   +GE I    G  P + LL  YG+V  DN     VVE         
Sbjct: 247 QEEGSFVMKAIKNIYSGEEIFNDYGELPRADLLRRYGYV-TDNYAQYDVVEFSLDAICKV 305

Query: 355 AALNTEDPQYQDKRMVAQRNGKLSVQVFHV----HAGREKEAI-SDMLPYLRLGY--VSD 407
           A L   +P   + R+    N  +  + +++      G  ++AI  D L  LR     + D
Sbjct: 306 AGLPDSEPSPSNPRLELLDNLDMLEEGYNISRIPRNGTLEDAIPEDFLVLLRALTLPIED 365

Query: 408 TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAML--------- 458
            + + +   +  P    S   E ++L  L      R + YP ++ EDE++L         
Sbjct: 366 LNRLGARNKAPKPEFSAS---EASLLRSLV---TLRQSEYPTSVQEDESILNCLEQQNGY 419

Query: 459 -TDYNLHPKKRVATQLVRMEKKMLNACLQV 487
             D  L+ +K++A Q+ + EK++L   L +
Sbjct: 420 INDSGLN-RKKMAVQVRKGEKEILTQILSL 448


>gi|255637489|gb|ACU19071.1| unknown [Glycine max]
          Length = 497

 Score = 41.2 bits (95), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 39/72 (54%), Gaps = 1/72 (1%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSELACLALYLMYEKKQG 171
           + A  DL+ G+    VP S ++T E V+ ++ + + +  +  LS    L + L+YE  +G
Sbjct: 55  LGAVRDLRRGEIVLRVPKSALMTRETVMEDKKLCDAVNRHSSLSSAQILIVCLLYEMGKG 114

Query: 172 KKSFWLPYIREL 183
           K S W PY+  L
Sbjct: 115 KTSRWHPYLMHL 126


>gi|71659283|ref|XP_821365.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70886742|gb|EAN99514.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 661

 Score = 41.2 bits (95), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 45/95 (47%), Gaps = 10/95 (10%)

Query: 149 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 208
           L ++ +  +AC+A Y+ YEKKQ + +  L Y R L       Q+ V++  LW+   L   
Sbjct: 466 LDSSNMESIACIAAYMFYEKKQPEIALRL-YRRLL-------QMGVQTTELWNNLGLCCF 517

Query: 209 TGSPTKAEI--LERAEGIKREYNELDTVWFMAGSL 241
             S     +  L+RA  I  E   L  VW+  G +
Sbjct: 518 YSSQYDIALSCLQRAVAISTEDETLADVWYNIGHI 552


>gi|428177750|gb|EKX46628.1| hypothetical protein GUITHDRAFT_107412 [Guillardia theta CCMP2712]
          Length = 606

 Score = 40.8 bits (94), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 17/49 (34%), Positives = 27/49 (55%)

Query: 309 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 357
           +Q     P K G  + +  GP  N++LL+ YG+ ++DNPY    +E  L
Sbjct: 440 LQFCTMAPIKQGSQVFLNYGPLDNTQLLLYYGYAEQDNPYQTYAIELEL 488


>gi|195396323|ref|XP_002056781.1| GJ16703 [Drosophila virilis]
 gi|194146548|gb|EDW62267.1| GJ16703 [Drosophila virilis]
          Length = 539

 Score = 40.8 bits (94), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 71/360 (19%), Positives = 137/360 (38%), Gaps = 40/360 (11%)

Query: 116 SEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSF 175
           + DL  G+   +VP  L+ + E +   +    +     L+    +   L+ EK +G  S 
Sbjct: 151 TRDLAEGELVLTVPRQLIFSEELLPEAQRKLFIDFPTHLN----VTYMLIIEKVRGAASN 206

Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL---- 231
           W P+I  L  +         + L ++  ++  L G+   +  +     I R Y  +    
Sbjct: 207 WQPFIDTLPTR-------YNTVLYFTVEQMQRLRGTSACSAAVRHCRVIARIYASMYKCA 259

Query: 232 -----DTVWFMAGSLFQQYP--YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 284
                D+V     +LF +Y   Y++   A +    +Q  V  Q        V  +   AL
Sbjct: 260 YMQPDDSVMAGMANLFTEYGLCYELYRWAVSTVTTRQNLVPRQ-LATDSDGVRNSPMSAL 318

Query: 285 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 344
           +P          K  +        ++ +    +KAGE   ++ G + N+  L+++GF+D 
Sbjct: 319 IPFWDMANHRCGKITSYYKPSAQQMECIAQEAFKAGEQFFIYYGDRCNADRLVHHGFLDM 378

Query: 345 DNPYDRLVVEAALNTEDPQYQDKRMV-----AQRNGKLSVQVFHVHAGREKEAISDMLPY 399
           +N  D + +   L+  D   + + ++      +R  +L V     H   E      +L +
Sbjct: 379 NNLKDYVHIRLGLSPTDALAEQRALLLSELNIERKAELRVLPAPEHISGE------LLAF 432

Query: 400 LRLGYVSD------TSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSE 453
           +R+  +S        S+++  +  L   C +   +E      L    K  L    ATL E
Sbjct: 433 VRVFNMSKEQLEHWCSDLERAVDLLHIDCALETDLETRTWQYLYQRLKLLLGVLDATLKE 492


>gi|363747032|ref|XP_003643892.1| PREDICTED: histone-lysine N-methyltransferase setd3-like, partial
           [Gallus gallus]
          Length = 283

 Score = 40.8 bits (94), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 42/182 (23%), Positives = 85/182 (46%), Gaps = 23/182 (12%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
           + A+ +++A +    VP  L++T+E    N  +  L + +++ +      LA +L+ E+ 
Sbjct: 108 LKATREIKAEELFLWVPRKLLMTVESA-KNSVLGSLYSQDRILQAMGNITLAFHLLCERA 166

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
               SFWLPYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y 
Sbjct: 167 -NPNSFWLPYIQTLPSE-------YDTPLYFEEDEVQYLRSTQAIHDVFSQYKNTARQYA 218

Query: 230 ELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALV 285
               V        Q +P    +P  ++FT++ ++ A  +V +    +     +R   AL+
Sbjct: 219 YFYKV-------IQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPTEDGSRVTLALI 271

Query: 286 PL 287
           PL
Sbjct: 272 PL 273


>gi|297598048|ref|NP_001044988.2| Os01g0879500 [Oryza sativa Japonica Group]
 gi|255673923|dbj|BAF06902.2| Os01g0879500 [Oryza sativa Japonica Group]
          Length = 263

 Score = 40.8 bits (94), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 20/63 (31%), Positives = 33/63 (52%)

Query: 298 CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 357
           C   +   D+ ++++V R    GE +    G   N+ LL  YGF + DN YD + ++ AL
Sbjct: 15  CSYYVGDDDEDLEMIVVRDVNEGEEVFNTYGTMGNAALLHRYGFTEMDNSYDIVNIDLAL 74

Query: 358 NTE 360
            T+
Sbjct: 75  VTK 77


>gi|340522118|gb|EGR52351.1| predicted protein [Trichoderma reesei QM6a]
          Length = 377

 Score = 40.8 bits (94), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 39/164 (23%), Positives = 71/164 (43%), Gaps = 25/164 (15%)

Query: 197 PLLWSETELAYLTGSPTKAEI-LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
           P+LW   EL  L   P ++++ LER E       E    W      F++   D+P + +T
Sbjct: 107 PMLWPR-ELKQLL--PLESQVTLERRE------KEFQDNW----DDFKEAFPDVPRDDYT 153

Query: 256 FEIFKQAFVAVQSCVVHLQ-----KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 310
           +     A++ V +   + +     K     R AL+P+       +  C+   +  +    
Sbjct: 154 Y-----AWLVVNTRTFYHETPETLKYPWEDRLALIPVADLFNHAAGGCRVYYSP-EGCYH 207

Query: 311 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
           +V DR YK GE + +      N   L+ YGF+ ++N  D + ++
Sbjct: 208 VVADRAYKKGEELFISYSSHSNDYNLLEYGFIPDENSLDDVYID 251


>gi|145354661|ref|XP_001421597.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144581835|gb|ABO99890.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 341

 Score = 40.8 bits (94), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 72/313 (23%), Positives = 124/313 (39%), Gaps = 30/313 (9%)

Query: 185 RQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG-----IKREYN-ELDTVWFMA 238
           R+R +G ++  +P + S  E        T      R EG       REY  + +  W  A
Sbjct: 2   RERAKGGVSAYAPFVESLYEHTPARAVETSRAARARLEGHAAAETMREYERDAEDGWRAA 61

Query: 239 GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC 298
              F+ +P       FT   F++A   V++     +     R  ALVP+   LL  +   
Sbjct: 62  RRTFETFPSIFSVHEFTRAAFEEALAIVRANSFEARSEDGTRARALVPMAHLLLHDTGSE 121

Query: 299 KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG----FVDEDNPYDRLVVE 354
              +  VD    + VD  ++ G+ +    G   +++    +G    +  E N  ++  ++
Sbjct: 122 VPCVKIVDGVFVINVD-EHEEGDELSCSHGDYSDAETFARFGVSAFYSAEKNARNK--IK 178

Query: 355 AALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSV 414
            A  +E   Y  K +   R G +    F   AG    A  + +  LRL   ++T E  ++
Sbjct: 179 FAFPSE--IYSMKSL--DRCGSVENIAF-TDAG----ATEEFMCALRLASANET-EWAAI 228

Query: 415 ISSLGPI-----CPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLH--PKK 467
             S   +      P+S   E AV + L       L  YP++ +EDE +L    L   P +
Sbjct: 229 SKSKASVRALRKKPLSEESEIAVYEALFATLTELLNSYPSSDNEDERLLQSRTLQSAPDE 288

Query: 468 RVATQLVRMEKKM 480
             A  +   EK++
Sbjct: 289 ERAVTIRLREKRL 301


>gi|323473309|gb|ADX78230.1| CIA6 [Chlamydomonas reinhardtii]
          Length = 699

 Score = 40.8 bits (94), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 23/55 (41%), Positives = 35/55 (63%), Gaps = 3/55 (5%)

Query: 306 DDAVQ---LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 357
           D+A Q   +VV R   AG+ +++  G   N +LL +YGFV +DNP+D   ++AAL
Sbjct: 396 DEARQQYVIVVRRRVAAGQQVLLCYGRHTNLELLEHYGFVMQDNPHDTAPLDAAL 450


>gi|449301991|gb|EMC98000.1| hypothetical protein BAUCODRAFT_146595 [Baudoinia compniacensis
           UAMH 10762]
          Length = 633

 Score = 40.8 bits (94), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 56/230 (24%), Positives = 93/230 (40%), Gaps = 27/230 (11%)

Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW-SETELAYLTGSPTKAEILER 220
            YLM +    ++SFW PY+  L            +PL + +  +LA+L G+     +L R
Sbjct: 87  FYLMTQYLNKEQSFWKPYLDVLPSPS-----EFSTPLWFDAPADLAWLDGTDVLHTMLAR 141

Query: 221 AEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA---FTFEIFKQAFVAVQS---CVVHLQ 274
            E   + Y     V   +G     Y +D+   A   FT   F    +  Q+     VH  
Sbjct: 142 REVYAQYYQSGLKVLSESGIDVTLYTWDLFRWAITTFTSRSFTSRVLLPQNRKYWPVHRT 201

Query: 275 KVSLARRFALVPLG-------------PPLLAYSSKCKAMLAAVDDAVQLVVD--RPYKA 319
             +  R+  L+ +              P L + +    A +    DA Q  +   +P +A
Sbjct: 202 STNGRRQTVLLDMSHSPAEDLDFSVLFPGLDSGNHDPNAQVDWSFDANQFSIALVQPIEA 261

Query: 320 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRM 369
           G  +    GP+ N +LL+ YGF   +NP D +++      E  Q + KR+
Sbjct: 262 GAEVCNNYGPKANDELLMGYGFCIPNNPRDEVLLTLKAPPEALQVELKRI 311


>gi|342875304|gb|EGU77102.1| hypothetical protein FOXB_12400 [Fusarium oxysporum Fo5176]
          Length = 371

 Score = 40.8 bits (94), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 23/77 (29%), Positives = 35/77 (45%), Gaps = 2/77 (2%)

Query: 281 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 340
           R   +P           CK   +A+  +VQ   DR Y  GE + V  GP  N  LL  YG
Sbjct: 178 RLVCMPTADLFNHADQGCKLAYSALGYSVQ--ADRVYHQGEEVYVSYGPHSNDFLLSEYG 235

Query: 341 FVDEDNPYDRLVVEAAL 357
           F+ + N +D + ++  +
Sbjct: 236 FILDTNRWDEVYLDEVI 252


>gi|4185151|gb|AAD08954.1| unknown protein [Arabidopsis thaliana]
 gi|20197036|gb|AAM14885.1| unknown protein [Arabidopsis thaliana]
          Length = 441

 Score = 40.8 bits (94), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 40/152 (26%), Positives = 64/152 (42%), Gaps = 15/152 (9%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
           ASEDL+ GD A  +P S +++ E V  ++    L T + ++    L L+ M EK      
Sbjct: 173 ASEDLKFGDVALEIPVSSIISEEYVYNSDMYPILETFDGITSETMLLLWTMREKHNLDSK 232

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
           F  PY   L      G       L +    +  L G+    EI++  E ++  Y+EL   
Sbjct: 233 F-KPYFDSLQENFCTG-------LSFGVDAIMELDGTLLLDEIMQAKELLRERYDEL--- 281

Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 266
                 L   +    P E +T+E +  A+  V
Sbjct: 282 ----IPLLSNHREVFPPELYTWEHYLWAYFDV 309


>gi|71019075|ref|XP_759768.1| hypothetical protein UM03621.1 [Ustilago maydis 521]
 gi|46099291|gb|EAK84524.1| hypothetical protein UM03621.1 [Ustilago maydis 521]
          Length = 685

 Score = 40.8 bits (94), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 38/83 (45%), Gaps = 6/83 (7%)

Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
            ++ P+   L A      A L      +++   +P  AGE I       PNS LL  YG 
Sbjct: 368 ISMTPMADMLNAKFESDNARLFYKSHVLEMRATKPIAAGEQIFNTYADPPNSDLLRRYGH 427

Query: 342 VDEDNPYD------RLVVEAALN 358
           VDE N  D      +LVV+AA+N
Sbjct: 428 VDEPNGNDVVELDAKLVVQAAVN 450


>gi|146415322|ref|XP_001483631.1| hypothetical protein PGUG_04360 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 466

 Score = 40.8 bits (94), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 64/282 (22%), Positives = 111/282 (39%), Gaps = 55/282 (19%)

Query: 113 VAASEDLQAGDAAFSVP-------NSLVVTLERVLGNETIAEL---------LTTNKLSE 156
           V A++++ A +    +P       N+++  + R  G E++ +L          TT++ +E
Sbjct: 73  VYATQNVSAKETLVRIPHSFLMNTNTIIKHISRFNGKESVPDLGYSVLLPSEYTTDQWTE 132

Query: 157 LAC---------------LALYLMYEKKQGKKSFW------LPYIRELDRQRGRGQLAVE 195
           L                  ALY+  EKK+ + SFW      LP + ELD           
Sbjct: 133 LYAKIPISKWLQLTAFQRTALYICLEKKRKENSFWCAFISSLPKLEELDF---------- 182

Query: 196 SPLLWS-ETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEA 253
           +P++W  E+E   LTGS   A+  E      R + +   V F    +   ++     +E 
Sbjct: 183 APIVWEVESE---LTGSKA-ADFFELLPRSSRNHAKKVLVRFNEDYTAVSEFLTAAKSEP 238

Query: 254 FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV 313
                F  A++ + S  +++   S         L P +   +  C    A   D+   +V
Sbjct: 239 LNKMEFLWAWMCINSRCLYMSFPSSKAEADNFTLAPYVDFLNHDCDEKCAIKIDSRGFLV 298

Query: 314 DR--PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
                + AG+ ++   GP  N  LL  Y F  E N ++ L V
Sbjct: 299 ISCVDHAAGQELLFSYGPHSNEFLLCEYAFTMETNKWNNLDV 340


>gi|148237199|ref|NP_001085404.1| N-lysine methyltransferase setd6 [Xenopus laevis]
 gi|82184826|sp|Q6INM2.1|SETD6_XENLA RecName: Full=N-lysine methyltransferase setd6; AltName: Full=SET
           domain-containing protein 6
 gi|48734800|gb|AAH72257.1| MGC82362 protein [Xenopus laevis]
          Length = 455

 Score = 40.8 bits (94), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 58/242 (23%), Positives = 95/242 (39%), Gaps = 28/242 (11%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETI-AELLTTNKLSELACLALYLMYEKKQGKK 173
           A ED+  G+  F+VP S +++       E +  E  +    S    L + L+YE      
Sbjct: 55  AREDIADGELLFTVPRSAILSQNTTRIQELLEKEQESLQSTSGWVPLLISLLYEATDSS- 113

Query: 174 SFWLPYIR---ELDRQRGRGQLAVESPLLWSETE-LAYLTGSPTKAEILERAEGIKREYN 229
           S W PY     ELD          + P+ WSE E    L G+     I    + I+ EYN
Sbjct: 114 SLWAPYFGLWPELD--------PPDMPMFWSEEEQTKLLQGTGVLEAIRNDLKNIEEEYN 165

Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ--AFV---AVQSCVVHLQKVSLARRFAL 284
            +            + P        T +++K+  AFV   + Q  +    +     +  L
Sbjct: 166 SI------VLPFITRNPEKFCPMKHTLDLYKRLVAFVMAYSFQEPLEENDEEDEDEKDIL 219

Query: 285 VPLGPP---LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
            P+  P   LL + +   A L    + +++V  +   AG+ +    G   N +LL  YGF
Sbjct: 220 PPMMVPVADLLNHVAHHNAHLEFTPECLRMVTTKSVHAGQELFNTYGEMANWQLLHMYGF 279

Query: 342 VD 343
            +
Sbjct: 280 AE 281


>gi|158295743|ref|XP_001688855.1| AGAP006364-PD [Anopheles gambiae str. PEST]
 gi|347965224|ref|XP_003435732.1| AGAP013401-PA [Anopheles gambiae str. PEST]
 gi|333469389|gb|EGK97284.1| AGAP013401-PA [Anopheles gambiae str. PEST]
          Length = 451

 Score = 40.8 bits (94), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 41/86 (47%), Gaps = 13/86 (15%)

Query: 310 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNT------EDPQ 363
            L  D  Y+AGE I +  G   N+KLL+ YGF    NP D   VE  + T       DP+
Sbjct: 276 NLHTDTAYRAGEQIFISYGTHNNTKLLLEYGFSIPSNPDD--FVELTIGTINAFMKHDPE 333

Query: 364 YQDKRMVAQR-----NGKLSVQVFHV 384
            +  R+  ++     + +L  Q+F V
Sbjct: 334 LRCLRLPREKYRFLADHRLDEQLFFV 359


>gi|260946533|ref|XP_002617564.1| hypothetical protein CLUG_03008 [Clavispora lusitaniae ATCC 42720]
 gi|238849418|gb|EEQ38882.1| hypothetical protein CLUG_03008 [Clavispora lusitaniae ATCC 42720]
          Length = 430

 Score = 40.8 bits (94), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 46/204 (22%), Positives = 86/204 (42%), Gaps = 22/204 (10%)

Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 213
           LS    LA+YL+ EK++G  SFW P+I   D      +L++ +P++W   ++      P 
Sbjct: 117 LSSFQLLAIYLVLEKERGAASFWKPFI---DMLPSIEELSL-APVVWKVLQV------PH 166

Query: 214 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPY--DIPT-EAFTFEIFKQAFVAVQSCV 270
             ++        R++ E      +     + Y    D+P+  AF    F  A++ + S  
Sbjct: 167 CDDLWRMLSRSARKHAES-----VVARFEKDYAVVCDLPSVPAFERSSFLWAWMCINSRC 221

Query: 271 VHL---QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 327
           +++   Q    +  F + P     L +S++ +  +        ++    YK  E +    
Sbjct: 222 LYMSMPQAKDTSDNFTMAPY-VDFLNHSNEDQCGIKIDPHGFHVLTSSAYKPQEELYFSY 280

Query: 328 GPQPNSKLLINYGFVDEDNPYDRL 351
           GP  N  LL  YGF    N ++ +
Sbjct: 281 GPHSNEFLLCEYGFTLPHNKWNYI 304


>gi|323455796|gb|EGB11664.1| hypothetical protein AURANDRAFT_61664 [Aureococcus anophagefferens]
          Length = 1916

 Score = 40.4 bits (93), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 17/46 (36%), Positives = 27/46 (58%)

Query: 307 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 352
           DA  +   R Y AG+ +    G + N++L+ NYGF++  NP+D  V
Sbjct: 294 DAFAVNAHRDYDAGDEVHASYGKKSNAQLVANYGFLEPGNPFDDYV 339


>gi|145502426|ref|XP_001437191.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124404340|emb|CAK69794.1| unnamed protein product [Paramecium tetraurelia]
          Length = 637

 Score = 40.4 bits (93), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 38/132 (28%), Positives = 64/132 (48%), Gaps = 23/132 (17%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIA------ELLT---TNKLSELACLALY 163
           V ASEDL +      +P SL+++ ++   +  I       E+     TN+ +E   L  Y
Sbjct: 46  VVASEDLPSDTVIICIPQSLIISPDKCKQSTLITVYNSHPEMFDEEETNE-AEFNILTFY 104

Query: 164 LMYEKKQGKKSFWLPYIRELDRQRGRGQLA--------VESPLLWSETELAY--LTGSPT 213
           +  EKK+G++SF+ PYI+ +  Q     +A        +E PL+  E +L      G  +
Sbjct: 105 MFNEKKKGEQSFYYPYIQAI--QTSNTLMAWSNEDLQKIEDPLILEEFQLIKQDFLGLWS 162

Query: 214 KAE-ILERAEGI 224
           KA+ I + A+ I
Sbjct: 163 KAKLIFDNAQDI 174


>gi|387219019|gb|AFJ69218.1| set domain-containing protein 3, partial [Nannochloropsis gaditana
           CCMP526]
          Length = 265

 Score = 40.4 bits (93), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 39/163 (23%), Positives = 62/163 (38%), Gaps = 29/163 (17%)

Query: 317 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQ-YQDKRMVAQRNG 375
           YK GE +    G + N++LL+ YGF   DN ++ + +        P  +Q         G
Sbjct: 18  YKKGEEVFTSYGRRTNAELLLFYGFALLDNEHESVALSMPGIPSPPSWFQASHSALGTAG 77

Query: 376 KLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT---SEMQSVISSLGPICPVS------- 425
            +         GR +    D+L    L +   T   SE+ +   +L   C ++       
Sbjct: 78  SV--------GGRARSMAEDVLRPSHLLFAGATELPSELVAYFRALTACCSMNEKDLVEQ 129

Query: 426 --------PC--MERAVLDQLADYFKARLAGYPATLSEDEAML 458
                   PC   ER     L  +  A LA +P ++ EDE  L
Sbjct: 130 KLDYMQHFPCSRHERDAFSTLGAHMSASLAAFPTSIEEDEVEL 172


>gi|340517549|gb|EGR47793.1| hypothetical protein TRIREDRAFT_122428 [Trichoderma reesei QM6a]
          Length = 482

 Score = 40.4 bits (93), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 68/300 (22%), Positives = 119/300 (39%), Gaps = 59/300 (19%)

Query: 128 VPNSLVVTLERVLG----NETIAELL-TTNKLSELACLALYLMYEKKQ------GKKSF- 175
           +P  LV++ E V      ++   +LL      S    + LYL+    Q      G ++F 
Sbjct: 57  IPRDLVLSAEAVEEYAKVDQNFKQLLDVAGHQSTRGDIMLYLLTHLVQSKATSPGTRAFA 116

Query: 176 ---WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
              W  YIR L R        +  P +W+  E   L G+  +A +  +   +  EY++L 
Sbjct: 117 STPWTEYIRFLPR-------PIPVPTMWTNDERELLKGTSLEAAVSAKLSALSSEYDKLC 169

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
                A +L   +   + +E+ T E +  A    +S  + L +       A+VP G  + 
Sbjct: 170 E---EASAL--SFWSTLLSESATLEDWVLADAWYRSRCLELPRAG----HAMVP-GLDMA 219

Query: 293 AYSSKCKAMLAAVDDAVQLVVDRPYK---AGESIVVWCG-PQPNSKLLINYGFVDEDNPY 348
            +S    A      D   +++ RP     AG  I +  G  +P +++L +YGF+D+D+  
Sbjct: 220 NHSQSHSAYYDESSDGDVVLLPRPGSKIPAGAEITISYGEAKPAAEMLFSYGFIDKDSTV 279

Query: 349 DRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDT 408
             L +      +DP              L    FH++ G          P +RL  ++D 
Sbjct: 280 KELTLHLEALPDDP--------------LGRAKFHIYKGP---------PTVRLSIINDN 316


>gi|320166344|gb|EFW43243.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 514

 Score = 40.4 bits (93), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 57/230 (24%), Positives = 95/230 (41%), Gaps = 20/230 (8%)

Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
           N +  +  LAL LMYE+ +   S W  ++R L          +ES L W++ EL  +   
Sbjct: 132 NAIDPMTALALGLMYERSRA-DSPWRAWLRMLPD-------PIESMLEWNDVELWPVEQL 183

Query: 212 PTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 271
             K    ER   ++  Y  + T           Y  D+    FT E F  A V  Q+  +
Sbjct: 184 YVKELREERIRNLEAVYESVIT------PFIDTYESDLVGVDFTIEAFVWAAVIAQTRGL 237

Query: 272 HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 331
           H    S     +L+P+   ++ +  +  A++ A    + +      KAGE I +      
Sbjct: 238 H---ESEKNGLSLLPIV-DMINHHREPNAVVVASGPNILVRTKTSLKAGEEITI-DYEMS 292

Query: 332 NSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGKLSVQV 381
           +  LL+ YGFV+     D   +  +  ++D  Y  +  + +  G LS QV
Sbjct: 293 SHVLLLLYGFVEMSENLDFYPIRLSWESKDIDYPRRLRLLEGRG-LSRQV 341


>gi|300124011|emb|CBK25282.2| unnamed protein product [Blastocystis hominis]
          Length = 366

 Score = 40.4 bits (93), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 44/183 (24%), Positives = 74/183 (40%), Gaps = 19/183 (10%)

Query: 164 LMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG 223
            + E  + + SF+ PY   L             P++W+ +E+  L GS     I  R   
Sbjct: 57  FLLEDMENEDSFYKPYYDTLPED------ISNIPVIWTNSEINQLHGSYFSICIRSRVVE 110

Query: 224 IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 283
           I R+Y ++  V     S F +YP+D         I  + F +  + + +   V LA    
Sbjct: 111 IYRDYQKMCDV----NSFFCRYPFDQYLRV-RLLIGSRNFGSFFNSLNNGILVPLADMLN 165

Query: 284 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 343
                     Y  K KA +  +   + L      + G  ++   G + N +LL +YGFV+
Sbjct: 166 HTRPRQTTWEYDDKEKAFV--ITSLLNL------RQGAQVMDSYGRRDNRRLLFSYGFVE 217

Query: 344 EDN 346
           +DN
Sbjct: 218 DDN 220


>gi|145354549|ref|XP_001421544.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144581782|gb|ABO99837.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 488

 Score = 40.4 bits (93), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 17/42 (40%), Positives = 27/42 (64%)

Query: 308 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
            V+L+  R   +GE I +  G   N +LL++YGF+ +DNP+D
Sbjct: 276 GVELIARRALTSGEPIELSYGNLSNDELLLDYGFIVKDNPFD 317


>gi|432119027|gb|ELK38252.1| SET domain-containing protein 4 [Myotis davidii]
          Length = 339

 Score = 40.4 bits (93), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 56/241 (23%), Positives = 100/241 (41%), Gaps = 26/241 (10%)

Query: 111 HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-SELACLALYLMYEKK 169
           H+ A +   + G    S+P S ++T + V+ +   A +       S L  L  +L+ EK 
Sbjct: 22  HFRAGASGAREGQVIISLPESCLLTTDTVIRSYLGAYIAKWQPPPSPLLALCTFLVAEKH 81

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
            G +S W PY+  L +       A   P+   E E+  L   P +A+  E+   ++    
Sbjct: 82  AGDRSPWKPYLEVLPK-------AYTCPVC-LEPEVVALLPRPLEAKAREQRTRVR---- 129

Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL----- 284
           EL T      S  Q    +     F++  F+ A+  V +  V++++    RR  L     
Sbjct: 130 ELFTSSRGRFSSLQPLLSEAAASVFSYRAFRWAWCTVNTRAVYMER---GRRQGLSAEPD 186

Query: 285 -VPLGPPL-LAYSSKCKAMLAAVDDAV---QLVVDRPYKAGESIVVWCGPQPNSKLLINY 339
              L P L L  +S    + AA ++     ++      +  E + +  GP  + +LL+ Y
Sbjct: 187 TCALAPYLDLLNNSPAVQVKAAFNEETRCYEIRTGSGCRRHEEVFICYGPHDSRRLLLEY 246

Query: 340 G 340
           G
Sbjct: 247 G 247


>gi|301099608|ref|XP_002898895.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262104601|gb|EEY62653.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 440

 Score = 40.0 bits (92), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 93/433 (21%), Positives = 172/433 (39%), Gaps = 56/433 (12%)

Query: 90  LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELL 149
           L P   +L+  P      R   Y+  +E+++ G    S+P S V+++E       +  LL
Sbjct: 16  LAPMSTVLQ--PEGFNFGRGTAYIT-TENVEVGSVLLSLPMSQVMSVESA-ARGRVGLLL 71

Query: 150 TTN-KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 208
             N  L     L L+L+ E+  G  S +  ++  L         A+ S L +SE E+  L
Sbjct: 72  EVNPDLPSAIALGLHLLEERALGAASNFSDFVATLPTIE-----AINSTLFYSEDEMKGL 126

Query: 209 TGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPT---EAFTFEIFKQAFVA 265
            GS  +   L RA+ +   Y+ L         +  +   D P      FT + F+ A   
Sbjct: 127 EGSQLQRFTLGRAQAVDAFYDAL------VQPVTSREAVDPPIFHKSEFTLDKFRWAMGV 180

Query: 266 VQSCVVHLQK----VSLARRFALVPLGPPLLAYSSK-CKAMLAAVD-DAVQLVV--DRPY 317
           V S      +    V LA     + +   L    ++ C      VD D  +L V     Y
Sbjct: 181 VWSSTFQFGENEDDVILAPVLNTIGICTDLNQEGNEACPETSIKVDTDTQRLTVYASVAY 240

Query: 318 KAGESIVVWCGPQPNSKLLINYGFVD-EDNPYDRLVVEAALNTEDPQYQDKRMVAQRNGK 376
             G+ + +    + +++L++++GF     +  D+L +   L++ D     K  + Q    
Sbjct: 241 SKGQEVRLSMPGKSSTQLMLSHGFARARASKLDKLDLTVTLDSSDTLAPLKNYLLQTQLN 300

Query: 377 LSVQV---FHVHAGREKEAIS-----------DMLPYLRLGYVSDTSEMQSVISSLGPIC 422
            S+     F   + +  E +S           ++  Y  L   ++  E + ++S      
Sbjct: 301 ESINATYEFFYGSSKIDEYVSTSLRMKLLSGGELARYKELLTPTEGEEHRPIVSLRNEF- 359

Query: 423 PVSPCMERAVLDQLADYFKARLAGYPATLSEDE---AMLTDYNLHPKKRVA--TQLVRME 477
                  RAV+       K     YP ++ +D+   A L+D +     R+A   +++ ME
Sbjct: 360 ----VFTRAVISTCTTLLKQ----YPTSIEQDQENLAKLSDKDDVESVRIAHVQRILIME 411

Query: 478 KKMLNACLQVTAD 490
           K++LN  +++  D
Sbjct: 412 KQILNETMELALD 424


>gi|302828172|ref|XP_002945653.1| hypothetical protein VOLCADRAFT_120141 [Volvox carteri f.
           nagariensis]
 gi|300268468|gb|EFJ52648.1| hypothetical protein VOLCADRAFT_120141 [Volvox carteri f.
           nagariensis]
          Length = 163

 Score = 40.0 bits (92), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 20/73 (27%), Positives = 36/73 (49%)

Query: 111 HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
           + + A E ++ G     VP  L+++ +    +E     +    L+E   L L+L+ E+  
Sbjct: 77  YSLVADEPVRRGQILVRVPRRLLMSQDTARASEACGRTVREAGLNEWQSLILHLLCERAL 136

Query: 171 GKKSFWLPYIREL 183
           G +SFW PY+  L
Sbjct: 137 GSRSFWAPYLDTL 149


>gi|440640494|gb|ELR10413.1| hypothetical protein GMDG_00825 [Geomyces destructans 20631-21]
          Length = 492

 Score = 40.0 bits (92), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 87/337 (25%), Positives = 134/337 (39%), Gaps = 48/337 (14%)

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-LERAEGIKR----- 226
           +S W PY   L  +       ++S + WS  ELA L  S    ++  ++AE I       
Sbjct: 104 ESKWAPYFNVLPTK-------LDSLVFWSPEELAELQASAVLKKVGKDKAEEIFHQSISK 156

Query: 227 ---EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVH-LQKVSLARRF 282
              E  ++D ++    S    Y +DIP      E   +        V    QK SLA   
Sbjct: 157 VTPEGTDVD-IFHRVASTIMAYAFDIPD----IEQEDEEGANEDDLVDDDEQKTSLA--- 208

Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
            ++PL   +L   +   A L    + +++    P K GE I+   G  P S LL  YG+V
Sbjct: 209 -MIPLAD-MLNADADNNARLHYDGEELEMRTINPIKTGEEILNDYGQLPRSDLLRRYGYV 266

Query: 343 -DEDNPYDRLVVEAALNT-EDPQYQD---KRMVAQRNGKLSVQVFHV------------- 384
            D+   +D  V E + +T  D  YQD   +  V  R G++  ++                
Sbjct: 267 TDKYATFD--VAEISTSTITDHIYQDLAGELKVYLRAGEIEARLELARREDVYEDAHDVG 324

Query: 385 HAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQLADYFKARL 444
           HA  E   ISD L  L    +     + ++ SS       S      V   L    + R 
Sbjct: 325 HATEEWPCISDELVALVYLLLVGEETLAAIQSSKMSFPSRSKMETELVGKALQRILERRE 384

Query: 445 AGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKML 481
             Y  T+ EDE +L   N   + ++A Q VRM +K++
Sbjct: 385 REYATTVVEDENLLQSGNHSNRVKMAIQ-VRMGEKVV 420


>gi|340505659|gb|EGR31971.1| SET domain protein [Ichthyophthirius multifiliis]
          Length = 705

 Score = 40.0 bits (92), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 26/104 (25%), Positives = 52/104 (50%), Gaps = 10/104 (9%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN---------KLSELACLALY 163
           +AA+ED+ A      +PN ++++L ++   E + +++  N           +E   +A+Y
Sbjct: 21  IAAAEDIPANTIIACIPNKIMISLNQIKECE-LKDIINENPSLFDEEENAEAEFNIIAMY 79

Query: 164 LMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY 207
           +++EK +G+KSF+ PY   + R        +E   L    E+ Y
Sbjct: 80  VIHEKLKGEKSFYKPYFDTIQRSYTMYDWTIEEVKLTESEEIIY 123


>gi|299748031|ref|XP_002911244.1| tho2 protein [Coprinopsis cinerea okayama7#130]
 gi|298407787|gb|EFI27750.1| tho2 protein [Coprinopsis cinerea okayama7#130]
          Length = 2474

 Score = 40.0 bits (92), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 15/47 (31%), Positives = 30/47 (63%)

Query: 308 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
           ++ L+       G+ +    GP+PNS+L+++YGF  +DNP D ++++
Sbjct: 209 SISLIAHSAIWTGQEVFNNYGPKPNSELILSYGFSIQDNPDDSIILK 255


>gi|428167603|gb|EKX36559.1| hypothetical protein GUITHDRAFT_155193 [Guillardia theta CCMP2712]
          Length = 321

 Score = 40.0 bits (92), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 31/133 (23%), Positives = 58/133 (43%), Gaps = 10/133 (7%)

Query: 57  VSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNG-LPPCKVILKEKPSHNEKHRPIHYVAA 115
           V++ D   A   +  ++ +ED      W   NG +   K+ +K +            V  
Sbjct: 50  VAAGDQGAASGADQQAQLQEDWTAFVKWFRSNGGIISSKLTVKVRNGRQG-------VYF 102

Query: 116 SEDLQAGDAAFSVPNSLVVTLERVLGNET--IAELLTTNKLSELACLALYLMYEKKQGKK 173
            E ++ G+   S P +L +  +  +  +   + + L  +K      + L++++E K GK 
Sbjct: 103 KERMRRGETIVSFPRNLRLDEKTAMKGKAGHVFQRLKQDKCYPDLMVILHVVHEDKLGKD 162

Query: 174 SFWLPYIRELDRQ 186
           SFW PY + L RQ
Sbjct: 163 SFWFPYFKLLRRQ 175


>gi|195353393|ref|XP_002043189.1| GM17489 [Drosophila sechellia]
 gi|194127287|gb|EDW49330.1| GM17489 [Drosophila sechellia]
          Length = 537

 Score = 40.0 bits (92), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 22/79 (27%), Positives = 36/79 (45%)

Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
           AL+P          K  +  AAV   ++        AGE   ++ G + N+ LL++ GFV
Sbjct: 319 ALIPYWDMANHRQGKITSFYAAVPRQLECTAQEAVDAGEQFFIYYGDRSNTDLLVHNGFV 378

Query: 343 DEDNPYDRLVVEAALNTED 361
           D+ N  D + +   L+  D
Sbjct: 379 DDYNLKDYVNIRVGLSLTD 397


>gi|41054567|ref|NP_955894.1| N-lysine methyltransferase setd6 [Danio rerio]
 gi|82177062|sp|Q803K4.1|SETD6_DANRE RecName: Full=N-lysine methyltransferase setd6; AltName: Full=SET
           domain-containing protein 6
 gi|27882107|gb|AAH44440.1| SET domain containing 6 [Danio rerio]
          Length = 460

 Score = 40.0 bits (92), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 59/244 (24%), Positives = 96/244 (39%), Gaps = 31/244 (12%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-----LSELACLALYLMYEKK 169
           A ED++ G   F++P   ++      G   + ++L   K      S    L L LMYE  
Sbjct: 53  AKEDIEEGHVLFTIPREALLHQ----GTTKVKKVLEEGKKCLESASGWVPLLLSLMYEYT 108

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREY 228
               S W PY+      R      ++ P+ WSE E    L G+     ++     ++ EY
Sbjct: 109 SST-SHWKPYLSLWPDFR-----TLDQPMFWSEEECDKLLKGTGIPESVITDLRKLQDEY 162

Query: 229 NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ--AFVAVQS----CVVHLQKVSLARRF 282
           N +  + FM     + +P     E    E++K   AFV   S         +      + 
Sbjct: 163 NSV-VLPFM-----KSHPDLWDPEKHNLELYKSLVAFVMAYSFQEPVEDDDEDEEDDEKK 216

Query: 283 ALVPLGPPL---LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 339
             +P+  P+   L + SK  A L    + +++V  R    GE +    G   N +LL  Y
Sbjct: 217 PNLPMMVPMADMLNHISKHNANLEYTPECLKMVSIRRIGKGEEVFNTYGQMANWQLLHMY 276

Query: 340 GFVD 343
           GF +
Sbjct: 277 GFAE 280


>gi|242045610|ref|XP_002460676.1| hypothetical protein SORBIDRAFT_02g032970 [Sorghum bicolor]
 gi|241924053|gb|EER97197.1| hypothetical protein SORBIDRAFT_02g032970 [Sorghum bicolor]
          Length = 489

 Score = 40.0 bits (92), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 79/382 (20%), Positives = 149/382 (39%), Gaps = 75/382 (19%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNE-TIAELLTTNK--LSELACLALYLMYEKK 169
           +AA+ DL+ G+     P + ++T +RV  ++  IA  ++ ++  LS +  L + L+ E  
Sbjct: 57  LAAARDLRRGELVLRAPRAALLTSDRVTADDPRIAACVSAHRPRLSSVQILIVCLLAEVG 116

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI---KR 226
           +G+ S W PY+ +L                   T LA  T    + E L+  + I   ++
Sbjct: 117 KGRNSVWYPYLSQLPSYY---------------TILA--TFDDFEVEALQVDDAIWVAQK 159

Query: 227 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 286
             + + + W     L ++  +    +   F+ +  AF  V S  +H   ++      L P
Sbjct: 160 AKSAIKSDWEDVTPLMKELEF--KPKLLMFKSWLWAFATVSSRTLH---IAWDEAGCLCP 214

Query: 287 LGPPLLAYSSKCKAMLAAVDDAVQL---------------VVDRPY-------------- 317
           +G  L  Y++         +D  +L               + D  Y              
Sbjct: 215 VG-DLFNYAAPDDDTSLEAEDTAELTNYQQKNEMINSSERLTDGGYEDSNAYCLYARKNY 273

Query: 318 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALN-TEDPQYQDKRMVAQRNGK 376
           K GE +++  G   N +LL +YGF+  +NP ++  +E  L+      +    M    NG 
Sbjct: 274 KQGEQVLLGYGTYTNLELLEHYGFLLGENPNEKTFIELDLDICSGGTWPKDSMYIHSNGH 333

Query: 377 LSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERAVLDQL 436
            S                 +L  LRL + + T+  ++V   +     +S   E  ++  L
Sbjct: 334 PSFA---------------LLCALRL-WSTPTNHRKAVGHQIYSGSMLSTENEMGIMKWL 377

Query: 437 ADYFKARLAGYPATLSEDEAML 458
               +  L   P T+  DE++L
Sbjct: 378 ISRCEGTLQQLPTTVEFDESLL 399


>gi|194896580|ref|XP_001978500.1| GG17647 [Drosophila erecta]
 gi|190650149|gb|EDV47427.1| GG17647 [Drosophila erecta]
          Length = 544

 Score = 40.0 bits (92), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 22/79 (27%), Positives = 37/79 (46%)

Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
           AL+P          K  +  AAV   ++       +AGE   ++ G + N+ LL++ GFV
Sbjct: 319 ALIPYWDMANHKPGKITSFYAAVSRQLECTAQEAVEAGEQFFIYYGDRSNTDLLVHNGFV 378

Query: 343 DEDNPYDRLVVEAALNTED 361
           D +N  D + +   L+  D
Sbjct: 379 DVNNLKDYVNIRVGLSPTD 397


>gi|354502761|ref|XP_003513450.1| PREDICTED: SET domain-containing protein 4 [Cricetulus griseus]
          Length = 440

 Score = 40.0 bits (92), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 56/239 (23%), Positives = 98/239 (41%), Gaps = 22/239 (9%)

Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
           LQ G    S+P S ++T   V+   ++   +   K   S L  L  +L+ E+  G +S W
Sbjct: 66  LQEGQMIISLPESCLLTTNTVI-RSSLGPYMKKWKPPPSPLLALCTFLISERHAGGQSLW 124

Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
             Y+  L +       +   P+   E ++  L   P KA+  E+   ++  +      + 
Sbjct: 125 KSYLDILPK-------SYTCPVCL-EPDVVDLLPQPLKAKAEEQRADVQDFFASSRAFFS 176

Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKV---SLARRFALVPLGP--PL 291
               LF + P D     F++  F  A+  V +  V+L+      L+       L P   L
Sbjct: 177 TLQPLFVE-PVD---GIFSYSAFLWAWCTVNTRAVYLRSTRQECLSAEPDTCALAPYLDL 232

Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
           L +S   + KA  +      ++      +  E + +  GP  N +LL+ YGFV   NP+
Sbjct: 233 LNHSPHVQVKAAFSEKTGCYEIRTASRCRKHEQVFICYGPYDNQRLLLEYGFVSVCNPH 291


>gi|407846232|gb|EKG02467.1| hypothetical protein TCSYLVIO_006496 [Trypanosoma cruzi]
          Length = 546

 Score = 40.0 bits (92), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 45/95 (47%), Gaps = 10/95 (10%)

Query: 149 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA-- 206
           L ++ +  +AC+A Y+ YEKKQ + +  L Y R L       Q+ V++  LW+   L   
Sbjct: 351 LDSSNMESIACIAAYMFYEKKQPEIALRL-YRRLL-------QMGVQTTELWNNLGLCCF 402

Query: 207 YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSL 241
           Y +        L+RA  I  E   L  VW+  G +
Sbjct: 403 YSSQYDIALSCLQRAVAISTEDETLADVWYNIGHI 437


>gi|452825744|gb|EME32739.1| ribulose-1,5 bisphosphate carboxylase oxygenase large subunit
           N-methyltransferase, putative isoform 1 [Galdieria
           sulphuraria]
          Length = 487

 Score = 40.0 bits (92), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 79/331 (23%), Positives = 125/331 (37%), Gaps = 47/331 (14%)

Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE-LD 232
           S W PYI  L      G +       WS +ELA L   P   E+      I + Y E L 
Sbjct: 166 SLWKPYIDILPHALNTGLVY------WSSSELAQLQYRPLIEEV-----KINQYYREALY 214

Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL- 291
           T  F + S     P  +  +     +F  A   VQS    +  V   + +AL+P+   L 
Sbjct: 215 TRVFESLS----SPVRVWLQNEKENVFFWALDMVQSRAFGIPDVG-NKTYALLPMMDMLN 269

Query: 292 LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 351
              +S+   +  ++ +  ++        G  I +  GP  N  LL  YGF+  +NP D  
Sbjct: 270 HRVNSQTHFLYDSIANQYEMKTYSKLSPGTDIYISYGPLDNDHLLHFYGFLQTNNPSDYF 329

Query: 352 VVEAALNTEDPQYQDKRMVAQ------------------RNGKLSVQVFHVHAGREKEAI 393
            V+         Y+ +   AQ                   NGK +  ++H H   E + I
Sbjct: 330 QVKDIFQWLHLMYEQEEWQAQPSHLLEEKLSLLRKYHIYENGK-TFHLYHDHYDDEIDII 388

Query: 394 SDMLPYLRLGYVSDTSEMQSVISSLGPICPVSPCMERA--VLDQLADYFKARLAGYPATL 451
                 LR+   S T   Q   +    +   +  +E    V   +    K  L     ++
Sbjct: 389 ------LRVFMASKTDWQQIQENFAMGLFHKALSLENQLHVWQVIIGGCKHLLKDMKTSV 442

Query: 452 SEDEAMLTDYN-LHPKKRVATQLVRMEKKML 481
            EDE +L + + L  K ++A Q  R+EKK +
Sbjct: 443 EEDEQLLKNKDQLSTKLQLAIQF-RLEKKYI 472


>gi|71409849|ref|XP_807248.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70871208|gb|EAN85397.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 544

 Score = 40.0 bits (92), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 45/95 (47%), Gaps = 10/95 (10%)

Query: 149 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA-- 206
           L ++ +  +AC+A Y+ YEKKQ + +  L Y R L       Q+ V++  LW+   L   
Sbjct: 349 LDSSNMESIACIAAYMFYEKKQPEIALRL-YRRLL-------QMGVQTTELWNNLGLCCF 400

Query: 207 YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSL 241
           Y +        L+RA  I  E   L  VW+  G +
Sbjct: 401 YSSQYDIALSCLQRAVAISTEDETLADVWYNIGHI 435


>gi|392563539|gb|EIW56718.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
          Length = 441

 Score = 39.7 bits (91), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 16/46 (34%), Positives = 29/46 (63%)

Query: 309 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
           + LV+  P   G  ++   GP+PN++L++ YGF   +NP D +V++
Sbjct: 250 ISLVIHTPTTTGSELLNNYGPKPNAELILGYGFSLPNNPDDTIVLK 295


>gi|336473420|gb|EGO61580.1| hypothetical protein NEUTE1DRAFT_58975 [Neurospora tetrasperma FGSC
           2508]
 gi|350293291|gb|EGZ74376.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
          Length = 533

 Score = 39.7 bits (91), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 49/205 (23%), Positives = 83/205 (40%), Gaps = 30/205 (14%)

Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-L 218
           L L LM+E  QG  S W PY+  L  Q        ++P+ W+E ELA L  S   A++  
Sbjct: 131 LILILMHEYLQGSSSNWSPYLSILPHQ-------FDTPMFWTEAELAELQASALVAKVGK 183

Query: 219 ERAEGIKRE-----YNELDTVWFMAGSLFQQ-----------YPYDIPTEAFTFEIFKQA 262
           + A+ + R        E + V++ AG+   Q           +       A+ F++ K+ 
Sbjct: 184 DEADKMIRTKIVKVVQENEDVFYPAGTPKTQRLDEGELLKLGHRMGSAIMAYAFDLAKEE 243

Query: 263 FVAVQSCVVHLQKV-----SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY 317
                        V      +     +VP+   +L   +   A +   +  +     R  
Sbjct: 244 DDDEDEEEEEDGWVEDKIGGMNDTMGMVPMA-DMLNADAVFNAHINHGEACLTATSLREI 302

Query: 318 KAGESIVVWCGPQPNSKLLINYGFV 342
           K GE I+ + GP  +++LL  YG+V
Sbjct: 303 KEGEEILNYYGPLSSAELLRRYGYV 327


>gi|254585507|ref|XP_002498321.1| ZYRO0G07502p [Zygosaccharomyces rouxii]
 gi|238941215|emb|CAR29388.1| ZYRO0G07502p [Zygosaccharomyces rouxii]
          Length = 562

 Score = 39.7 bits (91), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 66/299 (22%), Positives = 121/299 (40%), Gaps = 47/299 (15%)

Query: 78  LGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
           L D   W  KNG    + I  ++ S +     +  + A+E+         +P+ L++T E
Sbjct: 4   LEDCIQWAVKNGSIVDERIHFKQSSISGISAVVEGILATEE-----PLIQIPSKLLITNE 58

Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKK----QGKKSFWLPYIRELDRQRGRGQLA 193
           +    E+    + ++ + + A  AL  +Y  K    +G  S + PYI  L        L 
Sbjct: 59  K--AQESFQ--VDSDVIDKNAPNALVQLYVAKLKFAKGMPSIYQPYIDLLP-------LK 107

Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL--------------------DT 233
           +E P  W   EL  + G+     + +R   +  E+  L                    D 
Sbjct: 108 LEQPYFWDWKELQVIKGTDLYLVMKQRLPKLLEEWTTLLKKLSLEPSDDLGQLETPGLDL 167

Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS---CVVHLQKVSLARRFALVPLGPP 290
           V ++A   +++    +P  +F   ++     A ++     ++ Q +S+   F L P+   
Sbjct: 168 VDYVAR--YRETNEQLPWNSFAAYVWSAGIFASRAFPKIALNDQCLSINEAF-LYPI-VD 223

Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
            L + +  K      D  +  V     K+GE +    G + N +LL+NYGFV ++N YD
Sbjct: 224 FLNHKNDTKVKWCFQDGKMCFVSKESLKSGEELFNNYGDKSNEELLLNYGFVQDNNQYD 282


>gi|270005260|gb|EFA01708.1| hypothetical protein TcasGA2_TC007288 [Tribolium castaneum]
          Length = 253

 Score = 39.7 bits (91), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 35/119 (29%), Positives = 58/119 (48%), Gaps = 12/119 (10%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSEL--ACLALYLMYEKK 169
           V A+ D+       +VP  L++++E       + +L+  +K L  +    L+++L+ EK 
Sbjct: 116 VKANVDIAESSLVIAVPRKLMMSVENA-KESVLKDLIEKDKILGSMPNVALSIFLLLEKY 174

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
           +G  SFW PYI  L +          + L +S  EL  L GSPT    L + + I R+Y
Sbjct: 175 KGD-SFWKPYIDILPK-------TYTTVLYFSIDELEELRGSPTLEVALRQIKSITRQY 225


>gi|194707708|gb|ACF87938.1| unknown [Zea mays]
          Length = 352

 Score = 39.7 bits (91), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 47/205 (22%), Positives = 87/205 (42%), Gaps = 37/205 (18%)

Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
           L L L+ E+ +   SFW PYI  L             P+ +   ++  L  +P   ++ +
Sbjct: 3   LGLRLLQERAK-SDSFWWPYIANLPE-------TFTVPIFFPGEDIKNLQYAPILHQVNK 54

Query: 220 RA----EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK 275
           R     E  K    +L TV  +    + Q   D+ + +  +     A  A  S    L  
Sbjct: 55  RCRFLLEFEKEVQQKLHTVPLVDHPFYGQ---DVNSSSLGW-----AMSAASSRAFRLH- 105

Query: 276 VSLARRFALVPLGPPLL-----AYSSKCKAM----LAAVDDAVQLVVDRPYKAGESIVVW 326
                    VP+  PL+     +++   + +    + ++D +V+++ ++  K  E+I + 
Sbjct: 106 -------GEVPMLLPLIDMCNHSFNPNARIVQERSVNSLDMSVKVLAEKKIKQNEAITLN 158

Query: 327 CGPQPNSKLLINYGFVDEDNPYDRL 351
            G  PN   L++YGFV   NPYD++
Sbjct: 159 YGCYPNDFFLLDYGFVITQNPYDQV 183


>gi|300122775|emb|CBK23792.2| unnamed protein product [Blastocystis hominis]
          Length = 854

 Score = 39.7 bits (91), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 32/121 (26%), Positives = 56/121 (46%), Gaps = 10/121 (8%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGN----ETIAELLTTNKLSELACLALYLMYEK 168
           V A E +Q G+    + N  V+ L   L +    +  +     N+LSE A +AL L++EK
Sbjct: 512 VIAKEAIQKGEEVLRIHNDTVIGLHTALTHPRFGKAFSAFYHQNQLSEYALIALTLLWEK 571

Query: 169 KQGKK-SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
              ++ S + P++ +L             P+L S+ +L +L GS    E+      + RE
Sbjct: 572 FDNERWSLFAPFLAKLPSIE-----EFHHPVLLSKDDLLHLYGSALLDEVSALNATLHRE 626

Query: 228 Y 228
           +
Sbjct: 627 F 627


>gi|363746364|ref|XP_003643627.1| PREDICTED: histone-lysine N-methyltransferase setd3-like, partial
           [Gallus gallus]
          Length = 225

 Score = 39.7 bits (91), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 29/119 (24%), Positives = 59/119 (49%), Gaps = 12/119 (10%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
           + A+ +++A +    VP  L++T+E    N  +  L + +++ +      LA +L+ E+ 
Sbjct: 108 LKATREIKAEELFLWVPRKLLMTVESA-KNSVLGSLYSQDRILQAMGNITLAFHLLCERA 166

Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
               SFWLPYI+ L  +        ++PL + E E+ YL  +    ++  + +   R+Y
Sbjct: 167 N-PNSFWLPYIQTLPSE-------YDTPLYFEEDEVQYLRSTQAIHDVFSQYKNTARQY 217


>gi|407920105|gb|EKG13323.1| hypothetical protein MPH_09605 [Macrophomina phaseolina MS6]
          Length = 574

 Score = 39.7 bits (91), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 49/224 (21%), Positives = 86/224 (38%), Gaps = 32/224 (14%)

Query: 143 ETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSE 202
           E +  LL  N LS +A +   L+     G+KS W PYI  L +        + +P+ ++E
Sbjct: 71  EVVQGLLPNNVLSNIALIKELLL-----GEKSLWAPYINCLPKSE-----QLNTPIYFAE 120

Query: 203 -----------TELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPT 251
                       + A+L G+        R E  + E+    +V    G         I T
Sbjct: 121 EMTQEAINGRRNDTAWLLGTNLDKSWRPRKEQWEEEWKNAVSVLKRQG---------IAT 171

Query: 252 EAFTFEIFKQA--FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV 309
           E +T++ +  A      +S +        + ++A++     LL +    K      +   
Sbjct: 172 EGYTWDAYAWAATIFTSRSFISDPGLSKESSQYAVLMPVIDLLNHRFPTKVAWFFNEGNF 231

Query: 310 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
           Q + + P   G  I    G + N +LL  YGF   +N  D + +
Sbjct: 232 QFITEEPVPKGHEIFNNYGGKGNEELLNGYGFCIPNNHCDEVAI 275


>gi|194764087|ref|XP_001964163.1| GF21412 [Drosophila ananassae]
 gi|190619088|gb|EDV34612.1| GF21412 [Drosophila ananassae]
          Length = 1017

 Score = 39.7 bits (91), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 17/52 (32%), Positives = 30/52 (57%)

Query: 319 AGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMV 370
           AGE   ++ G + N++ L+N GFVD DN  D + +   L+  DP  + + ++
Sbjct: 838 AGEQFFIYYGDRTNTEFLVNNGFVDPDNRNDYVNIRLGLSPTDPLAEKRAII 889


>gi|156054286|ref|XP_001593069.1| hypothetical protein SS1G_05991 [Sclerotinia sclerotiorum 1980]
 gi|154703771|gb|EDO03510.1| hypothetical protein SS1G_05991 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 418

 Score = 39.7 bits (91), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 51/228 (22%), Positives = 94/228 (41%), Gaps = 24/228 (10%)

Query: 147 ELLTTNKLSELACLA-LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 205
           E L T K  +   +   +LM +  + +KS W  YIR L +      L +  P+ W E + 
Sbjct: 77  EFLETLKQDDPNIIGHFFLMQQYLKCEKSPWWQYIRLLPQPGDPKSLGI--PIWWPEEDQ 134

Query: 206 AYLTGSPT----------------KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDI 249
            +L G+                  K  +L R     +EY+ +   W  A ++F    +  
Sbjct: 135 KFLAGTNAGPPLQKREQMWRDQWKKGVVLLRELPNHKEYSYILYQW--AATIFDSRSFR- 191

Query: 250 PTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV 309
           P+     E   ++   +   + H++    +  + LV +G        + K  L++  ++ 
Sbjct: 192 PSLTICPEALSESSKEMDLNLDHVRNDRFSILYPLVDIGNHNGINQVEWKKDLSS--NSF 249

Query: 310 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAAL 357
            LV       G+ I  + G + NS+LL+ YGF+  ++  +R VV   L
Sbjct: 250 DLVHSAGVSEGDQIYNYYGNKSNSELLLGYGFILPNDIVNRNVVNLKL 297


>gi|255071473|ref|XP_002499410.1| predicted protein [Micromonas sp. RCC299]
 gi|226514673|gb|ACO60669.1| predicted protein [Micromonas sp. RCC299]
          Length = 323

 Score = 39.3 bits (90), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 16/35 (45%), Positives = 24/35 (68%)

Query: 320 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
           GE +V+  G + N +LL  +GF D DNP+D LV++
Sbjct: 199 GEEVVISYGDKTNEELLFVHGFADRDNPHDALVLQ 233


>gi|406978090|gb|EKE00118.1| hypothetical protein ACD_22C00090G0009 [uncultured bacterium]
          Length = 478

 Score = 39.3 bits (90), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 44/191 (23%), Positives = 74/191 (38%), Gaps = 28/191 (14%)

Query: 109 PIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEK 168
           P+HY+   ++   G   FS P   + T   + G E + E L     SE     +Y MY K
Sbjct: 77  PMHYMQKEKEHVKG---FS-PELAIAT---IAGGEKLTEELAIRPTSETI---MYDMYRK 126

Query: 169 KQGKKSFWLPYIRELD----------RQRGRGQLAVE-SPLLWSETELAYLTGSPTKAEI 217
                  W    R+L           R   R  L +  S  LW E   A+LT   +   +
Sbjct: 127 -------WTNSWRDLPVLINQWCNVVRWEKRTYLFLRTSEFLWQEGHCAHLTHEESTDTV 179

Query: 218 LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS 277
           +      K+ YN+L +++ + G   +   +    + +TFE       ++Q+C  H    +
Sbjct: 180 IWAINAYKKTYNDLMSIYGIVGVKSESEKFAGAVKTYTFESLMPNGKSLQTCTSHDLGQN 239

Query: 278 LARRFALVPLG 288
            ++ F     G
Sbjct: 240 FSKSFEWTVQG 250


>gi|412987667|emb|CCO20502.1| related to histone-lysine N-methyltransferase (ISS) [Bathycoccus
           prasinos]
          Length = 866

 Score = 39.3 bits (90), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 62/269 (23%), Positives = 111/269 (41%), Gaps = 53/269 (19%)

Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELAC---------LALYL 164
           A +ED++ GD    +P S   +LE    +E + + +     +             +A+++
Sbjct: 28  AVTEDVRRGDVLLEIPLSRCFSLESAQKSEMLTKAMAKAAAAAAGTRFTPTHDQYMAMFI 87

Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE--LAYLTGSPTKAEILERAE 222
           + E+  GK+S    +I  + +       A + PL WSE E   + L G+ T AE L   E
Sbjct: 88  LLEQNLGKQSSHYEHILSIPK-------AYDLPLFWSEEERQRSLLFGTTTYAETLALDE 140

Query: 223 GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK--QAFVAVQSC----------- 269
            + ++Y  L     +    F++       +  T + FK  +A +  + C           
Sbjct: 141 EVIQDYELLKH--HLGEDFFRE-------QNITMDRFKWVRATLWSRQCDLLRPAPETTR 191

Query: 270 -VVHLQKVSLARRFALVPLGPP-LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 327
             V + +  +    + VPLG    L YS   + ++ A   A     + P      I    
Sbjct: 192 LRVLIPEFDMFNHSSKVPLGSSHKLNYS---RGLVTAFATA-----NVPKGEQAYISYGS 243

Query: 328 GPQPNSKLLINYGFV---DEDNPYDRLVV 353
           G   +SKLL+ YGF    + +NP+++L V
Sbjct: 244 GEASSSKLLLWYGFAPLNEGENPFEQLDV 272


>gi|321257099|ref|XP_003193469.1| nucleus protein [Cryptococcus gattii WM276]
 gi|317459939|gb|ADV21682.1| nucleus protein, putative [Cryptococcus gattii WM276]
          Length = 491

 Score = 39.3 bits (90), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 58/260 (22%), Positives = 99/260 (38%), Gaps = 47/260 (18%)

Query: 115 ASEDLQAGDAAFSVPNSLVVT--LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           A +D++ G   F V ++L+++     +  +   +E    NK    A L L +M+E  +G 
Sbjct: 46  AVKDIEEGTPLFHVTDNLILSPYTSDLKDHLDASEWDQLNK--GWAQLILVMMWETIKGS 103

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
           KS W  Y+  +        +  E+P+ W+E +   L+G+       + A+ I RE  E +
Sbjct: 104 KSRWAGYLTNM-------PVMFETPMFWTEQQRDQLSGT-------DIADRIGREDAEAE 149

Query: 233 TVWFMAGSLFQQ---YPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR-------- 281
               +A  +      +P D P         + + +  +S  V L +   ++         
Sbjct: 150 YTSLLAPFIKAHPDLFPVDSPHTTIDAFHIQGSRILSRSFTVPLHRFGRSQSQSQSDGNE 209

Query: 282 -----------FALVPLGPPLLAYSSKCKAML-------AAVDDAVQLVVDRPYKAGESI 323
                        ++P    L A   K  A L          D+ V +   R  K  E I
Sbjct: 210 TESDDEEEEEVVVMIPFADMLNAAWGKDNAHLYVDEDTIEGFDEGVVMKSTRLVKQSEQI 269

Query: 324 VVWCGPQPNSKLLINYGFVD 343
                  PNS+LL  YG VD
Sbjct: 270 YNTYDSPPNSELLRKYGHVD 289


>gi|449544081|gb|EMD35055.1| hypothetical protein CERSUDRAFT_107074 [Ceriporiopsis subvermispora
           B]
          Length = 457

 Score = 38.9 bits (89), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 18/49 (36%), Positives = 30/49 (61%)

Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
           D AV L++  P   G  ++   GP+PN++L++ YGF    NP D +V++
Sbjct: 275 DLAVSLLLHSPTPRGAELLNNYGPKPNAELVLGYGFALPSNPDDTIVLK 323


>gi|335300684|ref|XP_003358991.1| PREDICTED: SET domain-containing protein 4 [Sus scrofa]
          Length = 440

 Score = 38.9 bits (89), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 74/332 (22%), Positives = 127/332 (38%), Gaps = 40/332 (12%)

Query: 65  AGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDA 124
           A SR V    + +  +LK W+         +I    P           + +   LQ G  
Sbjct: 20  AESRGVNESYKPEFIELKKWLKDRNFEDTNLIPARFPGTGRG------LMSKTSLQEGQL 73

Query: 125 AFSVPNSLVVTLERVLGNETIAELLTTNKL-SELACLALYLMYEKKQGKKSFWLPYIREL 183
             ++P S ++T + VL +     +       S L  L  +L+ EK  G +S W PY+  L
Sbjct: 74  VIALPESCLLTTDTVLRSYLGPYIAKWQPPPSPLLALCTFLVSEKHAGDQSPWKPYLEVL 133

Query: 184 DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQ 243
            +           P+   E E+  L   P K++  E+     R +    +      SL  
Sbjct: 134 PK-------TYTCPVC-LEPEVVNLLPGPLKSKAREQR---TRVWEFFSSSRDFFSSLQP 182

Query: 244 QYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR----FALVP----LGP--PLLA 293
            +P  + +      IF  + +    C V+ + V + +R    F+  P    L P   LL 
Sbjct: 183 LFPEAVES------IFSYSALLWAWCTVNTRAVYMKQRPRQCFSTEPDTCALAPYLDLLN 236

Query: 294 YSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 351
           +S   + KA         ++      +  E + +  GP  + +LL+ YGFV   NP+  +
Sbjct: 237 HSPAVQVKAAFNEESRCYEIRTGTSCRKHEEVFICYGPHGSHRLLLEYGFVSPRNPHACV 296

Query: 352 VVEAALNTEDPQYQDKRMVAQRNGKLSVQVFH 383
            V   +  +     DK+M    N K+S+   H
Sbjct: 297 YVPKDILVKYLPSTDKQM----NKKISILKDH 324


>gi|328726082|ref|XP_001952202.2| PREDICTED: SET domain-containing protein 3-like [Acyrthosiphon
           pisum]
          Length = 241

 Score = 38.9 bits (89), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 38/161 (23%), Positives = 73/161 (45%), Gaps = 20/161 (12%)

Query: 73  KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
           + ++ +  L  W  KNG      IL     H  ++   + + A++++  GD   +VP +L
Sbjct: 81  RNDQSIEKLTKWATKNG-----AILNGVEIHQFENYA-YGMKANKNITVGDKLVTVPRAL 134

Query: 133 VVTLERV----LGNETIAELLTTNKLSELACLALYLMYEK-KQGKKSFWLPYIRELDRQR 187
           ++T E +    L      +++  N  +    LA++++ E  ++ KKSFW  Y+  L    
Sbjct: 135 MMTEENIPSSPLWKLHSQDMMLRNMPN--VALAIFILVESLRKDKKSFWHSYLTTLP--- 189

Query: 188 GRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
               +   +P+ +   +L  L GSP     L+    I R+Y
Sbjct: 190 ----VTYSTPVYFDVADLEALKGSPAFEAALKLNRNIARQY 226


>gi|42820762|emb|CAF32075.1| SET domain protein, putative [Aspergillus fumigatus]
          Length = 530

 Score = 38.9 bits (89), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 65/289 (22%), Positives = 117/289 (40%), Gaps = 41/289 (14%)

Query: 81  LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDL-QAGDAA--FSVPNSLVVTLE 137
           L SW   NG+    +  ++  S +   +    VA +E   + G+A    +VP+ L +TLE
Sbjct: 59  LSSWAKLNGISLEGIAFQKLYSEHGTDKGSAIVATAEKKDEEGEANTLLTVPSDLALTLE 118

Query: 138 RVLGNETIAELL-----TTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
            V  +  I   L            +     ++   +K G  + W  YIR +         
Sbjct: 119 YVHNHAKIDRHLREVLDAVGDFGRVCYSPDFVNKRQKIGISNPWTEYIRFM-------PA 171

Query: 193 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL-----DTVWFMAGSLFQQYPY 247
           +V  P  +S  E   L G+  +  +  +   +++E++ L     +  W       Q++ +
Sbjct: 172 SVPLPTFYSAEERELLRGTSLQTAVDAKLGSLEKEFDHLRQATEEIPWC------QEHWW 225

Query: 248 DIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP---LGPPLLAYSSKCKAMLAA 304
           D  T  FTF+ +K      +S VV L +       A+VP   +       S K +     
Sbjct: 226 DEDTGKFTFDDWKYVDAVYRSRVVDLPRSG----HAIVPCVDMANHACEDSVKARYDEEG 281

Query: 305 VDDAV-QLVVDRPYKAGE----SIVVWC---GPQPNSKLLINYGFVDED 345
             +AV QL   +  + GE    +  V C     +P S+++ +YGFV+ +
Sbjct: 282 AGNAVLQLRTGKKLRVGEEKLHADAVACRYGDEKPASEMVFSYGFVENE 330


>gi|392594054|gb|EIW83379.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
          Length = 508

 Score = 38.9 bits (89), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 28/109 (25%), Positives = 48/109 (44%), Gaps = 17/109 (15%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELAC------LALYLMYEK 168
           A +D+  G   FS+P  L ++L       T+  LL  ++  E         L L +M+E+
Sbjct: 43  ALQDIHEGTTLFSLPRELTLSLR----TSTLPSLLGVDRWKEFGLNKGWVGLILCMMWEE 98

Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
            +G +S W  Y+  L           ++P+ WS  +L  L G+    +I
Sbjct: 99  SRGVESKWDVYLSSLPS-------TFDTPMFWSAEDLEELKGTAVPDKI 140


>gi|308802011|ref|XP_003078319.1| N-methyltransferase (ISS) [Ostreococcus tauri]
 gi|116056770|emb|CAL53059.1| N-methyltransferase (ISS) [Ostreococcus tauri]
          Length = 429

 Score = 38.9 bits (89), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 51/185 (27%), Positives = 76/185 (41%), Gaps = 17/185 (9%)

Query: 315 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQD-KRMVAQR 373
           + Y+ GE +++  G   N +L+  YGFVD DN  D    E  ++     Y   KR +   
Sbjct: 246 KDYETGEEVLISYGVLNNDELITRYGFVDVDNVADIYRFEGLMSYLQASYDPMKRALGAD 305

Query: 374 NGKLSVQVFHVH-----AGREKEAISD------MLPYLRLGYVSDTSEMQSVISSLGPIC 422
             +LS  +   H     A  E   ISD      +L  LR   V  T E  +    +    
Sbjct: 306 QKRLST-LKRTHPELDQALWEGNFISDGNADPKLLWALRT--VLATPEEYAAAKGVDGFK 362

Query: 423 PVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTDYNLHPKKRVATQLVRMEKKMLN 482
                 ER   D +    ++RLA YP T+ EDE  L   N    +R A Q    +K++L 
Sbjct: 363 LGGGAPERRAADAVRAAVESRLAEYPTTIEEDEEALKTAN--GNERTAIQYRIRKKRILR 420

Query: 483 ACLQV 487
              ++
Sbjct: 421 DASRI 425


>gi|413951745|gb|AFW84394.1| hypothetical protein ZEAMMB73_159573, partial [Zea mays]
          Length = 339

 Score = 38.9 bits (89), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 19/57 (33%), Positives = 32/57 (56%)

Query: 304 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 360
           A D+ +++++ R    GE +    G   N+ LL  YGF + DN YD + ++ AL T+
Sbjct: 99  ANDEDLEIIIVRDVNEGEEVYNTYGTMGNAALLHRYGFTELDNQYDIVNIDLALVTK 155


>gi|169606334|ref|XP_001796587.1| hypothetical protein SNOG_06204 [Phaeosphaeria nodorum SN15]
 gi|160706968|gb|EAT86035.2| hypothetical protein SNOG_06204 [Phaeosphaeria nodorum SN15]
          Length = 634

 Score = 38.9 bits (89), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 45/203 (22%), Positives = 89/203 (43%), Gaps = 23/203 (11%)

Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
           L L+ ++ +GK+S W  YI  L      G  ++ +PL + + ++A+L G+       ER 
Sbjct: 105 LLLIEQRNKGKESPWHAYIACLP-----GAESMTTPLWFDDEDMAFLAGTSLAPAAKERK 159

Query: 222 EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF-KQAFVAVQSCVVHLQKVSLAR 280
               +++ +   +   AG        D  +  +   IF  +AF++      H        
Sbjct: 160 SLYYQQWEQALGIMKDAGVALAD-EVDFESLLWAATIFTSRAFISTHILPDH-------- 210

Query: 281 RFALVPLGPPLL-----AYSSKCKAMLAAVDD-AVQLVVDRPYKAGESIVVWCGPQPNSK 334
               VPL  P++     + S+K +     +   +++L+    + AG+ +     P+ N +
Sbjct: 211 --ETVPLLFPIVDILNHSVSAKVEWEFQPLASFSLKLLEGDTFTAGQELFNNYAPKQNDE 268

Query: 335 LLINYGFVDEDNPYDRLVVEAAL 357
           LL+ YGF  E NP ++  ++ A 
Sbjct: 269 LLLGYGFCLEHNPIEQFPLKLAF 291


>gi|219122993|ref|XP_002181819.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217407095|gb|EEC47033.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 579

 Score = 38.9 bits (89), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 17/47 (36%), Positives = 28/47 (59%)

Query: 307 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
           +A  L  D+   +G+ + +  GP+ N +LL  YGFV+ +NP D  V+
Sbjct: 361 NAYSLATDQAIPSGDEVYISYGPRSNDQLLQYYGFVERNNPNDVYVM 407


>gi|125528589|gb|EAY76703.1| hypothetical protein OsI_04658 [Oryza sativa Indica Group]
          Length = 495

 Score = 38.9 bits (89), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 19/55 (34%), Positives = 31/55 (56%)

Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 360
           D+ ++++V R    GE +    G   N+ LL  YGF + DN YD + ++ AL T+
Sbjct: 255 DEDLEMIVVRDVNEGEEVFNTYGTMGNAALLHRYGFTEMDNSYDIVNIDLALVTK 309


>gi|21952799|dbj|BAC06215.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
 gi|22202682|dbj|BAC07340.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
 gi|215769224|dbj|BAH01453.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222619626|gb|EEE55758.1| hypothetical protein OsJ_04288 [Oryza sativa Japonica Group]
          Length = 495

 Score = 38.9 bits (89), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 19/55 (34%), Positives = 31/55 (56%)

Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 360
           D+ ++++V R    GE +    G   N+ LL  YGF + DN YD + ++ AL T+
Sbjct: 255 DEDLEMIVVRDVNEGEEVFNTYGTMGNAALLHRYGFTEMDNSYDIVNIDLALVTK 309


>gi|384490907|gb|EIE82103.1| hypothetical protein RO3G_06808 [Rhizopus delemar RA 99-880]
          Length = 216

 Score = 38.5 bits (88), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 28/119 (23%), Positives = 59/119 (49%), Gaps = 12/119 (10%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
           V  ++ ++  +   +VP S+ +T E+V  N       T    S     +L+L+ +K  GK
Sbjct: 22  VYTTDTVKENEKFATVPFSICIT-EKVARNA----FPTLTGFSGRVLQSLFLVQQKNLGK 76

Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
           KSF+ PYI  L ++       + + L + E ++ Y+  +  +  + ER   ++ ++++L
Sbjct: 77  KSFYFPYINILPKK-------IVTALHFDENDMNYIKKTNLELALRERKTALRDDFDKL 128


>gi|414886517|tpg|DAA62531.1| TPA: hypothetical protein ZEAMMB73_960129 [Zea mays]
          Length = 147

 Score = 38.5 bits (88), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 23/74 (31%), Positives = 43/74 (58%), Gaps = 3/74 (4%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNE-TIAELLTTNK--LSELACLALYLMYEKK 169
           +AA+ DL+ G+    +P + ++T +RV  ++  IA  ++ +K  LS +  L + L+ E  
Sbjct: 51  LAAARDLRRGELVLRLPRAALLTSDRVTADDPRIAACVSAHKPRLSSVQILIVCLLAEVG 110

Query: 170 QGKKSFWLPYIREL 183
           +G  S W PY+ +L
Sbjct: 111 KGSNSVWYPYLCQL 124


>gi|403375581|gb|EJY87766.1| hypothetical protein OXYTRI_23666 [Oxytricha trifallax]
          Length = 789

 Score = 38.5 bits (88), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 42/194 (21%), Positives = 84/194 (43%), Gaps = 26/194 (13%)

Query: 62  TLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQA 121
           T +   +  + K++E   + + W+  NG+    V   + P    +   +  +AA +D+  
Sbjct: 28  TFIHHEKTNLLKQQEKYVNFQKWLEDNGVLHPGV---DYPVAFGRQGQLIGMAARKDIPP 84

Query: 122 GDAAFSVPNSLVVTLERVLGNETIA-------ELLTTNKLSELACLALYLMYEKKQGKKS 174
             A   VP  L+++ E  + N  IA       E+   ++ +E   +  ++ +E  +G+ S
Sbjct: 85  QKAFLFVPQRLMIS-EVTVRNSKIAPLLSKHPEIFKHHQDAEYLVIIAFVWHELMKGEAS 143

Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-NELDT 233
           FW PY + ++          + P+LWS+ E+        + +I    +  K EY NE   
Sbjct: 144 FWHPYFQIIN--------LSDLPMLWSDQEIQEFQDQVLQKDI----QDYKVEYENEWKL 191

Query: 234 VW--FMAGSLFQQY 245
           V+  F     + +Y
Sbjct: 192 VYEAFSKDETYDEY 205


>gi|169595142|ref|XP_001790995.1| hypothetical protein SNOG_00305 [Phaeosphaeria nodorum SN15]
 gi|160701026|gb|EAT91800.2| hypothetical protein SNOG_00305 [Phaeosphaeria nodorum SN15]
          Length = 391

 Score = 38.5 bits (88), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 19/48 (39%), Positives = 26/48 (54%), Gaps = 5/48 (10%)

Query: 311 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN-----PYDRLVV 353
           +  DR YKAGE + V  G   N  LL+ YGF+ + N     P D L++
Sbjct: 203 VTADREYKAGEEVFVSYGAHTNDFLLVEYGFILDSNRNDAIPLDHLIL 250


>gi|351694473|gb|EHA97391.1| SET domain-containing protein 3 [Heterocephalus glaber]
          Length = 297

 Score = 38.5 bits (88), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 40/72 (55%), Gaps = 4/72 (5%)

Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQG 171
           A+ D++AG+    VP  LV+T+E    N  +  L + +++ +      LA +L+  ++  
Sbjct: 112 ATRDIKAGELFLWVPRKLVMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLLCERAS 170

Query: 172 KKSFWLPYIREL 183
             SFWLPYI+ L
Sbjct: 171 PISFWLPYIQTL 182


>gi|212544736|ref|XP_002152522.1| hypothetical protein PMAA_003730 [Talaromyces marneffei ATCC 18224]
 gi|210065491|gb|EEA19585.1| hypothetical protein PMAA_003730 [Talaromyces marneffei ATCC 18224]
          Length = 429

 Score = 38.5 bits (88), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 19/59 (32%), Positives = 30/59 (50%)

Query: 315 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVAQR 373
           R YK GE I +  GP PN  L + YGF  E N  D + ++  +  +    + + ++ QR
Sbjct: 264 RLYKKGEEIYMSYGPHPNDFLFVEYGFYLETNESDAIFLDDIIFKDFTVAEKEELIRQR 322


>gi|223946389|gb|ACN27278.1| unknown [Zea mays]
          Length = 289

 Score = 38.5 bits (88), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 19/57 (33%), Positives = 32/57 (56%)

Query: 304 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEAALNTE 360
           A D+ +++++ R    GE +    G   N+ LL  YGF + DN YD + ++ AL T+
Sbjct: 47  ANDEDLEIIIVRDVNEGEEVYNTYGTMGNAALLHRYGFTELDNQYDIVNIDLALVTK 103


>gi|443699166|gb|ELT98776.1| hypothetical protein CAPTEDRAFT_151537 [Capitella teleta]
          Length = 413

 Score = 38.5 bits (88), Expect = 8.6,   Method: Compositional matrix adjust.
 Identities = 65/262 (24%), Positives = 104/262 (39%), Gaps = 42/262 (16%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELL-----TTNKLSELACLALYLMYE 167
           + A+ D+  GD  F +P SL++T +    N TI  LL     +  + S    L + LMYE
Sbjct: 1   MVATSDISQGDTIFEIPRSLLLTPQ----NSTIGVLLNEEADSLQEASRWVPLLITLMYE 56

Query: 168 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY-LTGSPTKAEILERAEGIKR 226
                 S W PY    D      QL +  P+ WS  E+   L G+   + +      I +
Sbjct: 57  YT-SPSSRWKPY---FDLVPDFDQLDL--PMFWSSDEVKRELKGTGIPSLVESDLLNISK 110

Query: 227 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ--AFVAVQSCVVHL----------- 273
           E+N+L           Q++      E    + +K+  AFV   S                
Sbjct: 111 EFNDL------VLPFIQKHSNVFSDECKCLKFYKKMVAFVMAYSFTEPPPSPDLDDSDDL 164

Query: 274 --QKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGP 329
              +  L  +  +VP+   +L + +K  A L       ++++V  +  + GE I    G 
Sbjct: 165 SGDEHDLMPQPMMVPMA-DILNHVAKNSARLDFPKGSSSLKMVATQDIQKGEEIFNTYGE 223

Query: 330 QPNSKLLINYGFVDED--NPYD 349
             N  LL  YGF ++   N YD
Sbjct: 224 LANMNLLHMYGFAEDIGCNEYD 245


>gi|403366800|gb|EJY83208.1| hypothetical protein OXYTRI_19172 [Oxytricha trifallax]
          Length = 869

 Score = 38.5 bits (88), Expect = 8.8,   Method: Compositional matrix adjust.
 Identities = 37/170 (21%), Positives = 75/170 (44%), Gaps = 18/170 (10%)

Query: 302 LAAVDDAVQLVVDRPYKAG---ESIVVWC-GPQPNSKLLINYGFVDEDNPYDRLVVEAAL 357
           ++ V D    ++ R Y  G    S V  C G   N ++L  YGF    N Y+ + ++  L
Sbjct: 454 VSQVPDDFNFII-RTYNDGFPKGSQVFLCYGRMSNREMLKRYGFCLTYNKYNYIFIKLRL 512

Query: 358 NTEDPQYQDKRMVAQR-------NGKLSVQVFHVHAGREKEAISDMLPYLRLGYVSDTSE 410
             +DP +  ++ V ++         K+ +   H     +K   + +L ++++ Y +   +
Sbjct: 513 EQQDPDFIYRKYVLRKFFSIEPETDKMDISSRHFRIYFQK-LNTKVLKFIKILYFNVQED 571

Query: 411 MQSVISSLGPICPVSPCMERAVLDQLADYFKARLAGYPATLSEDEAMLTD 460
             S I     +   S  +E     +L D ++  L  +P T+ ED+ +L++
Sbjct: 572 DISCI-----VETRSLSLEYLAFQRLRDVYETFLKSFPTTIGEDKKILSE 616


>gi|428183324|gb|EKX52182.1| hypothetical protein GUITHDRAFT_150712, partial [Guillardia theta
           CCMP2712]
          Length = 205

 Score = 38.5 bits (88), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 29/116 (25%), Positives = 55/116 (47%), Gaps = 15/116 (12%)

Query: 116 SEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSF 175
           ++D+++     S+P+ L   L R         +   +KL     +AL ++YEK + ++SF
Sbjct: 88  TQDVKSNSVVCSIPSKLF--LSRSTTRAAFGSM--ADKLDVRTAMALQILYEKSKKEESF 143

Query: 176 WLPYIREL-DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
           W  +++ L DR+       + +P LW E +   L G+     + E  E  K  Y++
Sbjct: 144 WCEWLKVLPDREN------LGTPCLWPEDDQNLLKGT----SVFEEVEASKSLYSK 189


>gi|424513789|emb|CCO66411.1| predicted protein [Bathycoccus prasinos]
          Length = 532

 Score = 38.5 bits (88), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 75/332 (22%), Positives = 131/332 (39%), Gaps = 53/332 (15%)

Query: 75  EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
           E+D  DL +W  KNG+         K S  +  R +           GDAA  +  S++ 
Sbjct: 219 EKDRDDLLNWGVKNGVDFIAASFVRKGSDIDYIRSV----------LGDAAPKI--SIIS 266

Query: 135 TLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
            +E + G +   +++  +    +A   L +    +Q     +L   R + R    G+   
Sbjct: 267 KVENMEGLDNFEDIVDKSDGVMVARGDLGMEIRMEQ----IFLAQKRMIKRCNEAGK--- 319

Query: 195 ESPLLWSETELAYLTGSP--TKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 252
             P++ +   L  +TG+P  T+AE  + A  I    +  D V     +    YP  +   
Sbjct: 320 --PVITATQMLESMTGAPRPTRAEATDVANAI---LDGTDCVMLSGETAAGDYP--LEAV 372

Query: 253 AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 312
           +   +I ++A   + S  V  Q                LLAY S    +L ++  +    
Sbjct: 373 SCMADICREAEAYIDSAAVFQQ----------------LLAYQSVPMNILESLASSS--- 413

Query: 313 VDRPYKAGESIVVWCGPQPN-SKLLINYGFVDEDNPYDRLVVEAALNTEDPQYQDKRMVA 371
           V    K G  ++V      N S+L+  Y     D P   + V    NT DP+   +RM+A
Sbjct: 414 VRSAQKVGAKLIVTLAKSGNTSRLIAKY---RPDCPVLSVCVNMEENTHDPENTARRMLA 470

Query: 372 QRNGKLSVQ--VFHVHAGREKEAISDMLPYLR 401
            R  K  ++   +H  +G  +E  ++ + Y R
Sbjct: 471 SRGLKPMIEPAEWHAQSGHPQEISANAILYAR 502


>gi|301119251|ref|XP_002907353.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262105865|gb|EEY63917.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 424

 Score = 38.5 bits (88), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 62/248 (25%), Positives = 109/248 (43%), Gaps = 24/248 (9%)

Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLS---ELACLALYLMYEK- 168
           V  +ED+      FS+P   V++++ +     +  +    +L+   E   LA+ L+YEK 
Sbjct: 43  VFIAEDVTPHTEVFSIPLDSVLSVKSLQDISALQSITFFQQLTPEREDDQLAIALLYEKY 102

Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
            QG KS W  +I  L +          + L +   E+  L GS       +  E +  +Y
Sbjct: 103 MQGDKSKWAKHIELLPK-------TYHNALYFEAGEIKALEGSNLFFIAQQMEEKVASDY 155

Query: 229 NEL-DTVWF-MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-CVVHLQKVSLARRFALV 285
             L ++V F +  ++ +    D+  E F+ + +K A   + S  V+ + K S     A+V
Sbjct: 156 AVLKESVLFELFENITEGITVDLFDEIFSLDNYKWALSTIWSRFVLPVAKQSFK---AMV 212

Query: 286 PLGPPLLAYSSKCKAMLAAVDD----AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
           P+   L   +   +A ++   D      +LV  + + AG  + +  G   N KLL  YGF
Sbjct: 213 PVFDML---NHDPEAEMSHFFDMETQCFKLVSHQHWNAGAQMFINYGALSNHKLLSLYGF 269

Query: 342 VDEDNPYD 349
           V   N +D
Sbjct: 270 VIIGNLFD 277


>gi|301112144|ref|XP_002905151.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262095481|gb|EEY53533.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 510

 Score = 38.1 bits (87), Expect = 9.8,   Method: Compositional matrix adjust.
 Identities = 38/141 (26%), Positives = 63/141 (44%), Gaps = 10/141 (7%)

Query: 92  PCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTT 151
           P   +L+  P      R   Y+ A E+++ G    S+P S V+++E       +  LL  
Sbjct: 88  PMSTVLQ--PEGFNFGRGTAYITA-ENVEVGSELLSLPMSQVMSVESA-ARGRVGLLLEV 143

Query: 152 N-KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 210
           N  L     L L+L+ E+  G  S +  ++  L         A+ S L +SE E+  L G
Sbjct: 144 NPDLPSAIALGLHLLEERALGAASNFSDFVATLPTIE-----AINSTLFYSEDEMNELEG 198

Query: 211 SPTKAEILERAEGIKREYNEL 231
           S  +   L RA+ ++  Y+ L
Sbjct: 199 SQLQRFTLGRAQAVEAFYDAL 219


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.319    0.133    0.392 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,692,010,624
Number of Sequences: 23463169
Number of extensions: 308674708
Number of successful extensions: 825864
Number of sequences better than 100.0: 885
Number of HSP's better than 100.0 without gapping: 331
Number of HSP's successfully gapped in prelim test: 554
Number of HSP's that attempted gapping in prelim test: 824424
Number of HSP's gapped (non-prelim): 1222
length of query: 512
length of database: 8,064,228,071
effective HSP length: 147
effective length of query: 365
effective length of database: 8,910,109,524
effective search space: 3252189976260
effective search space used: 3252189976260
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 79 (35.0 bits)